Sample records for active protein-coding genes

  1. Activity-Dependent Human Brain Coding/Noncoding Gene Regulatory Networks

    PubMed Central

    Lipovich, Leonard; Dachet, Fabien; Cai, Juan; Bagla, Shruti; Balan, Karina; Jia, Hui; Loeb, Jeffrey A.

    2012-01-01

    While most gene transcription yields RNA transcripts that code for proteins, a sizable proportion of the genome generates RNA transcripts that do not code for proteins, but may have important regulatory functions. The brain-derived neurotrophic factor (BDNF) gene, a key regulator of neuronal activity, is overlapped by a primate-specific, antisense long noncoding RNA (lncRNA) called BDNFOS. We demonstrate reciprocal patterns of BDNF and BDNFOS transcription in highly active regions of human neocortex removed as a treatment for intractable seizures. A genome-wide analysis of activity-dependent coding and noncoding human transcription using a custom lncRNA microarray identified 1288 differentially expressed lncRNAs, of which 26 had expression profiles that matched activity-dependent coding genes and an additional 8 were adjacent to or overlapping with differentially expressed protein-coding genes. The functions of most of these protein-coding partner genes, such as ARC, include long-term potentiation, synaptic activity, and memory. The nuclear lncRNAs NEAT1, MALAT1, and RPPH1, composing an RNAse P-dependent lncRNA-maturation pathway, were also upregulated. As a means to replicate human neuronal activity, repeated depolarization of SY5Y cells resulted in sustained CREB activation and produced an inverse pattern of BDNF-BDNFOS co-expression that was not achieved with a single depolarization. RNAi-mediated knockdown of BDNFOS in human SY5Y cells increased BDNF expression, suggesting that BDNFOS directly downregulates BDNF. Temporal expression patterns of other lncRNA-messenger RNA pairs validated the effect of chronic neuronal activity on the transcriptome and implied various lncRNA regulatory mechanisms. lncRNAs, some of which are unique to primates, thus appear to have potentially important regulatory roles in activity-dependent human brain plasticity. PMID:22960213

  2. Promoter analysis reveals globally differential regulation of human long non-coding RNA and protein-coding genes

    DOE PAGES

    Alam, Tanvir; Medvedeva, Yulia A.; Jia, Hui; ...

    2014-10-02

    Transcriptional regulation of protein-coding genes is increasingly well-understood on a global scale, yet no comparable information exists for long non-coding RNA (lncRNA) genes, which were recently recognized to be as numerous as protein-coding genes in mammalian genomes. We performed a genome-wide comparative analysis of the promoters of human lncRNA and protein-coding genes, finding global differences in specific genetic and epigenetic features relevant to transcriptional regulation. These two groups of genes are hence subject to separate transcriptional regulatory programs, including distinct transcription factor (TF) proteins that significantly favor lncRNA, rather than coding-gene, promoters. We report a specific signature of promoter-proximal transcriptionalmore » regulation of lncRNA genes, including several distinct transcription factor binding sites (TFBS). Experimental DNase I hypersensitive site profiles are consistent with active configurations of these lncRNA TFBS sets in diverse human cell types. TFBS ChIP-seq datasets confirm the binding events that we predicted using computational approaches for a subset of factors. For several TFs known to be directly regulated by lncRNAs, we find that their putative TFBSs are enriched at lncRNA promoters, suggesting that the TFs and the lncRNAs may participate in a bidirectional feedback loop regulatory network. Accordingly, cells may be able to modulate lncRNA expression levels independently of mRNA levels via distinct regulatory pathways. Our results also raise the possibility that, given the historical reliance on protein-coding gene catalogs to define the chromatin states of active promoters, a revision of these chromatin signature profiles to incorporate expressed lncRNA genes is warranted in the future.« less

  3. Promoter analysis reveals globally differential regulation of human long non-coding RNA and protein-coding genes

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Alam, Tanvir; Medvedeva, Yulia A.; Jia, Hui

    Transcriptional regulation of protein-coding genes is increasingly well-understood on a global scale, yet no comparable information exists for long non-coding RNA (lncRNA) genes, which were recently recognized to be as numerous as protein-coding genes in mammalian genomes. We performed a genome-wide comparative analysis of the promoters of human lncRNA and protein-coding genes, finding global differences in specific genetic and epigenetic features relevant to transcriptional regulation. These two groups of genes are hence subject to separate transcriptional regulatory programs, including distinct transcription factor (TF) proteins that significantly favor lncRNA, rather than coding-gene, promoters. We report a specific signature of promoter-proximal transcriptionalmore » regulation of lncRNA genes, including several distinct transcription factor binding sites (TFBS). Experimental DNase I hypersensitive site profiles are consistent with active configurations of these lncRNA TFBS sets in diverse human cell types. TFBS ChIP-seq datasets confirm the binding events that we predicted using computational approaches for a subset of factors. For several TFs known to be directly regulated by lncRNAs, we find that their putative TFBSs are enriched at lncRNA promoters, suggesting that the TFs and the lncRNAs may participate in a bidirectional feedback loop regulatory network. Accordingly, cells may be able to modulate lncRNA expression levels independently of mRNA levels via distinct regulatory pathways. Our results also raise the possibility that, given the historical reliance on protein-coding gene catalogs to define the chromatin states of active promoters, a revision of these chromatin signature profiles to incorporate expressed lncRNA genes is warranted in the future.« less

  4. De Novo Origin of Human Protein-Coding Genes

    PubMed Central

    Wu, Dong-Dong; Irwin, David M.; Zhang, Ya-Ping

    2011-01-01

    The de novo origin of a new protein-coding gene from non-coding DNA is considered to be a very rare occurrence in genomes. Here we identify 60 new protein-coding genes that originated de novo on the human lineage since divergence from the chimpanzee. The functionality of these genes is supported by both transcriptional and proteomic evidence. RNA–seq data indicate that these genes have their highest expression levels in the cerebral cortex and testes, which might suggest that these genes contribute to phenotypic traits that are unique to humans, such as improved cognitive ability. Our results are inconsistent with the traditional view that the de novo origin of new genes is very rare, thus there should be greater appreciation of the importance of the de novo origination of genes. PMID:22102831

  5. A subset of conserved mammalian long non-coding RNAs are fossils of ancestral protein-coding genes.

    PubMed

    Hezroni, Hadas; Ben-Tov Perry, Rotem; Meir, Zohar; Housman, Gali; Lubelsky, Yoav; Ulitsky, Igor

    2017-08-30

    Only a small portion of human long non-coding RNAs (lncRNAs) appear to be conserved outside of mammals, but the events underlying the birth of new lncRNAs in mammals remain largely unknown. One potential source is remnants of protein-coding genes that transitioned into lncRNAs. We systematically compare lncRNA and protein-coding loci across vertebrates, and estimate that up to 5% of conserved mammalian lncRNAs are derived from lost protein-coding genes. These lncRNAs have specific characteristics, such as broader expression domains, that set them apart from other lncRNAs. Fourteen lncRNAs have sequence similarity with the loci of the contemporary homologs of the lost protein-coding genes. We propose that selection acting on enhancer sequences is mostly responsible for retention of these regions. As an example of an RNA element from a protein-coding ancestor that was retained in the lncRNA, we describe in detail a short translated ORF in the JPX lncRNA that was derived from an upstream ORF in a protein-coding gene and retains some of its functionality. We estimate that ~ 55 annotated conserved human lncRNAs are derived from parts of ancestral protein-coding genes, and loss of coding potential is thus a non-negligible source of new lncRNAs. Some lncRNAs inherited regulatory elements influencing transcription and translation from their protein-coding ancestors and those elements can influence the expression breadth and functionality of these lncRNAs.

  6. The Evolution and Expression Pattern of Human Overlapping lncRNA and Protein-coding Gene Pairs.

    PubMed

    Ning, Qianqian; Li, Yixue; Wang, Zhen; Zhou, Songwen; Sun, Hong; Yu, Guangjun

    2017-03-27

    Long non-coding RNA overlapping with protein-coding gene (lncRNA-coding pair) is a special type of overlapping genes. Protein-coding overlapping genes have been well studied and increasing attention has been paid to lncRNAs. By studying lncRNA-coding pairs in human genome, we showed that lncRNA-coding pairs were more likely to be generated by overprinting and retaining genes in lncRNA-coding pairs were given higher priority than non-overlapping genes. Besides, the preference of overlapping configurations preserved during evolution was based on the origin of lncRNA-coding pairs. Further investigations showed that lncRNAs promoting the splicing of their embedded protein-coding partners was a unilateral interaction, but the existence of overlapping partners improving the gene expression was bidirectional and the effect was decreased with the increased evolutionary age of genes. Additionally, the expression of lncRNA-coding pairs showed an overall positive correlation and the expression correlation was associated with their overlapping configurations, local genomic environment and evolutionary age of genes. Comparison of the expression correlation of lncRNA-coding pairs between normal and cancer samples found that the lineage-specific pairs including old protein-coding genes may play an important role in tumorigenesis. This work presents a systematically comprehensive understanding of the evolution and the expression pattern of human lncRNA-coding pairs.

  7. Transcription Factor Binding Profiles Reveal Cyclic Expression of Human Protein-coding Genes and Non-coding RNAs

    PubMed Central

    Cheng, Chao; Ung, Matthew; Grant, Gavin D.; Whitfield, Michael L.

    2013-01-01

    Cell cycle is a complex and highly supervised process that must proceed with regulatory precision to achieve successful cellular division. Despite the wide application, microarray time course experiments have several limitations in identifying cell cycle genes. We thus propose a computational model to predict human cell cycle genes based on transcription factor (TF) binding and regulatory motif information in their promoters. We utilize ENCODE ChIP-seq data and motif information as predictors to discriminate cell cycle against non-cell cycle genes. Our results show that both the trans- TF features and the cis- motif features are predictive of cell cycle genes, and a combination of the two types of features can further improve prediction accuracy. We apply our model to a complete list of GENCODE promoters to predict novel cell cycle driving promoters for both protein-coding genes and non-coding RNAs such as lincRNAs. We find that a similar percentage of lincRNAs are cell cycle regulated as protein-coding genes, suggesting the importance of non-coding RNAs in cell cycle division. The model we propose here provides not only a practical tool for identifying novel cell cycle genes with high accuracy, but also new insights on cell cycle regulation by TFs and cis-regulatory elements. PMID:23874175

  8. Systematic asymmetric nucleotide exchanges produce human mitochondrial RNAs cryptically encoding for overlapping protein coding genes.

    PubMed

    Seligmann, Hervé

    2013-05-07

    GenBank's EST database includes RNAs matching exactly human mitochondrial sequences assuming systematic asymmetric nucleotide exchange-transcription along exchange rules: A→G→C→U/T→A (12 ESTs), A→U/T→C→G→A (4 ESTs), C→G→U/T→C (3 ESTs), and A→C→G→U/T→A (1 EST), no RNAs correspond to other potential asymmetric exchange rules. Hypothetical polypeptides translated from nucleotide-exchanged human mitochondrial protein coding genes align with numerous GenBank proteins, predicted secondary structures resemble their putative GenBank homologue's. Two independent methods designed to detect overlapping genes (one based on nucleotide contents analyses in relation to replicative deamination gradients at third codon positions, and circular code analyses of codon contents based on frame redundancy), confirm nucleotide-exchange-encrypted overlapping genes. Methods converge on which genes are most probably active, and which not, and this for the various exchange rules. Mean EST lengths produced by different nucleotide exchanges are proportional to (a) extents that various bioinformatics analyses confirm the protein coding status of putative overlapping genes; (b) known kinetic chemistry parameters of the corresponding nucleotide substitutions by the human mitochondrial DNA polymerase gamma (nucleotide DNA misinsertion rates); (c) stop codon densities in predicted overlapping genes (stop codon readthrough and exchanging polymerization regulate gene expression by counterbalancing each other). Numerous rarely expressed proteins seem encoded within regular mitochondrial genes through asymmetric nucleotide exchange, avoiding lengthening genomes. Intersecting evidence between several independent approaches confirms the working hypothesis status of gene encryption by systematic nucleotide exchanges. Copyright © 2013 Elsevier Ltd. All rights reserved.

  9. Recognition of Protein-coding Genes Based on Z-curve Algorithms

    PubMed Central

    -Biao Guo, Feng; Lin, Yan; -Ling Chen, Ling

    2014-01-01

    Recognition of protein-coding genes, a classical bioinformatics issue, is an absolutely needed step for annotating newly sequenced genomes. The Z-curve algorithm, as one of the most effective methods on this issue, has been successfully applied in annotating or re-annotating many genomes, including those of bacteria, archaea and viruses. Two Z-curve based ab initio gene-finding programs have been developed: ZCURVE (for bacteria and archaea) and ZCURVE_V (for viruses and phages). ZCURVE_C (for 57 bacteria) and Zfisher (for any bacterium) are web servers for re-annotation of bacterial and archaeal genomes. The above four tools can be used for genome annotation or re-annotation, either independently or combined with the other gene-finding programs. In addition to recognizing protein-coding genes and exons, Z-curve algorithms are also effective in recognizing promoters and translation start sites. Here, we summarize the applications of Z-curve algorithms in gene finding and genome annotation. PMID:24822027

  10. A Dual Origin of the Xist Gene from a Protein-Coding Gene and a Set of Transposable Elements

    PubMed Central

    Elisaphenko, Eugeny A.; Kolesnikov, Nikolay N.; Shevchenko, Alexander I.; Rogozin, Igor B.; Nesterova, Tatyana B.; Brockdorff, Neil; Zakian, Suren M.

    2008-01-01

    X-chromosome inactivation, which occurs in female eutherian mammals is controlled by a complex X-linked locus termed the X-inactivation center (XIC). Previously it was proposed that genes of the XIC evolved, at least in part, as a result of pseudogenization of protein-coding genes. In this study we show that the key XIC gene Xist, which displays fragmentary homology to a protein-coding gene Lnx3, emerged de novo in early eutherians by integration of mobile elements which gave rise to simple tandem repeats. The Xist gene promoter region and four out of ten exons found in eutherians retain homology to exons of the Lnx3 gene. The remaining six Xist exons including those with simple tandem repeats detectable in their structure have similarity to different transposable elements. Integration of mobile elements into Xist accompanies the overall evolution of the gene and presumably continues in contemporary eutherian species. Additionally we showed that the combination of remnants of protein-coding sequences and mobile elements is not unique to the Xist gene and is found in other XIC genes producing non-coding nuclear RNA. PMID:18575625

  11. Codon usage and expression level of human mitochondrial 13 protein coding genes across six continents.

    PubMed

    Chakraborty, Supriyo; Uddin, Arif; Mazumder, Tarikul Huda; Choudhury, Monisha Nath; Malakar, Arup Kumar; Paul, Prosenjit; Halder, Binata; Deka, Himangshu; Mazumder, Gulshana Akthar; Barbhuiya, Riazul Ahmed; Barbhuiya, Masuk Ahmed; Devi, Warepam Jesmi

    2017-12-02

    The study of codon usage coupled with phylogenetic analysis is an important tool to understand the genetic and evolutionary relationship of a gene. The 13 protein coding genes of human mitochondria are involved in electron transport chain for the generation of energy currency (ATP). However, no work has yet been reported on the codon usage of the mitochondrial protein coding genes across six continents. To understand the patterns of codon usage in mitochondrial genes across six different continents, we used bioinformatic analyses to analyze the protein coding genes. The codon usage bias was low as revealed from high ENC value. Correlation between codon usage and GC3 suggested that all the codons ending with G/C were positively correlated with GC3 but vice versa for A/T ending codons with the exception of ND4L and ND5 genes. Neutrality plot revealed that for the genes ATP6, COI, COIII, CYB, ND4 and ND4L, natural selection might have played a major role while mutation pressure might have played a dominant role in the codon usage bias of ATP8, COII, ND1, ND2, ND3, ND5 and ND6 genes. Phylogenetic analysis indicated that evolutionary relationships in each of 13 protein coding genes of human mitochondria were different across six continents and further suggested that geographical distance was an important factor for the origin and evolution of 13 protein coding genes of human mitochondria. Copyright © 2017 Elsevier B.V. and Mitochondria Research Society. All rights reserved.

  12. Discovery of rare protein-coding genes in model methylotroph Methylobacterium extorquens AM1.

    PubMed

    Kumar, Dhirendra; Mondal, Anupam Kumar; Yadav, Amit Kumar; Dash, Debasis

    2014-12-01

    Proteogenomics involves the use of MS to refine annotation of protein-coding genes and discover genes in a genome. We carried out comprehensive proteogenomic analysis of Methylobacterium extorquens AM1 (ME-AM1) from publicly available proteomics data with a motive to improve annotation for methylotrophs; organisms capable of surviving in reduced carbon compounds such as methanol. Besides identifying 2482(50%) proteins, 29 new genes were discovered and 66 annotated gene models were revised in ME-AM1 genome. One such novel gene is identified with 75 peptides, lacks homolog in other methylobacteria but has glycosyl transferase and lipopolysaccharide biosynthesis protein domains, indicating its potential role in outer membrane synthesis. Many novel genes are present only in ME-AM1 among methylobacteria. Distant homologs of these genes in unrelated taxonomic classes and low GC-content of few genes suggest lateral gene transfer as a potential mode of their origin. Annotations of methylotrophy related genes were also improved by the discovery of a short gene in methylotrophy gene island and redefining a gene important for pyrroquinoline quinone synthesis, essential for methylotrophy. The combined use of proteogenomics and rigorous bioinformatics analysis greatly enhanced the annotation of protein-coding genes in model methylotroph ME-AM1 genome. © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  13. The Drosophila genes CG14593 and CG30106 code for G-protein-coupled receptors specifically activated by the neuropeptides CCHamide-1 and CCHamide-2.

    PubMed

    Hansen, Karina K; Hauser, Frank; Williamson, Michael; Weber, Stine B; Grimmelikhuijzen, Cornelis J P

    2011-01-07

    Recently, a novel neuropeptide, CCHamide, was discovered in the silkworm Bombyx mori (L. Roller et al., Insect Biochem. Mol. Biol. 38 (2008) 1147-1157). We have now found that all insects with a sequenced genome have two genes, each coding for a different CCHamide, CCHamide-1 and -2. We have also cloned and deorphanized two Drosophila G-protein-coupled receptors (GPCRs) coded for by genes CG14593 and CG30106 that are selectively activated by Drosophila CCH-amide-1 (EC(50), 2×10(-9) M) and CCH-amide-2 (EC(50), 5×10(-9) M), respectively. Gene CG30106 (symbol synonym CG14484) has in a previous publication (E.C. Johnson et al., J. Biol. Chem. 278 (2003) 52172-52178) been wrongly assigned to code for an allatostatin-B receptor. This conclusion is based on our findings that the allatostatins-B do not activate the CG30106 receptor and on the recent findings from other research groups that the allatostatins-B activate an unrelated GPCR coded for by gene CG16752. Comparative genomics suggests that a duplication of the CCHamide neuropeptide signalling system occurred after the split of crustaceans and insects, about 410 million years ago, because only one CCHamide neuropeptide gene is found in the water flea Daphnia pulex (Crustacea) and the tick Ixodes scapularis (Chelicerata). Copyright © 2010 Elsevier Inc. All rights reserved.

  14. Transcription of a protein-coding gene on B chromosomes of the Siberian roe deer (Capreolus pygargus)

    PubMed Central

    2013-01-01

    Background Most eukaryotic species represent stable karyotypes with a particular diploid number. B chromosomes are additional to standard karyotypes and may vary in size, number and morphology even between cells of the same individual. For many years it was generally believed that B chromosomes found in some plant, animal and fungi species lacked active genes. Recently, molecular cytogenetic studies showed the presence of additional copies of protein-coding genes on B chromosomes. However, the transcriptional activity of these genes remained elusive. We studied karyotypes of the Siberian roe deer (Capreolus pygargus) that possess up to 14 B chromosomes to investigate the presence and expression of genes on supernumerary chromosomes. Results Here, we describe a 2 Mbp region homologous to cattle chromosome 3 and containing TNNI3K (partial), FPGT, LRRIQ3 and a large gene-sparse segment on B chromosomes of the Siberian roe deer. The presence of the copy of the autosomal region was demonstrated by B-specific cDNA analysis, PCR assisted mapping, cattle bacterial artificial chromosome (BAC) clone localization and quantitative polymerase chain reaction (qPCR). By comparative analysis of B-specific and non-B chromosomal sequences we discovered some B chromosome-specific mutations in protein-coding genes, which further enabled the detection of a FPGT-TNNI3K transcript expressed from duplicated genes located on B chromosomes in roe deer fibroblasts. Conclusions Discovery of a large autosomal segment in all B chromosomes of the Siberian roe deer further corroborates the view of an autosomal origin for these elements. Detection of a B-derived transcript in fibroblasts implies that the protein coding sequences located on Bs are not fully inactivated. The origin, evolution and effect on host of B chromosomal genes seem to be similar to autosomal segmental duplications, which reinforces the view that supernumerary chromosomal elements might play an important role in genome

  15. Emerging Putative Associations between Non-Coding RNAs and Protein-Coding Genes in Neuropathic Pain: Added Value from Reusing Microarray Data.

    PubMed

    Raju, Hemalatha B; Tsinoremas, Nicholas F; Capobianco, Enrico

    2016-01-01

    Regeneration of injured nerves is likely occurring in the peripheral nervous system, but not in the central nervous system. Although protein-coding gene expression has been assessed during nerve regeneration, little is currently known about the role of non-coding RNAs (ncRNAs). This leaves open questions about the potential effects of ncRNAs at transcriptome level. Due to the limited availability of human neuropathic pain (NP) data, we have identified the most comprehensive time-course gene expression profile referred to sciatic nerve (SN) injury and studied in a rat model using two neuronal tissues, namely dorsal root ganglion (DRG) and SN. We have developed a methodology to identify differentially expressed bioentities starting from microarray probes and repurposing them to annotate ncRNAs, while analyzing the expression profiles of protein-coding genes. The approach is designed to reuse microarray data and perform first profiling and then meta-analysis through three main steps. First, we used contextual analysis to identify what we considered putative or potential protein-coding targets for selected ncRNAs. Relevance was therefore assigned to differential expression of neighbor protein-coding genes, with neighborhood defined by a fixed genomic distance from long or antisense ncRNA loci, and of parental genes associated with pseudogenes. Second, connectivity among putative targets was used to build networks, in turn useful to conduct inference at interactomic scale. Last, network paths were annotated to assess relevance to NP. We found significant differential expression in long-intergenic ncRNAs (32 lincRNAs in SN and 8 in DRG), antisense RNA (31 asRNA in SN and 12 in DRG), and pseudogenes (456 in SN and 56 in DRG). In particular, contextual analysis centered on pseudogenes revealed some targets with known association to neurodegeneration and/or neurogenesis processes. While modules of the olfactory receptors were clearly identified in protein-protein

  16. Long Non-Coding RNAs Differentially Expressed between Normal versus Primary Breast Tumor Tissues Disclose Converse Changes to Breast Cancer-Related Protein-Coding Genes

    PubMed Central

    Reiche, Kristin; Kasack, Katharina; Schreiber, Stephan; Lüders, Torben; Due, Eldri U.; Naume, Bjørn; Riis, Margit; Kristensen, Vessela N.; Horn, Friedemann; Børresen-Dale, Anne-Lise; Hackermüller, Jörg; Baumbusch, Lars O.

    2014-01-01

    Breast cancer, the second leading cause of cancer death in women, is a highly heterogeneous disease, characterized by distinct genomic and transcriptomic profiles. Transcriptome analyses prevalently assessed protein-coding genes; however, the majority of the mammalian genome is expressed in numerous non-coding transcripts. Emerging evidence supports that many of these non-coding RNAs are specifically expressed during development, tumorigenesis, and metastasis. The focus of this study was to investigate the expression features and molecular characteristics of long non-coding RNAs (lncRNAs) in breast cancer. We investigated 26 breast tumor and 5 normal tissue samples utilizing a custom expression microarray enclosing probes for mRNAs as well as novel and previously identified lncRNAs. We identified more than 19,000 unique regions significantly differentially expressed between normal versus breast tumor tissue, half of these regions were non-coding without any evidence for functional open reading frames or sequence similarity to known proteins. The identified non-coding regions were primarily located in introns (53%) or in the intergenic space (33%), frequently orientated in antisense-direction of protein-coding genes (14%), and commonly distributed at promoter-, transcription factor binding-, or enhancer-sites. Analyzing the most diverse mRNA breast cancer subtypes Basal-like versus Luminal A and B resulted in 3,025 significantly differentially expressed unique loci, including 682 (23%) for non-coding transcripts. A notable number of differentially expressed protein-coding genes displayed non-synonymous expression changes compared to their nearest differentially expressed lncRNA, including an antisense lncRNA strongly anticorrelated to the mRNA coding for histone deacetylase 3 (HDAC3), which was investigated in more detail. Previously identified chromatin-associated lncRNAs (CARs) were predominantly downregulated in breast tumor samples, including CARs located in the

  17. Long non-coding RNAs differentially expressed between normal versus primary breast tumor tissues disclose converse changes to breast cancer-related protein-coding genes.

    PubMed

    Reiche, Kristin; Kasack, Katharina; Schreiber, Stephan; Lüders, Torben; Due, Eldri U; Naume, Bjørn; Riis, Margit; Kristensen, Vessela N; Horn, Friedemann; Børresen-Dale, Anne-Lise; Hackermüller, Jörg; Baumbusch, Lars O

    2014-01-01

    Breast cancer, the second leading cause of cancer death in women, is a highly heterogeneous disease, characterized by distinct genomic and transcriptomic profiles. Transcriptome analyses prevalently assessed protein-coding genes; however, the majority of the mammalian genome is expressed in numerous non-coding transcripts. Emerging evidence supports that many of these non-coding RNAs are specifically expressed during development, tumorigenesis, and metastasis. The focus of this study was to investigate the expression features and molecular characteristics of long non-coding RNAs (lncRNAs) in breast cancer. We investigated 26 breast tumor and 5 normal tissue samples utilizing a custom expression microarray enclosing probes for mRNAs as well as novel and previously identified lncRNAs. We identified more than 19,000 unique regions significantly differentially expressed between normal versus breast tumor tissue, half of these regions were non-coding without any evidence for functional open reading frames or sequence similarity to known proteins. The identified non-coding regions were primarily located in introns (53%) or in the intergenic space (33%), frequently orientated in antisense-direction of protein-coding genes (14%), and commonly distributed at promoter-, transcription factor binding-, or enhancer-sites. Analyzing the most diverse mRNA breast cancer subtypes Basal-like versus Luminal A and B resulted in 3,025 significantly differentially expressed unique loci, including 682 (23%) for non-coding transcripts. A notable number of differentially expressed protein-coding genes displayed non-synonymous expression changes compared to their nearest differentially expressed lncRNA, including an antisense lncRNA strongly anticorrelated to the mRNA coding for histone deacetylase 3 (HDAC3), which was investigated in more detail. Previously identified chromatin-associated lncRNAs (CARs) were predominantly downregulated in breast tumor samples, including CARs located in the

  18. Transcriptome interrogation of human myometrium identifies differentially expressed sense-antisense pairs of protein-coding and long non-coding RNA genes in spontaneous labor at term

    PubMed Central

    Romero, Roberto; Tarca, Adi; Chaemsaithong, Piya; Miranda, Jezid; Chaiworapongsa, Tinnakorn; Jia, Hui; Hassan, Sonia S.; Kalita, Cynthia A.; Cai, Juan; Yeo, Lami; Lipovich, Leonard

    2014-01-01

    Objective The mechanisms responsible for normal and abnormal parturition are poorly understood. Myometrial activation leading to regular uterine contractions is a key component of labor. Dysfunctional labor (arrest of dilatation and/or descent) is a leading indication for cesarean delivery. Compelling evidence suggests that most of these disorders are functional in nature, and not the result of cephalopelvic disproportion. The methodology and the datasets afforded by the post-genomic era provide novel opportunities to understand and target gene functions in these disorders. In 2012, the ENCODE Consortium elucidated the extraordinary abundance and functional complexity of long non-coding RNA genes in the human genome. The purpose of the study was to identify differentially expressed long non-coding RNA genes in human myometrium in women in spontaneous labor at term. Materials and Methods Myometrium was obtained from women undergoing cesarean deliveries who were not in labor (n=19) and women in spontaneous labor at term (n=20). RNA was extracted and profiled using an Illumina® microarray platform. The analysis of the protein coding genes from this study has been previously reported. Here, we have used computational approaches to bound the extent of long non-coding RNA representation on this platform, and to identify co-differentially expressed and correlated pairs of long non-coding RNA genes and protein-coding genes sharing the same genomic loci. Results Upon considering more than 18,498 distinct lncRNA genes compiled nonredundantly from public experimental data sources, and interrogating 2,634 that matched Illumina microarray probes, we identified co-differential expression and correlation at two genomic loci that contain coding-lncRNA gene pairs: SOCS2-AK054607 and LMCD1-NR_024065 in women in spontaneous labor at term. This co-differential expression and correlation was validated by qRT-PCR, an independent experimental method. Intriguingly, one of the two lnc

  19. Biallelic insertion of a transcriptional terminator via the CRISPR/Cas9 system efficiently silences expression of protein-coding and non-coding RNA genes.

    PubMed

    Liu, Yangyang; Han, Xiao; Yuan, Junting; Geng, Tuoyu; Chen, Shihao; Hu, Xuming; Cui, Isabelle H; Cui, Hengmi

    2017-04-07

    The type II bacterial CRISPR/Cas9 system is a simple, convenient, and powerful tool for targeted gene editing. Here, we describe a CRISPR/Cas9-based approach for inserting a poly(A) transcriptional terminator into both alleles of a targeted gene to silence protein-coding and non-protein-coding genes, which often play key roles in gene regulation but are difficult to silence via insertion or deletion of short DNA fragments. The integration of 225 bp of bovine growth hormone poly(A) signals into either the first intron or the first exon or behind the promoter of target genes caused efficient termination of expression of PPP1R12C , NSUN2 (protein-coding genes), and MALAT1 (non-protein-coding gene). Both NeoR and PuroR were used as markers in the selection of clonal cell lines with biallelic integration of a poly(A) signal. Genotyping analysis indicated that the cell lines displayed the desired biallelic silencing after a brief selection period. These combined results indicate that this CRISPR/Cas9-based approach offers an easy, convenient, and efficient novel technique for gene silencing in cell lines, especially for those in which gene integration is difficult because of a low efficiency of homology-directed repair. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.

  20. Emerging Putative Associations between Non-Coding RNAs and Protein-Coding Genes in Neuropathic Pain: Added Value from Reusing Microarray Data

    PubMed Central

    Raju, Hemalatha B.; Tsinoremas, Nicholas F.; Capobianco, Enrico

    2016-01-01

    Regeneration of injured nerves is likely occurring in the peripheral nervous system, but not in the central nervous system. Although protein-coding gene expression has been assessed during nerve regeneration, little is currently known about the role of non-coding RNAs (ncRNAs). This leaves open questions about the potential effects of ncRNAs at transcriptome level. Due to the limited availability of human neuropathic pain (NP) data, we have identified the most comprehensive time-course gene expression profile referred to sciatic nerve (SN) injury and studied in a rat model using two neuronal tissues, namely dorsal root ganglion (DRG) and SN. We have developed a methodology to identify differentially expressed bioentities starting from microarray probes and repurposing them to annotate ncRNAs, while analyzing the expression profiles of protein-coding genes. The approach is designed to reuse microarray data and perform first profiling and then meta-analysis through three main steps. First, we used contextual analysis to identify what we considered putative or potential protein-coding targets for selected ncRNAs. Relevance was therefore assigned to differential expression of neighbor protein-coding genes, with neighborhood defined by a fixed genomic distance from long or antisense ncRNA loci, and of parental genes associated with pseudogenes. Second, connectivity among putative targets was used to build networks, in turn useful to conduct inference at interactomic scale. Last, network paths were annotated to assess relevance to NP. We found significant differential expression in long-intergenic ncRNAs (32 lincRNAs in SN and 8 in DRG), antisense RNA (31 asRNA in SN and 12 in DRG), and pseudogenes (456 in SN and 56 in DRG). In particular, contextual analysis centered on pseudogenes revealed some targets with known association to neurodegeneration and/or neurogenesis processes. While modules of the olfactory receptors were clearly identified in protein–protein

  1. Exogean: a framework for annotating protein-coding genes in eukaryotic genomic DNA

    PubMed Central

    Djebali, Sarah; Delaplace, Franck; Crollius, Hugues Roest

    2006-01-01

    Background Accurate and automatic gene identification in eukaryotic genomic DNA is more than ever of crucial importance to efficiently exploit the large volume of assembled genome sequences available to the community. Automatic methods have always been considered less reliable than human expertise. This is illustrated in the EGASP project, where reference annotations against which all automatic methods are measured are generated by human annotators and experimentally verified. We hypothesized that replicating the accuracy of human annotators in an automatic method could be achieved by formalizing the rules and decisions that they use, in a mathematical formalism. Results We have developed Exogean, a flexible framework based on directed acyclic colored multigraphs (DACMs) that can represent biological objects (for example, mRNA, ESTs, protein alignments, exons) and relationships between them. Graphs are analyzed to process the information according to rules that replicate those used by human annotators. Simple individual starting objects given as input to Exogean are thus combined and synthesized into complex objects such as protein coding transcripts. Conclusion We show here, in the context of the EGASP project, that Exogean is currently the method that best reproduces protein coding gene annotations from human experts, in terms of identifying at least one exact coding sequence per gene. We discuss current limitations of the method and several avenues for improvement. PMID:16925841

  2. The artificial zinc finger coding gene 'Jazz' binds the utrophin promoter and activates transcription.

    PubMed

    Corbi, N; Libri, V; Fanciulli, M; Tinsley, J M; Davies, K E; Passananti, C

    2000-06-01

    Up-regulation of utrophin gene expression is recognized as a plausible therapeutic approach in the treatment of Duchenne muscular dystrophy (DMD). We have designed and engineered new zinc finger-based transcription factors capable of binding and activating transcription from the promoter of the dystrophin-related gene, utrophin. Using the recognition 'code' that proposes specific rules between zinc finger primary structure and potential DNA binding sites, we engineered a new gene named 'Jazz' that encodes for a three-zinc finger peptide. Jazz belongs to the Cys2-His2 zinc finger type and was engineered to target the nine base pair DNA sequence: 5'-GCT-GCT-GCG-3', present in the promoter region of both the human and mouse utrophin gene. The entire zinc finger alpha-helix region, containing the amino acid positions that are crucial for DNA binding, was specifically chosen on the basis of the contacts more frequently represented in the available list of the 'code'. Here we demonstrate that Jazz protein binds specifically to the double-stranded DNA target, with a dissociation constant of about 32 nM. Band shift and super-shift experiments confirmed the high affinity and specificity of Jazz protein for its DNA target. Moreover, we show that chimeric proteins, named Gal4-Jazz and Sp1-Jazz, are able to drive the transcription of a test gene from the human utrophin promoter.

  3. Successful Recovery of Nuclear Protein-Coding Genes from Small Insects in Museums Using Illumina Sequencing.

    PubMed

    Kanda, Kojun; Pflug, James M; Sproul, John S; Dasenko, Mark A; Maddison, David R

    2015-01-01

    In this paper we explore high-throughput Illumina sequencing of nuclear protein-coding, ribosomal, and mitochondrial genes in small, dried insects stored in natural history collections. We sequenced one tenebrionid beetle and 12 carabid beetles ranging in size from 3.7 to 9.7 mm in length that have been stored in various museums for 4 to 84 years. Although we chose a number of old, small specimens for which we expected low sequence recovery, we successfully recovered at least some low-copy nuclear protein-coding genes from all specimens. For example, in one 56-year-old beetle, 4.4 mm in length, our de novo assembly recovered about 63% of approximately 41,900 nucleotides in a target suite of 67 nuclear protein-coding gene fragments, and 70% using a reference-based assembly. Even in the least successfully sequenced carabid specimen, reference-based assembly yielded fragments that were at least 50% of the target length for 34 of 67 nuclear protein-coding gene fragments. Exploration of alternative references for reference-based assembly revealed few signs of bias created by the reference. For all specimens we recovered almost complete copies of ribosomal and mitochondrial genes. We verified the general accuracy of the sequences through comparisons with sequences obtained from PCR and Sanger sequencing, including of conspecific, fresh specimens, and through phylogenetic analysis that tested the placement of sequences in predicted regions. A few possible inaccuracies in the sequences were detected, but these rarely affected the phylogenetic placement of the samples. Although our sample sizes are low, an exploratory regression study suggests that the dominant factor in predicting success at recovering nuclear protein-coding genes is a high number of Illumina reads, with success at PCR of COI and killing by immersion in ethanol being secondary factors; in analyses of only high-read samples, the primary significant explanatory variable was body length, with small beetles

  4. Successful Recovery of Nuclear Protein-Coding Genes from Small Insects in Museums Using Illumina Sequencing

    PubMed Central

    Dasenko, Mark A.

    2015-01-01

    In this paper we explore high-throughput Illumina sequencing of nuclear protein-coding, ribosomal, and mitochondrial genes in small, dried insects stored in natural history collections. We sequenced one tenebrionid beetle and 12 carabid beetles ranging in size from 3.7 to 9.7 mm in length that have been stored in various museums for 4 to 84 years. Although we chose a number of old, small specimens for which we expected low sequence recovery, we successfully recovered at least some low-copy nuclear protein-coding genes from all specimens. For example, in one 56-year-old beetle, 4.4 mm in length, our de novo assembly recovered about 63% of approximately 41,900 nucleotides in a target suite of 67 nuclear protein-coding gene fragments, and 70% using a reference-based assembly. Even in the least successfully sequenced carabid specimen, reference-based assembly yielded fragments that were at least 50% of the target length for 34 of 67 nuclear protein-coding gene fragments. Exploration of alternative references for reference-based assembly revealed few signs of bias created by the reference. For all specimens we recovered almost complete copies of ribosomal and mitochondrial genes. We verified the general accuracy of the sequences through comparisons with sequences obtained from PCR and Sanger sequencing, including of conspecific, fresh specimens, and through phylogenetic analysis that tested the placement of sequences in predicted regions. A few possible inaccuracies in the sequences were detected, but these rarely affected the phylogenetic placement of the samples. Although our sample sizes are low, an exploratory regression study suggests that the dominant factor in predicting success at recovering nuclear protein-coding genes is a high number of Illumina reads, with success at PCR of COI and killing by immersion in ethanol being secondary factors; in analyses of only high-read samples, the primary significant explanatory variable was body length, with small beetles

  5. Identification and analysis of unitary loss of long-established protein-coding genes in Poaceae shows evidences for biased gene loss and putatively functional transcription of relics.

    PubMed

    Zhao, Yi; Tang, Liang; Li, Zhe; Jin, Jinpu; Luo, Jingchu; Gao, Ge

    2015-04-18

    Long-established protein-coding genes may lose their coding potential during evolution ("unitary gene loss"). Members of the Poaceae family are a major food source and represent an ideal model clade for plant evolution research. However, the global pattern of unitary gene loss in Poaceae genomes as well as the evolutionary fate of lost genes are still less-investigated and remain largely elusive. Using a locally developed pipeline, we identified 129 unitary gene loss events for long-established protein-coding genes from four representative species of Poaceae, i.e. brachypodium, rice, sorghum and maize. Functional annotation suggested that the lost genes in all or most of Poaceae species are enriched for genes involved in development and response to endogenous stimulus. We also found that 44 mutated genomic loci of lost genes, which we referred as relics, were still actively transcribed, and of which 84% (37 of 44) showed significantly differential expression across different tissues. More interestingly, we found that there were totally five expressed relics may function as competitive endogenous RNA in brachypodium, rice and sorghum genome. Based on comparative genomics and transcriptome data, we firstly compiled a comprehensive catalogue of unitary gene loss events in Poaceae species and characterized a statistically significant functional preference for these lost genes as well showed the potential of relics functioning as competitive endogenous RNAs in Poaceae genomes.

  6. A compendium of transcription factor and Transcriptionally active protein coding gene families in cowpea (Vigna unguiculata L.).

    PubMed

    Misra, Vikram A; Wang, Yu; Timko, Michael P

    2017-11-22

    Cowpea (Vigna unguiculata (L.) Walp.) is the most important food and forage legume in the semi-arid tropics of sub-Saharan Africa where approximately 80% of worldwide production takes place primarily on low-input, subsistence farm sites. Among the major goals of cowpea breeding and improvement programs are the rapid manipulation of agronomic traits for seed size and quality and improved resistance to abiotic and biotic stresses to enhance productivity. Knowing the suite of transcription factors (TFs) and transcriptionally active proteins (TAPs) that control various critical plant cellular processes would contribute tremendously to these improvement aims. We used a computational approach that employed three different predictive pipelines to data mine the cowpea genome and identified over 4400 genes representing 136 different TF and TAP families. We compare the information content of cowpea to two evolutionarily close species common bean (Phaseolus vulgaris), and soybean (Glycine max) to gauge the relative informational content. Our data indicate that correcting for genome size cowpea has fewer TF and TAP genes than common bean (4408 / 5291) and soybean (4408/ 11,065). Members of the GROWTH-REGULATING FACTOR (GRF) and Auxin/indole-3-acetic acid (Aux/IAA) gene families appear to be over-represented in the genome relative to common bean and soybean, whereas members of the MADS (Minichromosome maintenance deficient 1 (MCM1), AGAMOUS, DEFICIENS, and serum response factor (SRF)) and C2C2-YABBY appear to be under-represented. Analysis of the AP2-EREBP APETALA2-Ethylene Responsive Element Binding Protein (AP2-EREBP), NAC (NAM (no apical meristem), ATAF1, 2 (Arabidopsis transcription activation factor), CUC (cup-shaped cotyledon)), and WRKY families, known to be important in defense signaling, revealed changes and phylogenetic rearrangements relative to common bean and soybean that suggest these groups may have evolved different functions. The availability of detailed

  7. Using a Euclid distance discriminant method to find protein coding genes in the yeast genome.

    PubMed

    Zhang, Chun-Ting; Wang, Ju; Zhang, Ren

    2002-02-01

    The Euclid distance discriminant method is used to find protein coding genes in the yeast genome, based on the single nucleotide frequencies at three codon positions in the ORFs. The method is extremely simple and may be extended to find genes in prokaryotic genomes or eukaryotic genomes with less introns. Six-fold cross-validation tests have demonstrated that the accuracy of the algorithm is better than 93%. Based on this, it is found that the total number of protein coding genes in the yeast genome is less than or equal to 5579 only, about 3.8-7.0% less than 5800-6000, which is currently widely accepted. The base compositions at three codon positions are analyzed in details using a graphic method. The result shows that the preference codons adopted by yeast genes are of the RGW type, where R, G and W indicate the bases of purine, non-G and A/T, whereas the 'codons' in the intergenic sequences are of the form NNN, where N denotes any base. This fact constitutes the basis of the algorithm to distinguish between coding and non-coding ORFs in the yeast genome. The names of putative non-coding ORFs are listed here in detail.

  8. Multiple copies of genes coding for electron transport proteins in the bacterium Nitrosomonas europaea.

    PubMed

    McTavish, H; LaQuier, F; Arciero, D; Logan, M; Mundfrom, G; Fuchs, J A; Hooper, A B

    1993-04-01

    The genome of Nitrosomonas europaea contains at least three copies each of the genes coding for hydroxylamine oxidoreductase (HAO) and cytochrome c554. A copy of an HAO gene is always located within 2.7 kb of a copy of a cytochrome c554 gene. Cytochrome P-460, a protein that shares very unusual spectral features with HAO, was found to be encoded by a gene separate from the HAO genes.

  9. Natural selection in avian protein-coding genes expressed in brain.

    PubMed

    Axelsson, Erik; Hultin-Rosenberg, Lina; Brandström, Mikael; Zwahlén, Martin; Clayton, David F; Ellegren, Hans

    2008-06-01

    The evolution of birds from theropod dinosaurs took place approximately 150 million years ago, and was associated with a number of specific adaptations that are still evident among extant birds, including feathers, song and extravagant secondary sexual characteristics. Knowledge about the molecular evolutionary background to such adaptations is lacking. Here, we analyse the evolution of > 5000 protein-coding gene sequences expressed in zebra finch brain by comparison to orthologous sequences in chicken. Mean d(N)/d(S) is 0.085 and genes with their maximal expression in the eye and central nervous system have the lowest mean d(N)/d(S) value, while those expressed in digestive and reproductive tissues exhibit the highest. We find that fast-evolving genes (those which have higher than expected rate of nonsynonymous substitution, indicative of adaptive evolution) are enriched for biological functions such as fertilization, muscle contraction, defence response, response to stress, wounding and endogenous stimulus, and cell death. After alignment to mammalian orthologues, we identify a catalogue of 228 genes that show a significantly higher rate of protein evolution in the two bird lineages than in mammals. These accelerated bird genes, representing candidates for avian-specific adaptations, include genes implicated in vocal learning and other cognitive processes. Moreover, colouration genes evolve faster in birds than in mammals, which may have been driven by sexual selection for extravagant plumage characteristics.

  10. New genes from non-coding sequence: the role of de novo protein-coding genes in eukaryotic evolutionary innovation.

    PubMed

    McLysaght, Aoife; Guerzoni, Daniele

    2015-09-26

    The origin of novel protein-coding genes de novo was once considered so improbable as to be impossible. In less than a decade, and especially in the last five years, this view has been overturned by extensive evidence from diverse eukaryotic lineages. There is now evidence that this mechanism has contributed a significant number of genes to genomes of organisms as diverse as Saccharomyces, Drosophila, Plasmodium, Arabidopisis and human. From simple beginnings, these genes have in some instances acquired complex structure, regulated expression and important functional roles. New genes are often thought of as dispensable late additions; however, some recent de novo genes in human can play a role in disease. Rather than an extremely rare occurrence, it is now evident that there is a relatively constant trickle of proto-genes released into the testing ground of natural selection. It is currently unknown whether de novo genes arise primarily through an 'RNA-first' or 'ORF-first' pathway. Either way, evolutionary tinkering with this pool of genetic potential may have been a significant player in the origins of lineage-specific traits and adaptations. © 2015 The Authors.

  11. Differential protein-coding gene and long noncoding RNA expression in smoking-related lung squamous cell carcinoma.

    PubMed

    Li, Shicheng; Sun, Xiao; Miao, Shuncheng; Liu, Jia; Jiao, Wenjie

    2017-11-01

    Cigarette smoking is one of the greatest preventable risk factors for developing cancer, and most cases of lung squamous cell carcinoma (lung SCC) are associated with smoking. The pathogenesis mechanism of tumor progress is unclear. This study aimed to identify biomarkers in smoking-related lung cancer, including protein-coding gene, long noncoding RNA, and transcription factors. We selected and obtained messenger RNA microarray datasets and clinical data from the Gene Expression Omnibus database to identify gene expression altered by cigarette smoking. Integrated bioinformatic analysis was used to clarify biological functions of the identified genes, including Gene Ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway, the construction of a protein-protein interaction network, transcription factor, and statistical analyses. Subsequent quantitative real-time PCR was utilized to verify these bioinformatic analyses. Five hundred and ninety-eight differentially expressed genes and 21 long noncoding RNA were identified in smoking-related lung SCC. GO and KEGG pathway analysis showed that identified genes were enriched in the cancer-related functions and pathways. The protein-protein interaction network revealed seven hub genes identified in lung SCC. Several transcription factors and their binding sites were predicted. The results of real-time quantitative PCR revealed that AURKA and BIRC5 were significantly upregulated and LINC00094 was downregulated in the tumor tissues of smoking patients. Further statistical analysis indicated that dysregulation of AURKA, BIRC5, and LINC00094 indicated poor prognosis in lung SCC. Protein-coding genes AURKA, BIRC5, and LINC00094 could be biomarkers or therapeutic targets for smoking-related lung SCC. © 2017 The Authors. Thoracic Cancer published by China Lung Oncology Group and John Wiley & Sons Australia, Ltd.

  12. Morphometric Analysis of Recognized Genes for Autism Spectrum Disorders and Obesity in Relationship to the Distribution of Protein-Coding Genes on Human Chromosomes.

    PubMed

    McGuire, Austen B; Rafi, Syed K; Manzardo, Ann M; Butler, Merlin G

    2016-05-05

    Mammalian chromosomes are comprised of complex chromatin architecture with the specific assembly and configuration of each chromosome influencing gene expression and function in yet undefined ways by varying degrees of heterochromatinization that result in Giemsa (G) negative euchromatic (light) bands and G-positive heterochromatic (dark) bands. We carried out morphometric measurements of high-resolution chromosome ideograms for the first time to characterize the total euchromatic and heterochromatic chromosome band length, distribution and localization of 20,145 known protein-coding genes, 790 recognized autism spectrum disorder (ASD) genes and 365 obesity genes. The individual lengths of G-negative euchromatin and G-positive heterochromatin chromosome bands were measured in millimeters and recorded from scaled and stacked digital images of 850-band high-resolution ideograms supplied by the International Society of Chromosome Nomenclature (ISCN) 2013. Our overall measurements followed established banding patterns based on chromosome size. G-negative euchromatic band regions contained 60% of protein-coding genes while the remaining 40% were distributed across the four heterochromatic dark band sub-types. ASD genes were disproportionately overrepresented in the darker heterochromatic sub-bands, while the obesity gene distribution pattern did not significantly differ from protein-coding genes. Our study supports recent trends implicating genes located in heterochromatin regions playing a role in biological processes including neurodevelopment and function, specifically genes associated with ASD.

  13. Transcriptome interrogation of human myometrium identifies differentially expressed sense-antisense pairs of protein-coding and long non-coding RNA genes in spontaneous labor at term.

    PubMed

    Romero, Roberto; Tarca, Adi L; Chaemsaithong, Piya; Miranda, Jezid; Chaiworapongsa, Tinnakorn; Jia, Hui; Hassan, Sonia S; Kalita, Cynthia A; Cai, Juan; Yeo, Lami; Lipovich, Leonard

    2014-09-01

    To identify differentially expressed long non-coding RNA (lncRNA) genes in human myometrium in women with spontaneous labor at term. Myometrium was obtained from women undergoing cesarean deliveries who were not in labor (n = 19) and women in spontaneous labor at term (n = 20). RNA was extracted and profiled using an Illumina® microarray platform. We have used computational approaches to bound the extent of long non-coding RNA representation on this platform, and to identify co-differentially expressed and correlated pairs of long non-coding RNA genes and protein-coding genes sharing the same genomic loci. We identified co-differential expression and correlation at two genomic loci that contain coding-lncRNA gene pairs: SOCS2-AK054607 and LMCD1-NR_024065 in women in spontaneous labor at term. This co-differential expression and correlation was validated by qRT-PCR, an experimental method completely independent of the microarray analysis. Intriguingly, one of the two lncRNA genes differentially expressed in term labor had a key genomic structure element, a splice site, that lacked evolutionary conservation beyond primates. We provide, for the first time, evidence for coordinated differential expression and correlation of cis-encoded antisense lncRNAs and protein-coding genes with known as well as novel roles in pregnancy in the myometrium of women in spontaneous labor at term.

  14. Evaluation of the efficacy of twelve mitochondrial protein-coding genes as barcodes for mollusk DNA barcoding.

    PubMed

    Yu, Hong; Kong, Lingfeng; Li, Qi

    2016-01-01

    In this study, we evaluated the efficacy of 12 mitochondrial protein-coding genes from 238 mitochondrial genomes of 140 molluscan species as potential DNA barcodes for mollusks. Three barcoding methods (distance, monophyly and character-based methods) were used in species identification. The species recovery rates based on genetic distances for the 12 genes ranged from 70.83 to 83.33%. There were no significant differences in intra- or interspecific variability among the 12 genes. The monophyly and character-based methods provided higher resolution than the distance-based method in species delimitation. Especially in closely related taxa, the character-based method showed some advantages. The results suggested that besides the standard COI barcode, other 11 mitochondrial protein-coding genes could also be potentially used as a molecular diagnostic for molluscan species discrimination. Our results also showed that the combination of mitochondrial genes did not enhance the efficacy for species identification and a single mitochondrial gene would be fully competent.

  15. Rate heterogeneity in six protein-coding genes from the holoparasite Balanophora (Balanophoraceae) and other taxa of Santalales

    PubMed Central

    Su, Huei-Jiun; Hu, Jer-Ming

    2012-01-01

    Background and Aims The holoparasitic flowering plant Balanophora displays extreme floral reduction and was previously found to have enormous rate acceleration in the nuclear 18S rDNA region. So far, it remains unclear whether non-ribosomal, protein-coding genes of Balanophora also evolve in an accelerated fashion and whether the genes with high substitution rates retain their functionality. To tackle these issues, six different genes were sequenced from two Balanophora species and their rate variation and expression patterns were examined. Methods Sequences including nuclear PI, euAP3, TM6, LFY and RPB2 and mitochondrial matR were determined from two Balanophora spp. and compared with selected hemiparasitic species of Santalales and autotrophic core eudicots. Gene expression was detected for the six protein-coding genes and the expression patterns of the three B-class genes (PI, AP3 and TM6) were further examined across different organs of B. laxiflora using RT-PCR. Key Results Balanophora mitochondrial matR is highly accelerated in both nonsynonymous (dN) and synonymous (dS) substitution rates, whereas the rate variation of nuclear genes LFY, PI, euAP3, TM6 and RPB2 are less dramatic. Significant dS increases were detected in Balanophora PI, TM6, RPB2 and dN accelerations in euAP3. All of the protein-coding genes are expressed in inflorescences, indicative of their functionality. PI is restrictively expressed in tepals, synandria and floral bracts, whereas AP3 and TM6 are widely expressed in both male and female inflorescences. Conclusions Despite the observation that rates of sequence evolution are generally higher in Balanophora than in hemiparasitic species of Santalales and autotrophic core eudicots, the five nuclear protein-coding genes are functional and are evolving at a much slower rate than 18S rDNA. The mechanism or mechanisms responsible for rapid sequence evolution and concomitant rate acceleration for 18S rDNA and matR are currently not well

  16. Inheritance-mode specific pathogenicity prioritization (ISPP) for human protein coding genes.

    PubMed

    Hsu, Jacob Shujui; Kwan, Johnny S H; Pan, Zhicheng; Garcia-Barcelo, Maria-Mercè; Sham, Pak Chung; Li, Miaoxin

    2016-10-15

    Exome sequencing studies have facilitated the detection of causal genetic variants in yet-unsolved Mendelian diseases. However, the identification of disease causal genes among a list of candidates in an exome sequencing study is still not fully settled, and it is often difficult to prioritize candidate genes for follow-up studies. The inheritance mode provides crucial information for understanding Mendelian diseases, but none of the existing gene prioritization tools fully utilize this information. We examined the characteristics of Mendelian disease genes under different inheritance modes. The results suggest that Mendelian disease genes with autosomal dominant (AD) inheritance mode are more haploinsufficiency and de novo mutation sensitive, whereas those autosomal recessive (AR) genes have significantly more non-synonymous variants and regulatory transcript isoforms. In addition, the X-linked (XL) Mendelian disease genes have fewer non-synonymous and synonymous variants. As a result, we derived a new scoring system for prioritizing candidate genes for Mendelian diseases according to the inheritance mode. Our scoring system assigned to each annotated protein-coding gene (N = 18 859) three pathogenic scores according to the inheritance mode (AD, AR and XL). This inheritance mode-specific framework achieved higher accuracy (area under curve  = 0.84) in XL mode. The inheritance-mode specific pathogenicity prioritization (ISPP) outperformed other well-known methods including Haploinsufficiency, Recessive, Network centrality, Genic Intolerance, Gene Damage Index and Gene Constraint scores. This systematic study suggests that genes manifesting disease inheritance modes tend to have unique characteristics. ISPP is included in KGGSeq v1.0 (http://grass.cgs.hku.hk/limx/kggseq/), and source code is available from (https://github.com/jacobhsu35/ISPP.git). mxli@hku.hkSupplementary information: Supplementary data are available at Bioinformatics online. © The Author

  17. MitoNuc: a database of nuclear genes coding for mitochondrial proteins. Update 2002.

    PubMed

    Attimonelli, Marcella; Catalano, Domenico; Gissi, Carmela; Grillo, Giorgio; Licciulli, Flavio; Liuni, Sabino; Santamaria, Monica; Pesole, Graziano; Saccone, Cecilia

    2002-01-01

    Mitochondria, besides their central role in energy metabolism, have recently been found to be involved in a number of basic processes of cell life and to contribute to the pathogenesis of many degenerative diseases. All functions of mitochondria depend on the interaction of nuclear and organelle genomes. Mitochondrial genomes have been extensively sequenced and analysed and data have been collected in several specialised databases. In order to collect information on nuclear coded mitochondrial proteins we developed MitoNuc, a database containing detailed information on sequenced nuclear genes coding for mitochondrial proteins in Metazoa. The MitoNuc database can be retrieved through SRS and is available via the web site http://bighost.area.ba.cnr.it/mitochondriome where other mitochondrial databases developed by our group, the complete list of the sequenced mitochondrial genomes, links to other mitochondrial sites and related information, are available. The MitoAln database, related to MitoNuc in the previous release, reporting the multiple alignments of the relevant homologous protein coding regions, is no longer supported in the present release. In order to keep the links among entries in MitoNuc from homologous proteins, a new field in the database has been defined: the cluster identifier, an alpha numeric code used to identify each cluster of homologous proteins. A comment field derived from the corresponding SWISS-PROT entry has been introduced; this reports clinical data related to dysfunction of the protein. The logic scheme of MitoNuc database has been implemented in the ORACLE DBMS. This will allow the end-users to retrieve data through a friendly interface that will be soon implemented.

  18. cncRNAs: Bi-functional RNAs with protein coding and non-coding functions

    PubMed Central

    Kumari, Pooja; Sampath, Karuna

    2015-01-01

    For many decades, the major function of mRNA was thought to be to provide protein-coding information embedded in the genome. The advent of high-throughput sequencing has led to the discovery of pervasive transcription of eukaryotic genomes and opened the world of RNA-mediated gene regulation. Many regulatory RNAs have been found to be incapable of protein coding and are hence termed as non-coding RNAs (ncRNAs). However, studies in recent years have shown that several previously annotated non-coding RNAs have the potential to encode proteins, and conversely, some coding RNAs have regulatory functions independent of the protein they encode. Such bi-functional RNAs, with both protein coding and non-coding functions, which we term as ‘cncRNAs’, have emerged as new players in cellular systems. Here, we describe the functions of some cncRNAs identified from bacteria to humans. Because the functions of many RNAs across genomes remains unclear, we propose that RNAs be classified as coding, non-coding or both only after careful analysis of their functions. PMID:26498036

  19. The spatial distribution of fixed mutations within genes coding for proteins

    NASA Technical Reports Server (NTRS)

    Holmquist, R.; Goodman, M.; Conroy, T.; Czelusniak, J.

    1983-01-01

    An examination has been conducted of the extensive amino acid sequence data now available for five protein families - the alpha crystallin A chain, myoglobin, alpha and beta hemoglobin, and the cytochromes c - with the goal of estimating the true spatial distribution of base substitutions within genes that code for proteins. In every case the commonly used Poisson density failed to even approximate the experimental pattern of base substitution. For the 87 species of beta hemoglobin examined, for example, the probability that the observed results were from a Poisson process was the minuscule 10 to the -44th. Analogous results were obtained for the other functional families. All the data were reasonably, but not perfectly, described by the negative binomial density. In particular, most of the data were described by one of the very simple limiting forms of this density, the geometric density. The implications of this for evolutionary inference are discussed. It is evident that most estimates of total base substitutions between genes are badly in need of revision.

  20. ChIPBase v2.0: decoding transcriptional regulatory networks of non-coding RNAs and protein-coding genes from ChIP-seq data.

    PubMed

    Zhou, Ke-Ren; Liu, Shun; Sun, Wen-Ju; Zheng, Ling-Ling; Zhou, Hui; Yang, Jian-Hua; Qu, Liang-Hu

    2017-01-04

    The abnormal transcriptional regulation of non-coding RNAs (ncRNAs) and protein-coding genes (PCGs) is contributed to various biological processes and linked with human diseases, but the underlying mechanisms remain elusive. In this study, we developed ChIPBase v2.0 (http://rna.sysu.edu.cn/chipbase/) to explore the transcriptional regulatory networks of ncRNAs and PCGs. ChIPBase v2.0 has been expanded with ∼10 200 curated ChIP-seq datasets, which represent about 20 times expansion when comparing to the previous released version. We identified thousands of binding motif matrices and their binding sites from ChIP-seq data of DNA-binding proteins and predicted millions of transcriptional regulatory relationships between transcription factors (TFs) and genes. We constructed 'Regulator' module to predict hundreds of TFs and histone modifications that were involved in or affected transcription of ncRNAs and PCGs. Moreover, we built a web-based tool, Co-Expression, to explore the co-expression patterns between DNA-binding proteins and various types of genes by integrating the gene expression profiles of ∼10 000 tumor samples and ∼9100 normal tissues and cell lines. ChIPBase also provides a ChIP-Function tool and a genome browser to predict functions of diverse genes and visualize various ChIP-seq data. This study will greatly expand our understanding of the transcriptional regulations of ncRNAs and PCGs. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  1. Structural and functional studies of a family of Dictyostelium discoideum developmentally regulated, prestalk genes coding for small proteins.

    PubMed

    Vicente, Juan J; Galardi-Castilla, María; Escalante, Ricardo; Sastre, Leandro

    2008-01-03

    The social amoeba Dictyostelium discoideum executes a multicellular development program upon starvation. This morphogenetic process requires the differential regulation of a large number of genes and is coordinated by extracellular signals. The MADS-box transcription factor SrfA is required for several stages of development, including slug migration and spore terminal differentiation. Subtractive hybridization allowed the isolation of a gene, sigN (SrfA-induced gene N), that was dependent on the transcription factor SrfA for expression at the slug stage of development. Homology searches detected the existence of a large family of sigN-related genes in the Dictyostelium discoideum genome. The 13 most similar genes are grouped in two regions of chromosome 2 and have been named Group1 and Group2 sigN genes. The putative encoded proteins are 87-89 amino acids long. All these genes have a similar structure, composed of a first exon containing a 13 nucleotides long open reading frame and a second exon comprising the remaining of the putative coding region. The expression of these genes is induced at10 hours of development. Analyses of their promoter regions indicate that these genes are expressed in the prestalk region of developing structures. The addition of antibodies raised against SigN Group 2 proteins induced disintegration of multi-cellular structures at the mound stage of development. A large family of genes coding for small proteins has been identified in D. discoideum. Two groups of very similar genes from this family have been shown to be specifically expressed in prestalk cells during development. Functional studies using antibodies raised against Group 2 SigN proteins indicate that these genes could play a role during multicellular development.

  2. Chromosome preference of disease genes and vectorization for the prediction of non-coding disease genes.

    PubMed

    Peng, Hui; Lan, Chaowang; Liu, Yuansheng; Liu, Tao; Blumenstein, Michael; Li, Jinyan

    2017-10-03

    Disease-related protein-coding genes have been widely studied, but disease-related non-coding genes remain largely unknown. This work introduces a new vector to represent diseases, and applies the newly vectorized data for a positive-unlabeled learning algorithm to predict and rank disease-related long non-coding RNA (lncRNA) genes. This novel vector representation for diseases consists of two sub-vectors, one is composed of 45 elements, characterizing the information entropies of the disease genes distribution over 45 chromosome substructures. This idea is supported by our observation that some substructures (e.g., the chromosome 6 p-arm) are highly preferred by disease-related protein coding genes, while some (e.g., the 21 p-arm) are not favored at all. The second sub-vector is 30-dimensional, characterizing the distribution of disease gene enriched KEGG pathways in comparison with our manually created pathway groups. The second sub-vector complements with the first one to differentiate between various diseases. Our prediction method outperforms the state-of-the-art methods on benchmark datasets for prioritizing disease related lncRNA genes. The method also works well when only the sequence information of an lncRNA gene is known, or even when a given disease has no currently recognized long non-coding genes.

  3. Chromosome preference of disease genes and vectorization for the prediction of non-coding disease genes

    PubMed Central

    Peng, Hui; Lan, Chaowang; Liu, Yuansheng; Liu, Tao; Blumenstein, Michael; Li, Jinyan

    2017-01-01

    Disease-related protein-coding genes have been widely studied, but disease-related non-coding genes remain largely unknown. This work introduces a new vector to represent diseases, and applies the newly vectorized data for a positive-unlabeled learning algorithm to predict and rank disease-related long non-coding RNA (lncRNA) genes. This novel vector representation for diseases consists of two sub-vectors, one is composed of 45 elements, characterizing the information entropies of the disease genes distribution over 45 chromosome substructures. This idea is supported by our observation that some substructures (e.g., the chromosome 6 p-arm) are highly preferred by disease-related protein coding genes, while some (e.g., the 21 p-arm) are not favored at all. The second sub-vector is 30-dimensional, characterizing the distribution of disease gene enriched KEGG pathways in comparison with our manually created pathway groups. The second sub-vector complements with the first one to differentiate between various diseases. Our prediction method outperforms the state-of-the-art methods on benchmark datasets for prioritizing disease related lncRNA genes. The method also works well when only the sequence information of an lncRNA gene is known, or even when a given disease has no currently recognized long non-coding genes. PMID:29108274

  4. Intraarticular expression of biologically active interleukin 1-receptor-antagonist protein by ex vivo gene transfer.

    PubMed Central

    Bandara, G; Mueller, G M; Galea-Lauri, J; Tindal, M H; Georgescu, H I; Suchanek, M K; Hung, G L; Glorioso, J C; Robbins, P D; Evans, C H

    1993-01-01

    Gene therapy offers a radical different approach to the treatment of arthritis. Here we have demonstrated that two marker genes (lacZ and neo) and cDNA coding for a potentially therapeutic protein (human interleukin 1-receptor-antagonist protein; IRAP or IL-1ra) can be delivered, by ex vivo techniques, to the synovial lining of joints; intraarticular expression of IRAP inhibited intraarticular responses to interleukin 1. To achieve this, lapine synoviocytes were first transduced in culture by retroviral infection. The genetically modified synovial cells were then transplanted by intraarticular injection into the knee joints of rabbits, where they efficiently colonized the synovium. Assay of joint lavages confirmed the in vivo expression of biologically active human IRAP. With allografted cells, IRAP expression was lost by 12 days after transfer. In contrast, autografted synoviocytes continued to express IRAP for approximately 5 weeks. Knee joints expressing human IRAP were protected from the leukocytosis that otherwise follows the intraarticular injection of recombinant human interleukin 1 beta. Thus, we report the intraarticular expression and activity of a potentially therapeutic protein by gene-transfer technology; these experiments demonstrate the feasibility of treating arthritis and other joint disorders with gene therapy. Images Fig. 1 Fig. 2 PMID:8248169

  5. Evidence for the recent origin of a bacterial protein-coding, overlapping orphan gene by evolutionary overprinting.

    PubMed

    Fellner, Lea; Simon, Svenja; Scherling, Christian; Witting, Michael; Schober, Steffen; Polte, Christine; Schmitt-Kopplin, Philippe; Keim, Daniel A; Scherer, Siegfried; Neuhaus, Klaus

    2015-12-18

    Gene duplication is believed to be the classical way to form novel genes, but overprinting may be an important alternative. Overprinting allows entirely novel proteins to evolve de novo, i.e., formerly non-coding open reading frames within functional genes become expressed. Only three cases have been described for Escherichia coli. Here, a fourth example is presented. RNA sequencing revealed an open reading frame weakly transcribed in cow dung, coding for 101 residues and embedded completely in the -2 reading frame of citC in enterohemorrhagic E. coli. This gene is designated novel overlapping gene, nog1. The promoter region fused to gfp exhibits specific activities and 5' rapid amplification of cDNA ends indicated the transcriptional start 40-bp upstream of the start codon. nog1 was strand-specifically arrested in translation by a nonsense mutation silent in citC. This Nog1-mutant showed a phenotype in competitive growth against wild type in the presence of MgCl2. Small differences in metabolite concentrations were also found. Bioinformatic analyses propose Nog1 to be inner membrane-bound and to possess at least one membrane-spanning domain. A phylogenetic analysis suggests that the orphan gene nog1 arose by overprinting after Escherichia/Shigella separated from the other γ-proteobacteria. Since nog1 is of recent origin, non-essential, short, weakly expressed and only marginally involved in E. coli's central metabolism, we propose that this gene is in an initial stage of evolution. While we present specific experimental evidence for the existence of a fourth overlapping gene in enterohemorrhagic E. coli, we believe that this may be an initial finding only and overlapping genes in bacteria may be more common than is currently assumed by microbiologists.

  6. Development-related expression patterns of protein-coding and miRNA genes involved in porcine muscle growth.

    PubMed

    Wang, F J; Jin, L; Guo, Y Q; Liu, R; He, M N; Li, M Z; Li, X W

    2014-11-27

    Muscle growth and development is associated with remarkable changes in protein-coding and microRNA (miRNA) gene expression. To determine the expression patterns of genes and miRNAs related to muscle growth and development, we measured the expression levels of 25 protein-coding and 16 miRNA genes in skeletal and cardiac muscles throughout 5 developmental stages by quantitative reverse transcription-polymerase chain reaction. The Short Time-Series Expression Miner (STEM) software clustering results showed that growth-related genes were downregulated at all developmental stages in both the psoas major and longissimus dorsi muscles, indicating their involvement in early developmental stages. Furthermore, genes related to muscle atrophy, such as forkhead box 1 and muscle ring finger, showed unregulated expression with increasing age, suggesting a decrease in protein synthesis during the later stages of skeletal muscle development. We found that development of the cardiac muscle was a complex process in which growth-related genes were highly expressed during embryonic development, but they did not show uniform postnatal expression patterns. Moreover, the expression level of miR-499, which enhances the expression of the β-myosin heavy chain, was significantly different in the psoas major and longissimus dorsi muscles, suggesting the involvement of miR-499 in the determination of skeletal muscle fiber types. We also performed correlation analyses of messenger RNA and miRNA expression. We found negative relationships between miR-486 and forkhead box 1, and miR-133a and serum response factor at all developmental stages, suggesting that forkhead box 1 and serum response factor are potential targets of miR-486 and miR-133a, respectively.

  7. Evolution of the alternative AQP2 gene: Acquisition of a novel protein-coding sequence in dolphins.

    PubMed

    Kishida, Takushi; Suzuki, Miwa; Takayama, Asuka

    2018-01-01

    Taxon-specific de novo protein-coding sequences are thought to be important for taxon-specific environmental adaptation. A recent study revealed that bottlenose dolphins acquired a novel isoform of aquaporin 2 generated by alternative splicing (alternative AQP2), which helps dolphins to live in hyperosmotic seawater. The AQP2 gene consists of four exons, but the alternative AQP2 gene lacks the fourth exon and instead has a longer third exon that includes the original third exon and a part of the original third intron. Here, we show that the latter half of the third exon of the alternative AQP2 arose from a non-protein-coding sequence. Intact ORF of this de novo sequence is shared not by all cetaceans, but only by delphinoids. However, this sequence is conservative in all modern cetaceans, implying that this de novo sequence potentially plays important roles for marine adaptation in cetaceans. Copyright © 2017 Elsevier Inc. All rights reserved.

  8. Activation of multiple mitogen-activated protein kinases by recombinant calcitonin gene-related peptide receptor.

    PubMed

    Parameswaran, N; Disa, J; Spielman, W S; Brooks, D P; Nambi, P; Aiyar, N

    2000-02-18

    Calcitonin gene-related peptide is a 37-amino-acid neuropeptide and a potent vasodilator. Although calcitonin gene-related peptide has been shown to have a number of effects in a variety of systems, the mechanisms of action and the intracellular signaling pathways, especially the regulation of mitogen-activated protien kinase (MAPK) pathway, is not known. In the present study we investigated the role of calcitonin gene-related peptide in the regulation of MAPKs in human embryonic kidney (HEK) 293 cells stably transfected with a recombinant porcine calcitonin gene-related peptide-1 receptor. Calcitonin gene-related peptide caused a significant dose-dependent increase in cAMP response and the effect was inhibited by calcitonin gene-related peptide(8-37), the calcitonin gene-related peptide-receptor antagonist. Calcitonin gene-related peptide also caused a time- and concentration-dependent increase in extracellular signal-regulated kinase (ERK) and P38 mitogen-activated protein kinase (P38 MAPK) activities, with apparently no significant change in cjun-N-terminal kinase (JNK) activity. Forskolin, a direct activator of adenylyl cyclase also stimulated ERK and P38 activities in these cells suggesting the invovement of cAMP in this process. Calcitonin gene-related peptide-stimulated ERK and P38 MAPK activities were inhibited significantly by calcitonin gene-related peptide receptor antagonist, calcitonin gene-related peptide-(8-37) suggesting the involvement of calcitonin gene-related peptide-1 receptor. Preincubation of the cells with the cAMP-dependent protein kinase inhibitor, H89 [¿N-[2-((p-bromocinnamyl)amino)ethyl]-5-isoquinolinesulfonamide, hydrochloride¿] inhibited calcitonin gene-related peptide-mediated activation of ERK and p38 kinases. On the other hand, preincubation of the cells with wortmannin ¿[1S-(1alpha,6balpha,9abeta,11alpha, 11bbeta)]-11-(acetyloxy)-1,6b,7,8,9a,10,11, 11b-octahydro-1-(methoxymethyl)-9a,11b-dimethyl-3H-furo[4,3, 2-de]indeno[4,5-h]-2

  9. [Convergent origin of repeats in genes coding for globular proteins. An analysis of the factors determining the presence of inverted and symmetrical repeats].

    PubMed

    Solov'ev, V V; Kel', A E; Kolchanov, N A

    1989-01-01

    The factors, determining the presence of inverted and symmetrical repeats in genes coding for globular proteins, have been analysed. An interesting property of genetical code has been revealed in the analysis of symmetrical repeats: the pairs of symmetrical codons corresponded to pairs of amino acids with mostly similar physical-chemical parameters. This property may explain the presence of symmetrical repeats and palindromes only in genes coding for beta-structural proteins-polypeptides, where amino acids with similar physical-chemical properties occupy symmetrical positions. A stochastic model of evolution of polynucleotide sequences has been used for analysis of inverted repeats. The modelling demonstrated that only limiting of sequences (uneven frequencies of used codons) is enough for arising of nonrandom inverted repeats in genes.

  10. Amino- and carboxyl-terminal amino acid sequences of proteins coded by gag gene of murine leukemia virus

    PubMed Central

    Oroszlan, Stephen; Henderson, Louis E.; Stephenson, John R.; Copeland, Terry D.; Long, Cedric W.; Ihle, James N.; Gilden, Raymond V.

    1978-01-01

    The amino- and carboxyl-terminal amino acid sequences of proteins (p10, p12, p15, and p30) coded by the gag gene of Rauscher and AKR murine leukemia viruses were determined. Among these proteins, p15 from both viruses appears to have a blocked amino end. Proline was found to be the common NH2 terminus of both p30s and both p12s, and alanine of both p10s. The amino-terminal sequences of p30s are identical, as are those of p10s, while the p12 sequences are clearly distinctive but also show substantial homology. The carboxyl-terminal amino acids of both viral p30s and p12s are leucine and phenylalanine, respectively. Rauscher leukemia virus p15 has tyrosine as the carboxyl terminus while AKR virus p15 has phenylalanine in this position. The compositional and sequence data provide definite chemical criteria for the identification of analogous gag gene products and for the comparison of viral proteins isolated in different laboratories. On the basis of amino acid sequences and the previously proposed H-p15-p12-p30-p10-COOH peptide sequence in the precursor polyprotein, a model for cleavage sites involved in the post-translational processing of the precursor coded for by the gag gene is proposed. PMID:206897

  11. CHIR99021 promotes self-renewal of mouse embryonic stem cells by modulation of protein-encoding gene and long intergenic non-coding RNA expression

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wu, Yongyan; Key Laboratory of Animal Biotechnology, Ministry of Agriculture, Northwest A and F University, Yangling 712100, Shaanxi; Ai, Zhiying

    2013-10-15

    Embryonic stem cells (ESCs) can proliferate indefinitely in vitro and differentiate into cells of all three germ layers. These unique properties make them exceptionally valuable for drug discovery and regenerative medicine. However, the practical application of ESCs is limited because it is difficult to derive and culture ESCs. It has been demonstrated that CHIR99021 (CHIR) promotes self-renewal and enhances the derivation efficiency of mouse (m)ESCs. However, the downstream targets of CHIR are not fully understood. In this study, we identified CHIR-regulated genes in mESCs using microarray analysis. Our microarray data demonstrated that CHIR not only influenced the Wnt/β-catenin pathway bymore » stabilizing β-catenin, but also modulated several other pluripotency-related signaling pathways such as TGF-β, Notch and MAPK signaling pathways. More detailed analysis demonstrated that CHIR inhibited Nodal signaling, while activating bone morphogenetic protein signaling in mESCs. In addition, we found that pluripotency-maintaining transcription factors were up-regulated by CHIR, while several developmental-related genes were down-regulated. Furthermore, we found that CHIR altered the expression of epigenetic regulatory genes and long intergenic non-coding RNAs. Quantitative real-time PCR results were consistent with microarray data, suggesting that CHIR alters the expression pattern of protein-encoding genes (especially transcription factors), epigenetic regulatory genes and non-coding RNAs to establish a relatively stable pluripotency-maintaining network. - Highlights: • Combined use of CHIR with LIF promotes self-renewal of J1 mESCs. • CHIR-regulated genes are involved in multiple pathways. • CHIR inhibits Nodal signaling and promotes Bmp4 expression to activate BMP signaling. • Expression of epigenetic regulatory genes and lincRNAs is altered by CHIR.« less

  12. PanCoreGen - Profiling, detecting, annotating protein-coding genes in microbial genomes.

    PubMed

    Paul, Sandip; Bhardwaj, Archana; Bag, Sumit K; Sokurenko, Evgeni V; Chattopadhyay, Sujay

    2015-12-01

    A large amount of genomic data, especially from multiple isolates of a single species, has opened new vistas for microbial genomics analysis. Analyzing the pan-genome (i.e. the sum of genetic repertoire) of microbial species is crucial in understanding the dynamics of molecular evolution, where virulence evolution is of major interest. Here we present PanCoreGen - a standalone application for pan- and core-genomic profiling of microbial protein-coding genes. PanCoreGen overcomes key limitations of the existing pan-genomic analysis tools, and develops an integrated annotation-structure for a species-specific pan-genomic profile. It provides important new features for annotating draft genomes/contigs and detecting unidentified genes in annotated genomes. It also generates user-defined group-specific datasets within the pan-genome. Interestingly, analyzing an example-set of Salmonella genomes, we detect potential footprints of adaptive convergence of horizontally transferred genes in two human-restricted pathogenic serovars - Typhi and Paratyphi A. Overall, PanCoreGen represents a state-of-the-art tool for microbial phylogenomics and pathogenomics study. Copyright © 2015 Elsevier Inc. All rights reserved.

  13. Improvement of genome assembly completeness and identification of novel full-length protein-coding genes by RNA-seq in the giant panda genome.

    PubMed

    Chen, Meili; Hu, Yibo; Liu, Jingxing; Wu, Qi; Zhang, Chenglin; Yu, Jun; Xiao, Jingfa; Wei, Fuwen; Wu, Jiayan

    2015-12-11

    High-quality and complete gene models are the basis of whole genome analyses. The giant panda (Ailuropoda melanoleuca) genome was the first genome sequenced on the basis of solely short reads, but the genome annotation had lacked the support of transcriptomic evidence. In this study, we applied RNA-seq to globally improve the genome assembly completeness and to detect novel expressed transcripts in 12 tissues from giant pandas, by using a transcriptome reconstruction strategy that combined reference-based and de novo methods. Several aspects of genome assembly completeness in the transcribed regions were effectively improved by the de novo assembled transcripts, including genome scaffolding, the detection of small-size assembly errors, the extension of scaffold/contig boundaries, and gap closure. Through expression and homology validation, we detected three groups of novel full-length protein-coding genes. A total of 12.62% of the novel protein-coding genes were validated by proteomic data. GO annotation analysis showed that some of the novel protein-coding genes were involved in pigmentation, anatomical structure formation and reproduction, which might be related to the development and evolution of the black-white pelage, pseudo-thumb and delayed embryonic implantation of giant pandas. The updated genome annotation will help further giant panda studies from both structural and functional perspectives.

  14. Proteogenomics produces comprehensive and highly accurate protein-coding gene annotation in a complete genome assembly of Malassezia sympodialis

    PubMed Central

    Tellgren-Roth, Christian; Baudo, Charles D.; Kennell, John C.; Sun, Sheng; Billmyre, R. Blake; Schröder, Markus S.; Andersson, Anna; Holm, Tina; Sigurgeirsson, Benjamin; Wu, Guangxi; Sankaranarayanan, Sundar Ram; Siddharthan, Rahul; Sanyal, Kaustuv; Lundeberg, Joakim; Nystedt, Björn; Boekhout, Teun; Dawson, Thomas L.; Heitman, Joseph

    2017-01-01

    Abstract Complete and accurate genome assembly and annotation is a crucial foundation for comparative and functional genomics. Despite this, few complete eukaryotic genomes are available, and genome annotation remains a major challenge. Here, we present a complete genome assembly of the skin commensal yeast Malassezia sympodialis and demonstrate how proteogenomics can substantially improve gene annotation. Through long-read DNA sequencing, we obtained a gap-free genome assembly for M. sympodialis (ATCC 42132), comprising eight nuclear and one mitochondrial chromosome. We also sequenced and assembled four M. sympodialis clinical isolates, and showed their value for understanding Malassezia reproduction by confirming four alternative allele combinations at the two mating-type loci. Importantly, we demonstrated how proteomics data could be readily integrated with transcriptomics data in standard annotation tools. This increased the number of annotated protein-coding genes by 14% (from 3612 to 4113), compared to using transcriptomics evidence alone. Manual curation further increased the number of protein-coding genes by 9% (to 4493). All of these genes have RNA-seq evidence and 87% were confirmed by proteomics. The M. sympodialis genome assembly and annotation presented here is at a quality yet achieved only for a few eukaryotic organisms, and constitutes an important reference for future host-microbe interaction studies. PMID:28100699

  15. Computational prediction of over-annotated protein-coding genes in the genome of Agrobacterium tumefaciens strain C58

    NASA Astrophysics Data System (ADS)

    Yu, Jia-Feng; Sui, Tian-Xiang; Wang, Hong-Mei; Wang, Chun-Ling; Jing, Li; Wang, Ji-Hua

    2015-12-01

    Agrobacterium tumefaciens strain C58 is a type of pathogen that can cause tumors in some dicotyledonous plants. Ever since the genome of A. tumefaciens strain C58 was sequenced, the quality of annotation of its protein-coding genes has been queried continually, because the annotation varies greatly among different databases. In this paper, the questionable hypothetical genes were re-predicted by integrating the TN curve and Z curve methods. As a result, 30 genes originally annotated as “hypothetical” were discriminated as being non-coding sequences. By testing the re-prediction program 10 times on data sets composed of the function-known genes, the mean accuracy of 99.99% and mean Matthews correlation coefficient value of 0.9999 were obtained. Further sequence analysis and COG analysis showed that the re-annotation results were very reliable. This work can provide an efficient tool and data resources for future studies of A. tumefaciens strain C58. Project supported by the National Natural Science Foundation of China (Grant Nos. 61302186 and 61271378) and the Funding from the State Key Laboratory of Bioelectronics of Southeast University.

  16. Conserved syntenic clusters of protein coding genes are missing in birds.

    PubMed

    Lovell, Peter V; Wirthlin, Morgan; Wilhelm, Larry; Minx, Patrick; Lazar, Nathan H; Carbone, Lucia; Warren, Wesley C; Mello, Claudio V

    2014-01-01

    Birds are one of the most highly successful and diverse groups of vertebrates, having evolved a number of distinct characteristics, including feathers and wings, a sturdy lightweight skeleton and unique respiratory and urinary/excretion systems. However, the genetic basis of these traits is poorly understood. Using comparative genomics based on extensive searches of 60 avian genomes, we have found that birds lack approximately 274 protein coding genes that are present in the genomes of most vertebrate lineages and are for the most part organized in conserved syntenic clusters in non-avian sauropsids and in humans. These genes are located in regions associated with chromosomal rearrangements, and are largely present in crocodiles, suggesting that their loss occurred subsequent to the split of dinosaurs/birds from crocodilians. Many of these genes are associated with lethality in rodents, human genetic disorders, or biological functions targeting various tissues. Functional enrichment analysis combined with orthogroup analysis and paralog searches revealed enrichments that were shared by non-avian species, present only in birds, or shared between all species. Together these results provide a clearer definition of the genetic background of extant birds, extend the findings of previous studies on missing avian genes, and provide clues about molecular events that shaped avian evolution. They also have implications for fields that largely benefit from avian studies, including development, immune system, oncogenesis, and brain function and cognition. With regards to the missing genes, birds can be considered ‘natural knockouts’ that may become invaluable model organisms for several human diseases.

  17. Analysis of protein-coding genetic variation in 60,706 humans.

    PubMed

    Lek, Monkol; Karczewski, Konrad J; Minikel, Eric V; Samocha, Kaitlin E; Banks, Eric; Fennell, Timothy; O'Donnell-Luria, Anne H; Ware, James S; Hill, Andrew J; Cummings, Beryl B; Tukiainen, Taru; Birnbaum, Daniel P; Kosmicki, Jack A; Duncan, Laramie E; Estrada, Karol; Zhao, Fengmei; Zou, James; Pierce-Hoffman, Emma; Berghout, Joanne; Cooper, David N; Deflaux, Nicole; DePristo, Mark; Do, Ron; Flannick, Jason; Fromer, Menachem; Gauthier, Laura; Goldstein, Jackie; Gupta, Namrata; Howrigan, Daniel; Kiezun, Adam; Kurki, Mitja I; Moonshine, Ami Levy; Natarajan, Pradeep; Orozco, Lorena; Peloso, Gina M; Poplin, Ryan; Rivas, Manuel A; Ruano-Rubio, Valentin; Rose, Samuel A; Ruderfer, Douglas M; Shakir, Khalid; Stenson, Peter D; Stevens, Christine; Thomas, Brett P; Tiao, Grace; Tusie-Luna, Maria T; Weisburd, Ben; Won, Hong-Hee; Yu, Dongmei; Altshuler, David M; Ardissino, Diego; Boehnke, Michael; Danesh, John; Donnelly, Stacey; Elosua, Roberto; Florez, Jose C; Gabriel, Stacey B; Getz, Gad; Glatt, Stephen J; Hultman, Christina M; Kathiresan, Sekar; Laakso, Markku; McCarroll, Steven; McCarthy, Mark I; McGovern, Dermot; McPherson, Ruth; Neale, Benjamin M; Palotie, Aarno; Purcell, Shaun M; Saleheen, Danish; Scharf, Jeremiah M; Sklar, Pamela; Sullivan, Patrick F; Tuomilehto, Jaakko; Tsuang, Ming T; Watkins, Hugh C; Wilson, James G; Daly, Mark J; MacArthur, Daniel G

    2016-08-18

    Large-scale reference data sets of human genetic variation are critical for the medical and functional interpretation of DNA sequence changes. Here we describe the aggregation and analysis of high-quality exome (protein-coding region) DNA sequence data for 60,706 individuals of diverse ancestries generated as part of the Exome Aggregation Consortium (ExAC). This catalogue of human genetic diversity contains an average of one variant every eight bases of the exome, and provides direct evidence for the presence of widespread mutational recurrence. We have used this catalogue to calculate objective metrics of pathogenicity for sequence variants, and to identify genes subject to strong selection against various classes of mutation; identifying 3,230 genes with near-complete depletion of predicted protein-truncating variants, with 72% of these genes having no currently established human disease phenotype. Finally, we demonstrate that these data can be used for the efficient filtering of candidate disease-causing variants, and for the discovery of human 'knockout' variants in protein-coding genes.

  18. AMP-Activated Protein Kinase Interacts with the Peroxisome Proliferator-Activated Receptor Delta to Induce Genes Affecting Fatty Acid Oxidation in Human Macrophages.

    PubMed

    Kemmerer, Marina; Finkernagel, Florian; Cavalcante, Marcela Frota; Abdalla, Dulcineia Saes Parra; Müller, Rolf; Brüne, Bernhard; Namgaladze, Dmitry

    2015-01-01

    AMP-activated protein kinase (AMPK) maintains energy homeostasis by suppressing cellular ATP-consuming processes and activating catabolic, ATP-producing pathways such as fatty acid oxidation (FAO). The transcription factor peroxisome proliferator-activated receptor δ (PPARδ) also affects fatty acid metabolism, stimulating the expression of genes involved in FAO. To question the interplay of AMPK and PPARδ in human macrophages we transduced primary human macrophages with lentiviral particles encoding for the constitutively active AMPKα1 catalytic subunit, followed by microarray expression analysis after treatment with the PPARδ agonist GW501516. Microarray analysis showed that co-activation of AMPK and PPARδ increased expression of FAO genes, which were validated by quantitative PCR. Induction of these FAO-associated genes was also observed upon infecting macrophages with an adenovirus coding for AMPKγ1 regulatory subunit carrying an activating R70Q mutation. The pharmacological AMPK activator A-769662 increased expression of several FAO genes in a PPARδ- and AMPK-dependent manner. Although GW501516 significantly increased FAO and reduced the triglyceride amount in very low density lipoproteins (VLDL)-loaded foam cells, AMPK activation failed to potentiate this effect, suggesting that increased expression of fatty acid catabolic genes alone may be not sufficient to prevent macrophage lipid overload.

  19. AMP-Activated Protein Kinase Interacts with the Peroxisome Proliferator-Activated Receptor Delta to Induce Genes Affecting Fatty Acid Oxidation in Human Macrophages

    PubMed Central

    Kemmerer, Marina; Finkernagel, Florian; Cavalcante, Marcela Frota; Abdalla, Dulcineia Saes Parra; Müller, Rolf; Brüne, Bernhard; Namgaladze, Dmitry

    2015-01-01

    AMP-activated protein kinase (AMPK) maintains energy homeostasis by suppressing cellular ATP-consuming processes and activating catabolic, ATP-producing pathways such as fatty acid oxidation (FAO). The transcription factor peroxisome proliferator-activated receptor δ (PPARδ) also affects fatty acid metabolism, stimulating the expression of genes involved in FAO. To question the interplay of AMPK and PPARδ in human macrophages we transduced primary human macrophages with lentiviral particles encoding for the constitutively active AMPKα1 catalytic subunit, followed by microarray expression analysis after treatment with the PPARδ agonist GW501516. Microarray analysis showed that co-activation of AMPK and PPARδ increased expression of FAO genes, which were validated by quantitative PCR. Induction of these FAO-associated genes was also observed upon infecting macrophages with an adenovirus coding for AMPKγ1 regulatory subunit carrying an activating R70Q mutation. The pharmacological AMPK activator A-769662 increased expression of several FAO genes in a PPARδ- and AMPK-dependent manner. Although GW501516 significantly increased FAO and reduced the triglyceride amount in very low density lipoproteins (VLDL)-loaded foam cells, AMPK activation failed to potentiate this effect, suggesting that increased expression of fatty acid catabolic genes alone may be not sufficient to prevent macrophage lipid overload. PMID:26098914

  20. Proteogenomics produces comprehensive and highly accurate protein-coding gene annotation in a complete genome assembly of Malassezia sympodialis.

    PubMed

    Zhu, Yafeng; Engström, Pär G; Tellgren-Roth, Christian; Baudo, Charles D; Kennell, John C; Sun, Sheng; Billmyre, R Blake; Schröder, Markus S; Andersson, Anna; Holm, Tina; Sigurgeirsson, Benjamin; Wu, Guangxi; Sankaranarayanan, Sundar Ram; Siddharthan, Rahul; Sanyal, Kaustuv; Lundeberg, Joakim; Nystedt, Björn; Boekhout, Teun; Dawson, Thomas L; Heitman, Joseph; Scheynius, Annika; Lehtiö, Janne

    2017-03-17

    Complete and accurate genome assembly and annotation is a crucial foundation for comparative and functional genomics. Despite this, few complete eukaryotic genomes are available, and genome annotation remains a major challenge. Here, we present a complete genome assembly of the skin commensal yeast Malassezia sympodialis and demonstrate how proteogenomics can substantially improve gene annotation. Through long-read DNA sequencing, we obtained a gap-free genome assembly for M. sympodialis (ATCC 42132), comprising eight nuclear and one mitochondrial chromosome. We also sequenced and assembled four M. sympodialis clinical isolates, and showed their value for understanding Malassezia reproduction by confirming four alternative allele combinations at the two mating-type loci. Importantly, we demonstrated how proteomics data could be readily integrated with transcriptomics data in standard annotation tools. This increased the number of annotated protein-coding genes by 14% (from 3612 to 4113), compared to using transcriptomics evidence alone. Manual curation further increased the number of protein-coding genes by 9% (to 4493). All of these genes have RNA-seq evidence and 87% were confirmed by proteomics. The M. sympodialis genome assembly and annotation presented here is at a quality yet achieved only for a few eukaryotic organisms, and constitutes an important reference for future host-microbe interaction studies. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  1. Polymerization of non-complementary RNA: systematic symmetric nucleotide exchanges mainly involving uracil produce mitochondrial RNA transcripts coding for cryptic overlapping genes.

    PubMed

    Seligmann, Hervé

    2013-03-01

    Usual DNA→RNA transcription exchanges T→U. Assuming different systematic symmetric nucleotide exchanges during translation, some GenBank RNAs match exactly human mitochondrial sequences (exchange rules listed in decreasing transcript frequencies): C↔U, A↔U, A↔U+C↔G (two nucleotide pairs exchanged), G↔U, A↔G, C↔G, none for A↔C, A↔G+C↔U, and A↔C+G↔U. Most unusual transcripts involve exchanging uracil. Independent measures of rates of rare replicational enzymatic DNA nucleotide misinsertions predict frequencies of RNA transcripts systematically exchanging the corresponding misinserted nucleotides. Exchange transcripts self-hybridize less than other gene regions, self-hybridization increases with length, suggesting endoribonuclease-limited elongation. Blast detects stop codon depleted putative protein coding overlapping genes within exchange-transcribed mitochondrial genes. These align with existing GenBank proteins (mainly metazoan origins, prokaryotic and viral origins underrepresented). These GenBank proteins frequently interact with RNA/DNA, are membrane transporters, or are typical of mitochondrial metabolism. Nucleotide exchange transcript frequencies increase with overlapping gene densities and stop densities, indicating finely tuned counterbalancing regulation of expression of systematic symmetric nucleotide exchange-encrypted proteins. Such expression necessitates combined activities of suppressor tRNAs matching stops, and nucleotide exchange transcription. Two independent properties confirm predicted exchanged overlap coding genes: discrepancy of third codon nucleotide contents from replicational deamination gradients, and codon usage according to circular code predictions. Predictions from both properties converge, especially for frequent nucleotide exchange types. Nucleotide exchanging transcription apparently increases coding densities of protein coding genes without lengthening genomes, revealing unsuspected functional DNA

  2. Analysis of Antisense Expression by Whole Genome Tiling Microarrays and siRNAs Suggests Mis-Annotation of Arabidopsis Orphan Protein-Coding Genes

    PubMed Central

    Richardson, Casey R.; Luo, Qing-Jun; Gontcharova, Viktoria; Jiang, Ying-Wen; Samanta, Manoj; Youn, Eunseog; Rock, Christopher D.

    2010-01-01

    Background MicroRNAs (miRNAs) and trans-acting small-interfering RNAs (tasi-RNAs) are small (20–22 nt long) RNAs (smRNAs) generated from hairpin secondary structures or antisense transcripts, respectively, that regulate gene expression by Watson-Crick pairing to a target mRNA and altering expression by mechanisms related to RNA interference. The high sequence homology of plant miRNAs to their targets has been the mainstay of miRNA prediction algorithms, which are limited in their predictive power for other kingdoms because miRNA complementarity is less conserved yet transitive processes (production of antisense smRNAs) are active in eukaryotes. We hypothesize that antisense transcription and associated smRNAs are biomarkers which can be computationally modeled for gene discovery. Principal Findings We explored rice (Oryza sativa) sense and antisense gene expression in publicly available whole genome tiling array transcriptome data and sequenced smRNA libraries (as well as C. elegans) and found evidence of transitivity of MIRNA genes similar to that found in Arabidopsis. Statistical analysis of antisense transcript abundances, presence of antisense ESTs, and association with smRNAs suggests several hundred Arabidopsis ‘orphan’ hypothetical genes are non-coding RNAs. Consistent with this hypothesis, we found novel Arabidopsis homologues of some MIRNA genes on the antisense strand of previously annotated protein-coding genes. A Support Vector Machine (SVM) was applied using thermodynamic energy of binding plus novel expression features of sense/antisense transcription topology and siRNA abundances to build a prediction model of miRNA targets. The SVM when trained on targets could predict the “ancient” (deeply conserved) class of validated Arabidopsis MIRNA genes with an accuracy of 84%, and 76% for “new” rapidly-evolving MIRNA genes. Conclusions Antisense and smRNA expression features and computational methods may identify novel MIRNA genes and other non-coding

  3. An operon from Lactobacillus helveticus composed of a proline iminopeptidase gene (pepI) and two genes coding for putative members of the ABC transporter family of proteins.

    PubMed

    Varmanen, P; Rantanen, T; Palva, A

    1996-12-01

    A proline iminopeptidase gene (pepI) of an industrial Lactobacillus helveticus strain was cloned and found to be organized in an operon-like structure of three open reading frames (ORF1, ORF2 and ORF3). ORF1 was preceded by a typical prokaryotic promoter region, and a putative transcription terminator was found downstream of ORF3, identified as the pepI gene. Using primer-extension analyses, only one transcription start site, upstream of ORF1, was identifiable in the predicted operon. Although the size of mRNA could not be judged by Northern analysis either with ORF1-, ORF2- or pepI-specific probes, reverse transcription-PCR analyses further supported the operon structure of the three genes. ORF1, ORF2 and ORF3 had coding capacities for 50.7, 24.5 and 33.8 kDa proteins, respectively. The ORF3-encoded PepI protein showed 65% identity with the PepI proteins from Lactobacillus delbrueckii subsp. bulgaricus and Lactobacillus delbrueckii subsp. lactis. The ORF1-encoded protein had significant homology with several members of the ABC transporter family but, with two distinct putative ATP-binding sites, it would represent an unusual type among the bacterial ABC transporters. ORF2 encoded a putative integral membrane protein also characteristic of the ABC transporter family. The pepI gene was overexpressed in Escherichia coli. Purified PepI hydrolysed only di and tripeptides with proline in the first position. Optimum PepI activity was observed at pH 7.5 and 40 degrees C. A gel filtration analysis indicated that PepI is a dimer of M(r) 53,000. PepI was shown to be a metal-independent serine peptidase having thiol groups at or near the active site. Kinetic studies with proline-p-nitroanilide as substrate revealed Km and Vmax values of 0.8 mM and 350 mmol min-1 mg-1, respectively, and a very high turnover number of 135,000 s-1.

  4. Partitioning of genetic variation between regulatory and coding gene segments: the predominance of software variation in genes encoding introvert proteins.

    PubMed

    Mitchison, A

    1997-01-01

    In considering genetic variation in eukaryotes, a fundamental distinction can be made between variation in regulatory (software) and coding (hardware) gene segments. For quantitative traits the bulk of variation, particularly that near the population mean, appears to reside in regulatory segments. The main exceptions to this rule concern proteins which handle extrinsic substances, here termed extrovert proteins. The immune system includes an unusually large proportion of this exceptional category, but even so its chief source of variation may well be polymorphism in regulatory gene segments. The main evidence for this view emerges from genome scanning for quantitative trait loci (QTL), which in the case of the immune system points to a major contribution of pro-inflammatory cytokine genes. Further support comes from sequencing of major histocompatibility complex (Mhc) class II promoters, where a high level of polymorphism has been detected. These Mhc promoters appear to act, in part at least, by gating the back-signal from T cells into antigen-presenting cells. Both these forms of polymorphism are likely to be sustained by the need for flexibility in the immune response. Future work on promoter polymorphism is likely to benefit from the input from genome informatics.

  5. Selfish DNA in protein-coding genes of Rickettsia.

    PubMed

    Ogata, H; Audic, S; Barbe, V; Artiguenave, F; Fournier, P E; Raoult, D; Claverie, J M

    2000-10-13

    Rickettsia conorii, the aetiological agent of Mediterranean spotted fever, is an intracellular bacterium transmitted by ticks. Preliminary analyses of the nearly complete genome sequence of R. conorii have revealed 44 occurrences of a previously undescribed palindromic repeat (150 base pairs long) throughout the genome. Unexpectedly, this repeat was found inserted in-frame within 19 different R. conorii open reading frames likely to encode functional proteins. We found the same repeat in proteins of other Rickettsia species. The finding of a mobile element inserted in many unrelated genes suggests the potential role of selfish DNA in the creation of new protein sequences.

  6. Origin and evolution of the long non-coding genes in the X-inactivation center.

    PubMed

    Romito, Antonio; Rougeulle, Claire

    2011-11-01

    Random X chromosome inactivation (XCI), the eutherian mechanism of X-linked gene dosage compensation, is controlled by a cis-acting locus termed the X-inactivation center (Xic). One of the striking features that characterize the Xic landscape is the abundance of loci transcribing non-coding RNAs (ncRNAs), including Xist, the master regulator of the inactivation process. Recent comparative genomic analyses have depicted the evolutionary scenario behind the origin of the X-inactivation center, revealing that this locus evolved from a region harboring protein-coding genes. During mammalian radiation, this ancestral protein-coding region was disrupted in the marsupial group, whilst it provided in eutherian lineage the starting material for the non-translated RNAs of the X-inactivation center. The emergence of non-coding genes occurred by a dual mechanism involving loss of protein-coding function of the pre-existing genes and integration of different classes of mobile elements, some of which modeled the structure and sequence of the non-coding genes in a species-specific manner. The rising genes started to produce transcripts that acquired function in regulating the epigenetic status of the X chromosome, as shown for Xist, its antisense Tsix, Jpx, and recently suggested for Ftx. Thus, the appearance of the Xic, which occurred after the divergence between eutherians and marsupials, was the basis for the evolution of random X inactivation as a strategy to achieve dosage compensation. Copyright © 2011. Published by Elsevier Masson SAS.

  7. PAR-CLIP data indicate that Nrd1-Nab3-dependent transcription termination regulates expression of hundreds of protein coding genes in yeast

    PubMed Central

    2014-01-01

    Background Nrd1 and Nab3 are essential sequence-specific yeast RNA binding proteins that function as a heterodimer in the processing and degradation of diverse classes of RNAs. These proteins also regulate several mRNA coding genes; however, it remains unclear exactly what percentage of the mRNA component of the transcriptome these proteins control. To address this question, we used the pyCRAC software package developed in our laboratory to analyze CRAC and PAR-CLIP data for Nrd1-Nab3-RNA interactions. Results We generated high-resolution maps of Nrd1-Nab3-RNA interactions, from which we have uncovered hundreds of new Nrd1-Nab3 mRNA targets, representing between 20 and 30% of protein-coding transcripts. Although Nrd1 and Nab3 showed a preference for binding near 5′ ends of relatively short transcripts, they bound transcripts throughout coding sequences and 3′ UTRs. Moreover, our data for Nrd1-Nab3 binding to 3′ UTRs was consistent with a role for these proteins in the termination of transcription. Our data also support a tight integration of Nrd1-Nab3 with the nutrient response pathway. Finally, we provide experimental evidence for some of our predictions, using northern blot and RT-PCR assays. Conclusions Collectively, our data support the notion that Nrd1 and Nab3 function is tightly integrated with the nutrient response and indicate a role for these proteins in the regulation of many mRNA coding genes. Further, we provide evidence to support the hypothesis that Nrd1-Nab3 represents a failsafe termination mechanism in instances of readthrough transcription. PMID:24393166

  8. Genetic relatedness among human rotavirus genes coding for VP7, a major neutralization protein, and its application to serotype identification.

    PubMed Central

    Midthun, K; Flores, J; Taniguchi, K; Urasawa, S; Kapikian, A Z; Chanock, R M

    1987-01-01

    Antigenic characterization of human rotaviruses by plaque reduction neutralization assay has revealed four distinct serotypes. The outer capsid protein VP7, coded for by gene 8 or 9, is a major neutralization protein; however, studies of rotaviruses derived from genetic reassortment between two strains have confirmed that another outer capsid protein, VP3, is in some cases equally important in neutralization. In this study, the genetic relatedness of the genes coding for VP7 of human rotaviruses belonging to serotypes 1 through 4 was examined by hybridization of their denatured double-stranded genomic RNAs to labeled single-stranded mRNA probes derived from human-animal rotavirus reassortants containing only the VP7 gene of their human rotavirus parent. A high degree of homology was demonstrated between the VP7 genes of strain D and other serotype 1 human rotaviruses, strain DS-1 and other serotype 2 human rotaviruses, strain P and other serotype 3 human rotaviruses, and strain ST3 and other serotype 4 human rotaviruses. Hybrid bands could not be demonstrated between the VP7 gene of D, DS-1, P, or ST3 and the corresponding gene of human rotaviruses belonging to a different serotype. RNA specimens extracted from the stools of 15 Venezuelan children hospitalized with rotavirus diarrhea were hybridized to each of the reassortant probes representing the four human serotypes. All five viruses with short RNA patterns showed homology with the DS-1 strain VP7 gene; two of these were previously adapted to tissue culture and shown to be serotype 2 strains by tissue culture neutralization. Of the remaining 10 viruses with long RNA patterns, 2 hybridized only to the D strain VP7 gene, 6 hybridized only to the P strain VP7 gene, and 2 hybridized only to the ST3 strain VP7 gene. Hybridization using single human rotavirus gene substitution reassortants as probes may provide an alternative method for identifying the VP7 serotype of field isolates that would circumvent the need for

  9. Gene and genon concept: coding versus regulation

    PubMed Central

    2007-01-01

    pieces, as steered by the genon. It emerges finally as an uninterrupted nucleic acid sequence at mRNA level just prior to translation, in faithful correspondence with the amino acid sequence to be produced as a polypeptide. After translation, the genon has fulfilled its role and expires. The distinction between the protein coding information as materialised in the final polypeptide and the processing information represented by the genon allows us to set up a new information theoretic scheme. The standard sequence information determined by the genetic code expresses the relation between coding sequence and product. Backward analysis asks from which coding region in the DNA a given polypeptide originates. The (more interesting) forward analysis asks in how many polypeptides of how many different types a given DNA segment is expressed. This concerns the control of the expression process for which we have introduced the genon concept. Thus, the information theoretic analysis can capture the complementary aspects of coding and regulation, of gene and genon. PMID:18087760

  10. The Arabidopsis TOR Kinase Specifically Regulates the Expression of Nuclear Genes Coding for Plastidic Ribosomal Proteins and the Phosphorylation of the Cytosolic Ribosomal Protein S6

    PubMed Central

    Dobrenel, Thomas; Mancera-Martínez, Eder; Forzani, Céline; Azzopardi, Marianne; Davanture, Marlène; Moreau, Manon; Schepetilnikov, Mikhail; Chicher, Johana; Langella, Olivier; Zivy, Michel; Robaglia, Christophe; Ryabova, Lyubov A.; Hanson, Johannes; Meyer, Christian

    2016-01-01

    Protein translation is an energy consuming process that has to be fine-tuned at both the cell and organism levels to match the availability of resources. The target of rapamycin kinase (TOR) is a key regulator of a large range of biological processes in response to environmental cues. In this study, we have investigated the effects of TOR inactivation on the expression and regulation of Arabidopsis ribosomal proteins at different levels of analysis, namely from transcriptomic to phosphoproteomic. TOR inactivation resulted in a coordinated down-regulation of the transcription and translation of nuclear-encoded mRNAs coding for plastidic ribosomal proteins, which could explain the chlorotic phenotype of the TOR silenced plants. We have identified in the 5′ untranslated regions (UTRs) of this set of genes a conserved sequence related to the 5′ terminal oligopyrimidine motif, which is known to confer translational regulation by the TOR kinase in other eukaryotes. Furthermore, the phosphoproteomic analysis of the ribosomal fraction following TOR inactivation revealed a lower phosphorylation of the conserved Ser240 residue in the C-terminal region of the 40S ribosomal protein S6 (RPS6). These results were confirmed by Western blot analysis using an antibody that specifically recognizes phosphorylated Ser240 in RPS6. Finally, this antibody was used to follow TOR activity in plants. Our results thus uncover a multi-level regulation of plant ribosomal genes and proteins by the TOR kinase. PMID:27877176

  11. The gene coding for the B cell surface protein CD19 is localized on human chromosome 16p11.

    PubMed

    Stapleton, P; Kozmik, Z; Weith, A; Busslinger, M

    1995-02-01

    The CD19 gene codes for one of the earliest markers of the human B cell lineage and is a target for the B lymphoid-specific transcription factor BSAP (Pax-5). The transmembrane protein CD19 has been implicated in controlling proliferation of mature B lymphocytes by modulating signal transduction through the antigen receptor. In this study, we have employed Southern blot and fluorescence in situ hybridization analyses to localize the CD19 gene to human chromosome 16p11.

  12. Td4IN2: A drought-responsive durum wheat (Triticum durum Desf.) gene coding for a resistance like protein with serine/threonine protein kinase, nucleotide binding site and leucine rich domains.

    PubMed

    Rampino, Patrizia; De Pascali, Mariarosaria; De Caroli, Monica; Luvisi, Andrea; De Bellis, Luigi; Piro, Gabriella; Perrotta, Carla

    2017-11-01

    Wheat, the main food source for a third of world population, appears strongly under threat because of predicted increasing temperatures coupled to drought. Plant complex molecular response to drought stress relies on the gene network controlling cell reactions to abiotic stress. In the natural environment, plants are subjected to the combination of abiotic and biotic stresses. Also the response of plants to biotic stress, to cope with pathogens, involves the activation of a molecular network. Investigations on combination of abiotic and biotic stresses indicate the existence of cross-talk between the two networks and a kind of overlapping can be hypothesized. In this work we describe the isolation and characterization of a drought-related durum wheat (Triticum durum Desf.) gene, identified in a previous study, coding for a protein combining features of NBS-LRR type resistance protein with a S/TPK domain, involved in drought stress response. This is one of the few examples reported where all three domains are present in a single protein and, to our knowledge, it is the first report on a gene specifically induced by drought stress and drought-related conditions, with this particular structure. Copyright © 2017 Elsevier Masson SAS. All rights reserved.

  13. Abundant RNA editing sites of chloroplast protein-coding genes in Ginkgo biloba and an evolutionary pattern analysis.

    PubMed

    He, Peng; Huang, Sheng; Xiao, Guanghui; Zhang, Yuzhou; Yu, Jianing

    2016-12-01

    RNA editing is a posttranscriptional modification process that alters the RNA sequence so that it deviates from the genomic DNA sequence. RNA editing mainly occurs in chloroplasts and mitochondrial genomes, and the number of editing sites varies in terrestrial plants. Why and how RNA editing systems evolved remains a mystery. Ginkgo biloba is one of the oldest seed plants and has an important evolutionary position. Determining the patterns and distribution of RNA editing in the ancient plant provides insights into the evolutionary trend of RNA editing, and helping us to further understand their biological significance. In this paper, we investigated 82 protein-coding genes in the chloroplast genome of G. biloba and identified 255 editing sites, which is the highest number of RNA editing events reported in a gymnosperm. All of the editing sites were C-to-U conversions, which mainly occurred in the second codon position, biased towards to the U_A context, and caused an increase in hydrophobic amino acids. RNA editing could change the secondary structures of 82 proteins, and create or eliminate a transmembrane region in five proteins as determined in silico. Finally, the evolutionary tendencies of RNA editing in different gene groups were estimated using the nonsynonymous-synonymous substitution rate selection mode. The G. biloba chloroplast genome possesses the highest number of RNA editing events reported so far in a seed plant. Most of the RNA editing sites can restore amino acid conservation, increase hydrophobicity, and even influence protein structures. Similar purifying selections constitute the dominant evolutionary force at the editing sites of essential genes, such as the psa, some psb and pet groups, and a positive selection occurred in the editing sites of nonessential genes, such as most ndh and a few psb genes.

  14. Kinetic models of gene expression including non-coding RNAs

    NASA Astrophysics Data System (ADS)

    Zhdanov, Vladimir P.

    2011-03-01

    In cells, genes are transcribed into mRNAs, and the latter are translated into proteins. Due to the feedbacks between these processes, the kinetics of gene expression may be complex even in the simplest genetic networks. The corresponding models have already been reviewed in the literature. A new avenue in this field is related to the recognition that the conventional scenario of gene expression is fully applicable only to prokaryotes whose genomes consist of tightly packed protein-coding sequences. In eukaryotic cells, in contrast, such sequences are relatively rare, and the rest of the genome includes numerous transcript units representing non-coding RNAs (ncRNAs). During the past decade, it has become clear that such RNAs play a crucial role in gene expression and accordingly influence a multitude of cellular processes both in the normal state and during diseases. The numerous biological functions of ncRNAs are based primarily on their abilities to silence genes via pairing with a target mRNA and subsequently preventing its translation or facilitating degradation of the mRNA-ncRNA complex. Many other abilities of ncRNAs have been discovered as well. Our review is focused on the available kinetic models describing the mRNA, ncRNA and protein interplay. In particular, we systematically present the simplest models without kinetic feedbacks, models containing feedbacks and predicting bistability and oscillations in simple genetic networks, and models describing the effect of ncRNAs on complex genetic networks. Mathematically, the presentation is based primarily on temporal mean-field kinetic equations. The stochastic and spatio-temporal effects are also briefly discussed.

  15. Evolution of coding and non-coding genes in HOX clusters of a marsupial.

    PubMed

    Yu, Hongshi; Lindsay, James; Feng, Zhi-Ping; Frankenberg, Stephen; Hu, Yanqiu; Carone, Dawn; Shaw, Geoff; Pask, Andrew J; O'Neill, Rachel; Papenfuss, Anthony T; Renfree, Marilyn B

    2012-06-18

    The HOX gene clusters are thought to be highly conserved amongst mammals and other vertebrates, but the long non-coding RNAs have only been studied in detail in human and mouse. The sequencing of the kangaroo genome provides an opportunity to use comparative analyses to compare the HOX clusters of a mammal with a distinct body plan to those of other mammals. Here we report a comparative analysis of HOX gene clusters between an Australian marsupial of the kangaroo family and the eutherians. There was a strikingly high level of conservation of HOX gene sequence and structure and non-protein coding genes including the microRNAs miR-196a, miR-196b, miR-10a and miR-10b and the long non-coding RNAs HOTAIR, HOTAIRM1 and HOXA11AS that play critical roles in regulating gene expression and controlling development. By microRNA deep sequencing and comparative genomic analyses, two conserved microRNAs (miR-10a and miR-10b) were identified and one new candidate microRNA with typical hairpin precursor structure that is expressed in both fibroblasts and testes was found. The prediction of microRNA target analysis showed that several known microRNA targets, such as miR-10, miR-414 and miR-464, were found in the tammar HOX clusters. In addition, several novel and putative miRNAs were identified that originated from elsewhere in the tammar genome and that target the tammar HOXB and HOXD clusters. This study confirms that the emergence of known long non-coding RNAs in the HOX clusters clearly predate the marsupial-eutherian divergence 160 Ma ago. It also identified a new potentially functional microRNA as well as conserved miRNAs. These non-coding RNAs may participate in the regulation of HOX genes to influence the body plan of this marsupial.

  16. Evolution of coding and non-coding genes in HOX clusters of a marsupial

    PubMed Central

    2012-01-01

    Background The HOX gene clusters are thought to be highly conserved amongst mammals and other vertebrates, but the long non-coding RNAs have only been studied in detail in human and mouse. The sequencing of the kangaroo genome provides an opportunity to use comparative analyses to compare the HOX clusters of a mammal with a distinct body plan to those of other mammals. Results Here we report a comparative analysis of HOX gene clusters between an Australian marsupial of the kangaroo family and the eutherians. There was a strikingly high level of conservation of HOX gene sequence and structure and non-protein coding genes including the microRNAs miR-196a, miR-196b, miR-10a and miR-10b and the long non-coding RNAs HOTAIR, HOTAIRM1 and HOXA11AS that play critical roles in regulating gene expression and controlling development. By microRNA deep sequencing and comparative genomic analyses, two conserved microRNAs (miR-10a and miR-10b) were identified and one new candidate microRNA with typical hairpin precursor structure that is expressed in both fibroblasts and testes was found. The prediction of microRNA target analysis showed that several known microRNA targets, such as miR-10, miR-414 and miR-464, were found in the tammar HOX clusters. In addition, several novel and putative miRNAs were identified that originated from elsewhere in the tammar genome and that target the tammar HOXB and HOXD clusters. Conclusions This study confirms that the emergence of known long non-coding RNAs in the HOX clusters clearly predate the marsupial-eutherian divergence 160 Ma ago. It also identified a new potentially functional microRNA as well as conserved miRNAs. These non-coding RNAs may participate in the regulation of HOX genes to influence the body plan of this marsupial. PMID:22708672

  17. Identification of a G protein coupled receptor induced in activated T cells.

    PubMed

    Kaplan, M H; Smith, D I; Sundick, R S

    1993-07-15

    Many genes are induced after T cell activation to make a cell competent for proliferation and ultimately, function. Many of these genes encode surface receptors for growth factors that signal a cell to proliferate. We have cloned a novel gene (clone 6H1) that codes for a member of the G protein-coupled receptor superfamily. This gene was isolated from a chicken activated T cell cDNA library by low level hybridization to mammalian IL-2 cDNA probes. The 308 amino acid open reading frame has seven hydrophobic, presumably transmembrane domains and a consensus site for interaction with G proteins. Tissue distribution studies suggest that gene expression is restricted to activated T cells. The message appears by 1 h after activation and is maintained for at least 45 h. Transcription of 6H1 is induced by a number of T cell stimuli and is inhibited by cyclosporin A, but not by cycloheximide. This is the first description of a member of this superfamily expressed specifically in activated T cells. The gene product may provide a link between T cell growth factors and G protein activation.

  18. Next-Generation Sequencing of Protein-Coding and Long Non-protein-Coding RNAs in Two Types of Exosomes Derived from Human Whole Saliva.

    PubMed

    Ogawa, Yuko; Tsujimoto, Masafumi; Yanoshita, Ryohei

    2016-01-01

    Exosomes are small extracellular vesicles containing microRNAs and mRNAs that are produced by various types of cells. We previously used ultrafiltration and size-exclusion chromatography to isolate two types of human salivary exosomes (exosomes I, II) that are different in size and proteomes. We showed that salivary exosomes contain large repertoires of small RNAs. However, precise information regarding long RNAs in salivary exosomes has not been fully determined. In this study, we investigated the compositions of protein-coding RNAs (pcRNAs) and long non-protein-coding RNAs (lncRNAs) of exosome I, exosome II and whole saliva (WS) by next-generation sequencing technology. Although 11% of all RNAs were commonly detected among the three samples, the compositions of reads mapping to known RNAs were similar. The most abundant pcRNA is ribosomal RNA protein, and pcRNAs of some salivary proteins such as S100 calcium-binding protein A8 (protein S100-A8) were present in salivary exosomes. Interestingly, lncRNAs of pseudogenes (presumably, processed pseudogenes) were abundant in exosome I, exosome II and WS. Translationally controlled tumor protein gene, which plays an important role in cell proliferation, cell death and immune responses, was highly expressed as pcRNA and pseudogenes in salivary exosomes. Our results show that salivary exosomes contain various types of RNAs such as pseudogenes and small RNAs, and may mediate intercellular communication by transferring these RNAs to target cells as gene expression regulators.

  19. A second gene for acyl-(acyl-carrier-protein): glycerol-3-phosphate acyltransferase in squash, Cucurbita moschata cv. Shirogikuza(*), codes for an oleate-selective isozyme: molecular cloning and protein purification studies.

    PubMed

    Nishida, I; Sugiura, M; Enju, A; Nakamura, M

    2000-12-01

    A new isogene for acyl-(acyl-carrier-protein):glycerol-3-phosphate acyltransferase (GPAT; EC 2.3.1.15) in squash has been cloned and the gene product was identified as oleate-selective GPAT. Using PCR primers that could hybridise with exons for a previously cloned squash GPAT, we obtained two PCR products of different size: one coded for a previously cloned squash GPAT corresponding to non-selective isoforms AT2 and AT3, and the other for a new isozyme, probably the oleate-selective isoform AT1. Full-length amino acid sequences of respective isozymes were deduced from the nucleotide sequences of genomic genes and cDNAs, which were cloned by a series of PCR-based methods. Thus, we designated the new gene CmATS1;1 and the other one CmATS1;2. Genome blot analysis revealed that the squash genome contained the two isogenes at non-allelic loci. AT1-active fractions were partially purified, and three polypeptide bands were identified as being AT1 polypeptides, which exhibited relative molecular masses of 39.5-40.5 kDa, pI values of 6.75-7.15, and oleate selectivity over palmitate. Partial amino-terminal sequences obtained from two of these bands verified that the new isogene codes for AT1 polypeptides.

  20. Phylogeny of Anophelinae using mitochondrial protein coding genes

    PubMed Central

    de Oliveira, Tatiane Marques Porangaba; Bergo, Eduardo S.; Conn, Jan E.; Sant’Ana, Denise Cristina; Nagaki, Sandra Sayuri; Nihei, Silvio; Lamas, Carlos Einicker; González, Christian; Moreira, Caio Cesar; Sallum, Maria Anice Mureb

    2017-01-01

    Malaria is a vector-borne disease that is a great burden on the poorest and most marginalized communities of the tropical and subtropical world. Approximately 41 species of Anopheline mosquitoes can effectively spread species of Plasmodium parasites that cause human malaria. Proposing a natural classification for the subfamily Anophelinae has been a continuous effort, addressed using both morphology and DNA sequence data. The monophyly of the genus Anopheles, and phylogenetic placement of the genus Bironella, subgenera Kerteszia, Lophopodomyia and Stethomyia within the subfamily Anophelinae, remain in question. To understand the classification of Anophelinae, we inferred the phylogeny of all three genera (Anopheles, Bironella, Chagasia) and major subgenera by analysing the amino acid sequences of the 13 protein coding genes of 150 newly sequenced mitochondrial genomes of Anophelinae and 18 newly sequenced Culex species as outgroup taxa, supplemented with 23 mitogenomes from GenBank. Our analyses generally place genus Bironella within the genus Anopheles, which implies that the latter as it is currently defined is not monophyletic. With some inconsistencies, Bironella was placed within the major clade that includes Anopheles, Cellia, Kerteszia, Lophopodomyia, Nyssorhynchus and Stethomyia, which were found to be monophyletic groups within Anophelinae. Our findings provided robust evidence for elevating the monophyletic groupings Kerteszia, Lophopodomyia, Nyssorhynchus and Stethomyia to genus level; genus Anopheles to include subgenera Anopheles, Baimaia, Cellia and Christya; Anopheles parvus to be placed into a new genus; Nyssorhynchus to be elevated to genus level; the genus Nyssorhynchus to include subgenera Myzorhynchella and Nyssorhynchus; Anopheles atacamensis and Anopheles pictipennis to be transferred from subgenus Nyssorhynchus to subgenus Myzorhynchella; and subgenus Nyssorhynchus to encompass the remaining species of Argyritarsis and Albimanus Sections

  1. A global analysis of protein expression profiles in Sinorhizobium meliloti: discovery of new genes for nodule occupancy and stress adaptation.

    PubMed

    Djordjevic, Michael A; Chen, Han Cai; Natera, Siria; Van Noorden, Giel; Menzel, Christian; Taylor, Scott; Renard, Clotilde; Geiger, Otto; Weiller, Georg F

    2003-06-01

    A proteomic examination of Sinorhizobium meliloti strain 1021 was undertaken using a combination of 2-D gel electrophoresis, peptide mass fingerprinting, and bioinformatics. Our goal was to identify (i) putative symbiosis- or nutrient-stress-specific proteins, (ii) the biochemical pathways active under different conditions, (iii) potential new genes, and (iv) the extent of posttranslational modifications of S. meliloti proteins. In total, we identified the protein products of 810 genes (13.1% of the genome's coding capacity). The 810 genes generated 1,180 gene products, with chromosomal genes accounting for 78% of the gene products identified (18.8% of the chromosome's coding capacity). The activity of 53 metabolic pathways was inferred from bioinformatic analysis of proteins with assigned Enzyme Commission numbers. Of the remaining proteins that did not encode enzymes, ABC-type transporters composed 12.7% and regulatory proteins 3.4% of the total. Proteins with up to seven transmembrane domains were identified in membrane preparations. A total of 27 putative nodule-specific proteins and 35 nutrient-stress-specific proteins were identified and used as a basis to define genes and describe processes occurring in S. meliloti cells in nodules and under stress. Several nodule proteins from the plant host were present in the nodule bacteria preparations. We also identified seven potentially novel proteins not predicted from the DNA sequence. Post-translational modifications such as N-terminal processing could be inferred from the data. The posttranslational addition of UMP to the key regulator of nitrogen metabolism, PII, was demonstrated. This work demonstrates the utility of combining mass spectrometry with protein arraying or separation techniques to identify candidate genes involved in important biological processes and niche occupations that may be intransigent to other methods of gene expression profiling.

  2. Intron-exon organization of the active human protein S gene PS. alpha. and its pseudogene PS. beta. : Duplication and silencing during primate evolution

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ploos van Amstel, H.; Reitsma, P.H.; van der Logt, C.P.

    The human protein S locus on chromosome 3 consists of two protein S genes, PS{alpha} and PS{beta}. Here the authors report the cloning and characterization of both genes. Fifteen exons of the PS{alpha} gene were identified that together code for protein S mRNA as derived from the reported protein S cDNAs. Analysis by primer extension of liver protein S mRNA, however, reveals the presence of two mRNA forms that differ in the length of their 5{prime}-noncoding region. Both transcripts contain a 5{prime}-noncoding region longer than found in the protein S cDNAs. The two products may arise from alternative splicing ofmore » an additional intron in this region or from the usage of two start sites for transcription. The intron-exon organization of the PS{alpha} gene fully supports the hypothesis that the protein S gene is the product of an evolutional assembling process in which gene modules coding for structural/functional protein units also found in other coagulation proteins have been put upstream of the ancestral gene of a steroid hormone binding protein. The PS{beta} gene is identified as a pseudogene. It contains a large variety of detrimental aberrations, viz., the absence of exon I, a splice site mutation, three stop codons, and a frame shift mutation. Overall the two genes PS{alpha} and PS{beta} show between their exonic sequences 96.5% homology. Southern analysis of primate DNA showed that the duplication of the ancestral protein S gene has occurred after the branching of the orangutan from the African apes. A nonsense mutation that is present in the pseudogene of man also could be identified in one of the two protein S genes of both chimpanzee and gorilla. This implicates that silencing of one of the two protein S genes must have taken place before the divergence of the three African apes.« less

  3. A gene family for acidic ribosomal proteins in Schizosaccharomyces pombe: two essential and two nonessential genes.

    PubMed Central

    Beltrame, M; Bianchi, M E

    1990-01-01

    We have cloned the genes for small acidic ribosomal proteins (A-proteins) of the fission yeast Schizosaccharomyces pombe. S. pombe contains four transcribed genes for small A-proteins per haploid genome, as is the case for Saccharomyces cerevisiae. In contrast, multicellular eucaryotes contain two transcribed genes per haploid genome. The four proteins of S. pombe, besides sharing a high overall similarity, form two couples of nearly identical sequences. Their corresponding genes have a very conserved structure and are transcribed to a similar level. Surprisingly, of each couple of genes coding for nearly identical proteins, one is essential for cell growth, whereas the other is not. We suggest that the unequal importance of the four small A-proteins for cell survival is related to their physical organization in 60S ribosomal subunits. Images PMID:2325655

  4. A Catalogue of Putative cis-Regulatory Interactions Between Long Non-coding RNAs and Proximal Coding Genes Based on Correlative Analysis Across Diverse Human Tumors.

    PubMed

    Basu, Swaraj; Larsson, Erik

    2018-05-31

    Antisense transcripts and other long non-coding RNAs are pervasive in mammalian cells, and some of these molecules have been proposed to regulate proximal protein-coding genes in cis For example, non-coding transcription can contribute to inactivation of tumor suppressor genes in cancer, and antisense transcripts have been implicated in the epigenetic inactivation of imprinted genes. However, our knowledge is still limited and more such regulatory interactions likely await discovery. Here, we make use of available gene expression data from a large compendium of human tumors to generate hypotheses regarding non-coding-to-coding cis -regulatory relationships with emphasis on negative associations, as these are less likely to arise for reasons other than cis -regulation. We document a large number of possible regulatory interactions, including 193 coding/non-coding pairs that show expression patterns compatible with negative cis -regulation. Importantly, by this approach we capture several known cases, and many of the involved coding genes have known roles in cancer. Our study provides a large catalog of putative non-coding/coding cis -regulatory pairs that may serve as a basis for further experimental validation and characterization. Copyright © 2018 Basu and Larsson.

  5. How to calculate the non-synonymous to synonymous rate ratio of protein-coding genes under the Fisher-Wright mutation-selection framework.

    PubMed

    Dos Reis, Mario

    2015-04-01

    First principles of population genetics are used to obtain formulae relating the non-synonymous to synonymous substitution rate ratio to the selection coefficients acting at codon sites in protein-coding genes. Two theoretical cases are discussed and two examples from real data (a chloroplast gene and a virus polymerase) are given. The formulae give much insight into the dynamics of non-synonymous substitutions and may inform the development of methods to detect adaptive evolution. © 2015 The Author(s) Published by the Royal Society. All rights reserved.

  6. Enrichment of Circular Code Motifs in the Genes of the Yeast Saccharomyces cerevisiae.

    PubMed

    Michel, Christian J; Ngoune, Viviane Nguefack; Poch, Olivier; Ripp, Raymond; Thompson, Julie D

    2017-12-03

    A set X of 20 trinucleotides has been found to have the highest average occurrence in the reading frame, compared to the two shifted frames, of genes of bacteria, archaea, eukaryotes, plasmids and viruses. This set X has an interesting mathematical property, since X is a maximal C3 self-complementary trinucleotide circular code. Furthermore, any motif obtained from this circular code X has the capacity to retrieve, maintain and synchronize the original (reading) frame. Since 1996, the theory of circular codes in genes has mainly been developed by analysing the properties of the 20 trinucleotides of X, using combinatorics and statistical approaches. For the first time, we test this theory by analysing the X motifs, i.e., motifs from the circular code X, in the complete genome of the yeast Saccharomyces cerevisiae . Several properties of X motifs are identified by basic statistics (at the frequency level), and evaluated by comparison to R motifs, i.e., random motifs generated from 30 different random codes R. We first show that the frequency of X motifs is significantly greater than that of R motifs in the genome of S. cerevisiae . We then verify that no significant difference is observed between the frequencies of X and R motifs in the non-coding regions of S. cerevisiae , but that the occurrence number of X motifs is significantly higher than R motifs in the genes (protein-coding regions). This property is true for all cardinalities of X motifs (from 4 to 20) and for all 16 chromosomes. We further investigate the distribution of X motifs in the three frames of S. cerevisiae genes and show that they occur more frequently in the reading frame, regardless of their cardinality or their length. Finally, the ratio of X genes, i.e., genes with at least one X motif, to non-X genes, in the set of verified genes is significantly different to that observed in the set of putative or dubious genes with no experimental evidence. These results, taken together, represent the first

  7. Structural architecture of the human long non-coding RNA, steroid receptor RNA activator

    PubMed Central

    Novikova, Irina V.; Hennelly, Scott P.; Sanbonmatsu, Karissa Y.

    2012-01-01

    While functional roles of several long non-coding RNAs (lncRNAs) have been determined, the molecular mechanisms are not well understood. Here, we report the first experimentally derived secondary structure of a human lncRNA, the steroid receptor RNA activator (SRA), 0.87 kB in size. The SRA RNA is a non-coding RNA that coactivates several human sex hormone receptors and is strongly associated with breast cancer. Coding isoforms of SRA are also expressed to produce proteins, making the SRA gene a unique bifunctional system. Our experimental findings (SHAPE, in-line, DMS and RNase V1 probing) reveal that this lncRNA has a complex structural organization, consisting of four domains, with a variety of secondary structure elements. We examine the coevolution of the SRA gene at the RNA structure and protein structure levels using comparative sequence analysis across vertebrates. Rapid evolutionary stabilization of RNA structure, combined with frame-disrupting mutations in conserved regions, suggests that evolutionary pressure preserves the RNA structural core rather than its translational product. We perform similar experiments on alternatively spliced SRA isoforms to assess their structural features. PMID:22362738

  8. Gene-Auto: Automatic Software Code Generation for Real-Time Embedded Systems

    NASA Astrophysics Data System (ADS)

    Rugina, A.-E.; Thomas, D.; Olive, X.; Veran, G.

    2008-08-01

    This paper gives an overview of the Gene-Auto ITEA European project, which aims at building a qualified C code generator from mathematical models under Matlab-Simulink and Scilab-Scicos. The project is driven by major European industry partners, active in the real-time embedded systems domains. The Gene- Auto code generator will significantly improve the current development processes in such domains by shortening the time to market and by guaranteeing the quality of the generated code through the use of formal methods. The first version of the Gene-Auto code generator has already been released and has gone thought a validation phase on real-life case studies defined by each project partner. The validation results are taken into account in the implementation of the second version of the code generator. The partners aim at introducing the Gene-Auto results into industrial development by 2010.

  9. Promoter activity of polypyrimidine tract-binding protein genes of potato responds to environmental cues.

    PubMed

    Butler, Nathaniel M; Hannapel, David J

    2012-12-01

    Polypyrimidine tract-binding (PTB) proteins are RNA-binding proteins that target specific RNAs for post-transcriptional processing by binding cytosine/uracil motifs. PTBs have established functions in a range of RNA processes including splicing, translation, stability and long-distance transport. Six PTB-like genes identified in potato have been grouped into two clades based on homology to other known plant PTBs. StPTB1 and StPTB6 are closely related to a PTB protein discovered in pumpkin, designated CmRBP50, and contain four canonical RNA-recognition motifs. CmRBP50 is expressed in phloem tissues and functions as the core protein of a phloem-mobile RNA/protein complex. Sequence from the potato genome database was used to clone the upstream sequence of these two PTB genes and analyzed to identify conserved cis-elements. The promoter of StPTB6 was enriched for regulatory elements for light and sucrose induction and defense. Upstream sequence of both PTB genes was fused to β-glucuronidase and monitored in transgenic potato lines. In whole plants, the StPTB1 promoter was most active in leaf veins and petioles, whereas StPTB6 was most active in leaf mesophyll. Both genes are active in new tubers and tuber sprouts. StPTB6 expression was induced in stems and stolon sections in response to sucrose and in leaves or petioles in response to light, heat, drought and mechanical wounding. These results show that CmRBP50-like genes of potato exhibit distinct expression patterns and respond to both developmental and environmental cues.

  10. Tenebrio molitor antifreeze protein gene identification and regulation.

    PubMed

    Qin, Wensheng; Walker, Virginia K

    2006-02-15

    The yellow mealworm, Tenebrio molitor, is a freeze susceptible, stored product pest. Its winter survival is facilitated by the accumulation of antifreeze proteins (AFPs), encoded by a small gene family. We have now isolated 11 different AFP genomic clones from 3 genomic libraries. All the clones had a single coding sequence, with no evidence of intervening sequences. Three genomic clones were further characterized. All have putative TATA box sequences upstream of the coding regions and multiple potential poly(A) signal sequences downstream of the coding regions. A TmAFP regulatory region, B1037, conferred transcriptional activity when ligated to a luciferase reporter sequence and after transfection into an insect cell line. A 143 bp core promoter including a TATA box sequence was identified. Its promoter activity was increased 4.4 times by inserting an exotic 245 bp intron into the construct, similar to the enhancement of transgenic expression seen in several other systems. The addition of a duplication of the first 120 bp sequence from the 143 bp core promoter decreased promoter activity by half. Although putative hormonal response sequences were identified, none of the five hormones tested enhanced reporter activity. These studies on the mechanisms of AFP transcriptional control are important for the consideration of any transfer of freeze-resistance phenotypes to beneficial hosts.

  11. Avoidance of truncated proteins from unintended ribosome binding sites within heterologous protein coding sequences.

    PubMed

    Whitaker, Weston R; Lee, Hanson; Arkin, Adam P; Dueber, John E

    2015-03-20

    Genetic sequences ported into non-native hosts for synthetic biology applications can gain unexpected properties. In this study, we explored sequences functioning as ribosome binding sites (RBSs) within protein coding DNA sequences (CDSs) that cause internal translation, resulting in truncated proteins. Genome-wide prediction of bacterial RBSs, based on biophysical calculations employed by the RBS calculator, suggests a selection against internal RBSs within CDSs in Escherichia coli, but not those in Saccharomyces cerevisiae. Based on these calculations, silent mutations aimed at removing internal RBSs can effectively reduce truncation products from internal translation. However, a solution for complete elimination of internal translation initiation is not always feasible due to constraints of available coding sequences. Fluorescence assays and Western blot analysis showed that in genes with internal RBSs, increasing the strength of the intended upstream RBS had little influence on the internal translation strength. Another strategy to minimize truncated products from an internal RBS is to increase the relative strength of the upstream RBS with a concomitant reduction in promoter strength to achieve the same protein expression level. Unfortunately, lower transcription levels result in increased noise at the single cell level due to stochasticity in gene expression. At the low expression regimes desired for many synthetic biology applications, this problem becomes particularly pronounced. We found that balancing promoter strengths and upstream RBS strengths to intermediate levels can achieve the target protein concentration while avoiding both excessive noise and truncated protein.

  12. [Physical mapping of the genes px and cld coding peroxidase and cold-regulated protein in maize (Zea mays L.)].

    PubMed

    Ning, S B; Wang, L; Song, Y C

    2000-01-01

    Peroxidase plays a key role in plant disease resistance, cold stress and some developmental processes, and cold-regulated protein functions necessarily in reaction of plants on cold or heat stress. Recent studies showed that these processes in plant cells were involved in programmed cell death (PCD). Using a biotin-labelled in situ hybridization (ISH) technique, we physically mapped the genes px and cld coding peroxidase and cold-regulated protein respectively onto maize chromosomes. Both DAB and fluorescence detection systems gave the identical results, the probe uaz235 corresponding to gene px was localized onto the long arm of chromosome 2 (2L) and 7L, and csu19 corresponding to gene cld was hybridized onto 4L and 5L. The percentage distances (from the hybridization sites to centromeres) of uaz235 in 2L and 7L were 45.4 +/- 1.3 and 67.4 +/- 3.7 respectively, and those of csu19 in 4L and 5L were 68.6 +/- 2.6 and 58.2 +/- 1.6 respectively. The physical positions of px in 2L and cld in 4L coincide with those in their genetic map pattern. The results also show that both of these genes have duplicated sites in maize genome.

  13. Forty-four novel protein-coding loci discovered using a proteomics informed by transcriptomics (PIT) approach in rat male germ cells.

    PubMed

    Chocu, Sophie; Evrard, Bertrand; Lavigne, Régis; Rolland, Antoine D; Aubry, Florence; Jégou, Bernard; Chalmel, Frédéric; Pineau, Charles

    2014-11-01

    Spermatogenesis is a complex process, dependent upon the successive activation and/or repression of thousands of gene products, and ends with the production of haploid male gametes. RNA sequencing of male germ cells in the rat identified thousands of novel testicular unannotated transcripts (TUTs). Although such RNAs are usually annotated as long noncoding RNAs (lncRNAs), it is possible that some of these TUTs code for protein. To test this possibility, we used a "proteomics informed by transcriptomics" (PIT) strategy combining RNA sequencing data with shotgun proteomics analyses of spermatocytes and spermatids in the rat. Among 3559 TUTs and 506 lncRNAs found in meiotic and postmeiotic germ cells, 44 encoded at least one peptide. We showed that these novel high-confidence protein-coding loci exhibit several genomic features intermediate between those of lncRNAs and mRNAs. We experimentally validated the testicular expression pattern of two of these novel protein-coding gene candidates, both highly conserved in mammals: one for a vesicle-associated membrane protein we named VAMP-9, and the other for an enolase domain-containing protein. This study confirms the potential of PIT approaches for the discovery of protein-coding transcripts initially thought to be untranslated or unknown transcripts. Our results contribute to the understanding of spermatogenesis by characterizing two novel proteins, implicated by their strong expression in germ cells. The mass spectrometry proteomics data have been deposited with the ProteomeXchange Consortium under the data set identifier PXD000872. © 2014 by the Society for the Study of Reproduction, Inc.

  14. Activating human genes with zinc finger proteins, transcription activator-like effectors and CRISPR/Cas9 for gene therapy and regenerative medicine.

    PubMed

    Gersbach, Charles A; Perez-Pinera, Pablo

    2014-08-01

    New technologies have recently been developed to control the expression of human genes in their native genomic context by engineering synthetic transcription factors that can be targeted to any DNA sequence. The ability to precisely regulate any gene as it occurs naturally in the genome provides a means to address a variety of diseases and disorders. This approach also circumvents some of the traditional challenges of gene therapy. In this editorial, we review the technologies that have enabled targeted human gene activation, including the engineering of transcription factors based on zinc finger proteins, transcription activator-like effectors and the CRISPR/Cas9 system. Additionally, we highlight examples in which these methods have been developed for therapeutic applications and discuss challenges and opportunities.

  15. Protein inhibitor of activated STAT3 inhibits adipogenic gene expression

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Deng Jianbei; Hua Kunjie; Caveney, Erica J.

    2006-01-20

    Protein inhibitor of activated STAT3 (PIAS3), a cytokine-induced repressor of signal transducer and activator of transcription 3 (STAT3) and a modulator of a broad array of nuclear proteins, is expressed in white adipose tissue, but its role in adipogenesis is not known. Here, we determined that PIAS3 was constitutively expressed in 3T3-L1 cells at all stages of adipogenesis. However, it translocated from the nucleus to the cytoplasm 4 days after induction of differentiation by isobutylmethylxanthine, dexamethasone, and insulin (MDI). In ob/ob mice, PIAS3 expression was increased in white adipose tissue depots compared to lean mice and was found in themore » cytoplasm of adipocytes. Overexpression of PIAS3 in differentiating preadipocytes, which localized primarily to the nucleus, inhibited mRNA level gene expression of adipogenic transcription factors C/EBP{alpha} and PPAR{gamma}, as well as their downstream target genes aP2 and adiponectin. PIAS3 also inhibited C/EBP{alpha} promoter activation mediated specifically by insulin, but not dexamethasone or isobutylmethylxanthine. Taken together, these data suggest that PIAS3 may play an inhibitory role in adipogenesis by modulating insulin-activated transcriptional activation events. Increased PIAS3 expression in adipose tissue may play a role in the metabolic disturbances of obesity.« less

  16. Profilin is associated with transcriptionally active genes

    PubMed Central

    Söderberg, Emilia; Hessle, Viktoria; von Euler, Anne; Visa, Neus

    2012-01-01

    We have raised antibodies against the profilin of Chironomus tentans to study the location of profilin relative to chromatin and to active genes in salivary gland polytene chromosomes. We show that a fraction of profilin is located in the nucleus, where profilin is highly concentrated in the nucleoplasm and at the nuclear periphery. Moreover, profilin is associated with multiple bands in the polytene chromosomes. By staining salivary glands with propidium iodide, we show that profilin does not co-localize with dense chromatin. Profilin associates instead with protein-coding genes that are transcriptionally active, as revealed by co-localization with hnRNP and snRNP proteins. We have performed experiments of transcription inhibition with actinomycin D and we show that the association of profilin with the chromosomes requires ongoing transcription. However, the interaction of profilin with the gene loci does not depend on RNA. Our results are compatible with profilin regulating actin polymerization in the cell nucleus. However, the association of actin with the polytene chromosomes of C. tentans is sensitive to RNase, whereas the association of profilin is not, and we propose therefore that the chromosomal location of profilin is independent of actin. PMID:22572953

  17. Computer analysis of protein functional sites projection on exon structure of genes in Metazoa

    PubMed Central

    2015-01-01

    Background Study of the relationship between the structural and functional organization of proteins and their coding genes is necessary for an understanding of the evolution of molecular systems and can provide new knowledge for many applications for designing proteins with improved medical and biological properties. It is well known that the functional properties of proteins are determined by their functional sites. Functional sites are usually represented by a small number of amino acid residues that are distantly located from each other in the amino acid sequence. They are highly conserved within their functional group and vary significantly in structure between such groups. According to this facts analysis of the general properties of the structural organization of the functional sites at the protein level and, at the level of exon-intron structure of the coding gene is still an actual problem. Results One approach to this analysis is the projection of amino acid residue positions of the functional sites along with the exon boundaries to the gene structure. In this paper, we examined the discontinuity of the functional sites in the exon-intron structure of genes and the distribution of lengths and phases of the functional site encoding exons in vertebrate genes. We have shown that the DNA fragments coding the functional sites were in the same exons, or in close exons. The observed tendency to cluster the exons that code functional sites which could be considered as the unit of protein evolution. We studied the characteristics of the structure of the exon boundaries that code, and do not code, functional sites in 11 Metazoa species. This is accompanied by a reduced frequency of intercodon gaps (phase 0) in exons encoding the amino acid residue functional site, which may be evidence of the existence of evolutionary limitations to the exon shuffling. Conclusions These results characterize the features of the coding exon-intron structure that affect the

  18. Computer analysis of protein functional sites projection on exon structure of genes in Metazoa.

    PubMed

    Medvedeva, Irina V; Demenkov, Pavel S; Ivanisenko, Vladimir A

    2015-01-01

    Study of the relationship between the structural and functional organization of proteins and their coding genes is necessary for an understanding of the evolution of molecular systems and can provide new knowledge for many applications for designing proteins with improved medical and biological properties. It is well known that the functional properties of proteins are determined by their functional sites. Functional sites are usually represented by a small number of amino acid residues that are distantly located from each other in the amino acid sequence. They are highly conserved within their functional group and vary significantly in structure between such groups. According to this facts analysis of the general properties of the structural organization of the functional sites at the protein level and, at the level of exon-intron structure of the coding gene is still an actual problem. One approach to this analysis is the projection of amino acid residue positions of the functional sites along with the exon boundaries to the gene structure. In this paper, we examined the discontinuity of the functional sites in the exon-intron structure of genes and the distribution of lengths and phases of the functional site encoding exons in vertebrate genes. We have shown that the DNA fragments coding the functional sites were in the same exons, or in close exons. The observed tendency to cluster the exons that code functional sites which could be considered as the unit of protein evolution. We studied the characteristics of the structure of the exon boundaries that code, and do not code, functional sites in 11 Metazoa species. This is accompanied by a reduced frequency of intercodon gaps (phase 0) in exons encoding the amino acid residue functional site, which may be evidence of the existence of evolutionary limitations to the exon shuffling. These results characterize the features of the coding exon-intron structure that affect the functionality of the encoded protein and

  19. Improvement of heterologous protein production in Aspergillus oryzae by RNA interference with alpha-amylase genes.

    PubMed

    Nemoto, Takashi; Maruyama, Jun-ichi; Kitamoto, Katsuhiko

    2009-11-01

    Aspergillus oryzae RIB40 has three alpha-amylase genes (amyA, amyB, and amyC), and secretes alpha-amylase abundantly. However, large amounts of endogenous secretory proteins such as alpha-amylase can compete with heterologous protein in the secretory pathway and decrease its production yields. In this study, we examined the effects of suppression of alpha-amylase on heterologous protein production in A. oryzae, using the bovine chymosin (CHY) as a reporter heterologous protein. The three alpha-amylase genes in A. oryzae have nearly identical DNA sequences from those promoters to the coding regions. Hence we performed silencing of alpha-amylase genes by RNA interference (RNAi) in the A. oryzae CHY producing strain. The silenced strains exhibited a reduction in alpha-amylase activity and an increase in CHY production in the culture medium. This result suggests that suppression of alpha-amylase is effective in heterologous protein production in A. oryzae.

  20. Circular RNAs: Unexpected outputs of many protein-coding genes

    PubMed Central

    Wilusz, Jeremy E.

    2017-01-01

    ABSTRACT Pre-mRNAs from thousands of eukaryotic genes can be non-canonically spliced to generate circular RNAs, some of which accumulate to higher levels than their associated linear mRNA. Recent work has revealed widespread mechanisms that dictate whether the spliceosome generates a linear or circular RNA. For most genes, circular RNA biogenesis via backsplicing is far less efficient than canonical splicing, but circular RNAs can accumulate due to their long half-lives. Backsplicing is often initiated when complementary sequences from different introns base pair and bring the intervening splice sites close together. This process is further regulated by the combinatorial action of RNA binding proteins, which allow circular RNAs to be expressed in unique patterns. Some genes do not require complementary sequences to generate RNA circles and instead take advantage of exon skipping events. It is still unclear what most mature circular RNAs do, but future investigations into their functions will be facilitated by recently described methods to modulate circular RNA levels. PMID:27571848

  1. Characterization of a gene coding for a type IIo bacterial IgG-binding protein.

    PubMed

    Boyle, M D; Weber-Heynemann, J; Raeder, R; Podbielski, A

    1995-06-01

    Two antigenic classes of non-immune IgG-binding proteins can be expressed by group A streptococci. One antigenic group of proteins is recognized by an antibody prepared against the product of a cloned fcrA gene (anti-FcRA). In this study, the immunogen used to prepare the antibody that defines the second antigenic class was shown to be the product of the emm-like (emmL) gene of M serotype 55 group A isolate, A928. The emmL55 gene expressed in E. coli produced an M(r) approximately 58,000 molecule which bound human IgG1, IgG2, IgG3 and IgG4, as well as horse, rabbit and pig IgG in a non-immune fashion. These properties are characteristic of the previously described type IIo IgG-binding protein isolated from this strain. In addition, the recombinant protein was reactive with human serum albumin and fibrinogen. The emmL 55 gene sequence was analysed and found to have the organization and sequence characteristics of a typical class I emm-like gene.

  2. Non-protein coding RNA genes as the novel diagnostic markers for the discrimination of Salmonella species using PCR.

    PubMed

    Nithya, Ravichantar; Ahmed, Siti Aminah; Hoe, Chee-Hock; Gopinath, Subash C B; Citartan, Marimuthu; Chinni, Suresh V; Lee, Li Pin; Rozhdestvensky, Timofey S; Tang, Thean-Hock

    2015-01-01

    Salmonellosis, a communicable disease caused by members of the Salmonella species, transmitted to humans through contaminated food or water. It is of paramount importance, to generate accurate detection methods for discriminating the various Salmonella species that cause severe infection in humans, including S. Typhi and S. Paratyphi A. Here, we formulated a strategy of detection and differentiation of salmonellosis by a multiplex polymerase chain reaction assay using S. Typhi non-protein coding RNA (sRNA) genes. With the designed sequences that specifically detect sRNA genes from S. Typhi and S. Paratyphi A, a detection limit of up to 10 pg was achieved. Moreover, in a stool-seeding experiment with S. Typhi and S. Paratyphi A, we have attained a respective detection limit of 15 and 1.5 CFU/mL. The designed strategy using sRNA genes shown here is comparatively sensitive and specific, suitable for clinical diagnosis and disease surveillance, and sRNAs represent an excellent molecular target for infectious disease.

  3. Identification of the G13 (cAMP-response-element-binding protein-related protein) gene product related to activating transcription factor 6 as a transcriptional activator of the mammalian unfolded protein response.

    PubMed

    Haze, K; Okada, T; Yoshida, H; Yanagi, H; Yura, T; Negishi, M; Mori, K

    2001-04-01

    Eukaryotic cells control the levels of molecular chaperones and folding enzymes in the endoplasmic reticulum (ER) by a transcriptional induction process termed the unfolded protein response (UPR). The mammalian UPR is mediated by the cis-acting ER stress response element consisting of 19 nt (CCAATN(9)CCACG), the CCACG part of which is considered to provide specificity. We recently identified the basic leucine zipper (bZIP) protein ATF6 as a mammalian UPR-specific transcription factor; ATF6 is activated by ER stress-induced proteolysis and binds directly to CCACG. Here we report that eukaryotic cells express another bZIP protein closely related to ATF6 in both structure and function. This protein encoded by the G13 (cAMP response element binding protein-related protein) gene is constitutively synthesized as a type II transmembrane glycoprotein anchored in the ER membrane and processed into a soluble form upon ER stress as occurs with ATF6. The proteolytic processing of ATF6 and the G13 gene product is accompanied by their relocation from the ER to the nucleus; their basic regions seem to function as a nuclear localization signal. Overexpression of the soluble form of the G13 product constitutively activates the UPR, whereas overexpression of a mutant lacking the activation domain exhibits a strong dominant-negative effect. Furthermore, the soluble forms of ATF6 and the G13 gene product are unable to bind to several point mutants of the cis-acting ER stress response element in vitro that hardly respond to ER stress in vivo. We thus concluded that the two related bZIP proteins are crucial transcriptional regulators of the mammalian UPR, and propose calling the ATF6 gene product ATF6alpha and the G13 gene product ATF6beta.

  4. Modeling T-cell activation using gene expression profiling and state-space models.

    PubMed

    Rangel, Claudia; Angus, John; Ghahramani, Zoubin; Lioumi, Maria; Sotheran, Elizabeth; Gaiba, Alessia; Wild, David L; Falciani, Francesco

    2004-06-12

    We have used state-space models to reverse engineer transcriptional networks from highly replicated gene expression profiling time series data obtained from a well-established model of T-cell activation. State space models are a class of dynamic Bayesian networks that assume that the observed measurements depend on some hidden state variables that evolve according to Markovian dynamics. These hidden variables can capture effects that cannot be measured in a gene expression profiling experiment, e.g. genes that have not been included in the microarray, levels of regulatory proteins, the effects of messenger RNA and protein degradation, etc. Bootstrap confidence intervals are developed for parameters representing 'gene-gene' interactions over time. Our models represent the dynamics of T-cell activation and provide a methodology for the development of rational and experimentally testable hypotheses. Supplementary data and Matlab computer source code will be made available on the web at the URL given below. http://public.kgi.edu/~wild/LDS/index.htm

  5. Gene regulatory network of unfolded protein response genes in endoplasmic reticulum stress.

    PubMed

    Takayanagi, Sayuri; Fukuda, Riga; Takeuchi, Yuuki; Tsukada, Sakiko; Yoshida, Kenichi

    2013-01-01

    In the endoplasmic reticulum (ER), secretory and membrane proteins are properly folded and modified, and the failure of these processes leads to ER stress. At the same time, unfolded protein response (UPR) genes are activated to maintain homeostasis. Despite the thorough characterization of the individual gene regulation of UPR genes to date, further investigation of the mutual regulation among UPR genes is required to understand the complex mechanism underlying the ER stress response. In this study, we aimed to reveal a gene regulatory network formed by UPR genes, including immunoglobulin heavy chain-binding protein (BiP), X-box binding protein 1 (XBP1), C/EBP [CCAAT/enhancer-binding protein]-homologous protein (CHOP), PKR-like endoplasmic reticulum kinase (PERK), inositol-requiring 1 (IRE1), activating transcription factor 6 (ATF6), and ATF4. For this purpose, we focused on promoter-luciferase reporters for BiP, XBP1, and CHOP genes, which bear an ER stress response element (ERSE), and p5 × ATF6-GL3, which bears an unfolded protein response element (UPRE). We demonstrated that the luciferase activities of the BiP and CHOP promoters were upregulated by all the UPR genes, whereas those of the XBP1 promoter and p5 × ATF6-GL3 were upregulated by all the UPR genes except for BiP, CHOP, and ATF4 in HeLa cells. Therefore, an ERSE- and UPRE-centered gene regulatory network of UPR genes could be responsible for the robustness of the ER stress response. Finally, we revealed that BiP protein was degraded when cells were treated with DNA-damaging reagents, such as etoposide and doxorubicin; this finding suggests that the expression level of BiP is tightly regulated at the post-translational level, rather than at the transcriptional level, in the presence of DNA damage.

  6. Comparative analysis of human protein-coding and noncoding RNAs between brain and 10 mixed cell lines by RNA-Seq.

    PubMed

    Chen, Geng; Yin, Kangping; Shi, Leming; Fang, Yuanzhang; Qi, Ya; Li, Peng; Luo, Jian; He, Bing; Liu, Mingyao; Shi, Tieliu

    2011-01-01

    In their expression process, different genes can generate diverse functional products, including various protein-coding or noncoding RNAs. Here, we investigated the protein-coding capacities and the expression levels of their isoforms for human known genes, the conservation and disease association of long noncoding RNAs (ncRNAs) with two transcriptome sequencing datasets from human brain tissues and 10 mixed cell lines. Comparative analysis revealed that about two-thirds of the genes expressed between brain and cell lines are the same, but less than one-third of their isoforms are identical. Besides those genes specially expressed in brain and cell lines, about 66% of genes expressed in common encoded different isoforms. Moreover, most genes dominantly expressed one isoform and some genes only generated protein-coding (or noncoding) RNAs in one sample but not in another. We found 282 human genes could encode both protein-coding and noncoding RNAs through alternative splicing in the two samples. We also identified more than 1,000 long ncRNAs, and most of those long ncRNAs contain conserved elements across either 46 vertebrates or 33 placental mammals or 10 primates. Further analysis showed that some long ncRNAs differentially expressed in human breast cancer or lung cancer, several of those differentially expressed long ncRNAs were validated by RT-PCR. In addition, those validated differentially expressed long ncRNAs were found significantly correlated with certain breast cancer or lung cancer related genes, indicating the important biological relevance between long ncRNAs and human cancers. Our findings reveal that the differences of gene expression profile between samples mainly result from the expressed gene isoforms, and highlight the importance of studying genes at the isoform level for completely illustrating the intricate transcriptome.

  7. Wheat CBF gene family: identification of polymorphisms in the CBF coding sequence.

    PubMed

    Mohseni, Sara; Che, Hua; Djillali, Zakia; Dumont, Estelle; Nankeu, Joseph; Danyluk, Jean

    2012-12-01

    Expression of cold-regulated genes needed for protection against freezing stress is mediated, in part, by the CBF transcription factor family. Previous studies with temperate cereals suggested that the CBF gene family in wheat was large, and that CBF genes were at the base of an important low temperature tolerance trait. Therefore, the goal of our study was to identify the CBF repertoire in the freezing-tolerant hexaploid wheat cultivar Norstar, and then to examine if the coding region of CBF genes in two spring cultivars contain polymorphisms that could affect the protein sequence and structure. Our analyses reveal that hexaploid wheat contains a complex CBF family consisting of at least 65 CBF genes of which 60 are known to be expressed in the cultivar Norstar. They represent 27 paralogous genes with 1-3 homeologous copies for the A, B, and D genomes. The cultivar Norstar contains two pseudogenes and at least 24 additional proteins having sequences and (or) structures that deviate from the consensus in the conserved AP2 DNA-binding and (or) C-terminal activation-domains. This suggests that in cultivars such as Norstar, low temperature tolerance may be increased through breeding of additional optimal alleles. The examination of the CBF repertoire present in the two spring cultivars, Chinese Spring and Manitou, reveals that they have additional polymorphisms affecting conserved positions in these domains. Understanding the effects of these polymorphisms will provide additional information for the selection of optimum CBF alleles in Triticeae breeding programs.

  8. Nucleotide sequence of the L1 ribosomal protein gene of Xenopus laevis: remarkable sequence homology among introns.

    PubMed Central

    Loreni, F; Ruberti, I; Bozzoni, I; Pierandrei-Amaldi, P; Amaldi, F

    1985-01-01

    Ribosomal protein L1 is encoded by two genes in Xenopus laevis. The comparison of two cDNA sequences shows that the two L1 gene copies (L1a and L1b) have diverged in many silent sites and very few substitution sites; moreover a small duplication occurred at the very end of the coding region of the L1b gene which thus codes for a product five amino acids longer than that coded by L1a. Quantitatively the divergence between the two L1 genes confirms that a whole genome duplication took place in Xenopus laevis approximately 30 million years ago. A genomic fragment containing one of the two L1 gene copies (L1a), with its nine introns and flanking regions, has been completely sequenced. The 5' end of this gene has been mapped within a 20-pyridimine stretch as already found for other vertebrate ribosomal protein genes. Four of the nine introns have a 60-nucleotide sequence with 80% homology; within this region some boxes, one of which is 16 nucleotides long, are 100% homologous among the four introns. This feature of L1a gene introns is interesting since we have previously shown that the activity of this gene is regulated at a post-transcriptional level and it involves the block of the normal splicing of some intron sequences. Images Fig. 3. Fig. 5. PMID:3841512

  9. Abscisic acid affects transcription of chloroplast genes via protein phosphatase 2C-dependent activation of nuclear genes: repression by guanosine-3'-5'-bisdiphosphate and activation by sigma factor 5.

    PubMed

    Yamburenko, Maria V; Zubo, Yan O; Börner, Thomas

    2015-06-01

    Abscisic acid (ABA) represses the transcriptional activity of chloroplast genes (determined by run-on assays), with the exception of psbD and a few other genes in wild-type Arabidopsis seedlings and mature rosette leaves. Abscisic acid does not influence chloroplast transcription in the mutant lines abi1-1 and abi2-1 with constitutive protein phosphatase 2C (PP2C) activity, suggesting that ABA affects chloroplast gene activity by binding to the pyrabactin resistance (PYR)/PYR1-like or regulatory component of ABA receptor protein family (PYR/PYL/RCAR) and signaling via PP2Cs and sucrose non-fermenting protein-related kinases 2 (SnRK2s). Further we show by quantitative PCR that ABA enhances the transcript levels of RSH2, RSH3, PTF1 and SIG5. RelA/SpoT homolog 2 (RSH2) and RSH3 are known to synthesize guanosine-3'-5'-bisdiphosphate (ppGpp), an inhibitor of the plastid-gene-encoded chloroplast RNA polymerase. We propose, therefore, that ABA leads to an inhibition of chloroplast gene expression via stimulation of ppGpp synthesis. On the other hand, sigma factor 5 (SIG5) and plastid transcription factor 1 (PTF1) are known to be necessary for the transcription of psbD from a specific light- and stress-induced promoter (the blue light responsive promoter, BLRP). We demonstrate that ABA activates the psbD gene by stimulation of transcription initiation at BLRP. Taken together, our data suggest that ABA affects the transcription of chloroplast genes by a PP2C-dependent activation of nuclear genes encoding proteins involved in chloroplast transcription. © 2015 The Authors The Plant Journal © 2015 John Wiley & Sons Ltd.

  10. Mitochondrial genomes of the jungle crow Corvus macrorhynchos (Passeriformes: Corvidae) from shed feathers and a phylogenetic analysis of genus Corvus using mitochondrial protein-coding genes.

    PubMed

    Krzeminska, Urszula; Wilson, Robyn; Rahman, Sadequr; Song, Beng Kah; Seneviratne, Sampath; Gan, Han Ming; Austin, Christopher M

    2016-07-01

    The complete mitochondrial genomes of two jungle crows (Corvus macrorhynchos) were sequenced. DNA was extracted from tissue samples obtained from shed feathers collected in the field in Sri Lanka and sequenced using the Illumina MiSeq Personal Sequencer. Jungle crow mitogenomes have a structural organization typical of the genus Corvus and are 16,927 bp and 17,066 bp in length, both comprising 13 protein-coding genes, 22 transfer RNA genes, 2 ribosomal subunit genes, and a non-coding control region. In addition, we complement already available house crow (Corvus spelendens) mitogenome resources by sequencing an individual from Singapore. A phylogenetic tree constructed from Corvidae family mitogenome sequences available on GenBank is presented. We confirm the monophyly of the genus Corvus and propose to use complete mitogenome resources for further intra- and interspecies genetic studies.

  11. Non-coding RNAs in lung cancer

    PubMed Central

    Ricciuti, Biagio; Mecca, Carmen; Crinò, Lucio; Baglivo, Sara; Cenci, Matteo; Metro, Giulio

    2014-01-01

    The discovery that protein-coding genes represent less than 2% of all human genome, and the evidence that more than 90% of it is actively transcribed, changed the classical point of view of the central dogma of molecular biology, which was always based on the assumption that RNA functions mainly as an intermediate bridge between DNA sequences and protein synthesis machinery. Accumulating data indicates that non-coding RNAs are involved in different physiological processes, providing for the maintenance of cellular homeostasis. They are important regulators of gene expression, cellular differentiation, proliferation, migration, apoptosis, and stem cell maintenance. Alterations and disruptions of their expression or activity have increasingly been associated with pathological changes of cancer cells, this evidence and the prospect of using these molecules as diagnostic markers and therapeutic targets, make currently non-coding RNAs among the most relevant molecules in cancer research. In this paper we will provide an overview of non-coding RNA function and disruption in lung cancer biology, also focusing on their potential as diagnostic, prognostic and predictive biomarkers. PMID:25593996

  12. What's that gene (or protein)? Online resources for exploring functions of genes, transcripts, and proteins

    PubMed Central

    Hutchins, James R. A.

    2014-01-01

    The genomic era has enabled research projects that use approaches including genome-scale screens, microarray analysis, next-generation sequencing, and mass spectrometry–based proteomics to discover genes and proteins involved in biological processes. Such methods generate data sets of gene, transcript, or protein hits that researchers wish to explore to understand their properties and functions and thus their possible roles in biological systems of interest. Recent years have seen a profusion of Internet-based resources to aid this process. This review takes the viewpoint of the curious biologist wishing to explore the properties of protein-coding genes and their products, identified using genome-based technologies. Ten key questions are asked about each hit, addressing functions, phenotypes, expression, evolutionary conservation, disease association, protein structure, interactors, posttranslational modifications, and inhibitors. Answers are provided by presenting the latest publicly available resources, together with methods for hit-specific and data set–wide information retrieval, suited to any genome-based analytical technique and experimental species. The utility of these resources is demonstrated for 20 factors regulating cell proliferation. Results obtained using some of these are discussed in more depth using the p53 tumor suppressor as an example. This flexible and universally applicable approach for characterizing experimental hits helps researchers to maximize the potential of their projects for biological discovery. PMID:24723265

  13. Cloning, sequence analysis, and expression in Escherichia coli of a gene coding for a. beta. -mannanase from the extremely thermophilic bacterium Caldocellum saccharolyticum

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Luethi, E.; Jasmat, N.B.; Grayling, R.A.

    1991-03-01

    A {lambda} recombinant phage expressing {beta}-mannanase activity in Escherichia coli has been isolated from a genomic library of the extremely thermophilic anaerobe Caldocellum saccharolyticum. The gene was cloned into pBR322 on a 5-kb BamHI fragment, and its location was obtained by deletion analysis. The sequence of a 2.1-kb fragment containing the mannanase gene has been determined. One open reading frame was found which could code for a protein of M{sub r} 38,904. The mannanase gene (manA) was overexpressed in E. coli by cloning the gene downstream from the lacZ promoter of pUC18. The enzyme was most active at pH 6more » and 80 C and degraded locust bean gum, guar gum, Pinus radiata glucomannan, and konjak glucomannan. The noncoding region downstream from the mannanase gene showed strong homology to celB, a gene coding for a cellulase from the same organism, suggesting that the manA gene might have been inserted into its present position on the C. saccharolyticum genome by homologous recombination.« less

  14. Expression of the Long Intergenic Non-Protein Coding RNA 665 (LINC00665) Gene and the Cell Cycle in Hepatocellular Carcinoma Using The Cancer Genome Atlas, the Gene Expression Omnibus, and Quantitative Real-Time Polymerase Chain Reaction.

    PubMed

    Wen, Dong-Yue; Lin, Peng; Pang, Yu-Yan; Chen, Gang; He, Yun; Dang, Yi-Wu; Yang, Hong

    2018-05-05

    BACKGROUND Long non-coding RNAs (lncRNAs) have a role in physiological and pathological processes, including cancer. The aim of this study was to investigate the expression of the long intergenic non-protein coding RNA 665 (LINC00665) gene and the cell cycle in hepatocellular carcinoma (HCC) using database analysis including The Cancer Genome Atlas (TCGA), the Gene Expression Omnibus (GEO), and quantitative real-time polymerase chain reaction (qPCR). MATERIAL AND METHODS Expression levels of LINC00665 were compared between human tissue samples of HCC and adjacent normal liver, clinicopathological correlations were made using TCGA and the GEO, and qPCR was performed to validate the findings. Other public databases were searched for other genes associated with LINC00665 expression, including The Atlas of Noncoding RNAs in Cancer (TANRIC), the Multi Experiment Matrix (MEM), Gene Ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG) and protein-protein interaction (PPI) networks. RESULTS Overexpression of LINC00665 in patients with HCC was significantly associated with gender, tumor grade, stage, and tumor cell type. Overexpression of LINC00665 in patients with HCC was significantly associated with overall survival (OS) (HR=1.47795%; CI: 1.046-2.086). Bioinformatics analysis identified 469 related genes and further analysis supported a hypothesis that LINC00665 regulates pathways in the cell cycle to facilitate the development and progression of HCC through ten identified core genes: CDK1, BUB1B, BUB1, PLK1, CCNB2, CCNB1, CDC20, ESPL1, MAD2L1, and CCNA2. CONCLUSIONS Overexpression of the lncRNA, LINC00665 may be involved in the regulation of cell cycle pathways in HCC through ten identified hub genes.

  15. Decoding sORF translation - from small proteins to gene regulation.

    PubMed

    Cabrera-Quio, Luis Enrique; Herberg, Sarah; Pauli, Andrea

    2016-11-01

    Translation is best known as the fundamental mechanism by which the ribosome converts a sequence of nucleotides into a string of amino acids. Extensive research over many years has elucidated the key principles of translation, and the majority of translated regions were thought to be known. The recent discovery of wide-spread translation outside of annotated protein-coding open reading frames (ORFs) came therefore as a surprise, raising the intriguing possibility that these newly discovered translated regions might have unrecognized protein-coding or gene-regulatory functions. Here, we highlight recent findings that provide evidence that some of these newly discovered translated short ORFs (sORFs) encode functional, previously missed small proteins, while others have regulatory roles. Based on known examples we will also speculate about putative additional roles and the potentially much wider impact that these translated regions might have on cellular homeostasis and gene regulation.

  16. CodingQuarry: highly accurate hidden Markov model gene prediction in fungal genomes using RNA-seq transcripts.

    PubMed

    Testa, Alison C; Hane, James K; Ellwood, Simon R; Oliver, Richard P

    2015-03-11

    The impact of gene annotation quality on functional and comparative genomics makes gene prediction an important process, particularly in non-model species, including many fungi. Sets of homologous protein sequences are rarely complete with respect to the fungal species of interest and are often small or unreliable, especially when closely related species have not been sequenced or annotated in detail. In these cases, protein homology-based evidence fails to correctly annotate many genes, or significantly improve ab initio predictions. Generalised hidden Markov models (GHMM) have proven to be invaluable tools in gene annotation and, recently, RNA-seq has emerged as a cost-effective means to significantly improve the quality of automated gene annotation. As these methods do not require sets of homologous proteins, improving gene prediction from these resources is of benefit to fungal researchers. While many pipelines now incorporate RNA-seq data in training GHMMs, there has been relatively little investigation into additionally combining RNA-seq data at the point of prediction, and room for improvement in this area motivates this study. CodingQuarry is a highly accurate, self-training GHMM fungal gene predictor designed to work with assembled, aligned RNA-seq transcripts. RNA-seq data informs annotations both during gene-model training and in prediction. Our approach capitalises on the high quality of fungal transcript assemblies by incorporating predictions made directly from transcript sequences. Correct predictions are made despite transcript assembly problems, including those caused by overlap between the transcripts of adjacent gene loci. Stringent benchmarking against high-confidence annotation subsets showed CodingQuarry predicted 91.3% of Schizosaccharomyces pombe genes and 90.4% of Saccharomyces cerevisiae genes perfectly. These results are 4-5% better than those of AUGUSTUS, the next best performing RNA-seq driven gene predictor tested. Comparisons against

  17. Maternal transcription of non-protein coding RNAs from the PWS-critical region rescues growth retardation in mice.

    PubMed

    Rozhdestvensky, Timofey S; Robeck, Thomas; Galiveti, Chenna R; Raabe, Carsten A; Seeger, Birte; Wolters, Anna; Gubar, Leonid V; Brosius, Jürgen; Skryabin, Boris V

    2016-02-05

    Prader-Willi syndrome (PWS) is a neurogenetic disorder caused by loss of paternally expressed genes on chromosome 15q11-q13. The PWS-critical region (PWScr) contains an array of non-protein coding IPW-A exons hosting intronic SNORD116 snoRNA genes. Deletion of PWScr is associated with PWS in humans and growth retardation in mice exhibiting ~15% postnatal lethality in C57BL/6 background. Here we analysed a knock-in mouse containing a 5'HPRT-LoxP-Neo(R) cassette (5'LoxP) inserted upstream of the PWScr. When the insertion was inherited maternally in a paternal PWScr-deletion mouse model (PWScr(p-/m5'LoxP)), we observed compensation of growth retardation and postnatal lethality. Genomic methylation pattern and expression of protein-coding genes remained unaltered at the PWS-locus of PWScr(p-/m5'LoxP) mice. Interestingly, ubiquitous Snord116 and IPW-A exon transcription from the originally silent maternal chromosome was detected. In situ hybridization indicated that PWScr(p-/m5'LoxP) mice expressed Snord116 in brain areas similar to wild type animals. Our results suggest that the lack of PWScr RNA expression in certain brain areas could be a primary cause of the growth retardation phenotype in mice. We propose that activation of disease-associated genes on imprinted regions could lead to general therapeutic strategies in associated diseases.

  18. Comparison of the protein-coding gene content of Chlamydia trachomatis and Protochlamydia amoebophila using a Raspberry Pi computer.

    PubMed

    Robson, James F; Barker, Daniel

    2015-10-13

    To demonstrate the bioinformatics capabilities of a low-cost computer, the Raspberry Pi, we present a comparison of the protein-coding gene content of two species in phylum Chlamydiae: Chlamydia trachomatis, a common sexually transmitted infection of humans, and Candidatus Protochlamydia amoebophila, a recently discovered amoebal endosymbiont. Identifying species-specific proteins and differences in protein families could provide insights into the unique phenotypes of the two species. Using a Raspberry Pi computer, sequence similarity-based protein families were predicted across the two species, C. trachomatis and P. amoebophila, and their members counted. Examples include nine multi-protein families unique to C. trachomatis, 132 multi-protein families unique to P. amoebophila and one family with multiple copies in both. Most families unique to C. trachomatis were polymorphic outer-membrane proteins. Additionally, multiple protein families lacking functional annotation were found. Predicted functional interactions suggest one of these families is involved with the exodeoxyribonuclease V complex. The Raspberry Pi computer is adequate for a comparative genomics project of this scope. The protein families unique to P. amoebophila may provide a basis for investigating the host-endosymbiont interaction. However, additional species should be included; and further laboratory research is required to identify the functions of unknown or putative proteins. Multiple outer membrane proteins were found in C. trachomatis, suggesting importance for host evasion. The tyrosine transport protein family is shared between both species, with four proteins in C. trachomatis and two in P. amoebophila. Shared protein families could provide a starting point for discovery of wide-spectrum drugs against Chlamydiae.

  19. Evaluation of 10 genes encoding cardiac proteins in Doberman Pinschers with dilated cardiomyopathy.

    PubMed

    O'Sullivan, M Lynne; O'Grady, Michael R; Pyle, W Glen; Dawson, John F

    2011-07-01

    To identify a causative mutation for dilated cardiomyopathy (DCM) in Doberman Pinschers by sequencing the coding regions of 10 cardiac genes known to be associated with familial DCM in humans. 5 Doberman Pinschers with DCM and congestive heart failure and 5 control mixed-breed dogs that were euthanized or died. RNA was extracted from frozen ventricular myocardial samples from each dog, and first-strand cDNA was synthesized via reverse transcription, followed by PCR amplification with gene-specific primers. Ten cardiac genes were analyzed: cardiac actin, α-actinin, α-tropomyosin, β-myosin heavy chain, metavinculin, muscle LIM protein, myosinbinding protein C, tafazzin, titin-cap (telethonin), and troponin T. Sequences for DCM-affected and control dogs and the published canine genome were compared. None of the coding sequences yielded a common causative mutation among all Doberman Pinscher samples. However, 3 variants were identified in the α-actinin gene in the DCM-affected Doberman Pinschers. One of these variants, identified in 2 of the 5 Doberman Pinschers, resulted in an amino acid change in the rod-forming triple coiled-coil domain. Mutations in the coding regions of several genes associated with DCM in humans did not appear to consistently account for DCM in Doberman Pinschers. However, an α-actinin variant was detected in some Doberman Pinschers that may contribute to the development of DCM given its potential effect on the structure of this protein. Investigation of additional candidate gene coding and noncoding regions and further evaluation of the role of α-actinin in development of DCM in Doberman Pinschers are warranted.

  20. The BET protein FSH functionally interacts with ASH1 to orchestrate global gene activity in Drosophila

    PubMed Central

    2013-01-01

    Background The question of how cells re-establish gene expression states after cell division is still poorly understood. Genetic and molecular analyses have indicated that Trithorax group (TrxG) proteins are critical for the long-term maintenance of active gene expression states in many organisms. A generally accepted model suggests that TrxG proteins contribute to maintenance of transcription by protecting genes from inappropriate Polycomb group (PcG)-mediated silencing, instead of directly promoting transcription. Results and discussion Here we report a physical and functional interaction in Drosophila between two members of the TrxG, the histone methyltransferase ASH1 and the bromodomain and extraterminal family protein FSH. We investigated this interface at the genome level, uncovering a widespread co-localization of both proteins at promoters and PcG-bound intergenic elements. Our integrative analysis of chromatin maps and gene expression profiles revealed that the observed ASH1-FSH binding pattern at promoters is a hallmark of active genes. Inhibition of FSH-binding to chromatin resulted in global down-regulation of transcription. In addition, we found that genes displaying marks of robust PcG-mediated repression also have ASH1 and FSH bound to their promoters. Conclusions Our data strongly favor a global coactivator function of ASH1 and FSH during transcription, as opposed to the notion that TrxG proteins impede inappropriate PcG-mediated silencing, but are dispensable elsewhere. Instead, our results suggest that PcG repression needs to overcome the transcription-promoting function of ASH1 and FSH in order to silence genes. PMID:23442797

  1. Regulated expression of the human cytomegalovirus pp65 gene: Octamer sequence in the promoter is required for activation by viral gene products

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Depto, A.S.; Stenberg, R.M.

    1989-03-01

    To better understand the regulation of late gene expression in human cytomegalovirus (CMV)-infected cells, the authors examined expression of the gene that codes for the 65-kilodalton lower-matrix phosphoprotein (pp65). Analysis of RNA isolated at 72 h from cells infected with CMV Towne or ts66, a DNA-negative temperature-sensitive mutant, supported the fact that pp65 is expressed at low levels prior to viral DNA replication but maximally expressed after the initiation of viral DNA replication. To investigate promoter activation in a transient expression assay, the pp65 promoter was cloned into the indicator plasmid containing the gene for chloramphenicol acetyltransferase (CAT). Transfection ofmore » the promoter-CAT construct and subsequent superinfection with CMV resulted in activation of the promoter at early times after infection. Cotransfection with plasmids capable of expressing immediate-early (IE) proteins demonstrated that the promoter was activated by IE proteins and that both IE regions 1 and 2 were necessary. These studies suggest that interactions between IE proteins and this octamer sequence may be important for the regulation and expression of this CMV gene.« less

  2. Multiple transcription factor codes activate epidermal wound–response genes in Drosophila

    PubMed Central

    Pearson, Joseph C.; Juarez, Michelle T.; Kim, Myungjin; Drivenes, Øyvind; McGinnis, William

    2009-01-01

    Wounds in Drosophila and mouse embryos induce similar genetic pathways to repair epidermal barriers. However, the transcription factors that transduce wound signals to repair epidermal barriers are largely unknown. We characterize the transcriptional regulatory enhancers of 4 genes—Ddc, ple, msn, and kkv—that are rapidly activated in epidermal cells surrounding wounds in late Drosophila embryos and early larvae. These epidermal wound enhancers all contain evolutionarily conserved sequences matching binding sites for JUN/FOS and GRH transcription factors, but vary widely in trans- and cis-requirements for these inputs and their binding sites. We propose that the combination of GRH and FOS is part of an ancient wound–response pathway still used in vertebrates and invertebrates, but that other mechanisms have evolved that result in similar transcriptional output. A common, but largely untested assumption of bioinformatic analyses of gene regulatory networks is that transcription units activated in the same spatial and temporal patterns will require the same cis-regulatory codes. Our results indicate that this is an overly simplistic view. PMID:19168633

  3. Massively Convergent Evolution for Ribosomal Protein Gene Content in Plastid and Mitochondrial Genomes

    PubMed Central

    Maier, Uwe-G; Zauner, Stefan; Woehle, Christian; Bolte, Kathrin; Hempel, Franziska; Allen, John F.; Martin, William F.

    2013-01-01

    Plastid and mitochondrial genomes have undergone parallel evolution to encode the same functional set of genes. These encode conserved protein components of the electron transport chain in their respective bioenergetic membranes and genes for the ribosomes that express them. This highly convergent aspect of organelle genome evolution is partly explained by the redox regulation hypothesis, which predicts a separate plastid or mitochondrial location for genes encoding bioenergetic membrane proteins of either photosynthesis or respiration. Here we show that convergence in organelle genome evolution is far stronger than previously recognized, because the same set of genes for ribosomal proteins is independently retained by both plastid and mitochondrial genomes. A hitherto unrecognized selective pressure retains genes for the same ribosomal proteins in both organelles. On the Escherichia coli ribosome assembly map, the retained proteins are implicated in 30S and 50S ribosomal subunit assembly and initial rRNA binding. We suggest that ribosomal assembly imposes functional constraints that govern the retention of ribosomal protein coding genes in organelles. These constraints are subordinate to redox regulation for electron transport chain components, which anchor the ribosome to the organelle genome in the first place. As organelle genomes undergo reduction, the rRNAs also become smaller. Below size thresholds of approximately 1,300 nucleotides (16S rRNA) and 2,100 nucleotides (26S rRNA), all ribosomal protein coding genes are lost from organelles, while electron transport chain components remain organelle encoded as long as the organelles use redox chemistry to generate a proton motive force. PMID:24259312

  4. Identification of a Conserved Non-Protein-Coding Genomic Element that Plays an Essential Role in Alphabaculovirus Pathogenesis

    PubMed Central

    Kikhno, Irina

    2014-01-01

    Highly homologous sequences 154–157 bp in length grouped under the name of “conserved non-protein-coding element” (CNE) were revealed in all of the sequenced genomes of baculoviruses belonging to the genus Alphabaculovirus. A CNE alignment led to the detection of a set of highly conserved nucleotide clusters that occupy strictly conserved positions in the CNE sequence. The significant length of the CNE and conservation of both its length and cluster architecture were identified as a combination of characteristics that make this CNE different from known viral non-coding functional sequences. The essential role of the CNE in the Alphabaculovirus life cycle was demonstrated through the use of a CNE-knockout Autographa californica multiple nucleopolyhedrovirus (AcMNPV) bacmid. It was shown that the essential function of the CNE was not mediated by the presumed expression activities of the protein- and non-protein-coding genes that overlap the AcMNPV CNE. On the basis of the presented data, the AcMNPV CNE was categorized as a complex-structured, polyfunctional genomic element involved in an essential DNA transaction that is associated with an undefined function of the baculovirus genome. PMID:24740153

  5. Random oligonucleotide mutagenesis: application to a large protein coding sequence of a major histocompatibility complex class I gene, H-2DP.

    PubMed Central

    Murray, R; Pederson, K; Prosser, H; Muller, D; Hutchison, C A; Frelinger, J A

    1988-01-01

    We have used random oligonucleotide mutagenesis (or saturation mutagenesis) to create a library of point mutations in the alpha 1 protein domain of a Major Histocompatibility Complex (MHC) molecule. This protein domain is critical for T cell and B cell recognition. We altered the MHC class I H-2DP gene sequence such that synthetic mutant alpha 1 exons (270 bp of coding sequence), which contain mutations identified by sequence analysis, can replace the wild type alpha 1 exon. The synthetic exons were constructed from twelve overlapping oligonucleotides which contained an average of 1.3 random point mutations per intact exon. DNA sequence analysis of mutant alpha 1 exons has shown a point mutant distribution that fits a Poisson distribution, and thus emphasizes the utility of this mutagenesis technique to "scan" a large protein sequence for important mutations. We report our use of saturation mutagenesis to scan an entire exon of the H-2DP gene, a cassette strategy to replace the wild type alpha 1 exon with individual mutant alpha 1 exons, and analysis of mutant molecules expressed on the surface of transfected mouse L cells. Images PMID:2903482

  6. Intra- and inter-isolate variation of ribosomal and protein-coding genes in Pleurotus: implications for molecular identification and phylogeny on fungal groups.

    PubMed

    He, Xiao-Lan; Li, Qian; Peng, Wei-Hong; Zhou, Jie; Cao, Xue-Lian; Wang, Di; Huang, Zhong-Qian; Tan, Wei; Li, Yu; Gan, Bing-Cheng

    2017-06-26

    The internal transcribed spacer (ITS), RNA polymerase II second largest subunit (RPB2), and elongation factor 1-alpha (EF1α) are often used in fungal taxonomy and phylogenetic analysis. As we know, an ideal molecular marker used in molecular identification and phylogenetic studies is homogeneous within species, and interspecific variation exceeds intraspecific variation. However, during our process of performing ITS, RPB2, and EF1α sequencing on the Pleurotus spp., we found that intra-isolate sequence polymorphism might be present in these genes because direct sequencing of PCR products failed in some isolates. Therefore, we detected intra- and inter-isolate variation of the three genes in Pleurotus by polymerase chain reaction amplification and cloning in this study. Results showed that intra-isolate variation of ITS was not uncommon but the polymorphic level in each isolate was relatively low in Pleurotus; intra-isolate variations of EF1α and RPB2 sequences were present in an unexpectedly high amount. The polymorphism level differed significantly between ITS, RPB2, and EF1α in the same individual, and the intra-isolate heterogeneity level of each gene varied between isolates within the same species. Intra-isolate and intraspecific variation of ITS in the tested isolates was less than interspecific variation, and intra-isolate and intraspecific variation of RPB2 was probably equal with interspecific divergence. Meanwhile, intra-isolate and intraspecific variation of EF1α could exceed interspecific divergence. These findings suggested that RPB2 and EF1α are not desirable barcoding candidates for Pleurotus. We also discussed the reason why rDNA and protein-coding genes showed variants within a single isolate in Pleurotus, but must be addressed in further research. Our study demonstrated that intra-isolate variation of ribosomal and protein-coding genes are likely widespread in fungi. This has implications for studies on fungal evolution, taxonomy

  7. Death of a dogma: eukaryotic mRNAs can code for more than one protein

    PubMed Central

    Mouilleron, Hélène; Delcourt, Vivian; Roucou, Xavier

    2016-01-01

    mRNAs carry the genetic information that is translated by ribosomes. The traditional view of a mature eukaryotic mRNA is a molecule with three main regions, the 5′ UTR, the protein coding open reading frame (ORF) or coding sequence (CDS), and the 3′ UTR. This concept assumes that ribosomes translate one ORF only, generally the longest one, and produce one protein. As a result, in the early days of genomics and bioinformatics, one CDS was associated with each protein-coding gene. This fundamental concept of a single CDS is being challenged by increasing experimental evidence indicating that annotated proteins are not the only proteins translated from mRNAs. In particular, mass spectrometry (MS)-based proteomics and ribosome profiling have detected productive translation of alternative open reading frames. In several cases, the alternative and annotated proteins interact. Thus, the expression of two or more proteins translated from the same mRNA may offer a mechanism to ensure the co-expression of proteins which have functional interactions. Translational mechanisms already described in eukaryotic cells indicate that the cellular machinery is able to translate different CDSs from a single viral or cellular mRNA. In addition to summarizing data showing that the protein coding potential of eukaryotic mRNAs has been underestimated, this review aims to challenge the single translated CDS dogma. PMID:26578573

  8. Molecular mechanisms of ribosomal protein gene coregulation

    PubMed Central

    Reja, Rohit; Vinayachandran, Vinesh; Ghosh, Sujana; Pugh, B. Franklin

    2015-01-01

    The 137 ribosomal protein genes (RPGs) of Saccharomyces provide a model for gene coregulation. We examined the positional and functional organization of their regulators (Rap1 [repressor activator protein 1], Fhl1, Ifh1, Sfp1, and Hmo1), the transcription machinery (TFIIB, TFIID, and RNA polymerase II), and chromatin at near-base-pair resolution using ChIP-exo, as RPGs are coordinately reprogrammed. Where Hmo1 is enriched, Fhl1, Ifh1, Sfp1, and Hmo1 cross-linked broadly to promoter DNA in an RPG-specific manner and demarcated by general minor groove widening. Importantly, Hmo1 extended 20–50 base pairs (bp) downstream from Fhl1. Upon RPG repression, Fhl1 remained in place. Hmo1 dissociated, which was coupled to an upstream shift of the +1 nucleosome, as reflected by the Hmo1 extension and core promoter region. Fhl1 and Hmo1 may create two regulatable and positionally distinct barriers, against which chromatin remodelers position the +1 nucleosome into either an activating or a repressive state. Consistent with in vitro studies, we found that specific TFIID subunits, in addition to cross-linking at the core promoter, made precise cross-links at Rap1 sites, which we interpret to reflect native Rap1–TFIID interactions. Our findings suggest how sequence-specific DNA binding regulates nucleosome positioning and transcription complex assembly >300 bp away and how coregulation coevolved with coding sequences. PMID:26385964

  9. Molecular mechanisms of ribosomal protein gene coregulation.

    PubMed

    Reja, Rohit; Vinayachandran, Vinesh; Ghosh, Sujana; Pugh, B Franklin

    2015-09-15

    The 137 ribosomal protein genes (RPGs) of Saccharomyces provide a model for gene coregulation. We examined the positional and functional organization of their regulators (Rap1 [repressor activator protein 1], Fhl1, Ifh1, Sfp1, and Hmo1), the transcription machinery (TFIIB, TFIID, and RNA polymerase II), and chromatin at near-base-pair resolution using ChIP-exo, as RPGs are coordinately reprogrammed. Where Hmo1 is enriched, Fhl1, Ifh1, Sfp1, and Hmo1 cross-linked broadly to promoter DNA in an RPG-specific manner and demarcated by general minor groove widening. Importantly, Hmo1 extended 20-50 base pairs (bp) downstream from Fhl1. Upon RPG repression, Fhl1 remained in place. Hmo1 dissociated, which was coupled to an upstream shift of the +1 nucleosome, as reflected by the Hmo1 extension and core promoter region. Fhl1 and Hmo1 may create two regulatable and positionally distinct barriers, against which chromatin remodelers position the +1 nucleosome into either an activating or a repressive state. Consistent with in vitro studies, we found that specific TFIID subunits, in addition to cross-linking at the core promoter, made precise cross-links at Rap1 sites, which we interpret to reflect native Rap1-TFIID interactions. Our findings suggest how sequence-specific DNA binding regulates nucleosome positioning and transcription complex assembly >300 bp away and how coregulation coevolved with coding sequences. © 2015 Reja et al.; Published by Cold Spring Harbor Laboratory Press.

  10. Intragenome Diversity of Gene Families Encoding Toxin-like Proteins in Venomous Animals.

    PubMed

    Rodríguez de la Vega, Ricardo C; Giraud, Tatiana

    2016-11-01

    The evolution of venoms is the story of how toxins arise and of the processes that generate and maintain their diversity. For animal venoms these processes include recruitment for expression in the venom gland, neofunctionalization, paralogous expansions, and functional divergence. The systematic study of these processes requires the reliable identification of the venom components involved in antagonistic interactions. High-throughput sequencing has the potential of uncovering the entire set of toxins in a given organism, yet the existence of non-venom toxin paralogs and the misleading effects of partial census of the molecular diversity of toxins make necessary to collect complementary evidence to distinguish true toxins from their non-venom paralogs. Here, we analyzed the whole genomes of two scorpions, one spider and one snake, aiming at the identification of the full repertoires of genes encoding toxin-like proteins. We classified the entire set of protein-coding genes into paralogous groups and monotypic genes, identified genes encoding toxin-like proteins based on known toxin families, and quantified their expression in both venom-glands and pooled tissues. Our results confirm that genes encoding toxin-like proteins are part of multigene families, and that these families arise by recruitment events from non-toxin genes followed by limited expansions of the toxin-like protein coding genes. We also show that failing to account for sequence similarity with non-toxin proteins has a considerable misleading effect that can be greatly reduced by comparative transcriptomics. Our study overall contributes to the understanding of the evolutionary dynamics of proteins involved in antagonistic interactions. © The Author 2016. Published by Oxford University Press on behalf of the Society for Integrative and Comparative Biology. All rights reserved. For permissions please email: journals.permissions@oup.com.

  11. Unraveling patterns of site-to-site synonymous rates variation and associated gene properties of protein domains and families.

    PubMed

    Dimitrieva, Slavica; Anisimova, Maria

    2014-01-01

    In protein-coding genes, synonymous mutations are often thought not to affect fitness and therefore are not subject to natural selection. Yet increasingly, cases of non-neutral evolution at certain synonymous sites were reported over the last decade. To evaluate the extent and the nature of site-specific selection on synonymous codons, we computed the site-to-site synonymous rate variation (SRV) and identified gene properties that make SRV more likely in a large database of protein-coding gene families and protein domains. To our knowledge, this is the first study that explores the determinants and patterns of the SRV in real data. We show that the SRV is widespread in the evolution of protein-coding sequences, putting in doubt the validity of the synonymous rate as a standard neutral proxy. While protein domains rarely undergo adaptive evolution, the SRV appears to play important role in optimizing the domain function at the level of DNA. In contrast, protein families are more likely to evolve by positive selection, but are less likely to exhibit SRV. Stronger SRV was detected in genes with stronger codon bias and tRNA reusage, those coding for proteins with larger number of interactions or forming larger number of structures, located in intracellular components and those involved in typically conserved complex processes and functions. Genes with extreme SRV show higher expression levels in nearly all tissues. This indicates that codon bias in a gene, which often correlates with gene expression, may often be a site-specific phenomenon regulating the speed of translation along the sequence, consistent with the co-translational folding hypothesis. Strikingly, genes with SRV were strongly overrepresented for metabolic pathways and those associated with several genetic diseases, particularly cancers and diabetes.

  12. GA binding protein augments autophagy via transcriptional activation of BECN1-PIK3C3 complex genes

    PubMed Central

    Zhu, Wan; Swaminathan, Gayathri; Plowey, Edward D

    2014-01-01

    Macroautophagy is a vesicular catabolic trafficking pathway that is thought to protect cells from diverse stressors and to promote longevity. Recent studies have revealed that transcription factors play important roles in the regulation of autophagy. In this study, we have identified GA binding protein (GABP) as a transcriptional regulator of the combinatorial expression of BECN1-PIK3C3 complex genes involved in autophagosome initiation. We performed bioinformatics analyses that demonstrated highly conserved putative GABP sites in genes that encode BECN1/Beclin 1, several BECN1 interacting proteins, and downstream autophagy proteins including the ATG12–ATG5-ATG16L1 complex. We demonstrate that GABP binds to the promoter regions of BECN1-PIK3C3 complex genes and activates their transcriptional activities. Knockdown of GABP reduced BECN1-PIK3C3 complex transcripts, BECN1-PIK3C3 complex protein levels and autophagy in cultured cells. Conversely, overexpression of GABP increased autophagy. Nutrient starvation increased GABP-dependent transcriptional activity of BECN1-PIK3C3 complex gene promoters and increased the recruitment of GABP to the BECN1 promoter. Our data reveal a novel function of GABP in the regulation of autophagy via transcriptional activation of the BECN1-PIK3C3 complex. PMID:25046113

  13. Cloning and sequence analysis of a cDNA clone coding for the mouse GM2 activator protein.

    PubMed Central

    Bellachioma, G; Stirling, J L; Orlacchio, A; Beccari, T

    1993-01-01

    A cDNA (1.1 kb) containing the complete coding sequence for the mouse GM2 activator protein was isolated from a mouse macrophage library using a cDNA for the human protein as a probe. There was a single ATG located 12 bp from the 5' end of the cDNA clone followed by an open reading frame of 579 bp. Northern blot analysis of mouse macrophage RNA showed that there was a single band with a mobility corresponding to a size of 2.3 kb. We deduce from this that the mouse mRNA, in common with the mRNA for the human GM2 activator protein, has a long 3' untranslated sequence of approx. 1.7 kb. Alignment of the mouse and human deduced amino acid sequences showed 68% identity overall and 75% identity for the sequence on the C-terminal side of the first 31 residues, which in the human GM2 activator protein contains the signal peptide. Hydropathicity plots showed great similarity between the mouse and human sequences even in regions of low sequence similarity. There is a single N-glycosylation site in the mouse GM2 activator protein sequence (Asn151-Phe-Thr) which differs in its location from the single site reported in the human GM2 activator protein sequence (Asn63-Val-Thr). Images Figure 1 PMID:7689829

  14. Novel methods for the molecular discrimination of Fasciola spp. on the basis of nuclear protein-coding genes.

    PubMed

    Shoriki, Takuya; Ichikawa-Seki, Madoka; Suganuma, Keisuke; Naito, Ikunori; Hayashi, Kei; Nakao, Minoru; Aita, Junya; Mohanta, Uday Kumar; Inoue, Noboru; Murakami, Kenji; Itagaki, Tadashi

    2016-06-01

    Fasciolosis is an economically important disease of livestock caused by Fasciola hepatica, Fasciola gigantica, and aspermic Fasciola flukes. The aspermic Fasciola flukes have been discriminated morphologically from the two other species by the absence of sperm in their seminal vesicles. To date, the molecular discrimination of F. hepatica and F. gigantica has relied on the nucleotide sequences of the internal transcribed spacer 1 (ITS1) region. However, ITS1 genotypes of aspermic Fasciola flukes cannot be clearly differentiated from those of F. hepatica and F. gigantica. Therefore, more precise and robust methods are required to discriminate Fasciola spp. In this study, we developed PCR restriction fragment length polymorphism and multiplex PCR methods to discriminate F. hepatica, F. gigantica, and aspermic Fasciola flukes on the basis of the nuclear protein-coding genes, phosphoenolpyruvate carboxykinase and DNA polymerase delta, which are single locus genes in most eukaryotes. All aspermic Fasciola flukes used in this study had mixed fragment pattern of F. hepatica and F. gigantica for both of these genes, suggesting that the flukes are descended through hybridization between the two species. These molecular methods will facilitate the identification of F. hepatica, F. gigantica, and aspermic Fasciola flukes, and will also prove useful in etiological studies of fasciolosis. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.

  15. Molecular characterization demonstrates that the Zea mays gene sugary2 codes for the starch synthase isoform SSIIa.

    PubMed

    Zhang, Xiaoli; Colleoni, Christophe; Ratushna, Vlada; Sirghie-Colleoni, Mirella; James, Martha G; Myers, Alan M

    2004-04-01

    Mutations in the maize gene sugary2 ( su2 ) affect starch structure and its resultant physiochemical properties in useful ways, although the gene has not been characterized previously at the molecular level. This study tested the hypothesis that su2 codes for starch synthase IIa (SSIIa). Two independent mutations of the su2 locus, su2-2279 and su2-5178 , were identified in a Mutator -active maize population. The nucleotide sequence of the genomic locus that codes for SSIIa was compared between wild type plants and those homozygous for either novel mutation. Plants bearing su2-2279 invariably contained a Mutator transposon in exon 3 of the SSIIa gene, and su2-5178 mutants always contained a small retrotransposon-like insertion in exon 10. Six allelic su2 (-) mutations conditioned loss or reduction in abundance of the SSIIa protein detected by immunoblot. These data indicate that su2 codes for SSIIa and that deficiency in this isoform is ultimately responsible for the altered physiochemical properties of su2 (-) mutant starches. A specific starch synthase isoform among several identified in soluble endosperm extracts was absent in su2-2279 or su2-5178 mutants, indicating that SSIIa is active in the soluble phase during kernel development. The immediate structural effect of the su2 (-) mutations was shown to be increased abundance of short glucan chains in amylopectin and a proportional decrease in intermediate length chains, similar to the effects of SSII deficiency in other species.

  16. Amplification of the groESL operon in Pseudomonas putida increases siderophore gene promoter activity.

    PubMed

    Venturi, V; Wolfs, K; Leong, J; Weisbeek, P J

    1994-10-17

    Pseudobactin 358 is the yellow-green fluorescent siderophore [microbial iron(III) transport agent] produced by Pseudomonas putida WCS358 under iron-limiting conditions. The genes encoding pseudobactin 358 biosynthesis are iron-regulated at the level of transcription. In this study, the molecular characterization is reported of a cosmid clone of WCS358 DNA that can stimulate, in an iron-dependent manner, the activity of a WCS358 siderophore gene promoter in the heterologous Pseudomonas strain A225. The functional region in the clone was identified by subcloning, transposon mutagenesis and DNA sequencing as the groESL operon of strain WCS358. This increase in promoter activity was not observed when the groESL genes of strain WCS358 were integrated via a transposon vector into the genome of Pseudomonas A225, indicating that multiple copies of the operon are necessary for the increase in siderophore gene promoter activity. Amplification of the Escherichia coli and WCS358 groESL genes also increased iron-regulated promoter activity in the parent strain WCS358. The groESL operon codes for the chaperone proteins GroES and GroEL, which are responsible for mediating the folding and assembly of many proteins.

  17. Ethylene Regulates Monomeric GTP-Binding Protein Gene Expression and Activity in Arabidopsis1

    PubMed Central

    Moshkov, Igor E.; Mur, Luis A.J.; Novikova, Galina V.; Smith, Aileen R.; Hall, Michael A.

    2003-01-01

    Ethylene rapidly and transiently up-regulates the activity of several monomeric GTP-binding proteins (monomeric G proteins) in leaves of Arabidopsis as determined by two-dimensional gel electrophoresis and autoradiographic analyses. The activation is suppressed by the receptor-directed inhibitor 1-methylcyclopropene. In the etr1-1 mutant, constitutive activity of all the monomeric G proteins activated by ethylene is down-regulated relative to wild type, and ethylene treatment has no effect on the levels of activity. Conversely, in the ctr1-1 mutant, several of the monomeric G proteins activated by ethylene are constitutively up-regulated. However, the activation profile of ctr1-1 does not exactly mimic that of ethylene-treated wild type. Biochemical and molecular evidence suggested that some of these monomeric G proteins are of the Rab class. Expression of the genes for a number of monomeric G proteins in response to ethylene was investigated by reverse transcriptase-PCR. Rab8 and Ara3 expression was increased within 10 min of ethylene treatment, although levels fell back significantly by 40 min. In the etr1-1 mutant, expression of Rab8 was lower than wild type and unaffected by ethylene; in ctr1-1, expression of Rab8 was much higher than wild type and comparable with that seen in ethylene treatments. Expression in ctr1-1 was also unaffected by ethylene. Thus, the data indicate a role for monomeric G proteins in ethylene signal transduction. PMID:12692329

  18. Maternal transcription of non-protein coding RNAs from the PWS-critical region rescues growth retardation in mice

    PubMed Central

    Rozhdestvensky, Timofey S.; Robeck, Thomas; Galiveti, Chenna R.; Raabe, Carsten A.; Seeger, Birte; Wolters, Anna; Gubar, Leonid V.; Brosius, Jürgen; Skryabin, Boris V.

    2016-01-01

    Prader-Willi syndrome (PWS) is a neurogenetic disorder caused by loss of paternally expressed genes on chromosome 15q11-q13. The PWS-critical region (PWScr) contains an array of non-protein coding IPW-A exons hosting intronic SNORD116 snoRNA genes. Deletion of PWScr is associated with PWS in humans and growth retardation in mice exhibiting ~15% postnatal lethality in C57BL/6 background. Here we analysed a knock-in mouse containing a 5′HPRT-LoxP-NeoR cassette (5′LoxP) inserted upstream of the PWScr. When the insertion was inherited maternally in a paternal PWScr-deletion mouse model (PWScrp−/m5′LoxP), we observed compensation of growth retardation and postnatal lethality. Genomic methylation pattern and expression of protein-coding genes remained unaltered at the PWS-locus of PWScrp−/m5′LoxP mice. Interestingly, ubiquitous Snord116 and IPW-A exon transcription from the originally silent maternal chromosome was detected. In situ hybridization indicated that PWScrp−/m5′LoxP mice expressed Snord116 in brain areas similar to wild type animals. Our results suggest that the lack of PWScr RNA expression in certain brain areas could be a primary cause of the growth retardation phenotype in mice. We propose that activation of disease-associated genes on imprinted regions could lead to general therapeutic strategies in associated diseases. PMID:26848093

  19. Methylation of miRNA genes and oncogenesis.

    PubMed

    Loginov, V I; Rykov, S V; Fridman, M V; Braga, E A

    2015-02-01

    Interaction between microRNA (miRNA) and messenger RNA of target genes at the posttranscriptional level provides fine-tuned dynamic regulation of cell signaling pathways. Each miRNA can be involved in regulating hundreds of protein-coding genes, and, conversely, a number of different miRNAs usually target a structural gene. Epigenetic gene inactivation associated with methylation of promoter CpG-islands is common to both protein-coding genes and miRNA genes. Here, data on functions of miRNAs in development of tumor-cell phenotype are reviewed. Genomic organization of promoter CpG-islands of the miRNA genes located in inter- and intragenic areas is discussed. The literature and our own results on frequency of CpG-island methylation in miRNA genes from tumors are summarized, and data regarding a link between such modification and changed activity of miRNA genes and, consequently, protein-coding target genes are presented. Moreover, the impact of miRNA gene methylation on key oncogenetic processes as well as affected signaling pathways is discussed.

  20. Streptococcus mutans genes that code for extracellular proteins in Escherichia coli K-12.

    PubMed

    Holt, R G; Abiko, Y; Saito, S; Smorawinska, M; Hansen, J B; Curtiss, R

    1982-10-01

    Chromosomal DNA from Streptococcus mutans 6715 (serotype g) was cloned into Escherichia coli K-12 by using the cosmid pJC74 cloning vector and a bacteriophage lambda in vitro packaging system. Rabbit antiserum against S. mutans extracellular proteins was used for immunological screening of the clone bank. Twenty-one clones produced weak to strong precipitin bands around the colonies, but only after the lambda c1857 prophage was induced by being heated to lyse the E. coli cells. None of the clones expressed enzyme activity for several known S. mutans extracellular enzymes. One of these clones contained a 45-kilobase recombinant plasmid designated pYA721. An 8.5-kilobase fragment of S. mutans DNA from pYA721 was isolated and recloned into the BamHI restriction site of the plasmid vector pACYC184 to construct pYA726. pYA726 contained all, or nearly all, of the gene for a surface protein antigen (the spaA protein) of S. mutans 6715. This was deduced from immunological studies in which extracts of cells harboring pYA726 reacted with antisera against both purified 6715 spaA protein (about 210,000 daltons) and the immunologically similar antigen I/II of serotype c strains of S. mutans. In addition, the S. mutans spaA protein was found to possess at least one antigenic determinant not present on the protein specified by pYA726. Sodium dodecyl sulfate-polyacrylamide gel electrophoresis of E. coli clone extracts revealed that pYA726 produced a polypeptide with a molecular mass of about 180,000 daltons which was predominantly found in the periplasmic space of E. coli cells. Antisera to the spaA protein of S. mutans reacted with extracellular protein from representative strains of S. mutans serotypes a, c, d, e, f, and g, but not b.

  1. Functional characterization of the MKC1 gene of Candida albicans, which encodes a mitogen-activated protein kinase homolog related to cell integrity.

    PubMed Central

    Navarro-García, F; Sánchez, M; Pla, J; Nombela, C

    1995-01-01

    Mitogen-activated protein (MAP) kinases represent a group of serine/threonine protein kinases playing a central role in signal transduction processes in eukaryotic cells. Using a strategy based on the complementation of the thermosensitive autolytic phenotype of slt2 null mutants, we have isolated a Candida albicans homolog of Saccharomyces cerevisiae MAP kinase gene SLT2 (MPK1), which is involved in the recently outlined PKC1-controlled signalling pathway. The isolated gene, named MKC1 (MAP kinase from C. albicans), coded for a putative protein, Mkc1p, of 58,320 Da that displayed all the characteristic domains of MAP kinases and was 55% identical to S. cerevisiae Slt2p (Mpk1p). The MKC1 gene was deleted in a diploid Candida strain, and heterozygous and homozygous strains, in both Ura+ and Ura- backgrounds, were obtained to facilitate the analysis of the function of the gene. Deletion of the two alleles of the MKC1 gene gave rise to viable cells that grew at 28 and 37 degrees C but, nevertheless, displayed a variety of phenotypic traits under more stringent conditions. These included a low growth yield and a loss of viability in cultures grown at 42 degrees C, a high sensitivity to thermal shocks at 55 degrees C, an enhanced susceptibility to caffeine that was osmotically remediable, and the formation of a weak cell wall with a very low resistance to complex lytic enzyme preparations. The analysis of the functions downstream of the MKC1 gene should contribute to understanding of the connection of growth and morphogenesis in pathogenic fungi. PMID:7891715

  2. Dietary Intervention by Phytochemicals and Their Role in Modulating Coding and Non-Coding Genes in Cancer

    PubMed Central

    Budisan, Liviuta; Gulei, Diana; Zanoaga, Oana Mihaela; Irimie, Alexandra Iulia; Chira, Sergiu; Braicu, Cornelia; Gherman, Claudia Diana; Berindan-Neagoe, Ioana

    2017-01-01

    Phytochemicals are natural compounds synthesized as secondary metabolites in plants, representing an important source of molecules with a wide range of therapeutic applications. These natural agents are important regulators of key pathological processes/conditions, including cancer, as they are able to modulate the expression of coding and non-coding transcripts with an oncogenic or tumour suppressor role. These natural agents are currently exploited for the development of therapeutic strategies alone or in tandem with conventional treatments for cancer. The aim of this paper is to review the recent studies regarding the role of these natural phytochemicals in different processes related to cancer inhibition, including apoptosis activation, angiogenesis and metastasis suppression. From the large palette of phytochemicals we selected epigallocatechin gallate (EGCG), caffeic acid phenethyl ester (CAPE), genistein, morin and kaempferol, due to their increased activity in modulating multiple coding and non-coding genes, targeting the main hallmarks of cancer. PMID:28587155

  3. Dietary Intervention by Phytochemicals and Their Role in Modulating Coding and Non-Coding Genes in Cancer.

    PubMed

    Budisan, Liviuta; Gulei, Diana; Zanoaga, Oana Mihaela; Irimie, Alexandra Iulia; Sergiu, Chira; Braicu, Cornelia; Gherman, Claudia Diana; Berindan-Neagoe, Ioana

    2017-06-01

    Phytochemicals are natural compounds synthesized as secondary metabolites in plants, representing an important source of molecules with a wide range of therapeutic applications. These natural agents are important regulators of key pathological processes/conditions, including cancer, as they are able to modulate the expression of coding and non-coding transcripts with an oncogenic or tumour suppressor role. These natural agents are currently exploited for the development of therapeutic strategies alone or in tandem with conventional treatments for cancer. The aim of this paper is to review the recent studies regarding the role of these natural phytochemicals in different processes related to cancer inhibition, including apoptosis activation, angiogenesis and metastasis suppression. From the large palette of phytochemicals we selected epigallocatechin gallate (EGCG), caffeic acid phenethyl ester (CAPE), genistein, morin and kaempferol, due to their increased activity in modulating multiple coding and non-coding genes, targeting the main hallmarks of cancer.

  4. In silico study of protein to protein interaction analysis of AMP-activated protein kinase and mitochondrial activity in three different farm animal species

    NASA Astrophysics Data System (ADS)

    Prastowo, S.; Widyas, N.

    2018-03-01

    AMP-activated protein kinase (AMPK) is cellular energy censor which works based on ATP and AMP concentration. This protein interacts with mitochondria in determine its activity to generate energy for cell metabolism purposes. For that, this paper aims to compare the protein to protein interaction of AMPK and mitochondrial activity genes in the metabolism of known animal farm (domesticated) that are cattle (Bos taurus), pig (Sus scrofa) and chicken (Gallus gallus). In silico study was done using STRING V.10 as prominent protein interaction database, followed with biological function comparison in KEGG PATHWAY database. Set of genes (12 in total) were used as input analysis that are PRKAA1, PRKAA2, PRKAB1, PRKAB2, PRKAG1, PRKAG2, PRKAG3, PPARGC1, ACC, CPT1B, NRF2 and SOD. The first 7 genes belong to gene in AMPK family, while the last 5 belong to mitochondrial activity genes. The protein interaction result shows 11, 8 and 5 metabolism pathways in Bos taurus, Sus scrofa and Gallus gallus, respectively. The top pathway in Bos taurus is AMPK signaling pathway (10 genes), Sus scrofa is Adipocytokine signaling pathway (8 genes) and Gallus gallus is FoxO signaling pathway (5 genes). Moreover, the common pathways found in those 3 species are Adipocytokine signaling pathway, Insulin signaling pathway and FoxO signaling pathway. Genes clustered in Adipocytokine and Insulin signaling pathway are PRKAA2, PPARGC1A, PRKAB1 and PRKAG2. While, in FoxO signaling pathway are PRKAA2, PRKAB1, PRKAG2. According to that, we found PRKAA2, PRKAB1 and PRKAG2 are the common genes. Based on the bioinformatics analysis, we can demonstrate that protein to protein interaction shows distinct different of metabolism in different species. However, further validation is needed to give a clear explanation.

  5. Death of a dogma: eukaryotic mRNAs can code for more than one protein.

    PubMed

    Mouilleron, Hélène; Delcourt, Vivian; Roucou, Xavier

    2016-01-08

    mRNAs carry the genetic information that is translated by ribosomes. The traditional view of a mature eukaryotic mRNA is a molecule with three main regions, the 5' UTR, the protein coding open reading frame (ORF) or coding sequence (CDS), and the 3' UTR. This concept assumes that ribosomes translate one ORF only, generally the longest one, and produce one protein. As a result, in the early days of genomics and bioinformatics, one CDS was associated with each protein-coding gene. This fundamental concept of a single CDS is being challenged by increasing experimental evidence indicating that annotated proteins are not the only proteins translated from mRNAs. In particular, mass spectrometry (MS)-based proteomics and ribosome profiling have detected productive translation of alternative open reading frames. In several cases, the alternative and annotated proteins interact. Thus, the expression of two or more proteins translated from the same mRNA may offer a mechanism to ensure the co-expression of proteins which have functional interactions. Translational mechanisms already described in eukaryotic cells indicate that the cellular machinery is able to translate different CDSs from a single viral or cellular mRNA. In addition to summarizing data showing that the protein coding potential of eukaryotic mRNAs has been underestimated, this review aims to challenge the single translated CDS dogma. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  6. HMGN proteins modulate chromatin regulatory sites and gene expression during activation of naïve B cells

    PubMed Central

    Zhang, Shaofei; Zhu, Iris; Deng, Tao; Furusawa, Takashi; Rochman, Mark; Vacchio, Melanie S.; Bosselut, Remy; Yamane, Arito; Casellas, Rafael; Landsman, David; Bustin, Michael

    2016-01-01

    The activation of naïve B lymphocyte involves rapid and major changes in chromatin organization and gene expression; however, the complete repertoire of nuclear factors affecting these genomic changes is not known. We report that HMGN proteins, which bind to nucleosomes and affect chromatin structure and function, co-localize with, and maintain the intensity of DNase I hypersensitive sites genome wide, in resting but not in activated B cells. Transcription analyses of resting and activated B cells from wild-type and Hmgn−/− mice, show that loss of HMGNs dampens the magnitude of the transcriptional response and alters the pattern of gene expression during the course of B-cell activation; defense response genes are most affected at the onset of activation. Our study provides insights into the biological function of the ubiquitous HMGN chromatin binding proteins and into epigenetic processes that affect the fidelity of the transcriptional response during the activation of B cell lymphocytes. PMID:27112571

  7. Differential DNA methylation profiles of coding and non-coding genes define hippocampal sclerosis in human temporal lobe epilepsy

    PubMed Central

    Miller-Delaney, Suzanne F.C.; Bryan, Kenneth; Das, Sudipto; McKiernan, Ross C.; Bray, Isabella M.; Reynolds, James P.; Gwinn, Ryder; Stallings, Raymond L.

    2015-01-01

    Temporal lobe epilepsy is associated with large-scale, wide-ranging changes in gene expression in the hippocampus. Epigenetic changes to DNA are attractive mechanisms to explain the sustained hyperexcitability of chronic epilepsy. Here, through methylation analysis of all annotated C-phosphate-G islands and promoter regions in the human genome, we report a pilot study of the methylation profiles of temporal lobe epilepsy with or without hippocampal sclerosis. Furthermore, by comparative analysis of expression and promoter methylation, we identify methylation sensitive non-coding RNA in human temporal lobe epilepsy. A total of 146 protein-coding genes exhibited altered DNA methylation in temporal lobe epilepsy hippocampus (n = 9) when compared to control (n = 5), with 81.5% of the promoters of these genes displaying hypermethylation. Unique methylation profiles were evident in temporal lobe epilepsy with or without hippocampal sclerosis, in addition to a common methylation profile regardless of pathology grade. Gene ontology terms associated with development, neuron remodelling and neuron maturation were over-represented in the methylation profile of Watson Grade 1 samples (mild hippocampal sclerosis). In addition to genes associated with neuronal, neurotransmitter/synaptic transmission and cell death functions, differential hypermethylation of genes associated with transcriptional regulation was evident in temporal lobe epilepsy, but overall few genes previously associated with epilepsy were among the differentially methylated. Finally, a panel of 13, methylation-sensitive microRNA were identified in temporal lobe epilepsy including MIR27A, miR-193a-5p (MIR193A) and miR-876-3p (MIR876), and the differential methylation of long non-coding RNA documented for the first time. The present study therefore reports select, genome-wide DNA methylation changes in human temporal lobe epilepsy that may contribute to the molecular architecture of the epileptic brain. PMID

  8. Genes encoding cuticular proteins are components of the Nimrod gene cluster in Drosophila.

    PubMed

    Cinege, Gyöngyi; Zsámboki, János; Vidal-Quadras, Maite; Uv, Anne; Csordás, Gábor; Honti, Viktor; Gábor, Erika; Hegedűs, Zoltán; Varga, Gergely I B; Kovács, Attila L; Juhász, Gábor; Williams, Michael J; Andó, István; Kurucz, Éva

    2017-08-01

    The Nimrod gene cluster, located on the second chromosome of Drosophila melanogaster, is the largest synthenic unit of the Drosophila genome. Nimrod genes show blood cell specific expression and code for phagocytosis receptors that play a major role in fruit fly innate immune functions. We previously identified three homologous genes (vajk-1, vajk-2 and vajk-3) located within the Nimrod cluster, which are unrelated to the Nimrod genes, but are homologous to a fourth gene (vajk-4) located outside the cluster. Here we show that, unlike the Nimrod candidates, the Vajk proteins are expressed in cuticular structures of the late embryo and the late pupa, indicating that they contribute to cuticular barrier functions. Copyright © 2017 Elsevier Ltd. All rights reserved.

  9. The Epstein–Barr virus nuclear protein SM is both a post-transcriptional inhibitor and activator of gene expression

    PubMed Central

    Ruvolo, Vivian; Wang, Eryu; Boyle, Sarah; Swaminathan, Sankar

    1998-01-01

    The Epstein–Barr virus (EBV) nuclear protein BS-MLF1 (SM) is expressed early after entry of EBV into the lytic cycle. SM transactivates reporter gene constructs driven by a wide variety of promoters, but the mechanism of SM action is poorly understood. In this study, we demonstrate that the SM protein inhibits expression of intron-containing genes and activates expression of intron-less genes. We demonstrate that SM has the predicted inhibitory effect on expression of a spliced EBV gene but activates an unspliced early EBV gene. SM inhibited gene expression at the post-transcriptional level by preventing the accumulation of nuclear and cytoplasmic RNA transcripts. Conversely, SM led to increased accumulation of nuclear mRNA from intron-less genes without affecting the rate of transcription, indicating that SM enhances nuclear RNA stability. The ratio of cytoplasmic to nuclear polyadenylated mRNA was increased in the presence of SM, suggesting that SM also enhances nucleo-cytoplasmic mRNA transport. The degree of transactivation by SM was dependent on the sequence of the 3′-untranslated region of the target mRNA. Finally, we demonstrate that the amino-terminal portion of SM fused to glutathione-S-transferase binds radioactively labeled RNA in vitro, indicating that SM is a single-stranded RNA binding protein. Importantly, the latent and immediate-early genes of EBV contain introns whereas many early and late genes do not. Thus, SM may down-regulate synthesis of host cell proteins and latent EBV proteins while simultaneously enhancing expression of specific lytic EBV genes by binding to mRNA and modulating its stability and transport. PMID:9671768

  10. The Epstein-Barr virus nuclear protein SM is both a post-transcriptional inhibitor and activator of gene expression.

    PubMed

    Ruvolo, V; Wang, E; Boyle, S; Swaminathan, S

    1998-07-21

    The Epstein-Barr virus (EBV) nuclear protein BS-MLF1 (SM) is expressed early after entry of EBV into the lytic cycle. SM transactivates reporter gene constructs driven by a wide variety of promoters, but the mechanism of SM action is poorly understood. In this study, we demonstrate that the SM protein inhibits expression of intron-containing genes and activates expression of intron-less genes. We demonstrate that SM has the predicted inhibitory effect on expression of a spliced EBV gene but activates an unspliced early EBV gene. SM inhibited gene expression at the post-transcriptional level by preventing the accumulation of nuclear and cytoplasmic RNA transcripts. Conversely, SM led to increased accumulation of nuclear mRNA from intron-less genes without affecting the rate of transcription, indicating that SM enhances nuclear RNA stability. The ratio of cytoplasmic to nuclear polyadenylated mRNA was increased in the presence of SM, suggesting that SM also enhances nucleo-cytoplasmic mRNA transport. The degree of transactivation by SM was dependent on the sequence of the 3'-untranslated region of the target mRNA. Finally, we demonstrate that the amino-terminal portion of SM fused to glutathione-S-transferase binds radioactively labeled RNA in vitro, indicating that SM is a single-stranded RNA binding protein. Importantly, the latent and immediate-early genes of EBV contain introns whereas many early and late genes do not. Thus, SM may down-regulate synthesis of host cell proteins and latent EBV proteins while simultaneously enhancing expression of specific lytic EBV genes by binding to mRNA and modulating its stability and transport.

  11. Novel coding, translation, and gene expression of a replicating covalently closed circular RNA of 220 nt.

    PubMed

    AbouHaidar, Mounir Georges; Venkataraman, Srividhya; Golshani, Ashkan; Liu, Bolin; Ahmad, Tauqeer

    2014-10-07

    The highly structured (64% GC) covalently closed circular (CCC) RNA (220 nt) of the virusoid associated with rice yellow mottle virus codes for a 16-kDa highly basic protein using novel modalities for coding, translation, and gene expression. This CCC RNA is the smallest among all known viroids and virusoids and the only one that codes proteins. Its sequence possesses an internal ribosome entry site and is directly translated through two (or three) completely overlapping ORFs (shifting to a new reading frame at the end of each round). The initiation and termination codons overlap UGAUGA (underline highlights the initiation codon AUG within the combined initiation-termination sequence). Termination codons can be ignored to obtain larger read-through proteins. This circular RNA with no noncoding sequences is a unique natural supercompact "nanogenome."

  12. The Complete Mitochondrial Genome of the Land Snail Cornu aspersum (Helicidae: Mollusca): Intra-Specific Divergence of Protein-Coding Genes and Phylogenetic Considerations within Euthyneura

    PubMed Central

    Gaitán-Espitia, Juan Diego; Nespolo, Roberto F.; Opazo, Juan C.

    2013-01-01

    The complete sequences of three mitochondrial genomes from the land snail Cornu aspersum were determined. The mitogenome has a length of 14050 bp, and it encodes 13 protein-coding genes, 22 transfer RNA genes and two ribosomal RNA genes. It also includes nine small intergene spacers, and a large AT-rich intergenic spacer. The intra-specific divergence analysis revealed that COX1 has the lower genetic differentiation, while the most divergent genes were NADH1, NADH3 and NADH4. With the exception of Euhadra herklotsi, the structural comparisons showed the same gene order within the family Helicidae, and nearly identical gene organization to that found in order Pulmonata. Phylogenetic reconstruction recovered Basommatophora as polyphyletic group, whereas Eupulmonata and Pulmonata as paraphyletic groups. Bayesian and Maximum Likelihood analyses showed that C. aspersum is a close relative of Cepaea nemoralis, and with the other Helicidae species form a sister group of Albinaria caerulea, supporting the monophyly of the Stylommatophora clade. PMID:23826260

  13. A new method for species identification via protein-coding and non-coding DNA barcodes by combining machine learning with bioinformatic methods.

    PubMed

    Zhang, Ai-bing; Feng, Jie; Ward, Robert D; Wan, Ping; Gao, Qiang; Wu, Jun; Zhao, Wei-zhong

    2012-01-01

    Species identification via DNA barcodes is contributing greatly to current bioinventory efforts. The initial, and widely accepted, proposal was to use the protein-coding cytochrome c oxidase subunit I (COI) region as the standard barcode for animals, but recently non-coding internal transcribed spacer (ITS) genes have been proposed as candidate barcodes for both animals and plants. However, achieving a robust alignment for non-coding regions can be problematic. Here we propose two new methods (DV-RBF and FJ-RBF) to address this issue for species assignment by both coding and non-coding sequences that take advantage of the power of machine learning and bioinformatics. We demonstrate the value of the new methods with four empirical datasets, two representing typical protein-coding COI barcode datasets (neotropical bats and marine fish) and two representing non-coding ITS barcodes (rust fungi and brown algae). Using two random sub-sampling approaches, we demonstrate that the new methods significantly outperformed existing Neighbor-joining (NJ) and Maximum likelihood (ML) methods for both coding and non-coding barcodes when there was complete species coverage in the reference dataset. The new methods also out-performed NJ and ML methods for non-coding sequences in circumstances of potentially incomplete species coverage, although then the NJ and ML methods performed slightly better than the new methods for protein-coding barcodes. A 100% success rate of species identification was achieved with the two new methods for 4,122 bat queries and 5,134 fish queries using COI barcodes, with 95% confidence intervals (CI) of 99.75-100%. The new methods also obtained a 96.29% success rate (95%CI: 91.62-98.40%) for 484 rust fungi queries and a 98.50% success rate (95%CI: 96.60-99.37%) for 1094 brown algae queries, both using ITS barcodes.

  14. The Ever-Evolving Concept of the Gene: The Use of RNA/Protein Experimental Techniques to Understand Genome Functions

    PubMed Central

    Cipriano, Andrea; Ballarino, Monica

    2018-01-01

    The completion of the human genome sequence together with advances in sequencing technologies have shifted the paradigm of the genome, as composed of discrete and hereditable coding entities, and have shown the abundance of functional noncoding DNA. This part of the genome, previously dismissed as “junk” DNA, increases proportionally with organismal complexity and contributes to gene regulation beyond the boundaries of known protein-coding genes. Different classes of functionally relevant nonprotein-coding RNAs are transcribed from noncoding DNA sequences. Among them are the long noncoding RNAs (lncRNAs), which are thought to participate in the basal regulation of protein-coding genes at both transcriptional and post-transcriptional levels. Although knowledge of this field is still limited, the ability of lncRNAs to localize in different cellular compartments, to fold into specific secondary structures and to interact with different molecules (RNA or proteins) endows them with multiple regulatory mechanisms. It is becoming evident that lncRNAs may play a crucial role in most biological processes such as the control of development, differentiation and cell growth. This review places the evolution of the concept of the gene in its historical context, from Darwin's hypothetical mechanism of heredity to the post-genomic era. We discuss how the original idea of protein-coding genes as unique determinants of phenotypic traits has been reconsidered in light of the existence of noncoding RNAs. We summarize the technological developments which have been made in the genome-wide identification and study of lncRNAs and emphasize the methodologies that have aided our understanding of the complexity of lncRNA-protein interactions in recent years. PMID:29560353

  15. The equine herpesvirus-1 IR3 gene that lies antisense to the sole immediate-early (IE) gene is trans-activated by the IE protein, and is poorly expressed to a protein

    PubMed Central

    Ahn, Byung Chul; Breitenbach, Jonathan E.; Kim, Seong K.; O’Callaghan, Dennis J.

    2007-01-01

    The unique IR3 gene of equine herpesvirus 1 (EHV-1) is expressed as a late 1.0-kb transcript. Previous studies confirmed the IR3 transcription initiation site and tentatively identified other cis-acting elements specific to IR3 such as a TATA box, a 443 base pair 5′untranslated region (UTR), a 285 base pair open reading frame (ORF) and a poly adenylation (A) signal (Holden et al., 1992 DNA Seq 3, 143-52). Transient transfection assays revealed that the IR3 promoter is strongly trans-activated by the IE protein (IEP) and that coexpression of the IEP with the early EICP0 and IR4 regulatory proteins results in maximal trans-activation of the IR3 promoter. Gel shift assays revealed that the IEP directly binds to the IR3 promoter region. Western blot analysis showed that the IR3 protein produced in E. coli was detected by antibodies to IR3 synthetic peptides; however, the IR3 protein was not detected in EHV-1 infected cell extracts by these same anti-IR3 antibodies, even though the IR3 transcript was detected by northern blot. These findings suggest that the IR3 may not be expressed to a protein. Expression of an IR3/GFP fusion gene was not observed, but expression of a GFP/IR3 fusion gene was detected by fluorescent microscopy. In further attempts to detect the IR3/GFP fusion protein using anti-GFP antibody, western blot analysis showed that the IR3/GFP fusion protein was not detected in vivo. Interestingly, a truncated form of the GFP/IR3 protein was synthesized from the GFP/IR3 fusion gene. However, GFP/IR3 and IR3/GFP fusion proteins of the predicted sizes were synthesized by in vitro coupled transcription and translation of the fusion genes, suggesting poor expression of the IR3 protein in vivo. The possible role of the IR3 transcript in EHV-1 infection is discussed. PMID:17306852

  16. Gene 2 of the sigma rhabdovirus genome encodes the P protein, and gene 3 encodes a protein related to the reverse transcriptase of retroelements.

    PubMed

    Landès-Devauchelle, C; Bras, F; Dezélée, S; Teninges, D

    1995-11-10

    The nucleotide sequence of the genes 2 and 3 of the Drosophila rhabdovirus sigma was determined from cDNAs to viral genome and poly(A)+ mRNAs. Gene 2 comprises 1032 nucleotides and contains a long ORF encoding a molecular weight 35,208 polypeptide present in infected cells and in virions which migrates in SDS-PAGE as a doublet of M(r) about 60 kDa. The distribution of acidic charges as well as the electrophoretic properties of the protein are characteristic of the rhabdovirus P proteins. Gene 3 comprises 923 nucleotides and contains a long ORF capable of coding a polypeptide of 298 amino acids of MW 33,790. The putative protein (PP3) is similar in size to a minor component of the virions. Computer analysis shows that the sequence of PP3 contains three motifs related to the conserved motifs of reverse transcriptases.

  17. Foxo3 activity promoted by non-coding effects of circular RNA and Foxo3 pseudogene in the inhibition of tumor growth and angiogenesis.

    PubMed

    Yang, W; Du, W W; Li, X; Yee, A J; Yang, B B

    2016-07-28

    It has recently been shown that the upregulation of a pseudogene specific to a protein-coding gene could function as a sponge to bind multiple potential targeting microRNAs (miRNAs), resulting in increased gene expression. Similarly, it was recently demonstrated that circular RNAs can function as sponges for miRNAs, and could upregulate expression of mRNAs containing an identical sequence. Furthermore, some mRNAs are now known to not only translate protein, but also function to sponge miRNA binding, facilitating gene expression. Collectively, these appear to be effective mechanisms to ensure gene expression and protein activity. Here we show that expression of a member of the forkhead family of transcription factors, Foxo3, is regulated by the Foxo3 pseudogene (Foxo3P), and Foxo3 circular RNA, both of which bind to eight miRNAs. We found that the ectopic expression of the Foxo3P, Foxo3 circular RNA and Foxo3 mRNA could all suppress tumor growth and cancer cell proliferation and survival. Our results showed that at least three mechanisms are used to ensure protein translation of Foxo3, which reflects an essential role of Foxo3 and its corresponding non-coding RNAs.

  18. Mining disease genes using integrated protein-protein interaction and gene-gene co-regulation information.

    PubMed

    Li, Jin; Wang, Limei; Guo, Maozu; Zhang, Ruijie; Dai, Qiguo; Liu, Xiaoyan; Wang, Chunyu; Teng, Zhixia; Xuan, Ping; Zhang, Mingming

    2015-01-01

    In humans, despite the rapid increase in disease-associated gene discovery, a large proportion of disease-associated genes are still unknown. Many network-based approaches have been used to prioritize disease genes. Many networks, such as the protein-protein interaction (PPI), KEGG, and gene co-expression networks, have been used. Expression quantitative trait loci (eQTLs) have been successfully applied for the determination of genes associated with several diseases. In this study, we constructed an eQTL-based gene-gene co-regulation network (GGCRN) and used it to mine for disease genes. We adopted the random walk with restart (RWR) algorithm to mine for genes associated with Alzheimer disease. Compared to the Human Protein Reference Database (HPRD) PPI network alone, the integrated HPRD PPI and GGCRN networks provided faster convergence and revealed new disease-related genes. Therefore, using the RWR algorithm for integrated PPI and GGCRN is an effective method for disease-associated gene mining.

  19. Gene cloning and prokaryotic expression of recombinant outer membrane protein from Vibrio parahaemolyticus

    NASA Astrophysics Data System (ADS)

    Yuan, Ye; Wang, Xiuli; Guo, Sheping; Qiu, Xuemei

    2011-06-01

    Gram-negative Vibrio parahaemolyticus is a common pathogen in humans and marine animals. The outer membrane protein of bacteria plays an important role in the infection and pathogenicity to the host. Thus, the outer membrane proteins are an ideal target for vaccines. We amplified a complete outer membrane protein gene (ompW) from V. parahaemolyticus ATCC 17802. We then cloned and expressed the gene into Escherichia coli BL21 (DE3) cells. The gene coded for a protein that was 42.78 kDa. We purified the protein using Ni-NTA affinity chromatography and Anti-His antibody Western blotting, respectively. Our results provide a basis for future application of the OmpW protein as a vaccine candidate against infection by V. parahaemolyticus. In addition, the purified OmpW protein can be used for further functional and structural studies.

  20. RNA editing differently affects protein-coding genes in D. melanogaster and H. sapiens.

    PubMed

    Grassi, Luigi; Leoni, Guido; Tramontano, Anna

    2015-07-14

    When an RNA editing event occurs within a coding sequence it can lead to a different encoded amino acid. The biological significance of these events remains an open question: they can modulate protein functionality, increase the complexity of transcriptomes or arise from a loose specificity of the involved enzymes. We analysed the editing events in coding regions that produce or not a change in the encoded amino acid (nonsynonymous and synonymous events, respectively) in D. melanogaster and in H. sapiens and compared them with the appropriate random models. Interestingly, our results show that the phenomenon has rather different characteristics in the two organisms. For example, we confirm the observation that editing events occur more frequently in non-coding than in coding regions, and report that this effect is much more evident in H. sapiens. Additionally, in this latter organism, editing events tend to affect less conserved residues. The less frequently occurring editing events in Drosophila tend to avoid drastic amino acid changes. Interestingly, we find that, in Drosophila, changes from less frequently used codons to more frequently used ones are favoured, while this is not the case in H. sapiens.

  1. In silico screening of the chicken genome for overlaps between genomic regions: microRNA genes, coding and non-coding transcriptional units, QTL, and genetic variations.

    PubMed

    Zorc, Minja; Kunej, Tanja

    2016-05-01

    MicroRNAs (miRNAs) are a class of non-coding RNAs involved in posttranscriptional regulation of target genes. Regulation requires complementarity between target mRNA and the mature miRNA seed region, responsible for their recognition and binding. It has been estimated that each miRNA targets approximately 200 genes, and genetic variability of miRNA genes has been reported to affect phenotypic variability and disease susceptibility in humans, livestock species, and model organisms. Polymorphisms in miRNA genes could therefore represent biomarkers for phenotypic traits in livestock animals. In our previous study, we collected polymorphisms within miRNA genes in chicken. In the present study, we identified miRNA-related genomic overlaps to prioritize genomic regions of interest for further functional studies and biomarker discovery. Overlapping genomic regions in chicken were analyzed using the following bioinformatics tools and databases: miRNA SNiPer, Ensembl, miRBase, NCBI Blast, and QTLdb. Out of 740 known pre-miRNA genes, 263 (35.5 %) contain polymorphisms; among them, 35 contain more than three polymorphisms The most polymorphic miRNA genes in chicken are gga-miR-6662, containing 23 single nucleotide polymorphisms (SNPs) within the pre-miRNA region, including five consecutive SNPs, and gga-miR-6688, containing ten polymorphisms including three consecutive polymorphisms. Several miRNA-related genomic hotspots have been revealed in chicken genome; polymorphic miRNA genes are located within protein-coding and/or non-coding transcription units and quantitative trait loci (QTL) associated with production traits. The present study includes the first description of an exonic miRNA in a chicken genome, an overlap between the miRNA gene and the exon of the protein-coding gene (gga-miR-6578/HADHB), and the first report of a missense polymorphism located within a mature miRNA seed region. Identified miRNA-related genomic hotspots in chicken can serve researchers as a

  2. Supplementation of chitosan alleviates high-fat diet-enhanced lipogenesis in rats via adenosine monophosphate (AMP)-activated protein kinase activation and inhibition of lipogenesis-associated genes.

    PubMed

    Chiu, Chen-Yuan; Chan, Im-Lam; Yang, Tsung-Han; Liu, Shing-Hwa; Chiang, Meng-Tsan

    2015-03-25

    This study investigated the role of chitosan in lipogenesis in high-fat diet-induced obese rats. The lipogenesis-associated genes and their upstream regulatory proteins were explored. Diet supplementation of chitosan efficiently decreased the increased weights in body, livers, and adipose tissues in high-fat diet-fed rats. Chitosan supplementation significantly raised the lipolysis rate; attenuated the adipocyte hypertrophy, triglyceride accumulation, and lipoprotein lipase activity in epididymal adipose tissues; and decreased hepatic enzyme activities of lipid biosynthesis. Chitosan supplementation significantly activated adenosine monophosphate (AMP)-activated protein kinase (AMPK) phosphorylation and attenuated high-fat diet-induced protein expressions of lipogenic transcription factors (PPAR-γ and SREBP1c) in livers and adipose tissues. Moreover, chitosan supplementation significantly inhibited the expressions of downstream lipogenic genes (FAS, HMGCR, FATP1, and FABP4) in livers and adipose tissues of high-fat diet-fed rats. These results demonstrate for the first time that chitosan supplementation alleviates high-fat diet-enhanced lipogenesis in rats via AMPK activation and lipogenesis-associated gene inhibition.

  3. Novel coding, translation, and gene expression of a replicating covalently closed circular RNA of 220 nt

    PubMed Central

    AbouHaidar, Mounir Georges; Venkataraman, Srividhya; Golshani, Ashkan; Liu, Bolin; Ahmad, Tauqeer

    2014-01-01

    The highly structured (64% GC) covalently closed circular (CCC) RNA (220 nt) of the virusoid associated with rice yellow mottle virus codes for a 16-kDa highly basic protein using novel modalities for coding, translation, and gene expression. This CCC RNA is the smallest among all known viroids and virusoids and the only one that codes proteins. Its sequence possesses an internal ribosome entry site and is directly translated through two (or three) completely overlapping ORFs (shifting to a new reading frame at the end of each round). The initiation and termination codons overlap UGAUGA (underline highlights the initiation codon AUG within the combined initiation-termination sequence). Termination codons can be ignored to obtain larger read-through proteins. This circular RNA with no noncoding sequences is a unique natural supercompact “nanogenome.” PMID:25253891

  4. Regulation of Lactobacillus casei Sorbitol Utilization Genes Requires DNA-Binding Transcriptional Activator GutR and the Conserved Protein GutM▿

    PubMed Central

    Alcántara, Cristina; Sarmiento-Rubiano, Luz Adriana; Monedero, Vicente; Deutscher, Josef; Pérez-Martínez, Gaspar; Yebra, María J.

    2008-01-01

    Sequence analysis of the five genes (gutRMCBA) downstream from the previously described sorbitol-6-phosphate dehydrogenase-encoding Lactobacillus casei gutF gene revealed that they constitute a sorbitol (glucitol) utilization operon. The gutRM genes encode putative regulators, while the gutCBA genes encode the EIIC, EIIBC, and EIIA proteins of a phosphoenolpyruvate-dependent sorbitol phosphotransferase system (PTSGut). The gut operon is transcribed as a polycistronic gutFRMCBA messenger, the expression of which is induced by sorbitol and repressed by glucose. gutR encodes a transcriptional regulator with two PTS-regulated domains, a galactitol-specific EIIB-like domain (EIIBGat domain) and a mannitol/fructose-specific EIIA-like domain (EIIAMtl domain). Its inactivation abolished gut operon transcription and sorbitol uptake, indicating that it acts as a transcriptional activator. In contrast, cells carrying a gutB mutation expressed the gut operon constitutively, but they failed to transport sorbitol, indicating that EIIBCGut negatively regulates GutR. A footprint analysis showed that GutR binds to a 35-bp sequence upstream from the gut promoter. A sequence comparison with the presumed promoter region of gut operons from various firmicutes revealed a GutR consensus motif that includes an inverted repeat. The regulation mechanism of the L. casei gut operon is therefore likely to be operative in other firmicutes. Finally, gutM codes for a conserved protein of unknown function present in all sequenced gut operons. A gutM mutant, the first constructed in a firmicute, showed drastically reduced gut operon expression and sorbitol uptake, indicating a regulatory role also for GutM. PMID:18676710

  5. Cell cycle, oncogenic and tumor suppressor pathways regulate numerous long and macro non-protein-coding RNAs

    PubMed Central

    2014-01-01

    Background The genome is pervasively transcribed but most transcripts do not code for proteins, constituting non-protein-coding RNAs. Despite increasing numbers of functional reports of individual long non-coding RNAs (lncRNAs), assessing the extent of functionality among the non-coding transcriptional output of mammalian cells remains intricate. In the protein-coding world, transcripts differentially expressed in the context of processes essential for the survival of multicellular organisms have been instrumental in the discovery of functionally relevant proteins and their deregulation is frequently associated with diseases. We therefore systematically identified lncRNAs expressed differentially in response to oncologically relevant processes and cell-cycle, p53 and STAT3 pathways, using tiling arrays. Results We found that up to 80% of the pathway-triggered transcriptional responses are non-coding. Among these we identified very large macroRNAs with pathway-specific expression patterns and demonstrated that these are likely continuous transcripts. MacroRNAs contain elements conserved in mammals and sauropsids, which in part exhibit conserved RNA secondary structure. Comparing evolutionary rates of a macroRNA to adjacent protein-coding genes suggests a local action of the transcript. Finally, in different grades of astrocytoma, a tumor disease unrelated to the initially used cell lines, macroRNAs are differentially expressed. Conclusions It has been shown previously that the majority of expressed non-ribosomal transcripts are non-coding. We now conclude that differential expression triggered by signaling pathways gives rise to a similar abundance of non-coding content. It is thus unlikely that the prevalence of non-coding transcripts in the cell is a trivial consequence of leaky or random transcription events. PMID:24594072

  6. Use of fluorescent proteins and color-coded imaging to visualize cancer cells with different genetic properties.

    PubMed

    Hoffman, Robert M

    2016-03-01

    Fluorescent proteins are very bright and available in spectrally-distinct colors, enable the imaging of color-coded cancer cells growing in vivo and therefore the distinction of cancer cells with different genetic properties. Non-invasive and intravital imaging of cancer cells with fluorescent proteins allows the visualization of distinct genetic variants of cancer cells down to the cellular level in vivo. Cancer cells with increased or decreased ability to metastasize can be distinguished in vivo. Gene exchange in vivo which enables low metastatic cancer cells to convert to high metastatic can be color-coded imaged in vivo. Cancer stem-like and non-stem cells can be distinguished in vivo by color-coded imaging. These properties also demonstrate the vast superiority of imaging cancer cells in vivo with fluorescent proteins over photon counting of luciferase-labeled cancer cells.

  7. Comparative architecture of silks, fibrous proteins and their encoding genes in insects and spiders.

    PubMed

    Craig, Catherine L; Riekel, Christian

    2002-12-01

    The known silk fibroins and fibrous glues are thought to be encoded by members of the same gene family. All silk fibroins sequenced to date contain regions of long-range order (crystalline regions) and/or short-range order (non-crystalline regions). All of the sequenced fibroin silks (Flag or silk from flagelliform gland in spiders; Fhc or heavy chain fibroin silks produced by Lepidoptera larvae) are made up of hierarchically organized, repetitive arrays of amino acids. Fhc fibroin genes are characterized by a similar molecular genetic architecture of two exons and one intron, but the organization and size of these units differs. The Flag, Ser (sericin gene) and BR (Balbiani ring genes; both fibrous proteins) genes are made up of multiple exons and introns. Sequences coding for crystalline and non-crystalline protein domains are integrated in the repetitive regions of Fhc and MA exons, but not in the protein glues Ser1 and BR-1. Genetic 'hot-spots' promote recombination errors in Fhc, MA, and Flag. Codon bias, structural constraint, point mutations, and shortened coding arrays may be alternative means of stabilizing precursor mRNA transcripts. Differential regulation of gene expression and selective splicing of the mRNA transcript may allow rapid adaptation of silk functional properties to different physical environments.

  8. Light-Regulated Transcription of Genes Encoding Peridinin Chlorophyll a Proteins and the Major Intrinsic Light-Harvesting Complex Proteins in the Dinoflagellate Amphidinium carterae Hulburt (Dinophycae)1

    PubMed Central

    ten Lohuis, Michael R.; Miller, David J.

    1998-01-01

    In the dinoflagellate Amphidinium carterae, photoadaptation involves changes in the transcription of genes encoding both of the major classes of light-harvesting proteins, the peridinin chlorophyll a proteins (PCPs) and the major a/c-containing intrinsic light-harvesting proteins (LHCs). PCP and LHC transcript levels were increased up to 86- and 6-fold higher, respectively, under low-light conditions relative to cells grown at high illumination. These increases in transcript abundance were accompanied by decreases in the extent of methylation of CpG and CpNpG motifs within or near PCP- and LHC-coding regions. Cytosine methylation levels in A. carterae are therefore nonstatic and may vary with environmental conditions in a manner suggestive of involvement in the regulation of gene expression. However, chemically induced undermethylation was insufficient in activating transcription, because treatment with two methylation inhibitors had no effect on PCP mRNA or protein levels. Regulation of gene activity through changes in DNA methylation has traditionally been assumed to be restricted to higher eukaryotes (deuterostomes and green plants); however, the atypically large genomes of dinoflagellates may have generated the requirement for systems of this type in a relatively “primitive” organism. Dinoflagellates may therefore provide a unique perspective on the evolution of eukaryotic DNA-methylation systems. PMID:9576788

  9. A chromatin activity based chemoproteomic approach reveals a transcriptional repressome for gene-specific silencing

    PubMed Central

    Liu, Cui; Yu, Yanbao; Liu, Feng; Wei, Xin; Wrobel, John A.; Gunawardena, Harsha P.; Zhou, Li; Jin, Jian; Chen, Xian

    2015-01-01

    Immune cells develop endotoxin tolerance (ET) after prolonged stimulation. ET increases the level of a repression mark H3K9me2 in the transcriptional-silent chromatin specifically associated with pro-inflammatory genes. However, it is not clear what proteins are functionally involved in this process. Here we show that a novel chromatin activity based chemoproteomic (ChaC) approach can dissect the functional chromatin protein complexes that regulate ET-associated inflammation. Using UNC0638 that binds the enzymatically active H3K9-specific methyltransferase G9a/GLP, ChaC reveals that G9a is constitutively active at a G9a-dependent mega-dalton repressome in primary endotoxin-tolerant macrophages. G9a/GLP broadly impacts the ET-specific reprogramming of the histone code landscape, chromatin remodeling, and the activities of select transcription factors. We discover that the G9a-dependent epigenetic environment promotes the transcriptional repression activity of c-Myc for gene-specific co-regulation of chronic inflammation. ChaC may be also applicable to dissect other functional protein complexes in the context of phenotypic chromatin architectures. PMID:25502336

  10. Deep Sequencing Reveals Uncharted Isoform Heterogeneity of the Protein-Coding Transcriptome in Cerebral Ischemia.

    PubMed

    Bhattarai, Sunil; Aly, Ahmed; Garcia, Kristy; Ruiz, Diandra; Pontarelli, Fabrizio; Dharap, Ashutosh

    2018-06-03

    Gene expression in cerebral ischemia has been a subject of intense investigations for several years. Studies utilizing probe-based high-throughput methodologies such as microarrays have contributed significantly to our existing knowledge but lacked the capacity to dissect the transcriptome in detail. Genome-wide RNA-sequencing (RNA-seq) enables comprehensive examinations of transcriptomes for attributes such as strandedness, alternative splicing, alternative transcription start/stop sites, and sequence composition, thus providing a very detailed account of gene expression. Leveraging this capability, we conducted an in-depth, genome-wide evaluation of the protein-coding transcriptome of the adult mouse cortex after transient focal ischemia at 6, 12, or 24 h of reperfusion using RNA-seq. We identified a total of 1007 transcripts at 6 h, 1878 transcripts at 12 h, and 1618 transcripts at 24 h of reperfusion that were significantly altered as compared to sham controls. With isoform-level resolution, we identified 23 splice variants arising from 23 genes that were novel mRNA isoforms. For a subset of genes, we detected reperfusion time-point-dependent splice isoform switching, indicating an expression and/or functional switch for these genes. Finally, for 286 genes across all three reperfusion time-points, we discovered multiple, distinct, simultaneously expressed and differentially altered isoforms per gene that were generated via alternative transcription start/stop sites. Of these, 165 isoforms derived from 109 genes were novel mRNAs. Together, our data unravel the protein-coding transcriptome of the cerebral cortex at an unprecedented depth to provide several new insights into the flexibility and complexity of stroke-related gene transcription and transcript organization.

  11. ProClaT, a new bioinformatics tool for in silico protein reclassification: case study of DraB, a protein coded from the draTGB operon in Azospirillum brasilense.

    PubMed

    Rubel, Elisa Terumi; Raittz, Roberto Tadeu; Coimbra, Nilson Antonio da Rocha; Gehlen, Michelly Alves Coutinho; Pedrosa, Fábio de Oliveira

    2016-12-15

    Azopirillum brasilense is a plant-growth promoting nitrogen-fixing bacteria that is used as bio-fertilizer in agriculture. Since nitrogen fixation has a high-energy demand, the reduction of N 2 to NH 4 + by nitrogenase occurs only under limiting conditions of NH 4 + and O 2 . Moreover, the synthesis and activity of nitrogenase is highly regulated to prevent energy waste. In A. brasilense nitrogenase activity is regulated by the products of draG and draT. The product of the draB gene, located downstream in the draTGB operon, may be involved in the regulation of nitrogenase activity by an, as yet, unknown mechanism. A deep in silico analysis of the product of draB was undertaken aiming at suggesting its possible function and involvement with DraT and DraG in the regulation of nitrogenase activity in A. brasilense. In this work, we present a new artificial intelligence strategy for protein classification, named ProClaT. The features used by the pattern recognition model were derived from the primary structure of the DraB homologous proteins, calculated by a ProClaT internal algorithm. ProClaT was applied to this case study and the results revealed that the A. brasilense draB gene codes for a protein highly similar to the nitrogenase associated NifO protein of Azotobacter vinelandii. This tool allowed the reclassification of DraB/NifO homologous proteins, hypothetical, conserved hypothetical and those annotated as putative arsenate reductase, ArsC, as NifO-like. An analysis of co-occurrence of draB, draT, draG and of other nif genes was performed, suggesting the involvement of draB (nifO) in nitrogen fixation, however, without the definition of a specific function.

  12. Evaluating the protein coding potential of exonized transposable element sequences

    PubMed Central

    Piriyapongsa, Jittima; Rutledge, Mark T; Patel, Sanil; Borodovsky, Mark; Jordan, I King

    2007-01-01

    encode protein sequences. Conclusion The exaptation of the numerous TE sequences found in exons as bona fide protein coding sequences may prove to be far less common than has been suggested by the analysis of complete genomes. We hypothesize that many exonized TE sequences actually function as post-transcriptional regulators of gene expression, rather than coding sequences, which may act through a variety of double stranded RNA related regulatory pathways. Indeed, their relatively high copy numbers and similarity to sequences dispersed throughout the genome suggests that exonized TE sequences could serve as master regulators with a wide scope of regulatory influence. Reviewers: This article was reviewed by Itai Yanai, Kateryna D. Makova, Melissa Wilson (nominated by Kateryna D. Makova) and Cedric Feschotte (nominated by John M. Logsdon Jr.). PMID:18036258

  13. Two novel heat shock genes encoding proteins produced in response to heterologous protein expression in Escherichia coli.

    PubMed Central

    Allen, S P; Polazzi, J O; Gierse, J K; Easton, A M

    1992-01-01

    In Escherichia coli high-level production of some heterologous proteins (specifically, human prorenin, renin, and bovine insulin-like growth factor 2) resulted in the induction of two new E. coli heat shock proteins, both of which have molecular masses of 16 kDa and are tightly associated with inclusion bodies formed during heterologous protein production. We named these inclusion body-associated proteins IbpA and IbpB. The coding sequences for IbpA and IbpB were identified and isolated from the Kohara E. coli gene bank. The genes for these proteins (ibpA and ibpB) are located at 82.5 min on the chromosome. Nucleotide sequencing of the two genes revealed that they are transcribed in the same direction and are separated by 110 bp. Putative Shine-Dalgarno sequences are located upstream from the initiation codons of both genes. A putative heat shock promoter is located upstream from ibpA, and a putative transcription terminator is located downstream from ibpB. A temperature upshift experiment in which we used a wild-type E. coli strain and an isogenic rpoH mutant strain indicated that a sigma 32-containing RNA polymerase is involved in the regulation of expression of these genes. There is 57.5% identity between the genes at the nucleotide level and 52.2% identity at the amino acid level. A search of the protein data bases showed that both of these 16-kDa proteins exhibit low levels of homology to low-molecular-weight heat shock proteins from eukaryotic species. Images PMID:1356969

  14. Biased exonization of transposed elements in duplicated genes: A lesson from the TIF-IA gene.

    PubMed

    Amit, Maayan; Sela, Noa; Keren, Hadas; Melamed, Ze'ev; Muler, Inna; Shomron, Noam; Izraeli, Shai; Ast, Gil

    2007-11-29

    Gene duplication and exonization of intronic transposed elements are two mechanisms that enhance genomic diversity. We examined whether there is less selection against exonization of transposed elements in duplicated genes than in single-copy genes. Genome-wide analysis of exonization of transposed elements revealed a higher rate of exonization within duplicated genes relative to single-copy genes. The gene for TIF-IA, an RNA polymerase I transcription initiation factor, underwent a humanoid-specific triplication, all three copies of the gene are active transcriptionally, although only one copy retains the ability to generate the TIF-IA protein. Prior to TIF-IA triplication, an Alu element was inserted into the first intron. In one of the non-protein coding copies, this Alu is exonized. We identified a single point mutation leading to exonization in one of the gene duplicates. When this mutation was introduced into the TIF-IA coding copy, exonization was activated and the level of the protein-coding mRNA was reduced substantially. A very low level of exonization was detected in normal human cells. However, this exonization was abundant in most leukemia cell lines evaluated, although the genomic sequence is unchanged in these cancerous cells compared to normal cells. The definition of the Alu element within the TIF-IA gene as an exon is restricted to certain types of cancers; the element is not exonized in normal human cells. These results further our understanding of the delicate interplay between gene duplication and alternative splicing and of the molecular evolutionary mechanisms leading to genetic innovations. This implies the existence of purifying selection against exonization in single copy genes, with duplicate genes free from such constrains.

  15. Non-homeodomain regions of Hox proteins mediate activation versus repression of Six2 via a single enhancer site in vivo

    PubMed Central

    Yallowitz, Alisha R.; Gong, Ke-Qin; Swinehart, Ilea T.; Nelson, Lisa T.; Wellik, Deneen M.

    2009-01-01

    Summary Hox genes control many developmental events along the AP axis, but few target genes have been identified. Whether target genes are activated or repressed, what enhancer elements are required for regulation, and how different domains of the Hox proteins contribute to regulatory specificity is poorly understood. Six2 is genetically downstream of both the Hox11 paralogous genes in the developing mammalian kidney and Hoxa2 in branchial arch and facial mesenchyme. Loss-of-function of Hox11 leads to loss of Six2 expression and loss-of-function of Hoxa2 leads to expanded Six2 expression. Herein we demonstrate that a single enhancer site upstream of the Six2 coding sequence is responsible for both activation by Hox11 proteins in the kidney and repression by Hoxa2 in the branchial arch and facial mesenchyme in vivo. DNA binding activity is required for both activation and repression, but differential activity is not controlled by differences in the homeodomains. Rather, protein domains N- and C-terminal to the homeodomain confer activation versus repression activity. These data support a model in which the DNA binding specificity of Hox proteins in vivo may be similar, consistent with accumulated in vitro data, and that unique functions result mainly from differential interactions mediated by non-homeodomain regions of Hox proteins. PMID:19716816

  16. A novel TaqMan® assay for Nosema ceranae quantification in honey bee, based on the protein coding gene Hsp70.

    PubMed

    Cilia, Giovanni; Cabbri, Riccardo; Maiorana, Giacomo; Cardaio, Ilaria; Dall'Olio, Raffaele; Nanetti, Antonio

    2018-04-01

    Nosema ceranae is now a widespread honey bee pathogen with high incidence in apiculture. Rapid and reliable detection and quantification methods are a matter of concern for research community, nowadays mainly relying on the use of biomolecular techniques such as PCR, RT-PCR or HRMA. The aim of this technical paper is to provide a new qPCR assay, based on the highly-conserved protein coding gene Hsp70, to detect and quantify the microsporidian Nosema ceranae affecting the western honey bee Apis mellifera. The validation steps to assess efficiency, sensitivity, specificity and robustness of the assay are described also. Copyright © 2018 Elsevier GmbH. All rights reserved.

  17. Peroxisome Proliferator-Activated Receptor γ Target Gene Encoding a Novel Angiopoietin-Related Protein Associated with Adipose Differentiation

    PubMed Central

    Yoon, J. Cliff; Chickering, Troy W.; Rosen, Evan D.; Dussault, Barry; Qin, Yubin; Soukas, Alexander; Friedman, Jeffrey M.; Holmes, William E.; Spiegelman, Bruce M.

    2000-01-01

    The nuclear receptor peroxisome proliferator-activated receptor γ regulates adipose differentiation and systemic insulin signaling via ligand-dependent transcriptional activation of target genes. However, the identities of the biologically relevant target genes are largely unknown. Here we describe the isolation and characterization of a novel target gene induced by PPARγ ligands, termed PGAR (for PPARγ angiopoietin related), which encodes a novel member of the angiopoietin family of secreted proteins. The transcriptional induction of PGAR follows a rapid time course typical of immediate-early genes and occurs in the absence of protein synthesis. The expression of PGAR is predominantly localized to adipose tissues and placenta and is consistently elevated in genetic models of obesity. Hormone-dependent adipocyte differentiation coincides with a dramatic early induction of the PGAR transcript. Alterations in nutrition and leptin administration are found to modulate the PGAR expression in vivo. Taken together, these data suggest a possible role for PGAR in the regulation of systemic lipid metabolism or glucose homeostasis. PMID:10866690

  18. Occurrence of genes coding for MSCRAMM and biofilm-associated protein Bap in Staphylococcus spp. isolated from bovine subclinical mastitis and relationship with somatic cell counts.

    PubMed

    Zuniga, Eveline; Melville, Priscilla A; Saidenberg, André B S; Laes, Marco A; Gonsales, Fernanda F; Salaberry, Sandra R S; Gregori, Fabio; Brandão, Paulo E; dos Santos, Franklin G B; Lincopan, Nilton E; Benites, Nilson R

    2015-12-01

    This study aimed to elucidate aspects of the epidemiology of bovine subclinical mastitis through the assessment of genes encoding MSCRAMM (microbial surface components recognizing adhesive matrix molecules - a group of adhesins) and protein Bap (implicated in biofilm formation), in coagulase-positive (CPS) and coagulase-negative (CNS) Staphylococcus isolated from subclinical mastitis. Milk samples were collected for microbiological exams, somatic cell count (SCC) and a survey of the genes coding for MSCRAMM (cna, eno, ebpS, fnbA, fnbB and fib) and biofilm-associated protein Bap (bap) in 106 Staphylococcus spp. isolates using PCR. The frequencies of occurrence of eno (82.1%), fnbA (72.6%), fib (71.7%) and bap (56.6%) were higher (P < 0.0001) compared with the other assessed genes (cna, ebpS and fnbB). The higher frequency of occurrence (P < 0.005) of the bap gene in CNS compared with CPS suggests that in these species biofilm formation is an important mechanism for the persistence of the infection. The medians of the SCCs in the samples where eno, fnbA, fib and bap genes were detected were higher compared with Staphylococcus without the assessed genes (P < 0.05) and negative samples (P < 0.01), which indicated that the presence of these MSCRAMM may be related to a higher intensity of the inflammatory process. Copyright © 2015 Elsevier Ltd. All rights reserved.

  19. [Regulation of heat shock gene expression in response to stress].

    PubMed

    Garbuz, D G

    2017-01-01

    Heat shock (HS) genes, or stress genes, code for a number of proteins that collectively form the most ancient and universal stress defense system. The system determines the cell capability of adaptation to various adverse factors and performs a variety of auxiliary functions in normal physiological conditions. Common stress factors, such as higher temperatures, hypoxia, heavy metals, and others, suppress transcription and translation for the majority of genes, while HS genes are upregulated. Transcription of HS genes is controlled by transcription factors of the HS factor (HSF) family. Certain HSFs are activated on exposure to higher temperatures or other adverse factors to ensure stress-induced HS gene expression, while other HSFs are specifically activated at particular developmental stages. The regulation of the main mammalian stress-inducible factor HSF1 and Drosophila melanogaster HSF includes many components, such as a variety of early warning signals indicative of abnormal cell activity (e.g., increases in intracellular ceramide, cytosolic calcium ions, or partly denatured proteins); protein kinases, which phosphorylate HSFs at various Ser residues; acetyltransferases; and regulatory proteins, such as SUMO and HSBP1. Transcription factors other than HSFs are also involved in activating HS gene transcription; the set includes D. melanogaster GAF, mammalian Sp1 and NF-Y, and other factors. Transcription of several stress genes coding for molecular chaperones of the glucose-regulated protein (GRP) family is predominantly regulated by another stress-detecting system, which is known as the unfolded protein response (UPR) system and is activated in response to massive protein misfolding in the endoplasmic reticulum and mitochondrial matrix. A translational fine tuning of HS protein expression occurs via changing the phosphorylation status of several proteins involved in translation initiation. In addition, specific signal sequences in the 5'-UTRs of some HS

  20. The evolution of small insertions and deletions in the coding genes of Drosophila melanogaster.

    PubMed

    Chong, Zechen; Zhai, Weiwei; Li, Chunyan; Gao, Min; Gong, Qiang; Ruan, Jue; Li, Juan; Jiang, Lan; Lv, Xuemei; Hungate, Eric; Wu, Chung-I

    2013-12-01

    Studies of protein evolution have focused on amino acid substitutions with much less systematic analysis on insertion and deletions (indels) in protein coding genes. We hence surveyed 7,500 genes between Drosophila melanogaster and D. simulans, using D. yakuba as an outgroup for this purpose. The evolutionary rate of coding indels is indeed low, at only 3% of that of nonsynonymous substitutions. As coding indels follow a geometric distribution in size and tend to fall in low-complexity regions of proteins, it is unclear whether selection or mutation underlies this low rate. To resolve the issue, we collected genomic sequences from an isogenic African line of D. melanogaster (ZS30) at a high coverage of 70× and analyzed indel polymorphism between ZS30 and the reference genome. In comparing polymorphism and divergence, we found that the divergence to polymorphism ratio (i.e., fixation index) for smaller indels (size ≤ 10 bp) is very similar to that for synonymous changes, suggesting that most of the within-species polymorphism and between-species divergence for indels are selectively neutral. Interestingly, deletions of larger sizes (size ≥ 11 bp and ≤ 30 bp) have a much higher fixation index than synonymous mutations and 44.4% of fixed middle-sized deletions are estimated to be adaptive. To our surprise, this pattern is not found for insertions. Protein indel evolution appear to be in a dynamic flux of neutrally driven expansion (insertions) together with adaptive-driven contraction (deletions), and these observations provide important insights for understanding the fitness of new mutations as well as the evolutionary driving forces for genomic evolution in Drosophila species.

  1. Complex organisation and structure of the ghrelin antisense strand gene GHRLOS, a candidate non-coding RNA gene

    PubMed Central

    Seim, Inge; Carter, Shea L; Herington, Adrian C; Chopin, Lisa K

    2008-01-01

    Background The peptide hormone ghrelin has many important physiological and pathophysiological roles, including the stimulation of growth hormone (GH) release, appetite regulation, gut motility and proliferation of cancer cells. We previously identified a gene on the opposite strand of the ghrelin gene, ghrelinOS (GHRLOS), which spans the promoter and untranslated regions of the ghrelin gene (GHRL). Here we further characterise GHRLOS. Results We have described GHRLOS mRNA isoforms that extend over 1.4 kb of the promoter region and 106 nucleotides of exon 4 of the ghrelin gene, GHRL. These GHRLOS transcripts initiate 4.8 kb downstream of the terminal exon 4 of GHRL and are present in the 3' untranslated exon of the adjacent gene TATDN2 (TatD DNase domain containing 2). Interestingly, we have also identified a putative non-coding TATDN2-GHRLOS chimaeric transcript, indicating that GHRLOS RNA biogenesis is extremely complex. Moreover, we have discovered that the 3' region of GHRLOS is also antisense, in a tail-to-tail fashion to a novel terminal exon of the neighbouring SEC13 gene, which is important in protein transport. Sequence analyses revealed that GHRLOS is riddled with stop codons, and that there is little nucleotide and amino-acid sequence conservation of the GHRLOS gene between vertebrates. The gene spans 44 kb on 3p25.3, is extensively spliced and harbours multiple variable exons. We have also investigated the expression of GHRLOS and found evidence of differential tissue expression. It is highly expressed in tissues which are emerging as major sites of non-coding RNA expression (the thymus, brain, and testis), as well as in the ovary and uterus. In contrast, very low levels were found in the stomach where sense, GHRL derived RNAs are highly expressed. Conclusion GHRLOS RNA transcripts display several distinctive features of non-coding (ncRNA) genes, including 5' capping, polyadenylation, extensive splicing and short open reading frames. The gene is also

  2. Complex organisation and structure of the ghrelin antisense strand gene GHRLOS, a candidate non-coding RNA gene.

    PubMed

    Seim, Inge; Carter, Shea L; Herington, Adrian C; Chopin, Lisa K

    2008-10-28

    The peptide hormone ghrelin has many important physiological and pathophysiological roles, including the stimulation of growth hormone (GH) release, appetite regulation, gut motility and proliferation of cancer cells. We previously identified a gene on the opposite strand of the ghrelin gene, ghrelinOS (GHRLOS), which spans the promoter and untranslated regions of the ghrelin gene (GHRL). Here we further characterise GHRLOS. We have described GHRLOS mRNA isoforms that extend over 1.4 kb of the promoter region and 106 nucleotides of exon 4 of the ghrelin gene, GHRL. These GHRLOS transcripts initiate 4.8 kb downstream of the terminal exon 4 of GHRL and are present in the 3' untranslated exon of the adjacent gene TATDN2 (TatD DNase domain containing 2). Interestingly, we have also identified a putative non-coding TATDN2-GHRLOS chimaeric transcript, indicating that GHRLOS RNA biogenesis is extremely complex. Moreover, we have discovered that the 3' region of GHRLOS is also antisense, in a tail-to-tail fashion to a novel terminal exon of the neighbouring SEC13 gene, which is important in protein transport. Sequence analyses revealed that GHRLOS is riddled with stop codons, and that there is little nucleotide and amino-acid sequence conservation of the GHRLOS gene between vertebrates. The gene spans 44 kb on 3p25.3, is extensively spliced and harbours multiple variable exons. We have also investigated the expression of GHRLOS and found evidence of differential tissue expression. It is highly expressed in tissues which are emerging as major sites of non-coding RNA expression (the thymus, brain, and testis), as well as in the ovary and uterus. In contrast, very low levels were found in the stomach where sense, GHRL derived RNAs are highly expressed. GHRLOS RNA transcripts display several distinctive features of non-coding (ncRNA) genes, including 5' capping, polyadenylation, extensive splicing and short open reading frames. The gene is also non-conserved, with differential

  3. Gene evolution and functions of extracellular matrix proteins in teeth

    PubMed Central

    Yoshizaki, Keigo; Yamada, Yoshihiko

    2013-01-01

    The extracellular matrix (ECM) not only provides physical support for tissues, but it is also critical for tissue development, homeostasis and disease. Over 300 ECM molecules have been defined as comprising the “core matrisome” in mammals through the analysis of whole genome sequences. During tooth development, the structure and functions of the ECM dynamically change. In the early stages, basement membranes (BMs) separate two cell layers of the dental epithelium and the mesenchyme. Later in the differentiation stages, the BM layer is replaced with the enamel matrix and the dentin matrix, which are secreted by ameloblasts and odontoblasts, respectively. The enamel matrix genes and the dentin matrix genes are each clustered in two closed regions located on human chromosome 4 (mouse chromosome 5), except for the gene coded for amelogenin, the major enamel matrix protein, which is located on the sex chromosomes. These genes for enamel and dentin matrix proteins are derived from a common ancestral gene, but as a result of evolution, they diverged in terms of their specific functions. These matrix proteins play important roles in cell adhesion, polarity, and differentiation and mineralization of enamel and dentin matrices. Mutations of these genes cause diseases such as odontogenesis imperfect (OI) and amelogenesis imperfect (AI). In this review, we discuss the recently defined terms matrisome and matrixome for ECMs, as well as focus on genes and functions of enamel and dentin matrix proteins. PMID:23539364

  4. Common and specific signatures of gene expression and protein-protein interactions in autoimmune diseases.

    PubMed

    Tuller, T; Atar, S; Ruppin, E; Gurevich, M; Achiron, A

    2013-03-01

    The aim of this study is to understand intracellular regulatory mechanisms in peripheral blood mononuclear cells (PBMCs), which are either common to many autoimmune diseases or specific to some of them. We incorporated large-scale data such as protein-protein interactions, gene expression and demographical information of hundreds of patients and healthy subjects, related to six autoimmune diseases with available large-scale gene expression measurements: multiple sclerosis (MS), systemic lupus erythematosus (SLE), juvenile rheumatoid arthritis (JRA), Crohn's disease (CD), ulcerative colitis (UC) and type 1 diabetes (T1D). These data were analyzed concurrently by statistical and systems biology approaches tailored for this purpose. We found that chemokines such as CXCL1-3, 5, 6 and the interleukin (IL) IL8 tend to be differentially expressed in PBMCs of patients with the analyzed autoimmune diseases. In addition, the anti-apoptotic gene BCL3, interferon-γ (IFNG), and the vitamin D receptor (VDR) gene physically interact with significantly many genes that tend to be differentially expressed in PBMCs of patients with the analyzed autoimmune diseases. In general, similar cellular processes tend to be differentially expressed in PBMC in the analyzed autoimmune diseases. Specifically, the cellular processes related to cell proliferation (for example, epidermal growth factor, platelet-derived growth factor, nuclear factor-κB, Wnt/β-catenin signaling, stress-activated protein kinase c-Jun NH2-terminal kinase), inflammatory response (for example, interleukins IL2 and IL6, the cytokine granulocyte-macrophage colony-stimulating factor and the B-cell receptor), general signaling cascades (for example, mitogen-activated protein kinase, extracellular signal-regulated kinase, p38 and TRK) and apoptosis are activated in most of the analyzed autoimmune diseases. However, our results suggest that in each of the analyzed diseases, apoptosis and chemotaxis are activated via

  5. Analysis of 6,515 exomes reveals the recent origin of most human protein-coding variants.

    PubMed

    Fu, Wenqing; O'Connor, Timothy D; Jun, Goo; Kang, Hyun Min; Abecasis, Goncalo; Leal, Suzanne M; Gabriel, Stacey; Rieder, Mark J; Altshuler, David; Shendure, Jay; Nickerson, Deborah A; Bamshad, Michael J; Akey, Joshua M

    2013-01-10

    Establishing the age of each mutation segregating in contemporary human populations is important to fully understand our evolutionary history and will help to facilitate the development of new approaches for disease-gene discovery. Large-scale surveys of human genetic variation have reported signatures of recent explosive population growth, notable for an excess of rare genetic variants, suggesting that many mutations arose recently. To more quantitatively assess the distribution of mutation ages, we resequenced 15,336 genes in 6,515 individuals of European American and African American ancestry and inferred the age of 1,146,401 autosomal single nucleotide variants (SNVs). We estimate that approximately 73% of all protein-coding SNVs and approximately 86% of SNVs predicted to be deleterious arose in the past 5,000-10,000 years. The average age of deleterious SNVs varied significantly across molecular pathways, and disease genes contained a significantly higher proportion of recently arisen deleterious SNVs than other genes. Furthermore, European Americans had an excess of deleterious variants in essential and Mendelian disease genes compared to African Americans, consistent with weaker purifying selection due to the Out-of-Africa dispersal. Our results better delimit the historical details of human protein-coding variation, show the profound effect of recent human history on the burden of deleterious SNVs segregating in contemporary populations, and provide important practical information that can be used to prioritize variants in disease-gene discovery.

  6. SOA genes encode proteins controlling lipase expression in response to triacylglycerol utilization in the yeast Yarrowia lipolytica.

    PubMed

    Desfougères, Thomas; Haddouche, Ramdane; Fudalej, Franck; Neuvéglise, Cécile; Nicaud, Jean-Marc

    2010-02-01

    The oleaginous yeast Yarrowia lipolytica efficiently metabolizes hydrophobic substrates such as alkanes, fatty acids or triacylglycerol. This yeast has been identified in oil-polluted water and in lipid-rich food. The enzymes involved in lipid breakdown, for use as a carbon source, are known, but the molecular mechanisms controlling the expression of the genes encoding these enzymes are still poorly understood. The study of mRNAs obtained from cells grown on oleic acid identified a new group of genes called SOA genes (specific for oleic acid). SOA1 and SOA2 are two small genes coding for proteins with no known homologs. Single- and double-disrupted strains were constructed. Wild-type and mutant strains were grown on dextrose, oleic acid and triacylglycerols. The double mutant presents a clear phenotype consisting of a growth defect on tributyrin and triolein, but not on dextrose or oleic acid media. Lipase activity was 50-fold lower in this mutant than in the wild-type strain. The impact of SOA deletion on the expression of the main extracellular lipase gene (LIP2) was monitored using a LIP2-beta-galactosidase promoter fusion protein. These data suggest that Soa proteins are components of a molecular mechanism controlling lipase gene expression in response to extracellular triacylglycerol.

  7. Gene end-like sequences within the 3' non-coding region of the Nipah virus genome attenuate viral gene transcription.

    PubMed

    Sugai, Akihiro; Sato, Hiroki; Yoneda, Misako; Kai, Chieko

    2017-08-01

    The regulation of transcription during Nipah virus (NiV) replication is poorly understood. Using a bicistronic minigenome system, we investigated the involvement of non-coding regions (NCRs) in the transcriptional re-initiation efficiency of NiV RNA polymerase. Reporter assays revealed that attenuation of NiV gene expression was not constant at each gene junction, and that the attenuating property was controlled by the 3' NCR. However, this regulation was independent of the gene-end, gene-start and intergenic regions. Northern blot analysis indicated that regulation of viral gene expression by the phosphoprotein (P) and large protein (L) 3' NCRs occurred at the transcription level. We identified uridine-rich tracts within the L 3' NCR that are similar to gene-end signals. These gene-end-like sequences were recognized as weak transcription termination signals by the viral RNA polymerase, thereby reducing downstream gene transcription. Thus, we suggest that NiV has a unique mechanism of transcriptional regulation. Copyright © 2017 Elsevier Inc. All rights reserved.

  8. Genetic coding and gene expression - new Quadruplet genetic coding model

    NASA Astrophysics Data System (ADS)

    Shankar Singh, Rama

    2012-07-01

    Successful demonstration of human genome project has opened the door not only for developing personalized medicine and cure for genetic diseases, but it may also answer the complex and difficult question of the origin of life. It may lead to making 21st century, a century of Biological Sciences as well. Based on the central dogma of Biology, genetic codons in conjunction with tRNA play a key role in translating the RNA bases forming sequence of amino acids leading to a synthesized protein. This is the most critical step in synthesizing the right protein needed for personalized medicine and curing genetic diseases. So far, only triplet codons involving three bases of RNA, transcribed from DNA bases, have been used. Since this approach has several inconsistencies and limitations, even the promise of personalized medicine has not been realized. The new Quadruplet genetic coding model proposed and developed here involves all four RNA bases which in conjunction with tRNA will synthesize the right protein. The transcription and translation process used will be the same, but the Quadruplet codons will help overcome most of the inconsistencies and limitations of the triplet codes. Details of this new Quadruplet genetic coding model and its subsequent potential applications including relevance to the origin of life will be presented.

  9. A splice junction-targeted CRISPR approach (spJCRISPR) reveals human FOXO3B to be a protein-coding gene.

    PubMed

    Santo, Evan E; Paik, Jihye

    2018-06-17

    The rapid development of CRISPR technology is revolutionizing molecular approaches to the dissection of complex biological phenomena. Here we describe an alternative generally applicable implementation of the CRISPR-Cas9 system that allows for selective knockdown of extremely homologous genes. This strategy employs the lentiviral delivery of paired sgRNAs and nickase Cas9 (Cas9D10A) to achieve targeted deletion of splice junctions. This general strategy offers several advantages over standard single-guide exon-targeting CRISPR-Cas9 such as greatly reduced off-target effects, more restricted genomic editing, routine disruption of target gene mRNA expression and the ability to differentiate between closely related genes. Here we demonstrate the utility of this strategy by achieving selective knockdown of the highly homologous human genes FOXO3A and suspected pseudogene FOXO3B. We find the spJCRISPR strategy to efficiently and selectively disrupt FOXO3A and FOXO3B mRNA and protein expression; thus revealing that the human FOXO3B locus encodes a bona fide human gene. Unlike FOXO3A, we find the FOXO3B protein to be cytosolically localized in both the presence and absence of active Akt. The ability to selectively target and efficiently disrupt the expression of the closely-related FOXO3A and FOXO3B genes demonstrates the efficacy of the spJCRISPR approach. Copyright © 2018. Published by Elsevier B.V.

  10. The high-level expression of human tissue plasminogen activator in the milk of transgenic mice with hybrid gene locus strategy.

    PubMed

    Zhou, Yanrong; Lin, Yanli; Wu, Xiaojie; Xiong, Fuyin; Lv, Yuemeng; Zheng, Tao; Huang, Peitang; Chen, Hongxing

    2012-02-01

    Transgene expression for the mammary gland bioreactor aimed at producing recombinant proteins requires optimized expression vector construction. Previously we presented a hybrid gene locus strategy, which was originally tested with human lactoferrin (hLF) as target transgene, and an extremely high-level expression of rhLF ever been achieved as to 29.8 g/l in mice milk. Here to demonstrate the broad application of this strategy, another 38.4 kb mWAP-htPA hybrid gene locus was constructed, in which the 3-kb genomic coding sequence in the 24-kb mouse whey acidic protein (mWAP) gene locus was substituted by the 17.4-kb genomic coding sequence of human tissue plasminogen activator (htPA), exactly from the start codon to the end codon. Corresponding five transgenic mice lines were generated and the highest expression level of rhtPA in the milk attained as to 3.3 g/l. Our strategy will provide a universal way for the large-scale production of pharmaceutical proteins in the mammary gland of transgenic animals.

  11. Biased exonization of transposed elements in duplicated genes: A lesson from the TIF-IA gene

    PubMed Central

    Amit, Maayan; Sela, Noa; Keren, Hadas; Melamed, Ze'ev; Muler, Inna; Shomron, Noam; Izraeli, Shai; Ast, Gil

    2007-01-01

    Background Gene duplication and exonization of intronic transposed elements are two mechanisms that enhance genomic diversity. We examined whether there is less selection against exonization of transposed elements in duplicated genes than in single-copy genes. Results Genome-wide analysis of exonization of transposed elements revealed a higher rate of exonization within duplicated genes relative to single-copy genes. The gene for TIF-IA, an RNA polymerase I transcription initiation factor, underwent a humanoid-specific triplication, all three copies of the gene are active transcriptionally, although only one copy retains the ability to generate the TIF-IA protein. Prior to TIF-IA triplication, an Alu element was inserted into the first intron. In one of the non-protein coding copies, this Alu is exonized. We identified a single point mutation leading to exonization in one of the gene duplicates. When this mutation was introduced into the TIF-IA coding copy, exonization was activated and the level of the protein-coding mRNA was reduced substantially. A very low level of exonization was detected in normal human cells. However, this exonization was abundant in most leukemia cell lines evaluated, although the genomic sequence is unchanged in these cancerous cells compared to normal cells. Conclusion The definition of the Alu element within the TIF-IA gene as an exon is restricted to certain types of cancers; the element is not exonized in normal human cells. These results further our understanding of the delicate interplay between gene duplication and alternative splicing and of the molecular evolutionary mechanisms leading to genetic innovations. This implies the existence of purifying selection against exonization in single copy genes, with duplicate genes free from such constrains. PMID:18047649

  12. Behind the curtain of non-coding RNAs; long non-coding RNAs regulating hepatocarcinogenesis

    PubMed Central

    El Khodiry, Aya; Afify, Menna; El Tayebi, Hend M

    2018-01-01

    Hepatocellular carcinoma (HCC) is one of the most common and aggressive cancers worldwide. HCC is the fifth common malignancy in the world and the second leading cause of cancer death in Asia. Long non-coding RNAs (lncRNAs) are RNAs with a length greater than 200 nucleotides that do not encode proteins. lncRNAs can regulate gene expression and protein synthesis in several ways by interacting with DNA, RNA and proteins in a sequence specific manner. They could regulate cellular and developmental processes through either gene inhibition or gene activation. Many studies have shown that dysregulation of lncRNAs is related to many human diseases such as cardiovascular diseases, genetic disorders, neurological diseases, immune mediated disorders and cancers. However, the study of lncRNAs is challenging as they are poorly conserved between species, their expression levels aren’t as high as that of mRNAs and have great interpatient variations. The study of lncRNAs expression in cancers have been a breakthrough as it unveils potential biomarkers and drug targets for cancer therapy and helps understand the mechanism of pathogenesis. This review discusses many long non-coding RNAs and their contribution in HCC, their role in development, metastasis, and prognosis of HCC and how to regulate and target these lncRNAs as a therapeutic tool in HCC treatment in the future. PMID:29434445

  13. Evolutionary history of mitogen-activated protein kinase (MAPK) genes in Lotus, Medicago, and Phaseolus

    PubMed Central

    Neupane, Achal; Nepal, Madhav P; Benson, Benjamin V; MacArthur, Kenton J; Piya, Sarbottam

    2013-01-01

    Mitogen-Activated Protein Kinase (MAPK) genes encode proteins that mediate various signaling pathways associated with biotic and abiotic stress responses in eukaryotes. The MAPK genes form a 3-tier signal transduction cascade between cellular stimuli and physiological responses. Recent identification of soybean MAPKs and availability of genome sequences from other legume species allowed us to identify their MAPK genes. The main objectives of this study were to identify MAPKs in 3 legume species, Lotus japonicus, Medicago truncatula, and Phaseolus vulgaris, and to assess their phylogenetic relationships. We used approaches in comparative genomics for MAPK gene identification and named the newly identified genes following Arabidopsis MAPK nomenclature model. We identified 19, 18, and 15 MAPKs and 7, 4, and 9 MAPKKs in the genome of Lotus japonicus, Medicago truncatula, and Phaseolus vulgaris, respectively. Within clade placement of MAPKs and MAPKKs in the 3 legume species were consistent with those in soybean and Arabidopsis. Among 5 clades of MAPKs, 4 founder clades were consistent to MAPKs of other plant species and orthologs of MAPK genes in the fifth clade-"Clade E" were consistent with those in soybean. Our results also indicated that some gene duplication events might have occurred prior to eudicot-monocot divergence. Highly diversified MAPKs in soybean relative to those in 3 other legume species are attributable to the polyploidization events in soybean. The identification of the MAPK genes in the legume species is important for the legume crop improvement; and evolutionary relationships and functional divergence of these gene members provide insights into plant genome evolution. PMID:24317362

  14. Genes uniquely expressed in human growth plate chondrocytes uncover a distinct regulatory network.

    PubMed

    Li, Bing; Balasubramanian, Karthika; Krakow, Deborah; Cohn, Daniel H

    2017-12-20

    Chondrogenesis is the earliest stage of skeletal development and is a highly dynamic process, integrating the activities and functions of transcription factors, cell signaling molecules and extracellular matrix proteins. The molecular mechanisms underlying chondrogenesis have been extensively studied and multiple key regulators of this process have been identified. However, a genome-wide overview of the gene regulatory network in chondrogenesis has not been achieved. In this study, employing RNA sequencing, we identified 332 protein coding genes and 34 long non-coding RNA (lncRNA) genes that are highly selectively expressed in human fetal growth plate chondrocytes. Among the protein coding genes, 32 genes were associated with 62 distinct human skeletal disorders and 153 genes were associated with skeletal defects in knockout mice, confirming their essential roles in skeletal formation. These gene products formed a comprehensive physical interaction network and participated in multiple cellular processes regulating skeletal development. The data also revealed 34 transcription factors and 11,334 distal enhancers that were uniquely active in chondrocytes, functioning as transcriptional regulators for the cartilage-selective genes. Our findings revealed a complex gene regulatory network controlling skeletal development whereby transcription factors, enhancers and lncRNAs participate in chondrogenesis by transcriptional regulation of key genes. Additionally, the cartilage-selective genes represent candidate genes for unsolved human skeletal disorders.

  15. Complex interplay among DNA modification, noncoding RNA expression and protein-coding RNA expression in Salvia miltiorrhiza chloroplast genome.

    PubMed

    Chen, Haimei; Zhang, Jianhui; Yuan, George; Liu, Chang

    2014-01-01

    Salvia miltiorrhiza is one of the most widely used medicinal plants. As a first step to develop a chloroplast-based genetic engineering method for the over-production of active components from S. miltiorrhiza, we have analyzed the genome, transcriptome, and base modifications of the S. miltiorrhiza chloroplast. Total genomic DNA and RNA were extracted from fresh leaves and then subjected to strand-specific RNA-Seq and Single-Molecule Real-Time (SMRT) sequencing analyses. Mapping the RNA-Seq reads to the genome assembly allowed us to determine the relative expression levels of 80 protein-coding genes. In addition, we identified 19 polycistronic transcription units and 136 putative antisense and intergenic noncoding RNA (ncRNA) genes. Comparison of the abundance of protein-coding transcripts (cRNA) with and without overlapping antisense ncRNAs (asRNA) suggest that the presence of asRNA is associated with increased cRNA abundance (p<0.05). Using the SMRT Portal software (v1.3.2), 2687 potential DNA modification sites and two potential DNA modification motifs were predicted. The two motifs include a TATA box-like motif (CPGDMM1, "TATANNNATNA"), and an unknown motif (CPGDMM2 "WNYANTGAW"). Specifically, 35 of the 97 CPGDMM1 motifs (36.1%) and 91 of the 369 CPGDMM2 motifs (24.7%) were found to be significantly modified (p<0.01). Analysis of genes downstream of the CPGDMM1 motif revealed the significantly increased abundance of ncRNA genes that are less than 400 bp away from the significantly modified CPGDMM1motif (p<0.01). Taking together, the present study revealed a complex interplay among DNA modifications, ncRNA and cRNA expression in chloroplast genome.

  16. Association of MAOA, 5-HTT, and NET promoter polymorphisms with gene expression and protein activity in human placentas

    PubMed Central

    Zhang, Huiping; Smith, Graeme N.; Liu, Xudong

    2010-01-01

    Monoamine oxidase A (MAOA) and the transporters for serotonin (5-HTT) and norepinephrine (NET) may play important roles in regulating maternal monoamine neurotransmitters transferred across the placenta to the fetus. We investigated whether promoter polymorphisms in MAOA (uVNTR), 5-HTT (5-HTTLPR), and NET (NETpPR AAGG4) could influence gene expression and protein activity in human placentas. Normal term human placentas (n = 73) were collected, and placental MAOA, 5-HTT, and NET mRNA levels and protein activity were determined. The mRNA levels or protein activities were compared between different genotype groups. Placentas hemizygous (male fetus) or homozygous (female fetus) for MAOA uVNTR 4-repeat allele had significantly higher MAOA mRNA levels than those hemizygous or homozygous for the 3-repeat allele (P = 0.001). However, no significant difference in MAOA enzyme activity was found for these two groups of genotypes (P = 0.161). Placentas with the 5-HTTLPR short (S)-allele (S/S+S/L) had significantly lower 5-HTT mRNA levels and serotonin uptake rate than those homozygous for the long (L)-allele (L/L) (mRNA: P < 0.001; serotonin transporting activity: P < 0.001). Placentas homozygous for the NET AAGG4 L4 allele had significantly higher NET mRNA levels, as well as dopamine and norepinephrine uptake rates, than those with the S4/L4 genotype (mRNA: P < 0.001; dopamine transporting activity: P = 0.012; norepinephrine transporting activity: P = 0.011). These findings suggest that the three promoter polymorphisms of MAOA, 5-HTT, and NET influence gene expression levels and protein activity of these genes in human placentas, potentially leading to different fetal levels of maternal monoamine neurotransmitters, which may have an impact on fetal neurodevelopment. PMID:20332182

  17. Association of MAOA, 5-HTT, and NET promoter polymorphisms with gene expression and protein activity in human placentas.

    PubMed

    Zhang, Huiping; Smith, Graeme N; Liu, Xudong; Holden, Jeanette J A

    2010-06-01

    Monoamine oxidase A (MAOA) and the transporters for serotonin (5-HTT) and norepinephrine (NET) may play important roles in regulating maternal monoamine neurotransmitters transferred across the placenta to the fetus. We investigated whether promoter polymorphisms in MAOA (uVNTR), 5-HTT (5-HTTLPR), and NET (NETpPR AAGG(4)) could influence gene expression and protein activity in human placentas. Normal term human placentas (n = 73) were collected, and placental MAOA, 5-HTT, and NET mRNA levels and protein activity were determined. The mRNA levels or protein activities were compared between different genotype groups. Placentas hemizygous (male fetus) or homozygous (female fetus) for MAOA uVNTR 4-repeat allele had significantly higher MAOA mRNA levels than those hemizygous or homozygous for the 3-repeat allele (P = 0.001). However, no significant difference in MAOA enzyme activity was found for these two groups of genotypes (P = 0.161). Placentas with the 5-HTTLPR short (S)-allele (S/S+S/L) had significantly lower 5-HTT mRNA levels and serotonin uptake rate than those homozygous for the long (L)-allele (L/L) (mRNA: P < 0.001; serotonin transporting activity: P < 0.001). Placentas homozygous for the NET AAGG(4) L(4) allele had significantly higher NET mRNA levels, as well as dopamine and norepinephrine uptake rates, than those with the S(4)/L(4) genotype (mRNA: P < 0.001; dopamine transporting activity: P = 0.012; norepinephrine transporting activity: P = 0.011). These findings suggest that the three promoter polymorphisms of MAOA, 5-HTT, and NET influence gene expression levels and protein activity of these genes in human placentas, potentially leading to different fetal levels of maternal monoamine neurotransmitters, which may have an impact on fetal neurodevelopment.

  18. Mitogen activated protein kinase (MAPK) pathway regulates heme oxygenase-1 gene expression by hypoxia in vascular cells.

    PubMed

    Ryter, Stefan W; Xi, Sichuan; Hartsfield, Cynthia L; Choi, Augustine M K

    2002-08-01

    Hypoxia induces the stress protein heme oxygenase-1 (HO-1), which participates in cellular adaptation. The molecular pathways that regulate ho-1 gene expression under hypoxia may involve mitogen activated protein kinase (MAPK) signaling and reactive oxygen. Hypoxia (8 h) increased HO-1 mRNA in rat pulmonary aortic endothelial cells (PAEC), and also activated both extracellular signal-regulated kinase 1 (ERK1)/ERK2 and p38 MAPK pathways. The role of these kinases in hypoxia-induced ho-1 gene expression was examined using chemical inhibitors of these pathways. Surprisingly, SB203580, an inhibitor of p38 MAPK, and PD98059, an inhibitor of mitogen-activated protein kinase kinase (MEK1), strongly enhanced hypoxia-induced HO-1 mRNA expression in PAEC. UO126, a MEK1/2 inhibitor, enhanced HO-1 expression in PAEC under normoxia, but not hypoxia. Diphenylene iodonium, an inhibitor of NADPH oxidase, also induced the expression of HO-1 in PAEC under both normoxia and hypoxia. Similar results were observed in aortic vascular smooth muscle cells. Furthermore, hypoxia induced activator protein (AP-1) DNA-binding activity in PAEC. Pretreatment with SB203580 and PD98059 enhanced AP-1 binding activity under hypoxia in PAEC; UO126 stimulated AP-1 binding under normoxia, whereas diphenylene iodonium stimulated AP-1 binding under normoxia and hypoxia. These results suggest a relationship between MAPK and hypoxic regulation of ho-1 in vascular cells, involving AP-1.

  19. Fragile X mental retardation protein participates in non-coding RNA pathways.

    PubMed

    Li, En-Hui; Zhao, Xin; Zhang, Ce; Liu, Wei

    2018-02-20

    Fragile X syndrome is one of the most common forms of inherited intellectual disability. It is caused by mutations of the Fragile X mental retardation 1(FMR1) gene, resulting in either the loss or abnormal expression of the Fragile X mental retardation protein (FMRP). Recent research showed that FMRP participates in non-coding RNA pathways and plays various important roles in physiology, thereby extending our knowledge of the pathogenesis of the Fragile X syndrome. Initial studies showed that the Drosophila FMRP participates in siRNA and miRNA pathways by interacting with Dicer, Ago1 and Ago2, involved in neural activity and the fate determination of the germline stem cells. Subsequent studies showed that the Drosophila FMRP participates in piRNA pathway by interacting with Aub, Ago1 and Piwi in the maintenance of normal chromatin structures and genomic stability. More recent studies showed that FMRP is associated with lncRNA pathway, suggesting a potential role for the involvement in the clinical manifestations. In this review, we summarize the novel findings and explore the relationship between FMRP and non-coding RNA pathways, particularly the piRNA pathway, thereby providing critical insights on the molecular pathogenesis of Fragile X syndrome, and potential translational applications in clinical management of the disease.

  20. Automated conserved non-coding sequence (CNS) discovery reveals differences in gene content and promoter evolution among grasses

    PubMed Central

    Turco, Gina; Schnable, James C.; Pedersen, Brent; Freeling, Michael

    2013-01-01

    Conserved non-coding sequences (CNS) are islands of non-coding sequence that, like protein coding exons, show less divergence in sequence between related species than functionless DNA. Several CNSs have been demonstrated experimentally to function as cis-regulatory regions. However, the specific functions of most CNSs remain unknown. Previous searches for CNS in plants have either anchored on exons and only identified nearby sequences or required years of painstaking manual annotation. Here we present an open source tool that can accurately identify CNSs between any two related species with sequenced genomes, including both those immediately adjacent to exons and distal sequences separated by >12 kb of non-coding sequence. We have used this tool to characterize new motifs, associate CNSs with additional functions, and identify previously undetected genes encoding RNA and protein in the genomes of five grass species. We provide a list of 15,363 orthologous CNSs conserved across all grasses tested. We were also able to identify regulatory sequences present in the common ancestor of grasses that have been lost in one or more extant grass lineages. Lists of orthologous gene pairs and associated CNSs are provided for reference inbred lines of arabidopsis, Japonica rice, foxtail millet, sorghum, brachypodium, and maize. PMID:23874343

  1. The putative protein methyltransferase LAE1 controls cellulase gene expression in Trichoderma reesei

    PubMed Central

    Seiboth, Bernhard; Karimi, Razieh Aghcheh; Phatale, Pallavi A; Linke, Rita; Hartl, Lukas; Sauer, Dominik G; Smith, Kristina M; Baker, Scott E; Freitag, Michael; Kubicek, Christian P

    2012-01-01

    Summary Trichoderma reesei is an industrial producer of enzymes that degrade lignocellulosic polysaccharides to soluble monomers, which can be fermented to biofuels. Here we show that the expression of genes for lignocellulose degradation are controlled by the orthologous T. reesei protein methyltransferase LAE1. In a lae1 deletion mutant we observed a complete loss of expression of all seven cellulases, auxiliary factors for cellulose degradation, β-glucosidases and xylanases were no longer expressed. Conversely, enhanced expression of lae1 resulted in significantly increased cellulase gene transcription. Lae1-modulated cellulase gene expression was dependent on the function of the general cellulase regulator XYR1, but also xyr1 expression was LAE1-dependent. LAE1 was also essential for conidiation of T. reesei. Chromatin immunoprecipitation followed by high-throughput sequencing (‘ChIP-seq’) showed that lae1 expression was not obviously correlated with H3K4 di- or trimethylation (indicative of active transcription) or H3K9 trimethylation (typical for heterochromatin regions) in CAZyme coding regions, suggesting that LAE1 does not affect CAZyme gene expression by directly modulating H3K4 or H3K9 methylation. Our data demonstrate that the putative protein methyltransferase LAE1 is essential for cellulase gene expression in T. reesei through mechanisms that remain to be identified. PMID:22554051

  2. The compositional transition of vertebrate genomes: an analysis of the secondary structure of the proteins encoded by human genes.

    PubMed

    D'Onofrio, Giuseppe; Ghosh, Tapash Chandra

    2005-01-17

    Fluctuations and increments of both C(3) and G(3) levels along the human coding sequences were investigated comparing two sets of Xenopus/human orthologous genes. The first set of genes shows minor differences of the GC(3) levels, the second shows considerable increments of the GC(3) levels in the human genes. In both data sets, the fluctuations of C(3) and G(3) levels along the coding sequences correlated with the secondary structures of the encoded proteins. The human genes that underwent the compositional transition showed a different increment of the C(3) and G(3) levels within and among the structural units of the proteins. The relative synonymous codon usage (RSCU) of several amino acids were also affected during the compositional transition, showing that there exists a correlation between RSCU and protein secondary structures in human genes. The importance of natural selection for the formation of isochore organization of the human genome has been discussed on the basis of these results.

  3. Identification and characterization of moonlighting long non-coding RNAs based on RNA and protein interactome.

    PubMed

    Cheng, Lixin; Leung, Kwong-Sak

    2018-05-16

    Moonlighting proteins are a class of proteins having multiple distinct functions, which play essential roles in a variety of cellular and enzymatic functioning systems. Although there have long been calls for computational algorithms for the identification of moonlighting proteins, research on approaches to identify moonlighting long non-coding RNAs (lncRNAs) has never been undertaken. Here, we introduce a novel methodology, MoonFinder, for the identification of moonlighting lncRNAs. MoonFinder is a statistical algorithm identifying moonlighting lncRNAs without a priori knowledge through the integration of protein interactome, RNA-protein interactions, and functional annotation of proteins. We identify 155 moonlighting lncRNA candidates and uncover that they are a distinct class of lncRNAs characterized by specific sequence and cellular localization features. The non-coding genes that transcript moonlighting lncRNAs tend to have shorter but more exons and the moonlighting lncRNAs have a variable localization pattern with a high chance of residing in the cytoplasmic compartment in comparison to the other lncRNAs. Moreover, moonlighting lncRNAs and moonlighting proteins are rather mutually exclusive in terms of both their direct interactions and interacting partners. Our results also shed light on how the moonlighting candidates and their interacting proteins implicated in the formation and development of cancers and other diseases. The code implementing MoonFinder is supplied as an R package in the supplementary material. lxcheng@cse.cuhk.edu.hk or ksleung@cse.cuhk.edu.hk. Supplementary data are available at Bioinformatics online.

  4. Spirulina non-protein components induce BDNF gene transcription via HO-1 activity in C6 glioma cells.

    PubMed

    Morita, Kyoji; Itoh, Mari; Nishibori, Naoyoshi; Her, Song; Lee, Mi-Sook

    2015-01-01

    Blue-green algae are known to contain biologically active proteins and non-protein substances and considered as useful materials for manufacturing the nutritional supplements. Particularly, Spirulina has been reported to contain a variety of antioxidants, such as flavonoids, carotenoids, and vitamin C, thereby exerting their protective effects against the oxidative damage to the cells. In addition to their antioxidant actions, polyphenolic compounds have been speculated to cause the protection of neuronal cells and the recovery of neurologic function in the brain through the production of brain-derived neurotrophic factor (BDNF) in glial cells. Then, the protein-deprived extract was prepared by removing the most part of protein components from aqueous extract of Spirulina platensis, and the effect of this extract on BDNF gene transcription was examined in C6 glioma cells. Consequently, the protein-deprived extract was shown to cause the elevation of BDNF mRNA levels following the expression of heme oxygenase-1 (HO-1) in the glioma cells. Therefore, the non-protein components of S. platensis are considered to stimulate BDNF gene transcription through the HO-1 induction in glial cells, thus proposing a potential ability of the algae to indirectly modulate the brain function through the glial cell activity.

  5. The divergently transcribed genes encoding yeast ribosomal proteins L46 and S24 are activated by shared RPG-boxes.

    PubMed Central

    Kraakman, L S; Mager, W H; Maurer, K T; Nieuwint, R T; Planta, R J

    1989-01-01

    Transcription of the majority of the ribosomal protein (rp) genes in yeast is activated through common cis-acting elements, designated RPG-boxes. These elements have been shown to act as specific binding sites for the protein factor TUF/RAP1/GRF1 in vitro. Two such elements occur in the intergenic region separating the divergently transcribed genes encoding L46 and S24. To investigate whether the two RPG-boxes mediate transcription activation of both the L46 and S24 gene, two experimental strategies were followed: cloning of the respective genes on multicopy vectors and construction of fusion genes. Cloning of the L46 + S24 gene including the intergenic region in a multicopy yeast vector indicated that both genes are transcriptionally active. Using constructs in which only the S24 or the L46 gene is present, with or without the intergenic region, we obtained evidence that the intergenic region is indispensable for transcription activation of either gene. To demarcate the element(s) responsible for this activation, fusions of the intergenic region in either orientation to the galK reporter gene were made. Northern analysis of the levels of hybrid mRNA demonstrated that the intergenic region can serve as an heterologous promoter when it is in the 'S24-orientation'. Surprisingly, however, when fused in the reverse orientation the intergenic region did hardly confer transcription activity on the fusion gene. Furthermore, a 274 bp FnuDII-FnuDII fragment from the intergenic region that contains the RPG-boxes, could replace the naturally occurring upstream activation site (UASrpg) of the L25 rp-gene only when inserted in the 'S24-orientation'. Removal of 15 bp from the FnuDII fragment appeared to be sufficient to obtain transcription activation in the 'L46 orientation' as well. Analysis of a construct in which the RPG-boxes were selectively deleted from the promoter region of the L46 gene indicated that the RPG-boxes are needed for efficient transcriptional activation of

  6. The divergently transcribed genes encoding yeast ribosomal proteins L46 and S24 are activated by shared RPG-boxes.

    PubMed

    Kraakman, L S; Mager, W H; Maurer, K T; Nieuwint, R T; Planta, R J

    1989-12-11

    Transcription of the majority of the ribosomal protein (rp) genes in yeast is activated through common cis-acting elements, designated RPG-boxes. These elements have been shown to act as specific binding sites for the protein factor TUF/RAP1/GRF1 in vitro. Two such elements occur in the intergenic region separating the divergently transcribed genes encoding L46 and S24. To investigate whether the two RPG-boxes mediate transcription activation of both the L46 and S24 gene, two experimental strategies were followed: cloning of the respective genes on multicopy vectors and construction of fusion genes. Cloning of the L46 + S24 gene including the intergenic region in a multicopy yeast vector indicated that both genes are transcriptionally active. Using constructs in which only the S24 or the L46 gene is present, with or without the intergenic region, we obtained evidence that the intergenic region is indispensable for transcription activation of either gene. To demarcate the element(s) responsible for this activation, fusions of the intergenic region in either orientation to the galK reporter gene were made. Northern analysis of the levels of hybrid mRNA demonstrated that the intergenic region can serve as an heterologous promoter when it is in the 'S24-orientation'. Surprisingly, however, when fused in the reverse orientation the intergenic region did hardly confer transcription activity on the fusion gene. Furthermore, a 274 bp FnuDII-FnuDII fragment from the intergenic region that contains the RPG-boxes, could replace the naturally occurring upstream activation site (UASrpg) of the L25 rp-gene only when inserted in the 'S24-orientation'. Removal of 15 bp from the FnuDII fragment appeared to be sufficient to obtain transcription activation in the 'L46 orientation' as well. Analysis of a construct in which the RPG-boxes were selectively deleted from the promoter region of the L46 gene indicated that the RPG-boxes are needed for efficient transcriptional activation of

  7. PPAR{gamma} activates ABCA1 gene transcription but reduces the level of ABCA1 protein in HepG2 cells

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Mogilenko, Denis A., E-mail: denis@iem.sp.ru; Department of Embryology, St. Petersburg State University, 199034 St. Petersburg; Shavva, Vladimir S.

    Research highlights: {yields} PPAR{gamma} activates ABCA1 gene expression but decreases ABCA1 protein content in human hepatoma cell line HepG2. {yields} Treatment of HepG2 cells with PPAR{gamma} agonist GW1929 leads to dissociation of LXR{beta} from ABCA1-LXR{beta} complex. {yields} Inhibition of protein kinases MEK1/2 abolishes PPAR{gamma}-mediated dissociation of LXR{beta} from ABCA1/LXR{beta} complex. {yields} Activation of PPAR{gamma} leads to increasing of the level of LXR{beta} associated with LXRE within ABCA1 gene promoter. -- Abstract: Synthesis of ABCA1 protein in liver is necessary for high-density lipoproteins (HDL) formation in mammals. Nuclear receptor PPAR{gamma} is known as activator of ABCA1 expression, but details of PPAR{gamma}-mediatedmore » regulation of ABCA1 at both transcriptional and post-transcriptional levels in hepatocytes have not still been well elucidated. In this study we have shown, that PPAR{gamma} activates ABCA1 gene transcription in human hepatoma cells HepG2 through increasing of LXR{beta} binding with promoter region of ABCA1 gene. Treatment of HepG2 cells with PPAR{gamma} agonist GW1929 leads to dissociation of LXR{beta} from ABCA1/LXR{beta} complex and to nuclear translocation of this nuclear receptor resulting in reduction of ABCA1 protein level 24 h after treatment. Inhibition of protein kinases MEK1/2 abolishes PPAR{gamma}-mediated dissociation of LXR{beta} from ABCA1/LXR{beta} complex, but does not block PPAR{gamma}-dependent down-regulation of ABCA1 protein in HepG2 cells. These data suggest that PPAR{gamma} may be important for regulation of the level of hepatic ABCA1 protein and indicate the new interplays between PPAR{gamma}, LXR{beta} and MEK1/2 in regulation of ABCA1 mRNA and protein expression.« less

  8. Genome-Wide Identification and Expression Analysis of the Mitogen-Activated Protein Kinase Gene Family in Cassava

    PubMed Central

    Yan, Yan; Wang, Lianzhe; Ding, Zehong; Tie, Weiwei; Ding, Xupo; Zeng, Changying; Wei, Yunxie; Zhao, Hongliang; Peng, Ming; Hu, Wei

    2016-01-01

    Mitogen-activated protein kinases (MAPKs) play central roles in plant developmental processes, hormone signaling transduction, and responses to abiotic stress. However, no data are currently available about the MAPK family in cassava, an important tropical crop. Herein, 21 MeMAPK genes were identified from cassava. Phylogenetic analysis indicated that MeMAPKs could be classified into four subfamilies. Gene structure analysis demonstrated that the number of introns in MeMAPK genes ranged from 1 to 10, suggesting large variation among cassava MAPK genes. Conserved motif analysis indicated that all MeMAPKs had typical protein kinase domains. Transcriptomic analysis suggested that MeMAPK genes showed differential expression patterns in distinct tissues and in response to drought stress between wild subspecies and cultivated varieties. Interaction networks and co-expression analyses revealed that crucial pathways controlled by MeMAPK networks may be involved in the differential response to drought stress in different accessions of cassava. Expression of nine selected MAPK genes showed that these genes could comprehensively respond to osmotic, salt, cold, oxidative stressors, and abscisic acid (ABA) signaling. These findings yield new insights into the transcriptional control of MAPK gene expression, provide an improved understanding of abiotic stress responses and signaling transduction in cassava, and lead to potential applications in the genetic improvement of cassava cultivars. PMID:27625666

  9. HSP70 in human polymorphonuclear and mononuclear leukocytes: comparison of the protein content and transcriptional activity of HSPA genes.

    PubMed

    Boyko, Anna A; Azhikina, Tatyana L; Streltsova, Maria A; Sapozhnikov, Alexander M; Kovalenko, Elena I

    2017-01-01

    Cell-type specific variations are typical for the expression of different members of the HSP70 family. In circulating immune cells, HSP70 proteins interact with units of signaling pathways involved in the immune responses and may promote cell survival in sites of inflammation. In this work, we compared basal HSP70 expression and stress-induced HSP70 response in polymorphonuclear and mononuclear human leukocytes. The intracellular content of inducible and constitutive forms of HSP70 was analyzed in relation to the transcriptional activity of HSPA genes. Hyperthermia was used as the stress model for induction of HSP70 synthesis in the cells. Our results demonstrated that granulocytes (mainly neutrophils) and mononuclear cells differ significantly by both basal HSP70 expression and levels of HSP70 induction under hyperthermia. The differences were observed at the levels of HSPA gene transcription and intracellular HSP70 content. The expression of constitutive Hsс70 protein was much higher in mononuclear cells consisting of monocytes and lymphocytes than in granulocytes. At the same time, intact neutrophils showed increased expression of inducible Hsp70 protein compared to mononuclear cells. Heat treatment induced additional expression of HSPA genes in leukocytes. The most pronounced increase in the expression was observed in polymorphonuclear and mononuclear leukocytes for HSPA1A/B. However, in granulocytes, the induction of the transcription of the HSPA8 gene encoding the Hsc70 protein was significantly higher than in mononuclear cells. These variations in transcriptional activity of HSPA genes and intracellular HSP70 content in different populations of leukocytes may reflect specified requirements for the chaperone activity in the cells with a distinct functional role in the immune system.

  10. Isolation and sequencing of the gene encoding Sp23, a structural protein of spermatophore of the mealworm beetle, Tenebrio molitor.

    PubMed

    Feng, X; Happ, G M

    1996-11-14

    The cDNA for Sp23, a structural protein of the spermatophore of Tenebrio molitor, had been previously cloned and characterized (Paesen, G.C., Schwartz, M.B., Peferoen, M., Weyda, F. and Happ, G.M. (1992a) Amino acid sequence of Sp23, a structure protein of the spermatophore of the mealworm beetle, Tenebrio molitor. J. Biol. Chem. 257, 18852-18857). Using the labeled cDNA for Sp23 as a probe to screen a library of genomic DNA from Tenebrio molitor, we isolated a genomic clone for Sp23. A 5373-base pair (bp) restriction fragment containing the Sp23 gene was sequenced. The coding region is separated by a 55-bp intron which is located close to the translation start site. Three putative ecdysone response elements (EcRE) are identified in the 5' flanking region of the Sp23 gene. Comparison of the flanking regions of the Sp23 gene with those of the D-protein gene expressed in the accessory glands of Tenebrio reveals similar sequences present in the flanking regions of the two genes. The genomic organization of the coding region of the Sp23 gene shares similarities with that of the D-protein gene, three Drosophila accessory gland genes and two Drosophila 20-OH ecdysone-responsive genes.

  11. Combined protein construct and synthetic gene engineering for heterologous protein expression and crystallization using Gene Composer

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Raymond, Amy; Lovell, Scott; Lorimer, Don

    2009-12-01

    With the goal of improving yield and success rates of heterologous protein production for structural studies we have developed the database and algorithm software package Gene Composer. This freely available electronic tool facilitates the information-rich design of protein constructs and their engineered synthetic gene sequences, as detailed in the accompanying manuscript. In this report, we compare heterologous protein expression levels from native sequences to that of codon engineered synthetic gene constructs designed by Gene Composer. A test set of proteins including a human kinase (P38{alpha}), viral polymerase (HCV NS5B), and bacterial structural protein (FtsZ) were expressed in both E. colimore » and a cell-free wheat germ translation system. We also compare the protein expression levels in E. coli for a set of 11 different proteins with greatly varied G:C content and codon bias. The results consistently demonstrate that protein yields from codon engineered Gene Composer designs are as good as or better than those achieved from the synonymous native genes. Moreover, structure guided N- and C-terminal deletion constructs designed with the aid of Gene Composer can lead to greater success in gene to structure work as exemplified by the X-ray crystallographic structure determination of FtsZ from Bacillus subtilis. These results validate the Gene Composer algorithms, and suggest that using a combination of synthetic gene and protein construct engineering tools can improve the economics of gene to structure research.« less

  12. Temporal expression of the human alcohol dehydrogenase gene family during liver development correlates with differential promoter activation by hepatocyte nuclear factor 1, CCAAT/enhancer-binding protein alpha, liver activator protein, and D-element-binding protein.

    PubMed Central

    van Ooij, C; Snyder, R C; Paeper, B W; Duester, G

    1992-01-01

    The human class I alcohol dehydrogenase (ADH) gene family consists of ADH1, ADH2, and ADH3, which are sequentially activated in early fetal, late fetal, and postnatal liver, respectively. Analysis of ADH promoters revealed differential activation by several factors previously shown to control liver transcription. In cotransfection assays, the ADH1 promoter, but not the ADH2 or ADH3 promoter, was shown to respond to hepatocyte nuclear factor 1 (HNF-1), which has previously been shown to regulate transcription in early liver development. The ADH2 promoter, but not the ADH1 or ADH3 promoter, was shown to respond to CCAAT/enhancer-binding protein alpha (C/EBP alpha), a transcription factor particularly active during late fetal liver and early postnatal liver development. The ADH1, ADH2, and ADH3 promoters all responded to the liver transcription factors liver activator protein (LAP) and D-element-binding protein (DBP), which are most active in postnatal liver. For all three promoters, the activation by LAP or DBP was higher than that seen by HNF-1 or C/EBP alpha, and a significant synergism between C/EBP alpha and LAP was noticed for the ADH2 and ADH3 promoters when both factors were simultaneously cotransfected. A hierarchy of ADH promoter responsiveness to C/EBP alpha and LAP homo- and heterodimers is suggested. In all three ADH genes, LAP bound to the same four sites previously reported for C/EBP alpha (i.e., -160, -120, -40, and -20 bp), but DBP bound strongly only to the site located at -40 bp relative to the transcriptional start. Mutational analysis of ADH2 indicated that the -40 bp element accounts for most of the promoter regulation by the bZIP factors analyzed. These studies suggest that HNF-1 and C/EBP alpha help establish ADH gene family transcription in fetal liver and that LAP and DBP help maintain high-level ADH gene family transcription in postnatal liver. Images PMID:1620113

  13. Phylogenetic relationships within Echinococcus and Taenia tapeworms (Cestoda: Taeniidae): an inference from nuclear protein-coding genes.

    PubMed

    Knapp, Jenny; Nakao, Minoru; Yanagida, Tetsuya; Okamoto, Munehiro; Saarma, Urmas; Lavikainen, Antti; Ito, Akira

    2011-12-01

    The family Taeniidae of tapeworms is composed of two genera, Echinococcus and Taenia, which obligately parasitize mammals including humans. Inferring phylogeny via molecular markers is the only way to trace back their evolutionary histories. However, molecular dating approaches are lacking so far. Here we established new markers from nuclear protein-coding genes for RNA polymerase II second largest subunit (rpb2), phosphoenolpyruvate carboxykinase (pepck) and DNA polymerase delta (pold). Bayesian inference and maximum likelihood analyses of the concatenated gene sequences allowed us to reconstruct phylogenetic trees for taeniid parasites. The tree topologies clearly demonstrated that Taenia is paraphyletic and that the clade of Echinococcus oligarthrus and Echinococcusvogeli is sister to all other members of Echinococcus. Both species are endemic in Central and South America, and their definitive hosts originated from carnivores that immigrated from North America after the formation of the Panamanian land bridge about 3 million years ago (Ma). A time-calibrated phylogeny was estimated by a Bayesian relaxed-clock method based on the assumption that the most recent common ancestor of E. oligarthrus and E. vogeli existed during the late Pliocene (3.0 Ma). The results suggest that a clade of Taenia including human-pathogenic species diversified primarily in the late Miocene (11.2 Ma), whereas Echinococcus started to diversify later, in the end of the Miocene (5.8 Ma). Close genetic relationships among the members of Echinococcus imply that the genus is a young group in which speciation and global radiation occurred rapidly. Copyright © 2011 Elsevier Inc. All rights reserved.

  14. Ubiquitin--conserved protein or selfish gene?

    PubMed

    Catic, André; Ploegh, Hidde L

    2005-11-01

    The posttranslational modifier ubiquitin is encoded by a multigene family containing three primary members, which yield the precursor protein polyubiquitin and two ubiquitin moieties, Ub(L40) and Ub(S27), that are fused to the ribosomal proteins L40 and S27, respectively. The gene encoding polyubiquitin is highly conserved and, until now, those encoding Ub(L40) and Ub(S27) have been generally considered to be equally invariant. The evolution of the ribosomal ubiquitin moieties is, however, proving to be more dynamic. It seems that the genes encoding Ub(L40) and Ub(S27) are actively maintained by homologous recombination with the invariant polyubiquitin locus. Failure to recombine leads to deterioration of the sequence of the ribosomal ubiquitin moieties in several phyla, although this deterioration is evidently constrained by the structural requirements of the ubiquitin fold. Only a few amino acids in ubiquitin are vital for its function, and we propose that conservation of all three ubiquitin genes is driven not only by functional properties of the ubiquitin protein, but also by the propensity of the polyubiquitin locus to act as a 'selfish gene'.

  15. Sponge non-metastatic Group I Nme gene/protein - structure and function is conserved from sponges to humans

    PubMed Central

    2011-01-01

    Background Nucleoside diphosphate kinases NDPK are evolutionarily conserved enzymes present in Bacteria, Archaea and Eukarya, with human Nme1 the most studied representative of the family and the first identified metastasis suppressor. Sponges (Porifera) are simple metazoans without tissues, closest to the common ancestor of all animals. They changed little during evolution and probably provide the best insight into the metazoan ancestor's genomic features. Recent studies show that sponges have a wide repertoire of genes many of which are involved in diseases in more complex metazoans. The original function of those genes and the way it has evolved in the animal lineage is largely unknown. Here we report new results on the metastasis suppressor gene/protein homolog from the marine sponge Suberites domuncula, NmeGp1Sd. The purpose of this study was to investigate the properties of the sponge Group I Nme gene and protein, and compare it to its human homolog in order to elucidate the evolution of the structure and function of Nme. Results We found that sponge genes coding for Group I Nme protein are intron-rich. Furthermore, we discovered that the sponge NmeGp1Sd protein has a similar level of kinase activity as its human homolog Nme1, does not cleave negatively supercoiled DNA and shows nonspecific DNA-binding activity. The sponge NmeGp1Sd forms a hexamer, like human Nme1, and all other eukaryotic Nme proteins. NmeGp1Sd interacts with human Nme1 in human cells and exhibits the same subcellular localization. Stable clones expressing sponge NmeGp1Sd inhibited the migratory potential of CAL 27 cells, as already reported for human Nme1, which suggests that Nme's function in migratory processes was engaged long before the composition of true tissues. Conclusions This study suggests that the ancestor of all animals possessed a NmeGp1 protein with properties and functions similar to evolutionarily recent versions of the protein, even before the appearance of true tissues

  16. How the Sequence of a Gene Specifies Structural Symmetry in Proteins

    PubMed Central

    Shen, Xiaojuan; Huang, Tongcheng; Wang, Guanyu; Li, Guanglin

    2015-01-01

    Internal symmetry is commonly observed in the majority of fundamental protein folds. Meanwhile, sufficient evidence suggests that nascent polypeptide chains of proteins have the potential to start the co-translational folding process and this process allows mRNA to contain additional information on protein structure. In this paper, we study the relationship between gene sequences and protein structures from the viewpoint of symmetry to explore how gene sequences code for structural symmetry in proteins. We found that, for a set of two-fold symmetric proteins from left-handed beta-helix fold, intragenic symmetry always exists in their corresponding gene sequences. Meanwhile, codon usage bias and local mRNA structure might be involved in modulating translation speed for the formation of structural symmetry: a major decrease of local codon usage bias in the middle of the codon sequence can be identified as a common feature; and major or consecutive decreases in local mRNA folding energy near the boundaries of the symmetric substructures can also be observed. The results suggest that gene duplication and fusion may be an evolutionarily conserved process for this protein fold. In addition, the usage of rare codons and the formation of higher order of secondary structure near the boundaries of symmetric substructures might have coevolved as conserved mechanisms to slow down translation elongation and to facilitate effective folding of symmetric substructures. These findings provide valuable insights into our understanding of the mechanisms of translation and its evolution, as well as the design of proteins via symmetric modules. PMID:26641668

  17. Rye B chromosomes encode a functional Argonaute-like protein with in vitro slicer activities similar to its A chromosome paralog.

    PubMed

    Ma, Wei; Gabriel, Tobias Sebastian; Martis, Mihaela Maria; Gursinsky, Torsten; Schubert, Veit; Vrána, Jan; Doležel, Jaroslav; Grundlach, Heidrun; Altschmied, Lothar; Scholz, Uwe; Himmelbach, Axel; Behrens, Sven-Erik; Banaei-Moghaddam, Ali Mohammad; Houben, Andreas

    2017-01-01

    B chromosomes (Bs) are supernumerary, dispensable parts of the nuclear genome, which appear in many different species of eukaryote. So far, Bs have been considered to be genetically inert elements without any functional genes. Our comparative transcriptome analysis and the detection of active RNA polymerase II (RNAPII) in the proximity of B chromatin demonstrate that the Bs of rye (Secale cereale) contribute to the transcriptome. In total, 1954 and 1218 B-derived transcripts with an open reading frame were expressed in generative and vegetative tissues, respectively. In addition to B-derived transposable element transcripts, a high percentage of short transcripts without detectable similarity to known proteins and gene fragments from A chromosomes (As) were found, suggesting an ongoing gene erosion process. In vitro analysis of the A- and B-encoded AGO4B protein variants demonstrated that both possess RNA slicer activity. These data demonstrate unambiguously the presence of a functional AGO4B gene on Bs and that these Bs carry both functional protein coding genes and pseudogene copies. Thus, B-encoded genes may provide an additional level of gene control and complexity in combination with their related A-located genes. Hence, physiological effects, associated with the presence of Bs, may partly be explained by the activity of B-located (pseudo)genes. © 2016 IPK Gatersleben. New Phytologist © 2016 New Phytologist Trust.

  18. Proteomic Analysis and Identification of the Structural and Regulatory Proteins of the Rhodobacter capsulatus Gene Transfer Agent

    PubMed Central

    Chen, Frank; Spano, Anthony; Goodman, Benjamin E.; Blasier, Kiev R.; Sabat, Agnes; Jeffery, Erin; Norris, Andrew; Shabanowitz, Jeffrey; Hunt, Donald F.; Lebedev, Nikolai

    2010-01-01

    The gene transfer agent of Rhodobacter capsulatus (GTA) is a unique phage-like particle that exchanges genetic information between members of this same species of bacterium. Besides being an excellent tool for genetic mapping, the GTA has a number of advantages for biotechnological and nanoengineering purposes. To facilitate the GTA purification and identify the proteins involved in GTA expression, assembly and regulation, in the present work we construct and transform into R. capsulatus Y262 a gene coding for a C-terminally His-tagged capsid protein. The constructed protein was expressed in the cells, assembled into chimeric GTA particles inside the cells and excreted from the cells into surrounding medium. Transmission electron micrographs of phosphotungstate-stained, NiNTA-purified chimeric GTA confirm that its structure is similar to normal GTA particles, with many particles composed both of a head and a tail. The mass spectrometric proteomic analysis of polypeptides present in the GTA recovered outside the cells shows that GTA is composed of at least 9 proteins represented in the GTA gene cluster including proteins coded for by Orf’s 3, 5, 6–9, 11, 13, and 15. PMID:19105630

  19. Proteomic analysis and identification of the structural and regulatory proteins of the Rhodobacter capsulatus gene transfer agent.

    PubMed

    Chen, Frank; Spano, Anthony; Goodman, Benjamin E; Blasier, Kiev R; Sabat, Agnes; Jeffery, Erin; Norris, Andrew; Shabanowitz, Jeffrey; Hunt, Donald F; Lebedev, Nikolai

    2009-02-01

    The gene transfer agent of Rhodobacter capsulatus (GTA) is a unique phage-like particle that exchanges genetic information between members of this same species of bacterium. Besides being an excellent tool for genetic mapping, the GTA has a number of advantages for biotechnological and nanoengineering purposes. To facilitate the GTA purification and identify the proteins involved in GTA expression, assembly and regulation, in the present work we construct and transform into R. capsulatus Y262 a gene coding for a C-terminally His-tagged capsid protein. The constructed protein was expressed in the cells, assembled into chimeric GTA particles inside the cells and excreted from the cells into surrounding medium. Transmission electron micrographs of phosphotungstate-stained, NiNTA-purified chimeric GTA confirm that its structure is similar to normal GTA particles, with many particles composed both of a head and a tail. The mass spectrometric proteomic analysis of polypeptides present in the GTA recovered outside the cells shows that GTA is composed of at least 9 proteins represented in the GTA gene cluster including proteins coded for by Orf's 3, 5, 6-9, 11, 13, and 15.

  20. Structure of the beta-galactosidase gene from Thermus sp. strain T2: expression in Escherichia coli and purification in a single step of an active fusion protein.

    PubMed

    Vian, A; Carrascosa, A V; García, J L; Cortés, E

    1998-06-01

    The nucleotide sequence of both the bgaA gene, coding for a thermostable beta-galactosidase of Thermus sp. strain T2, and its flanking regions was determined. The deduced amino acid sequence of the enzyme predicts a polypeptide of 645 amino acids (Mr, 73,595). Comparative analysis of the open reading frames located in the flanking regions of the bgaA gene revealed that they might encode proteins involved in the transport and hydrolysis of sugars. The observed homology between the deduced amino acid sequences of BgaA and the beta-galactosidase of Bacillus stearothermophilus allows us to classify the new enzyme within family 42 of glycosyl hydrolases. BgaA was overexpressed in its active form in Escherichia coli, but more interestingly, an active chimeric beta-galactosidase was constructed by fusing the BgaA protein to the choline-binding domain of the major pneumococcal autolysin. This chimera illustrates a novel approach for producing an active and thermostable hybrid enzyme that can be purified in a single step by affinity chromatography on DEAE-cellulose, retaining the catalytic properties of the native enzyme. The chimeric enzyme showed a specific activity of 191,000 U/mg at 70 degrees C and a Km value of 1.6 mM with o-nitrophenyl-beta-D-galactopyranoside as a substrate, and it retained 50% of its initial activity after 1 h of incubation at 70 degrees C.

  1. An open reading frame in intron seven of the sea urchin DNA-methyltransferase gene codes for a functional AP1 endonuclease.

    PubMed

    Cioffi, Anna Valentina; Ferrara, Diana; Cubellis, Maria Vittoria; Aniello, Francesco; Corrado, Marcella; Liguori, Francesca; Amoroso, Alessandro; Fucci, Laura; Branno, Margherita

    2002-08-01

    Analysis of the genome structure of the Paracentrotus lividus (sea urchin) DNA methyltransferase (DNA MTase) gene showed the presence of an open reading frame, named METEX, in intron 7 of the gene. METEX expression is developmentally regulated, showing no correlation with DNA MTase expression. In fact, DNA MTase transcripts are present at high concentrations in the early developmental stages, while METEX is expressed at late stages of development. Two METEX cDNA clones (Met1 and Met2) that are different in the 3' end have been isolated in a cDNA library screening. The putative translated protein from Met2 cDNA clone showed similarity with Escherichia coli endonuclease III on the basis of sequence and predictive three-dimensional structure. The protein, overexpressed in E. coli and purified, had functional properties similar to the endonuclease specific for apurinic/apyrimidinic (AP) sites on the basis of the lyase activity. Therefore the open reading frame, present in intron 7 of the P. lividus DNA MTase gene, codes for a functional AP endonuclease designated SuAP1.

  2. Novel exon 1 protein-coding regions N-terminally extend human KCNE3 and KCNE4.

    PubMed

    Abbott, Geoffrey W

    2016-08-01

    The 5 human (h)KCNE β subunits each regulate various cation channels and are linked to inherited cardiac arrhythmias. Reported here are previously undiscovered protein-coding regions in exon 1 of hKCNE3 and hKCNE4 that extend their encoded extracellular domains by 44 and 51 residues, which yields full-length proteins of 147 and 221 residues, respectively. Full-length hKCNE3 and hKCNE4 transcript and protein are expressed in multiple human tissues; for hKCNE4, only the longer protein isoform is detectable. Two-electrode voltage-clamp electrophysiology revealed that, when coexpressed in Xenopus laevis oocytes with various potassium channels, the newly discovered segment preserved conversion of KCNQ1 by hKCNE3 to a constitutively open channel, but prevented its inhibition of Kv4.2 and KCNQ4. hKCNE4 slowing of Kv4.2 inactivation and positive-shifted steady-state inactivation were also preserved in the longer form. In contrast, full-length hKCNE4 inhibition of KCNQ1 was limited to 40% at +40 mV vs. 80% inhibition by the shorter form, and augmentation of KCNQ4 activity by hKCNE4 was entirely abolished by the additional segment. Among the genome databases analyzed, the longer KCNE3 is confined to primates; full-length KCNE4 is widespread in vertebrates but is notably absent from Mus musculus Findings highlight unexpected KCNE gene diversity, raise the possibility of dynamic regulation of KCNE partner modulation via splice variation, and suggest that the longer hKCNE3 and hKCNE4 proteins should be adopted in future mechanistic and genetic screening studies.-Abbott, G. W. Novel exon 1 protein-coding regions N-terminally extend human KCNE3 and KCNE4. © FASEB.

  3. Long non-coding RNAs and mRNAs profiling during spleen development in pig.

    PubMed

    Che, Tiandong; Li, Diyan; Jin, Long; Fu, Yuhua; Liu, Yingkai; Liu, Pengliang; Wang, Yixin; Tang, Qianzi; Ma, Jideng; Wang, Xun; Jiang, Anan; Li, Xuewei; Li, Mingzhou

    2018-01-01

    Genome-wide transcriptomic studies in humans and mice have become extensive and mature. However, a comprehensive and systematic understanding of protein-coding genes and long non-coding RNAs (lncRNAs) expressed during pig spleen development has not been achieved. LncRNAs are known to participate in regulatory networks for an array of biological processes. Here, we constructed 18 RNA libraries from developing fetal pig spleen (55 days before birth), postnatal pig spleens (0, 30, 180 days and 2 years after birth), and the samples from the 2-year-old Wild Boar. A total of 15,040 lncRNA transcripts were identified among these samples. We found that the temporal expression pattern of lncRNAs was more restricted than observed for protein-coding genes. Time-series analysis showed two large modules for protein-coding genes and lncRNAs. The up-regulated module was enriched for genes related to immune and inflammatory function, while the down-regulated module was enriched for cell proliferation processes such as cell division and DNA replication. Co-expression networks indicated the functional relatedness between protein-coding genes and lncRNAs, which were enriched for similar functions over the series of time points examined. We identified numerous differentially expressed protein-coding genes and lncRNAs in all five developmental stages. Notably, ceruloplasmin precursor (CP), a protein-coding gene participating in antioxidant and iron transport processes, was differentially expressed in all stages. This study provides the first catalog of the developing pig spleen, and contributes to a fuller understanding of the molecular mechanisms underpinning mammalian spleen development.

  4. Complex Interplay among DNA Modification, Noncoding RNA Expression and Protein-Coding RNA Expression in Salvia miltiorrhiza Chloroplast Genome

    PubMed Central

    Chen, Haimei; Zhang, Jianhui; Yuan, George; Liu, Chang

    2014-01-01

    Salvia miltiorrhiza is one of the most widely used medicinal plants. As a first step to develop a chloroplast-based genetic engineering method for the over-production of active components from S. miltiorrhiza, we have analyzed the genome, transcriptome, and base modifications of the S. miltiorrhiza chloroplast. Total genomic DNA and RNA were extracted from fresh leaves and then subjected to strand-specific RNA-Seq and Single-Molecule Real-Time (SMRT) sequencing analyses. Mapping the RNA-Seq reads to the genome assembly allowed us to determine the relative expression levels of 80 protein-coding genes. In addition, we identified 19 polycistronic transcription units and 136 putative antisense and intergenic noncoding RNA (ncRNA) genes. Comparison of the abundance of protein-coding transcripts (cRNA) with and without overlapping antisense ncRNAs (asRNA) suggest that the presence of asRNA is associated with increased cRNA abundance (p<0.05). Using the SMRT Portal software (v1.3.2), 2687 potential DNA modification sites and two potential DNA modification motifs were predicted. The two motifs include a TATA box–like motif (CPGDMM1, “TATANNNATNA”), and an unknown motif (CPGDMM2 “WNYANTGAW”). Specifically, 35 of the 97 CPGDMM1 motifs (36.1%) and 91 of the 369 CPGDMM2 motifs (24.7%) were found to be significantly modified (p<0.01). Analysis of genes downstream of the CPGDMM1 motif revealed the significantly increased abundance of ncRNA genes that are less than 400 bp away from the significantly modified CPGDMM1motif (p<0.01). Taking together, the present study revealed a complex interplay among DNA modifications, ncRNA and cRNA expression in chloroplast genome. PMID:24914614

  5. XGC developments for a more efficient XGC-GENE code coupling

    NASA Astrophysics Data System (ADS)

    Dominski, Julien; Hager, Robert; Ku, Seung-Hoe; Chang, Cs

    2017-10-01

    In the Exascale Computing Program, the High-Fidelity Whole Device Modeling project initially aims at delivering a tightly-coupled simulation of plasma neoclassical and turbulence dynamics from the core to the edge of the tokamak. To permit such simulations, the gyrokinetic codes GENE and XGC will be coupled together. Numerical efforts are made to improve the numerical schemes agreement in the coupling region. One of the difficulties of coupling those codes together is the incompatibility of their grids. GENE is a continuum grid-based code and XGC is a Particle-In-Cell code using unstructured triangular mesh. A field-aligned filter is thus implemented in XGC. Even if XGC originally had an approximately field-following mesh, this field-aligned filter permits to have a perturbation discretization closer to the one solved in the field-aligned code GENE. Additionally, new XGC gyro-averaging matrices are implemented on a velocity grid adapted to the plasma properties, thus ensuring same accuracy from the core to the edge regions.

  6. Amino acid codes in mitochondria as possible clues to primitive codes

    NASA Technical Reports Server (NTRS)

    Jukes, T. H.

    1981-01-01

    Differences between mitochondrial codes and the universal code indicate that an evolutionary simplification has taken place, rather than a return to a more primitive code. However, these differences make it evident that the universal code is not the only code possible, and therefore earlier codes may have differed markedly from the previous code. The present universal code is probably a 'frozen accident.' The change in CUN codons from leucine to threonine (Neurospora vs. yeast mitochondria) indicates that neutral or near-neutral changes occurred in the corresponding proteins when this code change took place, caused presumably by a mutation in a tRNA gene.

  7. Gene expression profiling of porcine skeletal muscle in the early recovery phase following acute physical activity.

    PubMed

    Jensen, Jeanette H; Conley, Lene N; Hedegaard, Jakob; Nielsen, Mathilde; Young, Jette F; Oksbjerg, Niels; Hornshøj, Henrik; Bendixen, Christian; Thomsen, Bo

    2012-07-01

    Acute physical activity elicits changes in gene expression in skeletal muscles to promote metabolic changes and to repair exercise-induced muscle injuries. In the present time-course study, pigs were submitted to an acute bout of treadmill running until near exhaustion to determine the impact of unaccustomed exercise on global transcriptional profiles in porcine skeletal muscles. Using a combined microarray and candidate gene approach, we identified a suite of genes that are differentially expressed in muscles during postexercise recovery. Several members of the heat shock protein family and proteins associated with proteolytic events, such as the muscle-specific E3 ubiquitin ligase atrogin-1, were significantly upregulated, suggesting that protein breakdown, prevention of protein aggregation and stabilization of unfolded proteins are important processes for restoration of cellular homeostasis. We also detected an upregulation of genes that are associated with muscle cell proliferation and differentiation, including MUSTN1, ASB5 and CSRP3, possibly reflecting activation, differentiation and fusion of satellite cells to facilitate repair of muscle damage. In addition, exercise increased expression of the orphan nuclear hormone receptor NR4A3, which regulates metabolic functions associated with lipid, carbohydrate and energy homeostasis. Finally, we observed an unanticipated induction of the long non-coding RNA transcript NEAT1, which has been implicated in RNA processing and nuclear retention of adenosine-to-inosine edited mRNAs in the ribonucleoprotein bodies called paraspeckles. These findings expand the complexity of pathways affected by acute contractile activity of skeletal muscle, contributing to a better understanding of the molecular processes that occur in muscle tissue in the recovery phase.

  8. Evolutionary relationships between miRNA genes and their activity.

    PubMed

    Zhu, Yan; Skogerbø, Geir; Ning, Qianqian; Wang, Zhen; Li, Biqing; Yang, Shuang; Sun, Hong; Li, Yixue

    2012-12-22

    The emergence of vertebrates is characterized by a strong increase in miRNA families. MicroRNAs interact broadly with many transcripts, and the evolution of such a system is intriguing. However, evolutionary questions concerning the origin of miRNA genes and their subsequent evolution remain unexplained. In order to systematically understand the evolutionary relationship between miRNAs gene and their function, we classified human known miRNAs into eight groups based on their evolutionary ages estimated by maximum parsimony method. New miRNA genes with new functional sequences accumulated more dynamically in vertebrates than that observed in Drosophila. Different levels of evolutionary selection were observed over miRNA gene sequences with different time of origin. Most genic miRNAs differ from their host genes in time of origin, there is no particular relationship between the age of a miRNA and the age of its host genes, genic miRNAs are mostly younger than the corresponding host genes. MicroRNAs originated over different time-scales are often predicted/verified to target the same or overlapping sets of genes, opening the possibility of substantial functional redundancy among miRNAs of different ages. Higher degree of tissue specificity and lower expression level was found in young miRNAs. Our data showed that compared with protein coding genes, miRNA genes are more dynamic in terms of emergence and decay. Evolution patterns are quite different between miRNAs of different ages. MicroRNAs activity is under tight control with well-regulated expression increased and targeting decreased over time. Our work calls attention to the study of miRNA activity with a consideration of their origin time.

  9. Coding of Class I and II aminoacyl-tRNA synthetases

    PubMed Central

    Carter, Charles W.

    2018-01-01

    SUMMARY The aminoacyl-tRNA synthetases and their cognate transfer RNAs translate the universal genetic code. The twenty canonical amino acids are sufficiently diverse to create a selective advantage for dividing amino acid activation between two distinct, apparently unrelated superfamilies of synthetases, Class I amino acids being generally larger and less polar, Class II amino acids smaller and more polar. Biochemical, bioinformatic, and protein engineering experiments support the hypothesis that the two Classes descended from opposite strands of the same ancestral gene. Parallel experimental deconstructions of Class I and II synthetases reveal parallel losses in catalytic proficiency at two novel modular levels—protozymes and Urzymes—associated with the evolution of catalytic activity. Bi-directional coding supports an important unification of the proteome; affords a genetic relatedness metric—middle base-pairing frequencies in sense/antisense alignments—that probes more deeply into the evolutionary history of translation than do single multiple sequence alignments; and has facilitated the analysis of hitherto unknown coding relationships in tRNA sequences. Reconstruction of native synthetases by modular thermodynamic cycles facilitated by domain engineering emphasizes the subtlety associated with achieving high specificity, shedding new light on allosteric relationships in contemporary synthetases. Synthetase Urzyme structural biology suggests that they are catalytically active molten globules, broadening the potential manifold of polypeptide catalysts accessible to primitive genetic coding and motivating revisions of the origins of catalysis. Finally, bi-directional genetic coding of some of the oldest genes in the proteome places major limitations on the likelihood that any RNA World preceded the origins of coded proteins. PMID:28828732

  10. Protein geranylgeranyltransferase of Saccharomyces cerevisiae is specific for Cys-Xaa-Xaa-Leu motif proteins and requires the CDC43 gene product but not the DPR1 gene product.

    PubMed Central

    Finegold, A A; Johnson, D I; Farnsworth, C C; Gelb, M H; Judd, S R; Glomset, J A; Tamanoi, F

    1991-01-01

    Protein prenylation occurs by modification of proteins with one of at least two isoprenoids, the farnesyl group and the geranylgeranyl group. Protein farnesyltransferases have been identified, but no such enzyme has been identified for geranylgeranylation. We report the identification of an activity in crude soluble yeast extracts that catalyzes the transfer of a geranylgeranyl moiety from geranylgeranyl pyrophosphate to proteins having the C-terminal sequence Cys-Ile-Ile-Leu or Cys-Val-Leu-Leu but not to a similar protein ending with Cys-Ile-Ile-Ser. This activity is dependent upon the CDC43/CAL1 gene, which is involved in budding and the control of cell polarity, but does not require the DPR1/RAM1 gene, which is known to be required for the farnesylation of Ras proteins. These results indicate that the protein geranylgeranyltransferase activity is distinct from the protein farnesyltransferase activity and that its specificity depends in part on the extreme C-terminal leucine in the protein to be prenylated. Images PMID:2034682

  11. The Yersinia pestis gcvB gene encodes two small regulatory RNA molecules

    PubMed Central

    McArthur, Sarah D; Pulvermacher, Sarah C; Stauffer, George V

    2006-01-01

    Background In recent years it has become clear that small non-coding RNAs function as regulatory elements in bacterial virulence and bacterial stress responses. We tested for the presence of the small non-coding GcvB RNAs in Y. pestis as possible regulators of gene expression in this organism. Results In this study, we report that the Yersinia pestis KIM6 gcvB gene encodes two small RNAs. Transcription of gcvB is activated by the GcvA protein and repressed by the GcvR protein. The gcvB-encoded RNAs are required for repression of the Y. pestis dppA gene, encoding the periplasmic-binding protein component of the dipeptide transport system, showing that the GcvB RNAs have regulatory activity. A deletion of the gcvB gene from the Y. pestis KIM6 chromosome results in a decrease in the generation time of the organism as well as a change in colony morphology. Conclusion The results of this study indicate that the Y. pestis gcvB gene encodes two small non-coding regulatory RNAs that repress dppA expression. A gcvB deletion is pleiotropic, suggesting that the sRNAs are likely involved in controlling genes in addition to dppA. PMID:16768793

  12. The transcriptional activator ZNF143 is essential for normal development in zebrafish

    PubMed Central

    2012-01-01

    Background ZNF143 is a sequence-specific DNA-binding protein that stimulates transcription of both small RNA genes by RNA polymerase II or III, or protein-coding genes by RNA polymerase II, using separable activating domains. We describe phenotypic effects following knockdown of this protein in developing Danio rerio (zebrafish) embryos by injection of morpholino antisense oligonucleotides that target znf143 mRNA. Results The loss of function phenotype is pleiotropic and includes a broad array of abnormalities including defects in heart, blood, ear and midbrain hindbrain boundary. Defects are rescued by coinjection of synthetic mRNA encoding full-length ZNF143 protein, but not by protein lacking the amino-terminal activation domains. Accordingly, expression of several marker genes is affected following knockdown, including GATA-binding protein 1 (gata1), cardiac myosin light chain 2 (cmlc2) and paired box gene 2a (pax2a). The zebrafish pax2a gene proximal promoter contains two binding sites for ZNF143, and reporter gene transcription driven by this promoter in transfected cells is activated by this protein. Conclusions Normal development of zebrafish embryos requires ZNF143. Furthermore, the pax2a gene is probably one example of many protein-coding gene targets of ZNF143 during zebrafish development. PMID:22268977

  13. The transcriptional activator ZNF143 is essential for normal development in zebrafish.

    PubMed

    Halbig, Kari M; Lekven, Arne C; Kunkel, Gary R

    2012-01-23

    ZNF143 is a sequence-specific DNA-binding protein that stimulates transcription of both small RNA genes by RNA polymerase II or III, or protein-coding genes by RNA polymerase II, using separable activating domains. We describe phenotypic effects following knockdown of this protein in developing Danio rerio (zebrafish) embryos by injection of morpholino antisense oligonucleotides that target znf143 mRNA. The loss of function phenotype is pleiotropic and includes a broad array of abnormalities including defects in heart, blood, ear and midbrain hindbrain boundary. Defects are rescued by coinjection of synthetic mRNA encoding full-length ZNF143 protein, but not by protein lacking the amino-terminal activation domains. Accordingly, expression of several marker genes is affected following knockdown, including GATA-binding protein 1 (gata1), cardiac myosin light chain 2 (cmlc2) and paired box gene 2a (pax2a). The zebrafish pax2a gene proximal promoter contains two binding sites for ZNF143, and reporter gene transcription driven by this promoter in transfected cells is activated by this protein. Normal development of zebrafish embryos requires ZNF143. Furthermore, the pax2a gene is probably one example of many protein-coding gene targets of ZNF143 during zebrafish development.

  14. General theory for integrated analysis of growth, gene, and protein expression in biofilms.

    PubMed

    Zhang, Tianyu; Pabst, Breana; Klapper, Isaac; Stewart, Philip S

    2013-01-01

    A theory for analysis and prediction of spatial and temporal patterns of gene and protein expression within microbial biofilms is derived. The theory integrates phenomena of solute reaction and diffusion, microbial growth, mRNA or protein synthesis, biomass advection, and gene transcript or protein turnover. Case studies illustrate the capacity of the theory to simulate heterogeneous spatial patterns and predict microbial activities in biofilms that are qualitatively different from those of planktonic cells. Specific scenarios analyzed include an inducible GFP or fluorescent protein reporter, a denitrification gene repressed by oxygen, an acid stress response gene, and a quorum sensing circuit. It is shown that the patterns of activity revealed by inducible stable fluorescent proteins or reporter unstable proteins overestimate the region of activity. This is due to advective spreading and finite protein turnover rates. In the cases of a gene induced by either limitation for a metabolic substrate or accumulation of a metabolic product, maximal expression is predicted in an internal stratum of the biofilm. A quorum sensing system that includes an oxygen-responsive negative regulator exhibits behavior that is distinct from any stage of a batch planktonic culture. Though here the analyses have been limited to simultaneous interactions of up to two substrates and two genes, the framework applies to arbitrarily large networks of genes and metabolites. Extension of reaction-diffusion modeling in biofilms to the analysis of individual genes and gene networks is an important advance that dovetails with the growing toolkit of molecular and genetic experimental techniques.

  15. Correlation of rare coding variants in the gene encoding human glucokinase regulatory protein with phenotypic, cellular, and kinetic outcomes.

    PubMed

    Rees, Matthew G; Ng, David; Ruppert, Sarah; Turner, Clesson; Beer, Nicola L; Swift, Amy J; Morken, Mario A; Below, Jennifer E; Blech, Ilana; Mullikin, James C; McCarthy, Mark I; Biesecker, Leslie G; Gloyn, Anna L; Collins, Francis S

    2012-01-01

    Defining the genetic contribution of rare variants to common diseases is a major basic and clinical science challenge that could offer new insights into disease etiology and provide potential for directed gene- and pathway-based prevention and treatment. Common and rare nonsynonymous variants in the GCKR gene are associated with alterations in metabolic traits, most notably serum triglyceride levels. GCKR encodes glucokinase regulatory protein (GKRP), a predominantly nuclear protein that inhibits hepatic glucokinase (GCK) and plays a critical role in glucose homeostasis. The mode of action of rare GCKR variants remains unexplored. We identified 19 nonsynonymous GCKR variants among 800 individuals from the ClinSeq medical sequencing project. Excluding the previously described common missense variant p.Pro446Leu, all variants were rare in the cohort. Accordingly, we functionally characterized all variants to evaluate their potential phenotypic effects. Defects were observed for the majority of the rare variants after assessment of cellular localization, ability to interact with GCK, and kinetic activity of the encoded proteins. Comparing the individuals with functional rare variants to those without such variants showed associations with lipid phenotypes. Our findings suggest that, while nonsynonymous GCKR variants, excluding p.Pro446Leu, are rare in individuals of mixed European descent, the majority do affect protein function. In sum, this study utilizes computational, cell biological, and biochemical methods to present a model for interpreting the clinical significance of rare genetic variants in common disease.

  16. Systematic screening for mutations in the promoter and the coding region of the 5-HT{sub 1A} gene

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Erdmann, J.; Shimron-Abarbanell, D.; Cichon, S.

    1995-10-09

    In the present study we sought to identify genetic variation in the 5-HT{sub 1A} receptor gene which through alteration of protein function or level of expression might contribute to the genetic predisposition to neuropsychiatric diseases. Genomic DNA samples from 159 unrelated subjects (including 45 schizophrenic, 46 bipolar affective, and 43 patients with Tourette`s syndrome, as well as 25 healthy controls) were investigated by single-strand conformation analysis. Overlapping PCR (polymerase chain reaction) fragments covered the whole coding sequence as well as the 5{prime} untranslated region of the 5-HT{sub 1A} gene. The region upstream to the coding sequence we investigated contains amore » functional promoter. We found two rare nucleotide sequence variants. Both mutations are located in the coding region of the gene: a coding mutation (A{yields}G) in nucleotide position 82 which leads to an amino acid exchange (Ile{yields}Val) in position 28 of the receptor protein and a silent mutation (C{yields}T) in nucleotide position 549. The occurrence of the Ile-28-Val substitution was studied in an extended sample of patients (n = 352) and controls (n = 210) but was found in similar frequencies in all groups. Thus, this mutation is unlikely to play a significant role in the genetic predisposition to the diseases investigated. In conclusion, our study does not provide evidence that the 5-HT{sub 1A} gene plays either a major or a minor role in the genetic predisposition to schizophrenia, bipolar affective disorder, or Tourette`s syndrome. 29 refs., 4 figs., 1 tab.« less

  17. The HMG-I/Y-related protein p8 binds to p300 and Pax2 trans-activation domain-interacting protein to regulate the trans-activation activity of the Pax2A and Pax2B transcription factors on the glucagon gene promoter.

    PubMed

    Hoffmeister, Albrecht; Ropolo, Alejandro; Vasseur, Sophie; Mallo, Gustavo V; Bodeker, Hans; Ritz-Laser, Beate; Dressler, Gregory R; Vaccaro, Maria Ines; Dagorn, Jean-Charles; Moreno, Silvia; Iovanna, Juan Lucio

    2002-06-21

    p8 is a nuclear DNA-binding protein, which was identified because its expression is strongly activated in response to several stresses. Biochemical and biophysical studies revealed that despite a weak sequence homology p8 is an HMG-I/Y-like protein, suggesting that p8 may be involved in transcription regulation. Results reported here strongly support this hypothesis. Using a pull-down approach, we found that p8 interacts with the general co-activator p300. We also found that, similar to the HMG proteins, p300 was able to acetylate recombinant p8 in vitro, although the significance of such modification remains to be determined. Then a screening by the two-hybrid system, using p8 as bait, allowed us to identify the Pax2 trans-activation domain-interacting protein (PTIP) as another partner of p8. Transient transfection studies revealed that PTIP is a strong inhibitor of the trans-activation activities of Pax2A and Pax2B on the glucagon gene promoter, which was chosen as a model because it is a target of the Pax2A and Pax2B transcription factors. This effect is completely abolished by co-transfection of p8 in glucagon-producing InRIG9 cells, indicating that p8 binding to PTIP prevents inhibition of the glucagon gene promoter. This was not observed in NIH3T3 fibroblasts that do not express glucagon. Finally, expression of p8 enhances the effect of p300 on Pax2A and Pax2B trans-activation of the glucagon gene promoter. These observations suggest that in glucagon-producing cells p8 is a positive cofactor of the activation of the glucagon gene promoter by Pax2A and Pax2B, both by recruiting the p300 cofactor to increase the Pax2A and Pax2B activities and by binding the Pax2-interacting protein PTIP to suppress its inhibition.

  18. Umchs5, a gene coding for a class IV chitin synthase in Ustilago maydis.

    PubMed

    Xoconostle-Cázares, B; Specht, C A; Robbins, P W; Liu, Y; León, C; Ruiz-Herrera, J

    1997-12-01

    A fragment corresponding to a conserved region of a fifth gene coding for chitin synthase in the plant pathogenic fungus Ustilago maydis was amplified by means of the polymerase chain reaction (PCR). The amplified fragment was utilized as a probe for the identification of the whole gene in a genomic library of the fungus. The predicted gene product of Umchs5 has highest similarity with class IV chitin synthases encoded by the CHS3 genes from Saccharomyces cerevisiae and Candida albicans, chs-4 from Neurospora crassa, and chsE from Aspergillus nidulans. Umchs5 null mutants were constructed by substitution of most of the coding sequence with the hygromycin B resistance cassette. Mutants displayed significant reduction in growth rate, chitin content, and chitin synthase activity, specially in the mycelial form. Virulence to corn plantules was also reduced in the mutants. PCR was also used to obtain a fragment of a sixth chitin synthase, Umchs6. It is suggested that multigenic control of chitin synthesis in U. maydis operates as a protection mechanism for fungal viability in which the loss of one activity is partially compensated by the remaining enzymes. Copyright 1997 Academic Press.

  19. The Triticum aestivum non-specific lipid transfer protein (TaLtp) gene family: comparative promoter activity of six TaLtp genes in transgenic rice.

    PubMed

    Boutrot, Freddy; Meynard, Donaldo; Guiderdoni, Emmanuel; Joudrier, Philippe; Gautier, Marie-Françoise

    2007-03-01

    Plant non-specific lipid transfer proteins (nsLTPs) are encoded by a multigene family and support physiological functions, which remain unclear. We adapted an efficient ligation-mediated polymerase chain reaction (LM-PCR) procedure that enabled isolation of 22 novel Triticum aestivum nsLtp (TaLtp) genes encoding types 1 and 2 nsLTPs. A phylogenetic tree clustered the wheat nsLTPs into ten subfamilies comprising 1-7 members. We also studied the activity of four type 1 and two type 2 TaLtp gene promoters in transgenic rice using the 1-Glucuronidase reporter gene. The activities of the six promoters displayed both overlapping and distinct features in rice. In vegetative organs, these promoters were active in leaves and root vascular tissues while no beta-Glucuronidase (GUS) activity was detected in stems. In flowers, the GUS activity driven by the TaLtp7.2a, TaLtp9.1a, TaLtp9.2d, and TaLtp9.3e gene promoters was associated with vascular tissues in glumes and in the extremities of anther filaments whereas only the TaLtp9.4a gene promoter was active in anther epidermal cells. In developing grains, GUS activity and GUS immunolocalization data evidenced complex patterns of activity of the TaLtp7.1a, TaLtp9.2d, and TaLtp9.4a gene promoters in embryo scutellum and in the grain epicarp cell layer. In contrast, GUS activity driven by TaLtp7.2a, TaLtp9.1a, and TaLtp9.3e promoters was restricted to the vascular bundle of the embryo scutellum. This diversity of TaLtp gene promoter activity supports the hypothesis that the encoded TaLTPs possess distinct functions in planta.

  20. Identification and Validation of Selected Universal Stress Protein Domain Containing Drought-Responsive Genes in Pigeonpea (Cajanus cajan L.)

    PubMed Central

    Sinha, Pallavi; Pazhamala, Lekha T.; Singh, Vikas K.; Saxena, Rachit K.; Krishnamurthy, L.; Azam, Sarwar; Khan, Aamir W.; Varshney, Rajeev K.

    2016-01-01

    Pigeonpea is a resilient crop, which is relatively more drought tolerant than many other legume crops. To understand the molecular mechanisms of this unique feature of pigeonpea, 51 genes were selected using the Hidden Markov Models (HMM) those codes for proteins having close similarity to universal stress protein domain. Validation of these genes was conducted on three pigeonpea genotypes (ICPL 151, ICPL 8755, and ICPL 227) having different levels of drought tolerance. Gene expression analysis using qRT-PCR revealed 6, 8, and 18 genes to be ≥2-fold differentially expressed in ICPL 151, ICPL 8755, and ICPL 227, respectively. A total of 10 differentially expressed genes showed ≥2-fold up-regulation in the more drought tolerant genotype, which encoded four different classes of proteins. These include plant U-box protein (four genes), universal stress protein A-like protein (four genes), cation/H(+) antiporter protein (one gene) and an uncharacterized protein (one gene). Genes C.cajan_29830 and C.cajan_33874 belonging to uspA, were found significantly expressed in all the three genotypes with ≥2-fold expression variations. Expression profiling of these two genes on the four other legume crops revealed their specific role in pigeonpea. Therefore, these genes seem to be promising candidates for conferring drought tolerance specifically to pigeonpea. PMID:26779199

  1. In-depth comparative analysis of malaria parasite genomes reveals protein-coding genes linked to human disease in Plasmodium falciparum genome.

    PubMed

    Liu, Xuewu; Wang, Yuanyuan; Liang, Jiao; Wang, Luojun; Qin, Na; Zhao, Ya; Zhao, Gang

    2018-05-02

    Plasmodium falciparum is the most virulent malaria parasite capable of parasitizing human erythrocytes. The identification of genes related to this capability can enhance our understanding of the molecular mechanisms underlying human malaria and lead to the development of new therapeutic strategies for malaria control. With the availability of several malaria parasite genome sequences, performing computational analysis is now a practical strategy to identify genes contributing to this disease. Here, we developed and used a virtual genome method to assign 33,314 genes from three human malaria parasites, namely, P. falciparum, P. knowlesi and P. vivax, and three rodent malaria parasites, namely, P. berghei, P. chabaudi and P. yoelii, to 4605 clusters. Each cluster consisted of genes whose protein sequences were significantly similar and was considered as a virtual gene. Comparing the enriched values of all clusters in human malaria parasites with those in rodent malaria parasites revealed 115 P. falciparum genes putatively responsible for parasitizing human erythrocytes. These genes are mainly located in the chromosome internal regions and participate in many biological processes, including membrane protein trafficking and thiamine biosynthesis. Meanwhile, 289 P. berghei genes were included in the rodent parasite-enriched clusters. Most are located in subtelomeric regions and encode erythrocyte surface proteins. Comparing cluster values in P. falciparum with those in P. vivax and P. knowlesi revealed 493 candidate genes linked to virulence. Some of them encode proteins present on the erythrocyte surface and participate in cytoadhesion, virulence factor trafficking, or erythrocyte invasion, but many genes with unknown function were also identified. Cerebral malaria is characterized by accumulation of infected erythrocytes at trophozoite stage in brain microvascular. To discover cerebral malaria-related genes, fast Fourier transformation (FFT) was introduced to extract

  2. GCN-2 dependent inhibition of protein synthesis activates osmosensitive gene transcription via WNK and Ste20 kinase signaling

    PubMed Central

    Lee, Elaine Choung-Hee

    2012-01-01

    Increased gpdh-1 transcription is required for accumulation of the organic osmolyte glycerol and survival of Caenorhabditis elegans during hypertonic stress. Our previous work has shown that regulators of gpdh-1 (rgpd) gene knockdown constitutively activates gpdh-1 expression. Fifty-five rgpd genes play essential roles in translation suggesting that inhibition of protein synthesis is an important signal for regulating osmoprotective gene transcription. We demonstrate here that translation is reduced dramatically by hypertonic stress or knockdown of rgpd genes encoding aminoacyl-tRNA synthetases and eukaryotic translation initiation factors (eIFs). Toxin-induced inhibition of translation also activates gpdh-1 expression. Hypertonicity-induced translation inhibition is mediated by general control nonderepressible (GCN)-2 kinase signaling and eIF-2α phosphoryation. Loss of gcn-1 or gcn-2 function prevents eIF-2α phosphorylation, completely blocks reductions in translation, and inhibits gpdh-1 transcription. gpdh-1 expression is regulated by the highly conserved with-no-lysine kinase (WNK) and Ste20 kinases WNK-1 and GCK-3, which function in the GCN-2 signaling pathway downstream from eIF-2α phosphorylation. Our previous work has shown that hypertonic stress causes rapid and dramatic protein damage in C. elegans and that inhibition of translation reduces this damage. The current studies demonstrate that reduced translation also serves as an essential signal for activation of WNK-1/GCK-3 kinase signaling and subsequent transcription of gpdh-1 and possibly other osmoprotective genes. PMID:23076791

  3. Role of Accessory Proteins of HTLV-1 in Viral Replication, T Cell Activation, and Cellular Gene Expression

    PubMed Central

    Michael, Bindhu; Nair, Amithraj; Lairmore, Michael D.

    2010-01-01

    Human T-cell lymphotropic virus type 1 (HTLV-1), causes adult T cell leukemia/lymphoma (ATLL), and initiates a variety of immune mediated disorders. The viral genome encodes common structural and enzymatic proteins characteristic of all retroviruses and utilizes alternative splicing and alternate codon usage to make several regulatory and accessory proteins encoded in the pX region (pX ORF I to IV). Recent studies indicate that the accessory proteins p12I, p27I, p13II, and p30II, encoded by pX ORF I and II, contribute to viral replication and the ability of the virus to maintain typical in vivo expression levels. Proviral clones that are mutated in either pX ORF I or II, while fully competent in cell culture, are severely limited in their replicative capacity in a rabbit model. These HTLV-1 accessory proteins are critical for establishment of viral infectivity, enhance T- lymphocyte activation and potentially alter gene transcription and mitochondrial function. HTLV-1 pX ORF I expression is critical to the viral infectivity in resting primary lymphocytes suggesting a role for the calcineurin-binding protein p12I in lymphocyte activation. The endoplasmic reticulum and cis-Golgi localizing p12I activates NFAT, a key T cell transcription factor, through calcium-mediated signaling pathways and may lower the threshold of lymphocyte activation via the JAK/STAT pathway. In contrast p30II localizes to the nucleus and represses viral promoter activity, but may regulate cellular gene expression through p300/CBP or related co-activators of transcription. The mitochondrial localizing p13II induces morphologic changes in the organelle and may influence energy metabolism infected cells. Future studies of the molecular details HTLV-1 “accessory” proteins interactions will provide important new directions for investigations of HTLV-1 and related viruses associated with lymphoproliferative diseases. Thus, the accessory proteins of HTLV-1, once thought to be dispensable for

  4. Compound A, a Selective Glucocorticoid Receptor Modulator, Enhances Heat Shock Protein Hsp70 Gene Promoter Activation

    PubMed Central

    Beck, Ilse M.; Drebert, Zuzanna J.; Hoya-Arias, Ruben; Bahar, Ali A.; Devos, Michael; Clarisse, Dorien; Desmet, Sofie; Bougarne, Nadia; Ruttens, Bart; Gossye, Valerie; Denecker, Geertrui; Lievens, Sam; Bracke, Marc; Tavernier, Jan; Declercq, Wim; Gevaert, Kris; Berghe, Wim Vanden; Haegeman, Guy; De Bosscher, Karolien

    2013-01-01

    Compound A possesses glucocorticoid receptor (GR)-dependent anti-inflammatory properties. Just like classical GR ligands, Compound A can repress NF-κB-mediated gene expression. However, the monomeric Compound A-activated GR is unable to trigger glucocorticoid response element-regulated gene expression. The heat shock response potently activates heat shock factor 1 (HSF1), upregulates Hsp70, a known GR chaperone, and also modulates various aspects of inflammation. We found that the selective GR modulator Compound A and heat shock trigger similar cellular effects in A549 lung epithelial cells. With regard to their anti-inflammatory mechanism, heat shock and Compound A are both able to reduce TNF-stimulated IκBα degradation and NF-κB p65 nuclear translocation. We established an interaction between Compound A-activated GR and Hsp70, but remarkably, although the presence of the Hsp70 chaperone as such appears pivotal for the Compound A-mediated inflammatory gene repression, subsequent novel Hsp70 protein synthesis is uncoupled from an observed CpdA-induced Hsp70 mRNA upregulation and hence obsolete in mediating CpdA’s anti-inflammatory effect. The lack of a Compound A-induced increase in Hsp70 protein levels in A549 cells is not mediated by a rapid proteasomal degradation of Hsp70 or by a Compound A-induced general block on translation. Similar to heat shock, Compound A can upregulate transcription of Hsp70 genes in various cell lines and BALB/c mice. Interestingly, whereas Compound A-dependent Hsp70 promoter activation is GR-dependent but HSF1-independent, heat shock-induced Hsp70 expression alternatively occurs in a GR-independent and HSF1-dependent manner in A549 lung epithelial cells. PMID:23935933

  5. CCAAT/enhancer-binding protein delta activates insulin-like growth factor-I gene transcription in osteoblasts. Identification of a novel cyclic AMP signaling pathway in bone

    NASA Technical Reports Server (NTRS)

    Umayahara, Y.; Ji, C.; Centrella, M.; Rotwein, P.; McCarthy, T. L.

    1997-01-01

    Insulin-like growth factor-I (IGF-I) plays a key role in skeletal growth by stimulating bone cell replication and differentiation. We previously showed that prostaglandin E2 (PGE2) and other cAMP-activating agents enhanced IGF-I gene transcription in cultured primary rat osteoblasts through promoter 1, the major IGF-I promoter, and identified a short segment of the promoter, termed HS3D, that was essential for hormonal regulation of IGF-I gene expression. We now demonstrate that CCAAT/enhancer-binding protein (C/EBP) delta is a major component of a PGE2-stimulated DNA-protein complex involving HS3D and find that C/EBPdelta transactivates IGF-I promoter 1 through this site. Competition gel shift studies first indicated that a core C/EBP half-site (GCAAT) was required for binding of a labeled HS3D oligomer to osteoblast nuclear proteins. Southwestern blotting and UV-cross-linking studies showed that the HS3D probe recognized a approximately 35-kDa nuclear protein, and antibody supershift assays indicated that C/EBPdelta comprised most of the PGE2-activated gel-shifted complex. C/EBPdelta was detected by Western immunoblotting in osteoblast nuclear extracts after treatment of cells with PGE2. An HS3D oligonucleotide competed effectively with a high affinity C/EBP site from the rat albumin gene for binding to osteoblast nuclear proteins. Co-transfection of osteoblast cell cultures with a C/EBPdelta expression plasmid enhanced basal and PGE2-activated IGF-I promoter 1-luciferase activity but did not stimulate a reporter gene lacking an HS3D site. By contrast, an expression plasmid for the related protein, C/EBPbeta, did not alter basal IGF-I gene activity but did increase the response to PGE2. In osteoblasts and in COS-7 cells, C/EBPdelta, but not C/EBPbeta, transactivated a reporter gene containing four tandem copies of HS3D fused to a minimal promoter; neither transcription factor stimulated a gene with four copies of an HS3D mutant that was unable to bind osteoblast

  6. A Hox Gene, Antennapedia, Regulates Expression of Multiple Major Silk Protein Genes in the Silkworm Bombyx mori*

    PubMed Central

    Tsubota, Takuya; Tomita, Shuichiro; Uchino, Keiro; Kimoto, Mai; Takiya, Shigeharu; Kajiwara, Hideyuki; Yamazaki, Toshimasa; Sezutsu, Hideki

    2016-01-01

    Hox genes play a pivotal role in the determination of anteroposterior axis specificity during bilaterian animal development. They do so by acting as a master control and regulating the expression of genes important for development. Recently, however, we showed that Hox genes can also function in terminally differentiated tissue of the lepidopteran Bombyx mori. In this species, Antennapedia (Antp) regulates expression of sericin-1, a major silk protein gene, in the silk gland. Here, we investigated whether Antp can regulate expression of multiple genes in this tissue. By means of proteomic, RT-PCR, and in situ hybridization analyses, we demonstrate that misexpression of Antp in the posterior silk gland induced ectopic expression of major silk protein genes such as sericin-3, fhxh4, and fhxh5. These genes are normally expressed specifically in the middle silk gland as is Antp. Therefore, the evidence strongly suggests that Antp activates these silk protein genes in the middle silk gland. The putative sericin-1 activator complex (middle silk gland-intermolt-specific complex) can bind to the upstream regions of these genes, suggesting that Antp directly activates their expression. We also found that the pattern of gene expression was well conserved between B. mori and the wild species Bombyx mandarina, indicating that the gene regulation mechanism identified here is an evolutionarily conserved mechanism and not an artifact of the domestication of B. mori. We suggest that Hox genes have a role as a master control in terminally differentiated tissues, possibly acting as a primary regulator for a range of physiological processes. PMID:26814126

  7. Estimation of divergence times in cnidarian evolution based on mitochondrial protein-coding genes and the fossil record.

    PubMed

    Park, Eunji; Hwang, Dae-Sik; Lee, Jae-Seong; Song, Jun-Im; Seo, Tae-Kun; Won, Yong-Jin

    2012-01-01

    The phylum Cnidaria is comprised of remarkably diverse and ecologically significant taxa, such as the reef-forming corals, and occupies a basal position in metazoan evolution. The origin of this phylum and the most recent common ancestors (MRCAs) of its modern classes remain mostly unknown, although scattered fossil evidence provides some insights on this topic. Here, we investigate the molecular divergence times of the major taxonomic groups of Cnidaria (27 Hexacorallia, 16 Octocorallia, and 5 Medusozoa) on the basis of mitochondrial DNA sequences of 13 protein-coding genes. For this analysis, the complete mitochondrial genomes of seven octocoral and two scyphozoan species were newly sequenced and combined with all available mitogenomic data from GenBank. Five reliable fossil dates were used to calibrate the Bayesian estimates of divergence times. The molecular evidence suggests that cnidarians originated 741 million years ago (Ma) (95% credible region of 686-819), and the major taxa diversified prior to the Cambrian (543 Ma). The Octocorallia and Scleractinia may have originated from radiations of survivors of the Permian-Triassic mass extinction, which matches their fossil record well. Copyright © 2011 Elsevier Inc. All rights reserved.

  8. Rat leucine-rich protein binds and activates the promoter of the beta isoform of Ca2+/calmodulin-dependent protein kinase II gene.

    PubMed

    Ochiai, Nagahiro; Masumoto, Shuji; Sakagami, Hiroyuki; Yoshimura, Yoshiyuki; Yamauchi, Takashi

    2007-05-01

    We previously found the neuronal cell-type specific promoter and binding partner of the beta isoform of Ca(2+)/calmodulin-dependent protein kinase II (beta CaM kinase II) in rat brain [Donai, H., Morinaga, H., Yamauchi, T., 2001. Genomic organization and neuronal cell type specific promoter activity of beta isoform of Ca(2+)/calmodulin-dependent protein kinase II of rat brain. Mol. Brain Res. 94, 35-47]. In the present study, we purified a protein that binds specifically a promoter region of beta CaM kinase II gene from a nuclear extract of the rat cerebellum using DEAE-cellulose column chromatography, ammonium sulfate fractionation, gel filtration and polyacrylamide gel electrophoresis. The purified protein was identified as rat leucine-rich protein 157 (rLRP157) using tandem mass spectrometry. Then, we prepared its cDNA by reverse transcriptase-polymerase chain reaction (RT-PCR) from poly(A)(+)RNA of rat cerebellum. The rLRP157 cDNA was introduced into mouse neuroblastomaxrat glioma hybrid NG108-15 cells, and cells stably expressing rLRP157 (NG/LRP cells) were isolated. Binding of rLRP157 with the promoter sequence was confirmed by electrophoretic mobility shift assay using nuclear extract of NG/LRP cells. A luciferase reporter gene containing a promoter of beta CaM kinase II was transiently expressed in NG/LRP cells. Under the conditions, the promoter activity was enhanced about 2.6-fold in NG/LRP cells as compared with wild-type cells. The expression of rLRP157 mRNA was paralleled with that of beta CaM kinase II in the adult and embryo rat brain detected by in situ hybridization. Nuclear localization of rLRP157 was confirmed using GFP-rLRP157 fusion protein investigated under a confocal microscope. These results indicate that rLRP157 is one of the proteins binding to, and regulating the activity of, the promoter of beta CaM kinase II.

  9. A Conserved p38 Mitogen-Activated Protein Kinase Pathway Regulates Drosophila Immunity Gene Expression

    PubMed Central

    Han, Zhiqiang Stanley; Enslen, Hervé; Hu, Xiaodi; Meng, Xiangjun; Wu, I-Huan; Barrett, Tamera; Davis, Roger J.; Ip, Y. Tony

    1998-01-01

    Accumulating evidence suggests that the insect and mammalian innate immune response is mediated by homologous regulatory components. Proinflammatory cytokines and bacterial lipopolysaccharide stimulate mammalian immunity by activating transcription factors such as NF-κB and AP-1. One of the responses evoked by these stimuli is the initiation of a kinase cascade that leads to the phosphorylation of p38 mitogen-activated protein (MAP) kinase on Thr and Tyr within the motif Thr-Gly-Tyr, which is located within subdomain VIII. We have investigated the possible involvement of the p38 MAP kinase pathway in the Drosophila immune response. Two genes that are highly homologous to the mammalian p38 MAP kinase were molecularly cloned and characterized. Furthermore, genes that encode two novel Drosophila MAP kinase kinases, D-MKK3 and D-MKK4, were identified. D-MKK3 is an efficient activator of both Drosophila p38 MAP kinases, while D-MKK4 is an activator of D-JNK but not D-p38. These data establish that Drosophila indeed possesses a conserved p38 MAP kinase signaling pathway. We have examined the role of the D-p38 MAP kinases in the regulation of insect immunity. The results revealed that one of the functions of D-p38 is to attenuate antimicrobial peptide gene expression following exposure to lipopolysaccharide. PMID:9584193

  10. Light-Inducible Gene Regulation with Engineered Zinc Finger Proteins

    PubMed Central

    Polstein, Lauren R.; Gersbach, Charles A.

    2014-01-01

    The coupling of light-inducible protein-protein interactions with gene regulation systems has enabled the control of gene expression with light. In particular, heterodimer protein pairs from plants can be used to engineer a gene regulation system in mammalian cells that is reversible, repeatable, tunable, controllable in a spatiotemporal manner, and targetable to any DNA sequence. This system, Light-Inducible Transcription using Engineered Zinc finger proteins (LITEZ), is based on the blue light-induced interaction of GIGANTEA and the LOV domain of FKF1 that drives the localization of a transcriptional activator to the DNA-binding site of a highly customizable engineered zinc finger protein. This chapter provides methods for modifying LITEZ to target new DNA sequences, engineering a programmable LED array to illuminate cell cultures, and using the modified LITEZ system to achieve spatiotemporal control of transgene expression in mammalian cells. PMID:24718797

  11. A comparative study of disease genes and drug targets in the human protein interactome

    PubMed Central

    2015-01-01

    Background Disease genes cause or contribute genetically to the development of the most complex diseases. Drugs are the major approaches to treat the complex disease through interacting with their targets. Thus, drug targets are critical for treatment efficacy. However, the interrelationship between the disease genes and drug targets is not clear. Results In this study, we comprehensively compared the network properties of disease genes and drug targets for five major disease categories (cancer, cardiovascular disease, immune system disease, metabolic disease, and nervous system disease). We first collected disease genes from genome-wide association studies (GWAS) for five disease categories and collected their corresponding drugs based on drugs' Anatomical Therapeutic Chemical (ATC) classification. Then, we obtained the drug targets for these five different disease categories. We found that, though the intersections between disease genes and drug targets were small, disease genes were significantly enriched in targets compared to their enrichment in human protein-coding genes. We further compared network properties of the proteins encoded by disease genes and drug targets in human protein-protein interaction networks (interactome). The results showed that the drug targets tended to have higher degree, higher betweenness, and lower clustering coefficient in cancer Furthermore, we observed a clear fraction increase of disease proteins or drug targets in the near neighborhood compared with the randomized genes. Conclusions The study presents the first comprehensive comparison of the disease genes and drug targets in the context of interactome. The results provide some foundational network characteristics for further designing computational strategies to predict novel drug targets and drug repurposing. PMID:25861037

  12. A comparative study of disease genes and drug targets in the human protein interactome.

    PubMed

    Sun, Jingchun; Zhu, Kevin; Zheng, W; Xu, Hua

    2015-01-01

    Disease genes cause or contribute genetically to the development of the most complex diseases. Drugs are the major approaches to treat the complex disease through interacting with their targets. Thus, drug targets are critical for treatment efficacy. However, the interrelationship between the disease genes and drug targets is not clear. In this study, we comprehensively compared the network properties of disease genes and drug targets for five major disease categories (cancer, cardiovascular disease, immune system disease, metabolic disease, and nervous system disease). We first collected disease genes from genome-wide association studies (GWAS) for five disease categories and collected their corresponding drugs based on drugs' Anatomical Therapeutic Chemical (ATC) classification. Then, we obtained the drug targets for these five different disease categories. We found that, though the intersections between disease genes and drug targets were small, disease genes were significantly enriched in targets compared to their enrichment in human protein-coding genes. We further compared network properties of the proteins encoded by disease genes and drug targets in human protein-protein interaction networks (interactome). The results showed that the drug targets tended to have higher degree, higher betweenness, and lower clustering coefficient in cancer Furthermore, we observed a clear fraction increase of disease proteins or drug targets in the near neighborhood compared with the randomized genes. The study presents the first comprehensive comparison of the disease genes and drug targets in the context of interactome. The results provide some foundational network characteristics for further designing computational strategies to predict novel drug targets and drug repurposing.

  13. Molecular cloning and sequence analysis of the gene coding for the 57kDa soluble antigen of the salmonid fish pathogen Renibacterium salmoninarum

    USGS Publications Warehouse

    Chien, Maw-Sheng; Gilbert , Teresa L.; Huang, Chienjin; Landolt, Marsha L.; O'Hara, Patrick J.; Winton, James R.

    1992-01-01

    The complete sequence coding for the 57-kDa major soluble antigen of the salmonid fish pathogen, Renibacterium salmoninarum, was determined. The gene contained an opening reading frame of 1671 nucleotides coding for a protein of 557 amino acids with a calculated Mr value of 57190. The first 26 amino acids constituted a signal peptide. The deduced sequence for amino acid residues 27–61 was in agreement with the 35 N-terminal amino acid residues determined by microsequencing, suggesting the protein in synthesized as a 557-amino acid precursor and processed to produce a mature protein of Mr 54505. Two regions of the protein contained imperfect direct repeats. The first region contained two copies of an 81-residue repeat, the second contained five copies of an unrelated 25-residue repeat. Also, a perfect inverted repeat (including three in-frame UAA stop codons) was observed at the carboxyl-terminus of the gene.

  14. Mucin acts as a nutrient source and a signal for the differential expression of genes coding for cellular processes and virulence factors in Acinetobacter baumannii

    PubMed Central

    Ohneck, Emily J.; Arivett, Brock A.; Fiester, Steven E.; Wood, Cecily R.; Metz, Maeva L.; Simeone, Gabriella M.

    2018-01-01

    The capacity of Acinetobacter baumannii to persist and cause infections depends on its interaction with abiotic and biotic surfaces, including those found on medical devices and host mucosal surfaces. However, the extracellular stimuli affecting these interactions are poorly understood. Based on our previous observations, we hypothesized that mucin, a glycoprotein secreted by lung epithelial cells, particularly during respiratory infections, significantly alters A. baumannii’s physiology and its interaction with the surrounding environment. Biofilm, virulence and growth assays showed that mucin enhances the interaction of A. baumannii ATCC 19606T with abiotic and biotic surfaces and its cytolytic activity against epithelial cells while serving as a nutrient source. The global effect of mucin on the physiology and virulence of this pathogen is supported by RNA-Seq data showing that its presence in a low nutrient medium results in the differential transcription of 427 predicted protein-coding genes. The reduced expression of ion acquisition genes and the increased transcription of genes coding for energy production together with the detection of mucin degradation indicate that this host glycoprotein is a nutrient source. The increased expression of genes coding for adherence and biofilm biogenesis on abiotic and biotic surfaces, the degradation of phenylacetic acid and the production of an active type VI secretion system further supports the role mucin plays in virulence. Taken together, our observations indicate that A. baumannii recognizes mucin as an environmental signal, which triggers a response cascade that allows this pathogen to acquire critical nutrients and promotes host-pathogen interactions that play a role in the pathogenesis of bacterial infections. PMID:29309434

  15. The genomic structure of the human Charcot-Leyden crystal protein gene is analogous to those of the galectin genes

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Dyer, K.D.; Handen, J.S.; Rosenberg, H.F.

    The Charcot-Leyden crystal (CLC) protein, or eosinophil lysophospholipase, is a characteristic protein of human eosinophils and basophils; recent work has demonstrated that the CLC protein is both structurally and functionally related to the galectin family of {beta}-galactoside binding proteins. The galectins as a group share a number of features in common, including a linear ligand binding site encoded on a single exon. In this work, we demonstrate that the intron-exon structure of the gene encoding CLC is analogous to those encoding the galectins. The coding sequence of the CLC gene is divided into four exons, with the entire {beta}-galactoside bindingmore » site encoded by exon III. We have isolated CLC {beta}-galactoside binding sites from both orangutan (Pongo pygmaeus) and murine (Mus musculus) genomic DNAs, both encoded on single exons, and noted conservation of the amino acids shown to interact directly with the {beta}-galactoside ligand. The most likely interpretation of these results suggests the occurrence of one or more exon duplication and insertion events, resulting in the distribution of this lectin domain to CLC as well as to the multiple galectin genes. 35 refs., 3 figs.« less

  16. Abscisic acid-activated SNRK2 protein kinases function in the gene-regulation pathway of ABA signal transduction by phosphorylating ABA response element-binding factors.

    PubMed

    Kobayashi, Yuhko; Murata, Michiharu; Minami, Hideyuki; Yamamoto, Shuhei; Kagaya, Yasuaki; Hobo, Tokunori; Yamamoto, Akiko; Hattori, Tsukaho

    2005-12-01

    The plant hormone abscisic acid (ABA) induces gene expression via the ABA-response element (ABRE) present in the promoters of ABA-regulated genes. A group of bZIP proteins have been identified as ABRE-binding factors (ABFs) that activate transcription through this cis element. A rice ABF, TRAB1, has been shown to be activated via ABA-dependent phosphorylation. While a large number of signalling factors have been identified that are involved in stomatal regulation by ABA, relatively less is known about the ABA-signalling pathway that leads to gene expression. We have shown recently that three members of the rice SnRK2 protein kinase family, SAPK8, SAPK9 and SAPK10, are activated by ABA signal as well as by hyperosmotic stress. Here we show that transient overexpression in cultured cell protoplasts of these ABA-activated SnRK2 protein kinases leads to the activation of an ABRE-regulated promoter, suggesting that these kinases are involved in the gene-regulation pathway of ABA signalling. We further show several lines of evidence that these ABA-activated SnRK2 protein kinases directly phosphorylate TRAB1 in response to ABA. Kinetic analysis of SAPK10 activation and TRAB1 phosphorylation indicated that the latter immediately followed the former. TRAB1 was found to be phosphorylated not only in response to ABA, but also in response to hyperosmotic stress, which was interpreted as the consequence of phosphorylation of TRAB1 by hyperosmotically activated SAPKs. Physical interaction between TRAB1 and SAPK10 in vivo was demonstrated by a co-immunoprecipitation experiment. Finally, TRAB1 was phosphorylated in vitro by the ABA-activated SnRK2 protein kinases at Ser102, which is phosphorylated in vivo in response to ABA and is critical for the activation function.

  17. Genome co-amplification upregulates a mitotic gene network activity that predicts outcome and response to mitotic protein inhibitors in breast cancer

    DOE PAGES

    Hu, Zhi; Mao, Jian-Hua; Curtis, Christina; ...

    2016-07-01

    Background: High mitotic activity is associated with the genesis and progression of many cancers. Small molecule inhibitors of mitotic apparatus proteins are now being developed and evaluated clinically as anticancer agents. With clinical trials of several of these experimental compounds underway, it is important to understand the molecular mechanisms that determine high mitotic activity, identify tumor subtypes that carry molecular aberrations that confer high mitotic activity, and to develop molecular markers that distinguish which tumors will be most responsive to mitotic apparatus inhibitors. Methods: We identified a coordinately regulated mitotic apparatus network by analyzing gene expression profiles for 53 malignantmore » and non-malignant human breast cancer cell lines and two separate primary breast tumor datasets. We defined the mitotic network activity index (MNAI) as the sum of the transcriptional levels of the 54 coordinately regulated mitotic apparatus genes. The effect of those genes on cell growth was evaluated by small interfering RNA (siRNA). Results: High MNAI was enriched in basal-like breast tumors and was associated with reduced survival duration and preferential sensitivity to i nhibitors of the mitotic apparatus proteins, polo-like kinase, centromere associated protein E and aurora kinase designated GSK462364, GSK923295 and GSK1070916, respectively. Co-amplification of regions of chromosomes 8q24, 10p15-p12, 12p13, and 17q24-q25 was associated with the transcriptional upregulation of this network of 54 mitotic apparatus genes, and we identify transcription factors that localize to these regions and putatively regulate mitotic activity. Knockdown of the mitotic network by siRNA identified 22 genes that might be considered as additional therapeutic targets for this clinically relevant patient subgroup. Conclusions: We define a molecular signature which may guide therapeutic approaches for tumors with high mitotic network activity.« less

  18. Genome co-amplification upregulates a mitotic gene network activity that predicts outcome and response to mitotic protein inhibitors in breast cancer

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hu, Zhi; Mao, Jian-Hua; Curtis, Christina

    Background: High mitotic activity is associated with the genesis and progression of many cancers. Small molecule inhibitors of mitotic apparatus proteins are now being developed and evaluated clinically as anticancer agents. With clinical trials of several of these experimental compounds underway, it is important to understand the molecular mechanisms that determine high mitotic activity, identify tumor subtypes that carry molecular aberrations that confer high mitotic activity, and to develop molecular markers that distinguish which tumors will be most responsive to mitotic apparatus inhibitors. Methods: We identified a coordinately regulated mitotic apparatus network by analyzing gene expression profiles for 53 malignantmore » and non-malignant human breast cancer cell lines and two separate primary breast tumor datasets. We defined the mitotic network activity index (MNAI) as the sum of the transcriptional levels of the 54 coordinately regulated mitotic apparatus genes. The effect of those genes on cell growth was evaluated by small interfering RNA (siRNA). Results: High MNAI was enriched in basal-like breast tumors and was associated with reduced survival duration and preferential sensitivity to i nhibitors of the mitotic apparatus proteins, polo-like kinase, centromere associated protein E and aurora kinase designated GSK462364, GSK923295 and GSK1070916, respectively. Co-amplification of regions of chromosomes 8q24, 10p15-p12, 12p13, and 17q24-q25 was associated with the transcriptional upregulation of this network of 54 mitotic apparatus genes, and we identify transcription factors that localize to these regions and putatively regulate mitotic activity. Knockdown of the mitotic network by siRNA identified 22 genes that might be considered as additional therapeutic targets for this clinically relevant patient subgroup. Conclusions: We define a molecular signature which may guide therapeutic approaches for tumors with high mitotic network activity.« less

  19. Involvement of adenosine monophosphate-activated protein kinase in the influence of timed high-fat evening diet on the hepatic clock and lipogenic gene expression in mice.

    PubMed

    Huang, Yan; Zhu, Zengyan; Xie, Meilin; Xue, Jie

    2015-09-01

    A high-fat diet may result in changes in hepatic clock gene expression, but potential mechanisms are not yet elucidated. Adenosine monophosphate-activated protein kinase (AMPK) is a serine/threonine protein kinase that is recognized as a key regulator of energy metabolism and certain clock genes. Therefore, we hypothesized that AMPK may be involved in the alteration of hepatic clock gene expression under a high-fat environment. This study aimed to examine the effects of timed high-fat evening diet on the activity of hepatic AMPK, clock genes, and lipogenic genes. Mice with hyperlipidemic fatty livers were induced by orally administering high-fat milk via gavage every evening (19:00-20:00) for 6 weeks. Results showed that timed high-fat diet in the evening not only decreased the hepatic AMPK protein expression and activity but also disturbed its circadian rhythm. Accordingly, the hepatic clock genes, including clock, brain-muscle-Arnt-like 1, cryptochrome 2, and period 2, exhibited prominent changes in their expression rhythms and/or amplitudes. The diurnal rhythms of the messenger RNA expression of peroxisome proliferator-activated receptorα, acetyl-CoA carboxylase 1α, and carnitine palmitoyltransferase 1 were also disrupted; the amplitude of peroxisome proliferator-activated receptorγcoactivator 1α was significantly decreased at 3 time points, and fatty liver was observed. These findings demonstrate that timed high-fat diet at night can change hepatic AMPK protein levels, activity, and circadian rhythm, which may subsequently alter the circadian expression of several hepatic clock genes and finally result in the disorder of hepatic lipogenic gene expression and the formation of fatty liver. Copyright © 2015 Elsevier Inc. All rights reserved.

  20. Selective Constraints on Coding Sequences of Nervous System Genes Are a Major Determinant of Duplicate Gene Retention in Vertebrates

    PubMed Central

    Roux, Julien; Liu, Jialin; Robinson-Rechavi, Marc

    2017-01-01

    Abstract The evolutionary history of vertebrates is marked by three ancient whole-genome duplications: two successive rounds in the ancestor of vertebrates, and a third one specific to teleost fishes. Biased loss of most duplicates enriched the genome for specific genes, such as slow evolving genes, but this selective retention process is not well understood. To understand what drives the long-term preservation of duplicate genes, we characterized duplicated genes in terms of their expression patterns. We used a new method of expression enrichment analysis, TopAnat, applied to in situ hybridization data from thousands of genes from zebrafish and mouse. We showed that the presence of expression in the nervous system is a good predictor of a higher rate of retention of duplicate genes after whole-genome duplication. Further analyses suggest that purifying selection against the toxic effects of misfolded or misinteracting proteins, which is particularly strong in nonrenewing neural tissues, likely constrains the evolution of coding sequences of nervous system genes, leading indirectly to the preservation of duplicate genes after whole-genome duplication. Whole-genome duplications thus greatly contributed to the expansion of the toolkit of genes available for the evolution of profound novelties of the nervous system at the base of the vertebrate radiation. PMID:28981708

  1. GeneBuilder: interactive in silico prediction of gene structure.

    PubMed

    Milanesi, L; D'Angelo, D; Rogozin, I B

    1999-01-01

    Prediction of gene structure in newly sequenced DNA becomes very important in large genome sequencing projects. This problem is complicated due to the exon-intron structure of eukaryotic genes and because gene expression is regulated by many different short nucleotide domains. In order to be able to analyse the full gene structure in different organisms, it is necessary to combine information about potential functional signals (promoter region, splice sites, start and stop codons, 3' untranslated region) together with the statistical properties of coding sequences (coding potential), information about homologous proteins, ESTs and repeated elements. We have developed the GeneBuilder system which is based on prediction of functional signals and coding regions by different approaches in combination with similarity searches in proteins and EST databases. The potential gene structure models are obtained by using a dynamic programming method. The program permits the use of several parameters for gene structure prediction and refinement. During gene model construction, selecting different exon homology levels with a protein sequence selected from a list of homologous proteins can improve the accuracy of the gene structure prediction. In the case of low homology, GeneBuilder is still able to predict the gene structure. The GeneBuilder system has been tested by using the standard set (Burset and Guigo, Genomics, 34, 353-367, 1996) and the performances are: 0.89 sensitivity and 0.91 specificity at the nucleotide level. The total correlation coefficient is 0.88. The GeneBuilder system is implemented as a part of the WebGene a the URL: http://www.itba.mi. cnr.it/webgene and TRADAT (TRAncription Database and Analysis Tools) launcher URL: http://www.itba.mi.cnr.it/tradat.

  2. Protein-coding genes combined with long noncoding RNA as a novel transcriptome molecular staging model to predict the survival of patients with esophageal squamous cell carcinoma.

    PubMed

    Guo, Jin-Cheng; Wu, Yang; Chen, Yang; Pan, Feng; Wu, Zhi-Yong; Zhang, Jia-Sheng; Wu, Jian-Yi; Xu, Xiu-E; Zhao, Jian-Mei; Li, En-Min; Zhao, Yi; Xu, Li-Yan

    2018-04-09

    Esophageal squamous cell carcinoma (ESCC) is the predominant subtype of esophageal carcinoma in China. This study was to develop a staging model to predict outcomes of patients with ESCC. Using Cox regression analysis, principal component analysis (PCA), partitioning clustering, Kaplan-Meier analysis, receiver operating characteristic (ROC) curve analysis, and classification and regression tree (CART) analysis, we mined the Gene Expression Omnibus database to determine the expression profiles of genes in 179 patients with ESCC from GSE63624 and GSE63622 dataset. Univariate cox regression analysis of the GSE63624 dataset revealed that 2404 protein-coding genes (PCGs) and 635 long non-coding RNAs (lncRNAs) were associated with the survival of patients with ESCC. PCA categorized these PCGs and lncRNAs into three principal components (PCs), which were used to cluster the patients into three groups. ROC analysis demonstrated that the predictive ability of PCG-lncRNA PCs when applied to new patients was better than that of the tumor-node-metastasis staging (area under ROC curve [AUC]: 0.69 vs. 0.65, P < 0.05). Accordingly, we constructed a molecular disaggregated model comprising one lncRNA and two PCGs, which we designated as the LSB staging model using CART analysis in the GSE63624 dataset. This LSB staging model classified the GSE63622 dataset of patients into three different groups, and its effectiveness was validated by analysis of another cohort of 105 patients. The LSB staging model has clinical significance for the prognosis prediction of patients with ESCC and may serve as a three-gene staging microarray.

  3. Dual inhibition of γ-oryzanol on cellular melanogenesis: inhibition of tyrosinase activity and reduction of melanogenic gene expression by a protein kinase A-dependent mechanism.

    PubMed

    Jun, Hee-jin; Lee, Ji Hae; Cho, Bo-Ram; Seo, Woo-Duck; Kang, Hang-Won; Kim, Dong-Woo; Cho, Kang-Jin; Lee, Sung-Joon

    2012-10-26

    The in vitro effects on melanogenesis of γ-oryzanol (1), a rice bran-derived phytosterol, were investigated. The melanin content in B16F1 cells was significantly and dose-dependently reduced (-13% and -28% at 3 and 30 μM, respectively). Tyrosinase enzyme activity was inhibited by 1 both in a cell-free assay and when analyzed based on the measurement of cellular tyrosinase activity. Transcriptome analysis was performed to investigate the biological pathways altered by 1, and it was found that gene expression involving protein kinase A (PKA) signaling was markedly altered. Subsequent analyses revealed that 1 stimulation in B16 cells reduced cytosolic cAMP concentrations, PKA activity (-13% for cAMP levels and -40% for PKA activity), and phosphorylation of the cAMP-response element binding protein (-57%), which, in turn, downregulated the expression of microphthalmia-associated transcription factor (MITF; -59% for mRNA and -64% for protein), a key melanogenic gene transcription factor. Accordingly, tyrosinase-related protein 1 (TRP-1; -69% for mRNA and -82% for protein) and dopachrome tautomerase (-51% for mRNA and -92% for protein) in 1-stimulated B16F1 cells were also downregulated. These results suggest that 1 has dual inhibitory activities for cellular melanogenesis by inhibiting tyrosinase enzyme activity and reducing MITF and target genes in the PKA-dependent pathway.

  4. Recombinant Vaccinia Viruses Coding Transgenes of Apoptosis-Inducing Proteins Enhance Apoptosis But Not Immunogenicity of Infected Tumor Cells

    PubMed Central

    Tkachenko, Anastasiya; Richter, Vladimir

    2017-01-01

    Genetic modifications of the oncolytic vaccinia virus (VV) improve selective tumor cell infection and death, as well as activation of antitumor immunity. We have engineered a double recombinant VV, coding human GM-CSF, and apoptosis-inducing protein apoptin (VV-GMCSF-Apo) for comparing with the earlier constructed double recombinant VV-GMCSF-Lact, coding another apoptosis-inducing protein, lactaptin, which activated different cell death pathways than apoptin. We showed that both these recombinant VVs more considerably activated a set of critical apoptosis markers in infected cells than the recombinant VV coding GM-CSF alone (VV-GMCSF-dGF): these were phosphatidylserine externalization, caspase-3 and caspase-7 activation, DNA fragmentation, and upregulation of proapoptotic protein BAX. However, only VV-GMCSF-Lact efficiently decreased the mitochondrial membrane potential of infected cancer cells. Investigating immunogenic cell death markers in cancer cells infected with recombinant VVs, we demonstrated that all tested recombinant VVs were efficient in calreticulin and HSP70 externalization, decrease of cellular HMGB1, and ATP secretion. The comparison of antitumor activity against advanced MDA-MB-231 tumor revealed that both recombinants VV-GMCSF-Lact and VV-GMCSF-Apo efficiently delay tumor growth. Our results demonstrate that the composition of GM-CSF and apoptosis-inducing proteins in the VV genome is very efficient tool for specific killing of cancer cells and for activation of antitumor immunity. PMID:28951871

  5. APG: an Active Protein-Gene network model to quantify regulatory signals in complex biological systems.

    PubMed

    Wang, Jiguang; Sun, Yidan; Zheng, Si; Zhang, Xiang-Sun; Zhou, Huarong; Chen, Luonan

    2013-01-01

    Synergistic interactions among transcription factors (TFs) and their cofactors collectively determine gene expression in complex biological systems. In this work, we develop a novel graphical model, called Active Protein-Gene (APG) network model, to quantify regulatory signals of transcription in complex biomolecular networks through integrating both TF upstream-regulation and downstream-regulation high-throughput data. Firstly, we theoretically and computationally demonstrate the effectiveness of APG by comparing with the traditional strategy based only on TF downstream-regulation information. We then apply this model to study spontaneous type 2 diabetic Goto-Kakizaki (GK) and Wistar control rats. Our biological experiments validate the theoretical results. In particular, SP1 is found to be a hidden TF with changed regulatory activity, and the loss of SP1 activity contributes to the increased glucose production during diabetes development. APG model provides theoretical basis to quantitatively elucidate transcriptional regulation by modelling TF combinatorial interactions and exploiting multilevel high-throughput information.

  6. APG: an Active Protein-Gene Network Model to Quantify Regulatory Signals in Complex Biological Systems

    PubMed Central

    Wang, Jiguang; Sun, Yidan; Zheng, Si; Zhang, Xiang-Sun; Zhou, Huarong; Chen, Luonan

    2013-01-01

    Synergistic interactions among transcription factors (TFs) and their cofactors collectively determine gene expression in complex biological systems. In this work, we develop a novel graphical model, called Active Protein-Gene (APG) network model, to quantify regulatory signals of transcription in complex biomolecular networks through integrating both TF upstream-regulation and downstream-regulation high-throughput data. Firstly, we theoretically and computationally demonstrate the effectiveness of APG by comparing with the traditional strategy based only on TF downstream-regulation information. We then apply this model to study spontaneous type 2 diabetic Goto-Kakizaki (GK) and Wistar control rats. Our biological experiments validate the theoretical results. In particular, SP1 is found to be a hidden TF with changed regulatory activity, and the loss of SP1 activity contributes to the increased glucose production during diabetes development. APG model provides theoretical basis to quantitatively elucidate transcriptional regulation by modelling TF combinatorial interactions and exploiting multilevel high-throughput information. PMID:23346354

  7. Genes for Drosophila small heat shock proteins are regulated differently by ecdysterone

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Amin, J.; Voellmy, R.; Mestril, R.

    Genes for small heat shock proteins (hsp27 to hsp22) are activated in late third-instar larvae of Drosophila melanogaster in the absence of heat stress. This regulation has been stimulated in cultured Drosophila cells in which the genes are activated by the addition of ecdysterone. Sequence elements (HERE) involved in ecdysterone regulation of the hsp27 and hsp23 genes have been defined by transfection studies and have recently been identified as binding sites for ecdysterone receptor. The authors report here that the shp27 and hsp23 genes are regulated differently by ecdysterone. The hsp27 gene is activated rapidly by ecdysterone, even in themore » absence of protein synthesis. In contrast, high-level expression of the hsp23 gene begins only after a lag of about 6 h, is dependent on the continuous presence of ecdysterone, and is sensitive to low concentrations of protein synthesis inhibitors. Transfection experiments with reported constructs show that this difference in regulation is at the transcriptional level. Synthetic hsp27 or hsp23 HERE sequences confer hsp27- or hsp23-type ecdysterone regulation on a basal promoter. These findings indicate that the hsp27 gene is primary, and the hsp23 gene is mainly a secondary, hormone-responsive gene. Ecdysterone receptor is implied to play a role in the regulation of both genes.« less

  8. Heat Shock Protein Genes Undergo Dynamic Alteration in Their Three-Dimensional Structure and Genome Organization in Response to Thermal Stress

    PubMed Central

    Chowdhary, Surabhi; Kainth, Amoldeep S.

    2017-01-01

    ABSTRACT Three-dimensional (3D) chromatin organization is important for proper gene regulation, yet how the genome is remodeled in response to stress is largely unknown. Here, we use a highly sensitive version of chromosome conformation capture in combination with fluorescence microscopy to investigate Heat Shock Protein (HSP) gene conformation and 3D nuclear organization in budding yeast. In response to acute thermal stress, HSP genes undergo intense intragenic folding interactions that go well beyond 5′-3′ gene looping previously described for RNA polymerase II genes. These interactions include looping between upstream activation sequence (UAS) and promoter elements, promoter and terminator regions, and regulatory and coding regions (gene “crumpling”). They are also dynamic, being prominent within 60 s, peaking within 2.5 min, and attenuating within 30 min, and correlate with HSP gene transcriptional activity. With similarly striking kinetics, activated HSP genes, both chromosomally linked and unlinked, coalesce into discrete intranuclear foci. Constitutively transcribed genes also loop and crumple yet fail to coalesce. Notably, a missense mutation in transcription factor TFIIB suppresses gene looping, yet neither crumpling nor HSP gene coalescence is affected. An inactivating promoter mutation, in contrast, obviates all three. Our results provide evidence for widespread, transcription-associated gene crumpling and demonstrate the de novo assembly and disassembly of HSP gene foci. PMID:28970326

  9. Heat Shock Protein Genes Undergo Dynamic Alteration in Their Three-Dimensional Structure and Genome Organization in Response to Thermal Stress.

    PubMed

    Chowdhary, Surabhi; Kainth, Amoldeep S; Gross, David S

    2017-12-15

    Three-dimensional (3D) chromatin organization is important for proper gene regulation, yet how the genome is remodeled in response to stress is largely unknown. Here, we use a highly sensitive version of chromosome conformation capture in combination with fluorescence microscopy to investigate Heat Shock Protein ( HSP ) gene conformation and 3D nuclear organization in budding yeast. In response to acute thermal stress, HSP genes undergo intense intragenic folding interactions that go well beyond 5'-3' gene looping previously described for RNA polymerase II genes. These interactions include looping between upstream activation sequence (UAS) and promoter elements, promoter and terminator regions, and regulatory and coding regions (gene "crumpling"). They are also dynamic, being prominent within 60 s, peaking within 2.5 min, and attenuating within 30 min, and correlate with HSP gene transcriptional activity. With similarly striking kinetics, activated HSP genes, both chromosomally linked and unlinked, coalesce into discrete intranuclear foci. Constitutively transcribed genes also loop and crumple yet fail to coalesce. Notably, a missense mutation in transcription factor TFIIB suppresses gene looping, yet neither crumpling nor HSP gene coalescence is affected. An inactivating promoter mutation, in contrast, obviates all three. Our results provide evidence for widespread, transcription-associated gene crumpling and demonstrate the de novo assembly and disassembly of HSP gene foci. Copyright © 2017 American Society for Microbiology.

  10. Inducible Knockout of the Cyclin-Dependent Kinase 5 Activator p35 Alters Hippocampal Spatial Coding and Neuronal Excitability

    PubMed Central

    Kamiki, Eriko; Boehringer, Roman; Polygalov, Denis; Ohshima, Toshio; McHugh, Thomas J.

    2018-01-01

    p35 is an activating co-factor of Cyclin-dependent kinase 5 (Cdk5), a protein whose dysfunction has been implicated in a wide-range of neurological disorders including cognitive impairment and disease. Inducible deletion of the p35 gene in adult mice results in profound deficits in hippocampal-dependent spatial learning and synaptic physiology, however the impact of the loss of p35 function on hippocampal in vivo physiology and spatial coding remains unknown. Here, we recorded CA1 pyramidal cell activity in freely behaving p35 cKO and control mice and found that place cells in the mutant mice have elevated firing rates and impaired spatial coding, accompanied by changes in the temporal organization of spiking both during exploration and rest. These data shed light on the role of p35 in maintaining cellular and network excitability and provide a physiological correlate of the spatial learning deficits in these mice. PMID:29867369

  11. Hox proteins activate the IGFBP-1 promoter and suppress the function of hPR in human endometrial cells.

    PubMed

    Gao, Jiaguo; Mazella, James; Tseng, Linda

    2002-11-01

    Previous studies have shown that progestin activates the transcription of IGFBP-1 (insulin-like growth factor binding protein-1). Four regions in the IGFBP-1 promotor have been identified to enhance the transcription. Two of the regions, located at -73 to -65 bp and -319 to -311 bp formed identical DNA-protein complexes with the nuclear extracts of endometrial stromal/decidual cells. To identify the binding protein(s) in endometrial cells that interact with these two regions, we have used the TGTCAATTA repeats (-319 to -11 bp of the IGFBP-1 promoter) to screen the human decidual cDNA library by yeast one-hybrid system. We found that Hox A10, HoxA11, HoxB2, HoxB4, and HoxD11 interacted with the TGTCAATTA repeats in yeast cells. Among these hox genes, the full-length coding region of HoxA10, HoxA11, and HoxB4 were used for functional analysis in three types of endometrial cells, undifferentiated endometrial stromal cells, decidual cells (differentiated stromal cells) and endometrial adenocarcinoma cell line (HEC1-B). All these endometrial cells produce IGFBP-1. Transient transfection assay showed that HoxA10 expression vector increased the promoter activity (the IGFBP-1 proximal promoter containing TGC/TCAATTA and two functional PRE sites) in endometrial stromal cells and in HEC-1B cells, but not in decidual cells. HoxB4 enhanced the promoter activity only in decidual cells, while HoxA11 had no apparent effect in all three types of cells. To evaluate whether Hox proteins would interact with progesterone receptor (hPR), cells were transfected with the promoter construct, Hox and hPR expression vectors. hPR alone activated the IGFBP-1 promoter activity, but expression of Hox gene suppressed the activation. Hox proteins also suppressed the hPR enhanced promoter activities of MMTV (containing consensus-PRE sites) and glycodelin (GdA, containing Sp1 site which mediates the hPR function). These data showed that Hox genes selectively activate the transcription of the IGFBP

  12. The mRNA cap-binding protein Cbc1 is required for high and timely expression of genes by promoting the accumulation of gene-specific activators at promoters.

    PubMed

    Li, Tianlu; De Clercq, Nikki; Medina, Daniel A; Garre, Elena; Sunnerhagen, Per; Pérez-Ortín, José E; Alepuz, Paula

    2016-02-01

    The highly conserved Saccharomyces cerevisiae cap-binding protein Cbc1/Sto1 binds mRNA co-transcriptionally and acts as a key coordinator of mRNA fate. Recently, Cbc1 has also been implicated in transcription elongation and pre-initiation complex (PIC) formation. Previously, we described Cbc1 to be required for cell growth under osmotic stress and to mediate osmostress-induced translation reprogramming. Here, we observe delayed global transcription kinetics in cbc1Δ during osmotic stress that correlates with delayed recruitment of TBP and RNA polymerase II to osmo-induced promoters. Interestingly, we detect an interaction between Cbc1 and the MAPK Hog1, which controls most gene expression changes during osmostress, and observe that deletion of CBC1 delays the accumulation of the activator complex Hot1-Hog1 at osmostress promoters. Additionally, CBC1 deletion specifically reduces transcription rates of highly transcribed genes under non-stress conditions, such as ribosomal protein (RP) genes, while having low impact on transcription of weakly expressed genes. For RP genes, we show that recruitment of the specific activator Rap1, and subsequently TBP, to promoters is Cbc1-dependent. Altogether, our results indicate that binding of Cbc1 to the capped mRNAs is necessary for the accumulation of specific activators as well as PIC components at the promoters of genes whose expression requires high and rapid transcription. Copyright © 2016 Elsevier B.V. All rights reserved.

  13. Investigation of genes coding for inflammatory components in Parkinson's disease.

    PubMed

    Håkansson, Anna; Westberg, Lars; Nilsson, Staffan; Buervenich, Silvia; Carmine, Andrea; Holmberg, Björn; Sydow, Olof; Olson, Lars; Johnels, Bo; Eriksson, Elias; Nissbrandt, Hans

    2005-05-01

    Several findings obtained recently indicate that inflammation may contribute to the pathogenesis in Parkinson's disease (PD). Genetic variants of genes coding for components involved in immune reactions in the brain might therefore influence the risk of developing PD or the age of disease onset. Five single nucleotide polymorphisms (SNPs) in the genes coding for interferon-gamma (IFN-gamma; T874A in intron 1), interferon-gamma receptor 2 (IFN-gamma R2; Gln64Arg), interleukin-10 (IL-10; G1082A in the promoter region), platelet-activating factor acetylhydrolase (PAF-AH; Val379Ala), and intercellular adhesion molecule 1 (ICAM-1; Lys469Glu) were genotyped, using pyrosequencing, in 265 patients with PD and 308 controls. None of the investigated SNPs was found to be associated with PD; however, the G1082A polymorphism in the IL-10 gene promoter was found to be related to the age of disease onset. Linear regression showed a significantly earlier onset with more A-alleles (P = 0.0095; after Bonferroni correction, P = 0.048), resulting in a 5-year delayed age of onset of the disease for individuals having two G-alleles compared with individuals having two A-alleles. The results indicate that the IL-10 G1082A SNP could possibly be related to the age of onset of PD. Copyright 2005 Movement Disorder Society.

  14. Methylation of an alpha-foetoprotein gene intragenic site modulates gene activity.

    PubMed Central

    Opdecamp, K; Rivière, M; Molné, M; Szpirer, J; Szpirer, C

    1992-01-01

    By comparing the methylation pattern of Mspl/Hpall sites in the 5' region of the mouse alpha-foetoprotein (AFP) gene of different cells (hepatoma cells, foetal and adult liver, fibroblasts), we found a correlation between gene expression and unmethylation of a site located in the first intron of the gene. Other sites did not show this correlation. In transfection experiments of unmethylated and methylated AFP-CAT chimeric constructions, we then showed that methylation of the intronic site negatively modulates expression of CAT activity. We also found that a DNA segment centered on this site binds nuclear proteins; however methylation did not affect protein binding. Images PMID:1371343

  15. Molecular characterization of dihydroneopterin aldolase and aminodeoxychorismate synthase in common bean-genes coding for enzymes in the folate synthesis pathway.

    PubMed

    Xie, Weilong; Perry, Gregory; Martin, C Joe; Shim, Youn-Seb; Navabi, Alireza; Pauls, K Peter

    2017-07-01

    Common beans (Phaseolus vulgaris) are excellent sources of dietary folates, but different varieties contain different amounts of these compounds. Genes coding for dihydroneopterin aldolase (DHNA) and aminodeoxychorismate synthase (ADCS) of the folate synthesis pathway were characterized by PCR amplification, BAC clone sequencing, and whole genome sequencing. All DHNA and ADCS genes in the Mesoamerican cultivar OAC Rex were isolated and compared with those genes in the genome of Andean genotype G19833. Both genotypes have two functional DHNA genes and one pseudo gene. PvDHNA1 and PvDHNA2 proteins have similar secondary structures and conserved residues as DHNA homologs in Staphylococcus aureus and Arabidopsis. Sequence analysis and synteny mapping indicated that PvDHNA1 might be a duplicated and transposed copy of PvDHNA2. There is only one ADCS gene (PvADCS) identified in the bean genome and it is identical in OAC Rex and G19833. PvADCS has the conserved motifs required for catalytic activity similar to other plant ADCS homologs. DHNA and ADCS gene-specific markers were developed, mapped, and compared to their physical locations on chromosomes 1 and 7, respectively. The gene-specific markers developed in this study should be useful for detection and selection of varieties with enhanced folate contents in bean breeding programs.

  16. GNormPlus: An Integrative Approach for Tagging Genes, Gene Families, and Protein Domains

    PubMed Central

    Lu, Zhiyong

    2015-01-01

    The automatic recognition of gene names and their associated database identifiers from biomedical text has been widely studied in recent years, as these tasks play an important role in many downstream text-mining applications. Despite significant previous research, only a small number of tools are publicly available and these tools are typically restricted to detecting only mention level gene names or only document level gene identifiers. In this work, we report GNormPlus: an end-to-end and open source system that handles both gene mention and identifier detection. We created a new corpus of 694 PubMed articles to support our development of GNormPlus, containing manual annotations for not only gene names and their identifiers, but also closely related concepts useful for gene name disambiguation, such as gene families and protein domains. GNormPlus integrates several advanced text-mining techniques, including SimConcept for resolving composite gene names. As a result, GNormPlus compares favorably to other state-of-the-art methods when evaluated on two widely used public benchmarking datasets, achieving 86.7% F1-score on the BioCreative II Gene Normalization task dataset and 50.1% F1-score on the BioCreative III Gene Normalization task dataset. The GNormPlus source code and its annotated corpus are freely available, and the results of applying GNormPlus to the entire PubMed are freely accessible through our web-based tool PubTator. PMID:26380306

  17. Gene Unprediction with Spurio: A tool to identify spurious protein sequences.

    PubMed

    Höps, Wolfram; Jeffryes, Matt; Bateman, Alex

    2018-01-01

    We now have access to the sequences of tens of millions of proteins. These protein sequences are essential for modern molecular biology and computational biology. The vast majority of protein sequences are derived from gene prediction tools and have no experimental supporting evidence for their translation.  Despite the increasing accuracy of gene prediction tools there likely exists a large number of spurious protein predictions in the sequence databases.  We have developed the Spurio tool to help identify spurious protein predictions in prokaryotes.  Spurio searches the query protein sequence against a prokaryotic nucleotide database using tblastn and identifies homologous sequences. The tblastn matches are used to score the query sequence's likelihood of being a spurious protein prediction using a Gaussian process model. The most informative feature is the appearance of stop codons within the presumed translation of homologous DNA sequences. Benchmarking shows that the Spurio tool is able to distinguish spurious from true proteins. However, transposon proteins are prone to be predicted as spurious because of the frequency of degraded homologs found in the DNA sequence databases. Our initial experiments suggest that less than 1% of the proteins in the UniProtKB sequence database are likely to be spurious and that Spurio is able to identify over 60 times more spurious proteins than the AntiFam resource. The Spurio software and source code is available under an MIT license at the following URL: https://bitbucket.org/bateman-group/spurio.

  18. Activity-dependent neuroprotective protein (ADNP): a case study for highly conserved chordata-specific genes shaping the brain and mutated in cancer.

    PubMed

    Gozes, Illana; Yeheskel, Adva; Pasmanik-Chor, Metsada

    2015-01-01

    The recent finding of activity-dependent neuroprotective protein (ADNP) as a protein decreased in serum of patients with Alzheimer's disease (AD) compared to controls, alongside with the discovery of ADNP mutations in autism and coupled with the original description of cancer mutations, ignited an interest for a comparative analysis of ADNP with other AD/autism/cancer-associated genes. We strive toward a better understanding of the molecular structure of key players in psychiatric/neurodegenerative diseases including autism, schizophrenia, and AD. This article includes data mining and bioinformatics analysis on the ADNP gene and protein, in addition to other related genes, with emphasis on recent literature. ADNP is discovered here as unique to chordata with specific autism mutations different from cancer-associated mutation. Furthermore, ADNP exhibits similarities to other cancer/autism-associated genes. We suggest that key genes, which shape and maintain our brain and are prone to mutations, are by in large unique to chordata. Furthermore, these brain-controlling genes, like ADNP, are linked to cell growth and differentiation, and under different stress conditions may mutate or exhibit expression changes leading to cancer propagation. Better understanding of these genes could lead to better therapeutics.

  19. Robust expression of a bioactive mammalian protein in chlamydomonas chloroplast

    DOEpatents

    Mayfield, Stephen P.

    2010-03-16

    Methods and compositions are disclosed to engineer chloroplast comprising heterologous mammalian genes via a direct replacement of chloroplast Photosystem II (PSII) reaction center protein coding regions to achieve expression of recombinant protein above 5% of total protein. When algae is used, algal expressed protein is produced predominantly as a soluble protein where the functional activity of the peptide is intact. As the host algae is edible, production of biologics in this organism for oral delivery or proteins/peptides, especially gut active proteins, without purification is disclosed.

  20. Robust expression of a bioactive mammalian protein in Chlamydomonas chloroplast

    DOEpatents

    Mayfield, Stephen P

    2015-01-13

    Methods and compositions are disclosed to engineer chloroplast comprising heterologous mammalian genes via a direct replacement of chloroplast Photosystem II (PSII) reaction center protein coding regions to achieve expression of recombinant protein above 5% of total protein. When algae is used, algal expressed protein is produced predominantly as a soluble protein where the functional activity of the peptide is intact. As the host algae is edible, production of biologics in this organism for oral delivery of proteins/peptides, especially gut active proteins, without purification is disclosed.

  1. The radical induced cell death protein 1 (RCD1) supports transcriptional activation of genes for chloroplast antioxidant enzymes

    PubMed Central

    Hiltscher, Heiko; Rudnik, Radoslaw; Shaikhali, Jehad; Heiber, Isabelle; Mellenthin, Marina; Meirelles Duarte, Iuri; Schuster, Günter; Kahmann, Uwe; Baier, Margarete

    2014-01-01

    The rimb1 (redox imbalanced 1) mutation was mapped to the RCD1 locus (radical-induced cell death 1; At1g32230) demonstrating that a major factor involved in redox-regulation genes for chloroplast antioxidant enzymes and protection against photooxidative stress, RIMB1, is identical to the regulator of disease response reactions and cell death, RCD1. Discovering this link let to our investigation of its regulatory mechanism. We show in yeast that RCD1 can physically interact with the transcription factor Rap2.4a which provides redox-sensitivity to nuclear expression of genes for chloroplast antioxidant enzymes. In the rimb1 (rcd1-6) mutant, a single nucleotide exchange results in a truncated RCD1 protein lacking the transcription factor binding site. Protein-protein interaction between full-length RCD1 and Rap2.4a is supported by H2O2, but not sensitive to the antioxidants dithiotreitol and ascorbate. In combination with transcript abundance analysis in Arabidopsis, it is concluded that RCD1 stabilizes the Rap2.4-dependent redox-regulation of the genes encoding chloroplast antioxidant enzymes in a widely redox-independent manner. Over the years, rcd1-mutant alleles have been described to develop symptoms like chlorosis, lesions along the leaf rims and in the mesophyll and (secondary) induction of extra- and intra-plastidic antioxidant defense mechanisms. All these rcd1 mutant characteristics were observed in rcd1-6 to succeed low activation of the chloroplast antioxidant system and glutathione biosynthesis. We conclude that RCD1 protects plant cells from running into reactive oxygen species (ROS)-triggered programs, such as cell death and activation of pathogen-responsive genes (PR genes) and extra-plastidic antioxidant enzymes, by supporting the induction of the chloroplast antioxidant system. PMID:25295044

  2. NetDecoder: a network biology platform that decodes context-specific biological networks and gene activities.

    PubMed

    da Rocha, Edroaldo Lummertz; Ung, Choong Yong; McGehee, Cordelia D; Correia, Cristina; Li, Hu

    2016-06-02

    The sequential chain of interactions altering the binary state of a biomolecule represents the 'information flow' within a cellular network that determines phenotypic properties. Given the lack of computational tools to dissect context-dependent networks and gene activities, we developed NetDecoder, a network biology platform that models context-dependent information flows using pairwise phenotypic comparative analyses of protein-protein interactions. Using breast cancer, dyslipidemia and Alzheimer's disease as case studies, we demonstrate NetDecoder dissects subnetworks to identify key players significantly impacting cell behaviour specific to a given disease context. We further show genes residing in disease-specific subnetworks are enriched in disease-related signalling pathways and information flow profiles, which drive the resulting disease phenotypes. We also devise a novel scoring scheme to quantify key genes-network routers, which influence many genes, key targets, which are influenced by many genes, and high impact genes, which experience a significant change in regulation. We show the robustness of our results against parameter changes. Our network biology platform includes freely available source code (http://www.NetDecoder.org) for researchers to explore genome-wide context-dependent information flow profiles and key genes, given a set of genes of particular interest and transcriptome data. More importantly, NetDecoder will enable researchers to uncover context-dependent drug targets. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  3. Hidden Structural Codes in Protein Intrinsic Disorder.

    PubMed

    Borkosky, Silvia S; Camporeale, Gabriela; Chemes, Lucía B; Risso, Marikena; Noval, María Gabriela; Sánchez, Ignacio E; Alonso, Leonardo G; de Prat Gay, Gonzalo

    2017-10-17

    Intrinsic disorder is a major structural category in biology, accounting for more than 30% of coding regions across the domains of life, yet consists of conformational ensembles in equilibrium, a major challenge in protein chemistry. Anciently evolved papillomavirus genomes constitute an unparalleled case for sequence to structure-function correlation in cases in which there are no folded structures. E7, the major transforming oncoprotein of human papillomaviruses, is a paradigmatic example among the intrinsically disordered proteins. Analysis of a large number of sequences of the same viral protein allowed for the identification of a handful of residues with absolute conservation, scattered along the sequence of its N-terminal intrinsically disordered domain, which intriguingly are mostly leucine residues. Mutation of these led to a pronounced increase in both α-helix and β-sheet structural content, reflected by drastic effects on equilibrium propensities and oligomerization kinetics, and uncovers the existence of local structural elements that oppose canonical folding. These folding relays suggest the existence of yet undefined hidden structural codes behind intrinsic disorder in this model protein. Thus, evolution pinpoints conformational hot spots that could have not been identified by direct experimental methods for analyzing or perturbing the equilibrium of an intrinsically disordered protein ensemble.

  4. Rare Variant of GM2 Gangliosidosis through Activator-Protein Deficiency.

    PubMed

    Brackmann, Florian; Kehrer, Christiane; Kustermann, Wibke; Böhringer, Judith; Krägeloh-Mann, Ingeborg; Trollmann, Regina

    2017-04-01

    GM2 gangliosidosis, AB variant, is a very rare form of GM2 gangliosidosis due to a deficiency of GM2 activator protein. We report on two patients with typical clinical features suggestive of GM2 gangliosidosis, but normal results for hexosaminidase A and hexosaminidase B as well as their corresponding genes. Genetic analysis of the gene encoding the activator protein, the GM2A gene, elucidated the cause of the disease, adding a novel mutation to the spectrum of GM2 AB variant. This report points out that in typical clinical constellations with normal enzyme results, genetic diagnostic for activator protein defects should be performed. Georg Thieme Verlag KG Stuttgart · New York.

  5. Hypolipidemic effect of dietary pea proteins: Impact on genes regulating hepatic lipid metabolism.

    PubMed

    Rigamonti, Elena; Parolini, Cinzia; Marchesi, Marta; Diani, Erika; Brambilla, Stefano; Sirtori, Cesare R; Chiesa, Giulia

    2010-05-01

    Controversial data on the lipid-lowering effect of dietary pea proteins have been provided and the mechanisms behind this effect are not completely understood. The aim of the study was to evaluate a possible hypolipidemic activity of a pea protein isolate and to determine whether pea proteins could affect the hepatic lipid metabolism through regulation of genes involved in cholesterol and fatty acid homeostasis. Rats were fed Nath's hypercholesterolemic diets for 28 days, the protein sources being casein or a pea protein isolate from Pisum sativum. After 14 and 28 days of dietary treatment, rats fed pea proteins had markedly lower plasma cholesterol and triglyceride levels than rats fed casein (p<0.05). Pea protein-fed rats displayed higher hepatic mRNA levels of LDL receptor versus those fed casein (p<0.05). Hepatic mRNA concentration of genes involved in fatty acids synthesis, such as fatty acid synthase and stearoyl-CoA desaturase, was lower in pea protein-fed rats than in rats fed casein (p<0.05). In conclusion, the present study demonstrates a marked cholesterol and triglyceride-lowering activity of pea proteins in rats. Moreover, pea proteins appear to affect cellular lipid homeostasis by upregulating genes involved in hepatic cholesterol uptake and by downregulating fatty acid synthesis genes.

  6. Protein phosphatase 2ACα gene knock-out results in cortical atrophy through activating hippo cascade in neuronal progenitor cells.

    PubMed

    Liu, Bo; Sun, Li-Hua; Huang, Yan-Fei; Guo, Li-Jun; Luo, Li-Shu

    2018-02-01

    Protein phosphatase 2ACα (PP2ACα), a vital member of the protein phosphatase family, has been studied primarily as a regulator for the development, growth and protein synthesis of a lot of cell types. Dysfunction of PP2ACα protein results in neurodegenerative disease; however, this finding has not been directly confirmed in the mouse model with PP2ACα gene knock-out. Therefore, in this study presented here, we generated the PP2ACα gene knock-out mouse model by the Cre-loxP targeting gene system, with the purpose to directly observe the regulatory role of PP2ACα gene in the development of mouse's cerebral cortex. We observe that knocking-out PP2ACα gene in the central nervous system (CNS) results in cortical neuronal shrinkage, synaptic plasticity impairments, and learning/memory deficits. Further study reveals that PP2ACα gene knock-out initiates Hippo cascade in cortical neuroprogenitor cells (NPCs), which blocks YAP translocation into the nuclei of NPCs. Notably, p73, directly targeted by Hippo cascade, can bind to the promoter of glutaminase2 (GLS2) that plays a dominant role in the enzymatic regulation of glutamate/glutamine cycle. Finally, we find that PP2ACα gene knock-out inhibits the glutamine synthesis through up-regulating the activity of phosphorylated-p73 in cortical NPCs. Taken together, it concludes that PP2ACα critically supports cortical neuronal growth and cognitive function via regulating the signaling transduction of Hippo-p73 cascade. And PP2ACα indirectly modulates the glutamine synthesis of cortical NPCs through targeting p73 that plays a direct transcriptional regulatory role in the gene expression of GLS2. Copyright © 2017 Elsevier Ltd. All rights reserved.

  7. Selective Constraints on Coding Sequences of Nervous System Genes Are a Major Determinant of Duplicate Gene Retention in Vertebrates.

    PubMed

    Roux, Julien; Liu, Jialin; Robinson-Rechavi, Marc

    2017-11-01

    The evolutionary history of vertebrates is marked by three ancient whole-genome duplications: two successive rounds in the ancestor of vertebrates, and a third one specific to teleost fishes. Biased loss of most duplicates enriched the genome for specific genes, such as slow evolving genes, but this selective retention process is not well understood. To understand what drives the long-term preservation of duplicate genes, we characterized duplicated genes in terms of their expression patterns. We used a new method of expression enrichment analysis, TopAnat, applied to in situ hybridization data from thousands of genes from zebrafish and mouse. We showed that the presence of expression in the nervous system is a good predictor of a higher rate of retention of duplicate genes after whole-genome duplication. Further analyses suggest that purifying selection against the toxic effects of misfolded or misinteracting proteins, which is particularly strong in nonrenewing neural tissues, likely constrains the evolution of coding sequences of nervous system genes, leading indirectly to the preservation of duplicate genes after whole-genome duplication. Whole-genome duplications thus greatly contributed to the expansion of the toolkit of genes available for the evolution of profound novelties of the nervous system at the base of the vertebrate radiation. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  8. Tempo and Mode of Gene Duplication in Mammalian Ribosomal Protein Evolution

    PubMed Central

    Gajdosik, Matthew D.; Simon, Amanda; Nelson, Craig E.

    2014-01-01

    Gene duplication has been widely recognized as a major driver of evolutionary change and organismal complexity through the generation of multi-gene families. Therefore, understanding the forces that govern the evolution of gene families through the retention or loss of duplicated genes is fundamentally important in our efforts to study genome evolution. Previous work from our lab has shown that ribosomal protein (RP) genes constitute one of the largest classes of conserved duplicated genes in mammals. This result was surprising due to the fact that ribosomal protein genes evolve slowly and transcript levels are very tightly regulated. In our present study, we identified and characterized all RP duplicates in eight mammalian genomes in order to investigate the tempo and mode of ribosomal protein family evolution. We show that a sizable number of duplicates are transcriptionally active and are very highly conserved. Furthermore, we conclude that existing gene duplication models do not readily account for the preservation of a very large number of intact retroduplicated ribosomal protein (RT-RP) genes observed in mammalian genomes. We suggest that selection against dominant-negative mutations may underlie the unexpected retention and conservation of duplicated RP genes, and may shape the fate of newly duplicated genes, regardless of duplication mechanism. PMID:25369106

  9. UNC-13L, UNC-13S, and Tomosyn form a protein code for fast and slow neurotransmitter release in Caenorhabditis elegans

    PubMed Central

    Hu, Zhitao; Tong, Xia-Jing; Kaplan, Joshua M

    2013-01-01

    Synaptic transmission consists of fast and slow components of neurotransmitter release. Here we show that these components are mediated by distinct exocytic proteins. The Caenorhabditis elegans unc-13 gene is required for SV exocytosis, and encodes long and short isoforms (UNC-13L and S). Fast release was mediated by UNC-13L, whereas slow release required both UNC-13 proteins and was inhibited by Tomosyn. The spatial location of each protein correlated with its effect. Proteins adjacent to the dense projection mediated fast release, while those controlling slow release were more distal or diffuse. Two UNC-13L domains accelerated release. C2A, which binds RIM (a protein associated with calcium channels), anchored UNC-13 at active zones and shortened the latency of release. A calmodulin binding site accelerated release but had little effect on UNC-13’s spatial localization. These results suggest that UNC-13L, UNC-13S, and Tomosyn form a molecular code that dictates the timing of neurotransmitter release. DOI: http://dx.doi.org/10.7554/eLife.00967.001 PMID:23951547

  10. Molecular cloning of a Candida albicans gene (SSB1) coding for a protein related to the Hsp70 family.

    PubMed

    Maneu, V; Cervera, A M; Martinez, J P; Gozalbo, D

    1997-06-15

    We have cloned and sequenced a Candida albicans gene (SSB1) encoding a potential member of the heat-shock protein seventy (hsp70) family. The protein encoded by this gene contains 613 amino acids and shows a high degree (85%) of sequence identity to the ssb subfamily (ssb1 and ssb2) of the Saccharomyces cerevisiae hsp70 family. The transcribed mRNA (2.1 kb) is present in similar amounts both in yeast and germ tube cells of C. albicans.

  11. Osteogenic potential of the human bone morphogenetic protein 2 gene activated nanobone putty.

    PubMed

    Tian, Xiao-bin; Sun, Li; Yang, Shu-hua; Zhang, Yu-kun; Hu, Ru-yin; Fu, De-hao

    2008-04-20

    Nanobone putty is an injectable and bioresorbable bone substitute. The neutral-pH putty resembles hard bone tissue, does not contain polymers or plasticizers, and is self-setting and nearly isothermic, properties which are helpful for the adhesion, proliferation, and function of bone cells. The aim of this study was to investigate the osteogenic potential of human bone morphogenetic protein 2 (hBMP2) gene activated nanobone putty in inducing ectopic bone formation, and the effects of the hBMP2 gene activated nanobone putty on repairing bone defects. Twenty four Kunming mice were randomly divided into two groups. The nanobone putty + hBMP2 plasmid was injected into the right thigh muscle pouches of the mice (experiment side). The nanobone putty + blank plasmid or nanobone putty was injected into the left thigh muscle pouches of the group 1 (control side 1) or group 2 (control side 2), respectively. The effects of ectopic bone formation were evaluated by radiography, histology, and molecular biology analysis at 2 and 4 weeks after operation. Bilateral 15 mm radial defects were made in forty-eight rabbits. These rabbits were randomly divided into three groups: Group A, nanobone putty + hBMP2 plasmid; Group B, putty + blank plasmid; Group C, nanobone putty only. Six rabbits with left radial defects served as blank controls. The effect of bone repairing was evaluated by radiography, histology, molecular biology, and biomechanical analysis at 4, 8, and 12 weeks after operation. The tissue from the experimental side of the mice expressed hBMP2. Obvious cartilage and island-distributed immature bone formation in implants of the experiment side were observed at 2 weeks after operation, and massive mature bone observed at 4 weeks. No bone formation was observed in the control side of the mice. The ALP activity in the experiment side of the mice was higher than that in the control side. The tissue of Group A rabbits expressed hBMP2 protein and higher ALP level. The new bone

  12. In Vitro Anti-Echinococcal and Metabolic Effects of Metformin Involve Activation of AMP-Activated Protein Kinase in Larval Stages of Echinococcus granulosus.

    PubMed

    Loos, Julia A; Cumino, Andrea C

    2015-01-01

    Metformin (Met) is a biguanide anti-hyperglycemic agent, which also exerts antiproliferative effects on cancer cells. This drug inhibits the complex I of the mitochondrial electron transport chain inducing a fall in the cell energy charge and leading 5'-AMP-activated protein kinase (AMPK) activation. AMPK is a highly conserved heterotrimeric complex that coordinates metabolic and growth pathways in order to maintain energy homeostasis and cell survival, mainly under nutritional stress conditions, in a Liver Kinase B1 (LKB1)-dependent manner. This work describes for the first time, the in vitro anti-echinococcal effect of Met on Echinococcus granulosus larval stages, as well as the molecular characterization of AMPK (Eg-AMPK) in this parasite of clinical importance. The drug exerted a dose-dependent effect on the viability of both larval stages. Based on this, we proceeded with the identification of the genes encoding for the different subunits of Eg-AMPK. We cloned one gene coding for the catalytic subunit (Eg-ampkɑ) and two genes coding for the regulatory subunits (Eg-ampkβ and Eg-ampkγ), all of them constitutively transcribed in E. granulosus protoscoleces and metacestodes. Their deduced amino acid sequences show all the conserved functional domains, including key amino acids involved in catalytic activity and protein-protein interactions. In protoscoleces, the drug induced the activation of AMPK (Eg-AMPKɑ-P176), possibly as a consequence of cellular energy charge depletion evidenced by assays with the fluorescent indicator JC-1. Met also led to carbohydrate starvation, it increased glucogenolysis and homolactic fermentation, and decreased transcription of intermediary metabolism genes. By in toto immunolocalization assays, we detected Eg-AMPKɑ-P176 expression, both in the nucleus and the cytoplasm of cells as in the larval tegument, the posterior bladder and the calcareous corpuscles of control and Met-treated protoscoleces. Interestingly, expression of Eg

  13. In Vitro Anti-Echinococcal and Metabolic Effects of Metformin Involve Activation of AMP-Activated Protein Kinase in Larval Stages of Echinococcus granulosus

    PubMed Central

    Loos, Julia A.; Cumino, Andrea C.

    2015-01-01

    Metformin (Met) is a biguanide anti-hyperglycemic agent, which also exerts antiproliferative effects on cancer cells. This drug inhibits the complex I of the mitochondrial electron transport chain inducing a fall in the cell energy charge and leading 5'-AMP-activated protein kinase (AMPK) activation. AMPK is a highly conserved heterotrimeric complex that coordinates metabolic and growth pathways in order to maintain energy homeostasis and cell survival, mainly under nutritional stress conditions, in a Liver Kinase B1 (LKB1)-dependent manner. This work describes for the first time, the in vitro anti-echinococcal effect of Met on Echinococcus granulosus larval stages, as well as the molecular characterization of AMPK (Eg-AMPK) in this parasite of clinical importance. The drug exerted a dose-dependent effect on the viability of both larval stages. Based on this, we proceeded with the identification of the genes encoding for the different subunits of Eg-AMPK. We cloned one gene coding for the catalytic subunit (Eg-ampkɑ) and two genes coding for the regulatory subunits (Eg-ampkβ and Eg-ampkγ), all of them constitutively transcribed in E. granulosus protoscoleces and metacestodes. Their deduced amino acid sequences show all the conserved functional domains, including key amino acids involved in catalytic activity and protein-protein interactions. In protoscoleces, the drug induced the activation of AMPK (Eg-AMPKɑ-P176), possibly as a consequence of cellular energy charge depletion evidenced by assays with the fluorescent indicator JC-1. Met also led to carbohydrate starvation, it increased glucogenolysis and homolactic fermentation, and decreased transcription of intermediary metabolism genes. By in toto immunolocalization assays, we detected Eg-AMPKɑ-P176 expression, both in the nucleus and the cytoplasm of cells as in the larval tegument, the posterior bladder and the calcareous corpuscles of control and Met-treated protoscoleces. Interestingly, expression of Eg

  14. Non-coding RNAs—Novel targets in neurotoxicity

    PubMed Central

    Tal, Tamara L.; Tanguay, Robert L.

    2012-01-01

    Over the past ten years non-coding RNAs (ncRNAs) have emerged as pivotal players in fundamental physiological and cellular processes and have been increasingly implicated in cancer, immune disorders, and cardiovascular, neurodegenerative, and metabolic diseases. MicroRNAs (miRNAs) represent a class of ncRNA molecules that function as negative regulators of post-transcriptional gene expression. miRNAs are predicted to regulate 60% of all human protein-coding genes and as such, play key roles in cellular and developmental processes, human health, and disease. Relative to counterparts that lack bindings sites for miRNAs, genes encoding proteins that are post-transcriptionally regulated by miRNAs are twice as likely to be sensitive to environmental chemical exposure. Not surprisingly, miRNAs have been recognized as targets or effectors of nervous system, developmental, hepatic, and carcinogenic toxicants, and have been identified as putative regulators of phase I xenobiotic-metabolizing enzymes. In this review, we give an overview of the types of ncRNAs and highlight their roles in neurodevelopment, neurological disease, activity-dependent signaling, and drug metabolism. We then delve into specific examples that illustrate their importance as mediators, effectors, or adaptive agents of neurotoxicants or neuroactive pharmaceutical compounds. Finally, we identify a number of outstanding questions regarding ncRNAs and neurotoxicity. PMID:22394481

  15. Molecular characterization of a phloem-specific gene encoding the filament protein, phloem protein 1 (PP1), from Cucurbita maxima.

    PubMed

    Clark, A M; Jacobsen, K R; Bostwick, D E; Dannenhoffer, J M; Skaggs, M I; Thompson, G A

    1997-07-01

    Sieve elements in the phloem of most angiosperms contain proteinaceous filaments and aggregates called P-protein. In the genus Cucurbita, these filaments are composed of two major proteins: PP1, the phloem filament protein, and PP2, the phloem lactin. The gene encoding the phloem filament protein in pumpkin (Cucurbita maxima Duch.) has been isolated and characterized. Nucleotide sequence analysis of the reconstructed gene gPP1 revealed a continuous 2430 bp protein coding sequence, with no introns, encoding an 809 amino acid polypeptide. The deduced polypeptide had characteristics of PP1 and contained a 15 amino acid sequence determined by N-terminal peptide sequence analysis of PP1. The sequence of PP1 was highly repetitive with four 200 amino acid sequence domains containing structural motifs in common with cysteine proteinase inhibitors. Expression of the PP1 gene was detected in roots, hypocotyls, cotyledons, stems, and leaves of pumpkin plants. PP1 and its mRNA accumulated in pumpkin hypocotyls during the period of rapid hypocotyl elongation after which mRNA levels declined, while protein levels remained elevated. PP1 was immunolocalized in slime plugs and P-protein bodies in sieve elements of the phloem. Occasionally, PP1 was detected in companion cells. PP1 mRNA was localized by in situ hybridization in companion cells at early stages of vascular differentiation. The developmental accumulation and localization of PP1 and its mRNA paralleled the phloem lactin, further suggesting an interaction between these phloem-specific proteins.

  16. Activation of AMP-activated Protein Kinase by Metformin Induces Protein Acetylation in Prostate and Ovarian Cancer Cells*

    PubMed Central

    Galdieri, Luciano; Gatla, Himavanth; Vancurova, Ivana; Vancura, Ales

    2016-01-01

    AMP-activated protein kinase (AMPK) is an energy sensor and master regulator of metabolism. AMPK functions as a fuel gauge monitoring systemic and cellular energy status. Activation of AMPK occurs when the intracellular AMP/ATP ratio increases and leads to a metabolic switch from anabolism to catabolism. AMPK phosphorylates and inhibits acetyl-CoA carboxylase (ACC), which catalyzes carboxylation of acetyl-CoA to malonyl-CoA, the first and rate-limiting reaction in de novo synthesis of fatty acids. AMPK thus regulates homeostasis of acetyl-CoA, a key metabolite at the crossroads of metabolism, signaling, chromatin structure, and transcription. Nucleocytosolic concentration of acetyl-CoA affects histone acetylation and links metabolism and chromatin structure. Here we show that activation of AMPK with the widely used antidiabetic drug metformin or with the AMP mimetic 5-aminoimidazole-4-carboxamide ribonucleotide increases the inhibitory phosphorylation of ACC and decreases the conversion of acetyl-CoA to malonyl-CoA, leading to increased protein acetylation and altered gene expression in prostate and ovarian cancer cells. Direct inhibition of ACC with allosteric inhibitor 5-(tetradecyloxy)-2-furoic acid also increases acetylation of histones and non-histone proteins. Because AMPK activation requires liver kinase B1, metformin does not induce protein acetylation in liver kinase B1-deficient cells. Together, our data indicate that AMPK regulates the availability of nucleocytosolic acetyl-CoA for protein acetylation and that AMPK activators, such as metformin, have the capacity to increase protein acetylation and alter patterns of gene expression, further expanding the plethora of metformin's physiological effects. PMID:27733682

  17. Isolation and characterization of the promoter sequence of a cassava gene coding for Pt2L4, a glutamic acid-rich protein differentially expressed in storage roots.

    PubMed

    de Souza, C R; Aragão, F J; Moreira, E C O; Costa, C N M; Nascimento, S B; Carvalho, L J

    2009-03-24

    Cassava is one of the most important tropical food crops for more than 600 million people worldwide. Transgenic technologies can be useful for increasing its nutritional value and its resistance to viral diseases and insect pests. However, tissue-specific promoters that guarantee correct expression of transgenes would be necessary. We used inverse polymerase chain reaction to isolate a promoter sequence of the Mec1 gene coding for Pt2L4, a glutamic acid-rich protein differentially expressed in cassava storage roots. In silico analysis revealed putative cis-acting regulatory elements within this promoter sequence, including root-specific elements that may be required for its expression in vascular tissues. Transient expression experiments showed that the Mec1 promoter is functional, since this sequence was able to drive GUS expression in bean embryonic axes. Results from our computational analysis can serve as a guide for functional experiments to identify regions with tissue-specific Mec1 promoter activity. The DNA sequence that we identified is a new promoter that could be a candidate for genetic engineering of cassava roots.

  18. Calpain activation by ROS mediates human ether-a-go-go-related gene protein degradation by intermittent hypoxia.

    PubMed

    Wang, N; Kang, H S; Ahmmed, G; Khan, S A; Makarenko, V V; Prabhakar, N R; Nanduri, J

    2016-03-01

    Human ether-a-go-go-related gene (hERG) channels conduct delayed rectifier K(+) current. However, little information is available on physiological situations affecting hERG channel protein and function. In the present study we examined the effects of intermittent hypoxia (IH), which is a hallmark manifestation of sleep apnea, on hERG channel protein and function. Experiments were performed on SH-SY5Y neuroblastoma cells, which express hERG protein. Cells were exposed to IH consisting of alternating cycles of 30 s of hypoxia (1.5% O2) and 5 min of 20% O2. IH decreased hERG protein expression in a stimulus-dependent manner. A similar reduction in hERG protein was also seen in adrenal medullary chromaffin cells from IH-exposed neonatal rats. The decreased hERG protein was associated with attenuated hERG K(+) current. IH-evoked hERG protein degradation was not due to reduced transcription or increased proteosome/lysomal degradation. Rather it was mediated by calcium-activated calpain proteases. Both COOH- and NH2-terminal sequences of the hERG protein were the targets of calpain-dependent degradation. IH increased reactive oxygen species (ROS) levels, intracellular Ca(2+) concentration ([Ca(2+)]i), calpain enzyme activity, and hERG protein degradation, and all these effects were prevented by manganese-(111)-tetrakis-(1-methyl-4-pyridyl)-porphyrin pentachloride, a membrane-permeable ROS scavenger. These results demonstrate that activation of calpains by ROS-dependent elevation of [Ca(2+)]i mediates hERG protein degradation by IH. Copyright © 2016 the American Physiological Society.

  19. Genes2Networks: connecting lists of gene symbols using mammalian protein interactions databases.

    PubMed

    Berger, Seth I; Posner, Jeremy M; Ma'ayan, Avi

    2007-10-04

    In recent years, mammalian protein-protein interaction network databases have been developed. The interactions in these databases are either extracted manually from low-throughput experimental biomedical research literature, extracted automatically from literature using techniques such as natural language processing (NLP), generated experimentally using high-throughput methods such as yeast-2-hybrid screens, or interactions are predicted using an assortment of computational approaches. Genes or proteins identified as significantly changing in proteomic experiments, or identified as susceptibility disease genes in genomic studies, can be placed in the context of protein interaction networks in order to assign these genes and proteins to pathways and protein complexes. Genes2Networks is a software system that integrates the content of ten mammalian interaction network datasets. Filtering techniques to prune low-confidence interactions were implemented. Genes2Networks is delivered as a web-based service using AJAX. The system can be used to extract relevant subnetworks created from "seed" lists of human Entrez gene symbols. The output includes a dynamic linkable three color web-based network map, with a statistical analysis report that identifies significant intermediate nodes used to connect the seed list. Genes2Networks is powerful web-based software that can help experimental biologists to interpret lists of genes and proteins such as those commonly produced through genomic and proteomic experiments, as well as lists of genes and proteins associated with disease processes. This system can be used to find relationships between genes and proteins from seed lists, and predict additional genes or proteins that may play key roles in common pathways or protein complexes.

  20. Conserved Non-Coding Regulatory Signatures in Arabidopsis Co-Expressed Gene Modules

    PubMed Central

    Spangler, Jacob B.; Ficklin, Stephen P.; Luo, Feng; Freeling, Michael; Feltus, F. Alex

    2012-01-01

    Complex traits and other polygenic processes require coordinated gene expression. Co-expression networks model mRNA co-expression: the product of gene regulatory networks. To identify regulatory mechanisms underlying coordinated gene expression in a tissue-enriched context, ten Arabidopsis thaliana co-expression networks were constructed after manually sorting 4,566 RNA profiling datasets into aerial, flower, leaf, root, rosette, seedling, seed, shoot, whole plant, and global (all samples combined) groups. Collectively, the ten networks contained 30% of the measurable genes of Arabidopsis and were circumscribed into 5,491 modules. Modules were scrutinized for cis regulatory mechanisms putatively encoded in conserved non-coding sequences (CNSs) previously identified as remnants of a whole genome duplication event. We determined the non-random association of 1,361 unique CNSs to 1,904 co-expression network gene modules. Furthermore, the CNS elements were placed in the context of known gene regulatory networks (GRNs) by connecting 250 CNS motifs with known GRN cis elements. Our results provide support for a regulatory role of some CNS elements and suggest the functional consequences of CNS activation of co-expression in specific gene sets dispersed throughout the genome. PMID:23024789

  1. Conserved non-coding regulatory signatures in Arabidopsis co-expressed gene modules.

    PubMed

    Spangler, Jacob B; Ficklin, Stephen P; Luo, Feng; Freeling, Michael; Feltus, F Alex

    2012-01-01

    Complex traits and other polygenic processes require coordinated gene expression. Co-expression networks model mRNA co-expression: the product of gene regulatory networks. To identify regulatory mechanisms underlying coordinated gene expression in a tissue-enriched context, ten Arabidopsis thaliana co-expression networks were constructed after manually sorting 4,566 RNA profiling datasets into aerial, flower, leaf, root, rosette, seedling, seed, shoot, whole plant, and global (all samples combined) groups. Collectively, the ten networks contained 30% of the measurable genes of Arabidopsis and were circumscribed into 5,491 modules. Modules were scrutinized for cis regulatory mechanisms putatively encoded in conserved non-coding sequences (CNSs) previously identified as remnants of a whole genome duplication event. We determined the non-random association of 1,361 unique CNSs to 1,904 co-expression network gene modules. Furthermore, the CNS elements were placed in the context of known gene regulatory networks (GRNs) by connecting 250 CNS motifs with known GRN cis elements. Our results provide support for a regulatory role of some CNS elements and suggest the functional consequences of CNS activation of co-expression in specific gene sets dispersed throughout the genome.

  2. Identification, Nomenclature, and Evolutionary Relationships of Mitogen-Activated Protein Kinase (MAPK) Genes in Soybean

    PubMed Central

    Neupane, Achal; Nepal, Madhav P.; Piya, Sarbottam; Subramanian, Senthil; Rohila, Jai S.; Reese, R. Neil; Benson, Benjamin V.

    2013-01-01

    Mitogen-activated protein kinase (MAPK) genes in eukaryotes regulate various developmental and physiological processes including those associated with biotic and abiotic stresses. Although MAPKs in some plant species including Arabidopsis have been identified, they are yet to be identified in soybean. Major objectives of this study were to identify GmMAPKs, assess their evolutionary relationships, and analyze their functional divergence. We identified a total of 38 MAPKs, eleven MAPKKs, and 150 MAPKKKs in soybean. Within the GmMAPK family, we also identified a new clade of six genes: four genes with TEY and two genes with TQY motifs requiring further investigation into possible legume-specific functions. The results indicated the expansion of the GmMAPK families attributable to the ancestral polyploidy events followed by chromosomal rearrangements. The GmMAPK and GmMAPKKK families were substantially larger than those in other plant species. The duplicated GmMAPK members presented complex evolutionary relationships and functional divergence when compared to their counterparts in Arabidopsis. We also highlighted existing nomenclatural issues, stressing the need for nomenclatural consistency. GmMAPK identification is vital to soybean crop improvement, and novel insights into the evolutionary relationships will enhance our understanding about plant genome evolution. PMID:24137047

  3. Molecular cloning of chitinase 33 (chit33) gene from Trichoderma atroviride

    PubMed Central

    Matroudi, S.; Zamani, M.R.; Motallebi, M.

    2008-01-01

    In this study Trichoderma atroviride was selected as over producer of chitinase enzyme among 30 different isolates of Trichoderma sp. on the basis of chitinase specific activity. From this isolate the genomic and cDNA clones encoding chit33 have been isolated and sequenced. Comparison of genomic and cDNA sequences for defining gene structure indicates that this gene contains three short introns and also an open reading frame coding for a protein of 321 amino acids. The deduced amino acid sequence includes a 19 aa putative signal peptide. Homology between this sequence and other reported Trichoderma Chit33 proteins are discussed. The coding sequence of chit33 gene was cloned in pEt26b(+) expression vector and expressed in E. coli. PMID:24031242

  4. Modeling the Activity of Single Genes

    NASA Technical Reports Server (NTRS)

    Mjolsness, Eric; Gibson, Michael

    1999-01-01

    -scale sequencing began with simple organisms, viruses and bacteria, progressed to eukaryotes such as yeast, and more recently (1998) progressed to a multi-cellular animal, the nematode Caenorhabditis elegans. Sequencers have now moved on to the fruit fly Drosophila melanogaster, whose sequence is slated for completion by the end of 1999. The human genome project is expected to determine the complete sequence of all 3 billion bases of human DNA within the next five years. In the wake of genome-scale sequencing, further instrumentation is being developed to assay gene expression and function on a comparably large scale. Much of the work in computational biology focuses on computational tools used in sequencing, finding genes that are related to a particular gene, finding which parts of the DNA code for proteins and which do not, understanding what proteins will be formed from a given length of DNA, predicting how the proteins will fold from a one-dimensional structure into a three dimensional structure, and so on. Much less computational work has been done regarding the function of proteins. One reason for this is that different proteins function very differently, and so work on protein function is very specific to certain classes of proteins. There are, for example, proteins such enzymes that catalyze various intracellular reactions, receptors that respond to extracellular signals and ion channels that regulate the flow of charged particles into and out of the cell. In this chapter, we will consider a particular class of proteins called transcription factors(TFs), which are responsible for regulating when a certain gene is expressed in a certain cell, which cells it is express in, and how much is expressed. Understanding these processes will involve developing a deeper understanding of transcription, translation, and the cellular processes that control those processes. All of these elements fall under the aegis of gene regulation or more narrowly transcriptional regulation. Some of

  5. A class of circadian long non-coding RNAs mark enhancers modulating long-range circadian gene regulation

    PubMed Central

    Fan, Zenghua; Zhao, Meng; Joshi, Parth D.; Li, Ping; Zhang, Yan; Guo, Weimin; Xu, Yichi; Wang, Haifang; Zhao, Zhihu

    2017-01-01

    Abstract Circadian rhythm exerts its influence on animal physiology and behavior by regulating gene expression at various levels. Here we systematically explored circadian long non-coding RNAs (lncRNAs) in mouse liver and examined their circadian regulation. We found that a significant proportion of circadian lncRNAs are expressed at enhancer regions, mostly bound by two key circadian transcription factors, BMAL1 and REV-ERBα. These circadian lncRNAs showed similar circadian phases with their nearby genes. The extent of their nuclear localization is higher than protein coding genes but less than enhancer RNAs. The association between enhancer and circadian lncRNAs is also observed in tissues other than liver. Comparative analysis between mouse and rat circadian liver transcriptomes showed that circadian transcription at lncRNA loci tends to be conserved despite of low sequence conservation of lncRNAs. One such circadian lncRNA termed lnc-Crot led us to identify a super-enhancer region interacting with a cluster of genes involved in circadian regulation of metabolism through long-range interactions. Further experiments showed that lnc-Crot locus has enhancer function independent of lnc-Crot's transcription. Our results suggest that the enhancer-associated circadian lncRNAs mark the genomic loci modulating long-range circadian gene regulation and shed new lights on the evolutionary origin of lncRNAs. PMID:28335007

  6. ReTrOS: a MATLAB toolbox for reconstructing transcriptional activity from gene and protein expression data.

    PubMed

    Minas, Giorgos; Momiji, Hiroshi; Jenkins, Dafyd J; Costa, Maria J; Rand, David A; Finkenstädt, Bärbel

    2017-06-26

    Given the development of high-throughput experimental techniques, an increasing number of whole genome transcription profiling time series data sets, with good temporal resolution, are becoming available to researchers. The ReTrOS toolbox (Reconstructing Transcription Open Software) provides MATLAB-based implementations of two related methods, namely ReTrOS-Smooth and ReTrOS-Switch, for reconstructing the temporal transcriptional activity profile of a gene from given mRNA expression time series or protein reporter time series. The methods are based on fitting a differential equation model incorporating the processes of transcription, translation and degradation. The toolbox provides a framework for model fitting along with statistical analyses of the model with a graphical interface and model visualisation. We highlight several applications of the toolbox, including the reconstruction of the temporal cascade of transcriptional activity inferred from mRNA expression data and protein reporter data in the core circadian clock in Arabidopsis thaliana, and how such reconstructed transcription profiles can be used to study the effects of different cell lines and conditions. The ReTrOS toolbox allows users to analyse gene and/or protein expression time series where, with appropriate formulation of prior information about a minimum of kinetic parameters, in particular rates of degradation, users are able to infer timings of changes in transcriptional activity. Data from any organism and obtained from a range of technologies can be used as input due to the flexible and generic nature of the model and implementation. The output from this software provides a useful analysis of time series data and can be incorporated into further modelling approaches or in hypothesis generation.

  7. The Coding of Biological Information: From Nucleotide Sequence to Protein Recognition

    NASA Astrophysics Data System (ADS)

    Štambuk, Nikola

    The paper reviews the classic results of Swanson, Dayhoff, Grantham, Blalock and Root-Bernstein, which link genetic code nucleotide patterns to the protein structure, evolution and molecular recognition. Symbolic representation of the binary addresses defining particular nucleotide and amino acid properties is discussed, with consideration of: structure and metric of the code, direct correspondence between amino acid and nucleotide information, and molecular recognition of the interacting protein motifs coded by the complementary DNA and RNA strands.

  8. Myelin protein zero gene sequencing diagnoses Charcot-Marie-Tooth Type 1B disease

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Su, Y.; Zhang, H.; Madrid, R.

    1994-09-01

    Charcot-Marie-Tooth disease (CMT), the most common genetic neuropathy, affects about 1 in 2600 people in Norway and is found worldwide. CMT Type 1 (CMT1) has slow nerve conduction with demyelinated Schwann cells. Autosomal dominant CMT Type 1B (CMT1B) results from mutations in the myelin protein zero gene which directs the synthesis of more than half of all Schwann cell protein. This gene was mapped to the chromosome 1q22-1q23.1 borderline by fluorescence in situ hybridization. The first 7 of 7 reported CMT1B mutations are unique. Thus the most effective means to identify CMT1B mutations in at-risk family members and fetuses ismore » to sequence the entire coding sequence in dominant or sporadic CMT patients without the CMT1A duplication. Of the 19 primers used in 16 pars to uniquely amplify the entire MPZ coding sequence, 6 primer pairs were used to amplify and sequence the 6 exons. The DyeDeoxy Terminator cycle sequencing method used with four different color fluorescent lables was superior to manual sequencing because it sequences more bases unambiguously from extracted genomic DNA samples within 24 hours. This protocol was used to test 28 CMT and Dejerine-Sottas patients without CMT1A gene duplication. Sequencing MPZ gene-specific amplified fragments identified 9 polymorphic sites within the 6 exons that encode the 248 amino acid MPZ protein. The large number of major CMT1B mutations identified by single strand sequencing are being verified by reverse strand sequencing and when possible, by restriction enzyme analysis. This protocol can be used to distringuish CMT1B patients from othre CMT phenotypes and to determine the CMT1B status of relatives both presymptomatically and prenatally.« less

  9. Gene and protein nomenclature in public databases

    PubMed Central

    Fundel, Katrin; Zimmer, Ralf

    2006-01-01

    Background Frequently, several alternative names are in use for biological objects such as genes and proteins. Applications like manual literature search, automated text-mining, named entity identification, gene/protein annotation, and linking of knowledge from different information sources require the knowledge of all used names referring to a given gene or protein. Various organism-specific or general public databases aim at organizing knowledge about genes and proteins. These databases can be used for deriving gene and protein name dictionaries. So far, little is known about the differences between databases in terms of size, ambiguities and overlap. Results We compiled five gene and protein name dictionaries for each of the five model organisms (yeast, fly, mouse, rat, and human) from different organism-specific and general public databases. We analyzed the degree of ambiguity of gene and protein names within and between dictionaries, to a lexicon of common English words and domain-related non-gene terms, and we compared different data sources in terms of size of extracted dictionaries and overlap of synonyms between those. The study shows that the number of genes/proteins and synonyms covered in individual databases varies significantly for a given organism, and that the degree of ambiguity of synonyms varies significantly between different organisms. Furthermore, it shows that, despite considerable efforts of co-curation, the overlap of synonyms in different data sources is rather moderate and that the degree of ambiguity of gene names with common English words and domain-related non-gene terms varies depending on the considered organism. Conclusion In conclusion, these results indicate that the combination of data contained in different databases allows the generation of gene and protein name dictionaries that contain significantly more used names than dictionaries obtained from individual data sources. Furthermore, curation of combined dictionaries

  10. [Gene expression and activity regulation of two calmodulin binding protein kinases in tobacco seedling].

    PubMed

    Hua, Wei; Li, Rong-Jun; Liang, Shu-Ping; Lu, Ying-Tang

    2005-06-01

    Two different calmodulin-binding protein kinase cDNAs (NtCBK1/2) have been isolated from tobacco. To understand the CBK protein activity regulation, we compared the activity regulation of NtCBK1 and NtCBK2 by pH, Mg(2+) concentration and Na(+) concentration. We found the autophosphorylation of NtCBK1/2 reached the maximum in pH 7.5 and 8 respectively; Mg(2+) and Na(+) shown different effects on the activity of NtCBKs, high and low Mg(2+) concentrations both inhibited the activity of NtCBKs, but Na+ had little effect on the kinase activity. In addition, to obtain further insight about the physiological roles of individual NtCBKs, we detected the expression profiles of CBKs. The results revealed different patterns of expression of NtCBK1 and NtCBK2. Both are largely expressed in leaf and flower; but in stem and root, NtCBK1 gene had stronger expression than NtCBK2. NtCBK2 expression was induced by GA treatment, while NtCBK1 expression remained unchanged under GA treatment. Expression of both NtCBK1 and NtCBK2 increased in response to salt stress, the former to a greater extent, and both expressions did not change under high/low temperature, drought, NAA and ABA treatments.

  11. Analysis of bHLH coding genes using gene co-expression network approach.

    PubMed

    Srivastava, Swati; Sanchita; Singh, Garima; Singh, Noopur; Srivastava, Gaurava; Sharma, Ashok

    2016-07-01

    Network analysis provides a powerful framework for the interpretation of data. It uses novel reference network-based metrices for module evolution. These could be used to identify module of highly connected genes showing variation in co-expression network. In this study, a co-expression network-based approach was used for analyzing the genes from microarray data. Our approach consists of a simple but robust rank-based network construction. The publicly available gene expression data of Solanum tuberosum under cold and heat stresses were considered to create and analyze a gene co-expression network. The analysis provide highly co-expressed module of bHLH coding genes based on correlation values. Our approach was to analyze the variation of genes expression, according to the time period of stress through co-expression network approach. As the result, the seed genes were identified showing multiple connections with other genes in the same cluster. Seed genes were found to be vary in different time periods of stress. These analyzed seed genes may be utilized further as marker genes for developing the stress tolerant plant species.

  12. GTP cyclohydrolase I gene transfer augments intracellular tetrahydrobiopterin in human endothelial cells: effects on nitric oxide synthase activity, protein levels and dimerisation.

    PubMed

    Cai, Shijie; Alp, Nicholas J; McDonald, Denise; Smith, Ian; Kay, Jonathan; Canevari, Laura; Heales, Simon; Channon, Keith M

    2002-09-01

    Tetrahydrobiopterin (BH4) is an essential cofactor for endothelial nitric oxide synthase (eNOS) activity. BH4 levels are regulated by de novo biosynthesis; the rate-limiting enzyme is GTP cyclohydrolase I (GTPCH). BH4 activates and promotes homodimerisation of purified eNOS protein, but the intracellular mechanisms underlying BH4-mediated eNOS regulation in endothelial cells remain less clear. We aimed to investigate the role of BH4 levels in intracellular eNOS regulation, by targeting the BH4 synthetic pathway as a novel strategy to modulate intracellular BH4 levels. We constructed a recombinant adenovirus, AdGCH, encoding human GTPCH. We infected human endothelial cells with AdGCH, investigated the changes in intracellular biopterin levels, and determined the effects on eNOS enzymatic activity, protein levels and dimerisation. GTPCH gene transfer in EAhy926 endothelial cells increased BH4 >10-fold compared with controls (cells alone or control adenovirus infection), and greatly enhanced NO production in a dose-dependent, eNOS-specific manner. We found that eNOS was principally monomeric in control cells, whereas GTPCH gene transfer resulted in a striking increase in eNOS homodimerisation. Furthermore, the total amounts of both native eNOS protein and a recombinant eNOS-GFP fusion protein were significantly increased following GTPCH gene transfer. These findings suggest that GTPCH gene transfer is a valid approach to increase BH4 levels in human endothelial cells, and provide new evidence for the relative importance of different mechanisms underlying BH4-mediated eNOS regulation in intact human endothelial cells. Additionally, these observations suggest that GTPCH may be a rational target to augment endothelial BH4 and normalise eNOS activity in endothelial dysfunction states.

  13. Activation of endothelial-leukocyte adhesion molecule 1 (ELAM-1) gene transcription

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Montgomery, K.F.; Tarr, P.I.; Bomsztyk, K.

    1991-08-01

    Leukocyte adherence to endothelium is in part mediated by the transient expression of endothelial-leukocyte adhesion molecule 1 (ELAM-1) on endothelial surfaces stimulated by tumor necrosis factor {alpha} (TNF), interleukin (IL) 1, or bacterial lipopolysaccharide (LPS). The intracellular factors controlling induction of ELAM-1 mRNA and protein are unknown. In nuclear runoff experiments with cultured human umbilical vein endothelial cells (HUVEC), the authors demonstrate that transcriptional activation of the ELAM-1 gene occurs following stimulation with TNF. Sequence analysis of the 5{prime} flanking region of the ELAM-1 gene reveals consensus DNA-binding sequences for two known transcription factors, NF-{kappa}B and AP-1. Gel mobility shiftmore » assays demonstrate that TNF, IL-1, or LPS induces activation of NF-{kappa}B-like DNA binding activity in HUVEC. Phorbol 12-myristate 13-acetate, a known activator of protein kinase C (PKC), weakly induces NF-{kappa}B-like activity, ELAM-1 mRNA, and ELAM-1 surface expression in HUVEC. However, TNF, IL-1, and LPS do not activate PKC in HUVEC at doses that strongly induce NF-{kappa}B-like protein activation and ELAM-1 gene expression. PKC blockade with H7 does not inhibit activation of these NF-kB-like proteins but does inhibit ELAM-1 gene transcription. They conclude that PKC-independent activation of NF-{kappa}B in HUVEC with TNF, IL-1, or LPS is associated with, but not sufficient for, activation of ELAM-1 gene transcription.« less

  14. Androgen receptor stimulates bone sialoprotein (BSP) gene transcription via cAMP response element and activator protein 1/glucocorticoid response elements.

    PubMed

    Takai, Hideki; Nakayama, Youhei; Kim, Dong-Soon; Arai, Masato; Araki, Shouta; Mezawa, Masaru; Nakajima, Yu; Kato, Naoko; Masunaga, Hiroshi; Ogata, Yorimasa

    2007-09-01

    Bone sialoprotein (BSP) is an early marker of osteoblast differentiation. Androgens are steroid hormones that are essential for skeletal development. The androgen receptor (AR) is a transcription factor and a member of the steroid receptor superfamily that plays an important role in male sexual differentiation and prostate cell proliferation. To determine the molecular mechanism involved in the stimulation of bone formation, we have analyzed the effects of androgens and AR effects on BSP gene transcription. AR protein levels were increased after AR overexpression in ROS17/2.8 cells. BSP mRNA levels were increased by AR overexpression. However, the endogenous and overexpressed BSP mRNA levels were not changed by DHT (10(-8) M, 24 h). Whereas luciferase (LUC) activities in all constructs, including a short construct (nts -116 to +60), were increased by AR overexpression, the basal and LUC activities enhanced by AR overexpression were not induced by DHT (10(-8)M, 24 h). The effect of AR overexpression was abrogated by 2 bp mutations in either the cAMP response element (CRE) or activator protein 1 (AP1)/glucocorticoid response element (GRE). Gel shift analyses showed that AR overexpression increased binding to the CRE and AP1/GRE elements. Notably, the CRE-protein complexes were supershifted by phospho-CREB antibody, and CREB, c-Fos, c-Jun, and AR antibodies disrupted the complexes formation. The AP1/GRE-protein complexes were supershifted by c-Fos antibody and c-Jun, and AR antibodies disrupted the complexes formation. These studies demonstrate that AR stimulates BSP gene transcription by targeting the CRE and AP1/GRE elements in the promoter of the rat BSP gene.

  15. A Hox Gene, Antennapedia, Regulates Expression of Multiple Major Silk Protein Genes in the Silkworm Bombyx mori.

    PubMed

    Tsubota, Takuya; Tomita, Shuichiro; Uchino, Keiro; Kimoto, Mai; Takiya, Shigeharu; Kajiwara, Hideyuki; Yamazaki, Toshimasa; Sezutsu, Hideki

    2016-03-25

    Hoxgenes play a pivotal role in the determination of anteroposterior axis specificity during bilaterian animal development. They do so by acting as a master control and regulating the expression of genes important for development. Recently, however, we showed that Hoxgenes can also function in terminally differentiated tissue of the lepidopteranBombyx mori In this species,Antennapedia(Antp) regulates expression of sericin-1, a major silk protein gene, in the silk gland. Here, we investigated whether Antpcan regulate expression of multiple genes in this tissue. By means of proteomic, RT-PCR, and in situ hybridization analyses, we demonstrate that misexpression of Antpin the posterior silk gland induced ectopic expression of major silk protein genes such assericin-3,fhxh4, and fhxh5 These genes are normally expressed specifically in the middle silk gland as is Antp Therefore, the evidence strongly suggests that Antpactivates these silk protein genes in the middle silk gland. The putativesericin-1 activator complex (middle silk gland-intermolt-specific complex) can bind to the upstream regions of these genes, suggesting that Antpdirectly activates their expression. We also found that the pattern of gene expression was well conserved between B. moriand the wild species Bombyx mandarina, indicating that the gene regulation mechanism identified here is an evolutionarily conserved mechanism and not an artifact of the domestication of B. mori We suggest that Hoxgenes have a role as a master control in terminally differentiated tissues, possibly acting as a primary regulator for a range of physiological processes. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.

  16. Characterization of the novel antifungal protein PgAFP and the encoding gene of Penicillium chrysogenum.

    PubMed

    Rodríguez-Martín, Andrea; Acosta, Raquel; Liddell, Susan; Núñez, Félix; Benito, M José; Asensio, Miguel A

    2010-04-01

    The strain RP42C from Penicillium chrysogenum produces a small protein PgAFP that inhibits the growth of some toxigenic molds. The molecular mass of the protein determined by electrospray ionization mass spectrometry (ESI-MS) was 6 494Da. PgAFP showed a cationic character with an estimated pI value of 9.22. Upon chemical and enzymatic treatments of PgAFP, no evidence for N- or O-glycosylations was obtained. Five partial sequences of PgAFP were obtained by Edman degradation and by ESI-MS/MS after trypsin and chymotrypsin digestions. Using degenerate primers from these peptide sequences, a segment of 70bp was amplified by PCR from pgafp gene. 5'- and 3'-ends of pgafp were obtained by RACE-PCR with gene-specific primers designed from the 70bp segment. The complete pgafp sequence of 404bp was obtained using primers designed from 5'- and 3'-ends. Comparison of genomic and cDNA sequences revealed a 279bp coding region interrupted by two introns of 63 and 62bp. The precursor of the antifungal protein consists of 92 amino acids and appears to be processed to the mature 58 amino acids PgAFP. The deduced amino acid sequence of the mature protein shares 79% identity to the antifungal protein Anafp from Aspergillus niger. PgAFP is a new protein that belongs to the group of small, cysteine-rich, and basic proteins with antifungal activity produced by ascomycetes. Given that P. chrysogenum is regarded as safe mold commonly found in foods, PgAFP may be useful to prevent growth of toxigenic molds in food and agricultural products. Copyright (c) 2009 Elsevier Inc. All rights reserved.

  17. Isolation and characterization of the gene coding for Escherichia coli arginyl-tRNA synthetase.

    PubMed Central

    Eriani, G; Dirheimer, G; Gangloff, J

    1989-01-01

    The gene coding for Escherichia coli arginyl-tRNA synthetase (argS) was isolated as a fragment of 2.4 kb after analysis and subcloning of recombinant plasmids from the Clarke and Carbon library. The clone bearing the gene overproduces arginyl-tRNA synthetase by a factor 100. This means that the enzyme represents more than 20% of the cellular total protein content. Sequencing revealed that the fragment contains a unique open reading frame of 1734 bp flanked at its 5' and 3' ends respectively by 247 bp and 397 bp. The length of the corresponding protein (577 aa) is well consistent with earlier Mr determination (about 70 kd). Primer extension analysis of the ArgRS mRNA by reverse transcriptase, located its 5' end respectively at 8 and 30 nucleotides downstream of a TATA and a TTGAC like element (CTGAC) and 60 nucleotides upstream of the unusual translation initiation codon GUG; nuclease S1 analysis located the 3'-end at 48 bp downstream of the translation termination codon. argS has a codon usage pattern typical for highly expressed E. coli genes. With the exception of the presence of a HVGH sequence similar to the HIGH consensus element, ArgRS has no relevant sequence homologies with other aminoacyl-tRNA synthetases. Images PMID:2668891

  18. [Long non-coding RNAs in the pathophysiology of atherosclerosis].

    PubMed

    Novak, Jan; Vašků, Julie Bienertová; Souček, Miroslav

    2018-01-01

    The human genome contains about 22 000 protein-coding genes that are transcribed to an even larger amount of messenger RNAs (mRNA). Interestingly, the results of the project ENCODE from 2012 show, that despite up to 90 % of our genome being actively transcribed, protein-coding mRNAs make up only 2-3 % of the total amount of the transcribed RNA. The rest of RNA transcripts is not translated to proteins and that is why they are referred to as "non-coding RNAs". Earlier the non-coding RNA was considered "the dark matter of genome", or "the junk", whose genes has accumulated in our DNA during the course of evolution. Today we already know that non-coding RNAs fulfil a variety of regulatory functions in our body - they intervene into epigenetic processes from chromatin remodelling to histone methylation, or into the transcription process itself, or even post-transcription processes. Long non-coding RNAs (lncRNA) are one of the classes of non-coding RNAs that have more than 200 nucleotides in length (non-coding RNAs with less than 200 nucleotides in length are called small non-coding RNAs). lncRNAs represent a widely varied and large group of molecules with diverse regulatory functions. We can identify them in all thinkable cell types or tissues, or even in an extracellular space, which includes blood, specifically plasma. Their levels change during the course of organogenesis, they are specific to different tissues and their changes also occur along with the development of different illnesses, including atherosclerosis. This review article aims to present lncRNAs problematics in general and then focuses on some of their specific representatives in relation to the process of atherosclerosis (i.e. we describe lncRNA involvement in the biology of endothelial cells, vascular smooth muscle cells or immune cells), and we further describe possible clinical potential of lncRNA, whether in diagnostics or therapy of atherosclerosis and its clinical manifestations.Key words

  19. Long Non-Coding RNAs (lncRNAs) of Sea Cucumber: Large-Scale Prediction, Expression Profiling, Non-Coding Network Construction, and lncRNA-microRNA-Gene Interaction Analysis of lncRNAs in Apostichopus japonicus and Holothuria glaberrima During LPS Challenge and Radial Organ Complex Regeneration.

    PubMed

    Mu, Chuang; Wang, Ruijia; Li, Tianqi; Li, Yuqiang; Tian, Meilin; Jiao, Wenqian; Huang, Xiaoting; Zhang, Lingling; Hu, Xiaoli; Wang, Shi; Bao, Zhenmin

    2016-08-01

    Long non-coding RNA (lncRNA) structurally resembles mRNA but cannot be translated into protein. Although the systematic identification and characterization of lncRNAs have been increasingly reported in model species, information concerning non-model species is still lacking. Here, we report the first systematic identification and characterization of lncRNAs in two sea cucumber species: (1) Apostichopus japonicus during lipopolysaccharide (LPS) challenge and in heathy tissues and (2) Holothuria glaberrima during radial organ complex regeneration, using RNA-seq datasets and bioinformatics analysis. We identified A. japonicus and H. glaberrima lncRNAs that were differentially expressed during LPS challenge and radial organ complex regeneration, respectively. Notably, the predicted lncRNA-microRNA-gene trinities revealed that, in addition to targeting protein-coding transcripts, miRNAs might also target lncRNAs, thereby participating in a potential novel layer of regulatory interactions among non-coding RNA classes in echinoderms. Furthermore, the constructed coding-non-coding network implied the potential involvement of lncRNA-gene interactions during the regulation of several important genes (e.g., Toll-like receptor 1 [TLR1] and transglutaminase-1 [TGM1]) in response to LPS challenge and radial organ complex regeneration in sea cucumbers. Overall, this pioneer systematic identification, annotation, and characterization of lncRNAs in echinoderm pave the way for similar studies and future genetic, genomic, and evolutionary research in non-model species.

  20. Sequencing and Characterization of Novel PII Signaling Protein Gene in Microalga Haematococcus pluvialis.

    PubMed

    Ma, Ruijuan; Li, Yan; Lu, Yinghua

    2017-10-11

    The PII signaling protein is a key protein for controlling nitrogen assimilatory reactions in most organisms, but little information is reported on PII proteins of green microalga Haematococcus pluvialis . Since H. pluvialis cells can produce a large amount of astaxanthin upon nitrogen starvation, its PII protein may represent an important factor on elevated production of Haematococcus astaxanthin. This study identified and isolated the coding gene (Hp GLB1 ) from this microalga. The full-length of Hp GLB1 was 1222 bp, including 621 bp coding sequence (CDS), 103 bp 5' untranslated region (5' UTR), and 498 bp 3' untranslated region (3' UTR). The CDS could encode a protein with 206 amino acids (HpPII). Its calculated molecular weight (Mw) was 22.4 kDa and the theoretical isoelectric point was 9.53. When H. pluvialis cells were exposed to nitrogen starvation, the Hp GLB1 expression was increased 2.46 times in 48 h, concomitant with the raise of astaxanthin content. This study also used phylogenetic analysis to prove that HpPII was homogeneous to the PII proteins of other green microalgae. The results formed a fundamental basis for the future study on HpPII, for its potential physiological function in Haematococcus astaxanthin biosysthesis.

  1. The Purine Bias of Coding Sequences is Determined by Physicochemical Constraints on Proteins.

    PubMed

    Ponce de Leon, Miguel; de Miranda, Antonio Basilio; Alvarez-Valin, Fernando; Carels, Nicolas

    2014-01-01

    For this report, we analyzed protein secondary structures in relation to the statistics of three nucleotide codon positions. The purpose of this investigation was to find which properties of the ribosome, tRNA or protein level, could explain the purine bias (Rrr) as it is observed in coding DNA. We found that the Rrr pattern is the consequence of a regularity (the codon structure) resulting from physicochemical constraints on proteins and thermodynamic constraints on ribosomal machinery. The physicochemical constraints on proteins mainly come from the hydropathy and molecular weight (MW) of secondary structures as well as the energy cost of amino acid synthesis. These constraints appear through a network of statistical correlations, such as (i) the cost of amino acid synthesis, which is in favor of a higher level of guanine in the first codon position, (ii) the constructive contribution of hydropathy alternation in proteins, (iii) the spatial organization of secondary structure in proteins according to solvent accessibility, (iv) the spatial organization of secondary structure according to amino acid hydropathy, (v) the statistical correlation of MW with protein secondary structures and their overall hydropathy, (vi) the statistical correlation of thymine in the second codon position with hydropathy and the energy cost of amino acid synthesis, and (vii) the statistical correlation of adenine in the second codon position with amino acid complexity and the MW of secondary protein structures. Amino acid physicochemical properties and functional constraints on proteins constitute a code that is translated into a purine bias within the coding DNA via tRNAs. In that sense, the Rrr pattern within coding DNA is the effect of information transfer on nucleotide composition from protein to DNA by selection according to the codon positions. Thus, coding DNA structure and ribosomal machinery co-evolved to minimize the energy cost of protein coding given the functional

  2. Receptor Activity-modifying Protein-directed G Protein Signaling Specificity for the Calcitonin Gene-related Peptide Family of Receptors.

    PubMed

    Weston, Cathryn; Winfield, Ian; Harris, Matthew; Hodgson, Rose; Shah, Archna; Dowell, Simon J; Mobarec, Juan Carlos; Woodlock, David A; Reynolds, Christopher A; Poyner, David R; Watkins, Harriet A; Ladds, Graham

    2016-10-14

    The calcitonin gene-related peptide (CGRP) family of G protein-coupled receptors (GPCRs) is formed through the association of the calcitonin receptor-like receptor (CLR) and one of three receptor activity-modifying proteins (RAMPs). Binding of one of the three peptide ligands, CGRP, adrenomedullin (AM), and intermedin/adrenomedullin 2 (AM2), is well known to result in a Gα s -mediated increase in cAMP. Here we used modified yeast strains that couple receptor activation to cell growth, via chimeric yeast/Gα subunits, and HEK-293 cells to characterize the effect of different RAMP and ligand combinations on this pathway. We not only demonstrate functional couplings to both Gα s and Gα q but also identify a Gα i component to CLR signaling in both yeast and HEK-293 cells, which is absent in HEK-293S cells. We show that the CGRP family of receptors displays both ligand- and RAMP-dependent signaling bias among the Gα s , Gα i , and Gα q/11 pathways. The results are discussed in the context of RAMP interactions probed through molecular modeling and molecular dynamics simulations of the RAMP-GPCR-G protein complexes. This study further highlights the importance of RAMPs to CLR pharmacology and to bias in general, as well as identifying the importance of choosing an appropriate model system for the study of GPCR pharmacology. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.

  3. Consensus coding sequence (CCDS) database: a standardized set of human and mouse protein-coding regions supported by expert curation

    PubMed Central

    Pujar, Shashikant; O’Leary, Nuala A; Farrell, Catherine M; Mudge, Jonathan M; Wallin, Craig; Diekhans, Mark; Barnes, If; Bennett, Ruth; Berry, Andrew E; Cox, Eric; Davidson, Claire; Goldfarb, Tamara; Gonzalez, Jose M; Hunt, Toby; Jackson, John; Joardar, Vinita; Kay, Mike P; Kodali, Vamsi K; McAndrews, Monica; McGarvey, Kelly M; Murphy, Michael; Rajput, Bhanu; Rangwala, Sanjida H; Riddick, Lillian D; Seal, Ruth L; Webb, David; Zhu, Sophia; Aken, Bronwen L; Bult, Carol J; Frankish, Adam; Pruitt, Kim D

    2018-01-01

    Abstract The Consensus Coding Sequence (CCDS) project provides a dataset of protein-coding regions that are identically annotated on the human and mouse reference genome assembly in genome annotations produced independently by NCBI and the Ensembl group at EMBL-EBI. This dataset is the product of an international collaboration that includes NCBI, Ensembl, HUGO Gene Nomenclature Committee, Mouse Genome Informatics and University of California, Santa Cruz. Identically annotated coding regions, which are generated using an automated pipeline and pass multiple quality assurance checks, are assigned a stable and tracked identifier (CCDS ID). Additionally, coordinated manual review by expert curators from the CCDS collaboration helps in maintaining the integrity and high quality of the dataset. The CCDS data are available through an interactive web page (https://www.ncbi.nlm.nih.gov/CCDS/CcdsBrowse.cgi) and an FTP site (ftp://ftp.ncbi.nlm.nih.gov/pub/CCDS/). In this paper, we outline the ongoing work, growth and stability of the CCDS dataset and provide updates on new collaboration members and new features added to the CCDS user interface. We also present expert curation scenarios, with specific examples highlighting the importance of an accurate reference genome assembly and the crucial role played by input from the research community. PMID:29126148

  4. Gene composer: database software for protein construct design, codon engineering, and gene synthesis.

    PubMed

    Lorimer, Don; Raymond, Amy; Walchli, John; Mixon, Mark; Barrow, Adrienne; Wallace, Ellen; Grice, Rena; Burgin, Alex; Stewart, Lance

    2009-04-21

    To improve efficiency in high throughput protein structure determination, we have developed a database software package, Gene Composer, which facilitates the information-rich design of protein constructs and their codon engineered synthetic gene sequences. With its modular workflow design and numerous graphical user interfaces, Gene Composer enables researchers to perform all common bio-informatics steps used in modern structure guided protein engineering and synthetic gene engineering. An interactive Alignment Viewer allows the researcher to simultaneously visualize sequence conservation in the context of known protein secondary structure, ligand contacts, water contacts, crystal contacts, B-factors, solvent accessible area, residue property type and several other useful property views. The Construct Design Module enables the facile design of novel protein constructs with altered N- and C-termini, internal insertions or deletions, point mutations, and desired affinity tags. The modifications can be combined and permuted into multiple protein constructs, and then virtually cloned in silico into defined expression vectors. The Gene Design Module uses a protein-to-gene algorithm that automates the back-translation of a protein amino acid sequence into a codon engineered nucleic acid gene sequence according to a selected codon usage table with minimal codon usage threshold, defined G:C% content, and desired sequence features achieved through synonymous codon selection that is optimized for the intended expression system. The gene-to-oligo algorithm of the Gene Design Module plans out all of the required overlapping oligonucleotides and mutagenic primers needed to synthesize the desired gene constructs by PCR, and for physically cloning them into selected vectors by the most popular subcloning strategies. We present a complete description of Gene Composer functionality, and an efficient PCR-based synthetic gene assembly procedure with mis-match specific endonuclease

  5. Gene Composer: database software for protein construct design, codon engineering, and gene synthesis

    PubMed Central

    Lorimer, Don; Raymond, Amy; Walchli, John; Mixon, Mark; Barrow, Adrienne; Wallace, Ellen; Grice, Rena; Burgin, Alex; Stewart, Lance

    2009-01-01

    Background To improve efficiency in high throughput protein structure determination, we have developed a database software package, Gene Composer, which facilitates the information-rich design of protein constructs and their codon engineered synthetic gene sequences. With its modular workflow design and numerous graphical user interfaces, Gene Composer enables researchers to perform all common bio-informatics steps used in modern structure guided protein engineering and synthetic gene engineering. Results An interactive Alignment Viewer allows the researcher to simultaneously visualize sequence conservation in the context of known protein secondary structure, ligand contacts, water contacts, crystal contacts, B-factors, solvent accessible area, residue property type and several other useful property views. The Construct Design Module enables the facile design of novel protein constructs with altered N- and C-termini, internal insertions or deletions, point mutations, and desired affinity tags. The modifications can be combined and permuted into multiple protein constructs, and then virtually cloned in silico into defined expression vectors. The Gene Design Module uses a protein-to-gene algorithm that automates the back-translation of a protein amino acid sequence into a codon engineered nucleic acid gene sequence according to a selected codon usage table with minimal codon usage threshold, defined G:C% content, and desired sequence features achieved through synonymous codon selection that is optimized for the intended expression system. The gene-to-oligo algorithm of the Gene Design Module plans out all of the required overlapping oligonucleotides and mutagenic primers needed to synthesize the desired gene constructs by PCR, and for physically cloning them into selected vectors by the most popular subcloning strategies. Conclusion We present a complete description of Gene Composer functionality, and an efficient PCR-based synthetic gene assembly procedure with mis

  6. Genome-wide RNAi screening identifies protein damage as a regulator of osmoprotective gene expression.

    PubMed

    Lamitina, Todd; Huang, Chunyi George; Strange, Kevin

    2006-08-08

    The detection, stabilization, and repair of stress-induced damage are essential requirements for cellular life. All cells respond to osmotic stress-induced water loss with increased expression of genes that mediate accumulation of organic osmolytes, solutes that function as chemical chaperones and restore osmotic homeostasis. The signals and signaling mechanisms that regulate osmoprotective gene expression in animal cells are poorly understood. Here, we show that gpdh-1 and gpdh-2, genes that mediate the accumulation of the organic osmolyte glycerol, are essential for survival of the nematode Caenorhabditis elegans during osmotic stress. Expression of GFP driven by the gpdh-1 promoter (P(gpdh-1)::GFP) is detected only during hypertonic stress but is not induced by other stressors. Using P(gpdh-1)::GFP expression as a phenotype, we screened approximately 16,000 genes by RNAi feeding and identified 122 that cause constitutive activation of gpdh-1 expression and glycerol accumulation. Many of these genes function to regulate protein translation and cotranslational protein folding and to target and degrade denatured proteins, suggesting that the accumulation of misfolded proteins functions as a signal to activate osmoprotective gene expression and organic osmolyte accumulation in animal cells. Consistent with this hypothesis, 73% of these protein-homeostasis genes have been shown to slow age-dependent protein aggregation in C. elegans. Because diverse environmental stressors and numerous disease states result in protein misfolding, mechanisms must exist that discriminate between osmotically induced and other forms of stress-induced protein damage. Our findings provide a foundation for understanding how these damage-selectivity mechanisms function.

  7. Genome-wide RNAi screening identifies protein damage as a regulator of osmoprotective gene expression

    PubMed Central

    Lamitina, Todd; Huang, Chunyi George; Strange, Kevin

    2006-01-01

    The detection, stabilization, and repair of stress-induced damage are essential requirements for cellular life. All cells respond to osmotic stress-induced water loss with increased expression of genes that mediate accumulation of organic osmolytes, solutes that function as chemical chaperones and restore osmotic homeostasis. The signals and signaling mechanisms that regulate osmoprotective gene expression in animal cells are poorly understood. Here, we show that gpdh-1 and gpdh-2, genes that mediate the accumulation of the organic osmolyte glycerol, are essential for survival of the nematode Caenorhabditis elegans during osmotic stress. Expression of GFP driven by the gpdh-1 promoter (Pgpdh-1::GFP) is detected only during hypertonic stress but is not induced by other stressors. Using Pgpdh-1::GFP expression as a phenotype, we screened ≈16,000 genes by RNAi feeding and identified 122 that cause constitutive activation of gpdh-1 expression and glycerol accumulation. Many of these genes function to regulate protein translation and cotranslational protein folding and to target and degrade denatured proteins, suggesting that the accumulation of misfolded proteins functions as a signal to activate osmoprotective gene expression and organic osmolyte accumulation in animal cells. Consistent with this hypothesis, 73% of these protein-homeostasis genes have been shown to slow age-dependent protein aggregation in C. elegans. Because diverse environmental stressors and numerous disease states result in protein misfolding, mechanisms must exist that discriminate between osmotically induced and other forms of stress-induced protein damage. Our findings provide a foundation for understanding how these damage-selectivity mechanisms function. PMID:16880390

  8. New Insights into Protein Kinase B/Akt Signaling: Role of Localized Akt Activation and Compartment-Specific Target Proteins for the Cellular Radiation Response.

    PubMed

    Szymonowicz, Klaudia; Oeck, Sebastian; Malewicz, Nathalie M; Jendrossek, Verena

    2018-03-18

    Genetic alterations driving aberrant activation of the survival kinase Protein Kinase B (Akt) are observed with high frequency during malignant transformation and cancer progression. Oncogenic gene mutations coding for the upstream regulators or Akt, e.g., growth factor receptors, RAS and phosphatidylinositol-3-kinase (PI3K), or for one of the three Akt isoforms as well as loss of the tumor suppressor Phosphatase and Tensin Homolog on Chromosome Ten (PTEN) lead to constitutive activation of Akt. By activating Akt, these genetic alterations not only promote growth, proliferation and malignant behavior of cancer cells by phosphorylation of various downstream signaling molecules and signaling nodes but can also contribute to chemo- and radioresistance in many types of tumors. Here we review current knowledge on the mechanisms dictating Akt's activation and target selection including the involvement of miRNAs and with focus on compartmentalization of the signaling network. Moreover, we discuss recent advances in the cross-talk with DNA damage response highlighting nuclear Akt target proteins with potential involvement in the regulation of DNA double strand break repair.

  9. Radiation increases the activity of oncolytic adenovirus cancer gene therapy vectors that overexpress the ADP (E3-11.6K) protein.

    PubMed

    Toth, Karoly; Tarakanova, Vera; Doronin, Konstantin; Ward, Peter; Kuppuswamy, Mohan; Locke, Jacob E; Dawson, Julie E; Kim, Han J; Wold, William S M

    2003-03-01

    We have described three potential adenovirus type 5 (Ad5)-based replication-competent cancer gene therapy vectors named KD1, KD3, and VRX-007. All three vectors overexpress an Ad5 protein named Adenovirus Death Protein (ADP, also named E3-11.6 K protein). ADP is required for efficient lysis of Ad5-infected cells and spread of virus from cell to cell, and thus its overexpression increases the oncolytic activity of the vectors. KD1 and KD3 contain mutations in the Ad5 E1A gene that knock out binding of the E1A proteins to cellular p300/CBP and pRB; these mutations allow KD1 and KD3 to grow well in cancer cells but not in normal cells. VRX-007 has wild-type E1A. Here we report that radiation increases the oncolytic activity of KD1, KD3, and VRX-007. This increased activity was observed in cultured cells, and it was not because of radiation-induced replication of the vectors. The combination of radiation plus KD3 suppressed the growth of A549 lung adenocarcinoma xenografts in nude mice more efficiently than radiation alone or KD3 alone. The combination of ADP-overexpressing vectors and radiation may have potential in treating cancer.

  10. Phylogenetic analysis of mitochondrial protein coding genes confirms the reciprocal paraphyly of Hexapoda and Crustacea

    PubMed Central

    Carapelli, Antonio; Liò, Pietro; Nardi, Francesco; van der Wath, Elizabeth; Frati, Francesco

    2007-01-01

    Background The phylogeny of Arthropoda is still a matter of harsh debate among systematists, and significant disagreement exists between morphological and molecular studies. In particular, while the taxon joining hexapods and crustaceans (the Pancrustacea) is now widely accepted among zoologists, the relationships among its basal lineages, and particularly the supposed reciprocal paraphyly of Crustacea and Hexapoda, continues to represent a challenge. Several genes, as well as different molecular markers, have been used to tackle this problem in molecular phylogenetic studies, with the mitochondrial DNA being one of the molecules of choice. In this study, we have assembled the largest data set available so far for Pancrustacea, consisting of 100 complete (or almost complete) sequences of mitochondrial genomes. After removal of unalignable sequence regions and highly rearranged genomes, we used nucleotide and inferred amino acid sequences of the 13 protein coding genes to reconstruct the phylogenetic relationships among major lineages of Pancrustacea. The analysis was performed with Bayesian inference, and for the amino acid sequences a new, Pancrustacea-specific, matrix of amino acid replacement was developed and used in this study. Results Two largely congruent trees were obtained from the analysis of nucleotide and amino acid datasets. In particular, the best tree obtained based on the new matrix of amino acid replacement (MtPan) was preferred over those obtained using previously available matrices (MtArt and MtRev) because of its higher likelihood score. The most remarkable result is the reciprocal paraphyly of Hexapoda and Crustacea, with some lineages of crustaceans (namely the Malacostraca, Cephalocarida and, possibly, the Branchiopoda) being more closely related to the Insecta s.s. (Ectognatha) than two orders of basal hexapods, Collembola and Diplura. Our results confirm that the mitochondrial genome, unlike analyses based on morphological data or nuclear

  11. Protein and gene structure of a blue laccase from Pleurotus ostreatus1.

    PubMed Central

    Giardina, P; Palmieri, G; Scaloni, A; Fontanella, B; Faraco, V; Cennamo, G; Sannia, G

    1999-01-01

    A new laccase isoenzyme (POXA1b, where POX is phenol oxidase), produced by Pleurotus ostreatus in cultures supplemented with copper sulphate, has been purified and fully characterized. The main characteristics of this protein (molecular mass in native and denaturing conditions, pI and catalytic properties) are almost identical to the previously studied laccase POXA1w. However, POXA1b contains four copper atoms per molecule instead of one copper, two zinc and one iron atom per molecule of POXA1w. Furthermore, POXA1b shows an unusually high stability at alkaline pH. The gene and cDNA coding for POXA1b have been cloned and sequenced. The gene coding sequence contains 1599 bp, interrupted by 15 introns. Comparison of the structure of the poxa1b gene with the two previously studied P. ostreatus laccase genes (pox1 and poxc) suggests that these genes belong to two different subfamilies. The amino acid sequence of POXA1b deduced from the cDNA sequence has been almost completely verified by means of matrix-assisted laser desorption ionization MS. It has been demonstrated that three out of six putative glycosylation sites are post-translationally modified and the structure of the bound glycosidic moieties has been determined, whereas two other putative glycosylation sites are unmodified. PMID:10417329

  12. Tissue-specific expression of the gene coding for human Clara cell 10-kD protein, a phospholipase A2-inhibitory protein.

    PubMed Central

    Peri, A; Cordella-Miele, E; Miele, L; Mukherjee, A B

    1993-01-01

    Clara cell 10-kD protein (cc10kD), a secretory phospholipase A2 inhibitor, is suggested to be the human counterpart of rabbit uteroglobin (UG). Because cc10kD is expressed constitutively at a very high level in the human respiratory epithelium, the 5' region of its gene may be useful in achieving organ-specific expression of recombinant DNA in gene therapy of diseases such as cystic fibrosis. However, it is important to establish the tissue-specific expression of this gene before designing gene transfer experiments. Since the UG gene in the rabbit is expressed in many other organs besides the lung and the endometrium, we investigated the organ and tissue specificity of human cc10kD gene expression using polymerase chain reaction, nucleotide sequence analysis, immunofluorescence, and Northern blotting. Our results indicate that, in addition to the lung, cc10kD is expressed in several nonrespiratory organs, with a distribution pattern very similar, if not identical, to that of UG in the rabbit. These results underscore the necessity for more detailed analyses of the 5' region of the human cc10kD gene before its usefulness in gene therapy could be fully assessed. These data also suggest that cc10kD and UG may have similar physiological function(s). Images PMID:8227325

  13. Genome-Wide Identification and Comprehensive Expression Profiling of Ribosomal Protein Small Subunit (RPS) Genes and their Comparative Analysis with the Large Subunit (RPL) Genes in Rice

    PubMed Central

    Saha, Anusree; Das, Shubhajit; Moin, Mazahar; Dutta, Mouboni; Bakshi, Achala; Madhav, M. S.; Kirti, P. B.

    2017-01-01

    Ribosomal proteins (RPs) are indispensable in ribosome biogenesis and protein synthesis, and play a crucial role in diverse developmental processes. Our previous studies on Ribosomal Protein Large subunit (RPL) genes provided insights into their stress responsive roles in rice. In the present study, we have explored the developmental and stress regulated expression patterns of Ribosomal Protein Small (RPS) subunit genes for their differential expression in a spatiotemporal and stress dependent manner. We have also performed an in silico analysis of gene structure, cis-elements in upstream regulatory regions, protein properties and phylogeny. Expression studies of the 34 RPS genes in 13 different tissues of rice covering major growth and developmental stages revealed that their expression was substantially elevated, mostly in shoots and leaves indicating their possible involvement in the development of vegetative organs. The majority of the RPS genes have manifested significant expression under all abiotic stress treatments with ABA, PEG, NaCl, and H2O2. Infection with important rice pathogens, Xanthomonas oryzae pv. oryzae (Xoo) and Rhizoctonia solani also induced the up-regulation of several of the RPS genes. RPS4, 13a, 18a, and 4a have shown higher transcript levels under all the abiotic stresses, whereas, RPS4 is up-regulated in both the biotic stress treatments. The information obtained from the present investigation would be useful in appreciating the possible stress-regulatory attributes of the genes coding for rice ribosomal small subunit proteins apart from their functions as house-keeping proteins. A detailed functional analysis of independent genes is required to study their roles in stress tolerance and generating stress- tolerant crops. PMID:28966624

  14. Mutational analysis of genes coding for cell surface proteins in colorectal cancer cell lines reveal novel altered pathways, druggable mutations and mutated epitopes for targeted therapy

    PubMed Central

    Correa, Bruna R.; Bettoni, Fabiana; Koyama, Fernanda C.; Navarro, Fabio C.P.; Perez, Rodrigo O.; Mariadason, John; Sieber, Oliver M.; Strausberg, Robert L.; Simpson, Andrew J.G.; Jardim, Denis L.F.; Reis, Luiz Fernando L.; Parmigiani, Raphael B.; Galante, Pedro A.F.; Camargo, Anamaria A.

    2014-01-01

    We carried out a mutational analysis of 3,594 genes coding for cell surface proteins (Surfaceome) in 23 colorectal cancer cell lines, searching for new altered pathways, druggable mutations and mutated epitopes for targeted therapy in colorectal cancer. A total of 3,944 somatic non-synonymous substitutions and 595 InDels, occurring in 2,061 (57%) Surfaceome genes were catalogued. We identified 48 genes not previously described as mutated in colorectal tumors in the TCGA database, including genes that are mutated and expressed in >10% of the cell lines (SEMA4C, FGFRL1, PKD1, FAM38A, WDR81, TMEM136, SLC36A1, SLC26A6, IGFLR1). Analysis of these genes uncovered important roles for FGF and SEMA4 signaling in colorectal cancer with possible therapeutic implications. We also found that cell lines express on average 11 druggable mutations, including frequent mutations (>20%) in the receptor tyrosine kinases AXL and EPHA2, which have not been previously considered as potential targets for colorectal cancer. Finally, we identified 82 cell surface mutated epitopes, however expression of only 30% of these epitopes was detected in our cell lines. Notwithstanding, 92% of these epitopes were expressed in cell lines with the mutator phenotype, opening new venues for the use of “general” immune checkpoint drugs in this subset of patients. PMID:25193853

  15. Bioinformatics identification and transcript profile analysis of the mitogen-activated protein kinase gene family in the diploid woodland strawberry Fragaria vesca

    PubMed Central

    Wei, Wei; Chai, Zhuangzhuang; Xie, Yinge; Gao, Kuan; Cui, Mengyuan; Jiang, Ying

    2017-01-01

    Mitogen-activated protein kinases (MAPKs) play essential roles in mediating biotic and abiotic stress responses in plants. However, the MAPK gene family in strawberry has not been systematically characterized. Here, we performed a genome-wide survey and identified 12 MAPK genes in the Fragaria vesca genome. Protein domain analysis indicated that all FvMAPKs have typical protein kinase domains. Sequence alignments and phylogenetic analysis classified the FvMAPK genes into four different groups. Conserved motif and exon-intron organization supported the evolutionary relationships inferred from the phylogenetic analysis. Analysis of the stress-related cis-regulatory element in the promoters and subcellular localization predictions of FvMAPKs were also performed. Gene transcript profile analysis showed that the majority of the FvMAPK genes were ubiquitously transcribed in strawberry leaves after Podosphaera aphanis inoculation and after treatment with cold, heat, drought, salt and the exogenous hormones abscisic acid, ethephon, methyl jasmonate, and salicylic acid. RT-qPCR showed that six selected FvMAPK genes comprehensively responded to various stimuli. Additionally, interaction networks revealed that the crucial signaling transduction controlled by FvMAPKs may be involved in the biotic and abiotic stress responses. Our results may provide useful information for future research on the function of the MAPK gene family and the genetic improvement of strawberry resistance to environmental stresses. PMID:28562633

  16. IGF-1 modulates gene expression of proteins involved in inflammation, cytoskeleton, and liver architecture.

    PubMed

    Lara-Diaz, V J; Castilla-Cortazar, I; Martín-Estal, I; García-Magariño, M; Aguirre, G A; Puche, J E; de la Garza, R G; Morales, L A; Muñoz, U

    2017-05-01

    Even though the liver synthesizes most of circulating IGF-1, it lacks its receptor under physiological conditions. However, according to previous studies, a damaged liver expresses the receptor. For this reason, herein, we examine hepatic histology and expression of genes encoding proteins of the cytoskeleton, extracellular matrix, and cell-cell molecules and inflammation-related proteins. A partial IGF-1 deficiency murine model was used to investigate IGF-1's effects on liver by comparing wild-type controls, heterozygous igf1 +/- , and heterozygous mice treated with IGF-1 for 10 days. Histology, microarray for mRNA gene expression, RT-qPCR, and lipid peroxidation were assessed. Microarray analyses revealed significant underexpression of igf1 in heterozygous mice compared to control mice, restoring normal liver expression after treatment, which then normalized its circulating levels. IGF-1 receptor mRNA was overexpressed in Hz mice liver, while treated mice displayed a similar expression to that of the controls. Heterozygous mice showed overexpression of several genes encoding proteins related to inflammatory and acute-phase proteins and underexpression or overexpression of genes which coded for extracellular matrix, cytoskeleton, and cell junction components. Histology revealed an altered hepatic architecture. In addition, liver oxidative damage was found increased in the heterozygous group. The mere IGF-1 partial deficiency is associated with relevant alterations of the hepatic architecture and expression of genes involved in cytoskeleton, hepatocyte polarity, cell junctions, and extracellular matrix proteins. Moreover, it induces hepatic expression of the IGF-1 receptor and elevated acute-phase and inflammation mediators, which all resulted in liver oxidative damage.

  17. The rat alpha-tropomyosin gene generates a minimum of six different mRNAs coding for striated, smooth, and nonmuscle isoforms by alternative splicing.

    PubMed Central

    Wieczorek, D F; Smith, C W; Nadal-Ginard, B

    1988-01-01

    Tropomyosin (TM), a ubiquitous protein, is a component of the contractile apparatus of all cells. In nonmuscle cells, it is found in stress fibers, while in sarcomeric and nonsarcomeric muscle, it is a component of the thin filament. Several different TM isoforms specific for nonmuscle cells and different types of muscle cell have been described. As for other contractile proteins, it was assumed that smooth, striated, and nonmuscle isoforms were each encoded by different sets of genes. Through the use of S1 nuclease mapping, RNA blots, and 5' extension analyses, we showed that the rat alpha-TM gene, whose expression was until now considered to be restricted to muscle cells, generates many different tissue-specific isoforms. The promoter of the gene appears to be very similar to other housekeeping promoters in both its pattern of utilization, being active in most cell types, and its lack of any canonical sequence elements. The rat alpha-TM gene is split into at least 13 exons, 7 of which are alternatively spliced in a tissue-specific manner. This gene arrangement, which also includes two different 3' ends, generates a minimum of six different mRNAs each with the capacity to code for a different protein. These distinct TM isoforms are expressed specifically in nonmuscle and smooth and striated (cardiac and skeletal) muscle cells. The tissue-specific expression and developmental regulation of these isoforms is, therefore, produced by alternative mRNA processing. Moreover, structural and sequence comparisons among TM genes from different phyla suggest that alternative splicing is evolutionarily a very old event that played an important role in gene evolution and might have appeared concomitantly with or even before constitutive splicing. Images PMID:3352602

  18. Saccharomyces cerevisiae ribosomal protein L37 is encoded by duplicate genes that are differentially expressed.

    PubMed

    Tornow, J; Santangelo, G M

    1994-06-01

    A duplicate copy of the RPL37A gene (encoding ribosomal protein L37) was cloned and sequenced. The coding region of RPL37B is very similar to that of RPL37A, with only one conservative amino-acid difference. However, the intron and flanking sequences of the two genes are extremely dissimilar. Disruption experiments indicate that the two loci are not functionally equivalent: disruption of RPL37B was insignificant, but disruption of RPL37A severely impaired the growth rate of the cell. When both RPL37 loci are disrupted, the cell is unable to grow at all, indicating that rpL37 is an essential protein. The functional disparity between the two RPL37 loci could be explained by differential gene expression. The results of two experiments support this idea: gene fusion of RPL37A to a reporter gene resulted in six-fold higher mRNA levels than was generated by the same reporter gene fused to RPL37B, and a modest increase in gene dosage of RPL37B overcame the lack of a functional RPL37A gene.

  19. Requirements for selective recruitment of Ets proteins and activation of mb-1/Ig-α gene transcription by Pax-5 (BSAP)

    PubMed Central

    Maier, Holly; Ostraat, Rachel; Parenti, Sarah; Fitzsimmons, Daniel; Abraham, Lawrence J.; Garvie, Colin W.; Hagman, James

    2003-01-01

    Pax-5, a member of the paired domain family of transcription factors, is a key regulator of B lymphocyte-specific transcription and differentiation. A major target of Pax-5-mediated activation is the mb-1 gene, which encodes the essential transmembrane signaling protein Ig-α. Pax-5 recruits three members of the Ets family of transcription factors: Ets-1, Fli-1 and GABPα (with GABPβ1), to assemble ternary complexes on the mb-1 promoter in vitro. Using the Pax-5:Ets-1:DNA crystal structure as a guide, we defined amino acid requirements for transcriptional activation of endogenous mb-1 genes using a novel cell-based assay. Mutations in the β-hairpin/β-turn of the DNA-binding domain of Pax-5 demonstrated its importance for DNA sequence recognition and activation of mb-1 transcription. Mutations of amino acids contacting Ets-1 in the crystal structure reduced or blocked mb-1 promoter activation. One of these mutations, Q22A, resulted in greatly reduced mb-1 gene transcript levels, concurrent with the loss of its ability to recruit Fli-1 to bind the promoter in vitro. In contrast, the mutation had no effect on recruitment of the related Ets protein GABPα (with GABPβ1). These data further define requirements for Pax-5 function in vivo and reveal the complexity of interactions required for cooperative partnerships between transcription factors. PMID:14500810

  20. A Glycine Riboswitch in Streptococcus pyogenes Controls Expression of a Sodium:Alanine Symporter Family Protein Gene.

    PubMed

    Khani, Afsaneh; Popp, Nicole; Kreikemeyer, Bernd; Patenge, Nadja

    2018-01-01

    Regulatory RNAs play important roles in the control of bacterial gene expression. In this study, we investigated gene expression regulation by a putative glycine riboswitch located in the 5'-untranslated region of a sodium:alanine symporter family (SAF) protein gene in the group A Streptococcus pyogenes serotype M49 strain 591. Glycine-dependent gene expression mediated by riboswitch activity was studied using a luciferase reporter gene system. Maximal reporter gene expression was observed in the absence of glycine and in the presence of low glycine concentrations. Differences in glycine-dependent gene expression were not based on differential promoter activity. Expression of the SAF protein gene and the downstream putative cation efflux protein gene was investigated in wild-type bacteria by RT-qPCR transcript analyses. During growth in the presence of glycine (≥1 mM), expression of the genes were downregulated. Northern blot analyses revealed premature transcription termination in the presence of high glycine concentrations. Growth in the presence of 0.1 mM glycine led to the production of a full-length transcript. Furthermore, stability of the SAF protein gene transcript was drastically reduced in the presence of glycine. We conclude that the putative glycine riboswitch in S. pyogenes serotype M49 strain 591 represses expression of the SAF protein gene and the downstream putative cation efflux protein gene in the presence of high glycine concentrations. Sequence and secondary structure comparisons indicated that the streptococcal riboswitch belongs to the class of tandem aptamer glycine riboswitches.

  1. Combining random gene fission and rational gene fusion to discover near-infrared fluorescent protein fragments that report on protein-protein interactions.

    PubMed

    Pandey, Naresh; Nobles, Christopher L; Zechiedrich, Lynn; Maresso, Anthony W; Silberg, Jonathan J

    2015-05-15

    Gene fission can convert monomeric proteins into two-piece catalysts, reporters, and transcription factors for systems and synthetic biology. However, some proteins can be challenging to fragment without disrupting function, such as near-infrared fluorescent protein (IFP). We describe a directed evolution strategy that can overcome this challenge by randomly fragmenting proteins and concomitantly fusing the protein fragments to pairs of proteins or peptides that associate. We used this method to create libraries that express fragmented IFP as fusions to a pair of associating peptides (IAAL-E3 and IAAL-K3) and proteins (CheA and CheY) and screened for fragmented IFP with detectable near-infrared fluorescence. Thirteen novel fragmented IFPs were identified, all of which arose from backbone fission proximal to the interdomain linker. Either the IAAL-E3 and IAAL-K3 peptides or CheA and CheY proteins could assist with IFP fragment complementation, although the IAAL-E3 and IAAL-K3 peptides consistently yielded higher fluorescence. These results demonstrate how random gene fission can be coupled to rational gene fusion to create libraries enriched in fragmented proteins with AND gate logic that is dependent upon a protein-protein interaction, and they suggest that these near-infrared fluorescent protein fragments will be suitable as reporters for pairs of promoters and protein-protein interactions within whole animals.

  2. FunGene: the functional gene pipeline and repository.

    PubMed

    Fish, Jordan A; Chai, Benli; Wang, Qiong; Sun, Yanni; Brown, C Titus; Tiedje, James M; Cole, James R

    2013-01-01

    Ribosomal RNA genes have become the standard molecular markers for microbial community analysis for good reasons, including universal occurrence in cellular organisms, availability of large databases, and ease of rRNA gene region amplification and analysis. As markers, however, rRNA genes have some significant limitations. The rRNA genes are often present in multiple copies, unlike most protein-coding genes. The slow rate of change in rRNA genes means that multiple species sometimes share identical 16S rRNA gene sequences, while many more species share identical sequences in the short 16S rRNA regions commonly analyzed. In addition, the genes involved in many important processes are not distributed in a phylogenetically coherent manner, potentially due to gene loss or horizontal gene transfer. While rRNA genes remain the most commonly used markers, key genes in ecologically important pathways, e.g., those involved in carbon and nitrogen cycling, can provide important insights into community composition and function not obtainable through rRNA analysis. However, working with ecofunctional gene data requires some tools beyond those required for rRNA analysis. To address this, our Functional Gene Pipeline and Repository (FunGene; http://fungene.cme.msu.edu/) offers databases of many common ecofunctional genes and proteins, as well as integrated tools that allow researchers to browse these collections and choose subsets for further analysis, build phylogenetic trees, test primers and probes for coverage, and download aligned sequences. Additional FunGene tools are specialized to process coding gene amplicon data. For example, FrameBot produces frameshift-corrected protein and DNA sequences from raw reads while finding the most closely related protein reference sequence. These tools can help provide better insight into microbial communities by directly studying key genes involved in important ecological processes.

  3. Differential conservation of transcriptional domains of mammalian Prophet of Pit-1 proteins revealed by structural studies of the bovine gene and comparative functional analysis of the protein.

    PubMed

    Showalter, Aaron D; Smith, Timothy P L; Bennett, Gary L; Sloop, Kyle W; Whitsett, Julie A; Rhodes, Simon J

    2002-05-29

    The Prophet of Pit-1 (PROP1) gene encodes a paired class homeodomain transcription factor that is exclusively expressed in the developing mammalian pituitary gland. PROP1 function is essential for anterior pituitary organogenesis, and heritable mutations in the gene are associated with combined pituitary hormone deficiency in human patients and animals. By cloning the bovine PROP1 gene and by comparative analysis, we demonstrate that the homeodomains and carboxyl termini of mammalian PROP1 proteins are highly conserved while the amino termini are diverged. Whereas the carboxyl termini of the human and bovine PROP1 proteins contain potent transcriptional activation domains, the amino termini and homeodomains have repressive activities. The bovine PROP1 gene has four exons and three introns and maps to a region of chromosome seven carrying a quantitative trait locus affecting ovulation rate. Two alleles of the bovine gene were found that encode distinct protein products with different DNA binding and transcriptional activities. These experiments demonstrate that mammalian PROP1 genes encode proteins with complex regulatory capacities and that modest changes in protein sequence can significantly alter the activity of this pituitary developmental transcription factor.

  4. Opposite nuclear level and binding activity of STAT5B and STAT3 proteins with rat haptoglobin gene under normal and turpentine induced acute phase conditions.

    PubMed

    Grigorov, I; Lazić, T; Cvetković, I; Milosavljević, T; Petrović, M

    2001-01-01

    Transcription of the rat gene encoding haptoglobin (Hp) is highly induced during acute phase (AP) response which has been previously shown to be mediated by inducible STAT3 member of the Signal Transducer and Activators of Transcription (STATs) family proteins. In this study, we observed that under normal but not in the turpentine induced AP conditions, another member of the STAT family proteins, STAT5b is expressed and binds to the hormone regulatory element (HRE) of the rat Hp gene. We found that the nuclear amounts of constitutively active STAT5b in rat liver decreased significantly with time of turpentine treatment as opposed to that of cytosol STAT5b, suggesting possible export of constitutive STAT5b from the nucleus. Nuclear accumulation and binding of inducible STAT3 proteins to the rat Hp gene HRE following turpentine treatment implicated that STAT5b negatively regulates Hp gene expression during normal conditions.

  5. Biotin protein ligase from Corynebacterium glutamicum: role for growth and L: -lysine production.

    PubMed

    Peters-Wendisch, P; Stansen, K C; Götker, S; Wendisch, V F

    2012-03-01

    Corynebacterium glutamicum is a biotin auxotrophic Gram-positive bacterium that is used for large-scale production of amino acids, especially of L-glutamate and L-lysine. It is known that biotin limitation triggers L-glutamate production and that L-lysine production can be increased by enhancing the activity of pyruvate carboxylase, one of two biotin-dependent proteins of C. glutamicum. The gene cg0814 (accession number YP_225000) has been annotated to code for putative biotin protein ligase BirA, but the protein has not yet been characterized. A discontinuous enzyme assay of biotin protein ligase activity was established using a 105aa peptide corresponding to the carboxyterminus of the biotin carboxylase/biotin carboxyl carrier protein subunit AccBC of the acetyl CoA carboxylase from C. glutamicum as acceptor substrate. Biotinylation of this biotin acceptor peptide was revealed with crude extracts of a strain overexpressing the birA gene and was shown to be ATP dependent. Thus, birA from C. glutamicum codes for a functional biotin protein ligase (EC 6.3.4.15). The gene birA from C. glutamicum was overexpressed and the transcriptome was compared with the control strain revealing no significant gene expression changes of the bio-genes. However, biotin protein ligase overproduction increased the level of the biotin-containing protein pyruvate carboxylase and entailed a significant growth advantage in glucose minimal medium. Moreover, birA overexpression resulted in a twofold higher L-lysine yield on glucose as compared with the control strain.

  6. Matrix-specific protein kinase A signaling regulates p21 activated kinase activation by flow in endothelial cells

    PubMed Central

    Funk, Steven Daniel; Yurdagul, Arif; Green, Jonette M.; Jhaveri, Krishna A.; Schwartz, Martin Alexander; Orr, A. Wayne

    2010-01-01

    Rationale Atherosclerosis is initiated by blood flow patterns that activate inflammatory pathways in endothelial cells. Activation of inflammatory signaling by fluid shear stress is highly dependent on the composition of the subendothelial extracellular matrix. The basement membrane proteins laminin and collagen found in normal vessels suppress flow-induced p21 activated kinase (PAK) and NF-κB activation. By contrast, the provisional matrix proteins fibronectin and fibrinogen found in wounded or inflamed vessels support flow-induced PAK and NF-κB activation. PAK mediates both flow-induced permeability and matrix-specific activation of NF-κB. Objective To elucidate the mechanisms regulating matrix-specific PAK activation. Methods and Results We now show that matrix composition does not affect the upstream pathway by which flow activates PAK (integrin activation, Rac). Instead basement membrane proteins enhance flow-induced protein kinase A (PKA) activation, which suppresses PAK. Inhibiting PKA restored flow-induced PAK and NF-κB activation in cells on basement membrane proteins, whereas stimulating PKA inhibited flow-induced activation of inflammatory signaling in cells on fibronectin. PKA suppressed inflammatory signaling through PAK inhibition. Activating PKA by injection of the PGI2 analog iloprost reduced PAK activation and inflammatory gene expression at sites of disturbed flow in vivo, whereas inhibiting PKA by PKI injection enhanced PAK activation and inflammatory gene expression. Inhibiting PAK prevented the enhancement of inflammatory gene expression by PKI. Conclusions Basement membrane proteins inhibit inflammatory signaling in endothelial cells via PKA-dependent inhibition of PAK. PMID:20224042

  7. Network of proteins, enzymes and genes linked to biomass degradation shared by Trichoderma species.

    PubMed

    Horta, Maria Augusta Crivelente; Filho, Jaire Alves Ferreira; Murad, Natália Faraj; de Oliveira Santos, Eidy; Dos Santos, Clelton Aparecido; Mendes, Juliano Sales; Brandão, Marcelo Mendes; Azzoni, Sindelia Freitas; de Souza, Anete Pereira

    2018-01-22

    Understanding relationships between genes responsible for enzymatic hydrolysis of cellulose and synergistic reactions is fundamental for improving biomass biodegradation technologies. To reveal synergistic reactions, the transcriptome, exoproteome, and enzymatic activities of extracts from Trichoderma harzianum, Trichoderma reesei and Trichoderma atroviride under biodegradation conditions were examined. This work revealed co-regulatory networks across carbohydrate-active enzyme (CAZy) genes and secreted proteins in extracts. A set of 80 proteins and respective genes that might correspond to a common system for biodegradation from the studied species were evaluated to elucidate new co-regulated genes. Differences such as one unique base pair between fungal genomes might influence enzyme-substrate binding sites and alter fungal gene expression responses, explaining the enzymatic activities specific to each species observed in the corresponding extracts. These differences are also responsible for the different architectures observed in the co-expression networks.

  8. Novel Gal3 proteins showing altered Gal80p binding cause constitutive transcription of Gal4p-activated genes in Saccharomyces cerevisiae.

    PubMed Central

    Blank, T E; Woods, M P; Lebo, C M; Xin, P; Hopper, J E

    1997-01-01

    Gal4p-mediated activation of galactose gene expression in Saccharomyces cerevisiae normally requires both galactose and the activity of Gal3p. Recent evidence suggests that in cells exposed to galactose, Gal3p binds to and inhibits Ga180p, an inhibitor of the transcriptional activator Gal4p. Here, we report on the isolation and characterization of novel mutant forms of Gal3p that can induce Gal4p activity independently of galactose. Five mutant GAL3(c) alleles were isolated by using a selection demanding constitutive expression of a GAL1 promoter-driven HIS3 gene. This constitutive effect is not due to overproduction of Gal3p. The level of constitutive GAL gene expression in cells bearing different GAL3(c) alleles varies over more than a fourfold range and increases in response to galactose. Utilizing glutathione S-transferase-Gal3p fusions, we determined that the mutant Gal3p proteins show altered Gal80p-binding characteristics. The Gal3p mutant proteins differ in their requirements for galactose and ATP for their Gal80p-binding ability. The behavior of the novel Gal3p proteins provides strong support for a model wherein galactose causes an alteration in Gal3p that increases either its ability to bind to Gal80p or its access to Gal80p. With the Gal3p-Gal80p interaction being a critical step in the induction process, the Gal3p proteins constitute an important new reagent for studying the induction mechanism through both in vivo and in vitro methods. PMID:9111326

  9. Detecting Adaptation in Protein-Coding Genes Using a Bayesian Site-Heterogeneous Mutation-Selection Codon Substitution Model.

    PubMed

    Rodrigue, Nicolas; Lartillot, Nicolas

    2017-01-01

    Codon substitution models have traditionally attempted to uncover signatures of adaptation within protein-coding genes by contrasting the rates of synonymous and non-synonymous substitutions. Another modeling approach, known as the mutation-selection framework, attempts to explicitly account for selective patterns at the amino acid level, with some approaches allowing for heterogeneity in these patterns across codon sites. Under such a model, substitutions at a given position occur at the neutral or nearly neutral rate when they are synonymous, or when they correspond to replacements between amino acids of similar fitness; substitutions from high to low (low to high) fitness amino acids have comparatively low (high) rates. Here, we study the use of such a mutation-selection framework as a null model for the detection of adaptation. Following previous works in this direction, we include a deviation parameter that has the effect of capturing the surplus, or deficit, in non-synonymous rates, relative to what would be expected under a mutation-selection modeling framework that includes a Dirichlet process approach to account for across-codon-site variation in amino acid fitness profiles. We use simulations, along with a few real data sets, to study the behavior of the approach, and find it to have good power with a low false-positive rate. Altogether, we emphasize the potential of recent mutation-selection models in the detection of adaptation, calling for further model refinements as well as large-scale applications. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  10. Mechanosensitive channels of Escherichia coli: the MscL gene, protein, and activities

    NASA Technical Reports Server (NTRS)

    Sukharev, S. I.; Blount, P.; Martinac, B.; Kung, C.

    1997-01-01

    Although mechanosensory responses are ubiquitous and diverse, the molecular bases of mechanosensation in most cases remain mysterious MscL, a mechanosensitive channel of large conductance of Escherichia coli and its bacterial homologues are the first and currently only channel molecules shown to directly sense mechanical stretch of the membrane. In response to the tension conveyed via the lipid bilayer, MscL increases its open probability by several orders of magnitude. In the present review we describe the identification, cloning, and first sets of biophysical and structural data on this simplest mechanosensory molecule. We discovered a 2.5-ns mechanosensitive conductance in giant E. coli spheroplasts. Using chromatographies to enrich the target and patch clamp to assay the channel activity in liposome-reconstituted fractions, we identified the MscL protein and cloned the mscL gene. MscL comprises 136 amino acid residues (15 kDa), with two highly hydrophobic regions, and resides in the inner membrane of the bacterium. PhoA-fusion experiments indicate that the protein spans the membrane twice with both termini in the cytoplasm. Spectroscopic techniques show that it is highly helical. Expression of MscL tandems and covalent cross-linking suggest that the active channel complex is a homo-hexamer. We have identified several residues, which when deleted or substituted, affect channel kinetics or mechanosensitivity. Although unique when discovered, highly conserved MscL homologues in both gram-negative and gram-positive bacteria have been found, suggesting their ubiquitous importance among bacteria.

  11. Enhancement of protein production via the strong DIT1 terminator and two RNA-binding proteins in Saccharomyces cerevisiae.

    PubMed

    Ito, Yoichiro; Kitagawa, Takao; Yamanishi, Mamoru; Katahira, Satoshi; Izawa, Shingo; Irie, Kenji; Furutani-Seiki, Makoto; Matsuyama, Takashi

    2016-11-15

    Post-transcriptional upregulation is an effective way to increase the expression of transgenes and thus maximize the yields of target chemicals from metabolically engineered organisms. Refractory elements in the 3' untranslated region (UTR) that increase mRNA half-life might be available. In Saccharomyces cerevisiae, several terminator regions have shown activity in increasing the production of proteins by upstream coding genes; among these terminators the DIT1 terminator has the highest activity. Here, we found in Saccharomyces cerevisiae that two resident trans-acting RNA-binding proteins (Nab6p and Pap1p) enhance the activity of the DIT1 terminator through the cis element GUUCG/U within the 3'-UTR. These two RNA-binding proteins could upregulate a battery of cell-wall-related genes. Mutagenesis of the DIT1 terminator improved its activity by a maximum of 500% of that of the standard PGK1 terminator. Further understanding and improvement of this system will facilitate inexpensive and stable production of complicated organism-derived drugs worldwide.

  12. Penicillin-binding protein 4 of Escherichia coli: molecular cloning of the dacB gene, controlled overexpression, and alterations in murein composition.

    PubMed

    Korat, B; Mottl, H; Keck, W

    1991-03-01

    The penicillin-binding protein 4 (PBP4), from Escherichia coli, a DD-carboxypeptidase/DD-endopeptidase, was purified in an enzymatically active form to homogeneity by affinity chromatography on 6-aminopenicillanic acid/Sepharose and heparin/Sepharose. Polyclonal antibodies raised against the pure protein were used to identify and isolate PBP4 overproducing clones from an E. coli expression library, which was established on the basis of a temperature-inducible runaway replication plasmid. Three positive clones were isolated, one of which carried the intact structural gene dacB that codes for PBP4, on a 1.9kb SmaI-EcoRI fragment, whereas the other two carried truncated forms of this gene. The direction of transcription was determined. The PBP4 overproducing strain, when grown in rich medium, tolerated 160-fold overexpression. After disrupting cells by sonication, the majority (80%) of the overproduced PBP4 was detected in the 100,000 X g supernatant. Southern blotting analysis using the cloned dacB gene as a probe revealed that, in contrast to that described by Takeda et al. (1981), the plasmid pLC18-38 of the Clarke-Carbon collection does not code for PBP4. The overall composition of murein, synthesized in vitro or in vivo by the PBP4 overproducing strain, as determined by high-performance liquid chromatography analysis, suggests that PBP4 is not involved in transpeptidation but exclusively catalyses a DD-carboxypeptidase and DD-endopeptidase reaction.

  13. The secondary structure of the ets domain of human Fli-1 resembles that of the helix-turn-helix DNA-binding motif of the Escherichia coli catabolite gene activator protein.

    PubMed Central

    Liang, H; Olejniczak, E T; Mao, X; Nettesheim, D G; Yu, L; Thompson, C B; Fesik, S W

    1994-01-01

    The ets family of eukaryotic transcription factors is characterized by a conserved DNA-binding domain of approximately 85 amino acids for which the three-dimensional structure is not known. By using multidimensional NMR spectroscopy, we have determined the secondary structure of the ets domain of one member of this gene family, human Fli-1, both in the free form and in a complex with a 16-bp cognate DNA site. The secondary structure of the Fli-1 ets domain consists of three alpha-helices and a short four-stranded antiparallel beta-sheet. This secondary structure arrangement resembles that of the DNA-binding domain of the catabolite gene activator protein of Escherichia coli, as well as those of several eukaryotic DNA-binding proteins including histone H5, HNF-3/fork head, and the heat shock transcription factor. Differences in chemical shifts of backbone resonances and amide exchange rates between the DNA-bound and free forms of the Fli-1 ets domain suggest that the third helix is the DNA recognition helix, as in the catabolite gene activator protein and other structurally related proteins. These results suggest that the ets domain is structurally similar to the catabolite gene activator protein family of helix-turn-helix DNA-binding proteins. Images PMID:7972119

  14. RNF17 blocks promiscuous activity of PIWI proteins in mouse testes

    PubMed Central

    Wasik, Kaja A.; Tam, Oliver H.; Knott, Simon R.; Falciatori, Ilaria; Hammell, Molly; Vagin, Vasily V.; Hannon, Gregory J.

    2015-01-01

    PIWI proteins and their associated piRNAs protect germ cells from the activity of mobile genetic elements. Two classes of piRNAs—primary and secondary—are defined by their mechanisms of biogenesis. Primary piRNAs are processed directly from transcripts of piRNA cluster loci, whereas secondary piRNAs are generated in an adaptive amplification loop, termed the ping-pong cycle. In mammals, piRNA populations are dynamic, shifting as male germ cells develop. Embryonic piRNAs consist of both primary and secondary species and are mainly directed toward transposons. In meiotic cells, the piRNA population is transposon-poor and largely restricted to primary piRNAs derived from pachytene piRNA clusters. The transition from the embryonic to the adult piRNA pathway is not well understood. Here we show that RNF17 shapes adult meiotic piRNA content by suppressing the production of secondary piRNAs. In the absence of RNF17, ping-pong occurs inappropriately in meiotic cells. Ping-pong initiates piRNA responses against not only transposons but also protein-coding genes and long noncoding RNAs, including genes essential for germ cell development. Thus, the sterility of Rnf17 mutants may be a manifestation of a small RNA-based autoimmune reaction. PMID:26115953

  15. Antiaging Gene Klotho Deficiency Promoted High-Fat Diet-Induced Arterial Stiffening via Inactivation of AMP-Activated Protein Kinase.

    PubMed

    Lin, Yi; Chen, Jianglei; Sun, Zhongjie

    2016-03-01

    Klotho was originally discovered as an aging-suppressor gene. The objective of this study is to investigate whether klotho gene deficiency affects high-fat diet (HFD)-induced arterial stiffening. Heterozygous Klotho-deficient (KL(+/-)) mice and WT littermates were fed on HFD or normal diet. HFD increased pulse wave velocity within 5 weeks in KL(+/-) mice but not in wild-type mice, indicating that klotho deficiency accelerates and exacerbates HFD-induced arterial stiffening. A greater increase in blood pressure was found in KL(+/-) mice fed on HFD. Protein expressions of phosphorylated AMP-activated protein kinase-α (AMPKα), phosphorylated endothelial nitric oxide synthase (eNOS), and manganese-dependent superoxide dismutase (Mn-SOD) were decreased, whereas protein expressions of collagen I, transforming growth factor-β1, and Runx2 were increased in aortas of KL(+/-) mice fed on HFD. Interestingly, daily injections of an AMPKα activator, 5-aminoimidazole-4-carboxamide-3-ribonucleoside, abolished the increases in pulse wave velocity, blood pressure, and blood glucose in KL(+/-) mice fed on HFD. Treatment with 5-aminoimidazole-4-carboxamide-3-ribonucleoside for 2 weeks not only abolished the downregulation of phosphorylated AMPKα, phosphorylated eNOS, and Mn-SOD levels but also attenuated the increased levels of collagen I, transforming growth factor-β1, Runx2, superoxide, elastic lamellae breaks, and calcification in aortas of KL(+/-) mice fed on HFD. In cultured mouse aortic smooth muscle cells, cholesterol plus KL-deficient serum decreased phosphorylation levels of AMPKα and LKB1 (an important upstream regulator of AMPKα activity) but increased collagen I synthesis, which can be eliminated by activation of AMPKα by 5-aminoimidazole-4-carboxamide-3-ribonucleoside. In conclusions, Klotho deficiency promoted HFD-induced arterial stiffening and hypertension via downregulation of AMPKα activity. © 2016 American Heart Association, Inc.

  16. SR proteins in Vertical Integration of Gene Expression from Transcription to RNA Processing to Translation

    PubMed Central

    Zhong, Xiang-Yang; Wang, Pingping; Han, Joonhee; Rosenfeld, Michael G.; Fu, Xiang-Dong

    2009-01-01

    Summary SR proteins have been studied extensively as a family of RNA binding proteins that participate in both constitutive and regulated pre-mRNA splicing in mammalian cells. However, SR proteins were first discovered as factors that interact with transcriptionally active chromatin. Recent studies have now uncovered properties that connect these once apparently disparate functions, showing that a subset of SR proteins seem to bind directly to the histone 3 tail, play an active role in transcriptional elongation, and co-localize with genes that are engaged in specific intra- and inter-chromosome interactions for coordinated regulation of gene expression in the nucleus. These transcription-related activities are also coupled with a further expansion of putative functions of specific SR protein family members in RNA metabolism downstream of mRNA splicing, from RNA export to stability control to translation. These findings therefore highlight the broader roles of SR proteins in vertical integration of gene expression and provide mechanistic insights into their contributions to genome stability and proper cell cycle progression in higher eukaryotic cells. PMID:19595711

  17. SR proteins in vertical integration of gene expression from transcription to RNA processing to translation.

    PubMed

    Zhong, Xiang-Yang; Wang, Pingping; Han, Joonhee; Rosenfeld, Michael G; Fu, Xiang-Dong

    2009-07-10

    SR proteins have been studied extensively as a family of RNA-binding proteins that participate in both constitutive and regulated pre-mRNA splicing in mammalian cells. However, SR proteins were first discovered as factors that interact with transcriptionally active chromatin. Recent studies have now uncovered properties that connect these once apparently disparate functions, showing that a subset of SR proteins seem to bind directly to the histone 3 tail, play an active role in transcriptional elongation, and colocalize with genes that are engaged in specific intra- and interchromosome interactions for coordinated regulation of gene expression in the nucleus. These transcription-related activities are also coupled with a further expansion of putative functions of specific SR protein family members in RNA metabolism downstream of mRNA splicing, from RNA export to stability control to translation. These findings, therefore, highlight the broader roles of SR proteins in vertical integration of gene expression and provide mechanistic insights into their contributions to genome stability and proper cell-cycle progression in higher eukaryotic cells.

  18. Virus-Like Particles Derived from HIV-1 for Delivery of Nuclear Proteins: Improvement of Production and Activity by Protein Engineering.

    PubMed

    Robert, Marc-André; Lytvyn, Viktoria; Deforet, Francis; Gilbert, Rénald; Gaillet, Bruno

    2017-01-01

    Virus-like particles (VLPs) derived from retroviruses and lentiviruses can be used to deliver recombinant proteins without the fear of causing insertional mutagenesis to the host cell genome. In this study we evaluate the potential of an inducible lentiviral vector packaging cell line for VLP production. The Gag gene from HIV-1 was fused to a gene encoding a selected protein and it was transfected into the packaging cells. Three proteins served as model: the green fluorescent protein and two transcription factors-the cumate transactivator (cTA) of the inducible CR5 promoter and the human Krüppel-like factor 4 (KLF4). The sizes of the VLPs were 120-150 nm in diameter and they were resistant to freeze/thaw cycles. Protein delivery by the VLPs reached up to 100% efficacy in human cells and was well tolerated. Gag-cTA triggered up to 1100-fold gene activation of the reporter gene in comparison to the negative control. Protein engineering was required to detect Gag-KLF4 activity. Thus, insertion of the VP16 transactivation domain increased the activity of the VLPs by eightfold. An additional 2.4-fold enhancement was obtained by inserting nuclear export signal. In conclusion, our platform produced VLPs capable of efficient protein transfer, and it was shown that protein engineering can be used to improve the activity of the delivered proteins as well as VLP production.

  19. The Homeodomain of PDX-1 Mediates Multiple Protein-Protein Interactions in the Formation of a Transcriptional Activation Complex on the Insulin Promoter

    PubMed Central

    Ohneda, Kinuko; Mirmira, Raghavendra G.; Wang, Juehu; Johnson, Jeffrey D.; German, Michael S.

    2000-01-01

    Activation of insulin gene transcription specifically in the pancreatic β cells depends on multiple nuclear proteins that interact with each other and with sequences on the insulin gene promoter to build a transcriptional activation complex. The homeodomain protein PDX-1 exemplifies such interactions by binding to the A3/4 region of the rat insulin I promoter and activating insulin gene transcription by cooperating with the basic-helix-loop-helix (bHLH) protein E47/Pan1, which binds to the adjacent E2 site. The present study provides evidence that the homeodomain of PDX-1 acts as a protein-protein interaction domain to recruit multiple proteins, including E47/Pan1, BETA2/NeuroD1, and high-mobility group protein I(Y), to an activation complex on the E2A3/4 minienhancer. The transcriptional activity of this complex results from the clustering of multiple activation domains capable of interacting with coactivators and the basal transcriptional machinery. These interactions are not common to all homeodomain proteins: the LIM homeodomain protein Lmx1.1 can also activate the E2A3/4 minienhancer in cooperation with E47/Pan1 but does so through different interactions. Cooperation between Lmx1.1 and E47/Pan1 results not only in the aggregation of multiple activation domains but also in the unmasking of a potent activation domain on E47/Pan1 that is normally silent in non-β cells. While more than one activation complex may be capable of activating insulin gene transcription through the E2A3/4 minienhancer, each is dependent on multiple specific interactions among a unique set of nuclear proteins. PMID:10629047

  20. Identification of a cis-regulatory region of a gene in Arabidopsis thaliana whose induction by dehydration is mediated by abscisic acid and requires protein synthesis.

    PubMed

    Iwasaki, T; Yamaguchi-Shinozaki, K; Shinozaki, K

    1995-05-20

    In Arabidopsis thaliana, the induction of a dehydration-responsive gene, rd22, is mediated by abscisic acid (ABA) but the gene does not include any sequence corresponding to the consensus ABA-responsive element (ABRE), RYACGTGGYR, in its promoter region. The cis-regulatory region of the rd22 promoter was identified by monitoring the expression of beta-glucuronidase (GUS) activity in leaves of transgenic tobacco plants transformed with chimeric gene fusions constructed between 5'-deleted promoters of rd22 and the coding region of the GUS reporter gene. A 67-bp nucleotide fragment corresponding to positions -207 to -141 of the rd22 promoter conferred responsiveness to dehydration and ABA on a non-responsive promoter. The 67-bp fragment contains the sequences of the recognition sites for some transcription factors, such as MYC, MYB, and GT-1. The fact that accumulation of rd22 mRNA requires protein synthesis raises the possibility that the expression of rd22 might be regulated by one of these trans-acting protein factors whose de novo synthesis is induced by dehydration or ABA. Although the structure of the RD22 protein is very similar to that of a non-storage seed protein, USP, of Vicia faba, the expression of the GUS gene driven by the rd22 promoter in non-stressed transgenic Arabidopsis plants was found mainly in flowers and bolted stems rather than in seeds.

  1. The past and presence of gene targeting: from chemicals and DNA via proteins to RNA.

    PubMed

    Geel, T M; Ruiters, M H J; Cool, R H; Halby, L; Voshart, D C; Andrade Ruiz, L; Niezen-Koning, K E; Arimondo, P B; Rots, M G

    2018-06-05

    The ability to target DNA specifically at any given position within the genome allows many intriguing possibilities and has inspired scientists for decades. Early gene-targeting efforts exploited chemicals or DNA oligonucleotides to interfere with the DNA at a given location in order to inactivate a gene or to correct mutations. We here describe an example towards correcting a genetic mutation underlying Pompe's disease using a nucleotide-fused nuclease (TFO-MunI). In addition to the promise of gene correction, scientists soon realized that genes could be inactivated or even re-activated without inducing potentially harmful DNA damage by targeting transcriptional modulators to a particular gene. However, it proved difficult to fuse protein effector domains to the first generation of programmable DNA-binding agents. The engineering of gene-targeting proteins (zinc finger proteins (ZFPs), transcription activator-like effectors (TALEs)) circumvented this problem. The disadvantage of protein-based gene targeting is that a fusion protein needs to be engineered for every locus. The recent introduction of CRISPR/Cas offers a flexible approach to target a (fusion) protein to the locus of interest using cheap designer RNA molecules. Many research groups now exploit this platform and the first human clinical trials have been initiated: CRISPR/Cas has kicked off a new era of gene targeting and is revolutionizing biomedical sciences.This article is part of a discussion meeting issue 'Frontiers in epigenetic chemical biology'. © 2018 The Author(s).

  2. Screening and association testing of common coding variation in steroid hormone receptor co-activator and co-repressor genes in relation to breast cancer risk: the Multiethnic Cohort.

    PubMed

    Haiman, Christopher A; Garcia, Rachel R; Hsu, Chris; Xia, Lucy; Ha, Helen; Sheng, Xin; Le Marchand, Loic; Kolonel, Laurence N; Henderson, Brian E; Stallcup, Michael R; Greene, Geoffrey L; Press, Michael F

    2009-01-30

    Only a limited number of studies have performed comprehensive investigations of coding variation in relation to breast cancer risk. Given the established role of estrogens in breast cancer, we hypothesized that coding variation in steroid receptor coactivator and corepressor genes may alter inter-individual response to estrogen and serve as markers of breast cancer risk. We sequenced the coding exons of 17 genes (EP300, CCND1, NME1, NCOA1, NCOA2, NCOA3, SMARCA4, SMARCA2, CARM1, FOXA1, MPG, NCOR1, NCOR2, CALCOCO1, PRMT1, PPARBP and CREBBP) suggested to influence transcriptional activation by steroid hormone receptors in a multiethnic panel of women with advanced breast cancer (n = 95): African Americans, Latinos, Japanese, Native Hawaiians and European Americans. Association testing of validated coding variants was conducted in a breast cancer case-control study (1,612 invasive cases and 1,961 controls) nested in the Multiethnic Cohort. We used logistic regression to estimate odds ratios for allelic effects in ethnic-pooled analyses as well as in subgroups defined by disease stage and steroid hormone receptor status. We also investigated effect modification by established breast cancer risk factors that are associated with steroid hormone exposure. We identified 45 coding variants with frequencies > or = 1% in any one ethnic group (43 non-synonymous variants). We observed nominally significant positive associations with two coding variants in ethnic-pooled analyses (NCOR2: His52Arg, OR = 1.79; 95% CI, 1.05-3.05; CALCOCO1: Arg12His, OR = 2.29; 95% CI, 1.00-5.26). A small number of variants were associated with risk in disease subgroup analyses and we observed no strong evidence of effect modification by breast cancer risk factors. Based on the large number of statistical tests conducted in this study, the nominally significant associations that we observed may be due to chance, and will need to be confirmed in other studies. Our findings suggest that common coding

  3. Intact coding region of the serotonin transporter gene in obsessive-compulsive disorder

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Altemus, M.; Murphy, D.L.; Greenberg, B.

    1996-07-26

    Epidemiologic studies indicate that obsessive-compulsive disorder is genetically transmitted in some families, although no genetic abnormalities have been identified in individuals with this disorder. The selective response of obsessive-compulsive disorder to treatment with agents which block serotonin reuptake suggests the gene coding for the serotonin transporter as a candidate gene. The primary structure of the serotonin-transporter coding region was sequenced in 22 patients with obsessive-compulsive disorder, using direct PCR sequencing of cDNA synthesized from platelet serotonin-transporter mRNA. No variations in amino acid sequence were found among the obsessive-compulsive disorder patients or healthy controls. These results do not support a rolemore » for alteration in the primary structure of the coding region of the serotonin-transporter gene in the pathogenesis of obsessive-compulsive disorder. 27 refs.« less

  4. Downregulation of BRAF-activated non-protein coding RNA in patients with hepatitis B virus-associated hepatocellular carcinoma.

    PubMed

    Zhao, Na-Na; Wang, Cheng; Lai, Cheng-Cai; Cheng, Si-Jie; Yan, Jin; Hong, Zhi-Xian; Yu, Lin-Xiang; Zhu, Zhen-Yu; Zhang, Pei-Rui; Wang, Zhao-Hai; Wang, Xi-Liang; Zhang, Shao-Geng; Yang, Peng-Hui

    2018-05-01

    Long non-coding RNAs (lncRNAs) have been investigated as a novel class of regulators of cellular processes, including cell growth, apoptosis and carcinogenesis. lncRNA BRAF-activated non-protein coding RNA (BANCR) has recently been revealed to be involved in tumorigenesis of numerous types of cancer, including papillary thyroid carcinoma, melanoma, non-small cell lung cancer and colorectal cancer. However, the expression profiles and biological relevance of lncRNA BANCR in hepatocellular carcinoma (HCC) has not yet been reported. In the present study, the expression level of BANCR in tumor tissues and para-cancerous tissues was determined by reverse transcription-quantitative polymerase chain reaction in patients with hepatitis B virus (HBV)-associated HCC, and its association with clinicopathological characteristics of patients was analyzed. The results demonstrated that the expression level of BANCR was significantly reduced in tumor tissues in comparison with in para-cancerous tissues (P<0.001). Furthermore, the present study demonstrated that BANCR expression level was closely associated with serum α-fetoprotein levels (P<0.01) and HCC tumor number (P<0.05). To the best of our knowledge, these results revealed for the first time that BANCR downregulated in patients with HBV-associated HCC and BANCR expression level may be a potential valuable diagnosis and therapeutic biomarker in HCC.

  5. The Maximal C³ Self-Complementary Trinucleotide Circular Code X in Genes of Bacteria, Archaea, Eukaryotes, Plasmids and Viruses.

    PubMed

    Michel, Christian J

    2017-04-18

    In 1996, a set X of 20 trinucleotides was identified in genes of both prokaryotes and eukaryotes which has on average the highest occurrence in reading frame compared to its two shifted frames. Furthermore, this set X has an interesting mathematical property as X is a maximal C 3 self-complementary trinucleotide circular code. In 2015, by quantifying the inspection approach used in 1996, the circular code X was confirmed in the genes of bacteria and eukaryotes and was also identified in the genes of plasmids and viruses. The method was based on the preferential occurrence of trinucleotides among the three frames at the gene population level. We extend here this definition at the gene level. This new statistical approach considers all the genes, i.e., of large and small lengths, with the same weight for searching the circular code X . As a consequence, the concept of circular code, in particular the reading frame retrieval, is directly associated to each gene. At the gene level, the circular code X is strengthened in the genes of bacteria, eukaryotes, plasmids, and viruses, and is now also identified in the genes of archaea. The genes of mitochondria and chloroplasts contain a subset of the circular code X . Finally, by studying viral genes, the circular code X was found in DNA genomes, RNA genomes, double-stranded genomes, and single-stranded genomes.

  6. Consensus coding sequence (CCDS) database: a standardized set of human and mouse protein-coding regions supported by expert curation.

    PubMed

    Pujar, Shashikant; O'Leary, Nuala A; Farrell, Catherine M; Loveland, Jane E; Mudge, Jonathan M; Wallin, Craig; Girón, Carlos G; Diekhans, Mark; Barnes, If; Bennett, Ruth; Berry, Andrew E; Cox, Eric; Davidson, Claire; Goldfarb, Tamara; Gonzalez, Jose M; Hunt, Toby; Jackson, John; Joardar, Vinita; Kay, Mike P; Kodali, Vamsi K; Martin, Fergal J; McAndrews, Monica; McGarvey, Kelly M; Murphy, Michael; Rajput, Bhanu; Rangwala, Sanjida H; Riddick, Lillian D; Seal, Ruth L; Suner, Marie-Marthe; Webb, David; Zhu, Sophia; Aken, Bronwen L; Bruford, Elspeth A; Bult, Carol J; Frankish, Adam; Murphy, Terence; Pruitt, Kim D

    2018-01-04

    The Consensus Coding Sequence (CCDS) project provides a dataset of protein-coding regions that are identically annotated on the human and mouse reference genome assembly in genome annotations produced independently by NCBI and the Ensembl group at EMBL-EBI. This dataset is the product of an international collaboration that includes NCBI, Ensembl, HUGO Gene Nomenclature Committee, Mouse Genome Informatics and University of California, Santa Cruz. Identically annotated coding regions, which are generated using an automated pipeline and pass multiple quality assurance checks, are assigned a stable and tracked identifier (CCDS ID). Additionally, coordinated manual review by expert curators from the CCDS collaboration helps in maintaining the integrity and high quality of the dataset. The CCDS data are available through an interactive web page (https://www.ncbi.nlm.nih.gov/CCDS/CcdsBrowse.cgi) and an FTP site (ftp://ftp.ncbi.nlm.nih.gov/pub/CCDS/). In this paper, we outline the ongoing work, growth and stability of the CCDS dataset and provide updates on new collaboration members and new features added to the CCDS user interface. We also present expert curation scenarios, with specific examples highlighting the importance of an accurate reference genome assembly and the crucial role played by input from the research community. Published by Oxford University Press on behalf of Nucleic Acids Research 2017.

  7. Bioinsecticidal activity of Talisia esculenta reserve protein on growth and serine digestive enzymes during larval development of Anticarsia gemmatalis.

    PubMed

    Macedo, Maria Lígia R; Freire, Maria das Graças M; Kubo, Carlos Eduardo G; Parra, José Roberto P

    2011-01-01

    Plants synthesize a variety of molecules to defend themselves against an attack by insects. Talisin is a reserve protein from Talisia esculenta seeds, the first to be characterized from the family Sapindaceae. In this study, the insecticidal activity of Talisin was tested by incorporating the reserve protein into an artificial diet fed to the velvetbean caterpillar Anticarsia gemmatalis, the major pest of soybean crops in Brazil. At 1.5% (w/w) of the dietary protein, Talisin affected larval growth, pupal weight, development and mortality, adult fertility and longevity, and produced malformations in pupae and adult insects. Talisin inhibited the trypsin-like activity of larval midgut homogenates. The trypsin activity in Talisin-fed larvae was sensitive to Talisin, indicating that no novel protease-resistant to Talisin was induced in Talisin-fed larvae. Affinity chromatography showed that Talisin bound to midgut proteinases of the insect A. gemmatalis, but was resistant to enzymatic digestion by these larval proteinases. The transformation of genes coding for this reserve protein could be useful for developing insect resistant crops. Copyright © 2010 Elsevier Inc. All rights reserved.

  8. Isolated Fungal Promoters and Gene Transcription Terminators and Methods of Protein and Chemical Production in a Fungus

    DOEpatents

    Dai, Ziyu; Lasure, Linda L.; Magnuson, Jon K.

    2008-11-11

    The present invention encompasses isolated gene regulatory elements and gene transcription terminators that are differentially expressed in a native fungus exhibiting a first morphology relative to the native fungus exhibiting a second morphology. The invention also encompasses a method of utilizing a fungus for protein or chemical production. A transformed fungus is produced by transforming a fungus with a recombinant polynucleotide molecule. The recombinant polynucleotide molecule contains an isolated polynucleotide sequence linked operably to another molecule comprising a coding region of a gene of interest. The gene regulatory element and gene transcription terminator may temporally and spatially regulate expression of particular genes for optimum production of compounds of interest in a transgenic fungus.

  9. Isolated fungal promoters and gene transcription terminators and methods of protein and chemical production in a fungus

    DOEpatents

    Dai, Ziyu; Lasure, Linda L.; Magnuson, Jon K.

    2008-11-11

    The present invention encompasses isolated gene regulatory elements and gene transcription terminators that are differentially expressed in a native fungus exhibiting a first morphology relative to the native fungus exhibiting a second morphology. The invention also encompasses a method of utilizing a fungus for protein or chemical production. A transformed fungus is produced by transforming a fungus with a recombinant polynucleotide molecule. The recombinant polynucleotide molecule contains an isolated polynucleotide sequence linked operably to another molecule comprising a coding region of a gene of interest. The gene regulatory element and gene transcription terminator may temporally and spatially regulate expression of particular genes for optimum production of compounds of interest in a transgenic fungus.

  10. Isolated fungal promoters and gene transcription terminators and methods of protein and chemical production in a fungus

    DOEpatents

    Dai, Ziyu; Lasure, Linda L; Magnuson, Jon K

    2014-05-27

    The present invention encompasses isolated gene regulatory elements and gene transcription terminators that are differentially expressed in a native fungus exhibiting a first morphology relative to the native fungus exhibiting a second morphology. The invention also encompasses a method of utilizing a fungus for protein or chemical production. A transformed fungus is produced by transforming a fungus with a recombinant polynucleotide molecule. The recombinant polynucleotide molecule contains an isolated polynucleotide sequence linked operably to another molecule comprising a coding region of a gene of interest. The gene regulatory element and gene transcription terminator may temporally and spatially regulate expression of particular genes for optimum production of compounds of interest in a transgenic fungus.

  11. Discovering disease-associated genes in weighted protein-protein interaction networks

    NASA Astrophysics Data System (ADS)

    Cui, Ying; Cai, Meng; Stanley, H. Eugene

    2018-04-01

    Although there have been many network-based attempts to discover disease-associated genes, most of them have not taken edge weight - which quantifies their relative strength - into consideration. We use connection weights in a protein-protein interaction (PPI) network to locate disease-related genes. We analyze the topological properties of both weighted and unweighted PPI networks and design an improved random forest classifier to distinguish disease genes from non-disease genes. We use a cross-validation test to confirm that weighted networks are better able to discover disease-associated genes than unweighted networks, which indicates that including link weight in the analysis of network properties provides a better model of complex genotype-phenotype associations.

  12. Role of nuclear factor of activated T-cells and activator protein-1 in the inhibition of interleukin-2 gene transcription by cannabinol in EL4 T-cells.

    PubMed

    Yea, S S; Yang, K H; Kaminski, N E

    2000-02-01

    We previously reported that immunosuppressive cannabinoids inhibited interleukin (IL)-2 steady-state mRNA expression and secretion by phorbol-12-myristate-13-acetate plus ionomycin-activated mouse splenocytes and EL4 murine T-cells. Here we show that inhibition of IL-2 production by cannabinol, a modest central nervous system-active cannabinoid, is mediated through the inhibition of IL-2 gene transcription. Moreover, electrophoretic mobility shift assays demonstrated that cannabinol markedly inhibited the DNA binding activity of nuclear factor of activated T-cells (NF-AT) and activator protein-1 (AP-1) in a time- and concentration-dependent manner in activated EL4 cells. The inhibitory effects produced by cannabinol on AP-1 DNA binding were quite transient, showing partial recovery by 240 min after cell activation and no effect on the activity of a reporter gene under the control of AP-1. Conversely, cannabinol-mediated inhibition of NF-AT was robust and sustained as demonstrated by an NF-AT-regulated reporter gene. Collectively, these results suggest that decreased IL-2 production by cannabinol in EL4 cells is due to the inhibition of transcriptional activation of the IL-2 gene and is mediated, at least in part, through a transient inhibition of AP-1 and a sustained inhibition of NF-AT.

  13. Co-expression of the Thermotoga neapolitana aglB gene with an upstream 3'-coding fragment of the malG gene improves enzymatic characteristics of recombinant AglB cyclomaltodextrinase.

    PubMed

    Lunina, Natalia A; Agafonova, Elena V; Chekanovskaya, Lyudmila A; Dvortsov, Igor A; Berezina, Oksana V; Shedova, Ekaterina N; Kostrov, Sergey V; Velikodvorskaya, Galina A

    2007-07-01

    A cluster of Thermotoga neapolitana genes participating in starch degradation includes the malG gene of sugar transport protein and the aglB gene of cyclomaltodextrinase. The start and stop codons of these genes share a common overlapping sequence, aTGAtg. Here, we compared properties of expression products of three different constructs with aglB from T. neapolitana. The first expression vector contained the aglB gene linked to an upstream 90-bp 3'-terminal region of the malG gene with the stop codon overlapping with the start codon of aglB. The second construct included the isolated coding sequence of aglB with two tandem potential start codons. The expression product of this construct in Escherichia coli had two tandem Met residues at its N terminus and was characterized by low thermostability and high tendency to aggregate. In contrast, co-expression of aglB and the 3'-terminal region of malG (the first construct) resulted in AglB with only one N-terminal Met residue and a much higher specific activity of cyclomaltodextrinase. Moreover, the enzyme expressed by such a construct was more thermostable and less prone to aggregation. The third construct was the same as the second one except that it contained only one ATG start codon. The product of its expression had kinetic and other properties similar to those of the enzyme with only one N-terminal Met residue.

  14. Direct protein interaction underlies gene-for-gene specificity and coevolution of the flax resistance genes and flax rust avirulence genes

    PubMed Central

    Dodds, Peter N.; Lawrence, Gregory J.; Catanzariti, Ann-Maree; Teh, Trazel; Wang, Ching-I. A.; Ayliffe, Michael A.; Kobe, Bostjan; Ellis, Jeffrey G.

    2006-01-01

    Plant resistance proteins (R proteins) recognize corresponding pathogen avirulence (Avr) proteins either indirectly through detection of changes in their host protein targets or through direct R–Avr protein interaction. Although indirect recognition imposes selection against Avr effector function, pathogen effector molecules recognized through direct interaction may overcome resistance through sequence diversification rather than loss of function. Here we show that the flax rust fungus AvrL567 genes, whose products are recognized by the L5, L6, and L7 R proteins of flax, are highly diverse, with 12 sequence variants identified from six rust strains. Seven AvrL567 variants derived from Avr alleles induce necrotic responses when expressed in flax plants containing corresponding resistance genes (R genes), whereas five variants from avr alleles do not. Differences in recognition specificity between AvrL567 variants and evidence for diversifying selection acting on these genes suggest they have been involved in a gene-specific arms race with the corresponding flax R genes. Yeast two-hybrid assays indicate that recognition is based on direct R–Avr protein interaction and recapitulate the interaction specificity observed in planta. Biochemical analysis of Escherichia coli-produced AvrL567 proteins shows that variants that escape recognition nevertheless maintain a conserved structure and stability, suggesting that the amino acid sequence differences directly affect the R–Avr protein interaction. We suggest that direct recognition associated with high genetic diversity at corresponding R and Avr gene loci represents an alternative outcome of plant–pathogen coevolution to indirect recognition associated with simple balanced polymorphisms for functional and nonfunctional R and Avr genes. PMID:16731621

  15. Ubiquitous and gene-specific regulatory 5' sequences in a sea urchin histone DNA clone coding for histone protein variants.

    PubMed Central

    Busslinger, M; Portmann, R; Irminger, J C; Birnstiel, M L

    1980-01-01

    The DNA sequences of the entire structural H4, H3, H2A and H2B genes and of their 5' flanking regions have been determined in the histone DNA clone h19 of the sea urchin Psammechinus miliaris. In clone h19 the polarity of transcription and the relative arrangement of the histone genes is identical to that in clone h22 of the same species. The histone proteins encoded by h19 DNA differ in their primary structure from those encoded by clone h22 and have been compared to histone protein sequences of other sea urchin species as well as other eukaryotes. A comparative analysis of the 5' flanking DNA sequences of the structural histone genes in both clones revealed four ubiquitous sequence motifs; a pentameric element GATCC, followed at short distance by the Hogness box GTATAAATAG, a conserved sequence PyCATTCPu, in or near which the 5' ends of the mRNAs map in h22 DNA and lastly a sequence A, containing the initiation codon. These sequences are also found, sometimes in modified version, in front of other eukaryotic genes transcribed by polymerase II. When prelude sequences of isocoding histone genes in clone h19 and h22 are compared areas of homology are seen to extend beyond the ubiquitous sequence motifs towards the divergent AT-rich spacer and terminate between approximately 140 and 240 nucleotides away from the structural gene. These prelude regions contain quite large conservative sequence blocks which are specific for each type of histone genes. Images PMID:7443547

  16. The Bacillus thuringiensis cyt Genes for Hemolytic Endotoxins Constitute a Gene Family

    PubMed Central

    Guerchicoff, Alejandra; Delécluse, Armelle; Rubinstein, Clara P.

    2001-01-01

    In the same way that cry genes, coding for larvicidal delta endotoxins, constitute a large and diverse gene family, the cyt genes for hemolytic toxins seem to compose another set of highly related genes in Bacillus thuringiensis. Although the occurrence of Cyt hemolytic factors in B. thuringiensis has been typically associated with mosquitocidal strains, we have recently shown that cyt genes are also present in strains with different pathotypes; this is the case for the morrisoni subspecies, which includes strains biologically active against dipteran, lepidopteran, and coleopteran larvae. In addition, while one Cyt type of protein has been described in all of the mosquitocidal strains studied so far, the present study confirms that at least two Cyt toxins coexist in the more toxic antidipteran strains, such as B. thuringiensis subsp. israelensis and subsp. morrisoni PG14, and that this could also be the case for many others. In fact, PCR screening and Western blot analysis of 50 B. thuringiensis strains revealed that cyt2-related genes are present in all strains with known antidipteran activity, as well as in some others with different or unknown host ranges. Partial DNA sequences for several of these genes were determined, and protein sequence alignments revealed a high degree of conservation of the structural domains. These findings point to an important biological role for Cyt toxins in the final in vivo toxic activity of many B. thuringiensis strains. PMID:11229896

  17. Cloning & sequence identification of Hsp27 gene and expression analysis of the protein on thermal stress in Lucilia cuprina.

    PubMed

    Singh, Manish K; Tiwari, Pramod K

    2016-08-01

    Hsp27, a highly conserved small molecular weight heat shock protein, is widely known to be developmentally regulated and heat inducible. Its role in thermotolerance is also implicated. This study is a sequel of our earlier studies to understand the molecular organization of heat shock genes/proteins and their role in development and thermal adaptation in a sheep pest, Lucilia cuprina (blowfly), which exhibits unusually high adaptability to a variety of environmental stresses, including heat and chemicals. In this report our aim was to understand the evolutionary relationship of Lucilia hsp27 gene/protein with those of other species and its role in thermal adaptation. We sequence characterized the Lchsp27 gene (coding region) and analyzed its expression in various larval and adult tissues under normal as well as heat shock conditions. The nucleotide sequence analysis of 678 bps long-coding region of Lchsp27 exhibited closest evolutionary proximity with Drosophila (90.09%), which belongs to the same order, Diptera. Heat shock caused significant enhancement in the expression of Lchsp27 gene in all the larval and adult tissues examined, however, in a tissue specific manner. Significantly, in Malpighian tubules, while the heat-induced level of hsp27 transcript (mRNA) appeared increased as compared to control, the protein level remained unaltered and nuclear localized. We infer that Lchsp27 may have significant role in the maintenance of cellular homeostasis, particularly, during summer months, when the fly remains exposed to high heat in its natural habitat. © 2015 Institute of Zoology, Chinese Academy of Sciences.

  18. Gene Trapping Using Gal4 in Zebrafish

    PubMed Central

    Balciuniene, Jorune; Balciunas, Darius

    2013-01-01

    Large clutch size and external development of optically transparent embryos make zebrafish an exceptional vertebrate model system for in vivo insertional mutagenesis using fluorescent reporters to tag expression of mutated genes. Several laboratories have constructed and tested enhancer- and gene-trap vectors in zebrafish, using fluorescent proteins, Gal4- and lexA- based transcriptional activators as reporters 1-7. These vectors had two potential drawbacks: suboptimal stringency (e.g. lack of ability to differentiate between enhancer- and gene-trap events) and low mutagenicity (e.g. integrations into genes rarely produced null alleles). Gene Breaking Transposon (GBTs) were developed to address these drawbacks 8-10. We have modified one of the first GBT vectors, GBT-R15, for use with Gal4-VP16 as the primary gene trap reporter and added UAS:eGFP as the secondary reporter for direct detection of gene trap events. Application of Gal4-VP16 as the primary gene trap reporter provides two main advantages. First, it increases sensitivity for genes expressed at low expression levels. Second, it enables researchers to use gene trap lines as Gal4 drivers to direct expression of other transgenes in very specific tissues. This is especially pertinent for genes with non-essential or redundant functions, where gene trap integration may not result in overt phenotypes. The disadvantage of using Gal4-VP16 as the primary gene trap reporter is that genes coding for proteins with N-terminal signal sequences are not amenable to trapping, as the resulting Gal4-VP16 fusion proteins are unlikely to be able to enter the nucleus and activate transcription. Importantly, the use of Gal4-VP16 does not pre-select for nuclear proteins: we recovered gene trap mutations in genes encoding proteins which function in the nucleus, the cytoplasm and the plasma membrane. PMID:24121167

  19. Robust, synergistic regulation of human gene expression using TALE activators.

    PubMed

    Maeder, Morgan L; Linder, Samantha J; Reyon, Deepak; Angstman, James F; Fu, Yanfang; Sander, Jeffry D; Joung, J Keith

    2013-03-01

    Artificial activators designed using transcription activator-like effector (TALE) technology have broad utility, but previous studies suggest that these monomeric proteins often exhibit low activities. Here we demonstrate that TALE activators can robustly function individually or in synergistic combinations to increase expression of endogenous human genes over wide dynamic ranges. These findings will encourage applications of TALE activators for research and therapy, and guide design of monomeric TALE-based fusion proteins.

  20. The heat-shock protein Apg-2 binds to the tight junction protein ZO-1 and regulates transcriptional activity of ZONAB.

    PubMed

    Tsapara, Anna; Matter, Karl; Balda, Maria S

    2006-03-01

    The tight junction adaptor protein ZO-1 regulates intracellular signaling and cell proliferation. Its Src homology 3 (SH3) domain is required for the regulation of proliferation and binds to the Y-box transcription factor ZO-1-associated nucleic acid binding protein (ZONAB). Binding of ZO-1 to ZONAB results in cytoplasmic sequestration and hence inhibition of ZONAB's transcriptional activity. Here, we identify a new binding partner of the SH3 domain that modulates ZO-1-ZONAB signaling. Expression screening of a cDNA library with a fusion protein containing the SH3 domain yielded a cDNA coding for Apg-2, a member of the heat-shock protein 110 (Hsp 110) subfamily of Hsp70 heat-shock proteins, which is overexpressed in carcinomas. Regulated depletion of Apg-2 in Madin-Darby canine kidney cells inhibits G(1)/S phase progression. Apg-2 coimmunoprecipitates with ZO-1 and partially localizes to intercellular junctions. Junctional recruitment and coimmunoprecipitation with ZO-1 are stimulated by heat shock. Apg-2 competes with ZONAB for binding to the SH3 domain in vitro and regulates ZONAB's transcriptional activity in reporter gene assays. Our data hence support a model in which Apg-2 regulates ZONAB function by competing for binding to the SH3 domain of ZO-1 and suggest that Apg-2 functions as a regulator of ZO-1-ZONAB signaling in epithelial cells in response to cellular stress.

  1. The Heat-Shock Protein Apg-2 Binds to the Tight Junction Protein ZO-1 and Regulates Transcriptional Activity of ZONAB

    PubMed Central

    Tsapara, Anna; Matter, Karl; Balda, Maria S.

    2006-01-01

    The tight junction adaptor protein ZO-1 regulates intracellular signaling and cell proliferation. Its Src homology 3 (SH3) domain is required for the regulation of proliferation and binds to the Y-box transcription factor ZO-1-associated nucleic acid binding protein (ZONAB). Binding of ZO-1 to ZONAB results in cytoplasmic sequestration and hence inhibition of ZONAB's transcriptional activity. Here, we identify a new binding partner of the SH3 domain that modulates ZO-1–ZONAB signaling. Expression screening of a cDNA library with a fusion protein containing the SH3 domain yielded a cDNA coding for Apg-2, a member of the heat-shock protein 110 (Hsp 110) subfamily of Hsp70 heat-shock proteins, which is overexpressed in carcinomas. Regulated depletion of Apg-2 in Madin-Darby canine kidney cells inhibits G1/S phase progression. Apg-2 coimmunoprecipitates with ZO-1 and partially localizes to intercellular junctions. Junctional recruitment and coimmunoprecipitation with ZO-1 are stimulated by heat shock. Apg-2 competes with ZONAB for binding to the SH3 domain in vitro and regulates ZONAB's transcriptional activity in reporter gene assays. Our data hence support a model in which Apg-2 regulates ZONAB function by competing for binding to the SH3 domain of ZO-1 and suggest that Apg-2 functions as a regulator of ZO-1–ZONAB signaling in epithelial cells in response to cellular stress. PMID:16407410

  2. The Arabidopsis At1g30680 gene encodes a homologue to the phage T7 gp4 protein that has both DNA primase and DNA helicase activities.

    PubMed

    Diray-Arce, Joann; Liu, Bin; Cupp, John D; Hunt, Travis; Nielsen, Brent L

    2013-03-04

    The Arabidopsis thaliana genome encodes a homologue of the full-length bacteriophage T7 gp4 protein, which is also homologous to the eukaryotic Twinkle protein. While the phage protein has both DNA primase and DNA helicase activities, in animal cells Twinkle is localized to mitochondria and has only DNA helicase activity due to sequence changes in the DNA primase domain. However, Arabidopsis and other plant Twinkle homologues retain sequence homology for both functional domains of the phage protein. The Arabidopsis Twinkle homologue has been shown by others to be dual targeted to mitochondria and chloroplasts. To determine the functional activity of the Arabidopsis protein we obtained the gene for the full-length Arabidopsis protein and expressed it in bacteria. The purified protein was shown to have both DNA primase and DNA helicase activities. Western blot and qRT-PCR analysis indicated that the Arabidopsis gene is expressed most abundantly in young leaves and shoot apex tissue, as expected if this protein plays a role in organelle DNA replication. This expression is closely correlated with the expression of organelle-localized DNA polymerase in the same tissues. Homologues from other plant species show close similarity by phylogenetic analysis. The results presented here indicate that the Arabidopsis phage T7 gp4/Twinkle homologue has both DNA primase and DNA helicase activities and may provide these functions for organelle DNA replication.

  3. Cloning, characterization and sequence comparison of the gene coding for IMP dehydrogenase from Pyrococcus furiosus.

    PubMed

    Collart, F R; Osipiuk, J; Trent, J; Olsen, G J; Huberman, E

    1996-10-03

    We have cloned and characterized the gene encoding inosine monophosphate dehydrogenase (IMPDH) from Pyrococcus furiosus (Pf), a hyperthermophillic archeon. Sequence analysis of the Pf gene indicated an open reading frame specifying a protein of 485 amino acids (aa) with a calculated M(r) of 52900. Canonical Archaea promoter elements, Box A and Box B, are located -49 and -17 nucleotides (nt), respectively, upstream of the putative start codon. The sequence of the putative active-site region conforms to the IMPDH signature motif and contains a putative active-site cysteine. Phylogenetic relationships derived by using all available IMPDH sequences are consistent with trees developed for other molecules; they do not precisely resolve the history of Pf IMPDH but indicate a close similarity to bacterial IMPDH proteins. The phylogenetic analysis indicates that a gene duplication occurred prior to the division between rodents and humans, accounting for the Type I and II isoforms identified in mice and humans.

  4. Nuclear localization of the C2H2 zinc finger protein Msn2p is regulated by stress and protein kinase A activity

    PubMed Central

    Görner, Wolfram; Durchschlag, Erich; Martinez-Pastor, Maria Teresa; Estruch, Francisco; Ammerer, Gustav; Hamilton, Barbara; Ruis, Helmut; Schüller, Christoph

    1998-01-01

    Msn2p and the partially redundant factor Msn4p are key regulators of stress-responsive gene expression in Saccharomyces cerevisiae. They are required for the transcription of a number of genes coding for proteins with stress-protective functions. Both Msn2p and Msn4p are Cys2His2 zinc finger proteins and bind to the stress response element (STRE). In vivo footprinting studies show that the occupation of STREs is enhanced in stressed cells and dependent on the presence of Msn2p and Msn4p. Both factors accumulate in the nucleus under stress conditions, such as heat shock, osmotic stress, carbon-source starvation, and in the presence of ethanol or sorbate. Stress-induced nuclear localization was found to be rapid, reversible, and independent of protein synthesis. Nuclear localization of Msn2p and Msn4p was shown to be correlated inversely to cAMP levels and protein kinase A (PKA) activity. A region with significant homologies shared between Msn2p and Msn4p is sufficient to confer stress-regulated localization to a SV40–NLS–GFP fusion protein. Serine to alanine or aspartate substitutions in a conserved PKA consensus site abolished cAMP-driven nuclear export and cytoplasmic localization in unstressed cells. We propose stress and cAMP-regulated intracellular localization of Msn2p to be a key step in STRE-dependent transcription and in the general stress response. PMID:9472026

  5. Orpinomyces cellulase celf protein and coding sequences

    DOEpatents

    Li, Xin-Liang; Chen, Huizhong; Ljungdahl, Lars G.

    2000-09-05

    A cDNA (1,520 bp), designated celF, consisting of an open reading frame (ORF) encoding a polypeptide (CelF) of 432 amino acids was isolated from a cDNA library of the anaerobic rumen fungus Orpinomyces PC-2 constructed in Escherichia coli. Analysis of the deduced amino acid sequence showed that starting from the N-terminus, CelF consists of a signal peptide, a cellulose binding domain (CBD) followed by an extremely Asn-rich linker region which separate the CBD and the catalytic domains. The latter is located at the C-terminus. The catalytic domain of CelF is highly homologous to CelA and CelC of Orpinomyces PC-2, to CelA of Neocallimastix patriciarum and also to cellobiohydrolase IIs (CBHIIs) from aerobic fungi. However, Like CelA of Neocallimastix patriciarum, CelF does not have the noncatalytic repeated peptide domain (NCRPD) found in CelA and CelC from the same organism. The recombinant protein CelF hydrolyzes cellooligosaccharides in the pattern of CBHII, yielding only cellobiose as product with cellotetraose as the substrate. The genomic celF is interrupted by a 111 bp intron, located within the region coding for the CBD. The intron of the celF has features in common with genes from aerobic filamentous fungi.

  6. Heterochromatin protein 1 gamma and IκB kinase alpha interdependence during tumour necrosis factor gene transcription elongation in activated macrophages.

    PubMed

    Thorne, James L; Ouboussad, Lylia; Lefevre, Pascal F

    2012-09-01

    IκB kinase α (IKKα) is part of the cytoplasmic IKK complex regulating nuclear factor-κB (NF-κB) release and translocation into the nucleus in response to pro-inflammatory signals. IKKα can also be recruited directly to the promoter of NF-κB-dependent genes by NF-κB where it phosphorylates histone H3 at serine 10, triggering recruitment of the bromodomain-containing protein 4 and the positive transcription elongation factor b. Herein, we report that IKKα travels with the elongating form of ribonucleic acid polymerase II together with heterochromatin protein 1 gamma (HP1γ) at NF-κB-dependent genes in activated macrophages. IKKα binds to and phosphorylates HP1γ, which in turn controls IKKα binding to chromatin and phosphorylation of the histone variant H3.3 at serine 31 within transcribing regions. Downstream of transcription end sites, IKKα accumulates with its inhibitor the CUE-domain containing protein 2, suggesting a link between IKKα inactivation and transcription termination.

  7. Selective activation of human heat shock gene transcription by nitrosourea antitumor drugs mediated by isocyanate-induced damage and activation of heat shock transcription factor.

    PubMed Central

    Kroes, R A; Abravaya, K; Seidenfeld, J; Morimoto, R I

    1991-01-01

    Treatment of cultured human tumor cells with the chloroethylnitrosourea antitumor drug 1,3-bis(2-chloroethyl)-1-nitrosourea (BCNU) selectively induces transcription and protein synthesis of a subset of the human heat shock or stress-induced genes (HSP90 and HSP70) with little effect on other stress genes or on expression of the c-fos, c-myc, or beta-actin genes. The active component of BCNU and related compounds appears to be the isocyanate moiety that causes carbamoylation of proteins and nucleic acids. Transcriptional activation of the human HSP70 gene by BCNU is dependent on the heat shock element and correlates with the level of heat shock transcription factor and its binding to the heat shock element in vivo. Unlike activation by heat or heavy metals, BCNU-mediated activation is strongly dependent upon new protein synthesis. This suggests that BCNU-induced, isocyanate-mediated damage to newly synthesized protein(s) may be responsible for activation of the heat shock transcription factor and increased transcription of the HSP90 and HSP70 genes. Images PMID:2052560

  8. Tissue plasminogen activator (tPA) as a reporter gene in transient gene expression.

    PubMed

    Cheng, S M; Lee, S G; Kalyan, N K; McCloud, S; Levner, M; Hung, P P

    1987-01-01

    Using the gene coding for tissue plasminogen activator (tPA) as a reporter gene, a transient gene expression system has been established. Vectors containing the full-length cDNA of tPA with its signal sequences were introduced into mammalian recipient cells by a modified gene transfer procedure. Thirty hours after transfection, the secreted tPA was found in serum-free medium and measured by a fibrin-agarose plate assay (FAPA). In this assay, tPA converts plasminogen into plasmin which then degrades high-Mr fibrin to produce cleared zones. The sizes of these zones correspond to quantities of tPA. The combination of transient tPA expression system and the FAPA provides a quick, sensitive, quantitative and non-destructive method to examine the strength of eukaryotic regulatory elements in tissue-culture cells.

  9. Genome-Wide Identification of Mitogen-Activated Protein Kinase Gene Family across Fungal Lineage Shows Presence of Novel and Diverse Activation Loop Motifs

    PubMed Central

    Mohanta, Tapan Kumar; Mohanta, Nibedita; Parida, Pratap; Panda, Sujogya Kumar; Ponpandian, Lakshmi Narayanan; Bae, Hanhong

    2016-01-01

    The mitogen-activated protein kinase (MAPK) is characterized by the presence of the T-E-Y, T-D-Y, and T-G-Y motifs in its activation loop region and plays a significant role in regulating diverse cellular responses in eukaryotic organisms. Availability of large-scale genome data in the fungal kingdom encouraged us to identify and analyse the fungal MAPK gene family consisting of 173 fungal species. The analysis of the MAPK gene family resulted in the discovery of several novel activation loop motifs (T-T-Y, T-I-Y, T-N-Y, T-H-Y, T-S-Y, K-G-Y, T-Q-Y, S-E-Y and S-D-Y) in fungal MAPKs. The phylogenetic analysis suggests that fungal MAPKs are non-polymorphic, had evolved from their common ancestors around 1500 million years ago, and are distantly related to plant MAPKs. We are the first to report the presence of nine novel activation loop motifs in fungal MAPKs. The specificity of the activation loop motif plays a significant role in controlling different growth and stress related pathways in fungi. Hence, the presences of these nine novel activation loop motifs in fungi are of special interest. PMID:26918378

  10. HCV core protein induces hepatic lipid accumulation by activating SREBP1 and PPAR{gamma}

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kim, Kook Hwan; Hong, Sung Pyo; Kim, KyeongJin

    2007-04-20

    Hepatic steatosis is a common feature in patients with chronic hepatitis C virus (HCV) infection. HCV core protein plays an important role in the development of hepatic steatosis in HCV infection. Because SREBP1 (sterol regulatory element binding protein 1) and PPAR{gamma} (peroxisome proliferators-activated receptor {gamma}) are involved in the regulation of lipid metabolism of hepatocyte, we sought to determine whether HCV core protein may impair the expression and activity of SREBP1 and PPAR{gamma}. In this study, it was demonstrated that HCV core protein increases the gene expression of SREBP1 not only in Chang liver, Huh7, and HepG2 cells transiently transfectedmore » with HCV core protein expression plasmid, but also in Chang liver-core stable cells. Furthermore, HCV core protein enhanced the transcriptional activity of SREBP1. In addition, HCV core protein elevated PPAR{gamma} transcriptional activity. However, HCV core protein had no effect on PPAR{gamma} gene expression. Finally, we showed that HCV core protein stimulates the genes expression of lipogenic enzyme and fatty acid uptake associated protein. Therefore, our finding provides a new insight into the mechanism of hepatic steatosis by HCV infection.« less

  11. De Novo ORFs in Drosophila Are Important to Organismal Fitness and Evolved Rapidly from Previously Non-coding Sequences

    PubMed Central

    Reinhardt, Josephine A.; Wanjiru, Betty M.; Brant, Alicia T.; Saelao, Perot; Begun, David J.; Jones, Corbin D.

    2013-01-01

    How non-coding DNA gives rise to new protein-coding genes (de novo genes) is not well understood. Recent work has revealed the origins and functions of a few de novo genes, but common principles governing the evolution or biological roles of these genes are unknown. To better define these principles, we performed a parallel analysis of the evolution and function of six putatively protein-coding de novo genes described in Drosophila melanogaster. Reconstruction of the transcriptional history of de novo genes shows that two de novo genes emerged from novel long non-coding RNAs that arose at least 5 MY prior to evolution of an open reading frame. In contrast, four other de novo genes evolved a translated open reading frame and transcription within the same evolutionary interval suggesting that nascent open reading frames (proto-ORFs), while not required, can contribute to the emergence of a new de novo gene. However, none of the genes arose from proto-ORFs that existed long before expression evolved. Sequence and structural evolution of de novo genes was rapid compared to nearby genes and the structural complexity of de novo genes steadily increases over evolutionary time. Despite the fact that these genes are transcribed at a higher level in males than females, and are most strongly expressed in testes, RNAi experiments show that most of these genes are essential in both sexes during metamorphosis. This lethality suggests that protein coding de novo genes in Drosophila quickly become functionally important. PMID:24146629

  12. Junk DNA and the long non-coding RNA twist in cancer genetics

    PubMed Central

    Ling, Hui; Vincent, Kimberly; Pichler, Martin; Fodde, Riccardo; Berindan-Neagoe, Ioana; Slack, Frank J.; Calin, George A

    2015-01-01

    The central dogma of molecular biology states that the flow of genetic information moves from DNA to RNA to protein. However, in the last decade this dogma has been challenged by new findings on non-coding RNAs (ncRNAs) such as microRNAs (miRNAs). More recently, long non-coding RNAs (lncRNAs) have attracted much attention due to their large number and biological significance. Many lncRNAs have been identified as mapping to regulatory elements including gene promoters and enhancers, ultraconserved regions, and intergenic regions of protein-coding genes. Yet, the biological function and molecular mechanisms of lncRNA in human diseases in general and cancer in particular remain largely unknown. Data from the literature suggest that lncRNA, often via interaction with proteins, functions in specific genomic loci or use their own transcription loci for regulatory activity. In this review, we summarize recent findings supporting the importance of DNA loci in lncRNA function, and the underlying molecular mechanisms via cis or trans regulation, and discuss their implications in cancer. In addition, we use the 8q24 genomic locus, a region containing interactive SNPs, DNA regulatory elements and lncRNAs, as an example to illustrate how single nucleotide polymorphism (SNP) located within lncRNAs may be functionally associated with the individual’s susceptibility to cancer. PMID:25619839

  13. A genome-wide identification and analysis of the DYW-deaminase genes in the pentatricopeptide repeat gene family in cotton (Gossypium spp.)

    PubMed Central

    Liu, Guoyuan; Li, Xue; Guo, Liping; Zhang, Xuexian; Qi, Tingxiang; Wang, Hailin; Tang, Huini; Qiao, Xiuqin; Zhang, Jinfa; Xing, Chaozhu; Wu, Jianyong

    2017-01-01

    The RNA editing occurring in plant organellar genomes mainly involves the change of cytidine to uridine. This process involves a deamination reaction, with cytidine deaminase as the catalyst. Pentatricopeptide repeat (PPR) proteins with a C-terminal DYW domain are reportedly associated with cytidine deamination, similar to members of the deaminase superfamily. PPR genes are involved in many cellular functions and biological processes including fertility restoration to cytoplasmic male sterility (CMS) in plants. In this study, we identified 227 and 211 DYW deaminase-coding PPR genes for the cultivated tetraploid cotton species G. hirsutum and G. barbadense (2n = 4x = 52), respectively, as well as 126 and 97 DYW deaminase-coding PPR genes in the ancestral diploid species G. raimondii and G. arboreum (2n = 26), respectively. The 227 G. hirsutum PPR genes were predicted to encode 52–2016 amino acids, 203 of which were mapped onto 26 chromosomes. Most DYW deaminase genes lacked introns, and their proteins were predicted to target the mitochondria or chloroplasts. Additionally, the DYW domain differed from the complete DYW deaminase domain, which contained part of the E domain and the entire E+ domain. The types and number of DYW tripeptides may have been influenced by evolutionary processes, with some tripeptides being lost. Furthermore, a gene ontology analysis revealed that DYW deaminase functions were mainly related to binding as well as hydrolase and transferase activities. The G. hirsutum DYW deaminase expression profiles varied among different cotton tissues and developmental stages, and no differentially expressed DYW deaminase-coding PPRs were directly associated with the male sterility and restoration in the CMS-D2 system. Our current study provides an important piece of information regarding the structural and evolutionary characteristics of Gossypium DYW-containing PPR genes coding for deaminases and will be useful for characterizing the DYW deaminase gene

  14. The Maximal C3 Self-Complementary Trinucleotide Circular Code X in Genes of Bacteria, Archaea, Eukaryotes, Plasmids and Viruses

    PubMed Central

    Michel, Christian J.

    2017-01-01

    In 1996, a set X of 20 trinucleotides was identified in genes of both prokaryotes and eukaryotes which has on average the highest occurrence in reading frame compared to its two shifted frames. Furthermore, this set X has an interesting mathematical property as X is a maximal C3 self-complementary trinucleotide circular code. In 2015, by quantifying the inspection approach used in 1996, the circular code X was confirmed in the genes of bacteria and eukaryotes and was also identified in the genes of plasmids and viruses. The method was based on the preferential occurrence of trinucleotides among the three frames at the gene population level. We extend here this definition at the gene level. This new statistical approach considers all the genes, i.e., of large and small lengths, with the same weight for searching the circular code X. As a consequence, the concept of circular code, in particular the reading frame retrieval, is directly associated to each gene. At the gene level, the circular code X is strengthened in the genes of bacteria, eukaryotes, plasmids, and viruses, and is now also identified in the genes of archaea. The genes of mitochondria and chloroplasts contain a subset of the circular code X. Finally, by studying viral genes, the circular code X was found in DNA genomes, RNA genomes, double-stranded genomes, and single-stranded genomes. PMID:28420220

  15. Genes and proteins of urea transporters.

    PubMed

    Sands, Jeff M; Blount, Mitsi A

    2014-01-01

    A urea transporter protein in the kidney was first proposed in 1987. The first urea transporter cDNA was cloned in 1993. The SLC14a urea transporter family contains two major subgroups: SLC14a1, the UT-B urea transporter originally isolated from erythrocytes; and SLC14a2, the UT-A group originally isolated from kidney inner medulla. Slc14a1, the human UT-B gene, arises from a single locus located on chromosome 18q12.1-q21.1, which is located close to Slc14a2. Slc14a1 includes 11 exons, with the coding region extending from exon 4 to exon 11, and is approximately 30 kb in length. The Slc14a2 gene is a very large gene with 24 exons, is approximately 300 kb in length, and encodes 6 different isoforms. Slc14a2 contains two promoter elements: promoter I is located in the typical position, upstream of exon 1, and drives the transcription of UT-A1, UT-A1b, UT-A3, UT-A3b, and UT-A4; while promoter II is located within intron 12 and drives the transcription of UT-A2 and UT-A2b. UT-A1 and UT-A3 are located in the inner medullary collecting duct, UT-A2 in the thin descending limb and liver, UT-A5 in testis, UT-A6 in colon, UT-B1 primarily in descending vasa recta and erythrocytes, and UT-B2 in rumen.

  16. Using protein-protein interactions for refining gene networks estimated from microarray data by Bayesian networks.

    PubMed

    Nariai, N; Kim, S; Imoto, S; Miyano, S

    2004-01-01

    We propose a statistical method to estimate gene networks from DNA microarray data and protein-protein interactions. Because physical interactions between proteins or multiprotein complexes are likely to regulate biological processes, using only mRNA expression data is not sufficient for estimating a gene network accurately. Our method adds knowledge about protein-protein interactions to the estimation method of gene networks under a Bayesian statistical framework. In the estimated gene network, a protein complex is modeled as a virtual node based on principal component analysis. We show the effectiveness of the proposed method through the analysis of Saccharomyces cerevisiae cell cycle data. The proposed method improves the accuracy of the estimated gene networks, and successfully identifies some biological facts.

  17. Experimental annotation of post-translational features and translated coding regions in the pathogen Salmonella Typhimurium

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ansong, Charles; Tolic, Nikola; Purvine, Samuel O.

    Complete and accurate genome annotation is crucial for comprehensive and systematic studies of biological systems. For example systems biology-oriented genome scale modeling efforts greatly benefit from accurate annotation of protein-coding genes to develop proper functioning models. However, determining protein-coding genes for most new genomes is almost completely performed by inference, using computational predictions with significant documented error rates (> 15%). Furthermore, gene prediction programs provide no information on biologically important post-translational processing events critical for protein function. With the ability to directly measure peptides arising from expressed proteins, mass spectrometry-based proteomics approaches can be used to augment and verify codingmore » regions of a genomic sequence and importantly detect post-translational processing events. In this study we utilized “shotgun” proteomics to guide accurate primary genome annotation of the bacterial pathogen Salmonella Typhimurium 14028 to facilitate a systems-level understanding of Salmonella biology. The data provides protein-level experimental confirmation for 44% of predicted protein-coding genes, suggests revisions to 48 genes assigned incorrect translational start sites, and uncovers 13 non-annotated genes missed by gene prediction programs. We also present a comprehensive analysis of post-translational processing events in Salmonella, revealing a wide range of complex chemical modifications (70 distinct modifications) and confirming more than 130 signal peptide and N-terminal methionine cleavage events in Salmonella. This study highlights several ways in which proteomics data applied during the primary stages of annotation can improve the quality of genome annotations, especially with regards to the annotation of mature protein products.« less

  18. Haplotype combination of the bovine INSIG1 gene sequence variants and association with growth traits in Nanyang cattle.

    PubMed

    Sun, Jiajie; Gao, Yuan; Liu, Dong; Ma, Wei; Xue, Jing; Zhang, Chunlei; Lan, Xianyong; Lei, Chuzhao; Chen, Hong

    2012-06-01

    The insulin-induced gene 1 (INSIG1) gene encodes a protein that blocks proteolytic activation of sterol regulatory element binding proteins, which are transcription factors that activate genes that regulate cholesterol, fatty acid, and glucose metabolism. However, similar research for the bovine INSIG1 gene is lacking. Therefore, in this study, polymorphisms of the bovine INSIG1 gene were detected in 643 individuals from four cattle breeds by DNA pooling, forced PCR-RFLP, PCR-SSCP, and DNA sequencing methods. Only 10 novel SNPs were identified, which included four mutations in the coding region and the others in the introns. In Nanyang individuals, seven common haplotypes were identified based on four coding region SNPs. The haplotype GACT, with a frequency of 75.4%, was the most prevalent haplotypes and SNPs formed two linkage disequilibrium blocks with strong multi-allelic D' (D' = 1). Additionally, association analysis between mutations of the bovine INSIG1 gene and growth traits in Nanyang cattle at 6, 12, 18, and 24 months old was performed, and the results indicated that the polymorphisms were not significantly associated with body mass.

  19. Detection of a large duplication mutation in the myosin-binding protein C3 gene in a case of hypertrophic cardiomyopathy.

    PubMed

    Meyer, Thomas; Pankuweit, Sabine; Richter, Anette; Maisch, Bernhard; Ruppert, Volker

    2013-09-15

    Hypertrophic cardiomyopathy (HCM) is a cardiovascular disease with autosomal dominant inheritance caused by mutations in genes coding for sarcomeric and/or regulatory proteins expressed in cardiomyocytes. In a small cohort of HCM patients (n=8), we searched for mutations in the two most common genes responsible for HCM and found four missense mutations in the MYH7 gene encoding cardiac β-myosin heavy chain (R204H, M493V, R719W, and R870H) and three mutations in the myosin-binding protein C3 gene (MYBPC3) including one missense (A848V) and two frameshift mutations (c.3713delTG and c.702ins26bp). The c.702ins26bp insertion resulted from the duplication of a 26-bp fragment in a 54-year-old female HCM patient presenting with clinical signs of heart failure due to diastolic dysfunction. Although such large duplications (>10 bp) in the MYBPC3 gene are very rare and have been identified only in 4 families reported so far, the identical duplication mutation was found earlier in a Dutch patient, demonstrating that it may constitute a hitherto unknown founder mutation in central European populations. This observation underscores the significance of insertions into the coding sequence of the MYBPC3 gene for the development and pathogenesis of HCM. © 2013 Elsevier B.V. All rights reserved.

  20. [Nuclease activity of the recombinant plancitoxin-1-like proteins with mutations in the active site from Trichinella spiralis].

    PubMed

    Liao, Chengshui; Wang, Xiaoli; Tian, Wenjing; Zhang, Mengke; Zhang, Chunjie; Li, Yinju; Wu, Tingcai; Cheng, Xiangchao

    2017-08-25

    Although there are 125 predicted DNase Ⅱ-like family genes in the Trichinella spiralis genome, plancitoxin-1-like (Ts-Pt) contains the HKD motif, a typical conserved region of DNase Ⅱ, in N- and C-terminal. It is generally believed that histidine is the active site in DNase Ⅱ. To study the nuclease activity of recombinant Ts-Pt with mutations in the active site from T. spiralis, different fragments of the mutated Ts-Pt genes were cloned using overlap PCR technique and inserted into the expressing vector pET-28a(+), and transformed into Escherichia coli Rosseta (DE3). The fusion proteins were purified by Ni-NTA affinity chromatography and SDS-PAGE. Nuclease activity of the recombinant proteins was detected by agarose gel electrophoresis and nuclease-zymography. The recombinant plasmids harboring the mutated Ts-Pt genes were constructed and expressed as inclusive body in a prokaryotic expression system. After renaturation in vitro, the recombinant proteins had no nuclease activity according to agarose gel electrophoresis. However, the expressed proteins as inclusive body displayed the ability to degrade DNA after renaturation in gel. And the nuclease activity was not affected after subjected to mutation of active site in N- and C-termini of Ts-Pt. These results provide the basis to study the relationship between DNase Ⅱ-like protein family and infection of T. spiralis.

  1. Up-regulation of glutathione-related genes, enzyme activities and transport proteins in human cervical cancer cells treated with doxorubicin.

    PubMed

    Drozd, Ewa; Krzysztoń-Russjan, Jolanta; Marczewska, Jadwiga; Drozd, Janina; Bubko, Irena; Bielak, Magda; Lubelska, Katarzyna; Wiktorska, Katarzyna; Chilmonczyk, Zdzisław; Anuszewska, Elżbieta; Gruber-Bzura, Beata

    2016-10-01

    Doxorubicin (DOX), one of the most effective anticancer drugs, acts in a variety of ways including DNA damage, enzyme inhibition and generation of reactive oxygen species. Glutathione (GSH) and glutathione-related enzymes including: glutathione peroxidase (GPX), glutathione reductase (GSR) and glutathione S-transferases (GST) may play a role in adaptive detoxification processes in response to the oxidative stress, thus contributing to drug resistance phenotype. In this study, we investigated effects of DOX treatment on expression and activity of GSH-related enzymes and multidrug resistance-associated proteins in cultured human cervical cancer cells displaying different resistance against this drug (HeLa and KB-V1). Determination of expression level of genes encoding GST isoforms and MRP proteins (GCS, GPX, GSR, GSTA1-3, GSTM1, GSTP1, ABCC1-3, MGST1-3) was performed using StellARray™ Technology. Enzymatic activities of GPX and GSR were measured using biochemical methods. Expression of MRP1 was examined by immunofluorescence microscopy. This study showed that native expression levels of GSTM1 and GSTA3 were markedly higher in KB-V1 cells (2000-fold and 200-fold) compared to HeLa cells. Resistant cells have also shown significantly elevated expression of GSTA1 and GSTA2 genes (200-fold and 50-fold) as a result of DOX treatment. In HeLa cells, exposure to DOX increased expression of all genes: GSTM1 (7-fold) and GSTA1-3 (550-fold, 150-fold and 300-fold). Exposure to DOX led to the slight increase of GCS expression as well as GPX activity in KB-V1 cells, while in HeLa cells it did not. Expression of ABCC1 (MRP1) was not increased in any of the tested cell lines. Our results indicate that expression of GSTM1 and GSTA1-3 genes is up-regulated by DOX treatment and suggest that activity of these genes may be associated with drug resistance of the tested cells. At the same time, involvement of MRP1 in DOX resistance in the given experimental conditions is unlikely

  2. Long non-coding RNA expression patterns in lung tissues of chronic cigarette smoke induced COPD mouse model.

    PubMed

    Zhang, Haiyun; Sun, Dejun; Li, Defu; Zheng, Zeguang; Xu, Jingyi; Liang, Xue; Zhang, Chenting; Wang, Sheng; Wang, Jian; Lu, Wenju

    2018-05-15

    Long non-coding RNAs (lncRNAs) have critical regulatory roles in protein-coding gene expression. Aberrant expression profiles of lncRNAs have been observed in various human diseases. In this study, we investigated transcriptome profiles in lung tissues of chronic cigarette smoke (CS)-induced COPD mouse model. We found that 109 lncRNAs and 260 mRNAs were significantly differential expressed in lungs of chronic CS-induced COPD mouse model compared with control animals. GO and KEGG analyses indicated that differentially expressed lncRNAs associated protein-coding genes were mainly involved in protein processing of endoplasmic reticulum pathway, and taurine and hypotaurine metabolism pathway. The combination of high throughput data analysis and the results of qRT-PCR validation in lungs of chronic CS-induced COPD mouse model, 16HBE cells with CSE treatment and PBMC from patients with COPD revealed that NR_102714 and its associated protein-coding gene UCHL1 might be involved in the development of COPD both in mouse and human. In conclusion, our study demonstrated that aberrant expression profiles of lncRNAs and mRNAs existed in lungs of chronic CS-induced COPD mouse model. From animal models perspective, these results might provide further clues to investigate biological functions of lncRNAs and their potential target protein-coding genes in the pathogenesis of COPD.

  3. Biomimetic Artificial Epigenetic Code for Targeted Acetylation of Histones.

    PubMed

    Taniguchi, Junichi; Feng, Yihong; Pandian, Ganesh N; Hashiya, Fumitaka; Hidaka, Takuya; Hashiya, Kaori; Park, Soyoung; Bando, Toshikazu; Ito, Shinji; Sugiyama, Hiroshi

    2018-06-13

    While the central role of locus-specific acetylation of histone proteins in eukaryotic gene expression is well established, the availability of designer tools to regulate acetylation at particular nucleosome sites remains limited. Here, we develop a unique strategy to introduce acetylation by constructing a bifunctional molecule designated Bi-PIP. Bi-PIP has a P300/CBP-selective bromodomain inhibitor (Bi) as a P300/CBP recruiter and a pyrrole-imidazole polyamide (PIP) as a sequence-selective DNA binder. Biochemical assays verified that Bi-PIPs recruit P300 to the nucleosomes having their target DNA sequences and extensively accelerate acetylation. Bi-PIPs also activated transcription of genes that have corresponding cognate DNA sequences inside living cells. Our results demonstrate that Bi-PIPs could act as a synthetic programmable histone code of acetylation, which emulates the bromodomain-mediated natural propagation system of histone acetylation to activate gene expression in a sequence-selective manner.

  4. Mutations in protein-binding hot-spots on the hub protein Smad3 differentially affect its protein interactions and Smad3-regulated gene expression.

    PubMed

    Schiro, Michelle M; Stauber, Sara E; Peterson, Tami L; Krueger, Chateen; Darnell, Steven J; Satyshur, Kenneth A; Drinkwater, Norman R; Newton, Michael A; Hoffmann, F Michael

    2011-01-01

    Hub proteins are connected through binding interactions to many other proteins. Smad3, a mediator of signal transduction induced by transforming growth factor beta (TGF-β), serves as a hub protein for over 50 protein-protein interactions. Different cellular responses mediated by Smad3 are the product of cell-type and context dependent Smad3-nucleated protein complexes acting in concert. Our hypothesis is that perturbation of this spectrum of protein complexes by mutation of single protein-binding hot-spots on Smad3 will have distinct consequences on Smad3-mediated responses. We mutated 28 amino acids on the surface of the Smad3 MH2 domain and identified 22 Smad3 variants with reduced binding to subsets of 17 Smad3-binding proteins including Smad4, SARA, Ski, Smurf2 and SIP1. Mutations defective in binding to Smad4, e.g., D408H, or defective in nucleocytoplasmic shuttling, e.g., W406A, were compromised in modulating the expression levels of a Smad3-dependent reporter gene or six endogenous Smad3-responsive genes: Mmp9, IL11, Tnfaip6, Fermt1, Olfm2 and Wnt11. However, the Smad3 mutants Y226A, Y297A, W326A, K341A, and E267A had distinct differences on TGF-β signaling. For example, K341A and Y226A both reduced the Smad3-mediated activation of the reporter gene by ∼50% but K341A only reduced the TGF-β inducibilty of Olfm2 in contrast to Y226A which reduced the TGF-β inducibility of all six endogenous genes as severely as the W406A mutation. E267A had increased protein binding but reduced TGF-β inducibility because it caused higher basal levels of expression. Y297A had increased TGF-β inducibility because it caused lower Smad3-induced basal levels of gene expression. Mutations in protein binding hot-spots on Smad3 reduced the binding to different subsets of interacting proteins and caused a range of quantitative changes in the expression of genes induced by Smad3. This approach should be useful for unraveling which Smad3 protein complexes are critical for

  5. RNA editing of non-coding RNA and its role in gene regulation.

    PubMed

    Daniel, Chammiran; Lagergren, Jens; Öhman, Marie

    2015-10-01

    It has for a long time been known that repetitive elements, particularly Alu sequences in human, are edited by the adenosine deaminases acting on RNA, ADAR, family. The functional interpretation of these events has been even more difficult than that of editing events in coding sequences, but today there is an emerging understanding of their downstream effects. A surprisingly large fraction of the human transcriptome contains inverted Alu repeats, often forming long double stranded structures in RNA transcripts, typically occurring in introns and UTRs of protein coding genes. Alu repeats are also common in other primates, and similar inverted repeats can frequently be found in non-primates, although the latter are less prone to duplex formation. In human, as many as 700,000 Alu elements have been identified as substrates for RNA editing, of which many are edited at several sites. In fact, recent advancements in transcriptome sequencing techniques and bioinformatics have revealed that the human editome comprises at least a hundred million adenosine to inosine (A-to-I) editing sites in Alu sequences. Although substantial additional efforts are required in order to map the editome, already present knowledge provides an excellent starting point for studying cis-regulation of editing. In this review, we will focus on editing of long stem loop structures in the human transcriptome and how it can effect gene expression. Copyright © 2015 Elsevier B.V. and Société Française de Biochimie et Biologie Moléculaire (SFBBM). All rights reserved.

  6. dFOXO Activates Large and Small Heat Shock Protein Genes in Response to Oxidative Stress to Maintain Proteostasis in Drosophila.

    PubMed

    Donovan, Marissa R; Marr, Michael T

    2016-09-02

    Maintaining protein homeostasis is critical for survival at the cellular and organismal level (Morimoto, R. I. (2011) Cold Spring Harb. Symp. Quant. Biol. 76, 91-99). Cells express a family of molecular chaperones, the heat shock proteins, during times of oxidative stress to protect against proteotoxicity. We have identified a second stress responsive transcription factor, dFOXO, that works alongside the heat shock transcription factor to activate transcription of both the small heat shock protein and the large heat shock protein genes. This expression likely protects cells from protein misfolding associated with oxidative stress. Here we identify the regions of the Hsp70 promoter essential for FOXO-dependent transcription using in vitro methods and find a physiological role for FOXO-dependent expression of heat shock proteins in vivo. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.

  7. RNF17 blocks promiscuous activity of PIWI proteins in mouse testes.

    PubMed

    Wasik, Kaja A; Tam, Oliver H; Knott, Simon R; Falciatori, Ilaria; Hammell, Molly; Vagin, Vasily V; Hannon, Gregory J

    2015-07-01

    PIWI proteins and their associated piRNAs protect germ cells from the activity of mobile genetic elements. Two classes of piRNAs—primary and secondary—are defined by their mechanisms of biogenesis. Primary piRNAs are processed directly from transcripts of piRNA cluster loci, whereas secondary piRNAs are generated in an adaptive amplification loop, termed the ping-pong cycle. In mammals, piRNA populations are dynamic, shifting as male germ cells develop. Embryonic piRNAs consist of both primary and secondary species and are mainly directed toward transposons. In meiotic cells, the piRNA population is transposon-poor and largely restricted to primary piRNAs derived from pachytene piRNA clusters. The transition from the embryonic to the adult piRNA pathway is not well understood. Here we show that RNF17 shapes adult meiotic piRNA content by suppressing the production of secondary piRNAs. In the absence of RNF17, ping-pong occurs inappropriately in meiotic cells. Ping-pong initiates piRNA responses against not only transposons but also protein-coding genes and long noncoding RNAs, including genes essential for germ cell development. Thus, the sterility of Rnf17 mutants may be a manifestation of a small RNA-based autoimmune reaction. © 2015 Wasik et al.; Published by Cold Spring Harbor Laboratory Press.

  8. Gene expression and activity of digestive enzymes of Daphnia pulex in response to food quality differences.

    PubMed

    Schwarzenberger, Anke; Fink, Patrick

    2018-04-01

    Food quality is an important factor influencing organisms' well-being. In freshwater ecosystems, food quality has been studied extensively for the keystone herbivore genus Daphnia, as they form the critical trophic link between primary producers and higher order consumers such as fish. For Daphnia, the edible fraction of phytoplankton in lakes (consisting mostly of unicellular algae and cyanobacteria) is extraordinarily diverse. To be able to digest different food particles, Daphnia possess a set of digestive enzymes that metabolize carbohydrates, lipids and proteins. Recent studies have found a connection between gene expression and activity of single digestive enzyme types of Daphnia, i.e. lipases and proteases, and transcriptome studies have shown that a variety of genes coding for gut enzymes are differentially expressed in response to different food algae. However, never before has a set of digestive enzymes been studied simultaneously both on the gene expression and the enzyme activity level in Daphnia. Here, we investigated several digestive enzymes of Daphnia pulex in a comparison between a high-quality (green algal) and a low-quality (cyanobacterial) diet. Diet significantly affected the expression of all investigated digestive enzyme genes and enzyme activity was altered between treatments. Furthermore, we found that gene expression and enzyme activity were significantly correlated in cellulase, triacylglycerol lipase and β-glucosidase when switched from high to low-quality food. We conclude that one of the factors causing the often observed low biomass and energy transfer efficiency from cyanobacteria to Daphnia is probably the switch to a cost-effective overall increase of gene expression and activity of digestive enzymes of this herbivore. Copyright © 2018 Elsevier Inc. All rights reserved.

  9. The active gene that encodes human High Mobility Group 1 protein (HMG1) contains introns and maps to chromosome 13

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ferrari, S.; Finelli, P.; Rocchi, M.

    The human genome contains a large number of sequences related to the cDNA for High Mobility Group 1 protein (HMG1), which so far has hampered the cloning and mapping of the active HMG1 gene. We show that the human HMG1 gene contains introns, while the HMG1-related sequences do not and most likely are retrotransposed pseudogenes. We identified eight YACs from the ICI and CEPH libraries that contain the human HMG1 gene. The HMG1 gene is similar in structure to the previously characterized murine homologue and maps to human chromosome 13 and q12, as determined by in situ hybridization. The mousemore » Hmg1 gene maps to the telomeric region of murine Chromosome 5, which is syntenic to the human 13q12 band. 18 refs., 3 figs.« less

  10. BRD4 assists elongation of both coding and enhancer RNAs guided by histone acetylation

    PubMed Central

    Kanno, Tomohiko; Kanno, Yuka; LeRoy, Gary; Campos, Eric; Sun, Hong-Wei; Brooks, Stephen R; Vahedi, Golnaz; Heightman, Tom D; Garcia, Benjamin A; Reinberg, Danny; Siebenlist, Ulrich; O’Shea, John J; Ozato, Keiko

    2016-01-01

    Small-molecule BET inhibitors interfere with the epigenetic interactions between acetylated histones and the bromodomains of the BET family proteins, including BRD4, and they potently inhibit growth of malignant cells by targeting cancer-promoting genes. BRD4 interacts with the pause-release factor P-TEFb, and has been proposed to release Pol II from promoter-proximal pausing. We show that BRD4 occupied widespread genomic regions in mouse cells, and directly stimulated elongation of both protein-coding transcripts and non-coding enhancer RNAs (eRNAs), dependent on the function of bromodomains. BRD4 interacted physically with elongating Pol II complexes, and assisted Pol II progression through hyper-acetylated nucleosomes by interacting with acetylated histones via bromodomains. On active enhancers, the BET inhibitor JQ1 antagonized BRD4-associated eRNA synthesis. Thus, BRD4 is involved in multiple steps of the transcription hierarchy, primarily by assisting transcript elongation both at enhancers and on gene bodies. PMID:25383670

  11. Enhancement of protein production via the strong DIT1 terminator and two RNA-binding proteins in Saccharomyces cerevisiae

    PubMed Central

    Ito, Yoichiro; Kitagawa, Takao; Yamanishi, Mamoru; Katahira, Satoshi; Izawa, Shingo; Irie, Kenji; Furutani-Seiki, Makoto; Matsuyama, Takashi

    2016-01-01

    Post-transcriptional upregulation is an effective way to increase the expression of transgenes and thus maximize the yields of target chemicals from metabolically engineered organisms. Refractory elements in the 3′ untranslated region (UTR) that increase mRNA half-life might be available. In Saccharomyces cerevisiae, several terminator regions have shown activity in increasing the production of proteins by upstream coding genes; among these terminators the DIT1 terminator has the highest activity. Here, we found in Saccharomyces cerevisiae that two resident trans-acting RNA-binding proteins (Nab6p and Pap1p) enhance the activity of the DIT1 terminator through the cis element GUUCG/U within the 3′-UTR. These two RNA-binding proteins could upregulate a battery of cell-wall–related genes. Mutagenesis of the DIT1 terminator improved its activity by a maximum of 500% of that of the standard PGK1 terminator. Further understanding and improvement of this system will facilitate inexpensive and stable production of complicated organism-derived drugs worldwide. PMID:27845367

  12. Chemical Approaches to Control Gene Expression

    PubMed Central

    Gottesfeld, Joel M.; Turner, James M.; Dervan, Peter B.

    2000-01-01

    A current goal in molecular medicine is the development of new strategies to interfere with gene expression in living cells in the hope that novel therapies for human disease will result from these efforts. This review focuses on small-molecule or chemical approaches to manipulate gene expression by modulating either transcription of messenger RNA-coding genes or protein translation. The molecules under study include natural products, designed ligands, and compounds identified through functional screens of combinatorial libraries. The cellular targets for these molecules include DNA, messenger RNA, and the protein components of the transcription, RNA processing, and translational machinery. Studies with model systems have shown promise in the inhibition of both cellular and viral gene transcription and mRNA utilization. Moreover, strategies for both repression and activation of gene transcription have been described. These studies offer promise for treatment of diseases of pathogenic (viral, bacterial, etc.) and cellular origin (cancer, genetic diseases, etc.). PMID:11097426

  13. SGDB: a database of synthetic genes re-designed for optimizing protein over-expression.

    PubMed

    Wu, Gang; Zheng, Yuanpu; Qureshi, Imran; Zin, Htar Thant; Beck, Tyler; Bulka, Blazej; Freeland, Stephen J

    2007-01-01

    Here we present the Synthetic Gene Database (SGDB): a relational database that houses sequences and associated experimental information on synthetic (artificially engineered) genes from all peer-reviewed studies published to date. At present, the database comprises information from more than 200 published experiments. This resource not only provides reference material to guide experimentalists in designing new genes that improve protein expression, but also offers a dataset for analysis by bioinformaticians who seek to test ideas regarding the underlying factors that influence gene expression. The SGDB was built under MySQL database management system. We also offer an XML schema for standardized data description of synthetic genes. Users can access the database at http://www.evolvingcode.net/codon/sgdb/index.php, or batch downloads all information through XML files. Moreover, users may visually compare the coding sequences of a synthetic gene and its natural counterpart with an integrated web tool at http://www.evolvingcode.net/codon/sgdb/aligner.php, and discuss questions, findings and related information on an associated e-forum at http://www.evolvingcode.net/forum/viewforum.php?f=27.

  14. Nmf9 Encodes a Highly Conserved Protein Important to Neurological Function in Mice and Flies.

    PubMed

    Zhang, Shuxiao; Ross, Kevin D; Seidner, Glen A; Gorman, Michael R; Poon, Tiffany H; Wang, Xiaobo; Keithley, Elizabeth M; Lee, Patricia N; Martindale, Mark Q; Joiner, William J; Hamilton, Bruce A

    2015-07-01

    Many protein-coding genes identified by genome sequencing remain without functional annotation or biological context. Here we define a novel protein-coding gene, Nmf9, based on a forward genetic screen for neurological function. ENU-induced and genome-edited null mutations in mice produce deficits in vestibular function, fear learning and circadian behavior, which correlated with Nmf9 expression in inner ear, amygdala, and suprachiasmatic nuclei. Homologous genes from unicellular organisms and invertebrate animals predict interactions with small GTPases, but the corresponding domains are absent in mammalian Nmf9. Intriguingly, homozygotes for null mutations in the Drosophila homolog, CG45058, show profound locomotor defects and premature death, while heterozygotes show striking effects on sleep and activity phenotypes. These results link a novel gene orthology group to discrete neurological functions, and show conserved requirement across wide phylogenetic distance and domain level structural changes.

  15. Mutation analysis of aryl hydrocarbon receptor interacting protein (AIP) gene in colorectal, breast, and prostate cancers

    PubMed Central

    Georgitsi, M; Karhu, A; Winqvist, R; Visakorpi, T; Waltering, K; Vahteristo, P; Launonen, V; Aaltonen, L A

    2007-01-01

    Germline mutations in the aryl hydrocarbon receptor interacting protein (AIP) gene were recently identified in individuals with pituitary adenoma predisposition (PAP). These patients have prolactin (PRL) or growth hormone (GH) oversecreting pituitary adenomas, the latter exhibiting acromegaly or gigantism. Loss-of-heterozygosity (LOH) analysis revealed that AIP is lost in PAP tumours, suggesting that it acts as a tumour-suppressor gene. Aryl hydrocarbon receptor interacting protein is involved in several pathways, but it is best characterised as a cytoplasmic partner of the aryl hydrocarbon receptor (AHR). To examine the possible role of AIP in the genesis of common cancers, we performed somatic mutation screening in a series of 373 colorectal cancers (CRCs), 82 breast cancers, and 44 prostate tumour samples. A missense R16H (47G>A) change was identified in two CRC samples, as well as in the respective normal tissues, but was absent in 209 healthy controls. The remaining findings were silent, previously unreported, changes of the coding, non-coding, or untranslated regions of AIP. These results suggest that somatic AIP mutations are not common in CRC, breast, and prostate cancers. PMID:17242703

  16. Influence of the stringent control system on the transcription of ribosomal ribonucleic acid and ribosomal protein genes in Escherichia coli.

    PubMed Central

    Dennis, P P

    1977-01-01

    The fraction of the total ribonucleic acid (RNA) synthesis rate that is messenger RNA (mRNA) for ribosomal protein (r-protein) and ribosomal RNA (rRNA) has been estimated in valS(Ts) rel+ stringent and valS(Ts) relA1 relaxed strains of Escherichia coli during a partial inhibition of valyl-transfer RNA aminoacylation. The partial inhibition was accomplished by shifting the strains from the permissive growth temperature of 29.5 degrees C to the semipermissive temperature of 35.5 degrees C. The RNA synthesized at the elevated temperature was pulse labeled with [3H]uracil. The fraction of the total incorpoarted 3H radioactivity in r-protein mRNA or in rRNA was estimated by specific hybridization to the transducing phages gammaspc1, which carries about 15 r-protein genes and lambdailv5, which carries an rRNA transcription unit. The results clearly demonstrate that the rel gene influences the fraction of the total RNA synthesis rate that is r protein mRNA and rRNA; in the rel+ strain they are significantly increased relative to control cultures. This indicates that the expression of the genes coding for the RNA and protein component of the ribosome are most likely regulated at the level of transcription. Furthermore, it appears that the distribution of functioning RNA polymerase between rRNA genes, r-protein genes, and other types of genes is influenced by the rel gene control system; presumably this influence is mediated through the unusual nucleotide guanosine tetraphosphate. PMID:320185

  17. Chimeric Plant Calcium/Calmodulin-Dependent Protein Kinase Gene with a Neural Visinin-Like Calcium-Binding Domain

    NASA Technical Reports Server (NTRS)

    Patil, Shameekumar; Takezawa, D.; Poovaiah, B. W.

    1995-01-01

    Calcium, a universal second messenger, regulates diverse cellular processes in eukaryotes. Ca-2(+) and Ca-2(+)/calmodulin-regulated protein phosphorylation play a pivotal role in amplifying and diversifying the action of Ca-2(+)- mediated signals. A chimeric Ca-2(+)/calmodulin-dependent protein kinase (CCaMK) gene with a visinin-like Ca-2(+)- binding domain was cloned and characterized from lily. The cDNA clone contains an open reading frame coding for a protein of 520 amino acids. The predicted structure of CCaMK contains a catalytic domain followed by two regulatory domains, a calmodulin-binding domain and a visinin-like Ca-2(+)-binding domain. The amino-terminal region of CCaMK contains all 11 conserved subdomains characteristic of serine/threonine protein kinases. The calmodulin-binding region of CCaMK has high homology (79%) to alpha subunit of mammalian Ca-2(+)/calmodulin-dependent protein kinase. The calmodulin-binding region is fused to a neural visinin-like domain that contains three Ca-2(+)-binding EF-hand motifs and a biotin-binding site. The Escherichia coli-expressed protein (approx. 56 kDa) binds calmodulin in a Ca-2(+)-dependent manner. Furthermore, Ca-45-binding assays revealed that CCaMK directly binds Ca-2(+). The CCaMK gene is preferentially expressed in developing anthers. Southern blot analysis revealed that CCaMK is encoded by a single gene. The structural features of the gene suggest that it has multiple regulatory controls and could play a unique role in Ca-2(+) signaling in plants.

  18. Rice Ribosomal Protein Large Subunit Genes and Their Spatio-temporal and Stress Regulation

    PubMed Central

    Moin, Mazahar; Bakshi, Achala; Saha, Anusree; Dutta, Mouboni; Madhav, Sheshu M.; Kirti, P. B.

    2016-01-01

    Ribosomal proteins (RPs) are well-known for their role in mediating protein synthesis and maintaining the stability of the ribosomal complex, which includes small and large subunits. In the present investigation, in a genome-wide survey, we predicted that the large subunit of rice ribosomes is encoded by at least 123 genes including individual gene copies, distributed throughout the 12 chromosomes. We selected 34 candidate genes, each having 2–3 identical copies, for a detailed characterization of their gene structures, protein properties, cis-regulatory elements and comprehensive expression analysis. RPL proteins appear to be involved in interactions with other RP and non-RP proteins and their encoded RNAs have a higher content of alpha-helices in their predicted secondary structures. The majority of RPs have binding sites for metal and non-metal ligands. Native expression profiling of 34 ribosomal protein large (RPL) subunit genes in tissues covering the major stages of rice growth shows that they are predominantly expressed in vegetative tissues and seedlings followed by meiotically active tissues like flowers. The putative promoter regions of these genes also carry cis-elements that respond specifically to stress and signaling molecules. All the 34 genes responded differentially to the abiotic stress treatments. Phytohormone and cold treatments induced significant up-regulation of several RPL genes, while heat and H2O2 treatments down-regulated a majority of them. Furthermore, infection with a bacterial pathogen, Xanthomonas oryzae, which causes leaf blight also induced the expression of 80% of the RPL genes in leaves. Although the expression of RPL genes was detected in all the tissues studied, they are highly responsive to stress and signaling molecules indicating that their encoded proteins appear to have roles in stress amelioration besides house-keeping. This shows that the RPL gene family is a valuable resource for manipulation of stress tolerance in

  19. Xenopus laevis ribosomal protein genes: isolation of recombinant cDNA clones and study of the genomic organization.

    PubMed Central

    Bozzoni, I; Beccari, E; Luo, Z X; Amaldi, F

    1981-01-01

    Poly-A+ mRNA from Xenopus laevis oocytes, partially enriched for r-protein coding capacity has been used as starting material for preparing a cDNA bank in plasmid pBR322. The clones containing sequences specific for r-proteins have been selected by translation of the complementary mRNAs. Clones for six different r-proteins have been identified and utilized as probes for studying their genomic organization. Two gene copies per haploid genome were found for r-proteins L1, L14, S19, and four-five for protein S1, S8 and L32. Moreover a population polymorphism has been observed for the genomic regions containing sequences for r-protein S1, S8 and L14. Images PMID:6112733

  20. Capturing novel mouse genes encoding chromosomal and other nuclear proteins.

    PubMed

    Tate, P; Lee, M; Tweedie, S; Skarnes, W C; Bickmore, W A

    1998-09-01

    The burgeoning wealth of gene sequences contrasts with our ignorance of gene function. One route to assigning function is by determining the sub-cellular location of proteins. We describe the identification of mouse genes encoding proteins that are confined to nuclear compartments by splicing endogeneous gene sequences to a promoterless betageo reporter, using a gene trap approach. Mouse ES (embryonic stem) cell lines were identified that express betageo fusions located within sub-nuclear compartments, including chromosomes, the nucleolus and foci containing splicing factors. The sequences of 11 trapped genes were ascertained, and characterisation of endogenous protein distribution in two cases confirmed the validity of the approach. Three novel proteins concentrated within distinct chromosomal domains were identified, one of which appears to be a serine/threonine kinase. The sequence of a gene whose product co-localises with splicesome components suggests that this protein may be an E3 ubiquitin-protein ligase. The majority of the other genes isolated represent novel genes. This approach is shown to be a powerful tool for identifying genes encoding novel proteins with specific sub-nuclear localisations and exposes our ignorance of the protein composition of the nucleus. Motifs in two of the isolated genes suggest new links between cellular regulatory mechanisms (ubiquitination and phosphorylation) and mRNA splicing and chromosome structure/function.

  1. An Introductory Bioinformatics Exercise to Reinforce Gene Structure and Expression and Analyze the Relationship between Gene and Protein Sequences

    ERIC Educational Resources Information Center

    Almeida, Craig A.; Tardiff, Daniel F.; De Luca, Jane P.

    2004-01-01

    We have developed an introductory bioinformatics exercise for sophomore biology and biochemistry students that reinforces the understanding of the structure of a gene and the principles and events involved in its expression. In addition, the activity illustrates the severe effect mutations in a gene sequence can have on the protein product.…

  2. [HMGA proteins and their genes as a potential neoplastic biomarkers].

    PubMed

    Balcerczak, Ewa; Balcerczak, Mariusz; Mirowski, Marek

    2005-01-01

    HMGA proteins and their genes are described in this article. HMGA proteins reveal ability to bind DNA in AT-rich regions, which are characteristic for gene promoter sequences. This interaction lead to gene silencing or their overexpression. In normal tissue HMGA proteins level is low or even undetectable. During embriogenesis their level is increasing. High HMGA proteins level is characteristic for tumor phenotype of spontaneous and experimental malignant neoplasms. High HMGA proteins expression correlate with bad prognostic factors and with metastases formation. HMGA genes expression can be used as a marker of tumor progression. Present studies connected with tumor gene therapy based on HMGA proteins sythesis inhibition by the use of viral vectors containing gene encoding these proteins in antisence orientation, as well as a new potential anticancer drugs acting as crosslinkers between DNA and HMGA proteins suggest their usefulness as a targets in cancer therapy.

  3. Aym1, a mouse meiotic gene identified by virtue of its ability to activate early meiotic genes in the yeast Saccharomyces cerevisiae.

    PubMed

    Malcov, Mira; Cesarkas, Karen; Stelzer, Gil; Shalom, Sarah; Dicken, Yosef; Naor, Yaniv; Goldstein, Ronald S; Sagee, Shira; Kassir, Yona; Don, Jeremy

    2004-12-01

    Our understanding of the molecular mechanisms that operate during differentiation of mitotically dividing spermatogonia cells into spermatocytes lags way behind what is known about other differentiating systems. Given the evolutionary conservation of the meiotic process, we screened for mouse proteins that could specifically activate early meiotic promoters in Saccharomyces cerevisiae yeast cells, when fused to the Gal4 activation domain (Gal4AD). Our screen yielded the Aym1 gene that encodes a short peptide of 45 amino acids. We show that a Gal4AD-AYM1 fusion protein activates expression of reporter genes through the promoters of the early meiosis-specific genes IME2 and HOP1, and that this activation is dependent on the DNA-binding protein Ume6. Aym1 is transcribed predominantly in mouse primary spermatocytes and in gonads of female embryos undergoing the corresponding meiotic divisions. Aym1 immunolocalized to nuclei of primary spermatocytes and oocytes and to specific type A spermatogonia cells, suggesting it might play a role in the processes leading to meiotic competence. The potential functional relationship between AYM1 and yeast proteins that regulate expression of early meiotic genes is discussed.

  4. Systematic analysis of human kinase genes: a large number of genes and alternative splicing events result in functional and structural diversity

    PubMed Central

    Milanesi, Luciano; Petrillo, Mauro; Sepe, Leandra; Boccia, Angelo; D'Agostino, Nunzio; Passamano, Myriam; Di Nardo, Salvatore; Tasco, Gianluca; Casadio, Rita; Paolella, Giovanni

    2005-01-01

    Background Protein kinases are a well defined family of proteins, characterized by the presence of a common kinase catalytic domain and playing a significant role in many important cellular processes, such as proliferation, maintenance of cell shape, apoptosys. In many members of the family, additional non-kinase domains contribute further specialization, resulting in subcellular localization, protein binding and regulation of activity, among others. About 500 genes encode members of the kinase family in the human genome, and although many of them represent well known genes, a larger number of genes code for proteins of more recent identification, or for unknown proteins identified as kinase only after computational studies. Results A systematic in silico study performed on the human genome, led to the identification of 5 genes, on chromosome 1, 11, 13, 15 and 16 respectively, and 1 pseudogene on chromosome X; some of these genes are reported as kinases from NCBI but are absent in other databases, such as KinBase. Comparative analysis of 483 gene regions and subsequent computational analysis, aimed at identifying unannotated exons, indicates that a large number of kinase may code for alternately spliced forms or be incorrectly annotated. An InterProScan automated analysis was perfomed to study domain distribution and combination in the various families. At the same time, other structural features were also added to the annotation process, including the putative presence of transmembrane alpha helices, and the cystein propensity to participate into a disulfide bridge. Conclusion The predicted human kinome was extended by identifiying both additional genes and potential splice variants, resulting in a varied panorama where functionality may be searched at the gene and protein level. Structural analysis of kinase proteins domains as defined in multiple sources together with transmembrane alpha helices and signal peptide prediction provides hints to function assignment

  5. Severe Acute Respiratory Syndrome Coronavirus Envelope Protein Ion Channel Activity Promotes Virus Fitness and Pathogenesis

    PubMed Central

    Nieto-Torres, Jose L.; DeDiego, Marta L.; Verdiá-Báguena, Carmina; Jimenez-Guardeño, Jose M.; Regla-Nava, Jose A.; Fernandez-Delgado, Raul; Castaño-Rodriguez, Carlos; Alcaraz, Antonio; Torres, Jaume; Aguilella, Vicente M.; Enjuanes, Luis

    2014-01-01

    Deletion of Severe Acute Respiratory Syndrome Coronavirus (SARS-CoV) envelope (E) gene attenuates the virus. E gene encodes a small multifunctional protein that possesses ion channel (IC) activity, an important function in virus-host interaction. To test the contribution of E protein IC activity in virus pathogenesis, two recombinant mouse-adapted SARS-CoVs, each containing one single amino acid mutation that suppressed ion conductivity, were engineered. After serial infections, mutant viruses, in general, incorporated compensatory mutations within E gene that rendered active ion channels. Furthermore, IC activity conferred better fitness in competition assays, suggesting that ion conductivity represents an advantage for the virus. Interestingly, mice infected with viruses displaying E protein IC activity, either with the wild-type E protein sequence or with the revertants that restored ion transport, rapidly lost weight and died. In contrast, mice infected with mutants lacking IC activity, which did not incorporate mutations within E gene during the experiment, recovered from disease and most survived. Knocking down E protein IC activity did not significantly affect virus growth in infected mice but decreased edema accumulation, the major determinant of acute respiratory distress syndrome (ARDS) leading to death. Reduced edema correlated with lung epithelia integrity and proper localization of Na+/K+ ATPase, which participates in edema resolution. Levels of inflammasome-activated IL-1β were reduced in the lung airways of the animals infected with viruses lacking E protein IC activity, indicating that E protein IC function is required for inflammasome activation. Reduction of IL-1β was accompanied by diminished amounts of TNF and IL-6 in the absence of E protein ion conductivity. All these key cytokines promote the progression of lung damage and ARDS pathology. In conclusion, E protein IC activity represents a new determinant for SARS-CoV virulence. PMID:24788150

  6. Protein Poly(ADP-ribosyl)ation Regulates Arabidopsis Immune Gene Expression and Defense Responses

    PubMed Central

    Feng, Baomin; Liu, Chenglong; de Oliveira, Marcos V. V.; Intorne, Aline C.; Li, Bo; Babilonia, Kevin; de Souza Filho, Gonçalo A.; Shan, Libo; He, Ping

    2015-01-01

    Perception of microbe-associated molecular patterns (MAMPs) elicits transcriptional reprogramming in hosts and activates defense to pathogen attacks. The molecular mechanisms underlying plant pattern-triggered immunity remain elusive. A genetic screen identified Arabidopsis poly(ADP-ribose) glycohydrolase 1 (atparg1) mutant with elevated immune gene expression upon multiple MAMP and pathogen treatments. Poly(ADP-ribose) glycohydrolase (PARG) is predicted to remove poly(ADP-ribose) polymers on acceptor proteins modified by poly(ADP-ribose) polymerases (PARPs) with three PARPs and two PARGs in Arabidopsis genome. AtPARP1 and AtPARP2 possess poly(ADP-ribose) polymerase activity, and the activity of AtPARP2 was enhanced by MAMP treatment. AtPARG1, but not AtPARG2, carries glycohydrolase activity in vivo and in vitro. Importantly, mutation (G450R) in atparg1 blocks its activity and the corresponding residue is highly conserved and essential for human HsPARG activity. Consistently, mutant atparp1atparp2 plants exhibited compromised immune gene activation and enhanced susceptibility to pathogen infections. Our study indicates that protein poly(ADP-ribosyl)ation plays critical roles in plant immune gene expression and defense to pathogen attacks. PMID:25569773

  7. The structural genes for three Drosophila glue proteins reside at a single polytene chromosome puff locus.

    PubMed Central

    Crowley, T E; Bond, M W; Meyerowitz, E M

    1983-01-01

    The polytene chromosome puff at 68C on the Drosophila melanogaster third chromosome is thought from genetic experiments to contain the structural gene for one of the secreted salivary gland glue polypeptides, sgs-3. Previous work has demonstrated that the DNA included in this puff contains sequences that are transcribed to give three different polyadenylated RNAs that are abundant in third-larval-instar salivary glands. These have been called the group II, group III, and group IV RNAs. In the experiments reported here, we used the nucleotide sequence of the DNA coding for these RNAs to predict some of the physical and chemical properties expected of their protein products, including molecular weight, amino acid composition, and amino acid sequence. Salivary gland polypeptides with molecular weights similar to those expected for the 68C RNA translation products, and with the expected degree of incorporation of different radioactive amino acids, were purified. These proteins were shown by amino acid sequencing to correspond to the protein products of the 68C RNAs. It was further shown that each of these proteins is a part of the secreted salivary gland glue: the group IV RNA codes for the previously described sgs-3, whereas the group II and III RNAs code for the newly identified glue polypeptides sgs-8 and sgs-7. Images PMID:6406838

  8. Topological and organizational properties of the products of house-keeping and tissue-specific genes in protein-protein interaction networks.

    PubMed

    Lin, Wen-Hsien; Liu, Wei-Chung; Hwang, Ming-Jing

    2009-03-11

    Human cells of various tissue types differ greatly in morphology despite having the same set of genetic information. Some genes are expressed in all cell types to perform house-keeping functions, while some are selectively expressed to perform tissue-specific functions. In this study, we wished to elucidate how proteins encoded by human house-keeping genes and tissue-specific genes are organized in human protein-protein interaction networks. We constructed protein-protein interaction networks for different tissue types using two gene expression datasets and one protein-protein interaction database. We then calculated three network indices of topological importance, the degree, closeness, and betweenness centralities, to measure the network position of proteins encoded by house-keeping and tissue-specific genes, and quantified their local connectivity structure. Compared to a random selection of proteins, house-keeping gene-encoded proteins tended to have a greater number of directly interacting neighbors and occupy network positions in several shortest paths of interaction between protein pairs, whereas tissue-specific gene-encoded proteins did not. In addition, house-keeping gene-encoded proteins tended to connect with other house-keeping gene-encoded proteins in all tissue types, whereas tissue-specific gene-encoded proteins also tended to connect with other tissue-specific gene-encoded proteins, but only in approximately half of the tissue types examined. Our analysis showed that house-keeping gene-encoded proteins tend to occupy important network positions, while those encoded by tissue-specific genes do not. The biological implications of our findings were discussed and we proposed a hypothesis regarding how cells organize their protein tools in protein-protein interaction networks. Our results led us to speculate that house-keeping gene-encoded proteins might form a core in human protein-protein interaction networks, while clusters of tissue-specific gene

  9. Genome-Wide Discovery of Long Non-Coding RNAs in Rainbow Trout.

    PubMed

    Al-Tobasei, Rafet; Paneru, Bam; Salem, Mohamed

    2016-01-01

    The ENCODE project revealed that ~70% of the human genome is transcribed. While only 1-2% of the RNAs encode for proteins, the rest are non-coding RNAs. Long non-coding RNAs (lncRNAs) form a diverse class of non-coding RNAs that are longer than 200 nt. Emerging evidence indicates that lncRNAs play critical roles in various cellular processes including regulation of gene expression. LncRNAs show low levels of gene expression and sequence conservation, which make their computational identification in genomes difficult. In this study, more than two billion Illumina sequence reads were mapped to the genome reference using the TopHat and Cufflinks software. Transcripts shorter than 200 nt, with more than 83-100 amino acids ORF, or with significant homologies to the NCBI nr-protein database were removed. In addition, a computational pipeline was used to filter the remaining transcripts based on a protein-coding-score test. Depending on the filtering stringency conditions, between 31,195 and 54,503 lncRNAs were identified, with only 421 matching known lncRNAs in other species. A digital gene expression atlas revealed 2,935 tissue-specific and 3,269 ubiquitously-expressed lncRNAs. This study annotates the lncRNA rainbow trout genome and provides a valuable resource for functional genomics research in salmonids.

  10. AtFXG1, an Arabidopsis gene encoding alpha-L-fucosidase active against fucosylated xyloglucan oligosaccharides.

    PubMed

    de La Torre, Francisco; Sampedro, Javier; Zarra, Ignacio; Revilla, Gloria

    2002-01-01

    An alpha-L-fucosidase (EC 3.2.1.51) able to release the t-fucosyl residue from the side chain of xyloglucan oligosaccharides has been detected in the leaves of Arabidopsis plants. Moreover, an alpha-L-fucosidase with similar substrate specificity was purified from cabbage (Brassica oleracea) leaves to render a single band on SDS-PAGE. Two peptide sequences were obtained from this protein band, and they were used to identify an Arabidopsis gene coding for an alpha-fucosidase that we propose to call AtFXG1. In addition, an Arabidopsis gene with homology with known alpha-L-fucosidases has been also found, and we proposed to name it as AtFUC1. Both AtFXG1 and ATFUC1 were heterologously expressed in Pichia pastoris cells and the alpha-L-fucosidase activities secreted to the culture medium. The alpha-L-fucosidase encoded by AtFXG1 was active against the oligosaccharides from xyloglucan XXFG as well as against 2'-fucosyl-lactitol but not against p-nitrophenyl-alpha-L-fucopyranoside. However, the AtFUC1 heterologously expressed was active only against 2'-fucosyl-lactitol. Thus, the former must be related to xyloglucan metabolism.

  11. Fatty acid-binding protein genes of the ancient, air-breathing, ray-finned fish, spotted gar (Lepisosteus oculatus).

    PubMed

    Venkatachalam, Ananda B; Fontenot, Quenton; Farrara, Allyse; Wright, Jonathan M

    2018-03-01

    With the advent of high-throughput DNA sequencing technology, the genomic sequence of many disparate species has led to the relatively new discipline of genomics, the study of genome structure, function and evolution. Much work has been focused on the role of whole genome duplications (WGD) in the architecture of extant vertebrate genomes, particularly those of teleost fishes which underwent a WGD early in the teleost radiation >230 million years ago (mya). Our past work has focused on the fate of duplicated copies of a multigene family coding for the intracellular lipid-binding protein (iLBP) genes in the teleost fishes. To define the evolutionary processes that determined the fate of duplicated genes and generated the structure of extant fish genomes, however, requires comparative genomic analysis with a fish lineage that diverged before the teleost WGD, such as the spotted gar (Lepisosteus oculatus), an ancient, air-breathing, ray-finned fish. Here, we describe the genomic organization, chromosomal location and tissue-specific expression of a subfamily of the iLBP genes that code for fatty acid-binding proteins (Fabps) in spotted gar. Based on this work, we have defined the minimum suite of fabp genes prior to their duplication in the teleost lineages ~230-400 mya. Spotted gar, therefore, serves as an appropriate outgroup, or ancestral/ancient fish, that did not undergo the teleost-specific WGD. As such, analyses of the spatio-temporal regulation of spotted gar genes provides a foundation to determine whether the duplicated fabp genes have been retained in teleost genomes owing to either sub- or neofunctionalization. Copyright © 2017 Elsevier Inc. All rights reserved.

  12. Activation of transcriptional activity of HSE by a novel mouse zinc finger protein ZNFD specifically expressed in testis.

    PubMed

    Xu, Fengqin; Wang, Weiping; Lei, Chen; Liu, Qingmei; Qiu, Hao; Muraleedharan, Vinaydhar; Zhou, Bin; Cheng, Hongxia; Huang, Zhongkai; Xu, Weian; Li, Bichun; Wang, Minghua

    2012-04-01

    Zinc finger proteins (ZFPs) that contain multiple cysteine and/or histidine residues perform important roles in various cellular functions, including transcriptional regulation, cell proliferation, differentiation, and apoptosis. The Cys-Cys-His-His (C(2)H(2)) type of ZFPs are the well-defined members of this super family and are the largest and most complex proteins in eukaryotic genomes. In this study, we identified a novel C(2)H(2) type of zinc finger gene ZNFD from mice which has a 1,002 bp open reading frame and encodes a protein with 333 amino acid residues. The predicted 37.4 kDa protein contains a C(2)H(2) zinc finger domain. ZNFD gene is located on chromosome 18qD1. RT-PCR analysis revealed that the ZNFD gene was specifically expressed in mouse testis but not in other tissues. Subcellular localization analysis demonstrated that ZNFD was localized in the nucleus. Reporter gene assays showed that overexpression of ZNFD in the COS7 cells activates the transcriptional activities of heat shock element (HSE). Overall, these results suggest that ZNFD is a member of the zinc finger transcription factor family and it participates in the transcriptional regulation of HSE. Many heat shock proteins regulated by HSE are involved in testicular development. Therefore, our results suggest that ZNFD may probably participate in the development of mouse testis and function as a transcription activator in HSE-mediated gene expression and signaling pathways.

  13. Menin-MLL inhibitors reverse oncogenic activity of MLL fusion proteins in leukemia.

    PubMed

    Grembecka, Jolanta; He, Shihan; Shi, Aibin; Purohit, Trupta; Muntean, Andrew G; Sorenson, Roderick J; Showalter, Hollis D; Murai, Marcelo J; Belcher, Amalia M; Hartley, Thomas; Hess, Jay L; Cierpicki, Tomasz

    2012-01-29

    Translocations involving the mixed lineage leukemia (MLL) gene result in human acute leukemias with very poor prognosis. The leukemogenic activity of MLL fusion proteins is critically dependent on their direct interaction with menin, a product of the multiple endocrine neoplasia (MEN1) gene. Here we present what are to our knowledge the first small-molecule inhibitors of the menin-MLL fusion protein interaction that specifically bind menin with nanomolar affinities. These compounds effectively reverse MLL fusion protein-mediated leukemic transformation by downregulating the expression of target genes required for MLL fusion protein oncogenic activity. They also selectively block proliferation and induce both apoptosis and differentiation of leukemia cells harboring MLL translocations. Identification of these compounds provides a new tool for better understanding MLL-mediated leukemogenesis and represents a new approach for studying the role of menin as an oncogenic cofactor of MLL fusion proteins. Our findings also highlight a new therapeutic strategy for aggressive leukemias with MLL rearrangements.

  14. Genomic localization of the human gene encoding Dr1, a negative modulator of transcription of class II and class III genes.

    PubMed

    Purrello, M; Di Pietro, C; Rapisarda, A; Viola, A; Corsaro, C; Motta, S; Grzeschik, K H; Sichel, G

    1996-01-01

    Dr1 is a nuclear protein of 19 kDa that exists in the nucleoplasm as a homotetramer. By binding to TBP (the DNA-binding subunit of TFIID, and also a subunit of SL1 and TFIIIB), the protein blocks class II and class III preinitiation complex assembly, thus repressing the activity of the corresponding promoters. Since transcription of class I genes is unaffected by Dr1. it has been proposed that the protein may coordinate the expression of class I, class II and class III genes. By somatic cell genetics and fluorescence in situ hybridization, we have localized the gene (DR1), present in the genome of higher eukaryotes as a single copy, to human chromosome region 1p21-->p13. The nucleotide sequence conservation of the coding segment of the gene, as determined by Noah's ark blot analysis, and its ubiquitous transcription suggest that Dr1 has an important biological role, which could be related to the negative control of cell proliferation.

  15. Deficient Gene Expression in Protein Kinase Inhibitor α Null Mutant Mice

    PubMed Central

    Gangolli, Esha A.; Belyamani, Mouna; Muchinsky, Sara; Narula, Anita; Burton, Kimberly A.; McKnight, G. Stanley; Uhler, Michael D.; Idzerda, Rejean L.

    2000-01-01

    Protein kinase inhibitor (PKI) is a potent endogenous inhibitor of the cyclic AMP (cAMP)-dependent protein kinase (PKA). It functions by binding the free catalytic (C) subunit with a high affinity and is also known to export nuclear C subunit to the cytoplasm. The significance of these actions with respect to PKI's physiological role is not well understood. To address this, we have generated by homologous recombination mutant mice that are deficient in PKIα, one of the three isoforms of PKI. The mice completely lack PKI activity in skeletal muscle and, surprisingly, show decreased basal and isoproterenol-induced gene expression in muscle. Further examination revealed reduced levels of the phosphorylated (active) form of the transcription factor CREB (cAMP response element binding protein) in the knockouts. This phenomenon stems, at least in part, from lower basal PKA activity levels in the mutants, arising from a compensatory increase in the level of the RIα subunit of PKA. The deficit in gene induction, however, is not easily explained by current models of PKI function and suggests that PKI may play an as yet undescribed role in PKA signaling. PMID:10779334

  16. Amplified RLR signaling activation through an interferon-stimulated gene-endoplasmic reticulum stress-mitochondrial calcium uniporter protein loop

    PubMed Central

    Cheng, Jinbo; Liao, Yajin; Zhou, Lujun; Peng, Shengyi; Chen, Hong; Yuan, Zengqiang

    2016-01-01

    Type I interferon (IFN-I) is critical for a host against viral and bacterial infections via induction of hundreds of interferon-stimulated genes (ISGs), but the mechanism underlying the regulation of IFN-I remains largely unknown. In this study, we first demonstrate that ISG expression is required for optimal IFN-β levels, an effect that is further enhanced by endoplasmic reticulum (ER) stress. Furthermore, we identify mitochondrial calcium uniporter protein (MCU) as a mitochondrial antiviral signaling protein (MAVS)-interacting protein that is important for ER stress induction and amplified MAVS signaling activation. In addition, by performing an ectopic expression assay to screen a library of 117 human ISGs for effects on IFN-β levels, we found that tumor necrosis factor receptor 1 (TNFR1) significantly increases IFN-β levels independent of ER stress. Altogether, our findings suggest that MCU and TNFR1 are involved in the regulation of RIG-I-like receptors (RLR) signaling. PMID:26892273

  17. LincRNA-p21 activates p21 in cis to promote Polycomb target gene expression and to enforce the G1/S checkpoint

    PubMed Central

    Dimitrova, Nadya; Zamudio, Jesse R.; Jong, Robyn M.; Soukup, Dylan; Resnick, Rebecca; Sarma, Kavitha; Ward, Amanda J.; Raj, Arjun; Lee, Jeannie; Sharp, Phillip A.; Jacks, Tyler

    2014-01-01

    SUMMARY The p53-regulated long non-coding RNA lincRNA-p21 has been proposed to act in trans via several mechanisms ranging from repressing genes in the p53 transcriptional network to regulating mRNA translation and protein stability. To further examine lincRNA-p21 function we generated a conditional knockout mouse model. We find that lincRNA-p21 predominantly functions in cis to activate expression of its neighboring gene, p21. Mechanistically, we show that lincRNA-p21 acts in concert with hnRNP-K as a co-activator for p53-dependent p21 transcription. Additional phenotypes of lincRNA-p21 deficiency could be attributed to diminished p21 levels, including deregulated expression and altered chromatin state of some Polycomb target genes, defective G1/S checkpoint, increased proliferation rates, and enhanced reprogramming efficiency. These findings indicate that lincRNA-p21 affects global gene expression and influences the p53 tumor suppressor pathway by acting in cis as a locus-restricted co-activator for p53-mediated p21 expression. PMID:24857549

  18. Bacteriophage M13 gene 2 protein. Increasing its yield in infected cells, and identification and localization

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Lin, Norm S. -C.; Pratt, David

    M13 gene 2 protein, implicated in the introduction of single-strand nicks into double-stranded closed circular (RFI) DNA molecules, was previously found in only very small quantities in infected cells. We now find that the gene 2 protein can be readily identified and its yield can be increased manyfold if infections are carried out at high temperature with either a gene 2 temperature- sensitive mutant or with wild type M13. Mechanisms are suggested by which the increased yield could result from subnormal function of the protein in these infections. Under conditions of high yield, the gene 2 protein is found largelymore » in a rapidly sedimenting particulate fraction of unknown nature, where it constitutes as much as 36 percent of the leucine-labeled protein. The gene 2 protein can be readily solubilized from this particulate fraction with the ionic detergent sodium dodecyl sulfate (SDS) but no satisfactory solubilization method was found which keeps the protein in its native state. Attempts to demonstrate in vitro activity of the gene 2 protein, that is, nicking of M13 RFI DNA, were not successful. On the basis of SDS-polyacrylamide gel electrophoresis, we estimate that the gene 2 polypeptide has a molecular weight of approximately 40,000. In the course of the experiments on gene 2 protein, it was observed that the gene 3, as well as the gene 8, virion protein molecules were found predominantly in the cell inner membrane, supporting the idea that virion assembly is carried out there. The gene 4, nonvirion, protein also proved to be in the inner membrane, as would be expected if this protein plays a role in virion assembly.« less

  19. Cloning and characterization of the gene encoding the endopolygalacturonase-inhibiting protein (PGIP) of Phaseolus vulgaris L.

    PubMed

    Toubart, P; Desiderio, A; Salvi, G; Cervone, F; Daroda, L; De Lorenzo, G

    1992-05-01

    Polygalacturonase-inhibiting protein (PGIP) is a cell wall protein purified from hypocotyls of true bean (Phaseolus vulgaris L.). PGIP inhibits fungal endopolygalacturonases and is considered to be an important factor for plant resistance to phytopathogenic fungi (Albersheim and Anderson, 1971; Cervone et al., 1987). The amino acid sequences of the N-terminus and one internal tryptic peptide of the PGIP purified from P. vulgaris cv. Pinto were used to design redundant oligonucleotides that were successfully utilized as primers in a polymerase chain reaction (PCR) with total DNA of P. vulgaris as a template. A DNA band of 758 bp (a specific PCR amplification product of part of the gene coding for PGIP) was isolated and cloned. By using the 758-bp DNA as a hybridization probe, a lambda clone containing the PGIP gene was isolated from a genomic library of P. vulgaris cv. Saxa. The coding and immediate flanking regions of the PGIP gene, contained on a subcloned 3.3 kb SalI-SalI DNA fragment, were sequenced. A single, continuous ORF of 1026 nt (342 amino acids) was present in the genomic clone. The nucleotide and deduced amino acid sequences of the PGIP gene showed no significant similarity with any known databank sequence. Northern blotting analysis of poly(A)+ RNAs, isolated from various tissues of bean seedlings or from suspension-cultured bean cells, were also performed using the cloned PCR-generated DNA as a probe. A 1.2 kb transcript was detected in suspension-cultured cells and, to a lesser extent, in leaves, hypocotyls, and flowers.(ABSTRACT TRUNCATED AT 250 WORDS)

  20. Mutant phenotypes for thousands of bacterial genes of unknown function

    DOE PAGES

    Price, Morgan N.; Wetmore, Kelly M.; Waters, R. Jordan; ...

    2018-05-16

    One-third of all protein-coding genes from bacterial genomes cannot be annotated with a function. Here, to investigate the functions of these genes, we present genome-wide mutant fitness data from 32 diverse bacteria across dozens of growth conditions. We identified mutant phenotypes for 11,779 protein-coding genes that had not been annotated with a specific function. Many genes could be associated with a specific condition because the gene affected fitness only in that condition, or with another gene in the same bacterium because they had similar mutant phenotypes. Of the poorly annotated genes, 2,316 had associations that have high confidence because theymore » are conserved in other bacteria. By combining these conserved associations with comparative genomics, we identified putative DNA repair proteins; in addition, we propose specific functions for poorly annotated enzymes and transporters and for uncharacterized protein families. Lastly, our study demonstrates the scalability of microbial genetics and its utility for improving gene annotations.« less

  1. Mutant phenotypes for thousands of bacterial genes of unknown function

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Price, Morgan N.; Wetmore, Kelly M.; Waters, R. Jordan

    One-third of all protein-coding genes from bacterial genomes cannot be annotated with a function. Here, to investigate the functions of these genes, we present genome-wide mutant fitness data from 32 diverse bacteria across dozens of growth conditions. We identified mutant phenotypes for 11,779 protein-coding genes that had not been annotated with a specific function. Many genes could be associated with a specific condition because the gene affected fitness only in that condition, or with another gene in the same bacterium because they had similar mutant phenotypes. Of the poorly annotated genes, 2,316 had associations that have high confidence because theymore » are conserved in other bacteria. By combining these conserved associations with comparative genomics, we identified putative DNA repair proteins; in addition, we propose specific functions for poorly annotated enzymes and transporters and for uncharacterized protein families. Lastly, our study demonstrates the scalability of microbial genetics and its utility for improving gene annotations.« less

  2. Cloning and expression pattern of a gene encoding an alpha-xylosidase active against xyloglucan oligosaccharides from Arabidopsis.

    PubMed

    Sampedro, J; Sieiro, C; Revilla, G; González-Villa, T; Zarra, I

    2001-06-01

    An alpha-xylosidase active against xyloglucan oligosaccharides was purified from cabbage (Brassica oleracea var. capitata) leaves. Two peptide sequences were obtained from this protein, the N-terminal and an internal one, and these were used to identify an Arabidopsis gene coding for an alpha-xylosidase that we propose to call AtXYL1. It has been mapped to a region of chromosome I between markers at 100.44 and 107.48 cM. AtXYL1 comprised three exons and encoded a peptide that was 915 amino acids long, with a potential signal peptide of 22 amino acids and eight possible N-glycosylation sites. The protein encoded by AtXYL1 showed the signature regions of family 31 glycosyl hydrolases, which comprises not only alpha-xylosidases, but also alpha-glucosidases. The alpha-xylosidase activity is present in apoplastic extractions from Arabidopsis seedlings, as suggested by the deduced signal peptide. The first eight leaves from Arabidopsis plants were harvested to analyze alpha-xylosidase activity and AtXYL1 expression levels. Both increased from older to younger leaves, where xyloglucan turnover is expected to be higher. When this gene was introduced in a suitable expression vector and used to transform Saccharomyces cerevisiae, significantly higher alpha-xylosidase activity was detected in the yeast cells. alpha-Glucosidase activity was also increased in the transformed cells, although to a lesser extent. These results show that AtXYL1 encodes for an apoplastic alpha-xylosidase active against xyloglucan oligosaccharides that probably also has activity against p-nitrophenyl-alpha-D-glucoside.

  3. Purification and identification of a nuclease activity in embryo axes from French bean.

    PubMed

    Lambert, Rocío; Quiles, Francisco Antonio; Cabello-Díaz, Juan Miguel; Piedras, Pedro

    2014-07-01

    Plant nucleases are involved in nucleic acid degradation associated to programmed cell death processes as well as in DNA restriction, repair and recombination processes. However, the knowledge about the function of plant nucleases is limited. A major nuclease activity was detected by in-gel assay with whole embryonic axes of common bean by using ssDNA or RNA as substrate, whereas this activity was minimal in cotyledons. The enzyme has been purified to electrophoretic homogeneity from embryonic axes. The main biochemical properties of the purified enzyme indicate that it belongs to the S1/P1 family of nucleases. This was corroborated when this protein, after SDS-electrophoresis, was excised from the gel and further analysis by MALDI TOF/TOF allowed identification of the gene (PVN1) that codes this protein. The gene that codes the purified protein was identified. The expression of PVN1 gene was induced at the specific moment of radicle protrusion. The inclusion of inorganic phosphate to the imbibition media reduced the level of expression of this gene and the nuclease activity suggesting a relationship with the phosphorous status in French bean seedlings. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.

  4. Intramolecular interactions in aminoacyl nucleotides: Implications regarding the origin of genetic coding and protein synthesis

    NASA Technical Reports Server (NTRS)

    Lacey, J. C., Jr.; Mullins, D. W., Jr.; Watkins, C. L.; Hall, L. M.

    1986-01-01

    Cellular organisms store information as sequences of nucleotides in double stranded DNA. This information is useless unless it can be converted into the active molecular species, protein. This is done in contemporary creatures first by transcription of one strand to give a complementary strand of mRNA. The sequence of nucleotides is then translated into a specific sequence of amino acids in a protein. Translation is made possible by a genetic coding system in which a sequence of three nucleotides codes for a specific amino acid. The origin and evolution of any chemical system can be understood through elucidation of the properties of the chemical entities which make up the system. There is an underlying logic to the coding system revealed by a correlation of the hydrophobicities of amino acids and their anticodonic nucleotides (i.e., the complement of the codon). Its importance lies in the fact that every amino acid going into protein synthesis must first be activated. This is universally accomplished with ATP. Past studies have concentrated on the chemistry of the adenylates, but more recently we have found, through the use of NMR, that we can observe intramolecular interactions even at low concentrations, between amino acid side chains and nucleotide base rings in these adenylates. The use of this type of compound thus affords a novel way of elucidating the manner in which amino acids and nucleotides interact with each other. In aqueous solution, when a hydrophobic amino acid is attached to the most hydrophobic nucleotide, AMP, a hydrophobic interaction takes place between the amino acid side chain and the adenine ring. The studies to be reported concern these hydrophobic interactions.

  5. RNAi mediates post-transcriptional repression of gene expression in fission yeast Schizosaccharomyces pombe

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Smialowska, Agata, E-mail: smialowskaa@gmail.com; School of Life Sciences, Södertörn Högskola, Huddinge 141-89; Djupedal, Ingela

    Highlights: • Protein coding genes accumulate anti-sense sRNAs in fission yeast S. pombe. • RNAi represses protein-coding genes in S. pombe. • RNAi-mediated gene repression is post-transcriptional. - Abstract: RNA interference (RNAi) is a gene silencing mechanism conserved from fungi to mammals. Small interfering RNAs are products and mediators of the RNAi pathway and act as specificity factors in recruiting effector complexes. The Schizosaccharomyces pombe genome encodes one of each of the core RNAi proteins, Dicer, Argonaute and RNA-dependent RNA polymerase (dcr1, ago1, rdp1). Even though the function of RNAi in heterochromatin assembly in S. pombe is established, its rolemore » in controlling gene expression is elusive. Here, we report the identification of small RNAs mapped anti-sense to protein coding genes in fission yeast. We demonstrate that these genes are up-regulated at the protein level in RNAi mutants, while their mRNA levels are not significantly changed. We show that the repression by RNAi is not a result of heterochromatin formation. Thus, we conclude that RNAi is involved in post-transcriptional gene silencing in S. pombe.« less

  6. Genome-wide identification and transcriptional expression analysis of mitogen-activated protein kinase and mitogen-activated protein kinase kinase genes in Capsicum annuum

    PubMed Central

    Liu, Zhiqin; Shi, Lanping; Liu, Yanyan; Tang, Qian; Shen, Lei; Yang, Sheng; Cai, Jinsen; Yu, Huanxin; Wang, Rongzhang; Wen, Jiayu; Lin, Youquan; Hu, Jiong; Liu, Cailing; Zhang, Yangwen; Mou, Shaoliang; He, Shuilin

    2015-01-01

    The tripartite mitogen-activated protein kinase (MAPK) signaling cascades have been implicated in plant growth, development, and environment adaptation, but a comprehensive understanding of MAPK signaling at genome-wide level is limited in Capsicum annuum. Herein, genome-wide identification and transcriptional expression analysis of MAPK and MAPK kinase (MAPKK) were performed in pepper. A total of 19 pepper MAPK (CaMAPKs) genes and five MAPKK (CaMAPKKs) genes were identified. Phylogenetic analysis indicated that CaMAPKs and CaMAPKKs could be classified into four groups and each group contains similar exon-intron structures. However, significant divergences were also found. Notably, five members of the pepper MAPKK family were much less conserved than those found in Arabidopsis, and 9 Arabidopsis MAPKs did not have orthologs in pepper. Additionally, 7 MAPKs in Arabidopsis had either two or three orthologs in the pepper genome, and six pepper MAPKs and one MAPKK differing in sequence were found in three pepper varieties. Quantitative real-time RT-PCR analysis showed that the majority of MAPK and MAPKK genes were ubiquitously expressed and transcriptionally modified in pepper leaves after treatments with heat, salt, and Ralstonia solanacearum inoculation as well as exogenously applied salicylic acid, methyl jasmonate, ethephon, and abscisic acid. The MAPKK-MAPK interactome was tested by yeast two-hybrid assay, the results showed that one MAPKK might interact with multiple MAPKs, one MAPK might also interact with more than one MAPKKs, constituting MAPK signaling networks which may collaborate in transmitting upstream signals into appropriate downstream cellular responses and processes. These results will facilitate future functional characterization of MAPK cascades in pepper. PMID:26442088

  7. Zinc-finger protein-targeted gene regulation: Genomewide single-gene specificity

    PubMed Central

    Tan, Siyuan; Guschin, Dmitry; Davalos, Albert; Lee, Ya-Li; Snowden, Andrew W.; Jouvenot, Yann; Zhang, H. Steven; Howes, Katherine; McNamara, Andrew R.; Lai, Albert; Ullman, Chris; Reynolds, Lindsey; Moore, Michael; Isalan, Mark; Berg, Lutz-Peter; Campos, Bradley; Qi, Hong; Spratt, S. Kaye; Case, Casey C.; Pabo, Carl O.; Campisi, Judith; Gregory, Philip D.

    2003-01-01

    Zinc-finger protein transcription factors (ZFP TFs) can be designed to control the expression of any desired target gene, and thus provide potential therapeutic tools for the study and treatment of disease. Here we report that a ZFP TF can repress target gene expression with single-gene specificity within the human genome. A ZFP TF repressor that binds an 18-bp recognition sequence within the promoter of the endogenous CHK2 gene gives a >10-fold reduction in CHK2 mRNA and protein. This level of repression was sufficient to generate a functional phenotype, as demonstrated by the loss of DNA damage-induced CHK2-dependent p53 phosphorylation. We determined the specificity of repression by using DNA microarrays and found that the ZFP TF repressed a single gene (CHK2) within the monitored genome in two different cell types. These data demonstrate the utility of ZFP TFs as precise tools for target validation, and highlight their potential as clinical therapeutics. PMID:14514889

  8. Gene essentiality and the topology of protein interaction networks

    PubMed Central

    Coulomb, Stéphane; Bauer, Michel; Bernard, Denis; Marsolier-Kergoat, Marie-Claude

    2005-01-01

    The mechanistic bases for gene essentiality and for cell mutational resistance have long been disputed. The recent availability of large protein interaction databases has fuelled the analysis of protein interaction networks and several authors have proposed that gene dispensability could be strongly related to some topological parameters of these networks. However, many results were based on protein interaction data whose biases were not taken into account. In this article, we show that the essentiality of a gene in yeast is poorly related to the number of interactants (or degree) of the corresponding protein and that the physiological consequences of gene deletions are unrelated to several other properties of proteins in the interaction networks, such as the average degrees of their nearest neighbours, their clustering coefficients or their relative distances. We also found that yeast protein interaction networks lack degree correlation, i.e. a propensity for their vertices to associate according to their degrees. Gene essentiality and more generally cell resistance against mutations thus seem largely unrelated to many parameters of protein network topology. PMID:16087428

  9. Coding and non-coding gene regulatory networks underlie the immune response in liver cirrhosis

    PubMed Central

    Zhang, Xueming; Huang, Yongming; Yang, Zhengpeng; Zhang, Yuguo; Zhang, Weihui; Gao, Zu-hua; Xue, Dongbo

    2017-01-01

    Liver cirrhosis is recognized as being the consequence of immune-mediated hepatocyte damage and repair processes. However, the regulation of these immune responses underlying liver cirrhosis has not been elucidated. In this study, we used GEO datasets and bioinformatics methods to established coding and non-coding gene regulatory networks including transcription factor-/lncRNA-microRNA-mRNA, and competing endogenous RNA interaction networks. Our results identified 2224 mRNAs, 70 lncRNAs and 46 microRNAs were differentially expressed in liver cirrhosis. The transcription factor -/lncRNA- microRNA-mRNA network we uncovered that results in immune-mediated liver cirrhosis is comprised of 5 core microRNAs (e.g., miR-203; miR-219-5p), 3 transcription factors (i.e., FOXP3, ETS1 and FOS) and 7 lncRNAs (e.g., ENTS00000671336, ENST00000575137). The competing endogenous RNA interaction network we identified includes a complex immune response regulatory subnetwork that controls the entire liver cirrhosis network. Additionally, we found 10 overlapping GO terms shared by both liver cirrhosis and hepatocellular carcinoma including “immune response” as well. Interestingly, the overlapping differentially expressed genes in liver cirrhosis and hepatocellular carcinoma were enriched in immune response-related functional terms. In summary, a complex gene regulatory network underlying immune response processes may play an important role in the development and progression of liver cirrhosis, and its development into hepatocellular carcinoma. PMID:28355233

  10. Coding and non-coding gene regulatory networks underlie the immune response in liver cirrhosis.

    PubMed

    Gao, Bo; Zhang, Xueming; Huang, Yongming; Yang, Zhengpeng; Zhang, Yuguo; Zhang, Weihui; Gao, Zu-Hua; Xue, Dongbo

    2017-01-01

    Liver cirrhosis is recognized as being the consequence of immune-mediated hepatocyte damage and repair processes. However, the regulation of these immune responses underlying liver cirrhosis has not been elucidated. In this study, we used GEO datasets and bioinformatics methods to established coding and non-coding gene regulatory networks including transcription factor-/lncRNA-microRNA-mRNA, and competing endogenous RNA interaction networks. Our results identified 2224 mRNAs, 70 lncRNAs and 46 microRNAs were differentially expressed in liver cirrhosis. The transcription factor -/lncRNA- microRNA-mRNA network we uncovered that results in immune-mediated liver cirrhosis is comprised of 5 core microRNAs (e.g., miR-203; miR-219-5p), 3 transcription factors (i.e., FOXP3, ETS1 and FOS) and 7 lncRNAs (e.g., ENTS00000671336, ENST00000575137). The competing endogenous RNA interaction network we identified includes a complex immune response regulatory subnetwork that controls the entire liver cirrhosis network. Additionally, we found 10 overlapping GO terms shared by both liver cirrhosis and hepatocellular carcinoma including "immune response" as well. Interestingly, the overlapping differentially expressed genes in liver cirrhosis and hepatocellular carcinoma were enriched in immune response-related functional terms. In summary, a complex gene regulatory network underlying immune response processes may play an important role in the development and progression of liver cirrhosis, and its development into hepatocellular carcinoma.

  11. The HOX genes are expressed, in vivo, in human tooth germs: in vitro cAMP exposure of dental pulp cells results in parallel HOX network activation and neuronal differentiation.

    PubMed

    D'Antò, Vincenzo; Cantile, Monica; D'Armiento, Maria; Schiavo, Giulia; Spagnuolo, Gianrico; Terracciano, Luigi; Vecchione, Raffaela; Cillo, Clemente

    2006-03-01

    Homeobox-containing genes play a crucial role in odontogenesis. After the detection of Dlx and Msx genes in overlapping domains along maxillary and mandibular processes, a homeobox odontogenic code has been proposed to explain the interaction between different homeobox genes during dental lamina patterning. No role has so far been assigned to the Hox gene network in the homeobox odontogenic code due to studies on specific Hox genes and evolutionary considerations. Despite its involvement in early patterning during embryonal development, the HOX gene network, the most repeat-poor regions of the human genome, controls the phenotype identity of adult eukaryotic cells. Here, according to our results, the HOX gene network appears to be active in human tooth germs between 18 and 24 weeks of development. The immunohistochemical localization of specific HOX proteins mostly concerns the epithelial tooth germ compartment. Furthermore, only a few genes of the network are active in embryonal retromolar tissues, as well as in ectomesenchymal dental pulp cells (DPC) grown in vitro from adult human molar. Exposure of DPCs to cAMP induces the expression of from three to nine total HOX genes of the network in parallel with phenotype modifications with traits of neuronal differentiation. Our observations suggest that: (i) by combining its component genes, the HOX gene network determines the phenotype identity of epithelial and ectomesenchymal cells interacting in the generation of human tooth germ; (ii) cAMP treatment activates the HOX network and induces, in parallel, a neuronal-like phenotype in human primary ectomesenchymal dental pulp cells. 2005 Wiley-Liss, Inc.

  12. The Glucuronic Acid Utilization Gene Cluster from Bacillus stearothermophilus T-6

    PubMed Central

    Shulami, Smadar; Gat, Orit; Sonenshein, Abraham L.; Shoham, Yuval

    1999-01-01

    A λ-EMBL3 genomic library of Bacillus stearothermophilus T-6 was screened for hemicellulolytic activities, and five independent clones exhibiting β-xylosidase activity were isolated. The clones overlap each other and together represent a 23.5-kb chromosomal segment. The segment contains a cluster of xylan utilization genes, which are organized in at least three transcriptional units. These include the gene for the extracellular xylanase, xylanase T-6; part of an operon coding for an intracellular xylanase and a β-xylosidase; and a putative 15.5-kb-long transcriptional unit, consisting of 12 genes involved in the utilization of α-d-glucuronic acid (GlcUA). The first four genes in the potential GlcUA operon (orf1, -2, -3, and -4) code for a putative sugar transport system with characteristic components of the binding-protein-dependent transport systems. The most likely natural substrate for this transport system is aldotetraouronic acid [2-O-α-(4-O-methyl-α-d-glucuronosyl)-xylotriose] (MeGlcUAXyl3). The following two genes code for an intracellular α-glucuronidase (aguA) and a β-xylosidase (xynB). Five more genes (kdgK, kdgA, uxaC, uxuA, and uxuB) encode proteins that are homologous to enzymes involved in galacturonate and glucuronate catabolism. The gene cluster also includes a potential regulatory gene, uxuR, the product of which resembles repressors of the GntR family. The apparent transcriptional start point of the cluster was determined by primer extension analysis and is located 349 bp from the initial ATG codon. The potential operator site is a perfect 12-bp inverted repeat located downstream from the promoter between nucleotides +170 and +181. Gel retardation assays indicated that UxuR binds specifically to this sequence and that this binding is efficiently prevented in vitro by MeGlcUAXyl3, the most likely molecular inducer. PMID:10368143

  13. Cloning, sequencing, and expression of the gene coding for bile acid 7 alpha-hydroxysteroid dehydrogenase from Eubacterium sp. strain VPI 12708.

    PubMed Central

    Baron, S F; Franklund, C V; Hylemon, P B

    1991-01-01

    Southern blot analysis indicated that the gene encoding the constitutive, NADP-linked bile acid 7 alpha-hydroxysteroid dehydrogenase of Eubacterium sp. strain VPI 12708 was located on a 6.5-kb EcoRI fragment of the chromosomal DNA. This fragment was cloned into bacteriophage lambda gt11, and a 2.9-kb piece of this insert was subcloned into pUC19, yielding the recombinant plasmid pBH51. DNA sequence analysis of the 7 alpha-hydroxysteroid dehydrogenase gene in pBH51 revealed a 798-bp open reading frame, coding for a protein with a calculated molecular weight of 28,500. A putative promoter sequence and ribosome binding site were identified. The 7 alpha-hydroxysteroid dehydrogenase mRNA transcript in Eubacterium sp. strain VPI 12708 was about 0.94 kb in length, suggesting that it is monocistronic. An Escherichia coli DH5 alpha transformant harboring pBH51 had approximately 30-fold greater levels of 7 alpha-hydroxysteroid dehydrogenase mRNA, immunoreactive protein, and specific activity than Eubacterium sp. strain VPI 12708. The 7 alpha-hydroxysteroid dehydrogenase purified from the pBH51 transformant was similar in subunit molecular weight, specific activity, and kinetic properties to that from Eubacterium sp. strain VPI 12708, and it reached with antiserum raised against the authentic enzyme on Western immunoblots. Alignment of the amino acid sequence of the 7 alpha-hydroxysteroid dehydrogenase with those of 10 other pyridine nucleotide-linked alcohol/polyol dehydrogenases revealed six conserved amino acid residues in the N-terminal regions thought to function in coenzyme binding. Images PMID:1856160

  14. Mutations in Protein-Binding Hot-Spots on the Hub Protein Smad3 Differentially Affect Its Protein Interactions and Smad3-Regulated Gene Expression

    PubMed Central

    Schiro, Michelle M.; Stauber, Sara E.; Peterson, Tami L.; Krueger, Chateen; Darnell, Steven J.; Satyshur, Kenneth A.; Drinkwater, Norman R.; Newton, Michael A.; Hoffmann, F. Michael

    2011-01-01

    Background Hub proteins are connected through binding interactions to many other proteins. Smad3, a mediator of signal transduction induced by transforming growth factor beta (TGF-β), serves as a hub protein for over 50 protein-protein interactions. Different cellular responses mediated by Smad3 are the product of cell-type and context dependent Smad3-nucleated protein complexes acting in concert. Our hypothesis is that perturbation of this spectrum of protein complexes by mutation of single protein-binding hot-spots on Smad3 will have distinct consequences on Smad3-mediated responses. Methodology/Principal Findings We mutated 28 amino acids on the surface of the Smad3 MH2 domain and identified 22 Smad3 variants with reduced binding to subsets of 17 Smad3-binding proteins including Smad4, SARA, Ski, Smurf2 and SIP1. Mutations defective in binding to Smad4, e.g., D408H, or defective in nucleocytoplasmic shuttling, e.g., W406A, were compromised in modulating the expression levels of a Smad3-dependent reporter gene or six endogenous Smad3-responsive genes: Mmp9, IL11, Tnfaip6, Fermt1, Olfm2 and Wnt11. However, the Smad3 mutants Y226A, Y297A, W326A, K341A, and E267A had distinct differences on TGF-β signaling. For example, K341A and Y226A both reduced the Smad3-mediated activation of the reporter gene by ∼50% but K341A only reduced the TGF-β inducibilty of Olfm2 in contrast to Y226A which reduced the TGF-β inducibility of all six endogenous genes as severely as the W406A mutation. E267A had increased protein binding but reduced TGF-β inducibility because it caused higher basal levels of expression. Y297A had increased TGF-β inducibility because it caused lower Smad3-induced basal levels of gene expression. Conclusions/Significance Mutations in protein binding hot-spots on Smad3 reduced the binding to different subsets of interacting proteins and caused a range of quantitative changes in the expression of genes induced by Smad3. This approach should be useful

  15. DiffSLC: A graph centrality method to detect essential proteins of a protein-protein interaction network.

    PubMed

    Mistry, Divya; Wise, Roger P; Dickerson, Julie A

    2017-01-01

    Identification of central genes and proteins in biomolecular networks provides credible candidates for pathway analysis, functional analysis, and essentiality prediction. The DiffSLC centrality measure predicts central and essential genes and proteins using a protein-protein interaction network. Network centrality measures prioritize nodes and edges based on their importance to the network topology. These measures helped identify critical genes and proteins in biomolecular networks. The proposed centrality measure, DiffSLC, combines the number of interactions of a protein and the gene coexpression values of genes from which those proteins were translated, as a weighting factor to bias the identification of essential proteins in a protein interaction network. Potentially essential proteins with low node degree are promoted through eigenvector centrality. Thus, the gene coexpression values are used in conjunction with the eigenvector of the network's adjacency matrix and edge clustering coefficient to improve essentiality prediction. The outcome of this prediction is shown using three variations: (1) inclusion or exclusion of gene co-expression data, (2) impact of different coexpression measures, and (3) impact of different gene expression data sets. For a total of seven networks, DiffSLC is compared to other centrality measures using Saccharomyces cerevisiae protein interaction networks and gene expression data. Comparisons are also performed for the top ranked proteins against the known essential genes from the Saccharomyces Gene Deletion Project, which show that DiffSLC detects more essential proteins and has a higher area under the ROC curve than other compared methods. This makes DiffSLC a stronger alternative to other centrality methods for detecting essential genes using a protein-protein interaction network that obeys centrality-lethality principle. DiffSLC is implemented using the igraph package in R, and networkx package in Python. The python package can be

  16. From Gene Mutation to Protein Characterization

    ERIC Educational Resources Information Center

    Moffet, David A.

    2009-01-01

    A seven-week "gene to protein" laboratory sequence is described for an undergraduate biochemistry laboratory course. Student pairs were given the task of introducing a point mutation of their choosing into the well studied protein, enhanced green fluorescent protein (EGFP). After conducting literature searches, each student group chose the…

  17. Search for protein partners of mitochondrial single-stranded DNA-binding protein Rim1p using a yeast two-hybrid system.

    PubMed

    Kucejová, B; Foury, F

    2003-01-01

    RIM1 is a nuclear gene of the yeast Saccharomyces cerevisiae coding for a protein with single-stranded DNA-binding activity that is essential for mitochondrial genome maintenance. No protein partners of Rim1p have been described so far in yeast. To better understand the role of this protein in mitochondrial DNA replication and recombination, a search for protein interactors by the yeast two-hybrid system was performed. This approach led to the identification of several candidates, including a putative transcription factor, Azf1p, and Mph1p, a protein with an RNA helicase domain which is known to influence the mutation rate of nuclear and mitochondrial genomes.

  18. The chemical basis for the origin of the genetic code and the process of protein synthesis

    NASA Technical Reports Server (NTRS)

    1982-01-01

    The major thrust is to understand just how the process of protein synthesis, including that very important aspect, genetic coding, came to be. Two aspects of the problem: the chemistry of active aminoacyl species; and affinities between amino acids and nucleotides, and specifically, how these affinities might affect the chemistry between the two are stressed.

  19. Insecticidal properties of a crystal protein gene product isolated from Bacillus thuringiensis subsp. kenyae.

    PubMed

    Masson, L; Moar, W J; van Frankenhuyzen, K; Bossé, M; Brousseau, R

    1992-02-01

    A protoxin gene, localized to a high-molecular-weight plasmid from Bacillus thuringiensis subsp. kenyae, was cloned on a 19-kb BamHI DNA fragment into Escherichia coli. Characterization of the gene revealed it to be a member of the CryIE toxin subclass which has been reported to be as toxic as the CryIC subclass to larvae from Spodoptera exigua in assays with crude E. coli extracts. To directly test the purified recombinant gene product, the gene was subcloned as a 4.8-kb fragment into an expression vector resulting in the overexpression of a 134-kDa protein in the form of phase-bright inclusions in E. coli. Treatment of solubilized inclusion bodies with either trypsin or gut juice from the silkworm Bombyx mori resulted in the appearance of a protease-resistant 65-kDa protein. In force-feeding bioassays, the purified activated protein was highly toxic to larvae of B. mori but not to larvae of Choristoneura fumiferana. In diet bioassays with larvae from S. exigua, the purified protoxin was nontoxic. However, prior activation of the protoxin by tryptic digestion resulted in the appearance of some toxic activity. These results demonstrate that this new subclass of protein toxin may not be useful for the control of Spodoptera species as previously reported. Hierarchical clustering of the nine known lepidopteran-specific CryI toxin subclasses through multiple sequence alignment suggests that the toxins fall into four possible subgroups or clusters.

  20. Mitochondrial genetic codes evolve to match amino acid requirements of proteins.

    PubMed

    Swire, Jonathan; Judson, Olivia P; Burt, Austin

    2005-01-01

    Mitochondria often use genetic codes different from the standard genetic code. Now that many mitochondrial genomes have been sequenced, these variant codes provide the first opportunity to examine empirically the processes that produce new genetic codes. The key question is: Are codon reassignments the sole result of mutation and genetic drift? Or are they the result of natural selection? Here we present an analysis of 24 phylogenetically independent codon reassignments in mitochondria. Although the mutation-drift hypothesis can explain reassignments from stop to an amino acid, we found that it cannot explain reassignments from one amino acid to another. In particular--and contrary to the predictions of the mutation-drift hypothesis--the codon involved in such a reassignment was not rare in the ancestral genome. Instead, such reassignments appear to take place while the codon is in use at an appreciable frequency. Moreover, the comparison of inferred amino acid usage in the ancestral genome with the neutral expectation shows that the amino acid gaining the codon was selectively favored over the amino acid losing the codon. These results are consistent with a simple model of weak selection on the amino acid composition of proteins in which codon reassignments are selected because they compensate for multiple slightly deleterious mutations throughout the mitochondrial genome. We propose that the selection pressure is for reduced protein synthesis cost: most reassignments give amino acids that are less expensive to synthesize. Taken together, our results strongly suggest that mitochondrial genetic codes evolve to match the amino acid requirements of proteins.

  1. Regulation of contractile protein gene expression in unloaded mouse skeletal muscle

    NASA Technical Reports Server (NTRS)

    Criswell, D. S.; Carson, J. A.; Booth, F. W.

    1996-01-01

    Hindlimb unloading was performed on mice in an effort to study the regulation of contractile protein genes. In particular, the regulation of myosin heavy chain IIb was examined. During unloading, muscle fibers undergo a type conversion. Preliminary data from this study does not support the hypothesis that the fiber type conversion is due to an increase in promoter activity of fast isoform genes, such as myosin heavy chain IIb. The consequences of this finding are examined, with particular focus on other factors controlling gene regulation.

  2. Uncovering the functional constraints underlying the genomic organization of the odorant-binding protein genes.

    PubMed

    Librado, Pablo; Rozas, Julio

    2013-01-01

    Animal olfactory systems have a critical role for the survival and reproduction of individuals. In insects, the odorant-binding proteins (OBPs) are encoded by a moderately sized gene family, and mediate the first steps of the olfactory processing. Most OBPs are organized in clusters of a few paralogs, which are conserved over time. Currently, the biological mechanism explaining the close physical proximity among OBPs is not yet established. Here, we conducted a comprehensive study aiming to gain insights into the mechanisms underlying the OBP genomic organization. We found that the OBP clusters are embedded within large conserved arrangements. These organizations also include other non-OBP genes, which often encode proteins integral to plasma membrane. Moreover, the conservation degree of such large clusters is related to the following: 1) the promoter architecture of the confined genes, 2) a characteristic transcriptional environment, and 3) the chromatin conformation of the chromosomal region. Our results suggest that chromatin domains may restrict the location of OBP genes to regions having the appropriate transcriptional environment, leading to the OBP cluster structure. However, the appropriate transcriptional environment for OBP and the other neighbor genes is not dominated by reduced levels of expression noise. Indeed, the stochastic fluctuations in the OBP transcript abundance may have a critical role in the combinatorial nature of the olfactory coding process.

  3. MERP1: a mammalian ependymin-related protein gene differentially expressed in hematopoietic cells.

    PubMed

    Gregorio-King, Claudia C; McLeod, Janet L; Collier, Fiona McL; Collier, Gregory R; Bolton, Karyn A; Van Der Meer, Gavin J; Apostolopoulos, Jim; Kirkland, Mark A

    2002-03-20

    We have utilized differential display polymerase chain reaction to investigate the gene expression of hematopoietic progenitor cells from adult bone marrow and umbilical cord blood. A differentially expressed gene was identified in CD34+ hematopoietic progenitor cells, with low expression in CD34- cells. We have obtained the full coding sequence of this gene which we designated human mammalian ependymin-related protein 1 (MERP1). Expression of MERP1 was found in a variety of normal human tissues, and is 4- and 10-fold higher in adult bone marrow and umbilical cord blood CD34+ cells, respectively, compared to CD34- cells. Additionally, MERP1 expression in a hematopoietic stem cell enriched population was down-regulated with proliferation and differentiation. Conceptual translation of the MERP1 open reading frame reveals significant homology to two families of glycoprotein calcium-dependant cell adhesion molecules: ependymins and protocadherins.

  4. RNA- and protein-mediated control of Listeria monocytogenes virulence gene expression

    PubMed Central

    Lebreton, Alice; Cossart, Pascale

    2017-01-01

    ABSTRACT The model opportunistic pathogen Listeria monocytogenes has been the object of extensive research, aiming at understanding its ability to colonize diverse environmental niches and animal hosts. Bacterial transcriptomes in various conditions reflect this efficient adaptability. We review here our current knowledge of the mechanisms allowing L. monocytogenes to respond to environmental changes and trigger pathogenicity, with a special focus on RNA-mediated control of gene expression. We highlight how these studies have brought novel concepts in prokaryotic gene regulation, such as the ‘excludon’ where the 5′-UTR of a messenger also acts as an antisense regulator of an operon transcribed in opposite orientation, or the notion that riboswitches can regulate non-coding RNAs to integrate complex metabolic stimuli into regulatory networks. Overall, the Listeria model exemplifies that fine RNA tuners act together with master regulatory proteins to orchestrate appropriate transcriptional programmes. PMID:27217337

  5. PdSlt2 Penicillium digitatum mitogen-activated-protein kinase controls sporulation and virulence during citrus fruit infection.

    PubMed

    de Ramón-Carbonell, Marta; Sánchez-Torres, Paloma

    2017-12-01

    The Slt2 mitogen-activated protein (MAP) kinase homologue of Penicillium digitatum, the most relevant pathogen-producing citrus green mould decay during postharvest, was identified and explored. The P. digitatum Slt2-MAPK coding gene (PdSlt2) was functionally characterized by homologous gene elimination and transcriptomic evaluation. The absence of PdSlt2 gene resulted in significantly reduced virulence during citrus infection. The ΔPdSlt2 mutants were also defective in asexual reproduction, showing impairment of sporulation during citrus infection. Gene expression analysis revealed that PdSlt2 was highly induced during citrus fruit infection at early stages (1 dpi). Moreover, PdSlt2 deletion altered gene expression profiles. The relative gene expression (RGE) of fungicide resistance- and fungal virulence-related genes showed that PdSlt2 acts as negative regulator of several transporter encoding genes (ABC and MFS transporters) and a positive regulator of two sterol demethylases. This study indicates that PdSlt2 MAPK is functionally preserved in P. digitatum and highlights the relevant role of the PdSlt2 MAP kinase-mediated signalling pathway in regulating diverse genes crucial for infection and asexual reproduction. Copyright © 2017 British Mycological Society. Published by Elsevier Ltd. All rights reserved.

  6. A library of MiMICs allows tagging of genes and reversible, spatial and temporal knockdown of proteins in Drosophila

    DOE PAGES

    Nagarkar-Jaiswal, Sonal; Lee, Pei-Tseng; Campbell, Megan E.; ...

    2015-03-31

    Here, we document a collection of ~7434 MiMIC (Minos Mediated Integration Cassette) insertions of which 2854 are inserted in coding introns. They allowed us to create a library of 400 GFP-tagged genes. We show that 72% of internally tagged proteins are functional, and that more than 90% can be imaged in unfixed tissues. Moreover, the tagged mRNAs can be knocked down by RNAi against GFP (iGFPi), and the tagged proteins can be efficiently knocked down by deGradFP technology. The phenotypes associated with RNA and protein knockdown typically correspond to severe loss of function or null mutant phenotypes. Finally, we demonstratemore » reversible, spatial, and temporal knockdown of tagged proteins in larvae and adult flies. This new strategy and collection of strains allows unprecedented in vivo manipulations in flies for many genes. These strategies will likely extend to vertebrates.« less

  7. Distribution in microbial genomes of genes similar to lodA and goxA which encode a novel family of quinoproteins with amino acid oxidase activity.

    PubMed

    Campillo-Brocal, Jonatan C; Chacón-Verdú, María Dolores; Lucas-Elío, Patricia; Sánchez-Amat, Antonio

    2015-03-24

    L-Amino acid oxidases (LAOs) have been generally described as flavoproteins that oxidize amino acids releasing the corresponding ketoacid, ammonium and hydrogen peroxide. The generation of hydrogen peroxide gives to these enzymes antimicrobial characteristics. They are involved in processes such as biofilm development and microbial competition. LAOs are of great biotechnological interest in different applications such as the design of biosensors, biotransformations and biomedicine. The marine bacterium Marinomonas mediterranea synthesizes LodA, the first known LAO that contains a quinone cofactor. LodA is encoded in an operon that contains a second gene coding for LodB, a protein required for the post-translational modification generating the cofactor. Recently, GoxA, a quinoprotein with sequence similarity to LodA but with a different enzymatic activity (glycine oxidase instead of lysine-ε-oxidase) has been described. The aim of this work has been to study the distribution of genes similar to lodA and/or goxA in sequenced microbial genomes and to get insight into the evolution of this novel family of proteins through phylogenetic analysis. Genes encoding LodA-like proteins have been detected in several bacterial classes. However, they are absent in Archaea and detected only in a small group of fungi of the class Agaromycetes. The vast majority of the genes detected are in a genome region with a nearby lodB-like gene suggesting a specific interaction between both partner proteins. Sequence alignment of the LodA-like proteins allowed the detection of several conserved residues. All of them showed a Cys and a Trp that aligned with the residues that are forming part of the cysteine tryptophilquinone (CTQ) cofactor in LodA. Phylogenetic analysis revealed that LodA-like proteins can be clustered in different groups. Interestingly, LodA and GoxA are in different groups, indicating that those groups are related to the enzymatic activity of the proteins detected. Genome

  8. Characterization of mitochondrial genome of sea cucumber Stichopus horrens: a novel gene arrangement in Holothuroidea.

    PubMed

    Fan, SiGang; Hu, ChaoQun; Wen, Jing; Zhang, LvPing

    2011-05-01

    The complete mitochondrial DNA sequence contains useful information for phylogenetic analyses of metazoa. In this study, the complete mitochondrial DNA sequence of sea cucumber Stichopus horrens (Holothuroidea: Stichopodidae: Stichopus) is presented. The complete sequence was determined using normal and long PCRs. The mitochondrial genome of Stichopus horrens is a circular molecule 16257 bps long, composed of 13 protein-coding genes, two ribosomal RNA genes and 22 transfer RNA genes. Most of these genes are coded on the heavy strand except for one protein-coding gene (nad6) and five tRNA genes (tRNA ( Ser(UCN) ), tRNA ( Gln ), tRNA ( Ala ), tRNA ( Val ), tRNA ( Asp )) which are coded on the light strand. The composition of the heavy strand is 30.8% A, 23.7% C, 16.2% G, and 29.3% T bases (AT skew=0.025; GC skew=-0.188). A non-coding region of 675 bp was identified as a putative control region because of its location and AT richness. The intergenic spacers range from 1 to 50 bp in size, totaling 227 bp. A total of 25 overlapping nucleotides, ranging from 1 to 10 bp in size, exist among 11 genes. All 13 protein-coding genes are initiated with an ATG. The TAA codon is used as the stop codon in all the protein coding genes except nad3 and nad4 that use TAG as their termination codon. The most frequently used amino acids are Leu (16.29%), Ser (10.34%) and Phe (8.37%). All of the tRNA genes have the potential to fold into typical cloverleaf secondary structures. We also compared the order of the genes in the mitochondrial DNA from the five holothurians that are now available and found a novel gene arrangement in the mitochondrial DNA of Stichopus horrens.

  9. An integrative strategy to identify the entire protein coding potential of prokaryotic genomes by proteogenomics.

    PubMed

    Omasits, Ulrich; Varadarajan, Adithi R; Schmid, Michael; Goetze, Sandra; Melidis, Damianos; Bourqui, Marc; Nikolayeva, Olga; Québatte, Maxime; Patrignani, Andrea; Dehio, Christoph; Frey, Juerg E; Robinson, Mark D; Wollscheid, Bernd; Ahrens, Christian H

    2017-12-01

    Accurate annotation of all protein-coding sequences (CDSs) is an essential prerequisite to fully exploit the rapidly growing repertoire of completely sequenced prokaryotic genomes. However, large discrepancies among the number of CDSs annotated by different resources, missed functional short open reading frames (sORFs), and overprediction of spurious ORFs represent serious limitations. Our strategy toward accurate and complete genome annotation consolidates CDSs from multiple reference annotation resources, ab initio gene prediction algorithms and in silico ORFs (a modified six-frame translation considering alternative start codons) in an integrated proteogenomics database (iPtgxDB) that covers the entire protein-coding potential of a prokaryotic genome. By extending the PeptideClassifier concept of unambiguous peptides for prokaryotes, close to 95% of the identifiable peptides imply one distinct protein, largely simplifying downstream analysis. Searching a comprehensive Bartonella henselae proteomics data set against such an iPtgxDB allowed us to unambiguously identify novel ORFs uniquely predicted by each resource, including lipoproteins, differentially expressed and membrane-localized proteins, novel start sites and wrongly annotated pseudogenes. Most novelties were confirmed by targeted, parallel reaction monitoring mass spectrometry, including unique ORFs and single amino acid variations (SAAVs) identified in a re-sequenced laboratory strain that are not present in its reference genome. We demonstrate the general applicability of our strategy for genomes with varying GC content and distinct taxonomic origin. We release iPtgxDBs for B. henselae , Bradyrhizobium diazoefficiens and Escherichia coli and the software to generate both proteogenomics search databases and integrated annotation files that can be viewed in a genome browser for any prokaryote. © 2017 Omasits et al.; Published by Cold Spring Harbor Laboratory Press.

  10. Defended to the Nines: 25 Years of Resistance Gene Cloning Identifies Nine Mechanisms for R Protein Function.

    PubMed

    Kourelis, Jiorgos; van der Hoorn, Renier A L

    2018-02-01

    Plants have many, highly variable resistance ( R ) gene loci, which provide resistance to a variety of pathogens. The first R gene to be cloned, maize ( Zea mays ) Hm1 , was published over 25 years ago, and since then, many different R genes have been identified and isolated. The encoded proteins have provided clues to the diverse molecular mechanisms underlying immunity. Here, we present a meta-analysis of 314 cloned R genes. The majority of R genes encode cell surface or intracellular receptors, and we distinguish nine molecular mechanisms by which R proteins can elevate or trigger disease resistance: direct (1) or indirect (2) perception of pathogen-derived molecules on the cell surface by receptor-like proteins and receptor-like kinases; direct (3) or indirect (4) intracellular detection of pathogen-derived molecules by nucleotide binding, leucine-rich repeat receptors, or detection through integrated domains (5); perception of transcription activator-like effectors through activation of executor genes (6); and active (7), passive (8), or host reprogramming-mediated (9) loss of susceptibility. Although the molecular mechanisms underlying the functions of R genes are only understood for a small proportion of known R genes, a clearer understanding of mechanisms is emerging and will be crucial for rational engineering and deployment of novel R genes. © 2018 American Society of Plant Biologists. All rights reserved.

  11. Cellular reprogramming through mitogen-activated protein kinases.

    PubMed

    Lee, Justin; Eschen-Lippold, Lennart; Lassowskat, Ines; Böttcher, Christoph; Scheel, Dierk

    2015-01-01

    Mitogen-activated protein kinase (MAPK) cascades are conserved eukaryote signaling modules where MAPKs, as the final kinases in the cascade, phosphorylate protein substrates to regulate cellular processes. While some progress in the identification of MAPK substrates has been made in plants, the knowledge on the spectrum of substrates and their mechanistic action is still fragmentary. In this focused review, we discuss the biological implications of the data in our original paper (Sustained mitogen-activated protein kinase activation reprograms defense metabolism and phosphoprotein profile in Arabidopsis thaliana; Frontiers in Plant Science 5: 554) in the context of related research. In our work, we mimicked in vivo activation of two stress-activated MAPKs, MPK3 and MPK6, through transgenic manipulation of Arabidopsis thaliana and used phosphoproteomics analysis to identify potential novel MAPK substrates. Here, we plotted the identified putative MAPK substrates (and downstream phosphoproteins) as a global protein clustering network. Based on a highly stringent selection confidence level, the core networks highlighted a MAPK-induced cellular reprogramming at multiple levels of gene and protein expression-including transcriptional, post-transcriptional, translational, post-translational (such as protein modification, folding, and degradation) steps, and also protein re-compartmentalization. Additionally, the increase in putative substrates/phosphoproteins of energy metabolism and various secondary metabolite biosynthesis pathways coincides with the observed accumulation of defense antimicrobial substances as detected by metabolome analysis. Furthermore, detection of protein networks in phospholipid or redox elements suggests activation of downstream signaling events. Taken in context with other studies, MAPKs are key regulators that reprogram cellular events to orchestrate defense signaling in eukaryotes.

  12. Using the 2A Protein Coexpression System: Multicistronic 2A Vectors Expressing Gene(s) of Interest and Reporter Proteins.

    PubMed

    Luke, Garry A; Ryan, Martin D

    2018-01-01

    To date, a huge range of different proteins-many with cotranslational and posttranslational subcellular localization signals-have been coexpressed together with various reporter proteins in vitro and in vivo using 2A peptides. The pros and cons of 2A co-expression technology are considered below, followed by a simple example of a "how to" protocol to concatenate multiple genes of interest, together with a reporter gene, into a single gene linked via 2As for easy identification or selection of transduced cells.

  13. Genomic assessment of the evolution of the prion protein gene family in vertebrates.

    PubMed

    Harrison, Paul M; Khachane, Amit; Kumar, Manish

    2010-05-01

    Prion diseases are devastating neurological disorders caused by the propagation of particles containing an alternative beta-sheet-rich form of the prion protein (PrP). Genes paralogous to PrP, called Doppel and Shadoo, have been identified, that also have neuropathological relevance. To aid in the further functional characterization of PrP and its relatives, we annotated completely the PrP gene family (PrP-GF), in the genomes of 42 vertebrates, through combined strategic application of gene prediction programs and advanced remote homology detection techniques (such as HMMs, PSI-TBLASTN and pGenThreader). We have uncovered several previously undescribed paralogous genes and pseudogenes. We find that current high-quality genomic evidence indicates that the PrP relative Doppel, was likely present in the last common ancestor of present-day Tetrapoda, but was lost in the bird lineage, since its divergence from reptiles. Using the new gene annotations, we have defined the consensus of structural features that are characteristic of the PrP and Doppel structures, across diverse Tetrapoda clades. Furthermore, we describe in detail a transcribed pseudogene derived from Shadoo that is conserved across primates, and that overlaps the meiosis gene, SYCE1, thus possibly regulating its expression. In addition, we analysed the locus of PRNP/PRND for significant conservation across the genomic DNA of eleven mammals, and determined the phylogenetic penetration of non-coding exons. The genomic evidence indicates that the second PRNP non-coding exon found in even-toed ungulates and rodents, is conserved in all high-coverage genome assemblies of primates (human, chimp, orang utan and macaque), and is, at least, likely to have fallen out of use during primate speciation. Furthermore, we have demonstrated that the PRNT gene (at the PRNP human locus) is conserved across at least sixteen mammals, and evolves like a long non-coding RNA, fashioned from fragments of ancient, long

  14. Evolutionary Characteristics of Missing Proteins: Insights into the Evolution of Human Chromosomes Related to Missing-Protein-Encoding Genes.

    PubMed

    Xu, Aishi; Li, Guang; Yang, Dong; Wu, Songfeng; Ouyang, Hongsheng; Xu, Ping; He, Fuchu

    2015-12-04

    Although the "missing protein" is a temporary concept in C-HPP, the biological information for their "missing" could be an important clue in evolutionary studies. Here we classified missing-protein-encoding genes into two groups, the genes encoding PE2 proteins (with transcript evidence) and the genes encoding PE3/4 proteins (with no transcript evidence). These missing-protein-encoding genes distribute unevenly among different chromosomes, chromosomal regions, or gene clusters. In the view of evolutionary features, PE3/4 genes tend to be young, spreading at the nonhomology chromosomal regions and evolving at higher rates. Interestingly, there is a higher proportion of singletons in PE3/4 genes than the proportion of singletons in all genes (background) and OTCSGs (organ, tissue, cell type-specific genes). More importantly, most of the paralogous PE3/4 genes belong to the newly duplicated members of the paralogous gene groups, which mainly contribute to special biological functions, such as "smell perception". These functions are heavily restricted into specific type of cells, tissues, or specific developmental stages, acting as the new functional requirements that facilitated the emergence of the missing-protein-encoding genes during evolution. In addition, the criteria for the extremely special physical-chemical proteins were first set up based on the properties of PE2 proteins, and the evolutionary characteristics of those proteins were explored. Overall, the evolutionary analyses of missing-protein-encoding genes are expected to be highly instructive for proteomics and functional studies in the future.

  15. A T-DNA gene required for agropine biosynthesis by transformed plants is functionally and evolutionarily related to a Ti plasmid gene required for catabolism of agropine by Agrobacterium strains.

    PubMed Central

    Hong, S B; Hwang, I; Dessaux, Y; Guyon, P; Kim, K S; Farrand, S K

    1997-01-01

    The mechanisms that ensure that Ti plasmid T-DNA genes encoding proteins involved in the biosynthesis of opines in crown gall tumors are always matched by Ti plasmid genes conferring the ability to catabolize that set of opines on the inducing Agrobacterium strains are unknown. The pathway for the biosynthesis of the opine agropine is thought to require an enzyme, mannopine cyclase, coded for by the ags gene located in the T(R) region of octopine-type Ti plasmids. Extracts prepared from agropine-type tumors contained an activity that cyclized mannopine to agropine. Tumor cells containing a T region in which ags was mutated lacked this activity and did not contain agropine. Expression of ags from the lac promoter conferred mannopine-lactonizing activity on Escherichia coli. Agrobacterium tumefaciens strains harboring an octopine-type Ti plasmid exhibit a similar activity which is not coded for by ags. Analysis of the DNA sequence of the gene encoding this activity, called agcA, showed it to be about 60% identical to T-DNA ags genes. Relatedness decreased abruptly in the 5' and 3' untranslated regions of the genes. ags is preceded by a promoter that functions only in the plant. Expression analysis showed that agcA also is preceded by its own promoter, which is active in the bacterium. Translation of agcA yielded a protein of about 45 kDa, consistent with the size predicted from the DNA sequence. Antibodies raised against the agcA product cross-reacted with the anabolic enzyme. These results indicate that the agropine system arose by a duplication of a progenitor gene, one copy of which became associated with the T-DNA and the other copy of which remained associated with the bacterium. PMID:9244272

  16. A novel familial mutation in the PCSK1 gene that alters the oxyanion hole residue of proprotein convertase 1/3 and impairs its enzymatic activity.

    PubMed

    Wilschanski, Michael; Abbasi, Montaser; Blanco, Elias; Lindberg, Iris; Yourshaw, Michael; Zangen, David; Berger, Itai; Shteyer, Eyal; Pappo, Orit; Bar-Oz, Benjamin; Martín, Martin G; Elpeleg, Orly

    2014-01-01

    Four siblings presented with congenital diarrhea and various endocrinopathies. Exome sequencing and homozygosity mapping identified five regions, comprising 337 protein-coding genes that were shared by three affected siblings. Exome sequencing identified a novel homozygous N309K mutation in the proprotein convertase subtilisin/kexin type 1 (PCSK1) gene, encoding the neuroendocrine convertase 1 precursor (PC1/3) which was recently reported as a cause of Congenital Diarrhea Disorder (CDD). The PCSK1 mutation affected the oxyanion hole transition state-stabilizing amino acid within the active site, which is critical for appropriate proprotein maturation and enzyme activity. Unexpectedly, the N309K mutant protein exhibited normal, though slowed, prodomain removal and was secreted from both HEK293 and Neuro2A cells. However, the secreted enzyme showed no catalytic activity, and was not processed into the 66 kDa form. We conclude that the N309K enzyme is able to cleave its own propeptide but is catalytically inert against in trans substrates, and that this variant accounts for the enteric and systemic endocrinopathies seen in this large consanguineous kindred.

  17. A Novel Familial Mutation in the PCSK1 Gene That Alters the Oxyanion Hole Residue of Proprotein Convertase 1/3 and Impairs Its Enzymatic Activity

    PubMed Central

    Wilschanski, Michael; Abbasi, Montaser; Blanco, Elias; Lindberg, Iris; Yourshaw, Michael; Zangen, David; Berger, Itai; Shteyer, Eyal; Pappo, Orit; Bar-Oz, Benjamin; Martín, Martin G.; Elpeleg, Orly

    2014-01-01

    Four siblings presented with congenital diarrhea and various endocrinopathies. Exome sequencing and homozygosity mapping identified five regions, comprising 337 protein-coding genes that were shared by three affected siblings. Exome sequencing identified a novel homozygous N309K mutation in the proprotein convertase subtilisin/kexin type 1 (PCSK1) gene, encoding the neuroendocrine convertase 1 precursor (PC1/3) which was recently reported as a cause of Congenital Diarrhea Disorder (CDD). The PCSK1 mutation affected the oxyanion hole transition state-stabilizing amino acid within the active site, which is critical for appropriate proprotein maturation and enzyme activity. Unexpectedly, the N309K mutant protein exhibited normal, though slowed, prodomain removal and was secreted from both HEK293 and Neuro2A cells. However, the secreted enzyme showed no catalytic activity, and was not processed into the 66 kDa form. We conclude that the N309K enzyme is able to cleave its own propeptide but is catalytically inert against in trans substrates, and that this variant accounts for the enteric and systemic endocrinopathies seen in this large consanguineous kindred. PMID:25272002

  18. Sequence variations of the bovine prion protein gene (PRNP) in native Korean Hanwoo cattle

    PubMed Central

    Choi, Sangho

    2012-01-01

    Bovine spongiform encephalopathy (BSE) is one of the fatal neurodegenerative diseases known as transmissible spongiform encephalopathies (TSEs) caused by infectious prion proteins. Genetic variations correlated with susceptibility or resistance to TSE in humans and sheep have not been reported for bovine strains including those from Holstein, Jersey, and Japanese Black cattle. Here, we investigated bovine prion protein gene (PRNP) variations in Hanwoo cattle [Bos (B.) taurus coreanae], a native breed in Korea. We identified mutations and polymorphisms in the coding region of PRNP, determined their frequency, and evaluated their significance. We identified four synonymous polymorphisms and two non-synonymous mutations in PRNP, but found no novel polymorphisms. The sequence and number of octapeptide repeats were completely conserved, and the haplotype frequency of the coding region was similar to that of other B. taurus strains. When we examined the 23-bp and 12-bp insertion/deletion (indel) polymorphisms in the non-coding region of PRNP, Hanwoo cattle had a lower deletion allele and 23-bp del/12-bp del haplotype frequency than healthy and BSE-affected animals of other strains. Thus, Hanwoo are seemingly less susceptible to BSE than other strains due to the 23-bp and 12-bp indel polymorphisms. PMID:22705734

  19. Refolding techniques for recovering biologically active recombinant proteins from inclusion bodies.

    PubMed

    Yamaguchi, Hiroshi; Miyazaki, Masaya

    2014-02-20

    Biologically active proteins are useful for studying the biological functions of genes and for the development of therapeutic drugs and biomaterials in a biotechnology industry. Overexpression of recombinant proteins in bacteria, such as Escherichia coli, often results in the formation of inclusion bodies, which are protein aggregates with non-native conformations. As inclusion bodies contain relatively pure and intact proteins, protein refolding is an important process to obtain active recombinant proteins from inclusion bodies. However, conventional refolding methods, such as dialysis and dilution, are time consuming and, often, recovered yields of active proteins are low, and a trial-and-error process is required to achieve success. Recently, several approaches have been reported to refold these aggregated proteins into an active form. The strategies largely aim at reducing protein aggregation during the refolding procedure. This review focuses on protein refolding techniques using chemical additives and laminar flow in microfluidic chips for the efficient recovery of active proteins from inclusion bodies.

  20. Defended to the Nines: 25 Years of Resistance Gene Cloning Identifies Nine Mechanisms for R Protein Function[OPEN

    PubMed Central

    2018-01-01

    Plants have many, highly variable resistance (R) gene loci, which provide resistance to a variety of pathogens. The first R gene to be cloned, maize (Zea mays) Hm1, was published over 25 years ago, and since then, many different R genes have been identified and isolated. The encoded proteins have provided clues to the diverse molecular mechanisms underlying immunity. Here, we present a meta-analysis of 314 cloned R genes. The majority of R genes encode cell surface or intracellular receptors, and we distinguish nine molecular mechanisms by which R proteins can elevate or trigger disease resistance: direct (1) or indirect (2) perception of pathogen-derived molecules on the cell surface by receptor-like proteins and receptor-like kinases; direct (3) or indirect (4) intracellular detection of pathogen-derived molecules by nucleotide binding, leucine-rich repeat receptors, or detection through integrated domains (5); perception of transcription activator-like effectors through activation of executor genes (6); and active (7), passive (8), or host reprogramming-mediated (9) loss of susceptibility. Although the molecular mechanisms underlying the functions of R genes are only understood for a small proportion of known R genes, a clearer understanding of mechanisms is emerging and will be crucial for rational engineering and deployment of novel R genes. PMID:29382771

  1. Transimulation - protein biosynthesis web service.

    PubMed

    Siwiak, Marlena; Zielenkiewicz, Piotr

    2013-01-01

    Although translation is the key step during gene expression, it remains poorly characterized at the level of individual genes. For this reason, we developed Transimulation - a web service measuring translational activity of genes in three model organisms: Escherichia coli, Saccharomyces cerevisiae and Homo sapiens. The calculations are based on our previous computational model of translation and experimental data sets. Transimulation quantifies mean translation initiation and elongation time (expressed in SI units), and the number of proteins produced per transcript. It also approximates the number of ribosomes that typically occupy a transcript during translation, and simulates their propagation. The simulation of ribosomes' movement is interactive and allows modifying the coding sequence on the fly. It also enables uploading any coding sequence and simulating its translation in one of three model organisms. In such a case, ribosomes propagate according to mean codon elongation times of the host organism, which may prove useful for heterologous expression. Transimulation was used to examine evolutionary conservation of translational parameters of orthologous genes. Transimulation may be accessed at http://nexus.ibb.waw.pl/Transimulation (requires Java version 1.7 or higher). Its manual and source code, distributed under the GPL-2.0 license, is freely available at the website.

  2. Pollen specific expression of maize genes encoding actin depolymerizing factor-like proteins.

    PubMed Central

    Lopez, I; Anthony, R G; Maciver, S K; Jiang, C J; Khan, S; Weeds, A G; Hussey, P J

    1996-01-01

    In pollen development, a dramatic reorganization of the actin cytoskeleton takes place during the passage of the pollen grain into dormancy and on activation of pollen tube growth. A role for actin-binding proteins is implicated and we report here the identification of a small gene family in maize that encodes actin depolymerizing factor (ADF)-like proteins. The ADF group of proteins are believed to control actin polymerization and depolymerization in response to both intracellular and extracellular signals. Two of the maize genes ZmABP1 and ZmABP2 are expressed specifically in pollen and germinating pollen suggesting that the protein products may be involved in pollen actin reorganization. A third gene, ZmABP3, encodes a protein only 56% and 58% identical to ZmABP1 and ZmABP2, respectively, and its expression is suppressed in pollen and germinated pollen. The fundamental biochemical characteristics of the ZmABP proteins has been elucidated using bacterially expressed ZmABP3 protein. This has the ability to bind monomeric actin (G-actin) and filamentous actin (F-actin). Moreover, it decreases the viscosity of polymerized actin solutions consistent with an ability to depolymerize filaments. These biochemical characteristics, taken together with the sequence comparisons, support the inclusion of the ZmABP proteins in the ADF group. Images Fig. 2 Fig. 3 Fig. 4 Fig. 5 PMID:8693008

  3. Diversity and impact of rare variants in genes encoding the platelet G protein-coupled receptors.

    PubMed

    Jones, Matthew L; Norman, Jane E; Morgan, Neil V; Mundell, Stuart J; Lordkipanidzé, Marie; Lowe, Gillian C; Daly, Martina E; Simpson, Michael A; Drake, Sian; Watson, Steve P; Mumford, Andrew D

    2015-04-01

    Platelet responses to activating agonists are influenced by common population variants within or near G protein-coupled receptor (GPCR) genes that affect receptor activity. However, the impact of rare GPCR gene variants is unknown. We describe the rare single nucleotide variants (SNVs) in the coding and splice regions of 18 GPCR genes in 7,595 exomes from the 1,000-genomes and Exome Sequencing Project databases and in 31 cases with inherited platelet function disorders (IPFDs). In the population databases, the GPCR gene target regions contained 740 SNVs (318 synonymous, 410 missense, 7 stop gain and 6 splice region) of which 70 % had global minor allele frequency (MAF) < 0.05 %. Functional annotation using six computational algorithms, experimental evidence and structural data identified 156/740 (21 %) SNVs as potentially damaging to GPCR function, most commonly in regions encoding the transmembrane and C-terminal intracellular receptor domains. In 31 index cases with IPFDs (Gi-pathway defect n=15; secretion defect n=11; thromboxane pathway defect n=3 and complex defect n=2) there were 256 SNVs in the target regions of 15 stimulatory platelet GPCRs (34 unique; 12 with MAF< 1 % and 22 with MAF≥ 1 %). These included rare variants predicting R122H, P258T and V207A substitutions in the P2Y12 receptor that were annotated as potentially damaging, but only partially explained the platelet function defects in each case. Our data highlight that potentially damaging variants in platelet GPCR genes have low individual frequencies, but are collectively abundant in the population. Potentially damaging variants are also present in pedigrees with IPFDs and may contribute to complex laboratory phenotypes.

  4. AtFXG1, an Arabidopsis Gene Encoding α-l-Fucosidase Active against Fucosylated Xyloglucan Oligosaccharides1

    PubMed Central

    de la Torre, Francisco; Sampedro, Javier; Zarra, Ignacio; Revilla, Gloria

    2002-01-01

    An α-l-fucosidase (EC 3.2.1.51) able to release the t-fucosyl residue from the side chain of xyloglucan oligosaccharides has been detected in the leaves of Arabidopsis plants. Moreover, an α-l-fucosidase with similar substrate specificity was purified from cabbage (Brassica oleracea) leaves to render a single band on SDS-PAGE. Two peptide sequences were obtained from this protein band, and they were used to identify an Arabidopsis gene coding for an α-fucosidase that we propose to call AtFXG1. In addition, an Arabidopsis gene with homology with known α-l-fucosidases has been also found, and we proposed to name it as AtFUC1. Both AtFXG1 and ATFUC1 were heterologously expressed in Pichia pastoris cells and the α-l-fucosidase activities secreted to the culture medium. The α-l-fucosidase encoded by AtFXG1 was active against the oligosaccharides from xyloglucan XXFG as well as against 2′-fucosyl-lactitol but not against p-nitrophenyl-α-l-fucopyranoside. However, the AtFUC1 heterologously expressed was active only against 2′-fucosyl-lactitol. Thus, the former must be related to xyloglucan metabolism. PMID:11788770

  5. Molecular cloning, recombinant expression, and antifungal functional characterization of the lipid transfer protein from Panax ginseng.

    PubMed

    Cai, Kexin; Wang, Jiawen; Wang, Min; Zhang, Hui; Wang, Siming; Zhao, Yu

    2016-07-01

    To establish an efficient expression system for a fusion protein GST-pgLTP (Lipid Transfer Protein) and to test its antifungal activity. The nucleotide sequence of LTP gene was obtained from Panax ginseng using RT-PCR. The ORF of the cDNA is 363 bp, codING for a protein OF 120 amino acids with a calculated MW of 12.09 kDa. The pgLTP gene with a His6-tag at the C-terminus was cloned into the pGEX-6p1 vector to generate a GST-fusion pgLTP protein construct that was expressed in Escherichia coli Rosetta. Following purification by Ni-NTA, the fusion protein exhibited antifungal activity against five fungi found in ginseng. The fusion protein GST-pgLTP has activity against a broad spectrum of phytopathogenic fungi, and can potentially be adapted for production to combat fungal diseases that affect P. ginseng.

  6. Molecular cloning of MSSP-2, a c-myc gene single-strand binding protein: characterization of binding specificity and DNA replication activity.

    PubMed Central

    Takai, T; Nishita, Y; Iguchi-Ariga, S M; Ariga, H

    1994-01-01

    We have previously reported the human cDNA encoding MSSP-1, a sequence-specific double- and single-stranded DNA binding protein [Negishi, Nishita, Saëgusa, Kakizaki, Galli, Kihara, Tamai, Miyajima, Iguchi-Ariga and Ariga (1994) Oncogene, 9, 1133-1143]. MSSP-1 binds to a DNA replication origin/transcriptional enhancer of the human c-myc gene and has turned out to be identical with Scr2, a human protein which complements the defect of cdc2 kinase in S.pombe [Kataoka and Nojima (1994) Nucleic Acid Res., 22, 2687-2693]. We have cloned the cDNA for MSSP-2, another member of the MSSP family of proteins. The MSSP-2 cDNA shares highly homologous sequences with MSSP-1 cDNA, except for the insertion of 48 bp coding 16 amino acids near the C-terminus. Like MSSP-1, MSSP-2 has RNP-1 consensus sequences. The results of the experiments using bacterially expressed MSSP-2, and its deletion mutants, as histidine fusion proteins suggested that the binding specificity of MSSP-2 to double- and single-stranded DNA is the same as that of MSSP-1, and that the RNP consensus sequences are required for the DNA binding of the protein. MSSP-2 stimulated the DNA replication of an SV40-derived plasmid containing the binding sequence for MSSP-1 or -2. MSSP-2 is hence suggested to play an important role in regulation of DNA replication. Images PMID:7838710

  7. Genome-wide identification of long non-coding RNA genes and their association with insecticide resistance and metamorphosis in diamondback moth, Plutella xylostella.

    PubMed

    Liu, Feiling; Guo, Dianhao; Yuan, Zhuting; Chen, Chen; Xiao, Huamei

    2017-11-20

    Long non-coding RNA (lncRNA) is a class of noncoding RNA >200 bp in length that has essential roles in regulating a variety of biological processes. Here, we constructed a computational pipeline to identify lncRNA genes in the diamondback moth (Plutella xylostella), a major insect pest of cruciferous vegetables. In total, 3,324 lncRNAs corresponding to 2,475 loci were identified from 13 RNA-Seq datasets, including samples from parasitized, insecticide-resistant strains and different developmental stages. The identified P. xylostella lncRNAs had shorter transcripts and fewer exons than protein-coding genes. Seven out of nine randomly selected lncRNAs were validated by strand-specific RT-PCR. In total, 54-172 lncRNAs were specifically expressed in the insecticide resistant strains, among which one lncRNA was located adjacent to the sodium channel gene. In addition, 63-135 lncRNAs were specifically expressed in different developmental stages, among which three lncRNAs overlapped or were located adjacent to the metamorphosis-associated genes. These lncRNAs were either strongly or weakly co-expressed with their overlapping or neighboring mRNA genes. In summary, we identified thousands of lncRNAs and presented evidence that lncRNAs might have key roles in conferring insecticide resistance and regulating the metamorphosis development in P. xylostella.

  8. Deep transcriptome annotation enables the discovery and functional characterization of cryptic small proteins

    PubMed Central

    Delcourt, Vivian; Lucier, Jean-François; Gagnon, Jules; Beaudoin, Maxime C; Vanderperre, Benoît; Breton, Marc-André; Motard, Julie; Jacques, Jean-François; Brunelle, Mylène; Gagnon-Arsenault, Isabelle; Fournier, Isabelle; Ouangraoua, Aida; Hunting, Darel J; Cohen, Alan A; Landry, Christian R; Scott, Michelle S

    2017-01-01

    Recent functional, proteomic and ribosome profiling studies in eukaryotes have concurrently demonstrated the translation of alternative open-reading frames (altORFs) in addition to annotated protein coding sequences (CDSs). We show that a large number of small proteins could in fact be coded by these altORFs. The putative alternative proteins translated from altORFs have orthologs in many species and contain functional domains. Evolutionary analyses indicate that altORFs often show more extreme conservation patterns than their CDSs. Thousands of alternative proteins are detected in proteomic datasets by reanalysis using a database containing predicted alternative proteins. This is illustrated with specific examples, including altMiD51, a 70 amino acid mitochondrial fission-promoting protein encoded in MiD51/Mief1/SMCR7L, a gene encoding an annotated protein promoting mitochondrial fission. Our results suggest that many genes are multicoding genes and code for a large protein and one or several small proteins. PMID:29083303

  9. Petunia nectar proteins have ribonuclease activity.

    PubMed

    Hillwig, Melissa S; Liu, Xiaoteng; Liu, Guangyu; Thornburg, Robert W; Macintosh, Gustavo C

    2010-06-01

    Plants requiring an insect pollinator often produce nectar as a reward for the pollinator's visitations. This rich secretion needs mechanisms to inhibit microbial growth. In Nicotiana spp. nectar, anti-microbial activity is due to the production of hydrogen peroxide. In a close relative, Petunia hybrida, limited production of hydrogen peroxide was found; yet petunia nectar still has anti-bacterial properties, suggesting that a different mechanism may exist for this inhibition. The nectar proteins of petunia plants were compared with those of ornamental tobacco and significant differences were found in protein profiles and function between these two closely related species. Among those proteins, RNase activities unique to petunia nectar were identified. The genes corresponding to four RNase T2 proteins from Petunia hybrida that show unique expression patterns in different plant tissues were cloned. Two of these enzymes, RNase Phy3 and RNase Phy4 are unique among the T2 family and contain characteristics similar to both S- and S-like RNases. Analysis of amino acid patterns suggest that these proteins are an intermediate between S- and S-like RNases, and support the hypothesis that S-RNases evolved from defence RNases expressed in floral parts. This is the first report of RNase activities in nectar.

  10. Petunia nectar proteins have ribonuclease activity

    PubMed Central

    Hillwig, Melissa S.; Liu, Xiaoteng; Liu, Guangyu; Thornburg, Robert W.; MacIntosh, Gustavo C.

    2010-01-01

    Plants requiring an insect pollinator often produce nectar as a reward for the pollinator's visitations. This rich secretion needs mechanisms to inhibit microbial growth. In Nicotiana spp. nectar, anti-microbial activity is due to the production of hydrogen peroxide. In a close relative, Petunia hybrida, limited production of hydrogen peroxide was found; yet petunia nectar still has anti-bacterial properties, suggesting that a different mechanism may exist for this inhibition. The nectar proteins of petunia plants were compared with those of ornamental tobacco and significant differences were found in protein profiles and function between these two closely related species. Among those proteins, RNase activities unique to petunia nectar were identified. The genes corresponding to four RNase T2 proteins from Petunia hybrida that show unique expression patterns in different plant tissues were cloned. Two of these enzymes, RNase Phy3 and RNase Phy4 are unique among the T2 family and contain characteristics similar to both S- and S-like RNases. Analysis of amino acid patterns suggest that these proteins are an intermediate between S- and S-like RNases, and support the hypothesis that S-RNases evolved from defence RNases expressed in floral parts. This is the first report of RNase activities in nectar. PMID:20460362

  11. Heterologous expression of the immunomodulatory protein gene from Ganoderma sinense in the basidiomycete Coprinopsis cinerea.

    PubMed

    Han, F; Liu, Y; Guo, L Q; Zeng, X L; Liu, Z M; Lin, J F

    2010-11-01

    FIP-gsi, a fungal immunomodulatory protein found in Ganoderma sinense, has antitumour, anti-allergy and immunomodulatory activities and is regulated by the fip-gsi gene. In this study, we aimed to express the fip-gsi gene from G. sinense in Coprinopsis cinerea to increase yield of FIPs-gsi. A fungal expression vector pBfip-gsi containing the gpd promoter from Agaricus bisporus and the fip-gsi gene from the G. sinense was constructed and transformed into C. cinerea. PCR and Southern blotting analysis verified the successful integration of the exogenous gene fip-gsi into the genome of C. cinerea. RT-PCR and Northern blotting analysis confirmed that the fip-gsi gene was transcribed in C. cinerea. The yield of the FIP-gsi protein reached 314mg kg(-1) fresh mycelia. The molecular weight of the FIP-gsi was 13kDa, and the FIP-gsi was capable of hemagglutinating mouse red blood cells, but no such activity was observed towards human red blood cells in vitro. The fip-gsi from G. sinense has been successfully translated in C. cinerea, and the yield of bioactive FIP-gsi protein was high. This is the first report using the C. cinerea for the heterologous expression of FIP-gsi protein and it might supply a basis for large-scale production of the protein. © 2010 The Authors. Journal of Applied Microbiology © 2010 The Society for Applied Microbiology.

  12. Protein annotation from protein interaction networks and Gene Ontology.

    PubMed

    Nguyen, Cao D; Gardiner, Katheleen J; Cios, Krzysztof J

    2011-10-01

    We introduce a novel method for annotating protein function that combines Naïve Bayes and association rules, and takes advantage of the underlying topology in protein interaction networks and the structure of graphs in the Gene Ontology. We apply our method to proteins from the Human Protein Reference Database (HPRD) and show that, in comparison with other approaches, it predicts protein functions with significantly higher recall with no loss of precision. Specifically, it achieves 51% precision and 60% recall versus 45% and 26% for Majority and 24% and 61% for χ²-statistics, respectively. Copyright © 2011 Elsevier Inc. All rights reserved.

  13. Gene networks in the synthesis and deposition of protein polymers during grain development of wheat.

    PubMed

    She, Maoyun; Ye, Xingguo; Yan, Yueming; Howit, C; Belgard, M; Ma, Wujun

    2011-03-01

    As the amino acid storing organelle, the protein bodies provide nutrients for embryo development, seed germination and early seedling growth through storage proteolysis in cereal plants, such as wheat and rice. In protein bodies, the monomeric and polymeric prolamins, i.e. gliadins and glutenins, form gluten and play a key role in determining dough functionality and end-product quality of wheat. The formation of intra- and intermolecular bonds, including disulphide and tyrosine bonds, in and between prolamins confers cohesivity, viscosity, elasticity and extensibility to wheat dough during mixing and processing. In this review, we summarize recent progress in wheat gluten research with a focus on the fundamental molecular biological aspects, including transcriptional regulation on genes coding for prolamin components, biosynthesis, deposition and secretion of protein polymers, formation of protein bodies, genetic control of seed storage proteins, the transportation of the protein bodies and key enzymes for determining the formation of disulphide bonds of prolamin polymers.

  14. Seed storage protein gene promoters contain conserved DNA motifs in Brassicaceae, Fabaceae and Poaceae

    PubMed Central

    Fauteux, François; Strömvik, Martina V

    2009-01-01

    Background Accurate computational identification of cis-regulatory motifs is difficult, particularly in eukaryotic promoters, which typically contain multiple short and degenerate DNA sequences bound by several interacting factors. Enrichment in combinations of rare motifs in the promoter sequence of functionally or evolutionarily related genes among several species is an indicator of conserved transcriptional regulatory mechanisms. This provides a basis for the computational identification of cis-regulatory motifs. Results We have used a discriminative seeding DNA motif discovery algorithm for an in-depth analysis of 54 seed storage protein (SSP) gene promoters from three plant families, namely Brassicaceae (mustards), Fabaceae (legumes) and Poaceae (grasses) using backgrounds based on complete sets of promoters from a representative species in each family, namely Arabidopsis (Arabidopsis thaliana (L.) Heynh.), soybean (Glycine max (L.) Merr.) and rice (Oryza sativa L.) respectively. We have identified three conserved motifs (two RY-like and one ACGT-like) in Brassicaceae and Fabaceae SSP gene promoters that are similar to experimentally characterized seed-specific cis-regulatory elements. Fabaceae SSP gene promoter sequences are also enriched in a novel, seed-specific E2Fb-like motif. Conserved motifs identified in Poaceae SSP gene promoters include a GCN4-like motif, two prolamin-box-like motifs and an Skn-1-like motif. Evidence of the presence of a variant of the TATA-box is found in the SSP gene promoters from the three plant families. Motifs discovered in SSP gene promoters were used to score whole-genome sets of promoters from Arabidopsis, soybean and rice. The highest-scoring promoters are associated with genes coding for different subunits or precursors of seed storage proteins. Conclusion Seed storage protein gene promoter motifs are conserved in diverse species, and different plant families are characterized by a distinct combination of conserved motifs

  15. The DNA-mimic antirestriction proteins ArdA ColIB-P9, Arn T4, and Ocr T7 as activators of H-NS-dependent gene transcription.

    PubMed

    Melkina, Olga E; Goryanin, Ignatiy I; Zavilgelsky, Gennadii B

    2016-11-01

    The antirestriction proteins ArdA ColIb-P9, Arn T4 and Ocr T7 specifically inhibit type I and type IV restriction enzymes and belong to the family of DNA-mimic proteins because their three-dimensional structure is similar to the double-helical B-form DNA. It is proposed that the DNA-mimic proteins are able to bind nucleoid protein H-NS and alleviate H-NS-silencing of the transcription of bacterial genes. Escherichia coli lux biosensors were constructed by inserting H-NS-dependent promoters into a vector, thereby placing each fragment upstream of the promoterless Photorhabdus luminescens luxCDABE operon. It was demonstrated that the DNA-mimic proteins ArdA, Arn and Ocr activate the transcription of H-NS-dependent promoters of the lux operon of marine luminescent bacteria (mesophilic Aliivibrio fischeri and psychrophilic Aliivibrio logei), and the dps gene from E. coli. It was also demonstrated that the ArdA antirestriction protein, the genes of which are located on transmissive plasmids ColIb-P9, R64, PK101, decreases levels of H-NS silencing of the PluxC promoter during conjugation in the recipient bacteria. Copyright © 2016 Elsevier GmbH. All rights reserved.

  16. The Saccharomyces cerevisiae enolase-related regions encode proteins that are active enolases.

    PubMed

    Kornblatt, M J; Richard Albert, J; Mattie, S; Zakaib, J; Dayanandan, S; Hanic-Joyce, P J; Joyce, P B M

    2013-02-01

    In addition to two genes (ENO1 and ENO2) known to code for enolase (EC4.2.1.11), the Saccharomyces cerevisiae genome contains three enolase-related regions (ERR1, ERR2 and ERR3) which could potentially encode proteins with enolase function. Here, we show that products of these genes (Err2p and Err3p) have secondary and quaternary structures similar to those of yeast enolase (Eno1p). In addition, Err2p and Err3p can convert 2-phosphoglycerate to phosphoenolpyruvate, with kinetic parameters similar to those of Eno1p, suggesting that these proteins could function as enolases in vivo. To address this possibility, we overexpressed the ERR2 and ERR3 genes individually in a double-null yeast strain lacking ENO1 and ENO2, and showed that either ERR2 or ERR3 could complement the growth defect in this strain when cells are grown in medium with glucose as the carbon source. Taken together, these data suggest that the ERR genes in Saccharomyces cerevisiae encode a protein that could function in glycolysis as enolase. The presence of these enolase-related regions in Saccharomyces cerevisiae and their absence in other related yeasts suggests that these genes may play some unique role in Saccharomyces cerevisiae. Further experiments will be required to determine whether these functions are related to glycolysis or other cellular processes. Copyright © 2012 John Wiley & Sons, Ltd.

  17. Control of cellular morphogenesis by the Ip12/Bem2 GTPase-activating protein: possible role of protein phosphorylation

    PubMed Central

    1994-01-01

    The IPL2 gene is known to be required for normal polarized cell growth in the budding yeast Saccharomyces cerevisiae. We now show that IPL2 is identical to the previously identified BEM2 gene. bem2 mutants are defective in bud site selection at 26 degrees C and localized cell surface growth and organization of the actin cytoskeleton at 37 degrees C. BEM2 encodes a protein with a COOH-terminal domain homologous to sequences found in several GTPase-activating proteins, including human Bcr. The GTPase-activating protein-domain from the Bem2 protein (Bem2p) or human Bcr can functionally substitute for Bem2p. The Rho1 and Rho2 GTPases are the likely in vivo targets of Bem2p because bem2 mutant phenotypes can be partially suppressed by increasing the gene dosage of RHO1 or RHO2. CDC55 encodes the putative regulatory B subunit of protein phosphatase 2A, and mutations in BEM2 have previously been identified as suppressors of the cdc55-1 mutation. We show here that mutations in the previously identified GRR1 gene can suppress bem2 mutations. grr1 and cdc55 mutants are both elongated in shape and cold- sensitive for growth, and cells lacking both GRR1 and CDC55 exhibit a synthetic lethal phenotype. bem2 mutant phenotypes also can be suppressed by the SSD1-vl (also known as SRK1) mutation, which was shown previously to suppress mutations in the protein phosphatase- encoding SIT4 gene. Cells lacking both BEM2 and SIT4 exhibit a synthetic lethal phenotype even in the presence of the SSD1-v1 suppressor. These genetic interactions together suggest that protein phosphorylation and dephosphorylation play an important role in the BEM2-mediated process of polarized cell growth. PMID:7962097

  18. Control of cellular morphogenesis by the Ip12/Bem2 GTPase-activating protein: possible role of protein phosphorylation.

    PubMed

    Kim, Y J; Francisco, L; Chen, G C; Marcotte, E; Chan, C S

    1994-12-01

    The IPL2 gene is known to be required for normal polarized cell growth in the budding yeast Saccharomyces cerevisiae. We now show that IPL2 is identical to the previously identified BEM2 gene. bem2 mutants are defective in bud site selection at 26 degrees C and localized cell surface growth and organization of the actin cytoskeleton at 37 degrees C. BEM2 encodes a protein with a COOH-terminal domain homologous to sequences found in several GTPase-activating proteins, including human Bcr. The GTPase-activating protein-domain from the Bem2 protein (Bem2p) or human Bcr can functionally substitute for Bem2p. The Rho1 and Rho2 GTPases are the likely in vivo targets of Bem2p because bem2 mutant phenotypes can be partially suppressed by increasing the gene dosage of RHO1 or RHO2. CDC55 encodes the putative regulatory B subunit of protein phosphatase 2A, and mutations in BEM2 have previously been identified as suppressors of the cdc55-1 mutation. We show here that mutations in the previously identified GRR1 gene can suppress bem2 mutations. grr1 and cdc55 mutants are both elongated in shape and cold-sensitive for growth, and cells lacking both GRR1 and CDC55 exhibit a synthetic lethal phenotype. bem2 mutant phenotypes also can be suppressed by the SSD1-vl (also known as SRK1) mutation, which was shown previously to suppress mutations in the protein phosphatase-encoding SIT4 gene. Cells lacking both BEM2 and SIT4 exhibit a synthetic lethal phenotype even in the presence of the SSD1-v1 suppressor. These genetic interactions together suggest that protein phosphorylation and dephosphorylation play an important role in the BEM2-mediated process of polarized cell growth.

  19. XBP-1 Regulates a Subset of Endoplasmic Reticulum Resident Chaperone Genes in the Unfolded Protein Response

    PubMed Central

    Lee, Ann-Hwee; Iwakoshi, Neal N.; Glimcher, Laurie H.

    2003-01-01

    The mammalian unfolded protein response (UPR) protects the cell against the stress of misfolded proteins in the endoplasmic reticulum (ER). We have investigated here the contribution of the UPR transcription factors XBP-1, ATF6α, and ATF6β to UPR target gene expression. Gene profiling of cell lines lacking these factors yielded several XBP-1-dependent UPR target genes, all of which appear to act in the ER. These included the DnaJ/Hsp40-like genes, p58IPK, ERdj4, and HEDJ, as well as EDEM, protein disulfide isomerase-P5, and ribosome-associated membrane protein 4 (RAMP4), whereas expression of BiP was only modestly dependent on XBP-1. Surprisingly, given previous reports that enforced expression of ATF6α induced a subset of UPR target genes, cells deficient in ATF6α, ATF6β, or both had minimal defects in upregulating UPR target genes by gene profiling analysis, suggesting the presence of compensatory mechanism(s) for ATF6 in the UPR. Since cells lacking both XBP-1 and ATF6α had significantly impaired induction of select UPR target genes and ERSE reporter activation, XBP-1 and ATF6α may serve partially redundant functions. No UPR target genes that required ATF6β were identified, nor, in contrast to XBP-1 and ATF6α, did the activity of the UPRE or ERSE promoters require ATF6β, suggesting a minor role for it during the UPR. Collectively, these results suggest that the IRE1/XBP-1 pathway is required for efficient protein folding, maturation, and degradation in the ER and imply the existence of subsets of UPR target genes as defined by their dependence on XBP-1. Further, our observations suggest the existence of additional, as-yet-unknown, key regulators of the UPR. PMID:14559994

  20. Cloning and Expression Pattern of a Gene Encoding an α-Xylosidase Active against Xyloglucan Oligosaccharides from Arabidopsis1

    PubMed Central

    Sampedro, Javier; Sieiro, Carmen; Revilla, Gloria; González-Villa, Tomás; Zarra, Ignacio

    2001-01-01

    An α-xylosidase active against xyloglucan oligosaccharides was purified from cabbage (Brassica oleracea var. capitata) leaves. Two peptide sequences were obtained from this protein, the N-terminal and an internal one, and these were used to identify an Arabidopsis gene coding for an α-xylosidase that we propose to call AtXYL1. It has been mapped to a region of chromosome I between markers at 100.44 and 107.48 cM. AtXYL1 comprised three exons and encoded a peptide that was 915 amino acids long, with a potential signal peptide of 22 amino acids and eight possible N-glycosylation sites. The protein encoded by AtXYL1 showed the signature regions of family 31 glycosyl hydrolases, which comprises not only α-xylosidases, but also α-glucosidases. The α-xylosidase activity is present in apoplastic extractions from Arabidopsis seedlings, as suggested by the deduced signal peptide. The first eight leaves from Arabidopsis plants were harvested to analyze α-xylosidase activity and AtXYL1 expression levels. Both increased from older to younger leaves, where xyloglucan turnover is expected to be higher. When this gene was introduced in a suitable expression vector and used to transform Saccharomyces cerevisiae, significantly higher α-xylosidase activity was detected in the yeast cells. α-Glucosidase activity was also increased in the transformed cells, although to a lesser extent. These results show that AtXYL1 encodes for an apoplastic α-xylosidase active against xyloglucan oligosaccharides that probably also has activity against p-nitrophenyl-α-d-glucoside. PMID:11402218