Science.gov

Sample records for amp-binding protein gene

  1. Ethanol-induced loss of brain cyclic AMP binding proteins: correlation with growth suppression

    SciTech Connect

    Pennington, S.; Kalmus, G.

    1987-05-01

    Brain hypoplasia secondary to maternal ethanol consumption is a common fetal defect observed in all models of fetal alcohol syndrome. The molecular mechanism by which ethanol inhibits growth is unknown but has been hypothesized to involve ethanol-induced changes in the activity of cyclic-AMP stimulated protein kinase. Acute and chronic alcohol exposure elevate cyclic AMP level in many tissues, including brain. This increase in cyclic AMP should increase the phosphorylating activity of kinase by increasing the amount of dissociated (active) kinase catalytic subunit. In 7-day embryonic chick brains, ethanol-induced growth suppression was correlated with increased brain cyclic AMP content but neither basal nor cyclic AMP stimulated kinase catalytic activity was increased. However, the levels of cyclic AMP binding protein (kinase regulatory subunit) were significantly lowered by ethanol exposure. Measured as either /sup 3/H cyclic AMP binding or as 8-azido cyclic AM/sup 32/P labeling, ethanol-exposed brains had significantly less cyclic AMP binding activity (51 +/- 14 versus 29 +/- 10 units/..mu..g protein for 8-azido cyclic AMP binding). These findings suggest that ethanol's effect on kinase activity may involve more than ethanol-induced activation of adenylate cyclase.

  2. Functional characterization of the cAMP-binding proteins Epac in cardiac myocytes.

    PubMed

    Métrich, Mélanie; Morel, Eric; Berthouze, Magali; Pereira, Laetitia; Charron, Philippe; Gomez, Ana-Maria; Lezoualc'h, Frank

    2009-01-01

    The cyclic AMP (cAMP)-binding proteins, Epac, are guanine nucleotide exchange factors for the Ras-like small GTPases. Since their discovery in 1998 and with the development of specific Epac agonists, many data in the literature have illustrated their critical role in multiple cellular events mediated by the second messenger cAMP. Given the importance of cAMP in cardiovascular physiology and physiopathology, there is a growing interest to delineate the role of these multi-domain Epac in the cardiovascular system. This review will focus on recent pharmacological and biochemical studies aiming at understanding the role of Epac in cardiomyocyte signaling and hypertrophy.

  3. The Cyclic AMP-binding protein CbpB in Brucella melitensis and its role in cell envelope integrity, resistance to detergent and virulence.

    PubMed

    Liu, Wen-Juan; Dong, Hao; Peng, Xiao-Wei; Wu, Qing-Min

    2014-07-01

    Brucella melitensis possesses an operon with two components: the response regulator OtpR and a putative cAMP-dependent protein kinase regulatory subunit encoded by the BMEI0067 gene. In the previous study, the function of OtpR has been studied, while little is known about the function of the BMEI0067 gene. Using a bioinformatics approach, we showed that the BMEI0067 gene encodes an additional putative cAMP-binding protein, which we refer to as CbpB. Structural modeling predicted that CbpB has a cAMP-binding protein (CAP) domain and is structurally similar to eukaryotic protein kinase A regulatory subunits. Here, we report the characterization of CbpB, a cAMP-binding protein in Brucella melitensis, showed to be involved in mouse persistent infections. ∆cbpB::km possessed cell elongation, bubble-like protrusions on cell surface and its resistance to environmental stresses (temperature, osmotic stress and detergent). Interestingly, comparative real-time qPCR assays, the cbpB mutation resulted in significantly different expression of aqpX and several penicillin-binding proteins and cell division proteins in Brucella. Combined, these results demonstrated characterization of CbpB in B. melitensis and its key role for intracellular multiplication.

  4. The cAMP binding protein Epac regulates cardiac myofilament function

    PubMed Central

    Cazorla, Olivier; Lucas, Alexandre; Poirier, Florence; Lacampagne, Alain; Lezoualc'h, Frank

    2009-01-01

    In the heart, cAMP is a key regulator of excitation–contraction coupling and its biological effects are mainly associated with the activity of protein kinase A (PKA). The aim of this study was to investigate the contribution of the cAMP-binding protein Epac (Exchange protein directly activated by cAMP) in the regulation of the contractile properties of rat ventricular cardiac myocytes. We report that both PKA and Epac increased cardiac sarcomere contraction but through opposite mechanisms. Differently from PKA, selective Epac activation by the cAMP analog 8-(4-chlorophenylthio)-2′-O-methyl-cAMP (8-pCPT) reduced Ca2+ transient amplitude and increased cell shortening in intact cardiomyocytes and myofilament Ca2+ sensitivity in permeabilized cardiomyocytes. Moreover, ventricular myocytes, which were infected in vivo with a constitutively active form of Epac, showed enhanced myofilament Ca2+ sensitivity compared to control cells infected with green fluorescent protein (GFP) alone. At the molecular level, Epac increased phosphorylation of 2 key sarcomeric proteins, cardiac Troponin I (cTnI) and cardiac Myosin Binding Protein-C (cMyBP-C). The effects of Epac activation on myofilament Ca2+ sensitivity and on cTnI and cMyBP-C phosphorylation were independent of PKA and were blocked by protein kinase C (PKC) and Ca2+ calmodulin kinase II (CaMKII) inhibitors. Altogether these findings identify Epac as a new regulator of myofilament function. PMID:19666481

  5. Activity of cAMP-dependent protein kinases and cAMP-binding proteins of rat kidney cytosol during dehydration

    SciTech Connect

    Zelenina, M.N.; Solenov, E.I.; Ivanova, L.N.

    1985-09-20

    The activity of cAMP-dependent protein kinases, the binding of cAMP, and the spectrum of cAMP-binding proteins in the cytosol of the renal papilla was studied in intact rats and in rats after 24 h on a water-deprived diet. It was found that the activation of protein kinases by 10/sup -6/ M cAMP is significantly higher in the experimental animals than in the intact animals. In chromatography on DEAE-cellulose, the positions of the peaks of specific reception of cAMP corresponded to the peaks of the regulatory subunits of cAMP-dependent protein kinases of types I and II. In this case, in intact animals more than 80% of the binding activity was detected in peaks II, whereas in rats subjected to water deprivation, more than 60% of the binding was observed in peak I. The general regulatory activity of the cytosol was unchanged in the experimental animals in comparison with intact animals. It is suggested that during dehydration there is an induction of the synthesis of the regulatory subunit of type I cAMP-dependent protein kinase in the renal papilla.

  6. Detection of cyclic di-AMP using a competitive ELISA with a unique pneumococcal cyclic di-AMP binding protein

    PubMed Central

    Underwood, Adam J.; Zhang, Yang; Metzger, Dennis W.; Bai, Guangchun

    2014-01-01

    Cyclic di-AMP (c-di-AMP) is a signaling molecule that has been shown to play important roles in bacterial physiology and infections. Currently, c-di-AMP detection and quantification relies mostly on the use of high-performance liquid chromatography (HPLC) or liquid chromatography-mass spectrometry (LC-MS). In this study, a competitive enzyme-linked immunosorbent assay (ELISA) for the quantification of c-di-AMP was developed, which utilizes a novel pneumococcal c-di-AMP binding protein (CabP) and a newly commercialized c-di-AMP derivative. With this new method, c-di-AMP concentrations in biological samples can be quickly and accurately quantified. Furthermore, this assay is much more efficient than current methods as it requires less overall cost and training while processing many samples at once. Therefore, this assay can be extensively used in research into c-di-AMP signaling. PMID:25239824

  7. Identification of c-di-AMP-Binding Proteins Using Magnetic Beads.

    PubMed

    Kampf, Jan; Gundlach, Jan; Herzberg, Christina; Treffon, Katrin; Stülke, Jörg

    2017-01-01

    To identify cytosolic proteins that bind to cyclic di-AMP, a biotinylated analog of the nucleotide is used for protein pull-down experiments. In this approach, biotinylated c-di-AMP is coupled to Streptactin-covered beads. After protein separation using standard SDS-PAGE, the protein(s) of interest are identified by mass spectrometric analyses.

  8. Cyclic Di-AMP Impairs Potassium Uptake Mediated by a Cyclic Di-AMP Binding Protein in Streptococcus pneumoniae

    PubMed Central

    Bai, Yinlan; Yang, Jun; Zarrella, Tiffany M.; Zhang, Yang; Metzger, Dennis W.

    2014-01-01

    Cyclic di-AMP (c-di-AMP) has been shown to play important roles as a second messenger in bacterial physiology and infections. However, understanding of how the signal is transduced is still limited. Previously, we have characterized a diadenylate cyclase and two c-di-AMP phosphodiesterases in Streptococcus pneumoniae, a Gram-positive pathogen. In this study, we identified a c-di-AMP binding protein (CabP) in S. pneumoniae using c-di-AMP affinity chromatography. We demonstrated that CabP specifically bound c-di-AMP and that this interaction could not be interrupted by competition with other nucleotides, including ATP, cAMP, AMP, phosphoadenylyl adenosine (pApA), and cyclic di-GMP (c-di-GMP). By using a bacterial two-hybrid system and genetic mutagenesis, we showed that CabP directly interacted with a potassium transporter (SPD_0076) and that both proteins were required for pneumococcal growth in media with low concentrations of potassium. Interestingly, the interaction between CabP and SPD_0076 and the efficiency of potassium uptake were impaired by elevated c-di-AMP in pneumococci. These results establish a direct c-di-AMP-mediated signaling pathway that regulates pneumococcal potassium uptake. PMID:24272783

  9. Identification of a Tetrahydroquinoline Analog as a Pharmacological Inhibitor of the cAMP-binding Protein Epac*

    PubMed Central

    Courilleau, Delphine; Bisserier, Malik; Jullian, Jean-Christophe; Lucas, Alexandre; Bouyssou, Pascal; Fischmeister, Rodolphe; Blondeau, Jean-Paul; Lezoualc'h, Frank

    2012-01-01

    The cAMP-binding protein Epac is a therapeutic target for the treatment of various diseases such as cardiac hypertrophy and tumor invasion. This points out the importance to develop Epac inhibitors to better understand the involvement of these cAMP sensors in physiology and pathophysiology. Here, we have developed a functional fluorescence-based high-throughput assay with a Z′ value around 0.7 for screening Epac-specific antagonists. We identified an Epac1 inhibitor compound named CE3F4 that blocked Epac1 guanine nucleotide exchange activity toward its effector Rap1 both in cell-free systems and in intact cells. CE3F4 is a tetrahydroquinoline analog that fails to influence protein kinase A holoenzyme activity. CE3F4 inhibited neither the interaction of Rap1 with Epac1 nor directly the GDP exchange on Rap1. The kinetics of inhibition by CE3F4 indicated that this compound did not compete for binding of agonists to Epac1 and suggested an uncompetitive inhibition mechanism with respect to Epac1 agonists. A structure-activity study showed that the formyl group on position 1 and the bromine atom on position 5 of the tetrahydroquinoline skeleton were important for CE3F4 to exert its inhibitory activity. Finally, CE3F4 inhibited Rap1 activation in living cultured cells, following Epac activation by either 8-(4-chlorophenylthio)-2′-O-methyl-cAMP, an Epac-selective agonist, or isoprenaline, a non-selective β-adrenergic receptor agonist. Our study shows that CE3F4 and related compounds may serve as a basis for the development of new therapeutic drugs. PMID:23139415

  10. Site-directed mutagenesis of the cAMP-binding sites of the recombinant type I regulatory subunit of cAMP-dependent protein kinase.

    PubMed

    Kuno, T; Shuntoh, H; Sakaue, M; Saijoh, K; Takeda, T; Fukuda, K; Tanaka, C

    1988-06-30

    The type I regulatory subunit (R-I) of rat brain cAMP-dependent protein kinase was expressed in E. coli and site-directed mutagenesis was used to substitute amino acids in the putative cAMP-binding sites. The wild-type recombinant R-I bound 2 mol of cAMP/mol subunit, while two mutant R-Is with a single amino acid substitution in one of the two intrachain cAMP-binding sites (clone N153:a glutamate for Gly-200, and clone C254:an aspartate for Gly-324) bound 1 mol of cAMP/mol subunit. When these two substitutions were made in one mutant, cAMP did not bind to this mutant, indicating that binding of cAMP to N153 or C254 was to their nonmutated sites. Competition experiments with site-selective analogs and dissociation of bound cAMP from mutant R-Is provided evidence for strong intrachain interactions between the two classes of cAMP-binding sites in R-I.

  11. Complex structure and biochemical characterization of the Staphylococcus aureus cyclic diadenylate monophosphate (c-di-AMP)-binding protein PstA, the founding member of a new signal transduction protein family.

    PubMed

    Campeotto, Ivan; Zhang, Yong; Mladenov, Miroslav G; Freemont, Paul S; Gründling, Angelika

    2015-01-30

    Signaling nucleotides are integral parts of signal transduction systems allowing bacteria to cope with and rapidly respond to changes in the environment. The Staphylococcus aureus PII-like signal transduction protein PstA was recently identified as a cyclic diadenylate monophosphate (c-di-AMP)-binding protein. Here, we present the crystal structures of the apo- and c-di-AMP-bound PstA protein, which is trimeric in solution as well as in the crystals. The structures combined with detailed bioinformatics analysis revealed that the protein belongs to a new family of proteins with a similar core fold but with distinct features to classical PII proteins, which usually function in nitrogen metabolism pathways in bacteria. The complex structure revealed three identical c-di-AMP-binding sites per trimer with each binding site at a monomer-monomer interface. Although distinctly different from other cyclic-di-nucleotide-binding sites, as the half-binding sites are not symmetrical, the complex structure also highlighted common features for c-di-AMP-binding sites. A comparison between the apo and complex structures revealed a series of conformational changes that result in the ordering of two anti-parallel β-strands that protrude from each monomer and allowed us to propose a mechanism on how the PstA protein functions as a signaling transduction protein.

  12. CRIS—A Novel cAMP-Binding Protein Controlling Spermiogenesis and the Development of Flagellar Bending

    PubMed Central

    Krähling, Anke Miriam; Alvarez, Luis; Debowski, Katharina; Van, Qui; Gunkel, Monika; Irsen, Stephan; Al-Amoudi, Ashraf; Strünker, Timo; Kremmer, Elisabeth; Krause, Eberhard; Voigt, Ingo; Wörtge, Simone; Waisman, Ari; Weyand, Ingo; Seifert, Reinhard; Kaupp, Ulrich Benjamin; Wachten, Dagmar

    2013-01-01

    The second messengers cAMP and cGMP activate their target proteins by binding to a conserved cyclic nucleotide-binding domain (CNBD). Here, we identify and characterize an entirely novel CNBD-containing protein called CRIS (cyclic nucleotide receptor involved in sperm function) that is unrelated to any of the other members of this protein family. CRIS is exclusively expressed in sperm precursor cells. Cris-deficient male mice are either infertile due to a lack of sperm resulting from spermatogenic arrest, or subfertile due to impaired sperm motility. The motility defect is caused by altered Ca2+ regulation of flagellar beat asymmetry, leading to a beating pattern that is reminiscent of sperm hyperactivation. Our results suggest that CRIS interacts during spermiogenesis with Ca2+-regulated proteins that—in mature sperm—are involved in flagellar bending. PMID:24339785

  13. Active site conformational changes upon reaction intermediate biotinyl-5'-AMP binding in biotin protein ligase from Mycobacterium tuberculosis.

    PubMed

    Ma, Qingjun; Akhter, Yusuf; Wilmanns, Matthias; Ehebauer, Matthias T

    2014-07-01

    Protein biotinylation, a rare form of post-translational modification, is found in enzymes required for lipid biosynthesis. In mycobacteria, this process is essential for the formation of their complex and distinct cell wall and has become a focal point of drug discovery approaches. The enzyme responsible for this process, biotin protein ligase, substantially varies in different species in terms of overall structural organization, regulation of function and substrate specificity. To advance the understanding of the molecular mechanism of biotinylation in Mycobacterium tuberculosis we have biochemically and structurally characterized the corresponding enzyme. We report the high-resolution crystal structures of the apo-form and reaction intermediate biotinyl-5'-AMP-bound form of M. tuberculosis biotin protein ligase. Binding of the reaction intermediate leads to clear disorder-to-order transitions. We show that a conserved lysine, Lys138, in the active site is essential for biotinylation. © 2014 The Protein Society.

  14. Mycobacterium tuberculosis cAMP Receptor Protein (Rv3676) Differs from the Escherichia coli Paradigm in Its cAMP Binding and DNA Binding Properties and Transcription Activation Properties*

    PubMed Central

    Stapleton, Melanie; Haq, Ihtshamul; Hunt, Debbie M.; Arnvig, Kristine B.; Artymiuk, Peter J.; Buxton, Roger S.; Green, Jeffrey

    2010-01-01

    The pathogen Mycobacterium tuberculosis produces a burst of cAMP upon infection of macrophages. Bacterial cyclic AMP receptor proteins (CRP) are transcription factors that respond to cAMP by binding at target promoters when cAMP concentrations increase. Rv3676 (CRPMt) is a CRP family protein that regulates expression of genes (rpfA and whiB1) that are potentially involved in M. tuberculosis persistence and/or emergence from the dormant state. Here, the CRPMt homodimer is shown to bind two molecules of cAMP (one per protomer) at noninteracting sites. Furthermore, cAMP binding by CRPMt was relatively weak, entropy driven, and resulted in a relatively small enhancement in DNA binding. Tandem CRPMt-binding sites (CRP1 at −58.5 and CRP2 at −37.5) were identified at the whiB1 promoter (PwhiB1). In vitro transcription reactions showed that CRP1 is an activating site and that CRP2, which was only occupied in the presence of cAMP or at high CRPMt concentrations in the absence of cAMP, is a repressing site. Binding of CRPMt to CRP1 was not essential for open complex formation but was required for transcription activation. Thus, these data suggest that binding of CRPMt to the PwhiB1 CRP1 site activates transcription at a step after open complex formation. In contrast, high cAMP concentrations allowed occupation of both CRP1 and CRP2 sites, resulting in inhibition of open complex formation. Thus, M. tuberculosis CRP has evolved several distinct characteristics, compared with the Escherichia coli CRP paradigm, to allow it to regulate gene expression against a background of high concentrations of cAMP. PMID:20028978

  15. Allostery and Conformational Dynamics in cAMP-binding Acyltransferases*

    PubMed Central

    Podobnik, Marjetka; Siddiqui, Nida; Rebolj, Katja; Nambi, Subhalaxmi; Merzel, Franci; Visweswariah, Sandhya S.

    2014-01-01

    Mycobacteria harbor unique proteins that regulate protein lysine acylation in a cAMP-regulated manner. These lysine acyltransferases from Mycobacterium smegmatis (KATms) and Mycobacterium tuberculosis (KATmt) show distinctive biochemical properties in terms of cAMP binding affinity to the N-terminal cyclic nucleotide binding domain and allosteric activation of the C-terminal acyltransferase domain. Here we provide evidence for structural features in KATms that account for high affinity cAMP binding and elevated acyltransferase activity in the absence of cAMP. Structure-guided mutational analysis converted KATms from a cAMP-regulated to a cAMP-dependent acyltransferase and identified a unique asparagine residue in the acyltransferase domain of KATms that assists in the enzymatic reaction in the absence of a highly conserved glutamate residue seen in Gcn5-related N-acetyltransferase-like acyltransferases. Thus, we have identified mechanisms by which properties of similar proteins have diverged in two species of mycobacteria by modifications in amino acid sequence, which can dramatically alter the abundance of conformational states adopted by a protein. PMID:24748621

  16. Developmentally regulated enzymes and cyclic AMP-binding sites in Dictyostelium discoideum cells blocked during development by alpha-chymotrypsin.

    PubMed Central

    Schmidt, J A; Stirling, J L

    1982-01-01

    When cells of the slime mould Dictyostelium discoideum are allowed to starve in the presence of alpha-chymotrypsin, they are blocked in development at the stage where tight aggregates form tips. Analysis of developmentally regulated enzymes has shown that alpha-mannosidase, beta-N-acetylglucosaminidase, threonine deaminase, tyrosine aminotransferase, beta-glucosidase and the carbohydrate-binding protein discoidin are unaffected, but enzymes that show an increase in specific activity during post-aggregative development, namely glycogen phosphorylase, UDP-glucose pyrophosphorylase, UDP-galactose 4-epimerase, UDP-galactose polysaccharide transferase and alkaline phosphatase, did not show the characteristic increase when development was blocked by alpha-chymotrypsin. Recovery of cells from the effects of alpha-chymotrypsin was accompanied by the formation of fruiting bodies and a concomitant increase in the specific activity of UDP-glucose pyrophosphorylase. Uptake or efflux of 45Ca2+ was not altered in the presence of alpha-chymotrypsin. Cells allowed to develop in alpha-chymotrypsin, or treated with the enzyme for 15 min, had a markedly reduced ability to bind cyclic AMP with low affinity; high-affinity binding was unaffected. Pronase had a similar effect on cyclic AMP binding, but trypsin, which does not alter developmental processes, has no effect on cyclic AMP binding to D. discoideum cells. PMID:7150239

  17. Mapping the Free Energy Landscape of PKA Inhibition and Activation: A Double-Conformational Selection Model for the Tandem cAMP-Binding Domains of PKA RIα

    PubMed Central

    Akimoto, Madoka; McNicholl, Eric Tyler; Ramkissoon, Avinash; Moleschi, Kody; Taylor, Susan S.; Melacini, Giuseppe

    2015-01-01

    Protein Kinase A (PKA) is the major receptor for the cyclic adenosine monophosphate (cAMP) secondary messenger in eukaryotes. cAMP binds to two tandem cAMP-binding domains (CBD-A and -B) within the regulatory subunit of PKA (R), unleashing the activity of the catalytic subunit (C). While CBD-A in RIα is required for PKA inhibition and activation, CBD-B functions as a “gatekeeper” domain that modulates the control exerted by CBD-A. Preliminary evidence suggests that CBD-B dynamics are critical for its gatekeeper function. To test this hypothesis, here we investigate by Nuclear Magnetic Resonance (NMR) the two-domain construct RIα (91–379) in its apo, cAMP2, and C-bound forms. Our comparative NMR analyses lead to a double conformational selection model in which each apo CBD dynamically samples both active and inactive states independently of the adjacent CBD within a nearly degenerate free energy landscape. Such degeneracy is critical to explain the sensitivity of CBD-B to weak interactions with C and its high affinity for cAMP. Binding of cAMP eliminates this degeneracy, as it selectively stabilizes the active conformation within each CBD and inter-CBD contacts, which require both cAMP and W260. The latter is contributed by CBD-B and mediates capping of the cAMP bound to CBD-A. The inter-CBD interface is dispensable for intra-CBD conformational selection, but is indispensable for full activation of PKA as it occludes C-subunit recognition sites within CBD-A. In addition, the two structurally homologous cAMP-bound CBDs exhibit marked differences in their residual dynamics profiles, supporting the notion that conservation of structure does not necessarily imply conservation of dynamics. PMID:26618408

  18. Mapping the Free Energy Landscape of PKA Inhibition and Activation: A Double-Conformational Selection Model for the Tandem cAMP-Binding Domains of PKA RIα.

    PubMed

    Akimoto, Madoka; McNicholl, Eric Tyler; Ramkissoon, Avinash; Moleschi, Kody; Taylor, Susan S; Melacini, Giuseppe

    2015-01-01

    Protein Kinase A (PKA) is the major receptor for the cyclic adenosine monophosphate (cAMP) secondary messenger in eukaryotes. cAMP binds to two tandem cAMP-binding domains (CBD-A and -B) within the regulatory subunit of PKA (R), unleashing the activity of the catalytic subunit (C). While CBD-A in RIα is required for PKA inhibition and activation, CBD-B functions as a "gatekeeper" domain that modulates the control exerted by CBD-A. Preliminary evidence suggests that CBD-B dynamics are critical for its gatekeeper function. To test this hypothesis, here we investigate by Nuclear Magnetic Resonance (NMR) the two-domain construct RIα (91-379) in its apo, cAMP2, and C-bound forms. Our comparative NMR analyses lead to a double conformational selection model in which each apo CBD dynamically samples both active and inactive states independently of the adjacent CBD within a nearly degenerate free energy landscape. Such degeneracy is critical to explain the sensitivity of CBD-B to weak interactions with C and its high affinity for cAMP. Binding of cAMP eliminates this degeneracy, as it selectively stabilizes the active conformation within each CBD and inter-CBD contacts, which require both cAMP and W260. The latter is contributed by CBD-B and mediates capping of the cAMP bound to CBD-A. The inter-CBD interface is dispensable for intra-CBD conformational selection, but is indispensable for full activation of PKA as it occludes C-subunit recognition sites within CBD-A. In addition, the two structurally homologous cAMP-bound CBDs exhibit marked differences in their residual dynamics profiles, supporting the notion that conservation of structure does not necessarily imply conservation of dynamics.

  19. The Crystal Structures of Apo and cAMP-Bound GlxR from Corynebacterium glutamicum Reveal Structural and Dynamic Changes upon cAMP Binding in CRP/FNR Family Transcription Factors

    PubMed Central

    Townsend, Philip D.; Jungwirth, Britta; Pojer, Florence; Bußmann, Michael; Money, Victoria A.; Cole, Stewart T.; Pühler, Alfred; Tauch, Andreas; Bott, Michael; Cann, Martin J.; Pohl, Ehmke

    2014-01-01

    The cyclic AMP-dependent transcriptional regulator GlxR from Corynebacterium glutamicum is a member of the super-family of CRP/FNR (cyclic AMP receptor protein/fumarate and nitrate reduction regulator) transcriptional regulators that play central roles in bacterial metabolic regulatory networks. In C. glutamicum, which is widely used for the industrial production of amino acids and serves as a non-pathogenic model organism for members of the Corynebacteriales including Mycobacterium tuberculosis, the GlxR homodimer controls the transcription of a large number of genes involved in carbon metabolism. GlxR therefore represents a key target for understanding the regulation and coordination of C. glutamicum metabolism. Here we investigate cylic AMP and DNA binding of GlxR from C. glutamicum and describe the crystal structures of apo GlxR determined at a resolution of 2.5 Å, and two crystal forms of holo GlxR at resolutions of 2.38 and 1.82 Å, respectively. The detailed structural analysis and comparison of GlxR with CRP reveals that the protein undergoes a distinctive conformational change upon cyclic AMP binding leading to a dimer structure more compatible to DNA-binding. As the two binding sites in the GlxR homodimer are structurally identical dynamic changes upon binding of the first ligand are responsible for the allosteric behavior. The results presented here show how dynamic and structural changes in GlxR lead to optimization of orientation and distance of its two DNA-binding helices for optimal DNA recognition. PMID:25469635

  20. The crystal structures of apo and cAMP-bound GlxR from Corynebacterium glutamicum reveal structural and dynamic changes upon cAMP binding in CRP/FNR family transcription factors.

    PubMed

    Townsend, Philip D; Jungwirth, Britta; Pojer, Florence; Bußmann, Michael; Money, Victoria A; Cole, Stewart T; Pühler, Alfred; Tauch, Andreas; Bott, Michael; Cann, Martin J; Pohl, Ehmke

    2014-01-01

    The cyclic AMP-dependent transcriptional regulator GlxR from Corynebacterium glutamicum is a member of the super-family of CRP/FNR (cyclic AMP receptor protein/fumarate and nitrate reduction regulator) transcriptional regulators that play central roles in bacterial metabolic regulatory networks. In C. glutamicum, which is widely used for the industrial production of amino acids and serves as a non-pathogenic model organism for members of the Corynebacteriales including Mycobacterium tuberculosis, the GlxR homodimer controls the transcription of a large number of genes involved in carbon metabolism. GlxR therefore represents a key target for understanding the regulation and coordination of C. glutamicum metabolism. Here we investigate cylic AMP and DNA binding of GlxR from C. glutamicum and describe the crystal structures of apo GlxR determined at a resolution of 2.5 Å, and two crystal forms of holo GlxR at resolutions of 2.38 and 1.82 Å, respectively. The detailed structural analysis and comparison of GlxR with CRP reveals that the protein undergoes a distinctive conformational change upon cyclic AMP binding leading to a dimer structure more compatible to DNA-binding. As the two binding sites in the GlxR homodimer are structurally identical dynamic changes upon binding of the first ligand are responsible for the allosteric behavior. The results presented here show how dynamic and structural changes in GlxR lead to optimization of orientation and distance of its two DNA-binding helices for optimal DNA recognition.

  1. NMR studies of the AMP-binding site and mechanism of adenylate kinase.

    PubMed

    Fry, D C; Kuby, S A; Mildvan, A S

    1987-03-24

    NMR has previously been used to determine the conformation of enzyme-bound MgATP and to locate the MgATP-binding site on adenylate kinase [Fry, D. C., Kuby, S. A., & Mildvan, A. S. (1985) Biochemistry 24, 4680-4694]. To determine the conformation and location of the other substrate, AMP, distances have been measured from Cr3+AMPPCP, a linear competitive inhibitor with respect to MgATP, to six protons and to the phosphorus atom of AMP on adenylate kinase, with the paramagnetic probe-T1 method. Time-dependent nuclear Overhauser effects (NOEs) have been used to measure five interproton distances on enzyme-bound AMP. These distances were used to determine the conformation of bound AMP in addition to its position with respect to metal-ATP. Enzyme-bound AMP exhibits a high anti-glycosyl torsional angle (chi = 110 +/- 10 degrees), a 3'-endo,2'-exo ribose pucker (delta = 105 +/- 10 degrees), and gauche-trans orientations about the C4'-C5' bond (gamma = 180 +/- 10 degrees) and the C5'-O5' bond (beta = 170 +/- 20 degrees). The distance from Cr3+ to the phosphorus of AMP is 5.9 +/- 0.3 A, indicating a reaction coordinate distance of approximately 3 A, which is consistent with an associative SN2 mechanism for the phosphoryl transfer. Ten intermolecular NOEs, from protons of the enzyme to those of AMP, were detected, indicating the proximity of at least three hydrophobic amino acids to bound AMP. These constraints, together with the conformation of AMP and the intersubstrate distances, were used to position AMP into the X-ray structure of adenylate kinase. The AMP binding site is found to be near (less than or equal to 4 A from) Leu-116, Arg-171, Val-173, Val-182, and Leu-190; all of these residues have been found to be invariant in muscle-type rabbit, calf, human, porcine [Kuby, S. A., Palmieri, R. H., Frischat, A., Fischer, A. H., Wu, L. H., Maland, L., & Manship, M. (1984) Biochemistry 23, 2393-2399], and chicken adenylate kinase [Kishi, F., Maruyama, M., Tanizawa, Y

  2. Protein splicing: selfish genes invade cellular proteins.

    PubMed

    Neff, N F

    1993-12-01

    Protein splicing is a series of enzymatic events involving intramolecular protein breakage, rejoining and intron homing, in which introns are able to promote the recombinative transposition of their own coding sequences. Eukaryotic and prokaryotic spliced proteins have conserved similar gene structure, but little amino acid identity. The genes coding for these spliced proteins contain internal in-frame introns that encode polypeptides that apparently self-excise from the resulting host protein sequences. Excision of the 'protein intron' is coupled with joining of the two flanking protein regions encoded by exons of the host gene. Some introns of this type encode DNA endonucleases, related to Group I RNA intron gene products, that stimulate gene conversion and self-transmission.

  3. c-di-AMP binds the ydaO riboswitch in two pseudo-symmetry-related pockets.

    PubMed

    Ren, Aiming; Patel, Dinshaw J

    2014-09-01

    The ydaO riboswitch, involved in sporulation, osmotic stress responses and cell wall metabolism, targets the second messenger cyclic-di-AMP with subnanomolar affinity. We have solved the structure of c-di-AMP bound to the Thermoanaerobacter tengcongensis ydaO riboswitch, thereby identifying a five-helical scaffold containing a zippered-up bubble, a pseudoknot and long-range tertiary base pairs. Highlights include the identification of two c-di-AMP binding pockets on the same face of the riboswitch, related by pseudo-two-fold symmetry, with potential for cross-talk between sites mediated by adjacently positioned base-stacking alignments connecting pockets. The adenine rings of bound c-di-AMP molecules are wedged between bases and stabilized by stacking, base-sugar and sugar-sugar intermolecular hydrogen bonding interactions. The structural studies are complemented by isothermal titration calorimetry-based binding studies of mutants mediating key tertiary intermolecular contacts. The T. tengcongensis ydaO riboswitch, like its Bacillus subtilis counterpart, most likely functions through a transcription termination mechanism, with the c-di-AMP bound state representing an 'off' switch.

  4. Random insertion of GFP into the cAMP-dependent protein kinase regulatory subunit from Dictyostelium discoideum.

    PubMed Central

    Biondi, R M; Baehler, P J; Reymond, C D; Véron, M

    1998-01-01

    The green fluorescent protein (GFP) is currently being used for diverse cellular biology approaches, mainly as a protein tag or to monitor gene expression. Recently it has been shown that GFP can also be used to monitor the activation of second messenger pathways by the use of fluorescence resonance energy transfer (FRET) between two different GFP mutants fused to a Ca2+sensor. We show here that GFP fusions can also be used to obtain information on regions essential for protein function. As FRET requires the two GFPs to be very close, N- or C-terminal fusion proteins will not generally produce FRET between two interacting proteins. In order to increase the probability of FRET, we decided to study the effect of random insertion of two GFP mutants into a protein of interest. We describe here a methodology for random insertion of GFP into the cAMP-dependent protein kinase regulatory subunit using a bacterial expression vector. The selection and analysis of 120 green fluorescent colonies revealed that the insertions were distributed throughout the R coding region. 14 R/GFP fusion proteins were partially purified and characterized for cAMP binding, fluorescence and ability to inhibit PKA catalytic activity. This study reveals that GFP insertion only moderately disturbed the overall folding of the protein or the proper folding of another domain of the protein, as tested by cAMP binding capacity. Furthermore, three R subunits out of 14, which harbour a GFP inserted in the cAMP binding site B, inhibit PKA catalytic subunit in a cAMP-dependent manner. Random insertion of GFP within the R subunit sets the path to develop two-component FRET with the C subunit. PMID:9776758

  5. Random insertion of GFP into the cAMP-dependent protein kinase regulatory subunit from Dictyostelium discoideum.

    PubMed

    Biondi, R M; Baehler, P J; Reymond, C D; Véron, M

    1998-11-01

    The green fluorescent protein (GFP) is currently being used for diverse cellular biology approaches, mainly as a protein tag or to monitor gene expression. Recently it has been shown that GFP can also be used to monitor the activation of second messenger pathways by the use of fluorescence resonance energy transfer (FRET) between two different GFP mutants fused to a Ca2+sensor. We show here that GFP fusions can also be used to obtain information on regions essential for protein function. As FRET requires the two GFPs to be very close, N- or C-terminal fusion proteins will not generally produce FRET between two interacting proteins. In order to increase the probability of FRET, we decided to study the effect of random insertion of two GFP mutants into a protein of interest. We describe here a methodology for random insertion of GFP into the cAMP-dependent protein kinase regulatory subunit using a bacterial expression vector. The selection and analysis of 120 green fluorescent colonies revealed that the insertions were distributed throughout the R coding region. 14 R/GFP fusion proteins were partially purified and characterized for cAMP binding, fluorescence and ability to inhibit PKA catalytic activity. This study reveals that GFP insertion only moderately disturbed the overall folding of the protein or the proper folding of another domain of the protein, as tested by cAMP binding capacity. Furthermore, three R subunits out of 14, which harbour a GFP inserted in the cAMP binding site B, inhibit PKA catalytic subunit in a cAMP-dependent manner. Random insertion of GFP within the R subunit sets the path to develop two-component FRET with the C subunit.

  6. Protein polymer: Gene libraries open up

    NASA Astrophysics Data System (ADS)

    Ding, Sheng; Wang, Xiaoxiao; Barron, Annelise E.

    2011-02-01

    By combining gene cloning and amplification techniques, a new one-pot, parallel synthesis method for the generation of long, repetitive genes is realized. The method promises to open up the discovery of protein polymer biomaterials.

  7. From Gene Mutation to Protein Characterization

    ERIC Educational Resources Information Center

    Moffet, David A.

    2009-01-01

    A seven-week "gene to protein" laboratory sequence is described for an undergraduate biochemistry laboratory course. Student pairs were given the task of introducing a point mutation of their choosing into the well studied protein, enhanced green fluorescent protein (EGFP). After conducting literature searches, each student group chose the…

  8. From Gene Mutation to Protein Characterization

    ERIC Educational Resources Information Center

    Moffet, David A.

    2009-01-01

    A seven-week "gene to protein" laboratory sequence is described for an undergraduate biochemistry laboratory course. Student pairs were given the task of introducing a point mutation of their choosing into the well studied protein, enhanced green fluorescent protein (EGFP). After conducting literature searches, each student group chose the…

  9. cAMP-dependent activation of protein kinase A as a therapeutic target of skin hyperpigmentation by diphenylmethylene hydrazinecarbothioamide

    PubMed Central

    Shin, Hyoeun; Hong, Seung Deok; Roh, Eunmiri; Jung, Sang-Hun; Cho, Won-Jea; Hong Park, Sun; Yoon, Da Young; Ko, Seon Mi; Hwang, Bang Yeon; Hong, Jin Tae; Heo, Tae-Young; Han, Sang-Bae; Kim, Youngsoo

    2015-01-01

    Background and Purpose cAMP as a second messenger stimulates expression of microphthalmia-associated transcription factor (MITF) or the tyrosinase gene in UVB-induced skin pigmentation. Diphenylmethylene hydrazinecarbothioamide (QNT 3-80) inhibits α-melanocyte-stimulating hormone (α-MSH)-induced melanin production in B16 murine melanoma cells but its molecular basis remains to be defined. Here, we investigated the mechanism underlying the amelioration of skin hyperpigmentation by QNT 3-80. Experimental Approach We used melanocyte cultures with raised levels of cAMP and UVB-irradiated dorsal skin of guinea pigs for pigmentation assays. Immunoprecipitation, kemptide phosphorylation, fluorescence analysis and docking simulation were applied to elucidate a molecular mechanism of QNT 3-80. Key Results QNT 3-80 inhibited melanin production in melanocyte cultures with elevated levels of cAMP, including those from human foreskin. This compound also ameliorated hyperpigmentation in vivo in UVB-irradiated dorsal skin of guinea pigs. As a mechanism, QNT 3-80 directly antagonized cAMP binding to the regulatory subunit of PKA, nullified the dissociation and activation of inactive PKA holoenzyme in melanocytes and fitted into the cAMP-binding site on the crystal structure of human PKA under the most energetically favourable simulation. QNT 3-80 consequently inhibited cAMP- or UVB-induced phosphorylation (activation) of cAMP-responsive element-binding protein in vitro and in vivo, thus down-regulating expression of genes for MITF or tyrosinase in the melanogenic process. Conclusions and Implications Our data suggested that QNT 3-80 could contribute significantly to the treatment of skin disorders with hyperpigmented patches with the cAMP-binding site of PKA as its molecular target. PMID:25766244

  10. Popeye domain containing proteins are essential for stress-mediated modulation of cardiac pacemaking in mice

    PubMed Central

    Froese, Alexander; Breher, Stephanie S.; Waldeyer, Christoph; Schindler, Roland F.R.; Nikolaev, Viacheslav O.; Rinné, Susanne; Wischmeyer, Erhard; Schlueter, Jan; Becher, Jan; Simrick, Subreena; Vauti, Franz; Kuhtz, Juliane; Meister, Patrick; Kreissl, Sonja; Torlopp, Angela; Liebig, Sonja K.; Laakmann, Sandra; Müller, Thomas D.; Neumann, Joachim; Stieber, Juliane; Ludwig, Andreas; Maier, Sebastian K.; Decher, Niels; Arnold, Hans-Henning; Kirchhof, Paulus; Fabritz, Larissa; Brand, Thomas

    2012-01-01

    Cardiac pacemaker cells create rhythmic pulses that control heart rate; pacemaker dysfunction is a prevalent disorder in the elderly, but little is known about the underlying molecular causes. Popeye domain containing (Popdc) genes encode membrane proteins with high expression levels in cardiac myocytes and specifically in the cardiac pacemaking and conduction system. Here, we report the phenotypic analysis of mice deficient in Popdc1 or Popdc2. ECG analysis revealed severe sinus node dysfunction when freely roaming mutant animals were subjected to physical or mental stress. In both mutants, bradyarrhythmia developed in an age-dependent manner. Furthermore, we found that the conserved Popeye domain functioned as a high-affinity cAMP-binding site. Popdc proteins interacted with the potassium channel TREK-1, which led to increased cell surface expression and enhanced current density, both of which were negatively modulated by cAMP. These data indicate that Popdc proteins have an important regulatory function in heart rate dynamics that is mediated, at least in part, through cAMP binding. Mice with mutant Popdc1 and Popdc2 alleles are therefore useful models for the dissection of the mechanisms causing pacemaker dysfunction and could aid in the development of strategies for therapeutic intervention. PMID:22354168

  11. Combined protein construct and synthetic gene engineering for heterologous protein expression and crystallization using Gene Composer

    SciTech Connect

    Raymond, Amy; Lovell, Scott; Lorimer, Don; Walchli, John; Mixon, Mark; Wallace, Ellen; Thompkins, Kaitlin; Archer, Kimberly; Burgin, Alex; Stewart, Lance

    2009-12-01

    With the goal of improving yield and success rates of heterologous protein production for structural studies we have developed the database and algorithm software package Gene Composer. This freely available electronic tool facilitates the information-rich design of protein constructs and their engineered synthetic gene sequences, as detailed in the accompanying manuscript. In this report, we compare heterologous protein expression levels from native sequences to that of codon engineered synthetic gene constructs designed by Gene Composer. A test set of proteins including a human kinase (P38{alpha}), viral polymerase (HCV NS5B), and bacterial structural protein (FtsZ) were expressed in both E. coli and a cell-free wheat germ translation system. We also compare the protein expression levels in E. coli for a set of 11 different proteins with greatly varied G:C content and codon bias. The results consistently demonstrate that protein yields from codon engineered Gene Composer designs are as good as or better than those achieved from the synonymous native genes. Moreover, structure guided N- and C-terminal deletion constructs designed with the aid of Gene Composer can lead to greater success in gene to structure work as exemplified by the X-ray crystallographic structure determination of FtsZ from Bacillus subtilis. These results validate the Gene Composer algorithms, and suggest that using a combination of synthetic gene and protein construct engineering tools can improve the economics of gene to structure research.

  12. Integrated protein function prediction by mining function associations, sequences, and protein-protein and gene-gene interaction networks.

    PubMed

    Cao, Renzhi; Cheng, Jianlin

    2016-01-15

    Protein function prediction is an important and challenging problem in bioinformatics and computational biology. Functionally relevant biological information such as protein sequences, gene expression, and protein-protein interactions has been used mostly separately for protein function prediction. One of the major challenges is how to effectively integrate multiple sources of both traditional and new information such as spatial gene-gene interaction networks generated from chromosomal conformation data together to improve protein function prediction. In this work, we developed three different probabilistic scores (MIS, SEQ, and NET score) to combine protein sequence, function associations, and protein-protein interaction and spatial gene-gene interaction networks for protein function prediction. The MIS score is mainly generated from homologous proteins found by PSI-BLAST search, and also association rules between Gene Ontology terms, which are learned by mining the Swiss-Prot database. The SEQ score is generated from protein sequences. The NET score is generated from protein-protein interaction and spatial gene-gene interaction networks. These three scores were combined in a new Statistical Multiple Integrative Scoring System (SMISS) to predict protein function. We tested SMISS on the data set of 2011 Critical Assessment of Function Annotation (CAFA). The method performed substantially better than three base-line methods and an advanced method based on protein profile-sequence comparison, profile-profile comparison, and domain co-occurrence networks according to the maximum F-measure. Copyright © 2015 Elsevier Inc. All rights reserved.

  13. Protein structure protection commits gene expression patterns.

    PubMed

    Chen, Jianping; Liang, Han; Fernández, Ariel

    2008-01-01

    Gene co-expressions often determine module-defining spatial and temporal concurrences of proteins. Yet, little effort has been devoted to tracing coordinating signals for expression correlations to the three-dimensional structures of gene products. We performed a global structure-based analysis of the yeast and human proteomes and contrasted this information against their respective transcriptome organizations obtained from comprehensive microarray data. We show that protein vulnerability quantifies dosage sensitivity for metabolic adaptation phases and tissue-specific patterns of mRNA expression, determining the extent of co-expression similarity of binding partners. The role of protein intrinsic disorder in transcriptome organization is also delineated by interrelating vulnerability, disorder propensity and co-expression patterns. Extremely vulnerable human proteins are shown to be subject to severe post-transcriptional regulation of their expression through significant micro-RNA targeting, making mRNA levels poor surrogates for protein-expression levels. By contrast, in yeast the expression of extremely under-wrapped proteins is likely regulated through protein aggregation. Thus, the 85 most vulnerable proteins in yeast include the five confirmed prions, while in human, the genes encoding extremely vulnerable proteins are predicted to be targeted by microRNAs. Hence, in both vastly different organisms protein vulnerability emerges as a structure-encoded signal for post-transcriptional regulation. Vulnerability of protein structure and the concurrent need to maintain structural integrity are shown to quantify dosage sensitivity, compelling gene expression patterns across tissue types and temporal adaptation phases in a quantifiable manner. Extremely vulnerable proteins impose additional constraints on gene expression: They are subject to high levels of regulation at the post-transcriptional level.

  14. Making the Chromosome-Gene-Protein Connection.

    ERIC Educational Resources Information Center

    Mulvihill, Charlotte

    1996-01-01

    Presents an exercise that demonstrates the chromosome-gene-protein connection using sickle-cell anemia, a genetic disease with a well-characterized molecular basis. Involves connecting changes in DNA to protein outcomes and tying them into the next generation by meiosis and gamete formation with genetic crosses. Motivates students to integrate…

  15. Making the Chromosome-Gene-Protein Connection.

    ERIC Educational Resources Information Center

    Mulvihill, Charlotte

    1996-01-01

    Presents an exercise that demonstrates the chromosome-gene-protein connection using sickle-cell anemia, a genetic disease with a well-characterized molecular basis. Involves connecting changes in DNA to protein outcomes and tying them into the next generation by meiosis and gamete formation with genetic crosses. Motivates students to integrate…

  16. Coevolution of gene expression among interacting proteins

    SciTech Connect

    Fraser, Hunter B.; Hirsh, Aaron E.; Wall, Dennis P.; Eisen,Michael B.

    2004-03-01

    Physically interacting proteins or parts of proteins are expected to evolve in a coordinated manner that preserves proper interactions. Such coevolution at the amino acid-sequence level is well documented and has been used to predict interacting proteins, domains, and amino acids. Interacting proteins are also often precisely coexpressed with one another, presumably to maintain proper stoichiometry among interacting components. Here, we show that the expression levels of physically interacting proteins coevolve. We estimate average expression levels of genes from four closely related fungi of the genus Saccharomyces using the codon adaptation index and show that expression levels of interacting proteins exhibit coordinated changes in these different species. We find that this coevolution of expression is a more powerful predictor of physical interaction than is coevolution of amino acid sequence. These results demonstrate previously uncharacterized coevolution of gene expression, adding a different dimension to the study of the coevolution of interacting proteins and underscoring the importance of maintaining coexpression of interacting proteins over evolutionary time. Our results also suggest that expression coevolution can be used for computational prediction of protein protein interactions.

  17. Major Sperm Protein Genes from Globodera rostochiensis

    PubMed Central

    Novitski, Charles E.; Brown, Shiela; Chen, Ru; Corner, Adam S.; Atkinson, Howard J.; McPherson, Michael J.

    1993-01-01

    Three genes in the major sperm protein (MSP) gene family from the potato cyst nematode Globodera rostochiensis were cloned and sequenced. In contrast to the absence of introns in Caenorhabditis elegans MSP genes, these genes in G. rostochiensis contained a 57 nucleotide intron, with normal exon-intron boundaries, in the same relative location as the intron in Onchocerca volvulus. The MSP genes of G. rostochiensis had putative CAAT, TATA, and polyadenylation signals. The predicted G. rostochiensis MSP gene product is 126 amino acids long, one residue shorter than the products in the other species. The comparison of MSP amino acid sequences from four diverse nematode species suggests that O. volvulus, Ascaris suum, and C. elegans may be more closely related to each other than they are to G. rostochiensis. PMID:19279808

  18. Transcriptional enhancer from milk protein genes

    DOEpatents

    Casperson, Gerald F.; Schmidhauser, Christian T.; Bissell, Mina J.

    1999-01-01

    The invention relates to novel enhancer nucleotide sequences which stimulate transcription of heterologous DNA in cells in culture. The enhancers are derived from major milk protein genes by the process of deletion mapping and functional analysis. The invention also relates to expression vectors containing the novel enhancers.

  19. Transcriptional enhancer from milk protein genes

    SciTech Connect

    Casperson, G.F.; Schmidhauser, C.T.; Bissell, M.J.

    1999-12-21

    The invention relates to novel enhancer nucleotide sequences which stimulate transcription of heterologous DNA in cells in culture. The enhancers are derived from major milk protein genes by the process of deletion mapping and functional analysis. The invention also relates to expression vectors containing the novel enhancers.

  20. RPG: the Ribosomal Protein Gene database

    PubMed Central

    Nakao, Akihiro; Yoshihama, Maki; Kenmochi, Naoya

    2004-01-01

    RPG (http://ribosome.miyazaki-med.ac.jp/) is a new database that provides detailed information about ribosomal protein (RP) genes. It contains data from humans and other organisms, including Drosophila melanogaster, Caenorhabditis elegans, Saccharo myces cerevisiae, Methanococcus jannaschii and Escherichia coli. Users can search the database by gene name and organism. Each record includes sequences (genomic, cDNA and amino acid sequences), intron/exon structures, genomic locations and information about orthologs. In addition, users can view and compare the gene structures of the above organisms and make multiple amino acid sequence alignments. RPG also provides information on small nucleolar RNAs (snoRNAs) that are encoded in the introns of RP genes. PMID:14681386

  1. RPG: the Ribosomal Protein Gene database.

    PubMed

    Nakao, Akihiro; Yoshihama, Maki; Kenmochi, Naoya

    2004-01-01

    RPG (http://ribosome.miyazaki-med.ac.jp/) is a new database that provides detailed information about ribosomal protein (RP) genes. It contains data from humans and other organisms, including Drosophila melanogaster, Caenorhabditis elegans, Saccharo myces cerevisiae, Methanococcus jannaschii and Escherichia coli. Users can search the database by gene name and organism. Each record includes sequences (genomic, cDNA and amino acid sequences), intron/exon structures, genomic locations and information about orthologs. In addition, users can view and compare the gene structures of the above organisms and make multiple amino acid sequence alignments. RPG also provides information on small nucleolar RNAs (snoRNAs) that are encoded in the introns of RP genes.

  2. Amelogenesis Imperfecta; Genes, Proteins, and Pathways

    PubMed Central

    Smith, Claire E. L.; Poulter, James A.; Antanaviciute, Agne; Kirkham, Jennifer; Brookes, Steven J.; Inglehearn, Chris F.; Mighell, Alan J.

    2017-01-01

    Amelogenesis imperfecta (AI) is the name given to a heterogeneous group of conditions characterized by inherited developmental enamel defects. AI enamel is abnormally thin, soft, fragile, pitted and/or badly discolored, with poor function and aesthetics, causing patients problems such as early tooth loss, severe embarrassment, eating difficulties, and pain. It was first described separately from diseases of dentine nearly 80 years ago, but the underlying genetic and mechanistic basis of the condition is only now coming to light. Mutations in the gene AMELX, encoding an extracellular matrix protein secreted by ameloblasts during enamel formation, were first identified as a cause of AI in 1991. Since then, mutations in at least eighteen genes have been shown to cause AI presenting in isolation of other health problems, with many more implicated in syndromic AI. Some of the encoded proteins have well documented roles in amelogenesis, acting as enamel matrix proteins or the proteases that degrade them, cell adhesion molecules or regulators of calcium homeostasis. However, for others, function is less clear and further research is needed to understand the pathways and processes essential for the development of healthy enamel. Here, we review the genes and mutations underlying AI presenting in isolation of other health problems, the proteins they encode and knowledge of their roles in amelogenesis, combining evidence from human phenotypes, inheritance patterns, mouse models, and in vitro studies. An LOVD resource (http://dna2.leeds.ac.uk/LOVD/) containing all published gene mutations for AI presenting in isolation of other health problems is described. We use this resource to identify trends in the genes and mutations reported to cause AI in the 270 families for which molecular diagnoses have been reported by 23rd May 2017. Finally we discuss the potential value of the translation of AI genetics to clinical care with improved patient pathways and speculate on the

  3. Stress Genes and Proteins in the Archaea

    PubMed Central

    Macario, Alberto J. L.; Lange, Marianne; Ahring, Birgitte K.; De Macario, Everly Conway

    1999-01-01

    The field covered in this review is new; the first sequence of a gene encoding the molecular chaperone Hsp70 and the first description of a chaperonin in the archaea were reported in 1991. These findings boosted research in other areas beyond the archaea that were directly relevant to bacteria and eukaryotes, for example, stress gene regulation, the structure-function relationship of the chaperonin complex, protein-based molecular phylogeny of organisms and eukaryotic-cell organelles, molecular biology and biochemistry of life in extreme environments, and stress tolerance at the cellular and molecular levels. In the last 8 years, archaeal stress genes and proteins belonging to the families Hsp70, Hsp60 (chaperonins), Hsp40(DnaJ), and small heat-shock proteins (sHsp) have been studied. The hsp70(dnaK), hsp40(dnaJ), and grpE genes (the chaperone machine) have been sequenced in seven, four, and two species, respectively, but their expression has been examined in detail only in the mesophilic methanogen Methanosarcina mazei S-6. The proteins possess markers typical of bacterial homologs but none of the signatures distinctive of eukaryotes. In contrast, gene expression and transcription initiation signals and factors are of the eucaryal type, which suggests a hybrid archaeal-bacterial complexion for the Hsp70 system. Another remarkable feature is that several archaeal species in different phylogenetic branches do not have the gene hsp70(dnaK), an evolutionary puzzle that raises the important question of what replaces the product of this gene, Hsp70(DnaK), in protein biogenesis and refolding and for stress resistance. Although archaea are prokaryotes like bacteria, their Hsp60 (chaperonin) family is of type (group) II, similar to that of the eukaryotic cytosol; however, unlike the latter, which has several different members, the archaeal chaperonin system usually includes only two (in some species one and in others possibly three) related subunits of ∼60 kDa. These

  4. Specific interactions between DNA and regulatory protein controlled by ligand-binding: Ab initio molecular simulation

    SciTech Connect

    Matsushita, Y. Murakawa, T. Shimamura, K. Oishi, M. Ohyama, T. Kurita, N.

    2015-02-27

    The catabolite activator protein (CAP) is one of the regulatory proteins controlling the transcription mechanism of gene. Biochemical experiments elucidated that the complex of CAP with cyclic AMP (cAMP) is indispensable for controlling the mechanism, while previous molecular simulations for the monomer of CAP+cAMP complex revealed the specific interactions between CAP and cAMP. However, the effect of cAMP-binding to CAP on the specific interactions between CAP and DNA is not elucidated at atomic and electronic levels. We here considered the ternary complex of CAP, cAMP and DNA in solvating water molecules and investigated the specific interactions between them at atomic and electronic levels using ab initio molecular simulations based on classical molecular dynamics and ab initio fragment molecular orbital methods. The results highlight the important amino acid residues of CAP for the interactions between CAP and cAMP and between CAP and DNA.

  5. Mining disease genes using integrated protein-protein interaction and gene-gene co-regulation information.

    PubMed

    Li, Jin; Wang, Limei; Guo, Maozu; Zhang, Ruijie; Dai, Qiguo; Liu, Xiaoyan; Wang, Chunyu; Teng, Zhixia; Xuan, Ping; Zhang, Mingming

    2015-01-01

    In humans, despite the rapid increase in disease-associated gene discovery, a large proportion of disease-associated genes are still unknown. Many network-based approaches have been used to prioritize disease genes. Many networks, such as the protein-protein interaction (PPI), KEGG, and gene co-expression networks, have been used. Expression quantitative trait loci (eQTLs) have been successfully applied for the determination of genes associated with several diseases. In this study, we constructed an eQTL-based gene-gene co-regulation network (GGCRN) and used it to mine for disease genes. We adopted the random walk with restart (RWR) algorithm to mine for genes associated with Alzheimer disease. Compared to the Human Protein Reference Database (HPRD) PPI network alone, the integrated HPRD PPI and GGCRN networks provided faster convergence and revealed new disease-related genes. Therefore, using the RWR algorithm for integrated PPI and GGCRN is an effective method for disease-associated gene mining.

  6. Engineering Genes for Predictable Protein Expression

    PubMed Central

    Gustafsson, Claes; Minshull, Jeremy; Govindarajan, Sridhar; Ness, Jon; Villalobos, Alan; Welch, Mark

    2013-01-01

    The DNA sequence used to encode a polypeptide can have dramatic effects on its expression. Lack of readily available tools has until recently inhibited meaningful experimental investigation of this phenomenon. Advances in synthetic biology and the application of modern engineering approaches now provide the tools for systematic analysis of the sequence variables affecting heterologous expression of recombinant proteins. We here discuss how these new tools are being applied and how they circumvent the constraints of previous approaches, highlighting some of the surprising and promising results emerging from the developing field of gene engineering. PMID:22425659

  7. Engineering genes for predictable protein expression.

    PubMed

    Gustafsson, Claes; Minshull, Jeremy; Govindarajan, Sridhar; Ness, Jon; Villalobos, Alan; Welch, Mark

    2012-05-01

    The DNA sequence used to encode a polypeptide can have dramatic effects on its expression. Lack of readily available tools has until recently inhibited meaningful experimental investigation of this phenomenon. Advances in synthetic biology and the application of modern engineering approaches now provide the tools for systematic analysis of the sequence variables affecting heterologous expression of recombinant proteins. We here discuss how these new tools are being applied and how they circumvent the constraints of previous approaches, highlighting some of the surprising and promising results emerging from the developing field of gene engineering.

  8. Recombinant Brucella abortus gene expressing immunogenic protein

    SciTech Connect

    Mayfield, J.E.; Tabatabai, L.B.

    1991-06-11

    This patent describes a synthetic recombinant DNA molecule containing a DNA sequence. It comprises a gene of Brucella abortus encoding an immunogenic protein having a molecular weight of approximately 31,000 daltons as determined by sodium dodecyl sulfate polyacrylamide gel electrophoresis under denaturing conditions, the protein having an isoelectric point around 4.9, and containing a twenty-five amino acid sequence from its amino terminal end consisting of Gln-Ala-Pro-Thr-Phe-Phe-Arg-Ile-Gly-Thr-Gly-Gly-Thr-Ala-Gly-Thr-Tyr-Tyr-Pro-Ile-Gly-Gly-Leu-Ile-Ala, wherein Gln, Ala, Pro, Thr, Phe, Arg, Ile, Gly, Tyr, and Leu, respectively, represent glutamine, alanine, proline, threonine, phenylalanine, arginine, isolecuine, glycine, tyrosine, and leucine.

  9. Predicting gene ontology annotations of orphan GWAS genes using protein-protein interactions.

    PubMed

    Kuppuswamy, Usha; Ananthasubramanian, Seshan; Wang, Yanli; Balakrishnan, Narayanaswamy; Ganapathiraju, Madhavi K

    2014-04-03

    The number of genome-wide association studies (GWAS) has increased rapidly in the past couple of years, resulting in the identification of genes associated with different diseases. The next step in translating these findings into biomedically useful information is to find out the mechanism of the action of these genes. However, GWAS studies often implicate genes whose functions are currently unknown; for example, MYEOV, ANKLE1, TMEM45B and ORAOV1 are found to be associated with breast cancer, but their molecular function is unknown. We carried out Bayesian inference of Gene Ontology (GO) term annotations of genes by employing the directed acyclic graph structure of GO and the network of protein-protein interactions (PPIs). The approach is designed based on the fact that two proteins that interact biophysically would be in physical proximity of each other, would possess complementary molecular function, and play role in related biological processes. Predicted GO terms were ranked according to their relative association scores and the approach was evaluated quantitatively by plotting the precision versus recall values and F-scores (the harmonic mean of precision and recall) versus varying thresholds. Precisions of ~58% and ~ 40% for localization and functions respectively of proteins were determined at a threshold of ~30 (top 30 GO terms in the ranked list). Comparison with function prediction based on semantic similarity among nodes in an ontology and incorporation of those similarities in a k-nearest neighbor classifier confirmed that our results compared favorably. This approach was applied to predict the cellular component and molecular function GO terms of all human proteins that have interacting partners possessing at least one known GO annotation. The list of predictions is available at http://severus.dbmi.pitt.edu/engo/GOPRED.html. We present the algorithm, evaluations and the results of the computational predictions, especially for genes identified in

  10. Molecular mechanisms of ribosomal protein gene coregulation

    PubMed Central

    Reja, Rohit; Vinayachandran, Vinesh; Ghosh, Sujana; Pugh, B. Franklin

    2015-01-01

    The 137 ribosomal protein genes (RPGs) of Saccharomyces provide a model for gene coregulation. We examined the positional and functional organization of their regulators (Rap1 [repressor activator protein 1], Fhl1, Ifh1, Sfp1, and Hmo1), the transcription machinery (TFIIB, TFIID, and RNA polymerase II), and chromatin at near-base-pair resolution using ChIP-exo, as RPGs are coordinately reprogrammed. Where Hmo1 is enriched, Fhl1, Ifh1, Sfp1, and Hmo1 cross-linked broadly to promoter DNA in an RPG-specific manner and demarcated by general minor groove widening. Importantly, Hmo1 extended 20–50 base pairs (bp) downstream from Fhl1. Upon RPG repression, Fhl1 remained in place. Hmo1 dissociated, which was coupled to an upstream shift of the +1 nucleosome, as reflected by the Hmo1 extension and core promoter region. Fhl1 and Hmo1 may create two regulatable and positionally distinct barriers, against which chromatin remodelers position the +1 nucleosome into either an activating or a repressive state. Consistent with in vitro studies, we found that specific TFIID subunits, in addition to cross-linking at the core promoter, made precise cross-links at Rap1 sites, which we interpret to reflect native Rap1–TFIID interactions. Our findings suggest how sequence-specific DNA binding regulates nucleosome positioning and transcription complex assembly >300 bp away and how coregulation coevolved with coding sequences. PMID:26385964

  11. Molecular mechanisms of ribosomal protein gene coregulation.

    PubMed

    Reja, Rohit; Vinayachandran, Vinesh; Ghosh, Sujana; Pugh, B Franklin

    2015-09-15

    The 137 ribosomal protein genes (RPGs) of Saccharomyces provide a model for gene coregulation. We examined the positional and functional organization of their regulators (Rap1 [repressor activator protein 1], Fhl1, Ifh1, Sfp1, and Hmo1), the transcription machinery (TFIIB, TFIID, and RNA polymerase II), and chromatin at near-base-pair resolution using ChIP-exo, as RPGs are coordinately reprogrammed. Where Hmo1 is enriched, Fhl1, Ifh1, Sfp1, and Hmo1 cross-linked broadly to promoter DNA in an RPG-specific manner and demarcated by general minor groove widening. Importantly, Hmo1 extended 20-50 base pairs (bp) downstream from Fhl1. Upon RPG repression, Fhl1 remained in place. Hmo1 dissociated, which was coupled to an upstream shift of the +1 nucleosome, as reflected by the Hmo1 extension and core promoter region. Fhl1 and Hmo1 may create two regulatable and positionally distinct barriers, against which chromatin remodelers position the +1 nucleosome into either an activating or a repressive state. Consistent with in vitro studies, we found that specific TFIID subunits, in addition to cross-linking at the core promoter, made precise cross-links at Rap1 sites, which we interpret to reflect native Rap1-TFIID interactions. Our findings suggest how sequence-specific DNA binding regulates nucleosome positioning and transcription complex assembly >300 bp away and how coregulation coevolved with coding sequences. © 2015 Reja et al.; Published by Cold Spring Harbor Laboratory Press.

  12. Studies on the Orientation of Cyclic AMP-Dependent Protein Kinase in Human Erythrocyte Membranes

    PubMed Central

    Rubin, Charles S.; Rosenfeld, Robert D.; Rosen, Ora M.

    1973-01-01

    The topographic location of cyclic AMP-dependent protein kinase has been studied in preparations of permeable and sealed membranes derived from human erythrocytes. Both the catalytic and cyclic AMP-binding components of protein kinase are localized on the inner, cytoplasmic, surface of the plasma membrane. PMID:4359486

  13. Protein-Protein Interaction Network and Gene Ontology

    NASA Astrophysics Data System (ADS)

    Choi, Yunkyu; Kim, Seok; Yi, Gwan-Su; Park, Jinah

    Evolution of computer technologies makes it possible to access a large amount and various kinds of biological data via internet such as DNA sequences, proteomics data and information discovered about them. It is expected that the combination of various data could help researchers find further knowledge about them. Roles of a visualization system are to invoke human abilities to integrate information and to recognize certain patterns in the data. Thus, when the various kinds of data are examined and analyzed manually, an effective visualization system is an essential part. One instance of these integrated visualizations can be combination of protein-protein interaction (PPI) data and Gene Ontology (GO) which could help enhance the analysis of PPI network. We introduce a simple but comprehensive visualization system that integrates GO and PPI data where GO and PPI graphs are visualized side-by-side and supports quick reference functions between them. Furthermore, the proposed system provides several interactive visualization methods for efficiently analyzing the PPI network and GO directedacyclic- graph such as context-based browsing and common ancestors finding.

  14. What's that gene (or protein)? Online resources for exploring functions of genes, transcripts, and proteins

    PubMed Central

    Hutchins, James R. A.

    2014-01-01

    The genomic era has enabled research projects that use approaches including genome-scale screens, microarray analysis, next-generation sequencing, and mass spectrometry–based proteomics to discover genes and proteins involved in biological processes. Such methods generate data sets of gene, transcript, or protein hits that researchers wish to explore to understand their properties and functions and thus their possible roles in biological systems of interest. Recent years have seen a profusion of Internet-based resources to aid this process. This review takes the viewpoint of the curious biologist wishing to explore the properties of protein-coding genes and their products, identified using genome-based technologies. Ten key questions are asked about each hit, addressing functions, phenotypes, expression, evolutionary conservation, disease association, protein structure, interactors, posttranslational modifications, and inhibitors. Answers are provided by presenting the latest publicly available resources, together with methods for hit-specific and data set–wide information retrieval, suited to any genome-based analytical technique and experimental species. The utility of these resources is demonstrated for 20 factors regulating cell proliferation. Results obtained using some of these are discussed in more depth using the p53 tumor suppressor as an example. This flexible and universally applicable approach for characterizing experimental hits helps researchers to maximize the potential of their projects for biological discovery. PMID:24723265

  15. Binding of Cyclic Di-AMP to the Staphylococcus aureus Sensor Kinase KdpD Occurs via the Universal Stress Protein Domain and Downregulates the Expression of the Kdp Potassium Transporter

    PubMed Central

    Moscoso, Joana A.; Schramke, Hannah; Tosi, Tommaso; Dehbi, Amina; Jung, Kirsten

    2015-01-01

    ABSTRACT Nucleotide signaling molecules are important intracellular messengers that regulate a wide range of biological functions. The human pathogen Staphylococcus aureus produces the signaling nucleotide cyclic di-AMP (c-di-AMP). This molecule is common among Gram-positive bacteria and in many organisms is essential for survival under standard laboratory growth conditions. In this study, we investigated the interaction of c-di-AMP with the S. aureus KdpD protein. The sensor kinase KdpD forms a two-component signaling system with the response regulator KdpE and regulates the expression of the kdpDE genes and the kdpFABC operon coding for the Kdp potassium transporter components. Here we show that the S. aureus KdpD protein binds c-di-AMP specifically and with an affinity in the micromolar range through its universal stress protein (USP) domain. This domain is located within the N-terminal cytoplasmic region of KdpD, and amino acids of a conserved SXS-X20-FTAXY motif are important for this binding. We further show that KdpD2, a second KdpD protein found in some S. aureus strains, also binds c-di-AMP, and our bioinformatics analysis indicates that a subclass of KdpD proteins in c-di-AMP-producing bacteria has evolved to bind this signaling nucleotide. Finally, we show that c-di-AMP binding to KdpD inhibits the upregulation of the kdpFABC operon under salt stress, thus indicating that c-di-AMP is a negative regulator of potassium uptake in S. aureus. IMPORTANCE Staphylococcus aureus is an important human pathogen and a major cause of food poisoning in Western countries. A common method for food preservation is the use of salt to drive dehydration. This study sheds light on the regulation of potassium uptake in Staphylococcus aureus, an important aspect of this bacterium's ability to tolerate high levels of salt. We show that the signaling nucleotide c-di-AMP binds to a regulatory component of the Kdp potassium uptake system and that this binding has an inhibitory

  16. Binding of Cyclic Di-AMP to the Staphylococcus aureus Sensor Kinase KdpD Occurs via the Universal Stress Protein Domain and Downregulates the Expression of the Kdp Potassium Transporter.

    PubMed

    Moscoso, Joana A; Schramke, Hannah; Zhang, Yong; Tosi, Tommaso; Dehbi, Amina; Jung, Kirsten; Gründling, Angelika

    2015-07-20

    Nucleotide signaling molecules are important intracellular messengers that regulate a wide range of biological functions. The human pathogen Staphylococcus aureus produces the signaling nucleotide cyclic di-AMP (c-di-AMP). This molecule is common among Gram-positive bacteria and in many organisms is essential for survival under standard laboratory growth conditions. In this study, we investigated the interaction of c-di-AMP with the S. aureus KdpD protein. The sensor kinase KdpD forms a two-component signaling system with the response regulator KdpE and regulates the expression of the kdpDE genes and the kdpFABC operon coding for the Kdp potassium transporter components. Here we show that the S. aureus KdpD protein binds c-di-AMP specifically and with an affinity in the micromolar range through its universal stress protein (USP) domain. This domain is located within the N-terminal cytoplasmic region of KdpD, and amino acids of a conserved SXS-X20-FTAXY motif are important for this binding. We further show that KdpD2, a second KdpD protein found in some S. aureus strains, also binds c-di-AMP, and our bioinformatics analysis indicates that a subclass of KdpD proteins in c-di-AMP-producing bacteria has evolved to bind this signaling nucleotide. Finally, we show that c-di-AMP binding to KdpD inhibits the upregulation of the kdpFABC operon under salt stress, thus indicating that c-di-AMP is a negative regulator of potassium uptake in S. aureus. Staphylococcus aureus is an important human pathogen and a major cause of food poisoning in Western countries. A common method for food preservation is the use of salt to drive dehydration. This study sheds light on the regulation of potassium uptake in Staphylococcus aureus, an important aspect of this bacterium's ability to tolerate high levels of salt. We show that the signaling nucleotide c-di-AMP binds to a regulatory component of the Kdp potassium uptake system and that this binding has an inhibitory effect on the

  17. The sulfolobicin genes of Sulfolobus acidocaldarius encode novel antimicrobial proteins.

    PubMed

    Ellen, Albert F; Rohulya, Olha V; Fusetti, Fabrizia; Wagner, Michaela; Albers, Sonja-Verena; Driessen, Arnold J M

    2011-09-01

    Crenarchaea, such as Sulfolobus acidocaldarius and Sulfolobus tokodaii, produce antimicrobial proteins called sulfolobicins. These antimicrobial proteins inhibit the growth of closely related species. Here we report the identification of the sulfolobicin-encoding genes in S. acidocaldarius. The active sulfolobicin comprises two proteins that are equipped with a classical signal sequence. These proteins are secreted by the cells and found to be membrane vesicle associated. Gene inactivation studies demonstrate that both proteins are required for the bacteriostatic antimicrobial activity. Sulfolobicins constitute a novel class of antimicrobial proteins without detectable homology to any other protein.

  18. Identification and characterization of a Dictyostelium discoideum ribosomal protein gene.

    PubMed Central

    Szymkowski, D E; Deering, R A

    1990-01-01

    We have identified a developmentally repressed large-subunit ribosomal protein gene of Dictyostelium discoideum based on sequence similarity to other ribosomal proteins. Protein rpl7 is homologous to large subunit ribosomal proteins from the rat and possibly to Mycoplasma capricolum and Escherichia coli, but is not similar to three sequenced ribosomal proteins in Dictyostelium. The rpl7 gene is present at one copy per genome, as are six other cloned Dictyostelium ribosomal proteins. Restriction fragment length polymorphisms exist for ribosomal protein genes rpl7, rp1024, and rp110 in strain HU182; most Dictyostelium ribosomal protein genes examined are linked no closer than 30-100 kb to each other in the genome. Dictyostelium ribosomal proteins are known to be developmentally regulated, and levels of rpl7 transcript gradually decrease during the 24-hour development cycle. This drop correlates with that of rp1024, indicating these and other ribosomal protein genes may be coordinately regulated. To determine the cellular location of the protein, we raised antibodies to an rpl7-derived branched synthetic peptide. These antibodies cross-reacted with one protein of the expected size in a ribosomal protein fraction of Dictyostelium, indicating that the product of gene rpl7 is localized in the ribosome. Images PMID:1975664

  19. 4-Coumarate-CoA Ligase-Like Gene OsAAE3 Negatively Mediates the Rice Blast Resistance, Floret Development and Lignin Biosynthesis

    PubMed Central

    Liu, Hao; Guo, Zhenhua; Gu, Fengwei; Ke, Shanwen; Sun, Dayuan; Dong, Shuangyu; Liu, Wei; Huang, Ming; Xiao, Wuming; Yang, Guili; Liu, Yongzhu; Guo, Tao; Wang, Hui; Wang, Jiafeng; Chen, Zhiqiang

    2017-01-01

    Although adenosine monophosphate (AMP) binding domain is widely distributed in multiple plant species, detailed molecular functions of AMP binding proteins (AMPBPs) in plant development and plant-pathogen interaction remain unclear. In the present study, we identified an AMPBP OsAAE3 from a previous analysis of early responsive genes in rice during Magnaporthe oryzae infection. OsAAE3 is a homolog of Arabidopsis AAE3 in rice, which encodes a 4-coumarate-Co-A ligase (4CL) like protein. A phylogenetic analysis showed that OsAAE3 was most likely 4CL-like 10 in an independent group. OsAAE3 was localized to cytoplasm, and it could be expressed in various tissues. Histochemical staining of transgenic plants carrying OsAAE3 promoter-driven GUS (β-glucuronidase) reporter gene suggested that OsAAE3 was expressed in all tissues of rice. Furthermore, OsAAE3-OX plants showed increased susceptibility to M. Oryzae, and this finding was attributable to decreased expression of pathogen-related 1a (PR1) and low level of peroxidase (POD) activity. Moreover, OsAAE3 over-expression resulted in increased content of H2O2, leading to programmed cell-death induced by reactive oxygen species (ROS). In addition, OsAAE3 over-expression repressed the floret development, exhibiting dramatically twisted glume and decreased fertility rate of anther. Meanwhile, the expressions of lignin biosynthesis genes were significantly decreased in OsAAE3-OX plants, thereby leading to reduced lignin content. Taken together, OsAAE3 functioned as a negative regulator in rice blast resistance, floret development, and lignin biosynthesis. Our findings further expanded the knowledge in functions of AMBPs in plant floret development and the regulation of rice-fungus interaction. PMID:28119718

  20. Nucleotide sequence of the coat protein gene of canine parvovirus.

    PubMed Central

    Rhode, S L

    1985-01-01

    The nucleotide sequence of the canine parvovirus (CPV2) from map units 33 to 95 has been determined. This includes the entire coat protein gene and noncoding sequences at the 3' end of the gene, exclusive of the terminal inverted repeat. The predicted capsid protein structures are discussed and compared with those of the rodent parvoviruses H-1 and MVM. PMID:3989914

  1. Extracting synonymous gene and protein terms from biological literature.

    PubMed

    Yu, Hong; Agichtein, Eugene

    2003-01-01

    Genes and proteins are often associated with multiple names. More names are added as new functional or structural information is discovered. Because authors can use any one of the known names for a gene or protein, information retrieval and extraction would benefit from identifying the gene and protein terms that are synonyms of the same substance. We have explored four complementary approaches for extracting gene and protein synonyms from text, namely the unsupervised, partially supervised, and supervised machine-learning techniques, as well as the manual knowledge-based approach. We report results of a large scale evaluation of these alternatives over an archive of biological journal articles. Our evaluation shows that our extraction techniques could be a valuable supplement to resources such as SWISSPROT, as our systems were able to capture gene and protein synonyms not listed in the SWISSPROT database.

  2. Mosaic tetracycline resistance genes encoding ribosomal protection proteins

    PubMed Central

    Warburton, Philip J.; Amodeo, Nina; Roberts, Adam P.

    2016-01-01

    First reported in 2003, mosaic tetracycline resistance genes are a subgroup of the genes encoding ribosomal protection proteins (RPPs). They are formed when two or more RPP-encoding genes recombine resulting in a functional chimera. To date, the majority of mosaic genes are derived from sections of three RPP genes, tet(O), tet(W) and tet(32), with others comprising tet(M) and tet(S). In this first review of mosaic genes, we report on their structure, diversity and prevalence, and suggest that these genes may be responsible for an under-reported contribution to tetracycline resistance in bacteria. PMID:27494928

  3. Mosaic tetracycline resistance genes encoding ribosomal protection proteins.

    PubMed

    Warburton, Philip J; Amodeo, Nina; Roberts, Adam P

    2016-12-01

    First reported in 2003, mosaic tetracycline resistance genes are a subgroup of the genes encoding ribosomal protection proteins (RPPs). They are formed when two or more RPP-encoding genes recombine resulting in a functional chimera. To date, the majority of mosaic genes are derived from sections of three RPP genes, tet(O), tet(W) and tet(32), with others comprising tet(M) and tet(S). In this first review of mosaic genes, we report on their structure, diversity and prevalence, and suggest that these genes may be responsible for an under-reported contribution to tetracycline resistance in bacteria.

  4. Operon Gene Order Is Optimized for Ordered Protein Complex Assembly.

    PubMed

    Wells, Jonathan N; Bergendahl, L Therese; Marsh, Joseph A

    2016-02-02

    The assembly of heteromeric protein complexes is an inherently stochastic process in which multiple genes are expressed separately into proteins, which must then somehow find each other within the cell. Here, we considered one of the ways by which prokaryotic organisms have attempted to maximize the efficiency of protein complex assembly: the organization of subunit-encoding genes into operons. Using structure-based assembly predictions, we show that operon gene order has been optimized to match the order in which protein subunits assemble. Exceptions to this are almost entirely highly expressed proteins for which assembly is less stochastic and for which precisely ordered translation offers less benefit. Overall, these results show that ordered protein complex assembly pathways are of significant biological importance and represent a major evolutionary constraint on operon gene organization.

  5. Targeting protein kinase A in cancer therapy: an update

    PubMed Central

    Sapio, Luigi; Di Maiolo, Francesca; Illiano, Michela; Esposito, Antonietta; Chiosi, Emilio; Spina, Annamaria; Naviglio, Silvio

    2014-01-01

    Protein Kinase A (PKA) is a well known member of the serine-threonin protein kinase superfamily. PKA, also known as cAMP-dependent protein kinase, is a multi-unit protein kinase that mediates signal transduction of G-protein coupled receptors through its activation upon cAMP binding. The widespread expression of PKA subunit genes, and the myriad of mechanisms by which cAMP is regulated within a cell suggest that PKA signaling is one of extreme importance to cellular function. It is involved in the control of a wide variety of cellular processes from metabolism to ion channel activation, cell growth and differentiation, gene expression and apoptosis. Importantly, since it has been implicated in the initiation and progression of many tumors, PKA has been proposed as a novel biomarker for cancer detection, and as a potential molecular target for cancer therapy. Here, we highlight some features of cAMP/PKA signaling that are relevant to cancer biology and present an update on targeting PKA in cancer therapy. PMID:26417307

  6. Automatically identifying gene/protein terms in MEDLINE abstracts.

    PubMed

    Yu, Hong; Hatzivassiloglou, Vasileios; Rzhetsky, Andrey; Wilbur, W John

    2002-01-01

    Natural language processing (NLP) techniques are used to extract information automatically from computer-readable literature. In biology, the identification of terms corresponding to biological substances (e.g., genes and proteins) is a necessary step that precedes the application of other NLP systems that extract biological information (e.g., protein-protein interactions, gene regulation events, and biochemical pathways). We have developed GPmarkup (for "gene/protein-full name mark up"), a software system that automatically identifies gene/protein terms (i.e., symbols or full names) in MEDLINE abstracts. As a part of marking up process, we also generated automatically a knowledge source of paired gene/protein symbols and full names (e.g., LARD for lymphocyte associated receptor of death) from MEDLINE. We found that many of the pairs in our knowledge source do not appear in the current GenBank database. Therefore our methods may also be used for automatic lexicon generation. GPmarkup has 73% recall and 93% precision in identifying and marking up gene/protein terms in MEDLINE abstracts. A random sample of gene/protein symbols and full names and a sample set of marked up abstracts can be viewed at http://www.cpmc.columbia.edu/homepages/yuh9001/GPmarkup/. Contact. hy52@columbia.edu. Voice: 212-939-7028; fax: 212-666-0140.

  7. Gene composer: database software for protein construct design, codon engineering, and gene synthesis.

    PubMed

    Lorimer, Don; Raymond, Amy; Walchli, John; Mixon, Mark; Barrow, Adrienne; Wallace, Ellen; Grice, Rena; Burgin, Alex; Stewart, Lance

    2009-04-21

    To improve efficiency in high throughput protein structure determination, we have developed a database software package, Gene Composer, which facilitates the information-rich design of protein constructs and their codon engineered synthetic gene sequences. With its modular workflow design and numerous graphical user interfaces, Gene Composer enables researchers to perform all common bio-informatics steps used in modern structure guided protein engineering and synthetic gene engineering. An interactive Alignment Viewer allows the researcher to simultaneously visualize sequence conservation in the context of known protein secondary structure, ligand contacts, water contacts, crystal contacts, B-factors, solvent accessible area, residue property type and several other useful property views. The Construct Design Module enables the facile design of novel protein constructs with altered N- and C-termini, internal insertions or deletions, point mutations, and desired affinity tags. The modifications can be combined and permuted into multiple protein constructs, and then virtually cloned in silico into defined expression vectors. The Gene Design Module uses a protein-to-gene algorithm that automates the back-translation of a protein amino acid sequence into a codon engineered nucleic acid gene sequence according to a selected codon usage table with minimal codon usage threshold, defined G:C% content, and desired sequence features achieved through synonymous codon selection that is optimized for the intended expression system. The gene-to-oligo algorithm of the Gene Design Module plans out all of the required overlapping oligonucleotides and mutagenic primers needed to synthesize the desired gene constructs by PCR, and for physically cloning them into selected vectors by the most popular subcloning strategies. We present a complete description of Gene Composer functionality, and an efficient PCR-based synthetic gene assembly procedure with mis-match specific endonuclease

  8. Gene Composer: database software for protein construct design, codon engineering, and gene synthesis

    PubMed Central

    Lorimer, Don; Raymond, Amy; Walchli, John; Mixon, Mark; Barrow, Adrienne; Wallace, Ellen; Grice, Rena; Burgin, Alex; Stewart, Lance

    2009-01-01

    Background To improve efficiency in high throughput protein structure determination, we have developed a database software package, Gene Composer, which facilitates the information-rich design of protein constructs and their codon engineered synthetic gene sequences. With its modular workflow design and numerous graphical user interfaces, Gene Composer enables researchers to perform all common bio-informatics steps used in modern structure guided protein engineering and synthetic gene engineering. Results An interactive Alignment Viewer allows the researcher to simultaneously visualize sequence conservation in the context of known protein secondary structure, ligand contacts, water contacts, crystal contacts, B-factors, solvent accessible area, residue property type and several other useful property views. The Construct Design Module enables the facile design of novel protein constructs with altered N- and C-termini, internal insertions or deletions, point mutations, and desired affinity tags. The modifications can be combined and permuted into multiple protein constructs, and then virtually cloned in silico into defined expression vectors. The Gene Design Module uses a protein-to-gene algorithm that automates the back-translation of a protein amino acid sequence into a codon engineered nucleic acid gene sequence according to a selected codon usage table with minimal codon usage threshold, defined G:C% content, and desired sequence features achieved through synonymous codon selection that is optimized for the intended expression system. The gene-to-oligo algorithm of the Gene Design Module plans out all of the required overlapping oligonucleotides and mutagenic primers needed to synthesize the desired gene constructs by PCR, and for physically cloning them into selected vectors by the most popular subcloning strategies. Conclusion We present a complete description of Gene Composer functionality, and an efficient PCR-based synthetic gene assembly procedure with mis

  9. Measles virus P gene codes for two proteins.

    PubMed Central

    Bellini, W J; Englund, G; Rozenblatt, S; Arnheiter, H; Richardson, C D

    1985-01-01

    The entirety of the phosphoprotein gene of measles virus has been sequenced. The gene is composed of 1,657 nucleotides and specifies a 507-amino-acid protein (P). A second overlapping reading frame was predicted from the sequence and specifies a 186-amino-acid protein (C). Through the use of antisynthetic peptide antibodies, we show that both proteins are expressed in virally infected cells. Both proteins are expressed from a functionally bicistronic mRNA through independent initiation of ribosomes at the respective AUG codons. Using immunofluorescent microscopy, we localized the C protein in the nucleus and in cytoplasmic inclusions within the infected cells. Images PMID:3882996

  10. The Classical Arabinogalactan Protein Gene Family of Arabidopsis

    PubMed Central

    Schultz, Carolyn J.; Johnson, Kim L.; Currie, Graeme; Bacic, Antony

    2000-01-01

    Arabinogalactan proteins (AGPs) are extracellular proteoglycans implicated in plant growth and development. We searched for classical AGPs in Arabidopsis by identifying expressed sequence tags based on the conserved domain structure of the predicted protein backbone. To confirm that these genes encoded bona fide AGPs, we purified native AGPs and then deglycosylated and deblocked them for N-terminal protein sequencing. In total, we identified 15 genes encoding the protein backbones of classical AGPs, including genes for AG peptides—AGPs with very short backbones (10 to 13 amino acid residues). Seven of the AGPs were verified as AGPs by protein sequencing. A gene encoding a putative cell adhesion molecule with AGP-like domains was also identified. This work provides a firm foundation for beginning functional analysis by using a genetic approach. PMID:11006345

  11. Evolutionary conservation and diversification of Rh family genes and proteins

    PubMed Central

    Huang, Cheng-Han; Peng, Jianbin

    2005-01-01

    Rhesus (Rh) proteins were first identified in human erythroid cells and recently in other tissues. Like ammonia transporter (Amt) proteins, their only homologues, Rh proteins have the 12 transmembrane-spanning segments characteristic of transporters. Many think Rh and Amt proteins transport the same substrate, \\documentclass[12pt]{minimal} \\usepackage{amsmath} \\usepackage{wasysym} \\usepackage{amsfonts} \\usepackage{amssymb} \\usepackage{amsbsy} \\usepackage{mathrsfs} \\setlength{\\oddsidemargin}{-69pt} \\begin{document} \\begin{equation*}{\\mathrm{NH}}_{3}/{\\mathrm{NH}}_{4}^{+}\\end{equation*}\\end{document}, whereas others think that Rh proteins transport CO2 and Amt proteins NH3. In the latter view, Rh and Amt are different biological gas channels. To reconstruct the phylogeny of the Rh family and study its coexistence with and relationship to Amt in depth, we analyzed 111 Rh genes and 260 Amt genes. Although Rh and Amt are found together in organisms as diverse as unicellular eukaryotes and sea squirts, Rh genes apparently arose later, because they are rare in prokaryotes. However, Rh genes are prominent in vertebrates, in which Amt genes disappear. In organisms with both types of genes, Rh had apparently diverged away from Amt rapidly and then evolved slowly over a long period. Functionally divergent amino acid sites are clustered in transmembrane segments and around the gas-conducting lumen recently identified in Escherichia coli AmtB, in agreement with Rh proteins having new substrate specificity. Despite gene duplications and mutations, the Rh paralogous groups all have apparently been subject to strong purifying selection indicating functional conservation. Genes encoding the classical Rh proteins in mammalian red cells show higher nucleotide substitution rates at nonsynonymous codon positions than other Rh genes, a finding that suggests a possible role for these proteins in red cell morphogenetic evolution. PMID:16227429

  12. Evolutionary conservation and diversification of Rh family genes and proteins.

    PubMed

    Huang, Cheng-Han; Peng, Jianbin

    2005-10-25

    Rhesus (Rh) proteins were first identified in human erythroid cells and recently in other tissues. Like ammonia transporter (Amt) proteins, their only homologues, Rh proteins have the 12 transmembrane-spanning segments characteristic of transporters. Many think Rh and Amt proteins transport the same substrate, NH(3)/NH(4)(+), whereas others think that Rh proteins transport CO(2) and Amt proteins NH(3). In the latter view, Rh and Amt are different biological gas channels. To reconstruct the phylogeny of the Rh family and study its coexistence with and relationship to Amt in depth, we analyzed 111 Rh genes and 260 Amt genes. Although Rh and Amt are found together in organisms as diverse as unicellular eukaryotes and sea squirts, Rh genes apparently arose later, because they are rare in prokaryotes. However, Rh genes are prominent in vertebrates, in which Amt genes disappear. In organisms with both types of genes, Rh had apparently diverged away from Amt rapidly and then evolved slowly over a long period. Functionally divergent amino acid sites are clustered in transmembrane segments and around the gas-conducting lumen recently identified in Escherichia coli AmtB, in agreement with Rh proteins having new substrate specificity. Despite gene duplications and mutations, the Rh paralogous groups all have apparently been subject to strong purifying selection indicating functional conservation. Genes encoding the classical Rh proteins in mammalian red cells show higher nucleotide substitution rates at nonsynonymous codon positions than other Rh genes, a finding that suggests a possible role for these proteins in red cell morphogenetic evolution.

  13. Matrix Gla protein and osteocalcin: from gene duplication to neofunctionalization.

    PubMed

    Cancela, M Leonor; Laizé, Vincent; Conceição, Natércia

    2014-11-01

    Osteocalcin (OC or bone Gla protein, BGP) and matrix Gla protein (MGP) are two members of the growing family of vitamin K-dependent (VKD) proteins. They were the first VKD proteins found not to be involved in coagulation and synthesized outside the liver. Both proteins were isolated from bone although it is now known that only OC is synthesized by bone cells under normal physiological conditions, but since both proteins can bind calcium and hydroxyapatite, they can also accumulate in bone. Both OC and MGP share similar structural features, both in terms of protein domains and gene organization. OC gene is likely to have appeared from MGP through a tandem gene duplication that occurred concomitantly with the appearance of the bony vertebrates. Despite their relatively close relationship and the fact that both can bind calcium and affect mineralization, their functions are not redundant and they also have other unrelated functions. Interestingly, these two proteins appear to have followed quite different evolutionary strategies in order to acquire novel functionalities, with OC following a gene duplication strategy while MGP variability was obtained mostly by the use of multiple promoters and alternative splicing, leading to proteins with additional functional characteristics and alternative gene regulatory pathways. Copyright © 2014 Elsevier Inc. All rights reserved.

  14. De Novo Origin of Human Protein-Coding Genes

    PubMed Central

    Wu, Dong-Dong; Irwin, David M.; Zhang, Ya-Ping

    2011-01-01

    The de novo origin of a new protein-coding gene from non-coding DNA is considered to be a very rare occurrence in genomes. Here we identify 60 new protein-coding genes that originated de novo on the human lineage since divergence from the chimpanzee. The functionality of these genes is supported by both transcriptional and proteomic evidence. RNA–seq data indicate that these genes have their highest expression levels in the cerebral cortex and testes, which might suggest that these genes contribute to phenotypic traits that are unique to humans, such as improved cognitive ability. Our results are inconsistent with the traditional view that the de novo origin of new genes is very rare, thus there should be greater appreciation of the importance of the de novo origination of genes. PMID:22102831

  15. Identification of oral cancer related candidate genes by integrating protein-protein interactions, gene ontology, pathway analysis and immunohistochemistry.

    PubMed

    Kumar, Ravindra; Samal, Sabindra K; Routray, Samapika; Dash, Rupesh; Dixit, Anshuman

    2017-05-30

    In the recent years, bioinformatics methods have been reported with a high degree of success for candidate gene identification. In this milieu, we have used an integrated bioinformatics approach assimilating information from gene ontologies (GO), protein-protein interaction (PPI) and network analysis to predict candidate genes related to oral squamous cell carcinoma (OSCC). A total of 40973 PPIs were considered for 4704 cancer-related genes to construct human cancer gene network (HCGN). The importance of each node was measured in HCGN by ten different centrality measures. We have shown that the top ranking genes are related to a significantly higher number of diseases as compared to other genes in HCGN. A total of 39 candidate oral cancer target genes were predicted by combining top ranked genes and the genes corresponding to significantly enriched oral cancer related GO terms. Initial verification using literature and available experimental data indicated that 29 genes were related with OSCC. A detailed pathway analysis led us to propose a role for the selected candidate genes in the invasion and metastasis in OSCC. We further validated our predictions using immunohistochemistry (IHC) and found that the gene FLNA was upregulated while the genes ARRB1 and HTT were downregulated in the OSCC tissue samples.

  16. The Popeye Domain Containing Genes and their Function in Striated Muscle

    PubMed Central

    Schindler, Roland FR; Scotton, Chiara; French, Vanessa; Ferlini, Alessandra; Brand, Thomas

    2016-01-01

    The Popeye domain containing (POPDC) genes encode a novel class of cAMP effector proteins, which are abundantly expressed in heart and skeletal muscle. Here we will review their role in striated muscle as deduced from work in cell and animal models and the recent analysis of patients carrying a missense mutation in POPDC1. Evidence suggests that POPDC proteins control membrane trafficking of interacting proteins. Furthermore, we will discuss the current catalogue of established protein-protein interactions. In recent years, the number of POPDC-interacting proteins is rising and currently includes ion channels (TREK-1), sarcolemma-associated proteins serving functions in mechanical stability (Dystrophin), compartmentalization (Caveolin 3), scaffolding (ZO-1), trafficking (NDRG4, VAMP2/3) and repair (Dysferlin), or acting as a guanine nucleotide exchange factor for Rho-family GTPases (GEFT). Recent evidence suggests that POPDC proteins might also control the cellular level of the nuclear proto-oncoprotein c-Myc. These data suggests that this family of cAMP-binding proteins probably serves multiple roles in striated muscle. PMID:27347491

  17. GeneSense: a new approach for human gene annotation integrated with protein-protein interaction networks.

    PubMed

    Chen, Zhongzhong; Zhang, Tianhong; Lin, Jun; Yan, Zidan; Wang, Yongren; Zheng, Weiqiang; Weng, Kevin C

    2014-03-26

    Virtually all cellular functions involve protein-protein interactions (PPIs). As an increasing number of PPIs are identified and vast amount of information accumulated, researchers are finding different ways to interrogate the data and understand the interactions in context. However, it is widely recognized that a significant portion of the data is scattered, redundant, not considered high quality, and not readily accessible to researchers in a systematic fashion. In addition, it is challenging to identify the optimal protein targets in the current PPI networks. The GeneSense server was developed to integrate gene annotation and PPI networks in an expandable architecture that incorporates selected databases with the aim to assemble, analyze, evaluate and disseminate protein-protein association information in a comprehensive and user-friendly manner. Three network models including nodenet, leafnet and loopnet are used to identify the optimal protein targets in the complex networks. GeneSense is freely available at www.biomedsense.org/genesense.php.

  18. Major cancer protein amplifies global gene expression

    Cancer.gov

    Scientists may have discovered why a protein called MYC can provoke a variety of cancers. Like many proteins associated with cancer, MYC helps regulate cell growth. A new study carried out by researchers at the National Institutes of Health and colleagues

  19. The KP4 killer protein gene family

    USDA-ARS?s Scientific Manuscript database

    Killer protein 4 (KP4) is a well studied toxin secreted by the maize smut fungus Ustilago maydis that kills sensitive Ustilago strains as well as inhibits Fusarium and plant root growth. This small, cysteine rich protein is encoded by a virus that depends on host survival for replication. KP4 functi...

  20. Identification of breast cancer candidate genes using gene co-expression and protein-protein interaction information.

    PubMed

    Yue, Zhenyu; Li, Hai-Tao; Yang, Yabing; Hussain, Sajid; Zheng, Chun-Hou; Xia, Junfeng; Chen, Yan

    2016-06-14

    Breast cancer (BC) is one of the most common malignancies that could threaten female health. As the molecular mechanism of BC has not yet been completely discovered, identification of related genes of this disease is an important area of research that could provide new insights into gene function as well as potential treatment targets. Here we used subnetwork extraction algorithms to identify novel BC related genes based on the known BC genes (seed genes), gene co-expression profiles and protein-protein interaction network. We computationally predicted seven key genes (EPHX2, GHRH, PPYR1, ALPP, KNG1, GSK3A and TRIT1) as putative genes of BC. Further analysis shows that six of these have been reported as breast cancer associated genes, and one (PPYR1) as cancer associated gene. Lastly, we developed an expression signature using these seven key genes which significantly stratified 1660 BC patients according to relapse free survival (hazard ratio [HR], 0.55; 95% confidence interval [CI], 0.46-0.65; Logrank p = 5.5e-13). The 7-genes signature could be established as a useful predictor of disease prognosis in BC patients. Overall, the identified seven genes might be useful prognostic and predictive molecular markers to predict the clinical outcome of BC patients.

  1. Light-Inducible Gene Regulation with Engineered Zinc Finger Proteins

    PubMed Central

    Polstein, Lauren R.; Gersbach, Charles A.

    2014-01-01

    The coupling of light-inducible protein-protein interactions with gene regulation systems has enabled the control of gene expression with light. In particular, heterodimer protein pairs from plants can be used to engineer a gene regulation system in mammalian cells that is reversible, repeatable, tunable, controllable in a spatiotemporal manner, and targetable to any DNA sequence. This system, Light-Inducible Transcription using Engineered Zinc finger proteins (LITEZ), is based on the blue light-induced interaction of GIGANTEA and the LOV domain of FKF1 that drives the localization of a transcriptional activator to the DNA-binding site of a highly customizable engineered zinc finger protein. This chapter provides methods for modifying LITEZ to target new DNA sequences, engineering a programmable LED array to illuminate cell cultures, and using the modified LITEZ system to achieve spatiotemporal control of transgene expression in mammalian cells. PMID:24718797

  2. Tagging gene and protein names in biomedical text.

    PubMed

    Tanabe, Lorraine; Wilbur, W John

    2002-08-01

    The MEDLINE database of biomedical abstracts contains scientific knowledge about thousands of interacting genes and proteins. Automated text processing can aid in the comprehension and synthesis of this valuable information. The fundamental task of identifying gene and protein names is a necessary first step towards making full use of the information encoded in biomedical text. This remains a challenging task due to the irregularities and ambiguities in gene and protein nomenclature. We propose to approach the detection of gene and protein names in scientific abstracts as part-of-speech tagging, the most basic form of linguistic corpus annotation. We present a method for tagging gene and protein names in biomedical text using a combination of statistical and knowledge-based strategies. This method incorporates automatically generated rules from a transformation-based part-of-speech tagger, and manually generated rules from morphological clues, low frequency trigrams, indicator terms, suffixes and part-of-speech information. Results of an experiment on a test corpus of 56K MEDLINE documents demonstrate that our method to extract gene and protein names can be applied to large sets of MEDLINE abstracts, without the need for special conditions or human experts to predetermine relevant subsets. The programs are available on request from the authors.

  3. Genes and proteins of Escherichia coli K-12.

    PubMed

    Riley, M

    1998-01-01

    GenProtEC is a database of Escherichia coli genes and their gene products, classified by type of function and physiological role and with citations to the literature for each. Also present are data on sequence similarities among E.coli proteins, representing groups of paralogous genes, with PAM values, percent identity of amino acids, length of alignment and percent aligned. GenProtEC can be accessed at the URL http://www.mbl.edu/html/ecoli.html

  4. Characteristics and clustering of human ribosomal protein genes

    PubMed Central

    Ishii, Kyota; Washio, Takanori; Uechi, Tamayo; Yoshihama, Maki; Kenmochi, Naoya; Tomita, Masaru

    2006-01-01

    Background The ribosome is a central player in the translation system, which in mammals consists of four RNA species and 79 ribosomal proteins (RPs). The control mechanisms of gene expression and the functions of RPs are believed to be identical. Most RP genes have common promoters and were therefore assumed to have a unified gene expression control mechanism. Results We systematically analyzed the homogeneity and heterogeneity of RP genes on the basis of their expression profiles, promoter structures, encoded amino acid compositions, and codon compositions. The results revealed that (1) most RP genes are coordinately expressed at the mRNA level, with higher signals in the spleen, lymph node dissection (LND), and fetal brain. However, 17 genes, including the P protein genes (RPLP0, RPLP1, RPLP2), are expressed in a tissue-specific manner. (2) Most promoters have GC boxes and possible binding sites for nuclear respiratory factor 2, Yin and Yang 1, and/or activator protein 1. However, they do not have canonical TATA boxes. (3) Analysis of the amino acid composition of the encoded proteins indicated a high lysine and arginine content. (4) The major RP genes exhibit a characteristic synonymous codon composition with high rates of G or C in the third-codon position and a high content of AAG, CAG, ATC, GAG, CAC, and CTG. Conclusion Eleven of the RP genes are still identified as being unique and did not exhibit at least some of the above characteristics, indicating that they may have unknown functions not present in other RP genes. Furthermore, we found sequences conserved between human and mouse genes around the transcription start sites and in the intronic regions. This study suggests certain overall trends and characteristic features of human RP genes. PMID:16504170

  5. Calreticulin: one protein, one gene, many functions.

    PubMed Central

    Michalak, M; Corbett, E F; Mesaeli, N; Nakamura, K; Opas, M

    1999-01-01

    The endoplasmic reticulum (ER) plays a critical role in the synthesis and chaperoning of membrane-associated and secreted proteins. The membrane is also an important site of Ca(2+) storage and release. Calreticulin is a unique ER luminal resident protein. The protein affects many cellular functions, both in the ER lumen and outside of the ER environment. In the ER lumen, calreticulin performs two major functions: chaperoning and regulation of Ca(2+) homoeostasis. Calreticulin is a highly versatile lectin-like chaperone, and it participates during the synthesis of a variety of molecules, including ion channels, surface receptors, integrins and transporters. The protein also affects intracellular Ca(2+) homoeostasis by modulation of ER Ca(2+) storage and transport. Studies on the cell biology of calreticulin revealed that the ER membrane is a very dynamic intracellular compartment affecting many aspects of cell physiology. PMID:10567207

  6. Ranking Candidate Disease Genes from Gene Expression and Protein Interaction: A Katz-Centrality Based Approach

    PubMed Central

    Zhao, Jing; Yang, Ting-Hong; Huang, Yongxu; Holme, Petter

    2011-01-01

    Many diseases have complex genetic causes, where a set of alleles can affect the propensity of getting the disease. The identification of such disease genes is important to understand the mechanistic and evolutionary aspects of pathogenesis, improve diagnosis and treatment of the disease, and aid in drug discovery. Current genetic studies typically identify chromosomal regions associated specific diseases. But picking out an unknown disease gene from hundreds of candidates located on the same genomic interval is still challenging. In this study, we propose an approach to prioritize candidate genes by integrating data of gene expression level, protein-protein interaction strength and known disease genes. Our method is based only on two, simple, biologically motivated assumptions—that a gene is a good disease-gene candidate if it is differentially expressed in cases and controls, or that it is close to other disease-gene candidates in its protein interaction network. We tested our method on 40 diseases in 58 gene expression datasets of the NCBI Gene Expression Omnibus database. On these datasets our method is able to predict unknown disease genes as well as identifying pleiotropic genes involved in the physiological cellular processes of many diseases. Our study not only provides an effective algorithm for prioritizing candidate disease genes but is also a way to discover phenotypic interdependency, cooccurrence and shared pathophysiology between different disorders. PMID:21912686

  7. Ranking candidate disease genes from gene expression and protein interaction: a Katz-centrality based approach.

    PubMed

    Zhao, Jing; Yang, Ting-Hong; Huang, Yongxu; Holme, Petter

    2011-01-01

    Many diseases have complex genetic causes, where a set of alleles can affect the propensity of getting the disease. The identification of such disease genes is important to understand the mechanistic and evolutionary aspects of pathogenesis, improve diagnosis and treatment of the disease, and aid in drug discovery. Current genetic studies typically identify chromosomal regions associated specific diseases. But picking out an unknown disease gene from hundreds of candidates located on the same genomic interval is still challenging. In this study, we propose an approach to prioritize candidate genes by integrating data of gene expression level, protein-protein interaction strength and known disease genes. Our method is based only on two, simple, biologically motivated assumptions--that a gene is a good disease-gene candidate if it is differentially expressed in cases and controls, or that it is close to other disease-gene candidates in its protein interaction network. We tested our method on 40 diseases in 58 gene expression datasets of the NCBI Gene Expression Omnibus database. On these datasets our method is able to predict unknown disease genes as well as identifying pleiotropic genes involved in the physiological cellular processes of many diseases. Our study not only provides an effective algorithm for prioritizing candidate disease genes but is also a way to discover phenotypic interdependency, cooccurrence and shared pathophysiology between different disorders.

  8. Evolution and organization of the human protein C gene.

    PubMed Central

    Plutzky, J; Hoskins, J A; Long, G L; Crabtree, G R

    1986-01-01

    We have isolated overlapping phage genomic clones covering an area of 21 kilobases that encodes the human protein C gene. The gene is at least 11.2 kilobases long and is made up of nine exons and eight introns. Two regions homologous to epidermal growth factor and transforming growth factor are encoded by amino acids 46-91 and 92-136 and are precisely delimited by introns, as is a similar sequence in the genes for coagulation factor IX and tissue plasminogen activator. When homologous amino acids of factor IX and protein C are aligned, the positions of all eight introns correspond precisely, suggesting that these genes are the product of a relatively recent gene duplication. Nevertheless, the two genes are sufficiently distantly related that no nucleic acid homology remains in the intronic regions and that the size of the introns varies dramatically between the two genes. The similarity of the genes for factor IX and protein C suggests that they may be the most closely related members of the serine protease gene family involved in coagulation and fibrinolysis. Images PMID:3511471

  9. Green Fluorescent Protein as a Marker for Gene Expression

    NASA Astrophysics Data System (ADS)

    Chalfie, Martin; Tu, Yuan; Euskirchen, Ghia; Ward, William W.; Prasher, Douglas C.

    1994-02-01

    A complementary DNA for the Aequorea victoria green fluorescent protein (GFP) produces a fluorescent product when expressed in prokaryotic (Escherichia coli) or eukaryotic (Caenorhabditis elegans) cells. Because exogenous substrates and cofactors are not required for this fluorescence, GFP expression can be used to monitor gene expression and protein localization in living organisms.

  10. Rare disease relations through common genes and protein interactions.

    PubMed

    Fernandez-Novo, Sara; Pazos, Florencio; Chagoyen, Monica

    2016-06-01

    ODCs (Orphan Disease Connections), available at http://csbg.cnb.csic.es/odcs, is a novel resource to explore potential molecular relations between rare diseases. These molecular relations have been established through the integration of disease susceptibility genes and human protein-protein interactions. The database currently contains 54,941 relations between 3032 diseases. Copyright © 2016 Elsevier Ltd. All rights reserved.

  11. Development of a New Generation of Vectors for Gene Expression, Gene Replacement, and Protein-Protein Interaction Studies in Mycobacteria

    PubMed Central

    Parikh, Amit; Kumar, Devanand; Chawla, Yogesh; Kurthkoti, Krishna; Khan, Shazia; Varshney, Umesh

    2013-01-01

    Escherichia coli-mycobacterium shuttle vectors are important tools for gene expression and gene replacement in mycobacteria. However, most of the currently available vectors are limited in their use because of the lack of extended multiple cloning sites (MCSs) and convenience of appending an epitope tag(s) to the cloned open reading frames (ORFs). Here we report a new series of vectors that allow for the constitutive and regulatable expression of proteins, appended with peptide tag sequences at their N and C termini, respectively. The applicability of these vectors is demonstrated by the constitutive and induced expression of the Mycobacterium tuberculosis pknK gene, coding for protein kinase K, a serine-threonine protein kinase. Furthermore, a suicide plasmid with expanded MCS for creating gene replacements, a plasmid for chromosomal integrations at the commonly used L5 attB site, and a hypoxia-responsive vector, for expression of a gene(s) under hypoxic conditions that mimic latency, have also been created. Additionally, we have created a vector for the coexpression of two proteins controlled by two independent promoters, with each protein being in fusion with a different tag. The shuttle vectors developed in the present study are excellent tools for the analysis of gene function in mycobacteria and are a valuable addition to the existing repertoire of vectors for mycobacterial research. PMID:23315736

  12. ApoFnr Binds as a Monomer to Promoters Regulating the Expression of Enterotoxin Genes of Bacillus cereus▿ †

    PubMed Central

    Esbelin, Julia; Jouanneau, Yves; Armengaud, Jean; Duport, Catherine

    2008-01-01

    Bacillus cereus Fnr is a member of the Crp/Fnr (cyclic AMP-binding protein/fumarate nitrate reduction regulatory protein) family of helix-turn-helix transcriptional regulators. It is essential for the expression of hbl and nhe enterotoxin genes independently of the oxygen tension in the environment. We studied aerobic Fnr binding to target sites in promoters regulating the expression of enterotoxin genes. B. cereus Fnr was overexpressed and purified as either a C-terminal His-tagged (FnrHis) fusion protein or an N-terminal fusion protein tagged with the Strep-tag (IBA BioTAGnology) (StrepFnr). Both recombinant Fnr proteins were produced as apoforms (clusterless) and occurred as mixtures of monomers and oligomers in solution. However, apoFnrHis was mainly monomeric, while apoStrepFnr was mainly oligomeric, suggesting that the His-tagged C-terminal extremity may interfere with oligomerization. The oligomeric state of apoStrepFnr was dithiothreitol sensitive, underlining the importance of a disulfide bridge for apoFnr oligomerization. Electrophoretic mobility shift assays showed that monomeric apoFnr, but not oligomeric apoFnr, bound to specific sequences located in the promoter regions of the enterotoxin regulators fnr, resDE, and plcR and the structural genes hbl and nhe. The question of whether apoFnr binding is regulated in vivo by redox-dependent oligomerization is discussed. PMID:18424517

  13. Expression Trend of Selected Ribosomal Protein Genes in Nasopharyngeal Carcinoma

    PubMed Central

    Ma, Xiang-Ru; Sim, Edmund Ui-Hang; Ling, Teck-Yee; Tiong, Thung-Sing; Subramaniam, Selva Kumar; Khoo, Alan Soo-Beng

    2012-01-01

    Background: Ribosomal proteins are traditionally associated with protein biosynthesis until recent studies that implicated their extraribosomal functions in human diseases and cancers. Our previous studies using GeneFishing™ DEG method and microarray revealed underexpression of three ribosomal protein genes, RPS26, RPS27, and RPL32 in cancer of the nasopharynx. Herein, we investigated the expression pattern and nucleotide sequence integrity of these genes in nasopharyngeal carcinoma to further delineate their involvement in tumourigenesis. The relationship of expression level with clinicopathologic factors was also statistically studied. Methods: Quantitative Polymerase Chain Reaction was performed on nasopharyngeal carcinoma and their paired normal tissues. Expression and sequence of these three genes were analysed. Results: All three ribosomal protein genes showed no significant difference in transcript expressions and no association could be established with clinicopathologic factors studied. No nucleotide aberrancy was detected in the coding regions of these genes. Conclusion: There is no early evidence to substantiate possible involvement of RPS26, RPS27, and RPL32 genes in NPC tumourigenesis. PMID:23613646

  14. Gene evolution and functions of extracellular matrix proteins in teeth

    PubMed Central

    Yoshizaki, Keigo; Yamada, Yoshihiko

    2013-01-01

    The extracellular matrix (ECM) not only provides physical support for tissues, but it is also critical for tissue development, homeostasis and disease. Over 300 ECM molecules have been defined as comprising the “core matrisome” in mammals through the analysis of whole genome sequences. During tooth development, the structure and functions of the ECM dynamically change. In the early stages, basement membranes (BMs) separate two cell layers of the dental epithelium and the mesenchyme. Later in the differentiation stages, the BM layer is replaced with the enamel matrix and the dentin matrix, which are secreted by ameloblasts and odontoblasts, respectively. The enamel matrix genes and the dentin matrix genes are each clustered in two closed regions located on human chromosome 4 (mouse chromosome 5), except for the gene coded for amelogenin, the major enamel matrix protein, which is located on the sex chromosomes. These genes for enamel and dentin matrix proteins are derived from a common ancestral gene, but as a result of evolution, they diverged in terms of their specific functions. These matrix proteins play important roles in cell adhesion, polarity, and differentiation and mineralization of enamel and dentin matrices. Mutations of these genes cause diseases such as odontogenesis imperfect (OI) and amelogenesis imperfect (AI). In this review, we discuss the recently defined terms matrisome and matrixome for ECMs, as well as focus on genes and functions of enamel and dentin matrix proteins. PMID:23539364

  15. Identification of genes involved in radioresistance of nasopharyngeal carcinoma by integrating gene ontology and protein-protein interaction networks.

    PubMed

    Guo, Ya; Zhu, Xiao-Dong; Qu, Song; Li, Ling; Su, Fang; Li, Ye; Huang, Shi-Ting; Li, Dan-Rong

    2012-01-01

    Radioresistance remains one of the important factors in relapse and metastasis of nasopharyngeal carcinoma. Thus, it is imperative to identify genes involved in radioresistance and explore the underlying biological processes in the development of radioresistance. In this study, we used cDNA microarrays to select differential genes between radioresistant CNE-2R and parental CNE-2 cell lines. One hundred and eighty-three significantly differentially expressed genes (p<0.05) were identified, of which 138 genes were upregulated and 45 genes were downregulated in CNE-2R. We further employed publicly available bioinformatics related software, such as GOEAST and STRING to examine the relationship among differentially expressed genes. The results show that these genes were involved in type I interferon-mediated signaling pathway biological processes; the nodes tended to have high connectivity with the EGFR pathway, IFN-related pathways, NF-κB. The node STAT1 has high connectivity with other nodes in the protein-protein interaction (PPI) networks. Finally, the reliability of microarray data was validated for selected genes by semi-quantitative RT-PCR and Western blotting. The results were consistent with the microarray data. Our study suggests that microarrays combined with gene ontology and protein interaction networks have great value in the identification of genes of radioresistance in nasopharyngeal carcinoma; genes involved in several biological processes and protein interaction networks may be relevant to NPC radioresistance; in particular, the verified genes CCL5, STAT1-α, STAT2 and GSTP1 may become potential biomarkers for predicting NPC response to radiotherapy.

  16. (Genetic engineering with a gene encoding a soybean storage protein)

    SciTech Connect

    Beachy, R.N.

    1985-12-18

    We have isolated and characterized a gene which encodes the alpha prime subunit of beta conglycinin. This gene was fully sequenced by DNA sequence analysis and a report of that work was prepared and submitted for publication in early November 1985. This represented the culmination of several years of research effort by several scientists. A preprint of that work is attached to this report and has been offered by Dr. J.J. Doyle, Dr. Mary A. Schuler and Dr. Jerry Slighton, as well as myself. This paper is a comparison of the alpha prime subunit gene with a similar gene from phaseolus vulgaris, the common garden bean. In this paper we compare the sequences that are 5' of the gene, and which would represent the transcriptional promoter, as well as the sequences within the structural region of the gene. The sequence paper also compares the amino acid sequence of these two genes with that of other genes from Phaseolus, peas and from soybeans. On the basis of this comparison, we predict evolutionary trends within the multigene families which encode these proteins in the various plants, as well as to look at the protein itself to try to predict regions of the protein that might have functional significance. All of this work was done on a prior DOE-BER grant and has simply been reported here for the first time.

  17. Photosynthesis genes in marine viruses yield proteins during host infection.

    PubMed

    Lindell, Debbie; Jaffe, Jacob D; Johnson, Zackary I; Church, George M; Chisholm, Sallie W

    2005-11-03

    Cyanobacteria, and the viruses (phages) that infect them, are significant contributors to the oceanic 'gene pool'. This pool is dynamic, and the transfer of genetic material between hosts and their phages probably influences the genetic and functional diversity of both. For example, photosynthesis genes of cyanobacterial origin have been found in phages that infect Prochlorococcus and Synechococcus, the numerically dominant phototrophs in ocean ecosystems. These genes include psbA, which encodes the photosystem II core reaction centre protein D1, and high-light-inducible (hli) genes. Here we show that phage psbA and hli genes are expressed during infection of Prochlorococcus and are co-transcribed with essential phage capsid genes, and that the amount of phage D1 protein increases steadily over the infective period. We also show that the expression of host photosynthesis genes declines over the course of infection and that replication of the phage genome is a function of photosynthesis. We thus propose that the phage genes are functional in photosynthesis and that they may be increasing phage fitness by supplementing the host production of these proteins.

  18. Ribosomal protein gene mapping and human chromosomal disorders

    SciTech Connect

    Kenmochi, N.; Goodman, N.; Page, D.C.

    1994-09-01

    In Drosophila, the Minute phenotype (reduced body size, diminished viability and fertility, and short, thin bristles) results from heterozygous deficiencies (deletions) at any one of 50 loci scattered about the genome. A handful of these Minute loci have been molecularly characterized, and all have been found to encode ribosomal proteins. Thus, the Minute phenotype appears to result from reduced protein synthetic capacity in flies with one rather than two copies of a given ribosomal protein (rp) gene. We are pursuing the possibility that similar reductions in protein synthetic capacity--again resulting from rp gene deficiencies--might underlie phenotypes associated with certain chromosomal disorders in humans. We and our colleagues have reported findings consistent with a role for RPS4 deficiency in the etiology of certain features of Turner syndrome, a complex human disorder classically associated with an XO karyotype. We are intrigued by the possibility that deficiencies of other human rp genes might cause phenotypic abnormalities similar to those seen in Turner syndrome--just as deficiencies of any of a number of Drosophila rp genes cause the Minute phenotype. We must first learn the chromosomal map position of each of the estimated 83 human rp genes. The task of mapping the functional (intron-containing) rp genes is complicated by the existence of processed pseudogenes elsewhere in the genome. To date, we have assigned (or confirmed the previous assignment of) 38 rp genes to individual human chromosomes by PCR analysis of human-rodent somatic cell hybrids containing subsets of human chromosomes, with all but four chromosomes carrying at least one rp gene. We have also identified more than 100 large-insert human YAC (yeast artificial chromosome) clones that contain individual rp genes. Such screening of YAC libraries will result in precise positioning of the rp genes on the emerging physical map of the human genome.

  19. Prediction of the Ebola virus infection related human genes using protein-protein interaction network.

    PubMed

    Cao, HuanHuan; Zhang, YuHang; Zhao, Jia; Zhu, Liucun; Wang, Yi; Li, JiaRui; Feng, Yuanming; Zhang, Ning

    2017-03-10

    Ebola hemorrhagic fever (EHF) is caused by Ebola virus (EBOV). It is reported that human could be infected by EBOV with a high fatality rate. However, association factors between EBOV and host still tend to be ambiguous. According to the "guilt by association" (GBA) principle, proteins interacting with each other are very likely to function similarly or the same. Based on this assumption, we tried to obtain EBOV infection-related human genes in a protein-protein interaction network using Dijkstra algorithm. We hope it could contribute to the discovery of novel effective treatments. Finally, 15 genes were selected as potential EBOV infection-related human genes.

  20. A Drosophila gene encoding a protein resembling the human beta-amyloid protein precursor.

    PubMed Central

    Rosen, D R; Martin-Morris, L; Luo, L Q; White, K

    1989-01-01

    We have isolated genomic and cDNA clones for a Drosophila gene resembling the human beta-amyloid precursor protein (APP). This gene produces a nervous system-enriched 6.5-kilobase transcript. Sequencing of cDNAs derived from the 6.5-kilobase transcript predicts an 886-amino acid polypeptide. This polypeptide contains a putative transmembrane domain and exhibits strong sequence similarity to cytoplasmic and extracellular regions of the human beta-amyloid precursor protein. There is a high probability that this Drosophila gene corresponds to the essential Drosophila locus vnd, a gene required for embryonic nervous system development. Images PMID:2494667

  1. A Drosophila gene encoding a protein resembling the human. beta. -amyloid protein precursor

    SciTech Connect

    Rosen, D.R.; Martin-Morris, L.; Luo, L.; White, K. )

    1989-04-01

    The authors have isolated genomic and cDNA clones for a Drosophila gene resembling the human {beta}-amyloid precursor protein (APP). This gene produces a nervous system-enriched 6.5-kilobase transcript. Sequencing of cDNAs derived from the 6.5-kilobase transcript predicts an 886-amino acid polypeptide. This polypeptide contains a putative transmembrane domain and exhibits strong sequence similarity to cytoplasmic and extracellular regions of the human {beta}-amyloid precursor protein. There is a high probability that this Drosophila gene corresponds to the essential Drosophila locus vnd, a gene required for embryonic nervous system development.

  2. The p53 gene and protein in human brain tumors

    SciTech Connect

    Louis, D.N. )

    1994-01-01

    Because p53 gene alterations are commonplace in human tumors and because p53 protein is involved in a number of important cellular pathways, p53 has become a topic of intensive investigation, both by basic scientists and clinicians. p53 was initially identified by two independent laboratories in 1979 as a 53 kilodalton (kD) protein that complexes with the large T antigen of SV40 virus. Shortly thereafter, it was shown that the E1B oncoprotein of adenovirus also binds p53. The binding of two different oncogenic viral tumor proteins to the same cellular protein suggested that p53 might be integral to tumorigenesis. The human p53 cDNA and gene were subsequently cloned in the mid-1980s, and analysis of p53 gene alterations in human tumors followed a few year later. During these 10 years, researchers grappling with the vagaries of p53 first characterized the gene as an oncogene, then as a tumor suppressor gene, and most recently as both a tumor suppressor gene and a so-called [open quotes]dominant negative[close quotes] oncogene. The last few years have seen an explosion in work on this single gene and its protein product. A review of a computerized medical database revealed approximately 650 articles on p53 in 1992 alone. p53 has assumed importance in neuro-oncology because p53 mutations and protein alterations are frequent in the common diffuse, fibrillary astrocytic tumors of adults. p53 mutations in astrocytomas were first described in 1989 and were followed by more extensive analyses of gene mutations and protein alterations in adult astrocytomas. The gene has also been studied in less common brain tumors. Elucidating the role of p53 in brain tumorigenesis will not only enhance understanding of brain tumor biology but may also contribute to improved diagnosis and therapy. This discussion reviews key aspects of the p53 gene and protein, and describe their emerging roles in central nervous system neoplasia. 102 refs., 6 figs., 1 tab.

  3. Identification of genes and proteins associated with anagen wool growth.

    PubMed

    Zhao, J; Liu, N; Liu, K; He, J; Yu, J; Bu, R; Cheng, M; De, W; Liu, J; Li, H

    2017-02-01

    Identifying genes of major effect for wool growth would offer strategies for improving the quality and increasing the yield of fine wool. In this study, we employed the Agilent Sheep Gene Expression Microarray and proteomic technology to investigate the gene expression patterns of body side skin (more wool growing) in Aohan fine wool sheep (a Chinese indigenous breed) in comparison with groin skin (no wool growing) at the anagen stage of the wool follicle. A microarray study revealed that 4772 probes were differentially expressed, including 2071 upregulated and 2701 downregulated probes, in the comparisons of body side skin vs. groin skin (S/G). The microarray results were verified by means of quantitative PCR. A total of 1099 probes were assigned to unique genes/transcripts. The number of distinct genes/transcripts (annotated) was 926, of which 352 were upregulated and 574 were downregulated. In S/G, 13 genes were upregulated by more than 10 fold, whereas 60 genes were downregulated by more than 10 fold. Further analysis revealed that the majority of the genes possibly related to the wool growth could be assigned to categories including regulation of cell division, intermediate filament, cytoskeletal part and growth factor activity. Several potential gene families may participate in hair growth regulation, including fibroblast growth factors, transforming growth factor-β, WNTs, insulin-like growth factor, vascular endothelial growth factors and so on. Proteomic analysis also revealed 196 differentially expressed protein points, of which 121 were identified as single protein points.

  4. Gene encoding herbicide safener binding protein

    SciTech Connect

    Walton, J.D.; Scott-Craig, J.S.

    1999-10-26

    The cDNA encoding safener binding protein (SafBP), also referred to as SBP1, is presented. The deduced amino acid sequence is provided. Methods of making and using SBP1 and SafBP to alter a plant's sensitivity to certain herbicides or a plant's responsiveness to certain safeners are also provided, as well as expression vectors, transgenic plants or other organisms transfected with vectors and seeds from the plants.

  5. The gene-protein database of Escherichia coli: edition 5.

    PubMed

    VanBogelen, R A; Sankar, P; Clark, R L; Bogan, J A; Neidhardt, F C

    1992-12-01

    The gene-protein database of Escherichia coli is both an index relating a gene to its protein product on two-dimensional gels, and a catalog of information about the function, regulation, and genetics of individual proteins obtained from two-dimensional gel analysis or collated from the literature. Edition 5 has 102 new entries--a 15% increase in the number of annotated two-dimensional gel spots. The large increase in this edition was accomplished in part by the use of a new method for expression analysis of ordered segments of the E. coli genome, which has resulted in linking 50 gel spots to their genes (or open reading frames) and another 45 to specific regions of the chromosome awaiting the availability of DNA sequence information. Communication of information from the scientific community resulted in additional identifications and regulatory information. To increase accessibility of the database it has been placed in the repository at the National Center for Biotechnology Information (NCBI) at the National Library of Medicine under the name ECO2DBASE. It will be updated twice yearly. This edition of the gene-protein database is estimated to contain entries for one-sixth of the protein-encoding genes of E. coli.

  6. Embryonic Expression and Evolution of Duplicated E-Protein Genes in Xenopus Laevis: Parallels with Ancestral E-Protein Genes

    PubMed Central

    Shain, D. H.; Neuman, T.; Zuber, M. X.

    1997-01-01

    E-proteins comprise a subfamily of helix-loop-helix transcription factors that have been identified in arthropods and several chordate taxa. In mammals, there are three classes of E-protein genes (E2A, E2-2, and HEB) that encode related, and often interchangeable, gene products. We have determined that the clawed frog Xenopus laevis contains twice the number of transcriptionally active E-protein genes when compared with other vertebrate species. Based upon genomic Southern blots and nucleotide sequence comparisons, it is likely that the additional X. laevis genes arose from tetraploidization. During embryogenesis, XE2A (homologue of mammalian E2A) transcripts were broadly expressed in anterior and posterior regions of the embryo while homologues of E2-2 (XE2.2) and HEB (XE1.2) appeared in vertebrate-specific structures including the pineal gland, olfactory bulb, and brachial arches. A phylogenetic analysis of these genes and other known metazoan E-proteins suggests that there were two periods of marked E-protein gene expansion; one that predated the radiation of vertebrates, and the other that coincided with Xenopus tetraploidization. Both of these periods were characterized by the rapid evolution of E2-2 and HEB-class genes, but not of E2A. We propose that the former genes acquired new or specialized roles during early chordate evolution and also more recently in Xenopus, as reflected by the stereotypic expression patterns of these genes during X. laevis development. PMID:9136023

  7. Detecting Essential Proteins Based on Network Topology, Gene Expression Data and Gene Ontology Information.

    PubMed

    Zhang, Wei; Xu, Jia; Li, Yuanyuan; Zou, Xiufen

    2016-10-07

    The identification of essential proteins in protein-protein interaction (PPI) networks is of great significance for understanding cellular processes. With the increasing availability of large-scale PPI data, numerous centrality measures based on network topology have been proposed to detect essential proteins from PPI networks. However, most of the current approaches focus mainly on the topological structure of PPI networks, and largely ignore the gene ontology annotation information. In this paper, we propose a novel centrality measure, called TEO, for identifying essential proteins by combining network topology, gene expression profiles and GO information. To evaluate the performance of the TEO method, we compare it with five other methods (degree, betweenness, NC, Pec, CowEWC) in detecting essential proteins from two different yeast PPI datasets. The simulation results show that adding GO information can effectively improve the predicted precision and that our method outperforms the others in predicting essential proteins.

  8. WWOX gene and gene product: tumor suppression through specific protein interactions.

    PubMed

    Salah, Zaidoun; Aqeilan, Rami; Huebner, Kay

    2010-02-01

    The WWOX gene, an archetypal fragile gene, encompasses a chromosomal fragile site at 16q23.2, and encodes the approximately 46-kDa Wwox protein, with WW domains that interact with a growing list of interesting proteins. If the function of a protein is defined by the company it keeps, then Wwox is involved in numerous important signal pathways for bone and germ-cell development, cellular and animal growth and death, transcriptional control and suppression of cancer development. Because alterations to genes at fragile sites are exquisitely sensitive to replication stress-induced DNA damage, there has been an ongoing scientific discussion questioning whether such gene expression alterations provide a selective advantage for clonal expansion of neoplastic cells, and a parallel discussion on why important genes would be present at sites that are susceptible to inactivation. We offer some answers through a description of known WWOX functions.

  9. Gene encoding herbicide safener binding protein

    DOEpatents

    Walton, Jonathan D.; Scott-Craig, John S.

    1999-01-01

    The cDNA encoding safener binding protein (SafBP), also referred to as SBP1, is set forth in FIG. 5 and SEQ ID No. 1. The deduced amino acid sequence is provided in FIG. 5 and SEQ ID No. 2. Methods of making and using SBP1 and SafBP to alter a plant's sensitivity to certain herbicides or a plant's responsiveness to certain safeners are also provided, as well as expression vectors, transgenic plants or other organisms transfected with said vectors and seeds from said plants.

  10. The ribosomal protein genes and Minute loci of Drosophila melanogaster

    PubMed Central

    Marygold, Steven J; Roote, John; Reuter, Gunter; Lambertsson, Andrew; Ashburner, Michael; Millburn, Gillian H; Harrison, Paul M; Yu, Zhan; Kenmochi, Naoya; Kaufman, Thomas C; Leevers, Sally J; Cook, Kevin R

    2007-01-01

    Background Mutations in genes encoding ribosomal proteins (RPs) have been shown to cause an array of cellular and developmental defects in a variety of organisms. In Drosophila melanogaster, disruption of RP genes can result in the 'Minute' syndrome of dominant, haploinsufficient phenotypes, which include prolonged development, short and thin bristles, and poor fertility and viability. While more than 50 Minute loci have been defined genetically, only 15 have so far been characterized molecularly and shown to correspond to RP genes. Results We combined bioinformatic and genetic approaches to conduct a systematic analysis of the relationship between RP genes and Minute loci. First, we identified 88 genes encoding 79 different cytoplasmic RPs (CRPs) and 75 genes encoding distinct mitochondrial RPs (MRPs). Interestingly, nine CRP genes are present as duplicates and, while all appear to be functional, one member of each gene pair has relatively limited expression. Next, we defined 65 discrete Minute loci by genetic criteria. Of these, 64 correspond to, or very likely correspond to, CRP genes; the single non-CRP-encoding Minute gene encodes a translation initiation factor subunit. Significantly, MRP genes and more than 20 CRP genes do not correspond to Minute loci. Conclusion This work answers a longstanding question about the molecular nature of Minute loci and suggests that Minute phenotypes arise from suboptimal protein synthesis resulting from reduced levels of cytoribosomes. Furthermore, by identifying the majority of haplolethal and haplosterile loci at the molecular level, our data will directly benefit efforts to attain complete deletion coverage of the D. melanogaster genome. PMID:17927810

  11. Combining random gene fission and rational gene fusion to discover near-infrared fluorescent protein fragments that report on protein-protein interactions.

    PubMed

    Pandey, Naresh; Nobles, Christopher L; Zechiedrich, Lynn; Maresso, Anthony W; Silberg, Jonathan J

    2015-05-15

    Gene fission can convert monomeric proteins into two-piece catalysts, reporters, and transcription factors for systems and synthetic biology. However, some proteins can be challenging to fragment without disrupting function, such as near-infrared fluorescent protein (IFP). We describe a directed evolution strategy that can overcome this challenge by randomly fragmenting proteins and concomitantly fusing the protein fragments to pairs of proteins or peptides that associate. We used this method to create libraries that express fragmented IFP as fusions to a pair of associating peptides (IAAL-E3 and IAAL-K3) and proteins (CheA and CheY) and screened for fragmented IFP with detectable near-infrared fluorescence. Thirteen novel fragmented IFPs were identified, all of which arose from backbone fission proximal to the interdomain linker. Either the IAAL-E3 and IAAL-K3 peptides or CheA and CheY proteins could assist with IFP fragment complementation, although the IAAL-E3 and IAAL-K3 peptides consistently yielded higher fluorescence. These results demonstrate how random gene fission can be coupled to rational gene fusion to create libraries enriched in fragmented proteins with AND gate logic that is dependent upon a protein-protein interaction, and they suggest that these near-infrared fluorescent protein fragments will be suitable as reporters for pairs of promoters and protein-protein interactions within whole animals.

  12. Cellular unfolded protein response against viruses used in gene therapy

    PubMed Central

    Sen, Dwaipayan; Balakrishnan, Balaji; Jayandharan, Giridhara R.

    2014-01-01

    Viruses are excellent vehicles for gene therapy due to their natural ability to infect and deliver the cargo to specific tissues with high efficiency. Although such vectors are usually “gutted” and are replication defective, they are subjected to clearance by the host cells by immune recognition and destruction. Unfolded protein response (UPR) is a naturally evolved cyto-protective signaling pathway which is triggered due to endoplasmic reticulum (ER) stress caused by accumulation of unfolded/misfolded proteins in its lumen. The UPR signaling consists of three signaling pathways, namely PKR-like ER kinase, activating transcription factor 6, and inositol-requiring protein-1. Once activated, UPR triggers the production of ER molecular chaperones and stress response proteins to help reduce the protein load within the ER. This occurs by degradation of the misfolded proteins and ensues in the arrest of protein translation machinery. If the burden of protein load in ER is beyond its processing capacity, UPR can activate pro-apoptotic pathways or autophagy leading to cell death. Viruses are naturally evolved in hijacking the host cellular translation machinery to generate a large amount of proteins. This phenomenon disrupts ER homeostasis and leads to ER stress. Alternatively, in the case of gutted vectors used in gene therapy, the excess load of recombinant vectors administered and encountered by the cell can trigger UPR. Thus, in the context of gene therapy, UPR becomes a major roadblock that can potentially trigger inflammatory responses against the vectors and reduce the efficiency of gene transfer. PMID:24904562

  13. Protein-directed ribosomal frameshifting temporally regulates gene expression

    PubMed Central

    Napthine, Sawsan; Ling, Roger; Finch, Leanne K.; Jones, Joshua D.; Bell, Susanne; Brierley, Ian; Firth, Andrew E.

    2017-01-01

    Programmed −1 ribosomal frameshifting is a mechanism of gene expression, whereby specific signals within messenger RNAs direct a proportion of translating ribosomes to shift −1 nt and continue translating in the new reading frame. Such frameshifting normally occurs at a set ratio and is utilized in the expression of many viral genes and a number of cellular genes. An open question is whether proteins might function as trans-acting switches to turn frameshifting on or off in response to cellular conditions. Here we show that frameshifting in a model RNA virus, encephalomyocarditis virus, is trans-activated by viral protein 2A. As a result, the frameshifting efficiency increases from 0 to 70% (one of the highest known in a mammalian system) over the course of infection, temporally regulating the expression levels of the viral structural and enzymatic proteins. PMID:28593994

  14. The proteolipid protein gene: Double, double, . . . and trouble

    SciTech Connect

    Hodes, M.E.; Dlouhy, S.R.

    1996-07-01

    That more of a good thing may be too much has been apparent at least since the discovery that Down syndrome is caused by three copies of chromosome 21 instead of the normal two. Duplications of myelin genes also lead to trouble. An extra dose of PMP22, the gene for a protein of peripheral nervous system myelin, causes Charcot-Marie Tooth type 1A disease (CMT1A). Increased dosage of the proteolipid protein gene, PLP, which encodes the chief protein of CNS myelin, can cause Pelizaeus-Merzbacher disease (PMD). The work of Inoue et al. is of particular importance because they found the duplication in four of five families with {open_quotes}classical{close_quotes} PMD, whereas other changes in PLP, such as missense mutations, are found in no more than one in four or five patients with the disease. 27 refs.

  15. NAM-1gene polymorphism and grain protein content in Hordeum.

    PubMed

    Jamar, Catherine; Loffet, Francois; Frettinger, Patrick; Ramsay, Luke; Fauconnier, Marie-Laure; du Jardin, Patrick

    2010-04-15

    Grain protein content (GPC) is a key quality factor for malting and brewing process. In wheat, a QTL explaining a large part of GPC variation was identified, which co-localizes with a gene encoding a NAC transcription factor (TtNAM-B1). NAC transcription factors influence GPC by their role in the regulation of senescence and in protein remobilization. An orthologous gene was discovered on barley chromosome 6H where a GPC QTL was mapped. In this study, we identify allelic variation of the NAM-1 gene for three species of Hordeum representing wild and cultivated barley and we investigate the possible link with GPC. Three haplotypes were identified, one corresponds to the sequences of 11 European varieties representing H. vulgare, one corresponds to the sequence found in H. spontaneum and one represents the sequence of H. bulbosum. Three SNPs were identified between H. spontaneum sequence and H. vulgare sequence. One of the H. bulbosum polymorphisms leads to the introduction of a stop codon and a non-functional protein. Differences in GPC between the 11 varieties were found but no polymorphism in the NAM-1 gene was observed, suggesting that differences in expression of the HvNAM-1 gene or other genes should play a role in GPC regulation. Nevertheless based on published values for GPC of H. bulbosum and H. spontaneum compared to GPC measured here in H. vulgare, the non-functional protein is associated with the lower GPC, suggesting that loss of functionality of the NAM-1 gene in Hordeum is related to lower GPC. Moreover H. spontaneum GPC seems to be higher than H. vulgare GPC, suggesting also that allelic variation of the functional NAM-1 gene could be associated with GPC variation within the genus Hordeum. Copyright 2009 Elsevier GmbH. All rights reserved.

  16. Evolution of yolk protein genes in the Echinodermata.

    PubMed

    Prowse, Thomas A A; Byrne, Maria

    2012-01-01

    Vitellogenin genes (vtg) encode large lipid transfer proteins (LLTPs) that are typically female-specific, functioning as precursors to major yolk proteins (MYPs). Within the phylum Echinodermata, however, the MYP of the Echinozoa (Echinoidea + Holothuroidea) is expressed by an unrelated transferrin-like gene that has a reproductive function in both sexes. We investigated egg proteins in the Asterozoa (Asteroidea + Ophiuroidea), a sister clade to the Echinozoa, showing that eggs of the asteroid Parvulastra exigua contain a vitellogenin protein (Vtg). vtg is expressed by P. exigua, a species with large eggs and nonfeeding larvae, and by the related asterinid Patiriella regularis which has small eggs and feeding larvae. In the Asteroidea, therefore, the reproductive function of vtg is conserved despite significant life history evolution. Like the echinozoan MYP gene, asteroid vtg is expressed in both sexes and may play a role in the development of both ovaries and testes. Phylogenetic analysis indicated that a putative Vtg from the sea urchin genome, a likely pseudogene, does not clade with asteroid Vtg. We propose the following sequence as a potential pathway for the evolution of YP genes in the Echinodermata: (1) the ancestral echinoderm produced YPs derived from Vtg, (2) bisexual vtg expression subsequently evolved in the echinoderm lineage, (3) the reproductive function of vtg was assumed by a transferrin-like gene in the ancestral echinozoan, and (4) redundant echinozoan vtg was released from stabilizing selection.

  17. Identifying Novel Candidate Genes Related to Apoptosis from a Protein-Protein Interaction Network

    PubMed Central

    Wang, Baoman; Yuan, Fei; Kong, Xiangyin; Hu, Lan-Dian; Cai, Yu-Dong

    2015-01-01

    Apoptosis is the process of programmed cell death (PCD) that occurs in multicellular organisms. This process of normal cell death is required to maintain the balance of homeostasis. In addition, some diseases, such as obesity, cancer, and neurodegenerative diseases, can be cured through apoptosis, which produces few side effects. An effective comprehension of the mechanisms underlying apoptosis will be helpful to prevent and treat some diseases. The identification of genes related to apoptosis is essential to uncover its underlying mechanisms. In this study, a computational method was proposed to identify novel candidate genes related to apoptosis. First, protein-protein interaction information was used to construct a weighted graph. Second, a shortest path algorithm was applied to the graph to search for new candidate genes. Finally, the obtained genes were filtered by a permutation test. As a result, 26 genes were obtained, and we discuss their likelihood of being novel apoptosis-related genes by collecting evidence from published literature. PMID:26543496

  18. Use of Galerina marginata genes and proteins for peptide production

    DOEpatents

    Hallen-Adams, Heather E.; Scott-Craig, John S.; Walton, Jonathan D.; Luo, Hong

    2017-03-21

    The present invention relates to compositions and methods comprising genes and peptides associated with cyclic peptides and cyclic peptide production in mushrooms. In particular, the present invention relates to using genes and proteins from Galerina species encoding peptides specifically relating to amatoxins in addition to proteins involved with processing cyclic peptide toxins. In a preferred embodiment, the present invention also relates to methods for making small peptides and small cyclic peptides including peptides similar to amanitin. Further, the present inventions relate to providing kits for making small peptides.

  19. Use of Galerina marginata genes and proteins for peptide production

    DOEpatents

    Hallen-Adams, Heather E.; Scott-Craig, John S.; Walton, Jonathan D.; Luo, Hong

    2016-03-01

    The present invention relates to compositions and methods comprising genes and peptides associated with cyclic peptides and cyclic peptide production in mushrooms. In particular, the present invention relates to using genes and proteins from Galerina species encoding peptides specifically relating to amatoxins in addition to proteins involved with processing cyclic peptide toxins. In a preferred embodiment, the present invention also relates to methods for making small peptides and small cyclic peptides including peptides similar to amanitin. Further, the present inventions relate to providing kits for making small peptides.

  20. Divinyl ether synthase gene, and protein and uses thereof

    DOEpatents

    Howe, Gregg A.; Itoh, Aya

    2006-12-26

    The present invention relates to divinyl ether synthase genes, proteins, and methods of their use. The present invention encompasses both native and recombinant wild-type forms of the synthase, as well as mutants and variant forms, some of which possess altered characteristics relative to the wild-type synthase. The present invention also relates to methods of using divinyl ether synthase genes and proteins, including in their expression in transgenic organisms and in the production of divinyl ether fatty acids, and to methods of suing divinyl ether fatty acids, including in the protection of plants from pathogens.

  1. Divinyl ether synthase gene and protein, and uses thereof

    DOEpatents

    Howe, Gregg A [East Lansing, MI; Itoh, Aya [Tsuruoka, JP

    2011-09-13

    The present invention relates to divinyl ether synthase genes, proteins, and methods of their use. The present invention encompasses both native and recombinant wild-type forms of the synthase, as well as mutants and variant forms, some of which possess altered characteristics relative to the wild-type synthase. The present invention also relates to methods of using divinyl ether synthase genes and proteins, including in their expression in transgenic organisms and in the production of divinyl ether fatty acids, and to methods of suing divinyl ether fatty acids, including in the protection of plants from pathogens.

  2. Gene 5. 5 protein of bacteriophaze T7 inhibits the nucleoid protein H-NS of Escherichia coli

    SciTech Connect

    Liu, Q.; Richardson, C.C. )

    1993-03-01

    Gene 5.5 of coliphage T7 is one of the most highly expressed genes during T7 infection. Gene 5.5 protein, purified from cells overexpressing the cloned gene, purifies with the nucleoid protein H-NS of Escherichia coli during three chromatographic steps. A fusion protein of gene 5.5 protein and maltose binding protein also purifies with H-NS. The fusion protein binds to the DNA-H-NS complex and abolishes H-NS-mediated inhibition of transcription by Escherichia coli and T7 RNA polymerases in vitro. Expression of gene 5.5 also relieves the repression of the Escherichia coli proU promoter by H-NS in vivo. The change of leucine to proline at residue 30 of gene 5.5 protein abolishes the interaction between gene 5.5 protein and H-NS. 30 refs., 4 figs., 1 tab.

  3. Japanese neuropathy patients with peripheral myelin protein-22 gene aneuploidy

    SciTech Connect

    Lebo, R.V.; Li, L.Y.; Flandermeyer, R.R.

    1994-09-01

    Peripheral myelin protein (PMP-22) gene aneuploidy results in Charcot-Marie-Tooth disease Type 1A (CMT1A) and the Hereditary Neuropathy with Liability to Pressure Palsy (HNPP) in Japanese patients as well as Caucasian Americans. Charcot-Marie-Tooth disease (CMT), the most common genetic neuropathy, results when expression of one of at least seven genes is defective. CMT1A, about half of all CMT mutations, is usually associated with a duplication spanning the peripheral myelin protein-22 gene on distal chromosome band 17p11.2. Autosomal dominant HNPP (hereditary pressure and sensory neuropathy, HPSN) results from a deletion of the CMT1A gene region. Multicolor in situ hybridization with PMP-22 gene region probe characterized HNPP deletion reliably and detected all different size duplications reported previously. In summary, 72% of 28 Japanese CMT1 (HMSNI) patients tested had the CMT1A duplication, while none of the CMT2 (HMSNII) or CMT3 (HMSNIII) patients had a duplication. Three cases of HNPP were identified by deletion of the CMT1A gene region on chromosome 17p. HNPP and CMT1A have been reported to result simultaneously from the same unequal recombination event. The lower frequency of HNPP compared to CMT1A suggests that HNPP patients have a lower reproductive fitness than CMT1A patients. This result, along with a CMT1A duplication found in an Asian Indian family, demonstrates the broad geographic distribution and high frequency of PMP-22 gene aneuploidy.

  4. Possible eggshell protein gene from Schistosoma mansoni.

    PubMed

    Johnson, K S; Taylor, D W; Cordingley, J S

    1987-01-02

    We have identified and sequenced a cDNA clone of a mRNA found only in mature female schistosomes. This mRNA is not detectably synthesized by female worms from single sex infections (unisexual females), by males or by the developing miracidia in the eggs. The clone hybridises to a highly abundant polyadenylated mRNA of approximately 1500 nucleotides. The nucleotide sequence of the clone predicts a polypeptide comprising two repetitive regions. A pentapeptide repeat with the consensus sequence Gly-Tyr-Asp-Lys-Tyr, and a region rich in histidine residues. Hybrid selected mRNA translated in vitro with [3H]tyrosine as labelled amino acid yields a polypeptide of 48 kDa (p48) that corresponds to the major [3H]tyrosine labelled translation product of female worm total mRNA. p48 does not label with [35S]methionine and is absent from the translation products of male and unisexual female mRNAs. The amino acid sequence of p48 has significant homologies to silk moth chorion proteins and we suggest that it is one of the major components of the schistosome eggshell probably accounting for the high level of [3H]tyrosine incorporation into the vitellaria of Schistosoma mansoni. The tyrosine content of the polypeptide suggests that it may play a role in phenol oxidase mediated cross-linking of the schistosome eggshell and in support of this we find that mushroom phenol oxidase will cause the specific cross-linking of p48 in in vitro translation products.

  5. Fe-S Proteins that Regulate Gene Expression

    PubMed Central

    Mettert, Erin L.; Kiley, Patricia J.

    2014-01-01

    Iron-sulfur (Fe-S) cluster containing proteins that regulate gene expression are present in most organisms. The innate chemistry of their Fe-S cofactors makes these regulatory proteins ideal for sensing environmental signals, such as gases (e.g. O2 and NO), levels of Fe and Fe-S clusters, reactive oxygen species, and redox cycling compounds, to subsequently mediate an adaptive response. Here we review the recent findings that have provided invaluable insight into the mechanism and function of these highly significant Fe-S regulatory proteins. PMID:25450978

  6. Challenges in biotechnology at LLNL: from genes to proteins

    SciTech Connect

    Albala, J S

    1999-03-11

    This effort has undertaken the task of developing a link between the genomics, DNA repair and structural biology efforts within the Biology and Biotechnology Research Program at LLNL. Through the advent of the I.M.A.G.E. (Integrated Molecular Analysis of Genomes and their Expression) Consortium, a world-wide effort to catalog the largest public collection of genes, accepted and maintained within BBRP, it is now possible to systematically express the protein complement of these to further elucidate novel gene function and structure. The work has ensued in four phases, outlined as follows: (1) Gene and System selection; (2) Protein expression and purification; (3) Structural analysis; and (4) biological integration. Proteins to be expressed have been those of high programmatic interest. This includes, in particular, proteins involved in the maintenance of genome integrity, particularly those involved in the repair of DNA damage, including ERCC1, ERCC4, XRCC2, XRCC3, XRCC9, HEX1, APN1, p53, RAD51B, RAD51C, and RAD51. Full-length cDNA cognates of selected genes were isolated, and cloned into baculovirus-based expression vectors. The baculoviral expression system for protein over-expression is now well-established in the Albala laboratory. Procedures have been successfully optimized for full-length cDNA clining into expression vectors for protein expression from recombinant constructs. This includes the reagents, cell lines, techniques necessary for expression of recombinant baculoviral constructs in Spodoptera frugiperda (Sf9) cells. The laboratory has also generated a high-throughput baculoviral expression paradigm for large scale expression and purification of human recombinant proteins amenable to automation.

  7. Expression of heat shock protein genes in insect stress responses

    USDA-ARS?s Scientific Manuscript database

    The heat shock proteins (HSPs) that are abundantly expressed in insects are important modulators of insect survival. Expression of HSP genes in insects is not only developmentally regulated, but also induced by various stressors in order to confer protection against such stressors. The expression o...

  8. Linking Protein and RNA Function within the Same Gene.

    PubMed

    Szempruch, Anthony; Guttman, Mitchell

    2017-02-23

    Exposure to ultraviolet light leads to a cell-wide DNA damage response that includes a global reduction in transcription. Williamson et al., identify a protein involved in this process as well as a noncoding RNA produced by alternative processing of RNA transcribed from the same gene that promotes recovery from the repressed state.

  9. Molecular evolution of the mammalian ribosomal protein gene, RPS14.

    PubMed

    Rhoads, D D; Roufa, D J

    1991-07-01

    Ribosomal protein S14 genes (RPS14) in eukaryotic species from protozoa to primates exhibit dramatically different intron-exon structures yet share homologous polypeptide-coding sequences. To recognize common features of RPS14 gene architectures in closely related mammalian species and to evaluate similarities in their noncoding DNA sequences, we isolated the intron-containing S14 locus from Chinese hamster ovary (CHO) cell DNA by using a PCR strategy and compared it with human RPS14. We found that rodent and primate S14 genes are composed of identical protein-coding exons interrupted by introns at four conserved DNA sites. However, the structures of corresponding CHO and human RPS14 introns differ significantly. Nonetheless, individual intron splice donor, splice acceptor, and upstream flanking motifs have been conserved within mammalian S14 homologues as well as within RPS14 gene fragments PCR amplified from other vertebrate genera (birds and bony fish). Our data indicate that noncoding, intronic DNA sequences within highly conserved, single-copy ribosomal protein genes are useful molecular landmarks for phylogenetic analysis of closely related vertebrate species.

  10. Protein networks identify novel symbiogenetic genes resulting from plastid endosymbiosis

    PubMed Central

    Méheust, Raphaël; Zelzion, Ehud; Bhattacharya, Debashish; Lopez, Philippe; Bapteste, Eric

    2016-01-01

    The integration of foreign genetic information is central to the evolution of eukaryotes, as has been demonstrated for the origin of the Calvin cycle and of the heme and carotenoid biosynthesis pathways in algae and plants. For photosynthetic lineages, this coordination involved three genomes of divergent phylogenetic origins (the nucleus, plastid, and mitochondrion). Major hurdles overcome by the ancestor of these lineages were harnessing the oxygen-evolving organelle, optimizing the use of light, and stabilizing the partnership between the plastid endosymbiont and host through retargeting of proteins to the nascent organelle. Here we used protein similarity networks that can disentangle reticulate gene histories to explore how these significant challenges were met. We discovered a previously hidden component of algal and plant nuclear genomes that originated from the plastid endosymbiont: symbiogenetic genes (S genes). These composite proteins, exclusive to photosynthetic eukaryotes, encode a cyanobacterium-derived domain fused to one of cyanobacterial or another prokaryotic origin and have emerged multiple, independent times during evolution. Transcriptome data demonstrate the existence and expression of S genes across a wide swath of algae and plants, and functional data indicate their involvement in tolerance to oxidative stress, phototropism, and adaptation to nitrogen limitation. Our research demonstrates the “recycling” of genetic information by photosynthetic eukaryotes to generate novel composite genes, many of which function in plastid maintenance. PMID:26976593

  11. Protein networks identify novel symbiogenetic genes resulting from plastid endosymbiosis.

    PubMed

    Méheust, Raphaël; Zelzion, Ehud; Bhattacharya, Debashish; Lopez, Philippe; Bapteste, Eric

    2016-03-29

    The integration of foreign genetic information is central to the evolution of eukaryotes, as has been demonstrated for the origin of the Calvin cycle and of the heme and carotenoid biosynthesis pathways in algae and plants. For photosynthetic lineages, this coordination involved three genomes of divergent phylogenetic origins (the nucleus, plastid, and mitochondrion). Major hurdles overcome by the ancestor of these lineages were harnessing the oxygen-evolving organelle, optimizing the use of light, and stabilizing the partnership between the plastid endosymbiont and host through retargeting of proteins to the nascent organelle. Here we used protein similarity networks that can disentangle reticulate gene histories to explore how these significant challenges were met. We discovered a previously hidden component of algal and plant nuclear genomes that originated from the plastid endosymbiont: symbiogenetic genes (S genes). These composite proteins, exclusive to photosynthetic eukaryotes, encode a cyanobacterium-derived domain fused to one of cyanobacterial or another prokaryotic origin and have emerged multiple, independent times during evolution. Transcriptome data demonstrate the existence and expression of S genes across a wide swath of algae and plants, and functional data indicate their involvement in tolerance to oxidative stress, phototropism, and adaptation to nitrogen limitation. Our research demonstrates the "recycling" of genetic information by photosynthetic eukaryotes to generate novel composite genes, many of which function in plastid maintenance.

  12. Bioengineered Silk Protein-Based Gene Delivery Systems

    PubMed Central

    Numata, Keiji; Subramanian, Balajikarthick; Currie, Heather A.; Kaplan, David L.

    2009-01-01

    Silk proteins self-assemble into mechanically robust material structures that are also biodegradable and non-cytotoxic, suggesting utility for gene delivery. Since silk proteins can also be tailored in terms of chemistry, molecular weight and other design features via genetic engineering, further control of this system for gene delivery can be considered. In the present study, silk-based block copolymers were bioengineered with poly(l-lysine) domains for gene delivery. Ionic complexes of these silk-polylysine based block copolymers with plasmid DNA (pDNA) were prepared for gene delivery to human embryonic kidney (HEK) cells. The material systems were characterized by agarose gel electrophoresis, atomic force microscopy, and dynamic light scattering. The polymers self-assembled in solution and complexed plasmid DNA through ionic interactions. The pDNA complexes with 30-lysine residues prepared at a polymer/nucleotide ratio of 10 and with a solution diameter of 380 nm, showed the highest efficiency for transfection. The pDNA complexes were also immobilized on silk films and demonstrated direct cell transfection from these surfaces. The results demonstrate the potential of bioengineered silk proteins as a new family of highly tailored gene delivery systems. PMID:19577803

  13. Transcriptional regulation of the uncoupling protein-1 gene.

    PubMed

    Villarroya, Francesc; Peyrou, Marion; Giralt, Marta

    2017-03-01

    Regulated transcription of the uncoupling protein-1 (UCP1) gene, and subsequent UCP1 protein synthesis, is a hallmark of the acquisition of the differentiated, thermogenically competent status of brown and beige/brite adipocytes, as well as of the responsiveness of brown and beige/brite adipocytes to adaptive regulation of thermogenic activity. The 5' non-coding region of the UCP1 gene contains regulatory elements that confer tissue specificity, differentiation dependence, and neuro-hormonal regulation to UCP1 gene transcription. Two main regions-a distal enhancer and a proximal promoter region-mediate transcriptional regulation through interactions with a plethora of transcription factors, including nuclear hormone receptors and cAMP-responsive transcription factors. Co-regulators, such as PGC-1α, play a pivotal role in the concerted regulation of UCP1 gene transcription. Multiple interactions of transcription factors and co-regulators at the promoter region of the UCP1 gene result in local chromatin remodeling, leading to activation and increased accessibility of RNA polymerase II and subsequent gene transcription. Moreover, a commonly occurring A-to-G polymorphism in close proximity to the UCP1 gene enhancer influences the extent of UCP1 gene transcription. Notably, it has been reported that specific aspects of obesity and associated metabolic diseases are associated with human population variability at this site. On another front, the unique properties of the UCP1 promoter region have been exploited to develop brown adipose tissue-specific gene delivery tools for experimental purposes. Copyright © 2016 Elsevier B.V. and Société Française de Biochimie et Biologie Moléculaire (SFBBM). All rights reserved.

  14. Cell Cycle Programs of Gene Expression Control Morphogenetic Protein Localization

    PubMed Central

    Lord, Matthew; Yang, Melody C.; Mischke, Michelle; Chant, John

    2000-01-01

    Genomic studies in yeast have revealed that one eighth of genes are cell cycle regulated in their expression. Almost without exception, the significance of cell cycle periodic gene expression has not been tested. Given that many such genes are critical to cellular morphogenesis, we wanted to examine the importance of periodic gene expression to this process. The expression profiles of two genes required for the axial pattern of cell division, BUD3 and BUD10/AXL2/SRO4, are strongly cell cycle regulated. BUD3 is expressed close to the onset of mitosis. BUD10 is expressed in late G1. Through promotor-swap experiments, the expression profile of each gene was altered and the consequences examined. We found that an S/G2 pulse of BUD3 expression controls the timing of Bud3p localization, but that this timing is not critical to Bud3p function. In contrast, a G1 pulse of BUD10 expression plays a direct role in Bud10p localization and function. Bud10p, a membrane protein, relies on the polarized secretory machinery specific to G1 to be delivered to its proper location. Such a secretion-based targeting mechanism for membrane proteins provides cells with flexibility in remodeling their architecture or evolving new forms. PMID:11134078

  15. Patterns of soybean proline-rich protein gene expression.

    PubMed Central

    Wyatt, R E; Nagao, R T; Key, J L

    1992-01-01

    The expression patterns of three members of a gene family that encodes proline-rich proteins in soybean (SbPRPs) were examined using in situ hybridization experiments. In most instances, the expression of SbPRP genes was intense in a limited number of cell types of a particular organ. SbPRP1 RNA was localized in several cell types of soybean hypocotyls, including cells within the phloem and xylem. SbPRP1 expression increased within epidermal cells in the elongating and mature regions of the hypocotyl; expression was detected also in lignified cells surrounding the hilum of mature seeds. SbPRP2 RNA was present in cortical cells and in the vascular tissue of the hypocotyl, especially cells of the phloem. This gene was expressed also in the inner integuments of the mature seed coat. SbPRP3 RNA was localized specifically to the endodermoid layer of cells surrounding the stele in the elongating region of the hypocotyl, as well as in the epidermal cells of leaves and cotyledons. These data show that members of this gene family exhibit cell-specific expression. The members of the SbPRP gene family are expressed in different types of cells and in some cell types that also express the glycine-rich protein or hydroxyproline-rich glycoprotein classes of genes. PMID:1525563

  16. Comprehensive functional analysis of large lists of genes and proteins.

    PubMed

    Mlecnik, Bernhard; Galon, Jérôme; Bindea, Gabriela

    2017-03-22

    The interpretation of high dimensional datasets resulting from genomic and proteomic experiments in a timely and efficient manner is challenging. ClueGO software is a Cytoscape App that extracts representative functional biological information for large lists of genes or proteins. The functional enrichment analysis is based on the latest publicly available data from multiple annotation and ontology resources that can be automatically accessed through ClueGO. Predefined settings for the selection of the terms are provided to facilitate the analysis. Results are visualized as networks in which Gene Ontology (GO) terms and pathways are grouped based on their biological role. Many species are now supported by ClueGO and additional organisms are added on demand. ClueGO can be used together with the CluePedia App to enable the visualization of protein-protein interactions within or between pathways.

  17. Sequences of the recA gene and protein.

    PubMed

    Sancar, A; Stachelek, C; Konigsberg, W; Rupp, W D

    1980-05-01

    We have determined the nucleotide sequence of the recA gene of Escherichia coli; this permits the formulation of the primary structure for the recA protein. This structure is consistent with the amino acid composition of the tryptic peptides obtained from the recA protein. The coding region of the recA gene has 1059 base pairs, which specify 352 amino acids. The recA protein has alanine and phenylalanine as its NH2- and COOH-terminal amino acids, respectively, and has the following amino acid composition: Cys3 Asp20 Asn15 Met9 Thr17 Ser20 Glu30 Gln13 Pro10 Gly35 Ala38 Val22 Ile27 Leu31 Tyr7 Phe10 His2Lys27 Trp2 Arg14. Of the three cysteine residues, only two can be alkylated under reducing and denaturing conditions. The molecular weight of the recA polypeptide is 37,842.

  18. SITEX 2.0: Projections of protein functional sites on eukaryotic genes. Extension with orthologous genes.

    PubMed

    Medvedeva, Irina V; Demenkov, Pavel S; Ivanisenko, Vladimir A

    2017-04-01

    Functional sites define the diversity of protein functions and are the central object of research of the structural and functional organization of proteins. The mechanisms underlying protein functional sites emergence and their variability during evolution are distinguished by duplication, shuffling, insertion and deletion of the exons in genes. The study of the correlation between a site structure and exon structure serves as the basis for the in-depth understanding of sites organization. In this regard, the development of programming resources that allow the realization of the mutual projection of exon structure of genes and primary and tertiary structures of encoded proteins is still the actual problem. Previously, we developed the SitEx system that provides information about protein and gene sequences with mapped exon borders and protein functional sites amino acid positions. The database included information on proteins with known 3D structure. However, data with respect to orthologs was not available. Therefore, we added the projection of sites positions to the exon structures of orthologs in SitEx 2.0. We implemented a search through database using site conservation variability and site discontinuity through exon structure. Inclusion of the information on orthologs allowed to expand the possibilities of SitEx usage for solving problems regarding the analysis of the structural and functional organization of proteins. Database URL: http://www-bionet.sscc.ru/sitex/ .

  19. cAMP receptor protein (CRP)-mediated resistance/tolerance in bacteria: mechanism and utilization in biotechnology.

    PubMed

    Geng, Hefang; Jiang, Rongrong

    2015-06-01

    Cyclic AMP receptor protein (CRP) is one of the seven global regulators in Escherichia coli, which regulates the expression of over 490 genes. It contains a cAMP binding N-terminal domain and a DNA binding C-terminal domain, connected via a short hinge region. Various stress-tolerant E. coli mutants had been obtained through transcriptional engineering of CRP. This review aims to shed some light on the possible mechanism behind these CRP variants, from the change in CRP structure, transcription profile, and DNA binding affinity. The amino acid substitutions are distributed along the protein-certain mutations have shown higher frequency than others, such as T127N and D138Y. β-Galactosidase reporter gene assay revealed that CRP mutants had lower binding affinity with all three classes of CRP-dependent promoters as compared to native CRP, which probably would change cellular transcription profile. Different CRP mutants would induce different cellular transcription profile in E. coli, but there are common genes differentially expressed in these variants, including upregulated gadAB and downregulated nontransporter genes aspA and tnaA, and transporter/poringenes malE, mglB, cstA, and lamB. We believe that transcriptional engineering of CRP can provide an alternative strain engineering method for E. coli and its detailed mechanism may need further investigations.

  20. Intron retention in the Drosophila melanogaster Rieske iron sulphur protein gene generated a new protein

    PubMed Central

    Gontijo, Alisson M.; Miguela, Veronica; Whiting, Michael F.; Woodruff, R.C.; Dominguez, Maria

    2011-01-01

    Genomes can encode a variety of proteins with unrelated architectures and activities. It is known that protein-coding genes of de novo origin have significantly contributed to this diversity. However, the molecular mechanisms and evolutionary processes behind these originations are still poorly understood. Here we show that the last 102 codons of a novel gene, Noble, assembled directly from non-coding DNA following an intronic deletion that induced alternative intron retention at the Drosophila melanogaster Rieske Iron Sulphur Protein (RFeSP) locus. A systematic analysis of the evolutionary processes behind the origin of Noble showed that its emergence was strongly biased by natural selection on and around the RFeSP locus. Noble mRNA is shown to encode a bona fide protein that lacks an iron sulphur domain and localizes to mitochondria. Together, these results demonstrate the generation of a novel protein at a naturally selected site. PMID:21610726

  1. Disease Gene Prioritization Based on Topological Similarity in Protein-Protein Interaction Networks

    NASA Astrophysics Data System (ADS)

    Erten, Sinan; Bebek, Gurkan; Koyutürk, Mehmet

    In recent years, many algorithms have been developed to narrow down the set of candidate disease genes implicated by genome wide association studies (GWAS), using knowledge on protein-protein interactions (PPIs). All of these algorithms are based on a common principle; functional association between proteins is correlated with their connectivity/proximity in the PPI network. However, recent research also reveals that networks are organized into recurrent network schemes that underlie the mechanisms of cooperation among proteins with different function, as well as the crosstalk between different cellular processes. In this paper, we hypothesize that proteins that are associated with similar diseases may exhibit patterns of "topological similarity" in PPI networks. Motivated by these observations, we introduce the notion of "topological profile", which represents the location of a protein in the network with respect to other proteins. Based on this notion, we develop a novel measure to assess the topological similarity of proteins in a PPI network. We then use this measure to develop algorithms that prioritize candidate disease genes based on the topological similarity of their products and the products of known disease genes. Systematic experimental studies using an integrated human PPI network and the Online Mendelian Inheritance (OMIM) database show that the proposed algorithm, Vavien, clearly outperforms state-of-the-art network based prioritization algorithms. Vavien is available as a web service at http://www.diseasegenes.org .

  2. The ubiquitous octamer-binding protein(s) is sufficient for transcription of immunoglobulin genes.

    PubMed Central

    Johnson, D G; Carayannopoulos, L; Capra, J D; Tucker, P W; Hanke, J H

    1990-01-01

    All immunoglobulin genes contain a conserved octanucleotide promoter element, ATGCAAAT, which has been shown to be required for their normal B-cell-specific transcription. Proteins that bind this octamer have been purified, and cDNAs encoding octamer-binding proteins have been cloned. Some of these proteins (referred to as OTF-2) are lymphoid specific, whereas at least one other, and possibly more (referred to as OTF-1), is found ubiquitously in all cell types. The exact role of these different proteins in directing the tissue-specific expression of immunoglobulin genes is unclear. We have identified two human pre-B-cell lines that contain extremely low levels of OTF-2 yet still express high levels of steady-state immunoglobulin heavy-chain mRNA in vivo and efficiently transcribe an immunoglobulin gene in vitro. Addition of a highly enriched preparation of OTF-1 made from one of these pre-B cells or from HeLa cells specifically stimulated in vitro transcription of an immunoglobulin gene. Furthermore, OFT-1 appeared to have approximately the same transactivation ability as OTF-2 when normalized for binding activity. These results suggest that OTF-1, without OTF-2, is sufficient for transcription of immunoglobulin genes and that OTF-2 alone is not responsible for the B-cell-specific regulation of immunoglobulin gene expression. Images PMID:2304473

  3. Gene3D: modelling protein structure, function and evolution.

    PubMed

    Yeats, Corin; Maibaum, Michael; Marsden, Russell; Dibley, Mark; Lee, David; Addou, Sarah; Orengo, Christine A

    2006-01-01

    The Gene3D release 4 database and web portal (http://cathwww.biochem.ucl.ac.uk:8080/Gene3D) provide a combined structural, functional and evolutionary view of the protein world. It is focussed on providing structural annotation for protein sequences without structural representatives--including the complete proteome sets of over 240 different species. The protein sequences have also been clustered into whole-chain families so as to aid functional prediction. The structural annotation is generated using HMM models based on the CATH domain families; CATH is a repository for manually deduced protein domains. Amongst the changes from the last publication are: the addition of over 100 genomes and the UniProt sequence database, domain data from Pfam, metabolic pathway and functional data from COGs, KEGG and GO, and protein-protein interaction data from MINT and BIND. The website has been rebuilt to allow more sophisticated querying and the data returned is presented in a clearer format with greater functionality. Furthermore, all data can be downloaded in a simple XML format, allowing users to carry out complex investigations at their own computers.

  4. Automatic annotation of protein motif function with Gene Ontology terms

    PubMed Central

    Lu, Xinghua; Zhai, Chengxiang; Gopalakrishnan, Vanathi; Buchanan, Bruce G

    2004-01-01

    Background Conserved protein sequence motifs are short stretches of amino acid sequence patterns that potentially encode the function of proteins. Several sequence pattern searching algorithms and programs exist foridentifying candidate protein motifs at the whole genome level. However, amuch needed and importanttask is to determine the functions of the newly identified protein motifs. The Gene Ontology (GO) project is an endeavor to annotate the function of genes or protein sequences with terms from a dynamic, controlled vocabulary and these annotations serve well as a knowledge base. Results This paperpresents methods to mine the GO knowledge base and use the association between the GO terms assigned to a sequence and the motifs matched by the same sequence as evidence for predicting the functions of novel protein motifs automatically. The task of assigning GO terms to protein motifsis viewed as both a binary classification and information retrieval problem, where PROSITE motifs are used as samples for mode training and functional prediction. The mutual information of a motif and aGO term association isfound to be a very useful feature. We take advantageof the known motifs to train a logistic regression classifier, which allows us to combine mutual information with other frequency-based features and obtain a probability of correctassociation. The trained logistic regression model has intuitively meaningful and logically plausible parameter values, and performs very well empirically according to our evaluation criteria. Conclusions In this research, different methods for automatic annotation of protein motifs have been investigated. Empirical result demonstrated that the methods have a great potential for detecting and augmenting information about thefunctions of newly discovered candidate protein motifs. PMID:15345032

  5. KEGG as a reference resource for gene and protein annotation

    PubMed Central

    Kanehisa, Minoru; Sato, Yoko; Kawashima, Masayuki; Furumichi, Miho; Tanabe, Mao

    2016-01-01

    KEGG (http://www.kegg.jp/ or http://www.genome.jp/kegg/) is an integrated database resource for biological interpretation of genome sequences and other high-throughput data. Molecular functions of genes and proteins are associated with ortholog groups and stored in the KEGG Orthology (KO) database. The KEGG pathway maps, BRITE hierarchies and KEGG modules are developed as networks of KO nodes, representing high-level functions of the cell and the organism. Currently, more than 4000 complete genomes are annotated with KOs in the KEGG GENES database, which can be used as a reference data set for KO assignment and subsequent reconstruction of KEGG pathways and other molecular networks. As an annotation resource, the following improvements have been made. First, each KO record is re-examined and associated with protein sequence data used in experiments of functional characterization. Second, the GENES database now includes viruses, plasmids, and the addendum category for functionally characterized proteins that are not represented in complete genomes. Third, new automatic annotation servers, BlastKOALA and GhostKOALA, are made available utilizing the non-redundant pangenome data set generated from the GENES database. As a resource for translational bioinformatics, various data sets are created for antimicrobial resistance and drug interaction networks. PMID:26476454

  6. KEGG as a reference resource for gene and protein annotation.

    PubMed

    Kanehisa, Minoru; Sato, Yoko; Kawashima, Masayuki; Furumichi, Miho; Tanabe, Mao

    2016-01-04

    KEGG (http://www.kegg.jp/ or http://www.genome.jp/kegg/) is an integrated database resource for biological interpretation of genome sequences and other high-throughput data. Molecular functions of genes and proteins are associated with ortholog groups and stored in the KEGG Orthology (KO) database. The KEGG pathway maps, BRITE hierarchies and KEGG modules are developed as networks of KO nodes, representing high-level functions of the cell and the organism. Currently, more than 4000 complete genomes are annotated with KOs in the KEGG GENES database, which can be used as a reference data set for KO assignment and subsequent reconstruction of KEGG pathways and other molecular networks. As an annotation resource, the following improvements have been made. First, each KO record is re-examined and associated with protein sequence data used in experiments of functional characterization. Second, the GENES database now includes viruses, plasmids, and the addendum category for functionally characterized proteins that are not represented in complete genomes. Third, new automatic annotation servers, BlastKOALA and GhostKOALA, are made available utilizing the non-redundant pangenome data set generated from the GENES database. As a resource for translational bioinformatics, various data sets are created for antimicrobial resistance and drug interaction networks. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  7. Epigenetic engineering of ribosomal RNA genes enhances protein production.

    PubMed

    Santoro, Raffaella; Lienemann, Philipp; Fussenegger, Martin

    2009-08-14

    Selection of mammalian high-producer cell lines remains a major challenge for the biopharmaceutical manufacturing industry. Ribosomal RNA (rRNA) genes encode the major component of the ribosome but many rRNA gene copies are not transcribed due to epigenetic silencing by the nucleolar remodelling complex (NoRC) [6], which may limit the cell's full production capacity. Here we show that the knockdown of TIP5, a subunit of NoRC, decreases the number of silent rRNA genes, upregulates rRNA transcription, enhances ribosome synthesis and increases production of recombinant proteins. However, general enhancement of rRNA transcription rate did not stimulate protein synthesis. Our data demonstrates that the number of transcriptionally competent rRNA genes limits efficient ribosome synthesis. Epigenetic engineering of ribosomal RNA genes offers new possibilities for improving biopharmaceutical manufacturing and provides novel insights into the complex regulatory network which governs the translation machinery in normal cellular processes as well as in pathological conditions like cancer.

  8. Retroviral display in gene therapy, protein engineering, and vaccine development.

    PubMed

    Urban, Johannes H; Merten, Christoph A

    2011-01-21

    The display and analysis of proteins expressed on biological surfaces has become an attractive tool for the study of molecular interactions in enzymology, protein engineering, and high-throughput screening. Among the growing number of established display systems, retroviruses offer a unique and fully mammalian platform for the expression of correctly folded and post-translationally modified proteins in the context of cell plasma membrane-derived particles. This is of special interest for therapeutic applications such as gene therapy and vaccine development and also offers advantages for the engineering of mammalian proteins toward customized binding affinities and catalytic activities. This review critically summarizes the basic concepts and applications of retroviral display and analyses its benefits in comparison to other display techniques.

  9. Analysis of incomplete gene expression dataset through protein-protein interaction information.

    PubMed

    Massanet-Vila, Raimon; Padró, Teresa; Cardús, Anna; Badimon, Lina; Caminal, Pere; Perera, Alexandre

    2011-01-01

    This paper shows a graph based method to analyze proteomic expression data. The method allows the prediction of the expression of genes not measured by the gene expression technology based on the local connectivity properties of the measured differentially expressed gene set. The prediction of the expression jointly with the stability of this prediction as a function of the variation of the initial expressed set is computed. The method is able to correctly predict one third of the proteins with independence of variations on the selection of the initial set. The algorithm is validated through a Matrix-Assisted Laser Desorption/Ionization Time of Flight Mass Spectrometer (MALDI-TOF) protein expression experiment aiming the study of the protein expression patterns and post-translational modifications in human endothelial vascular cells exposed to atherosclerotic levels of Low Density Lipoproteins (LDL).

  10. Comparative Studies of Vertebrate Beta Integrin Genes and Proteins: Ancient Genes in Vertebrate Evolution

    PubMed Central

    Holmes, Roger S.; Rout, Ujjwal K.

    2011-01-01

    Intregins are heterodimeric α- and β-subunit containing membrane receptor proteins which serve various cell adhesion roles in tissue repair, hemostasis, immune response, embryogenesis and metastasis. At least 18 α- (ITA or ITGA) and 8 β-integrin subunits (ITB or ITGB) are encoded on mammalian genomes. Comparative ITB amino acid sequences and protein structures and ITB gene locations were examined using data from several vertebrate genome projects. Vertebrate ITB genes usually contained 13–16 coding exons and encoded protein subunits with ∼800 amino acids, whereas vertebrate ITB4 genes contained 36-39 coding exons and encoded larger proteins with ∼1800 amino acids. The ITB sequences exhibited several conserved domains including signal peptide, extracellular β-integrin, β-tail domain and integrin β-cytoplasmic domains. Sequence alignments of the integrin β-cytoplasmic domains revealed highly conserved regions possibly for performing essential functions and its maintenance during vertebrate evolution. With the exception of the human ITB8 sequence, the other ITB sequences shared a predicted 19 residue α-helix for this region. Potential sites for regulating human ITB gene expression were identified which included CpG islands, transcription factor binding sites and microRNA binding sites within the 3′-UTR of human ITB genes. Phylogenetic analyses examined the relationships of vertebrate beta-integrin genes which were consistent with four major groups: 1: ITB1, ITB2, ITB7; 2: ITB3, ITB5, ITB6; 3: ITB4; and 4: ITB8 and a common evolutionary origin from an ancestral gene, prior to the appearance of fish during vertebrate evolution. The phylogenetic analyses revealed that ITB4 is the most likely primordial form of the vertebrate β integrin subunit encoding genes, that is the only β subunit expressed as a constituent of the sole integrin receptor ‘α6β4’ in the hemidesmosomes of unicellular organisms. PMID:24970121

  11. Cloning and characterization of a novel CoA-ligase gene from Penicillium chrysogenum.

    PubMed

    Yu, Zhou-Liang; Liu, Jing; Wang, Fu-Qiang; Dai, Meng; Zhao, Bao-Hua; He, Jian-Gong; Zhang, Hua

    2011-05-01

    A novel phenylacetic acid (PAA)-induced CoA-ligase-encoding gene, designated as phlC, has been cloned from penicillin-producing fungus Penicillium chrysogenum. The open reading frame of phlC cDNA was 1671 bp and encoded a 556 amino acid residues protein with the consensus AMP binding site and a peroxisomal targeting signal 1 on its C terminus. The deduced amino acid sequence showed 37% and 38% identity with characterized P. chrysogenum Phl and PhlB protein, respectively. Functional recombinant PhlC protein was overexpressed in Escherichia coli. The purified recombinant enzyme was capable to convert PAA into its corresponding CoA ester with a specific activity of 129.5 ± 3.026 pmol/min per mg protein. Similar to Phl and PhlB, PhlC displayed broad substrate spectrum and showed higher activities to medium- and long-chain fatty acids. The catalytic properties of PhlC have been determined and compared to those of Phl and PhlB.

  12. Genes and proteins of Escherichia coli (GenProtEc).

    PubMed

    Riley, M; Space, D B

    1996-01-01

    GenProtEc is a database of Escherichia coli genes and their gene products, classified by type of function and physiological role and with citations to the literature for each. Also present are data on sequence similarities among E.coli proteins with PAM values, percent identity of amino acids, length of alignment and percent aligned. The database is available as a PKZip file by ftp from mbl.edu/pub/ecoli.exe. The program runs under MS-DOS on IMB-compatible machines. GenProtEc can also be accessed through the World Wide Web at URL http://mbl.edu/html/ecoli.html.

  13. Genes and proteins of Escherichia coli (GenProtEc).

    PubMed Central

    Riley, M; Space, D B

    1996-01-01

    GenProtEc is a database of Escherichia coli genes and their gene products, classified by type of function and physiological role and with citations to the literature for each. Also present are data on sequence similarities among E.coli proteins with PAM values, percent identity of amino acids, length of alignment and percent aligned. The database is available as a PKZip file by ftp from mbl.edu/pub/ecoli.exe. The program runs under MS-DOS on IMB-compatible machines. GenProtEc can also be accessed through the World Wide Web at URL http://mbl.edu/html/ecoli.html. PMID:8594596

  14. RNA Binding Proteins Posttranscriptionally Regulate Genes Involved In Oncogenesis

    DTIC Science & Technology

    2010-06-01

    Cloning and characterization of HuR, a ubiquitously expressed Elav-like protein . J Biol Chem 1996, 271(14):8144-8151. 21. Meisner NC, Hackermuller J...Hauptmann S: Expression of the ELAV-like protein HuR is associated with higher tumor grade and increased cyclooxygenase-2 expression in human breast...SH3 domain, ankyrin repeat and pH domain 3 tumor microarray reveals 47 annotated genes up regulated in the HA-HuR overexpressing tumors as compared to

  15. An improved method for scoring protein-protein interactions using semantic similarity within the gene ontology.

    PubMed

    Jain, Shobhit; Bader, Gary D

    2010-11-15

    Semantic similarity measures are useful to assess the physiological relevance of protein-protein interactions (PPIs). They quantify similarity between proteins based on their function using annotation systems like the Gene Ontology (GO). Proteins that interact in the cell are likely to be in similar locations or involved in similar biological processes compared to proteins that do not interact. Thus the more semantically similar the gene function annotations are among the interacting proteins, more likely the interaction is physiologically relevant. However, most semantic similarity measures used for PPI confidence assessment do not consider the unequal depth of term hierarchies in different classes of cellular location, molecular function, and biological process ontologies of GO and thus may over-or under-estimate similarity. We describe an improved algorithm, Topological Clustering Semantic Similarity (TCSS), to compute semantic similarity between GO terms annotated to proteins in interaction datasets. Our algorithm, considers unequal depth of biological knowledge representation in different branches of the GO graph. The central idea is to divide the GO graph into sub-graphs and score PPIs higher if participating proteins belong to the same sub-graph as compared to if they belong to different sub-graphs. The TCSS algorithm performs better than other semantic similarity measurement techniques that we evaluated in terms of their performance on distinguishing true from false protein interactions, and correlation with gene expression and protein families. We show an average improvement of 4.6 times the F1 score over Resnik, the next best method, on our Saccharomyces cerevisiae PPI dataset and 2 times on our Homo sapiens PPI dataset using cellular component, biological process and molecular function GO annotations.

  16. An improved method for scoring protein-protein interactions using semantic similarity within the gene ontology

    PubMed Central

    2010-01-01

    Background Semantic similarity measures are useful to assess the physiological relevance of protein-protein interactions (PPIs). They quantify similarity between proteins based on their function using annotation systems like the Gene Ontology (GO). Proteins that interact in the cell are likely to be in similar locations or involved in similar biological processes compared to proteins that do not interact. Thus the more semantically similar the gene function annotations are among the interacting proteins, more likely the interaction is physiologically relevant. However, most semantic similarity measures used for PPI confidence assessment do not consider the unequal depth of term hierarchies in different classes of cellular location, molecular function, and biological process ontologies of GO and thus may over-or under-estimate similarity. Results We describe an improved algorithm, Topological Clustering Semantic Similarity (TCSS), to compute semantic similarity between GO terms annotated to proteins in interaction datasets. Our algorithm, considers unequal depth of biological knowledge representation in different branches of the GO graph. The central idea is to divide the GO graph into sub-graphs and score PPIs higher if participating proteins belong to the same sub-graph as compared to if they belong to different sub-graphs. Conclusions The TCSS algorithm performs better than other semantic similarity measurement techniques that we evaluated in terms of their performance on distinguishing true from false protein interactions, and correlation with gene expression and protein families. We show an average improvement of 4.6 times the F1 score over Resnik, the next best method, on our Saccharomyces cerevisiae PPI dataset and 2 times on our Homo sapiens PPI dataset using cellular component, biological process and molecular function GO annotations. PMID:21078182

  17. PPISEARCHENGINE: gene ontology-based search for protein-protein interactions.

    PubMed

    Park, Byungkyu; Cui, Guangyu; Lee, Hyunjin; Huang, De-Shuang; Han, Kyungsook

    2013-01-01

    This paper presents a new search engine called PPISearchEngine which finds protein-protein interactions (PPIs) using the gene ontology (GO) and the biological relations of proteins. For efficient retrieval of PPIs, each GO term is assigned a prime number and the relation between the terms is represented by the product of prime numbers. This representation is hidden from users but facilitates the search for the interactions of a query protein by unique prime factorisation of the number that represents the query protein. For a query protein, PPISearchEngine considers not only the GO term associated with the query protein but also the GO terms at the lower level than the GO term in the GO hierarchy, and finds all the interactions of the query protein which satisfy the search condition. In contrast, the standard keyword-matching or ID-matching search method cannot find the interactions of a protein unless the interactions involve a protein with explicit annotations. To the best of our knowledge, this search engine is the first method that can process queries like 'for protein p with GO [Formula: see text], find p's interaction partners with GO [Formula: see text]'. PPISearchEngine is freely available to academics at http://search.hpid.org/.

  18. Controlling for Gene Expression Changes in Transcription Factor Protein Networks*

    PubMed Central

    Banks, Charles A. S.; Lee, Zachary T.; Boanca, Gina; Lakshminarasimhan, Mahadevan; Groppe, Brad D.; Wen, Zhihui; Hattem, Gaye L.; Seidel, Chris W.; Florens, Laurence; Washburn, Michael P.

    2014-01-01

    The development of affinity purification technologies combined with mass spectrometric analysis of purified protein mixtures has been used both to identify new protein–protein interactions and to define the subunit composition of protein complexes. Transcription factor protein interactions, however, have not been systematically analyzed using these approaches. Here, we investigated whether ectopic expression of an affinity tagged transcription factor as bait in affinity purification mass spectrometry experiments perturbs gene expression in cells, resulting in the false positive identification of bait-associated proteins when typical experimental controls are used. Using quantitative proteomics and RNA sequencing, we determined that the increase in the abundance of a set of proteins caused by overexpression of the transcription factor RelA is not sufficient for these proteins to then co-purify non-specifically and be misidentified as bait-associated proteins. Therefore, typical controls should be sufficient, and a number of different baits can be compared with a common set of controls. This is of practical interest when identifying bait interactors from a large number of different baits. As expected, we found several known RelA interactors enriched in our RelA purifications (NFκB1, NFκB2, Rel, RelB, IκBα, IκBβ, and IκBε). We also found several proteins not previously described in association with RelA, including the small mitochondrial chaperone Tim13. Using a variety of biochemical approaches, we further investigated the nature of the association between Tim13 and NFκB family transcription factors. This work therefore provides a conceptual and experimental framework for analyzing transcription factor protein interactions. PMID:24722732

  19. Ribozymes, riboswitches and beyond: regulation of gene expression without proteins

    PubMed Central

    Serganov, Alexander; Patel, Dinshaw J.

    2015-01-01

    Although various functions of RNA are carried out in conjunction with proteins, some catalytic RNAs, or ribozymes, which contribute to a range of cellular processes, require little or no assistance from proteins. Furthermore, the discovery of metabolite-sensing riboswitches and other types of RNA sensors has revealed RNA-based mechanisms that cells use to regulate gene expression in response to internal and external changes. Structural studies have shown how these RNAs can carry out a range of functions. In addition, the contribution of ribozymes and riboswitches to gene expression is being revealed as far more widespread than was previously appreciated. These findings have implications for understanding how cellular functions might have evolved from RNA-based origins. PMID:17846637

  20. Expression data on liver metabolic pathway genes and proteins

    PubMed Central

    Raja Gopal Reddy, Mooli; Pavan Kumar, Chodisetti; Mahesh, Malleswarapu; Sravan Kumar, Manchiryala; Jeyakumar, Shanmugam M.

    2016-01-01

    Here, we present the expression data on various metabolic pathways of liver with special emphasize on lipid and carbohydrate metabolism and long chain polyunsaturated fatty acid (PUFA) synthesis, both at gene and protein levels. The data were obtained to understand the effect of vitamin A deficiency on the expression status (both gene and protein levels) of some of the key factors involved in lipogenesis, fatty acid oxidation, triglyceride secretion, long chain PUFA, resolvin D1 synthesis, glucose transport and glycogen synthesis of liver, using modern biology tools, such as quantitative real-time PCR (RT-PCR) and immunoblotting techniques. This data article provides the supporting evidence to the article “Vitamin A deficiency suppresses high fructose-induced triglyceride synthesis and elevates resolvin D1 levels” [1] and therefore, these data may be referred back, for comprehensive understanding and interpretations and for future studies. PMID:26909377

  1. Common and specific signatures of gene expression and protein-protein interactions in autoimmune diseases.

    PubMed

    Tuller, T; Atar, S; Ruppin, E; Gurevich, M; Achiron, A

    2013-03-01

    The aim of this study is to understand intracellular regulatory mechanisms in peripheral blood mononuclear cells (PBMCs), which are either common to many autoimmune diseases or specific to some of them. We incorporated large-scale data such as protein-protein interactions, gene expression and demographical information of hundreds of patients and healthy subjects, related to six autoimmune diseases with available large-scale gene expression measurements: multiple sclerosis (MS), systemic lupus erythematosus (SLE), juvenile rheumatoid arthritis (JRA), Crohn's disease (CD), ulcerative colitis (UC) and type 1 diabetes (T1D). These data were analyzed concurrently by statistical and systems biology approaches tailored for this purpose. We found that chemokines such as CXCL1-3, 5, 6 and the interleukin (IL) IL8 tend to be differentially expressed in PBMCs of patients with the analyzed autoimmune diseases. In addition, the anti-apoptotic gene BCL3, interferon-γ (IFNG), and the vitamin D receptor (VDR) gene physically interact with significantly many genes that tend to be differentially expressed in PBMCs of patients with the analyzed autoimmune diseases. In general, similar cellular processes tend to be differentially expressed in PBMC in the analyzed autoimmune diseases. Specifically, the cellular processes related to cell proliferation (for example, epidermal growth factor, platelet-derived growth factor, nuclear factor-κB, Wnt/β-catenin signaling, stress-activated protein kinase c-Jun NH2-terminal kinase), inflammatory response (for example, interleukins IL2 and IL6, the cytokine granulocyte-macrophage colony-stimulating factor and the B-cell receptor), general signaling cascades (for example, mitogen-activated protein kinase, extracellular signal-regulated kinase, p38 and TRK) and apoptosis are activated in most of the analyzed autoimmune diseases. However, our results suggest that in each of the analyzed diseases, apoptosis and chemotaxis are activated via

  2. Evolution of Drosophila ribosomal protein gene core promoters

    PubMed Central

    Ma, Xiaotu; Zhang, Kangyu; Li, Xiaoman

    2011-01-01

    The coordinated expression of ribosomal protein genes (RPGs) has been well documented in many species. Previous analyses of RPG promoters focus only on Fungi and mammals. Recognizing this gap and using a comparative genomics approach, we utilize a motif-finding algorithm that incorporates cross-species conservation to identify several significant motifs in Drosophila RPG promoters. As a result, significant differences of the enriched motifs in RPG promoter are found among Drosophila, Fungi, and mammals, demonstrating the evolutionary dynamics of the ribosomal gene regulatory network. We also report a motif present in similar numbers of RPGs among Drosophila species which does not appear to be conserved at the individual RPG gene level. A module-wise stabilizing selection theory is proposed to explain this observation. Overall, our results provide significant insight into the fast-evolving nature of transcriptional regulation in the RPG module. PMID:19059316

  3. Evolution of Drosophila ribosomal protein gene core promoters.

    PubMed

    Ma, Xiaotu; Zhang, Kangyu; Li, Xiaoman

    2009-03-01

    The coordinated expression of ribosomal protein genes (RPGs) has been well documented in many species. Previous analyses of RPG promoters focus only on Fungi and mammals. Recognizing this gap and using a comparative genomics approach, we utilize a motif-finding algorithm that incorporates cross-species conservation to identify several significant motifs in Drosophila RPG promoters. As a result, significant differences of the enriched motifs in RPG promoter are found among Drosophila, Fungi, and mammals, demonstrating the evolutionary dynamics of the ribosomal gene regulatory network. We also report a motif present in similar numbers of RPGs among Drosophila species which does not appear to be conserved at the individual RPG gene level. A module-wise stabilizing selection theory is proposed to explain this observation. Overall, our results provide significant insight into the fast-evolving nature of transcriptional regulation in the RPG module.

  4. Rational design of orthogonal libraries of protein coding genes.

    PubMed

    Ryan, Daniel; Papamichail, Dimitris

    2013-05-17

    Array-based oligonucleotide synthesis technologies provide access to thousands of custom-designed sequence variants at low cost. Large-scale synthesis and high-throughput assays have become valuable experimental tools to study in detail the interplay between sequence and function. We have developed a methodology and corresponding algorithms for the design of diverse protein coding gene libraries, to exploit the potential of multiplex synthesis and help elucidate the effects of codon utilization and other factors in gene expression. Using our algorithm, we have computationally designed gene libraries with hundreds to thousands of orthogonal codon usage variants, uniformly exploring the design space of codon utilization, while demanding only a small fraction of the synthesis cost that would be required if these variants were synthesized independently.

  5. Prolactin receptor and signal transduction to milk protein genes

    SciTech Connect

    Djiane, J.; Daniel, N.; Bignon, C.

    1994-06-01

    After cloning of the mammary gland prolactin (PRL) receptor cDNA, a functional assay was established using co-transfection of PRL receptor cDNA together with a milk protein promoter/chloramphenicol acetyl transferase (CAT) construct in Chinese hamster ovary (CHO) cells. Different mutants of the PRL receptor were tested in this CAT assay to delimit the domains in the receptor necessary for signal transduction to milk protein genes. In CHO cells stably transfected with PRL receptor cDNA, high numbers of PRL receptor are expressed. By metabolic labeling and immunoprecipitation, expressed PRL receptor was identified as a single species of 100 kDa. Using these cells, we analyzed the effects of PRL on intracellular free Ca{sup ++} concentration. PRL stimulates Ca{sup ++} entry and induces secondary Ca{sup ++} mobilization. The entry of Ca{sup ++} is a result of an increase in K{sup +} conductance that hyperpolarizes the membranes. We have also analyzed tyrosine phosphorylation induced by PRL. In CHO cells stably transfected with PRL receptor cDNA, PRL induced a very rapid and transient tyrosine phosphorylation of a 100-kDa protein which is most probably the PRL receptor. The same finding was obtained in mammary membranes after PRL injection to lactating rabbits. Whereas tyrosine kinase inhibitors genistein and lavendustin were without effect, PRL stimulation of milk protein gene promoters was partially inhibited by 2 {mu}M herbimycin in CHO cells co-transfected with PRL receptor cDNA and the {Beta} lactoglobulin CAT construct. Taken together these observations indicate that the cytoplasmic domain of the PRL receptor interacts with one or several tyrosine kinases, which may represent early postreceptor events necessary for PRL signal transduction to milk protein genes. 14 refs., 4 figs.

  6. Multidrug resistance protein gene expression in Trichoplusia ni caterpillars.

    PubMed

    Simmons, Jason; D'Souza, Olivia; Rheault, Mark; Donly, Cam

    2013-02-01

    Many insect species exhibit pesticide-resistant phenotypes. One of the mechanisms capable of contributing to resistance is the overexpression of multidrug resistance (MDR) transporter proteins. Here we describe the cloning of three genes encoding MDR proteins from Trichoplusia ni: trnMDR1, trnMDR2 and trnMDR3. Real-time quantitative PCR (qPCR) detected trnMDR mRNA in the whole nervous system, midgut and Malpighian tubules of final instar T. ni caterpillars. To test whether these genes are upregulated in response to chemical challenge in this insect, qPCR was used to compare trnMDR mRNA levels in unchallenged insects with those of insects fed the synthetic pyrethroid, deltamethrin. Only limited increases were detected in a single gene, trnMDR2, which is the most weakly expressed of the three MDR genes, suggesting that increased multidrug resistance of this type is not a significant part of the response to deltamethrin exposure.

  7. Protein-protein interaction inference based on semantic similarity of Gene Ontology terms.

    PubMed

    Zhang, Shu-Bo; Tang, Qiang-Rong

    2016-07-21

    Identifying protein-protein interactions is important in molecular biology. Experimental methods to this issue have their limitations, and computational approaches have attracted more and more attentions from the biological community. The semantic similarity derived from the Gene Ontology (GO) annotation has been regarded as one of the most powerful indicators for protein interaction. However, conventional methods based on GO similarity fail to take advantage of the specificity of GO terms in the ontology graph. We proposed a GO-based method to predict protein-protein interaction by integrating different kinds of similarity measures derived from the intrinsic structure of GO graph. We extended five existing methods to derive the semantic similarity measures from the descending part of two GO terms in the GO graph, then adopted a feature integration strategy to combines both the ascending and the descending similarity scores derived from the three sub-ontologies to construct various kinds of features to characterize each protein pair. Support vector machines (SVM) were employed as discriminate classifiers, and five-fold cross validation experiments were conducted on both human and yeast protein-protein interaction datasets to evaluate the performance of different kinds of integrated features, the experimental results suggest the best performance of the feature that combines information from both the ascending and the descending parts of the three ontologies. Our method is appealing for effective prediction of protein-protein interaction.

  8. Computational codon optimization of synthetic gene for protein expression

    PubMed Central

    2012-01-01

    Background The construction of customized nucleic acid sequences allows us to have greater flexibility in gene design for recombinant protein expression. Among the various parameters considered for such DNA sequence design, individual codon usage (ICU) has been implicated as one of the most crucial factors affecting mRNA translational efficiency. However, previous works have also reported the significant influence of codon pair usage, also known as codon context (CC), on the level of protein expression. Results In this study, we have developed novel computational procedures for evaluating the relative importance of optimizing ICU and CC for enhancing protein expression. By formulating appropriate mathematical expressions to quantify the ICU and CC fitness of a coding sequence, optimization procedures based on genetic algorithm were employed to maximize its ICU and/or CC fitness. Surprisingly, the in silico validation of the resultant optimized DNA sequences for Escherichia coli, Lactococcus lactis, Pichia pastoris and Saccharomyces cerevisiae suggests that CC is a more relevant design criterion than the commonly considered ICU. Conclusions The proposed CC optimization framework can complement and enhance the capabilities of current gene design tools, with potential applications to heterologous protein production and even vaccine development in synthetic biotechnology. PMID:23083100

  9. Interferon-stimulated gene 15 and the protein ISGylation system.

    PubMed

    Zhang, Dongxian; Zhang, Dong-Er

    2011-01-01

    Interferon-stimulated gene 15 (ISG15) is one of the most upregulated genes upon Type I interferon treatment or pathogen infection. Its 17  kDa protein product, ISG15, was the first ubiquitin-like modifier identified, and is similar to a ubiquitin linear dimer. As ISG15 modifies proteins in a similar manner to ubiquitylation, protein conjugation by ISG15 is termed ISGylation. Some of the primary enzymes that promote ISGylation are also involved in ubiquitin conjugation. The process to remove ISG15 from its conjugated proteins, termed de-ISGylation, is performed by a cellular ISG15-specific protease, ubiquitin-specific proteases with molecular mass 43 kDa (UBP43)/ubiquitin-specific proteases 18. Relative to ubiquitin, the biological function of ISG15 is still poorly understood, but ISG15 appears to play important roles in various biological and cellular functions. Therefore, there is growing interest in ISG15, as the study of free ISG15 and functional consequences of ISGylation/de-ISGylation may identify useful therapeutic targets. This review highlights recent discoveries and remaining questions important to understanding the biological functions of ISG15.

  10. Analysis of the prion protein gene in multiple system atrophy.

    PubMed

    Chelban, Viorica; Manole, Andreea; Pihlstrøm, Lasse; Schottlaender, Lucia; Efthymiou, Stephanie; OConnor, Emer; Meissner, Wassilios G; Holton, Janice L; Houlden, Henry

    2017-01-01

    Neurodegenerative diseases are a very diverse group of disorders but they share some common mechanisms such as abnormally misfolded proteins with prion-like propagation and aggregation. Creutzfeldt-Jakob disease (CJD) is the most prevalent prion disease in humans. In the sporadic form of CJD the only known risk factor is the codon 129 polymorphism. Recent reports suggested that α-synuclein in multiple system atrophy (MSA) has similar pathogenic mechanisms as the prion protein. Here we present 1 Italian family with MSA and prion disease. Also, cases of concurrent MSA and prion pathology in the same individual or family suggest the possibility of molecular interaction between prion protein and α-synuclein in the process of protein accumulation and neurodegeneration, warranting further investigations. We assessed the PRNP gene by whole-exome sequencing in 264 pathologically confirmed MSA cases and 462 healthy controls to determine whether the 2 diseases share similar risk factors. We then analyzed codon 129 polymorphism by Sanger sequencing and compared with previously published results in sporadic CJD. Homozygosity at codon 129 was present in 50% of pathologically confirmed MSA cases and in 58% of normal controls (odds ratio, 0.7 (95% confidence interval of 0.5-0.9)) compared with 88.2% in sporadic CJD. Our data show that the homozygous state of position 129 in the PRNP is not a risk factor for MSA. No other variants in the PRNP gene were associated with increased risk for MSA.

  11. Next Generation Delivery System for Proteins and Genes of Therapeutic Purpose: Why and How?

    PubMed Central

    Priya Doss, C. George; Lee, Sang-Soo

    2014-01-01

    Proteins and genes of therapeutic interests in conjunction with different delivery systems are growing towards new heights. “Next generation delivery systems” may provide more efficient platform for delivery of proteins and genes. In the present review, snapshots about the benefits of proteins or gene therapy, general procedures for therapeutic protein or gene delivery system, and different next generation delivery system such as liposome, PEGylation, HESylation, and nanoparticle based delivery have been depicted with their detailed explanation. PMID:25126554

  12. Genes encoding calmodulin-binding proteins in the Arabidopsis genome

    NASA Technical Reports Server (NTRS)

    Reddy, Vaka S.; Ali, Gul S.; Reddy, Anireddy S N.

    2002-01-01

    Analysis of the recently completed Arabidopsis genome sequence indicates that approximately 31% of the predicted genes could not be assigned to functional categories, as they do not show any sequence similarity with proteins of known function from other organisms. Calmodulin (CaM), a ubiquitous and multifunctional Ca(2+) sensor, interacts with a wide variety of cellular proteins and modulates their activity/function in regulating diverse cellular processes. However, the primary amino acid sequence of the CaM-binding domain in different CaM-binding proteins (CBPs) is not conserved. One way to identify most of the CBPs in the Arabidopsis genome is by protein-protein interaction-based screening of expression libraries with CaM. Here, using a mixture of radiolabeled CaM isoforms from Arabidopsis, we screened several expression libraries prepared from flower meristem, seedlings, or tissues treated with hormones, an elicitor, or a pathogen. Sequence analysis of 77 positive clones that interact with CaM in a Ca(2+)-dependent manner revealed 20 CBPs, including 14 previously unknown CBPs. In addition, by searching the Arabidopsis genome sequence with the newly identified and known plant or animal CBPs, we identified a total of 27 CBPs. Among these, 16 CBPs are represented by families with 2-20 members in each family. Gene expression analysis revealed that CBPs and CBP paralogs are expressed differentially. Our data suggest that Arabidopsis has a large number of CBPs including several plant-specific ones. Although CaM is highly conserved between plants and animals, only a few CBPs are common to both plants and animals. Analysis of Arabidopsis CBPs revealed the presence of a variety of interesting domains. Our analyses identified several hypothetical proteins in the Arabidopsis genome as CaM targets, suggesting their involvement in Ca(2+)-mediated signaling networks.

  13. Genes encoding calmodulin-binding proteins in the Arabidopsis genome

    NASA Technical Reports Server (NTRS)

    Reddy, Vaka S.; Ali, Gul S.; Reddy, Anireddy S N.

    2002-01-01

    Analysis of the recently completed Arabidopsis genome sequence indicates that approximately 31% of the predicted genes could not be assigned to functional categories, as they do not show any sequence similarity with proteins of known function from other organisms. Calmodulin (CaM), a ubiquitous and multifunctional Ca(2+) sensor, interacts with a wide variety of cellular proteins and modulates their activity/function in regulating diverse cellular processes. However, the primary amino acid sequence of the CaM-binding domain in different CaM-binding proteins (CBPs) is not conserved. One way to identify most of the CBPs in the Arabidopsis genome is by protein-protein interaction-based screening of expression libraries with CaM. Here, using a mixture of radiolabeled CaM isoforms from Arabidopsis, we screened several expression libraries prepared from flower meristem, seedlings, or tissues treated with hormones, an elicitor, or a pathogen. Sequence analysis of 77 positive clones that interact with CaM in a Ca(2+)-dependent manner revealed 20 CBPs, including 14 previously unknown CBPs. In addition, by searching the Arabidopsis genome sequence with the newly identified and known plant or animal CBPs, we identified a total of 27 CBPs. Among these, 16 CBPs are represented by families with 2-20 members in each family. Gene expression analysis revealed that CBPs and CBP paralogs are expressed differentially. Our data suggest that Arabidopsis has a large number of CBPs including several plant-specific ones. Although CaM is highly conserved between plants and animals, only a few CBPs are common to both plants and animals. Analysis of Arabidopsis CBPs revealed the presence of a variety of interesting domains. Our analyses identified several hypothetical proteins in the Arabidopsis genome as CaM targets, suggesting their involvement in Ca(2+)-mediated signaling networks.

  14. Genes encoding calmodulin-binding proteins in the Arabidopsis genome.

    PubMed

    Reddy, Vaka S; Ali, Gul S; Reddy, Anireddy S N

    2002-03-22

    Analysis of the recently completed Arabidopsis genome sequence indicates that approximately 31% of the predicted genes could not be assigned to functional categories, as they do not show any sequence similarity with proteins of known function from other organisms. Calmodulin (CaM), a ubiquitous and multifunctional Ca(2+) sensor, interacts with a wide variety of cellular proteins and modulates their activity/function in regulating diverse cellular processes. However, the primary amino acid sequence of the CaM-binding domain in different CaM-binding proteins (CBPs) is not conserved. One way to identify most of the CBPs in the Arabidopsis genome is by protein-protein interaction-based screening of expression libraries with CaM. Here, using a mixture of radiolabeled CaM isoforms from Arabidopsis, we screened several expression libraries prepared from flower meristem, seedlings, or tissues treated with hormones, an elicitor, or a pathogen. Sequence analysis of 77 positive clones that interact with CaM in a Ca(2+)-dependent manner revealed 20 CBPs, including 14 previously unknown CBPs. In addition, by searching the Arabidopsis genome sequence with the newly identified and known plant or animal CBPs, we identified a total of 27 CBPs. Among these, 16 CBPs are represented by families with 2-20 members in each family. Gene expression analysis revealed that CBPs and CBP paralogs are expressed differentially. Our data suggest that Arabidopsis has a large number of CBPs including several plant-specific ones. Although CaM is highly conserved between plants and animals, only a few CBPs are common to both plants and animals. Analysis of Arabidopsis CBPs revealed the presence of a variety of interesting domains. Our analyses identified several hypothetical proteins in the Arabidopsis genome as CaM targets, suggesting their involvement in Ca(2+)-mediated signaling networks.

  15. Molecular evolution of monotreme and marsupial whey acidic protein genes.

    PubMed

    Sharp, Julie A; Lefèvre, Christophe; Nicholas, Kevin R

    2007-01-01

    Whey acidic protein (WAP), a major whey protein present in milk of a number of mammalian species has characteristic cysteine-rich domains known as four-disulfide cores (4-DSC). Eutherian WAP, expressed in the mammary gland throughout lactation, has two 4-DSC domains, (DI-DII) whereas marsupial WAP, expressed only during mid-late lactation, contains an additional 4-DSC (DIII), and has a DIII-D1-DII configuration. We report the expression and evolution of echidna (Tachyglossus aculeatus) and platypus (Onithorhynchus anatinus) WAP cDNAs. Predicted translation of monotreme cDNAs showed echidna WAP contains two 4-DSC domains corresponding to DIII-DII, whereas platypus WAP contains an additional domain at the C-terminus with homology to DII and has the configuration DIII-DII-DII. Both monotreme WAPs represent new WAP protein configurations. We propose models for evolution of the WAP gene in the mammalian lineage either through exon loss from an ancient ancestor or by rapid evolution via the process of exon shuffling. This evolutionary outcome may reflect differences in lactation strategy between marsupials, monotremes, and eutherians, and give insight to biological function of the gene products. WAP four-disulfide core domain 2 (WFDC2) proteins were also identified in echidna, platypus and tammar wallaby (Macropus eugenii) lactating mammary cells. WFDC2 proteins are secreted proteins not previously associated with lactation. Mammary gland expression of tammar WFDC2 during the course of lactation showed WFDC2 was elevated during pregnancy, reduced in early lactation and absent in mid-late lactation.

  16. Direct protein interaction underlies gene-for-gene specificity and coevolution of the flax resistance genes and flax rust avirulence genes.

    PubMed

    Dodds, Peter N; Lawrence, Gregory J; Catanzariti, Ann-Maree; Teh, Trazel; Wang, Ching-I A; Ayliffe, Michael A; Kobe, Bostjan; Ellis, Jeffrey G

    2006-06-06

    Plant resistance proteins (R proteins) recognize corresponding pathogen avirulence (Avr) proteins either indirectly through detection of changes in their host protein targets or through direct R-Avr protein interaction. Although indirect recognition imposes selection against Avr effector function, pathogen effector molecules recognized through direct interaction may overcome resistance through sequence diversification rather than loss of function. Here we show that the flax rust fungus AvrL567 genes, whose products are recognized by the L5, L6, and L7 R proteins of flax, are highly diverse, with 12 sequence variants identified from six rust strains. Seven AvrL567 variants derived from Avr alleles induce necrotic responses when expressed in flax plants containing corresponding resistance genes (R genes), whereas five variants from avr alleles do not. Differences in recognition specificity between AvrL567 variants and evidence for diversifying selection acting on these genes suggest they have been involved in a gene-specific arms race with the corresponding flax R genes. Yeast two-hybrid assays indicate that recognition is based on direct R-Avr protein interaction and recapitulate the interaction specificity observed in planta. Biochemical analysis of Escherichia coli-produced AvrL567 proteins shows that variants that escape recognition nevertheless maintain a conserved structure and stability, suggesting that the amino acid sequence differences directly affect the R-Avr protein interaction. We suggest that direct recognition associated with high genetic diversity at corresponding R and Avr gene loci represents an alternative outcome of plant-pathogen coevolution to indirect recognition associated with simple balanced polymorphisms for functional and nonfunctional R and Avr genes.

  17. Integrating gene synthesis and microfluidic protein analysis for rapid protein engineering

    PubMed Central

    Blackburn, Matthew C.; Petrova, Ekaterina; Correia, Bruno E.; Maerkl, Sebastian J.

    2016-01-01

    The capability to rapidly design proteins with novel functions will have a significant impact on medicine, biotechnology and synthetic biology. Synthetic genes are becoming a commodity, but integrated approaches have yet to be developed that take full advantage of gene synthesis. We developed a solid-phase gene synthesis method based on asymmetric primer extension (APE) and coupled this process directly to high-throughput, on-chip protein expression, purification and characterization (via mechanically induced trapping of molecular interactions, MITOMI). By completely circumventing molecular cloning and cell-based steps, APE-MITOMI reduces the time between protein design and quantitative characterization to 3–4 days. With APE-MITOMI we synthesized and characterized over 400 zinc-finger (ZF) transcription factors (TF), showing that although ZF TFs can be readily engineered to recognize a particular DNA sequence, engineering the precise binding energy landscape remains challenging. We also found that it is possible to engineer ZF–DNA affinity precisely and independently of sequence specificity and that in silico modeling can explain some of the observed affinity differences. APE-MITOMI is a generic approach that should facilitate fundamental studies in protein biophysics, and protein design/engineering. PMID:26704969

  18. Integrating gene synthesis and microfluidic protein analysis for rapid protein engineering.

    PubMed

    Blackburn, Matthew C; Petrova, Ekaterina; Correia, Bruno E; Maerkl, Sebastian J

    2016-04-20

    The capability to rapidly design proteins with novel functions will have a significant impact on medicine, biotechnology and synthetic biology. Synthetic genes are becoming a commodity, but integrated approaches have yet to be developed that take full advantage of gene synthesis. We developed a solid-phase gene synthesis method based on asymmetric primer extension (APE) and coupled this process directly to high-throughput, on-chip protein expression, purification and characterization (via mechanically induced trapping of molecular interactions, MITOMI). By completely circumventing molecular cloning and cell-based steps, APE-MITOMI reduces the time between protein design and quantitative characterization to 3-4 days. With APE-MITOMI we synthesized and characterized over 400 zinc-finger (ZF) transcription factors (TF), showing that although ZF TFs can be readily engineered to recognize a particular DNA sequence, engineering the precise binding energy landscape remains challenging. We also found that it is possible to engineer ZF-DNA affinity precisely and independently of sequence specificity and that in silico modeling can explain some of the observed affinity differences. APE-MITOMI is a generic approach that should facilitate fundamental studies in protein biophysics, and protein design/engineering.

  19. DAF-16/FoxO Directly Regulates an Atypical AMP-Activated Protein Kinase Gamma Isoform to Mediate the Effects of Insulin/IGF-1 Signaling on Aging in Caenorhabditis elegans

    PubMed Central

    Tullet, Jennifer M. A.; Araiz, Caroline; Sanders, Matthew J.; Au, Catherine; Benedetto, Alexandre; Papatheodorou, Irene; Clark, Emily; Schmeisser, Kathrin; Jones, Daniel; Schuster, Eugene F.; Thornton, Janet M.; Gems, David

    2014-01-01

    The DAF-16/FoxO transcription factor controls growth, metabolism and aging in Caenorhabditis elegans. The large number of genes that it regulates has been an obstacle to understanding its function. However, recent analysis of transcript and chromatin profiling implies that DAF-16 regulates relatively few genes directly, and that many of these encode other regulatory proteins. We have investigated the regulation by DAF-16 of genes encoding the AMP-activated protein kinase (AMPK), which has α, β and γ subunits. C. elegans has 5 genes encoding putative AMP-binding regulatory γ subunits, aakg-1-5. aakg-4 and aakg-5 are closely related, atypical isoforms, with orthologs throughout the Chromadorea class of nematodes. We report that ∼75% of total γ subunit mRNA encodes these 2 divergent isoforms, which lack consensus AMP-binding residues, suggesting AMP-independent kinase activity. DAF-16 directly activates expression of aakg-4, reduction of which suppresses longevity in daf-2 insulin/IGF-1 receptor mutants. This implies that an increase in the activity of AMPK containing the AAKG-4 γ subunit caused by direct activation by DAF-16 slows aging in daf-2 mutants. Knock down of aakg-4 expression caused a transient decrease in activation of expression in multiple DAF-16 target genes. This, taken together with previous evidence that AMPK promotes DAF-16 activity, implies the action of these two metabolic regulators in a positive feedback loop that accelerates the induction of DAF-16 target gene expression. The AMPK β subunit, aakb-1, also proved to be up-regulated by DAF-16, but had no effect on lifespan. These findings reveal key features of the architecture of the gene-regulatory network centered on DAF-16, and raise the possibility that activation of AMP-independent AMPK in nutritionally replete daf-2 mutant adults slows aging in C. elegans. Evidence of activation of AMPK subunits in mammals suggests that such FoxO-AMPK interactions may be evolutionarily conserved

  20. Heat shock protein 70-hom gene polymorphism and protein expression in multiple sclerosis.

    PubMed

    Boiocchi, C; Monti, M C; Osera, C; Mallucci, G; Pistono, C; Ferraro, O E; Nosari, G; Romani, A; Cuccia, M; Govoni, S; Pascale, A; Montomoli, C; Bergamaschi, R

    2016-09-15

    Immune-mediated and neurodegenerative mechanisms are involved in multiple sclerosis (MS). Growing evidences highlight the role of HSP70 genes in the susceptibility of some neurological diseases. In this explorative study we analyzed a polymorphism (i.e. HSP70-hom rs2227956) of the gene HSPA1L, which encodes for the protein hsp70-hom. We sequenced the polymorphism by polymerase chain reaction (PCR), in 191 MS patients and 365 healthy controls. The hsp70-hom protein expression was quantified by western blotting. We reported a strong association between rs2227956 polymorphism and MS risk, which is independent from the association with HSP70-2 rs1061581, and a significant link between hsp70-hom protein expression and MS severity.

  1. Prioritization of candidate genes for cattle reproductive traits, based on protein-protein interactions, gene expression, and text-mining.

    PubMed

    Hulsegge, Ina; Woelders, Henri; Smits, Mari; Schokker, Dirkjan; Jiang, Li; Sørensen, Peter

    2013-05-15

    Reproduction is of significant economic importance in dairy cattle. Improved understanding of mechanisms that control estrous behavior and other reproduction traits could help in developing strategies to improve and/or monitor these traits. The objective of this study was to predict and rank genes and processes in brain areas and pituitary involved in reproductive traits in cattle using information derived from three different data sources: gene expression, protein-protein interactions, and literature. We identified 59, 89, 53, 23, and 71 genes in bovine amygdala, dorsal hypothalamus, hippocampus, pituitary, and ventral hypothalamus, respectively, potentially involved in processes underlying estrus and estrous behavior. Functional annotation of the candidate genes points to a number of tissue-specific processes of which the "neurotransmitter/ion channel/synapse" process in the amygdala, "steroid hormone receptor activity/ion binding" in the pituitary, "extracellular region" in the ventral hypothalamus, and "positive regulation of transcription/metabolic process" in the dorsal hypothalamus are most prominent. The regulation of the functional processes in the various tissues operate at different biological levels, including transcriptional, posttranscriptional, extracellular, and intercellular signaling levels.

  2. Gene Ontology Function prediction in Mollicutes using Protein-Protein Association Networks

    PubMed Central

    2011-01-01

    Background Many complex systems can be represented and analysed as networks. The recent availability of large-scale datasets, has made it possible to elucidate some of the organisational principles and rules that govern their function, robustness and evolution. However, one of the main limitations in using protein-protein interactions for function prediction is the availability of interaction data, especially for Mollicutes. If we could harness predicted interactions, such as those from a Protein-Protein Association Networks (PPAN), combining several protein-protein network function-inference methods with semantic similarity calculations, the use of protein-protein interactions for functional inference in this species would become more potentially useful. Results In this work we show that using PPAN data combined with other approximations, such as functional module detection, orthology exploitation methods and Gene Ontology (GO)-based information measures helps to predict protein function in Mycoplasma genitalium. Conclusions To our knowledge, the proposed method is the first that combines functional module detection among species, exploiting an orthology procedure and using information theory-based GO semantic similarity in PPAN of the Mycoplasma species. The results of an evaluation show a higher recall than previously reported methods that focused on only one organism network. PMID:21486441

  3. p53 gene alterations and protein accumulation in colorectal cancer

    PubMed Central

    Bertorelle, R; Esposito, G; Belluco, C; Bonaldi, L; Del Mistro, A; Nitti, D; Lise, M; Chieco-Bianchi, L

    1996-01-01

    Aim—To correlate immunohistochemical staining with single strand conformation polymorphism (SSCP) analysis of the p53 gene in colorectal cancer in order to understand how the findings provided by the two techniques complement each other in defining p53 functional status. Methods—Frozen tumour tissue from 94 patients with colorectal cancer was studied for p53 protein accumulation and gene mutations. Accumulation of p53 protein was detected by immunohistochemistry using PAb1801 and BP53-12-1 monoclonal antibodies. The findings were then compared with SSCP analysis of exons 5 to 8 of the p53 gene. All cases with a positive result by SSCP analysis were confirmed by sequencing. Results—Nuclear staining was observed in 51 (54.2%) cases. SSCP analysis of the DNA amplified by PCR revealed that the electrophoretic pattern had shifted in 30 cases; sequence analysis confirmed the occurrence of a mutation in 29 cases and of a polymorphism in one. In 27 cases both assays gave a positive result, and in 40 both were negative; therefore, concordance between PCR-SSCP and immunohistochemistry was seen in 72% of cases. Conclusion—The data indicate that positive immunostaining corresponds with the presence of a mutation in most, but not all, cases studied; other mechanisms could be responsible for stabilisation and accumulation of p53 protein in the nucleus. Nonsense mutations which do not confer stability on the protein will not be detected by immunohistochemistry and false negative results can also occur with SSCP analysis. Images PMID:16696056

  4. Diverse nucleotide compositions and sequence fluctuation in Rubisco protein genes

    NASA Astrophysics Data System (ADS)

    Holden, Todd; Dehipawala, S.; Cheung, E.; Bienaime, R.; Ye, J.; Tremberger, G., Jr.; Schneider, P.; Lieberman, D.; Cheung, T.

    2011-10-01

    The Rubisco protein-enzyme is arguably the most abundance protein on Earth. The biology dogma of transcription and translation necessitates the study of the Rubisco genes and Rubisco-like genes in various species. Stronger correlation of fractal dimension of the atomic number fluctuation along a DNA sequence with Shannon entropy has been observed in the studied Rubisco-like gene sequences, suggesting a more diverse evolutionary pressure and constraints in the Rubisco sequences. The strategy of using metal for structural stabilization appears to be an ancient mechanism, with data from the porphobilinogen deaminase gene in Capsaspora owczarzaki and Monosiga brevicollis. Using the chi-square distance probability, our analysis supports the conjecture that the more ancient Rubisco-like sequence in Microcystis aeruginosa would have experienced very different evolutionary pressure and bio-chemical constraint as compared to Bordetella bronchiseptica, the two microbes occupying either end of the correlation graph. Our exploratory study would indicate that high fractal dimension Rubisco sequence would support high carbon dioxide rate via the Michaelis- Menten coefficient; with implication for the control of the whooping cough pathogen Bordetella bronchiseptica, a microbe containing a high fractal dimension Rubisco-like sequence (2.07). Using the internal comparison of chi-square distance probability for 16S rRNA (~ E-22) versus radiation repair Rec-A gene (~ E-05) in high GC content Deinococcus radiodurans, our analysis supports the conjecture that high GC content microbes containing Rubisco-like sequence are likely to include an extra-terrestrial origin, relative to Deinococcus radiodurans. Similar photosynthesis process that could utilize host star radiation would not compete with radiation resistant process from the biology dogma perspective in environments such as Mars and exoplanets.

  5. Molecular cloning and analysis of the CRY1 gene: a yeast ribosomal protein gene.

    PubMed Central

    Larkin, J C; Woolford, J L

    1983-01-01

    Using cloned DNA from the vicinity of the yeast mating type locus (MAT) as a probe, the wild type allele of the cryptopleurine resistance gene CRY1 has been isolated by the technique of chromosome walking and has been shown to be identical to the gene for ribosomal protein 59. A recessive cryR1 allele has also been cloned, using the integration excision method. The genetic distance from MAT to CRY1 is 2.2 cM, while the physical distance is 21 kb, giving a ratio of about 10 kb/cM for this interval. The phenotypic expression of both plasmid borne alleles of the gene can be detected in vivo. The use of this gene as a hybridization probe to examine RNA processing defects in the rna 2, rna 3, rna 4, rna 8, and rna 11 mutants is also discussed. Images PMID:6338478

  6. Leber congenital amaurosis: genes, proteins and disease mechanisms.

    PubMed

    den Hollander, Anneke I; Roepman, Ronald; Koenekoop, Robert K; Cremers, Frans P M

    2008-07-01

    Leber congenital amaurosis (LCA) is the most severe retinal dystrophy causing blindness or severe visual impairment before the age of 1 year. Linkage analysis, homozygosity mapping and candidate gene analysis facilitated the identification of 14 genes mutated in patients with LCA and juvenile retinal degeneration, which together explain approximately 70% of the cases. Several of these genes have also been implicated in other non-syndromic or syndromic retinal diseases, such as retinitis pigmentosa and Joubert syndrome, respectively. CEP290 (15%), GUCY2D (12%), and CRB1 (10%) are the most frequently mutated LCA genes; one intronic CEP290 mutation (p.Cys998X) is found in approximately 20% of all LCA patients from north-western Europe, although this frequency is lower in other populations. Despite the large degree of genetic and allelic heterogeneity, it is possible to identify the causative mutations in approximately 55% of LCA patients by employing a microarray-based, allele-specific primer extension analysis of all known DNA variants. The LCA genes encode proteins with a wide variety of retinal functions, such as photoreceptor morphogenesis (CRB1, CRX), phototransduction (AIPL1, GUCY2D), vitamin A cycling (LRAT, RDH12, RPE65), guanine synthesis (IMPDH1), and outer segment phagocytosis (MERTK). Recently, several defects were identified that are likely to affect intra-photoreceptor ciliary transport processes (CEP290, LCA5, RPGRIP1, TULP1). As the eye represents an accessible and immune-privileged organ, it appears to be uniquely suitable for human gene replacement therapy. Rodent (Crb1, Lrat, Mertk, Rpe65, Rpgrip1), avian (Gucy2D) and canine (Rpe65) models for LCA and profound visual impairment have been successfully corrected employing adeno-associated virus or lentivirus-based gene therapy. Moreover, phase 1 clinical trials have been carried out in humans with RPE65 deficiencies. Apart from ethical considerations inherently linked to treating children, major

  7. Gene Delivery into Plant Cells for Recombinant Protein Production

    PubMed Central

    Chen, Qiang

    2015-01-01

    Recombinant proteins are primarily produced from cultures of mammalian, insect, and bacteria cells. In recent years, the development of deconstructed virus-based vectors has allowed plants to become a viable platform for recombinant protein production, with advantages in versatility, speed, cost, scalability, and safety over the current production paradigms. In this paper, we review the recent progress in the methodology of agroinfiltration, a solution to overcome the challenge of transgene delivery into plant cells for large-scale manufacturing of recombinant proteins. General gene delivery methodologies in plants are first summarized, followed by extensive discussion on the application and scalability of each agroinfiltration method. New development of a spray-based agroinfiltration and its application on field-grown plants is highlighted. The discussion of agroinfiltration vectors focuses on their applications for producing complex and heteromultimeric proteins and is updated with the development of bridge vectors. Progress on agroinfiltration in Nicotiana and non-Nicotiana plant hosts is subsequently showcased in context of their applications for producing high-value human biologics and low-cost and high-volume industrial enzymes. These new advancements in agroinfiltration greatly enhance the robustness and scalability of transgene delivery in plants, facilitating the adoption of plant transient expression systems for manufacturing recombinant proteins with a broad range of applications. PMID:26075275

  8. Transcription of antifreeze protein genes in Choristoneura fumiferana.

    PubMed

    Qin, W; Doucet, D; Tyshenko, M G; Walker, V K

    2007-08-01

    Antifreeze proteins (AFPs) are encoded by approximately 17 genes in the spruce budworm, Choristoneura fumiferana. Northern analysis using 6 different cDNA probes showed isoform-specific patterns that varied during development. Transcripts for the majority of isoforms were most abundant in the second instar overwintering stage, but some were also detected in first instar and even in egg stages. In situ hybridization using riboprobes corresponding to two 9 kDa protein isoforms showed differential AFP expression even in second instars; CfAFP10 RNA was detected in all tissues, but CfAFP337 RNA distribution was more limited. Two genomic regions encoding three AFP genes have been isolated. Presumptive regulatory regions conferred transcriptional activity when placed upstream of a luciferase reporter sequence and transfected into a C. fumiferana cell line. The CfAFP2.26 core promoter is an 87 bp sequence containing a TATA box, whereas the CfAFP2.7 core promoter is a 76 bp sequence with both a TATA box and CAAT box, which directed higher reporter activities when tested in vitro. Reporter activity was not enhanced with five different hormones, although lower activities were observed with all intron-containing constructs. AFP message half-life, as assessed using reporter assays, was not appreciably influenced by isoform-specific-3'UTRs. These studies successfully demonstrate the temporal and spatial diversity of AFP expression encoded by this small gene family, and underscore the complexity of their regulation.

  9. Protein-Protein Interactions: Gene Acronym Redundancies and Current Limitations Precluding Automated Data Integration.

    PubMed

    Casado-Vela, Juan; Matthiesen, Rune; Sellés, Susana; Naranjo, José Ramón

    2013-05-31

    Understanding protein interaction networks and their dynamic changes is a major challenge in modern biology. Currently, several experimental and in silico approaches allow the screening of protein interactors in a large-scale manner. Therefore, the bulk of information on protein interactions deposited in databases and peer-reviewed published literature is constantly growing. Multiple databases interfaced from user-friendly web tools recently emerged to facilitate the task of protein interaction data retrieval and data integration. Nevertheless, as we evidence in this report, despite the current efforts towards data integration, the quality of the information on protein interactions retrieved by in silico approaches is frequently incomplete and may even list false interactions. Here we point to some obstacles precluding confident data integration, with special emphasis on protein interactions, which include gene acronym redundancies and protein synonyms. Three human proteins (choline kinase, PPIase and uromodulin) and three different web-based data search engines focused on protein interaction data retrieval (PSICQUIC, DASMI and BIPS) were used to explain the potential occurrence of undesired errors that should be considered by researchers in the field. We demonstrate that, despite the recent initiatives towards data standardization, manual curation of protein interaction networks based on literature searches are still required to remove potential false positives. A three-step workflow consisting of: (i) data retrieval from multiple databases, (ii) peer-reviewed literature searches, and (iii) data curation and integration, is proposed as the best strategy to gather updated information on protein interactions. Finally, this strategy was applied to compile bona fide information on human DREAM protein interactome, which constitutes liable training datasets that can be used to improve computational predictions.

  10. Protein-Protein Interactions: Gene Acronym Redundancies and Current Limitations Precluding Automated Data Integration

    PubMed Central

    Casado-Vela, Juan; Matthiesen, Rune; Sellés, Susana; Naranjo, José Ramón

    2013-01-01

    Understanding protein interaction networks and their dynamic changes is a major challenge in modern biology. Currently, several experimental and in silico approaches allow the screening of protein interactors in a large-scale manner. Therefore, the bulk of information on protein interactions deposited in databases and peer-reviewed published literature is constantly growing. Multiple databases interfaced from user-friendly web tools recently emerged to facilitate the task of protein interaction data retrieval and data integration. Nevertheless, as we evidence in this report, despite the current efforts towards data integration, the quality of the information on protein interactions retrieved by in silico approaches is frequently incomplete and may even list false interactions. Here we point to some obstacles precluding confident data integration, with special emphasis on protein interactions, which include gene acronym redundancies and protein synonyms. Three human proteins (choline kinase, PPIase and uromodulin) and three different web-based data search engines focused on protein interaction data retrieval (PSICQUIC, DASMI and BIPS) were used to explain the potential occurrence of undesired errors that should be considered by researchers in the field. We demonstrate that, despite the recent initiatives towards data standardization, manual curation of protein interaction networks based on literature searches are still required to remove potential false positives. A three-step workflow consisting of: (i) data retrieval from multiple databases, (ii) peer-reviewed literature searches, and (iii) data curation and integration, is proposed as the best strategy to gather updated information on protein interactions. Finally, this strategy was applied to compile bona fide information on human DREAM protein interactome, which constitutes liable training datasets that can be used to improve computational predictions. PMID:28250396

  11. False positive reduction in protein-protein interaction predictions using gene ontology annotations.

    PubMed

    Mahdavi, Mahmoud A; Lin, Yen-Han

    2007-07-23

    Many crucial cellular operations such as metabolism, signalling, and regulations are based on protein-protein interactions. However, the lack of robust protein-protein interaction information is a challenge. One reason for the lack of solid protein-protein interaction information is poor agreement between experimental findings and computational sets that, in turn, comes from huge false positive predictions in computational approaches. Reduction of false positive predictions and enhancing true positive fraction of computationally predicted protein-protein interaction datasets based on highly confident experimental results has not been adequately investigated. Gene Ontology (GO) annotations were used to reduce false positive protein-protein interactions (PPI) pairs resulting from computational predictions. Using experimentally obtained PPI pairs as a training dataset, eight top-ranking keywords were extracted from GO molecular function annotations. The sensitivity of these keywords is 64.21% in the yeast experimental dataset and 80.83% in the worm experimental dataset. The specificities, a measure of recovery power, of these keywords applied to four predicted PPI datasets for each studied organisms, are 48.32% and 46.49% (by average of four datasets) in yeast and worm, respectively. Based on eight top-ranking keywords and co-localization of interacting proteins a set of two knowledge rules were deduced and applied to remove false positive protein pairs. The 'strength', a measure of improvement provided by the rules was defined based on the signal-to-noise ratio and implemented to measure the applicability of knowledge rules applying to the predicted PPI datasets. Depending on the employed PPI-predicting methods, the strength varies between two and ten-fold of randomly removing protein pairs from the datasets. Gene Ontology annotations along with the deduced knowledge rules could be implemented to partially remove false predicted PPI pairs. Removal of false positives

  12. Functions of BET proteins in erythroid gene expression.

    PubMed

    Stonestrom, Aaron J; Hsu, Sarah C; Jahn, Kristen S; Huang, Peng; Keller, Cheryl A; Giardine, Belinda M; Kadauke, Stephan; Campbell, Amy E; Evans, Perry; Hardison, Ross C; Blobel, Gerd A

    2015-04-30

    Inhibitors of bromodomain and extraterminal motif proteins (BETs) are being evaluated for the treatment of cancer and other diseases, yet much remains to be learned about how BET proteins function during normal physiology. We used genomic and genetic approaches to examine BET function in a hematopoietic maturation system driven by GATA1, an acetylated transcription factor previously shown to interact with BETs. We found that BRD2, BRD3, and BRD4 were variably recruited to GATA1-regulated genes, with BRD3 binding the greatest number of GATA1-occupied sites. Pharmacologic BET inhibition impaired GATA1-mediated transcriptional activation, but not repression, genome-wide. Mechanistically, BETs promoted chromatin occupancy of GATA1 and subsequently supported transcriptional activation. Using a combination of CRISPR-Cas9-mediated genomic engineering and shRNA approaches, we observed that depletion of either BRD2 or BRD4 alone blunted erythroid gene activation. Surprisingly, depletion of BRD3 only affected erythroid transcription in the context of BRD2 deficiency. Consistent with functional overlap among BET proteins, forced BRD3 expression substantially rescued defects caused by BRD2 deficiency. These results suggest that pharmacologic BET inhibition should be interpreted in the context of distinct steps in transcriptional activation and overlapping functions among BET family members.

  13. pGenN, a Gene Normalization Tool for Plant Genes and Proteins in Scientific Literature

    PubMed Central

    Ding, Ruoyao; Arighi, Cecilia N.; Lee, Jung-Youn; Wu, Cathy H.; Vijay-Shanker, K.

    2015-01-01

    Background Automatically detecting gene/protein names in the literature and connecting them to databases records, also known as gene normalization, provides a means to structure the information buried in free-text literature. Gene normalization is critical for improving the coverage of annotation in the databases, and is an essential component of many text mining systems and database curation pipelines. Methods In this manuscript, we describe a gene normalization system specifically tailored for plant species, called pGenN (pivot-based Gene Normalization). The system consists of three steps: dictionary-based gene mention detection, species assignment, and intra species normalization. We have developed new heuristics to improve each of these phases. Results We evaluated the performance of pGenN on an in-house expertly annotated corpus consisting of 104 plant relevant abstracts. Our system achieved an F-value of 88.9% (Precision 90.9% and Recall 87.2%) on this corpus, outperforming state-of-art systems presented in BioCreative III. We have processed over 440,000 plant-related Medline abstracts using pGenN. The gene normalization results are stored in a local database for direct query from the pGenN web interface (proteininformationresource.org/pgenn/). The annotated literature corpus is also publicly available through the PIR text mining portal (proteininformationresource.org/iprolink/). PMID:26258475

  14. Evolutionary Characteristics of Missing Proteins: Insights into the Evolution of Human Chromosomes Related to Missing-Protein-Encoding Genes.

    PubMed

    Xu, Aishi; Li, Guang; Yang, Dong; Wu, Songfeng; Ouyang, Hongsheng; Xu, Ping; He, Fuchu

    2015-12-04

    Although the "missing protein" is a temporary concept in C-HPP, the biological information for their "missing" could be an important clue in evolutionary studies. Here we classified missing-protein-encoding genes into two groups, the genes encoding PE2 proteins (with transcript evidence) and the genes encoding PE3/4 proteins (with no transcript evidence). These missing-protein-encoding genes distribute unevenly among different chromosomes, chromosomal regions, or gene clusters. In the view of evolutionary features, PE3/4 genes tend to be young, spreading at the nonhomology chromosomal regions and evolving at higher rates. Interestingly, there is a higher proportion of singletons in PE3/4 genes than the proportion of singletons in all genes (background) and OTCSGs (organ, tissue, cell type-specific genes). More importantly, most of the paralogous PE3/4 genes belong to the newly duplicated members of the paralogous gene groups, which mainly contribute to special biological functions, such as "smell perception". These functions are heavily restricted into specific type of cells, tissues, or specific developmental stages, acting as the new functional requirements that facilitated the emergence of the missing-protein-encoding genes during evolution. In addition, the criteria for the extremely special physical-chemical proteins were first set up based on the properties of PE2 proteins, and the evolutionary characteristics of those proteins were explored. Overall, the evolutionary analyses of missing-protein-encoding genes are expected to be highly instructive for proteomics and functional studies in the future.

  15. Eukaryotic expression vectors bearing genes encoding cytotoxic proteins for cancer gene therapy.

    PubMed

    Glinka, Elena M

    2012-09-01

    Cancer gene therapy is a promising direction for the treatment of cancer patients. A primary goal of all cancer therapies is to selectively target and kill tumour cells. Such therapies are administered via different approaches, including both viral and non-viral delivery; however, both methods have advantages and disadvantages. Transcriptional targeting enables genes encoding toxic proteins to be expressed directly in cancer cells. Numerous vectors have been created with the purpose of killing cancer cells, and some have successfully suppressed malignant tumours. Data concerning the function of vectors bearing genes that encode cytotoxic proteins under the control of different promoters, including tissue/tumour specific and constitutive promoters, is summarised here. This review focuses on vectors that bear genes encoding diphtheria toxin, Pseudomonas exotoxin A, caspases, gef, streptolysin, and melittin. Data describing the efficacy of such vectors have been summarised. Notably, there are vectors that killed cancer cell lines originating from the same type of cancer with differential efficiency. Thus, there is differential inhibition of cancer cell growth dependent on the cell line. In this review, the constructs employing genes whose expression induces cell death and the efficiency with which they suppress cancer cell growth will be summarised.

  16. Fc-receptor and M-protein genes of group A streptococci are products of gene duplication.

    PubMed Central

    Heath, D G; Cleary, P P

    1989-01-01

    The partial nucleotide sequence for an Fc-receptor gene from an M-type 76 group A streptococcus was determined. DNA sequence analysis revealed considerable sequence similarity between the Fc-receptor and M-protein genes in their proposed promoter regions, signal sequences, and 3' termini. Additional analysis indicated that the deduced Fc-receptor protein contains a proline-rich region and membrane anchor region highly similar to that of M protein. In view of these results, we postulated that Fc-receptor and M-protein genes of group A streptococci are the products of gene duplication from a common ancestral gene. It is proposed that DNA sequence similarity between these two genes may allow for extragenic homologous recombination as a means of generating antigenic diversity in these two surface proteins. PMID:2660147

  17. From patenting genes to proteins: the search for utility via function.

    PubMed

    Ilag, Lawrence L; Ilag, Leodegario M; Ilag, Leodevico L

    2002-05-01

    The debate regarding the patenting of genes has extended into the post-genome era. With only approximately 35000 genes deduced from the draft sequence of the human genome, there are fears that a few companies have already gained monopoly on the potential benefits from this knowledge. Nevertheless, it is accepted that proteins determine gene function and function is not readily predicted from gene sequence. Furthermore, genes can encode multiple proteins and a single protein can have multiple functions. Here, we argue that unraveling the intrinsic complexity of proteins and their functions is the key towards determining the utility requirement for patenting protein inventions and consider the possible socioeconomic impact.

  18. Nucleotide variation in the Toxoplasma gondii micronemal protein 8 gene.

    PubMed

    Li, Z Y; Song, H Q; Wang, C R; Zhu, X Q

    2016-05-09

    Toxoplasma gondii is a successful opportunistic protozoan distributed worldwide, which can infect all vertebrates, leading to serious infection, blindness, and abortion. Micronemal (MIC) proteins are critically important for T. gondii infection, as they participate in various stages of the Toxoplasma life cycle, including invasion and attachment to host cells. MIC8 secretion relies on the concentration of intracellular calcium, and can mediate the invasion of T. gondii by interacting with soluble MIC3. To investigate genetic diversity of the MIC8 gene, 16 T. gondii strains from different hosts and geographical locations, and two reference isolates (ToxoDB: TGME49_245490 and TGVEG_245490) were examined in this study. The results showed that all the examined MIC8 genes are 2055 bp, with an A+T content ranging from 50.2 to 50.6%. Conversely, lower levels of variation were detected within their nucleotide and amino acid sequences. Phylogenetic analyses indicated that three classical genotypes of T. gondii and the ToxoDB#9 genotype did not group exclusively via Bayesian inference, maximum parsimony, neighbor joining, and/or maximum likelihood assays based on the nucleotide and amino acid sequences of the MIC8 gene. In summary, the T. gondii MIC8 gene is not a suitable marker for population genetic studies of this parasite.

  19. Nuclear genes encoding plastid proteins expressed early in chloroplast development

    SciTech Connect

    Mullet, J.E.

    1991-01-01

    The overall objective of this grant was to characterize events which occur early in chloroplast biogenesis and to isolate nuclear genes encoding plastid proteins which are expressed during this developmental phase. In addition, the possible requirement of plastid transcription for the expression of the nuclear genes such as rbcS and cab was to be tested. The impetus for this research came from studies of chloroplast biogenesis in barley. We found that plastid DNA copy number was relatively high (120 copies/plastid vs 200 at maximal accumulation) in the meristematic region of the leaf base whereas plastid transcription activity was low in this plastid population. Later in chloroplast development transcription activity increased at least 5 fold per plastid or per template indicating that activation of plastid transcription occurred after most of the build up in plastid DNA per plastid. This suggested that activation of plastid DNA synthesis occurred early in chloroplast development and that nuclear genes involved in this activity must be regulated differently from genes such rbcS or cab which are expressed later in development. 3 refs., 7 figs.

  20. A rhodopsin-like protein in Cyanophora paradoxa: gene sequence and protein immunolocalization.

    PubMed

    Frassanito, Anna Maria; Barsanti, Laura; Passarelli, Vincenzo; Evangelista, Valtere; Gualtieri, Paolo

    2010-03-01

    Here, we report the DNA sequence of the rhodopsin gene in the alga Cyanophora paradoxa (Glaucophyta). The primers were designed according to the conserved regions of prokaryotic and eukaryotic rhodopsin-like proteins deposited in the GenBank. The sequence consists of 1,272 bp comprised of 5 introns. The correspondent protein, named Cyanophopsin, showed high identity to rhodopsin-like proteins of Archea, Bacteria, Fungi, and Algae. At the N-terminal, the protein is characterized by a region with no transmembrane alpha-helices (80 aa), followed by a region with 7alpha-helices (219 aa) and a shorter 35-aa C-terminal region. The DNA sequence of the N-terminal region was expressed in E. coli and the recombinant purified peptide was used as antigen in hens to obtain polyclonal antibodies. Indirect immunofluorescence in C. paradoxa cells showed a marked labeling of the muroplast (aka cyanelle) membrane.

  1. [Drosophila melanogaster gene Merlin interacts with the clathrin adaptor protein gene lap].

    PubMed

    Kopyl, S A; Dorogova, N V; Akhmamet'eva, E M; Omel'ianchuk, L V; Chang, L -S

    2010-03-01

    The protein Merlin is involved in the regulation of cell proliferation and differentiation in the eyes and wings of Drosophila and is a homolog of the human protein encoded by the Neurofibromatosis 2 (NF2) gene whose mutations cause auricular nerve tumors. Recent studies show that Merlin and Expanded cooperatively regulate the recycling of membrane receptors, such as the epidermal growth factor receptor (EGFR). By performing a search for potential genetic interactions between Merlin (Mer) and the genes important for vesicular trafficking, we found that ectopic expression in the wing pouch of the clathrin adapter protein Lap involved in clathrin-mediated receptor endocytosis resulted in the formation of extra vein materials. On the one hand, coexpression of wild-type Merlin and lap in the wing pouch restored normal venation, while overexpression of a dominant-negative mutant Mer(DBB) together with lap enhanced ectopic vein formation. Using various constructs with Merlin truncated copies, we showed the C-terminal portion of the Merlin protein to be responsible for the Merlin-lap genetic interaction. Furthermore, we showed that the Merlin and Lap proteins colocalized at the cortex of the wing imaginal disc cells.

  2. Targeted genes and interacting proteins of hypoxia inducible factor-1

    PubMed Central

    Liu, Wei; Shen, Shao-Ming; Zhao, Xu-Yun; Chen, Guo-Qiang

    2012-01-01

    Heterodimeric transcription factor hypoxia inducible factor-1 (HIF-1) functions as a master regulator of oxygen homeostasis in almost all nucleated mammalian cells. The fundamental process adapted to cellular oxygen alteration largely depends on the refined regulation on its alpha subunit, HIF-1α. Recent studies have unraveled expanding and critical roles of HIF-1α, involving in a multitude of developmental, physiological, and pathophysiological processes. This review will focus on the current knowledge of HIF-1α-targeting genes and its interacting proteins, as well as the concomitant functional relationships between them. PMID:22773957

  3. Mouse Genetic Nomenclature: Standardization of Strain, Gene, and Protein Symbols

    PubMed Central

    Sundberg, John P.; Schofield, Paul N

    2011-01-01

    The use of standard nomenclatures for describing the strains, genes, and proteins of species is vital for the interpretation, archiving, analysis, and recovery of experimental data on the laboratory mouse. At a time when sharing of data and meta- analysis of experimental results is becoming a dominant mode of scientific investigation, failure to respect formal nomenclatures can cause confusion, errors, and in some cases contribute to poor science. Here we present the basic nomenclature rules for laboratory mice and explain how these rules should be applied to complex genetic manipulations and crosses. PMID:20685919

  4. Genes and proteins involved in bacterial magnetic particle formation.

    PubMed

    Matsunaga, Tadashi; Okamura, Yoshiko

    2003-11-01

    Magnetic bacteria synthesize intracellular magnetosomes that impart a cellular swimming behaviour referred to as magnetotaxis. The magnetic structures aligned in chains are postulated to function as biological compass needles allowing the bacterium to migrate along redox gradients through the Earth's geomagnetic field lines. Despite the discovery of this unique group of microorganisms 28 years ago, the mechanisms of magnetic crystal biomineralization have yet to be fully elucidated. This review describes the current knowledge of the genes and proteins involved in magnetite formation in magnetic bacteria and the biotechnological applications of biomagnetites in the interdisciplinary fields of nanobiotechnology, medicine and environmental management.

  5. PREFACE: Physics approaches to protein interactions and gene regulation Physics approaches to protein interactions and gene regulation

    NASA Astrophysics Data System (ADS)

    Nussinov, Ruth; Panchenko, Anna R.; Przytycka, Teresa

    2011-06-01

    networks have been identified, including scale free distribution of the vertex degree, network motifs, and modularity, to name a few. These studies of network organization require the network to be as complete as possible, which given the limitations of experimental techniques is not currently the case. Therefore, experimental procedures for detecting biomolecular interactions should be complemented by computational approaches. The paper by Lees et al provides a review of computational methods, integrating multiple independent sources of data to infer physical and functional protein-protein interaction networks. One of the important aspects of protein interactions that should be accounted for in the prediction of protein interaction networks is that many proteins are composed of distinct domains. Protein domains may mediate protein interactions while proteins and their interaction networks may gain complexity through gene duplication and expansion of existing domain architectures via domain rearrangements. The latter mechanisms have been explored in detail in the paper by Cohen-Gihon et al. Protein-protein interactions are not the only component of the cell's interactome. Regulation of cell activity can be achieved at the level of transcription and involve a transcription factor—DNA binding which typically requires recognition of a specific DNA sequence motif. Chip-Chip and the more recent Chip-Seq technologies allow in vivo identification of DNA binding sites and, together with novel in vitro approaches, provide data necessary for deciphering the corresponding binding motifs. Such information, complemented by structures of protein-DNA complexes and knowledge of the differences in binding sites among homologs, opens the door to constructing predictive binding models. The paper by Persikov and Singh provides an example of such a model in the Cys2His2 zinc finger family. Recent studies have indicated that the presence of such binding motifs is, however, neither necessary

  6. Structural basis for cAMP-mediated allosteric control of the catabolite activator protein.

    PubMed

    Popovych, Nataliya; Tzeng, Shiou-Ru; Tonelli, Marco; Ebright, Richard H; Kalodimos, Charalampos G

    2009-04-28

    The cAMP-mediated allosteric transition in the catabolite activator protein (CAP; also known as the cAMP receptor protein, CRP) is a textbook example of modulation of DNA-binding activity by small-molecule binding. Here we report the structure of CAP in the absence of cAMP, which, together with structures of CAP in the presence of cAMP, defines atomic details of the cAMP-mediated allosteric transition. The structural changes, and their relationship to cAMP binding and DNA binding, are remarkably clear and simple. Binding of cAMP results in a coil-to-helix transition that extends the coiled-coil dimerization interface of CAP by 3 turns of helix and concomitantly causes rotation, by approximately 60 degrees , and translation, by approximately 7 A, of the DNA-binding domains (DBDs) of CAP, positioning the recognition helices in the DBDs in the correct orientation to interact with DNA. The allosteric transition is stabilized further by expulsion of an aromatic residue from the cAMP-binding pocket upon cAMP binding. The results define the structural mechanisms that underlie allosteric control of this prototypic transcriptional regulatory factor and provide an illustrative example of how effector-mediated structural changes can control the activity of regulatory proteins.

  7. The nucleocapsid protein gene of bovine coronavirus is bicistronic.

    PubMed Central

    Senanayake, S D; Hofmann, M A; Maki, J L; Brian, D A

    1992-01-01

    For animal RNA viruses that replicate through an RNA intermediate, reported examples of bicistronic mRNAs with overlapping open reading frames in which one cistron is contained entirely within another have been made only for those with negative-strand or double-stranded genomes. In this report, we demonstrate for the positive-strand bovine coronavirus that an overlapping open reading frame potentially encoding a 23-kDa protein (names the I [for internal open reading frame] protein) and lying entirely within the gene for the 49-kDa nucleocapsid phosphoprotein is expressed during virus replication from a single species of unedited mRNA. The I protein was specifically immunoprecipitated from virus-infected cells with an I-specific antipeptide serum and was shown to be membrane associated. Many features of I protein synthesis conform to the leaky ribosomal scanning model for regulation of translation. This, to our knowledge, is the first example of a bicistronic mRNA for a cytoplasmically replicating, positive-strand animal RNA virus in which one cistron entirely overlaps another. Images PMID:1501275

  8. Protein-protein interactions prediction based on iterative clique extension with gene ontology filtering.

    PubMed

    Yang, Lei; Tang, Xianglong

    2014-01-01

    Cliques (maximal complete subnets) in protein-protein interaction (PPI) network are an important resource used to analyze protein complexes and functional modules. Clique-based methods of predicting PPI complement the data defection from biological experiments. However, clique-based predicting methods only depend on the topology of network. The false-positive and false-negative interactions in a network usually interfere with prediction. Therefore, we propose a method combining clique-based method of prediction and gene ontology (GO) annotations to overcome the shortcoming and improve the accuracy of predictions. According to different GO correcting rules, we generate two predicted interaction sets which guarantee the quality and quantity of predicted protein interactions. The proposed method is applied to the PPI network from the Database of Interacting Proteins (DIP) and most of the predicted interactions are verified by another biological database, BioGRID. The predicted protein interactions are appended to the original protein network, which leads to clique extension and shows the significance of biological meaning.

  9. Growing functional modules from a seed protein via integration of protein interaction and gene expression data.

    PubMed

    Maraziotis, Ioannis A; Dimitrakopoulou, Konstantina; Bezerianos, Anastasios

    2007-10-23

    Nowadays modern biology aims at unravelling the strands of complex biological structures such as the protein-protein interaction (PPI) networks. A key concept in the organization of PPI networks is the existence of dense subnetworks (functional modules) in them. In recent approaches clustering algorithms were applied at these networks and the resulting subnetworks were evaluated by estimating the coverage of well-established protein complexes they contained. However, most of these algorithms elaborate on an unweighted graph structure which in turn fails to elevate those interactions that would contribute to the construction of biologically more valid and coherent functional modules. In the current study, we present a method that corroborates the integration of protein interaction and microarray data via the discovery of biologically valid functional modules. Initially the gene expression information is overlaid as weights onto the PPI network and the enriched PPI graph allows us to exploit its topological aspects, while simultaneously highlights enhanced functional association in specific pairs of proteins. Then we present an algorithm that unveils the functional modules of the weighted graph by expanding a kernel protein set, which originates from a given 'seed' protein used as starting-point. The integrated data and the concept of our approach provide reliable functional modules. We give proofs based on yeast data that our method manages to give accurate results in terms both of structural coherency, as well as functional consistency.

  10. Histone deacetylase inhibitors modulate the transcriptional regulation of guanylyl cyclase/natriuretic peptide receptor-a gene: interactive roles of modified histones, histone acetyltransferase, p300, AND Sp1.

    PubMed

    Kumar, Prerna; Tripathi, Satyabha; Pandey, Kailash N

    2014-03-07

    Atrial natriuretic peptide (ANP) binds guanylyl cyclase-A/natriuretic peptide receptor-A (GC-A/NPRA) and produces the intracellular second messenger, cGMP, which regulates cardiovascular homeostasis. We sought to determine the function of histone deacetylases (HDACs) in regulating Npr1 (coding for GC-A/NPRA) gene transcription, using primary mouse mesangial cells treated with class-specific HDAC inhibitors (HDACi). Trichostatin A, a pan inhibitor, and mocetinostat (MGCD0103), a class I HDAC inhibitor, significantly enhanced Npr1 promoter activity (by 8- and 10-fold, respectively), mRNA levels (4- and 5.3-fold, respectively), and NPRA protein (2.7- and 3.5-fold, respectively). However, MC1568 (class II HDAC inhibitor) had no discernible effect. Overexpression of HDAC1 and HDAC2 significantly attenuated Npr1 promoter activity, whereas HDAC3 and HDAC8 had no effect. HDACi-treated cultured cells in vitro and intact animals in vivo showed significantly reduced binding of HDAC1 and -2 and increased accumulation of acetylated H3-K9/14 and H4-K12 at the Npr1 promoter. Deletional analyses of the Npr1 promoter along with ectopic overexpression and inhibition of Sp1 confirmed that HDACi-induced Npr1 gene transcription is accomplished by Sp1 activation. Furthermore, HDACi attenuated the interaction of Sp1 with HDAC1/2 and promoted Sp1 association with p300 and p300/cAMP-binding protein-associated factor; it also promoted the recruitment of p300 and p300/cAMP-binding protein-associated factor to the Npr1 promoter. Our results demonstrate that trichostatin A and MGCD0103 enhanced Npr1 gene expression through inhibition of HDAC1/2 and increased both acetylation of histones (H3-K9/14, H4-K12) and Sp1 by p300, and their recruitment to Npr1 promoter. Our findings define a novel epigenetic regulatory mechanism that governs Npr1 gene transcription.

  11. Protein inhibitor of activated STAT3 inhibits adipogenic gene expression

    SciTech Connect

    Deng Jianbei; Hua Kunjie; Caveney, Erica J.; Takahashi, Nobuyuki; Harp, Joyce B. . E-mail: jharp@unc.edu

    2006-01-20

    Protein inhibitor of activated STAT3 (PIAS3), a cytokine-induced repressor of signal transducer and activator of transcription 3 (STAT3) and a modulator of a broad array of nuclear proteins, is expressed in white adipose tissue, but its role in adipogenesis is not known. Here, we determined that PIAS3 was constitutively expressed in 3T3-L1 cells at all stages of adipogenesis. However, it translocated from the nucleus to the cytoplasm 4 days after induction of differentiation by isobutylmethylxanthine, dexamethasone, and insulin (MDI). In ob/ob mice, PIAS3 expression was increased in white adipose tissue depots compared to lean mice and was found in the cytoplasm of adipocytes. Overexpression of PIAS3 in differentiating preadipocytes, which localized primarily to the nucleus, inhibited mRNA level gene expression of adipogenic transcription factors C/EBP{alpha} and PPAR{gamma}, as well as their downstream target genes aP2 and adiponectin. PIAS3 also inhibited C/EBP{alpha} promoter activation mediated specifically by insulin, but not dexamethasone or isobutylmethylxanthine. Taken together, these data suggest that PIAS3 may play an inhibitory role in adipogenesis by modulating insulin-activated transcriptional activation events. Increased PIAS3 expression in adipose tissue may play a role in the metabolic disturbances of obesity.

  12. Two secretory protein genes in Chironomus tentans have arisen by gene duplication and exhibit different developmental expression patterns.

    PubMed

    Galli, J; Wieslander, L

    1993-05-20

    The salivary gland cells in the dipteran Chironomus tentans produce approximately 15 different secretory proteins, with relative molecular masses ranging between 1 x 10(4) and 1 x 10(6). Together these proteins form two types of extra corporal tubes, a larval protective housing and feeding tube or a pupation tube. The developmental change in tube formation is accompanied by a switch in production from one combination of secretory proteins to another. Here we characterize two genes, the sp38-40.A and B genes, which encode secretory proteins with relative molecular masses of 38,000 to 40,000. The two genes are located 346 base-pairs apart in the same orientation and have presumably arisen by gene duplication as the result of an illegitimate recombination event. Both genes contain two regions with cysteine codons, surrounded by regions with short repeats coding for proline and charged amino acid residues. The two genes and alleles of the genes differ in their number of repeats. This structure resembles the structure of the Balbiani ring (BR) genes, which encode the four largest salivary gland secretory proteins. The sp38-40.A and B genes are therefore likely to belong to a BR multigene family containing all or most of the 15 salivary gland secretory protein genes. The expression of the sp38-40.A and B genes are different: the A gene is expressed throughout the larval fourth instar but considerably less in the prepupal stage, while the B gene shows the opposite expression pattern. The developmental regulation of the expression of the two genes has therefore diverged after the gene duplication event.

  13. Parathion hydrolase specified by the Flavobacterium opd gene: relationship between the gene and protein.

    PubMed Central

    Mulbry, W W; Karns, J S

    1989-01-01

    The sequence of a 1,693-base-pair plasmid DNA fragment from Flavobacterium sp. strain ATCC 27551 containing the parathion hydrolase gene (opd) was determined. Within this sequence, there is only one open reading frame large enough to encode the 35,000-dalton membrane-associated hydrolase protein purified from Flavobacterium extracts. Amino-terminal sequence analysis of the purified Flavobacterium hydrolase demonstrated that serine is the amino-terminal residue of the hydrolase protein. The amino-terminal serine corresponds to a TCG codon located 87 base pairs downstream of the presumptive ATG initiation codon in the nucleotide sequence. The amino acid composition of the purified protein agrees well with that predicted from the nucleotide sequence, using serine as the amino-terminal residue. These data suggest that the parathion hydrolase protein is processed at its amino terminus in Flavobacterium sp. Construction in Escherichia coli of a lacZ-opd gene fusion in which the first 33 amino-terminal residues of opd were replaced by the first 5 residues of lacZ resulted in the production of an active hydrolase identical in molecular mass to the hydrolase isolated from Flavobacterium sp. E. coli cells containing the lacZ-opd fusion showed higher levels of hydrolase activity than did cells containing the parent plasmid. Images PMID:2556372

  14. Parathion hydrolase specified by the Flavobacterium opd gene: relationship between the gene and protein.

    PubMed

    Mulbry, W W; Karns, J S

    1989-12-01

    The sequence of a 1,693-base-pair plasmid DNA fragment from Flavobacterium sp. strain ATCC 27551 containing the parathion hydrolase gene (opd) was determined. Within this sequence, there is only one open reading frame large enough to encode the 35,000-dalton membrane-associated hydrolase protein purified from Flavobacterium extracts. Amino-terminal sequence analysis of the purified Flavobacterium hydrolase demonstrated that serine is the amino-terminal residue of the hydrolase protein. The amino-terminal serine corresponds to a TCG codon located 87 base pairs downstream of the presumptive ATG initiation codon in the nucleotide sequence. The amino acid composition of the purified protein agrees well with that predicted from the nucleotide sequence, using serine as the amino-terminal residue. These data suggest that the parathion hydrolase protein is processed at its amino terminus in Flavobacterium sp. Construction in Escherichia coli of a lacZ-opd gene fusion in which the first 33 amino-terminal residues of opd were replaced by the first 5 residues of lacZ resulted in the production of an active hydrolase identical in molecular mass to the hydrolase isolated from Flavobacterium sp. E. coli cells containing the lacZ-opd fusion showed higher levels of hydrolase activity than did cells containing the parent plasmid.

  15. Regulatory elements of Caenorhabditis elegans ribosomal protein genes

    PubMed Central

    2012-01-01

    Background Ribosomal protein genes (RPGs) are essential, tightly regulated, and highly expressed during embryonic development and cell growth. Even though their protein sequences are strongly conserved, their mechanism of regulation is not conserved across yeast, Drosophila, and vertebrates. A recent investigation of genomic sequences conserved across both nematode species and associated with different gene groups indicated the existence of several elements in the upstream regions of C. elegans RPGs, providing a new insight regarding the regulation of these genes in C. elegans. Results In this study, we performed an in-depth examination of C. elegans RPG regulation and found nine highly conserved motifs in the upstream regions of C. elegans RPGs using the motif discovery algorithm DME. Four motifs were partially similar to transcription factor binding sites from C. elegans, Drosophila, yeast, and human. One pair of these motifs was found to co-occur in the upstream regions of 250 transcripts including 22 RPGs. The distance between the two motifs displayed a complex frequency pattern that was related to their relative orientation. We tested the impact of three of these motifs on the expression of rpl-2 using a series of reporter gene constructs and showed that all three motifs are necessary to maintain the high natural expression level of this gene. One of the motifs was similar to the binding site of an orthologue of POP-1, and we showed that RNAi knockdown of pop-1 impacts the expression of rpl-2. We further determined the transcription start site of rpl-2 by 5’ RACE and found that the motifs lie 40–90 bases upstream of the start site. We also found evidence that a noncoding RNA, contained within the outron of rpl-2, is co-transcribed with rpl-2 and cleaved during trans-splicing. Conclusions Our results indicate that C. elegans RPGs are regulated by a complex novel series of regulatory elements that is evolutionarily distinct from those of all other species

  16. Transient protein-protein interactions perturb E. coli metabolome and cause gene dosage toxicity

    PubMed Central

    Bhattacharyya, Sanchari; Bershtein, Shimon; Yan, Jin; Argun, Tijda; Gilson, Amy I; Trauger, Sunia A; Shakhnovich, Eugene I

    2016-01-01

    Gene dosage toxicity (GDT) is an important factor that determines optimal levels of protein abundances, yet its molecular underpinnings remain unknown. Here, we demonstrate that overexpression of DHFR in E. coli causes a toxic metabolic imbalance triggered by interactions with several functionally related enzymes. Though deleterious in the overexpression regime, surprisingly, these interactions are beneficial at physiological concentrations, implying their functional significance in vivo. Moreover, we found that overexpression of orthologous DHFR proteins had minimal effect on all levels of cellular organization – molecular, systems, and phenotypic, in sharp contrast to E. coli DHFR. Dramatic difference of GDT between ‘E. coli’s self’ and ‘foreign’ proteins suggests the crucial role of evolutionary selection in shaping protein-protein interaction (PPI) networks at the whole proteome level. This study shows how protein overexpression perturbs a dynamic metabolon of weak yet potentially functional PPI, with consequences for the metabolic state of cells and their fitness. DOI: http://dx.doi.org/10.7554/eLife.20309.001 PMID:27938662

  17. Mining topological structures of protein-protein interaction networks for human brain-specific genes.

    PubMed

    Cui, W J; Gong, X J; Yu, H; Zhang, X C

    2015-10-16

    Compared to other placental mammals, humans have unique thinking and cognitive abilities because of their developed cerebral cortex composed of billions of neurons and synaptic connections. As the primary effectors of the mechanisms of life, proteins and their interactions form the basis of cellular and molecular functions in the living body. In this paper, we developed a pipeline for mining topological structures, identifying functional modules, and analyzing their functions from publically available datasets. A human brain-specific protein-protein interaction network with 1482 nodes and 3105 edges was built using a MapReduce based shortest path algorithm. Within this, 7 functional cliques were identified using a network clustering method, 98 hub proteins were obtained by the calculation of betweenness and connectivity, and 5 closest relationship to clique connector proteins were recognized by the combination scores of topological distance and gene ontology similarity. Furthermore, we discovered functional modules interacting with TP53 protein, which involves several fragmented research study conclusions and might be an important clue for further in vivo or in silico experiments to confirm these associations.

  18. Gene 5 protein-DNA complex: modeling binding interactions.

    PubMed

    Hutchinson, D L; Barnett, B L; Bobst, A M

    1990-08-01

    A helical (not toroidal) complex consisting of eight gene 5 protein dimers per turn is proposed for the extension of DNA from dimer to dimer using known bond length constraints, postulated protein-nucleic acid interactions (determined from NMR and chemical modification studies), other physical properties of the complex, and data from electron micrographs. The binding channel has been dictated by these known parameters and the relative ease of geometrically fitting these constituents. This channel is different from that previously reported by other modelers. The channel lies underneath the long arm "claw-like" extension of the monomer, so that it rests inside the outer surface of the protein complex. An explanation is proposed for the two binding modes, n = 4 (the predominate mode) and n = 3, based on the weak binding interaction of Tyrosine 34. Also, the site of the less mobile nucleic acid base as reported from ESR studies (S.-C. Kao, E.V. Bobst, G.T. Pauly and A.M. Bobst, J. Biom. Struc. Dyn. 3,261 (1985)) is postulated as involving the fourth nucleotide, and this particular base is stacked between Tyrosine 34 and Phenylalanine 73'.

  19. Secretion Trap Tagging of Secreted and Membrane-Spanning Proteins Using Arabidopsis Gene Traps

    Treesearch

    Andrew T. Groover; Joseph R. Fontana; Juana M. Arroyo; Cristina Yordan; W. Richard McCombie; Robert A. Martienssen

    2003-01-01

    Secreted and membrane-spanning proteins play fundamental roles in plant development but pose challenges for genetic identification and characterization. We describe a "secretion trap" screen for gene trap insertions in genes encoding proteins routed through the secretory pathway. The gene trap transposon encodes a ß-glucuronidase reporter enzyme...

  20. Distribution of SR protein exonic splicing enhancer motifs in human protein-coding genes.

    PubMed

    Wang, Jinhua; Smith, Philip J; Krainer, Adrian R; Zhang, Michael Q

    2005-01-01

    Exonic splicing enhancers (ESEs) are pre-mRNA cis-acting elements required for splice-site recognition. We previously developed a web-based program called ESEfinder that scores any sequence for the presence of ESE motifs recognized by the human SR proteins SF2/ASF, SRp40, SRp55 and SC35 (http://rulai.cshl.edu/tools/ESE/). Using ESEfinder, we have undertaken a large-scale analysis of ESE motif distribution in human protein-coding genes. Significantly higher frequencies of ESE motifs were observed in constitutive internal protein-coding exons, compared with both their flanking intronic regions and with pseudo exons. Statistical analysis of ESE motif frequency distributions revealed a complex relationship between splice-site strength and increased or decreased frequencies of particular SR protein motifs. Comparison of constitutively and alternatively spliced exons demonstrated slightly weaker splice-site scores, as well as significantly fewer ESE motifs, in the alternatively spliced group. Our results underline the importance of ESE-mediated SR protein function in the process of exon definition, in the context of both constitutive splicing and regulated alternative splicing.

  1. A Codon Deletion at the Beginning of Green Fluorescent Protein Genes Enhances Protein Expression.

    PubMed

    Rodríguez-Mejía, José-Luis; Roldán-Salgado, Abigail; Osuna, Joel; Merino, Enrique; Gaytán, Paul

    2017-01-01

    Recombinant protein expression is one of the key issues in protein engineering and biotechnology. Among the different models for assessing protein production and structure-function studies, green fluorescent protein (GFP) is one of the preferred models because of its importance as a reporter in cellular and molecular studies. In this research we analyze the effect of codon deletions near the amino terminus of different GFP proteins on fluorescence. Our study includes Gly4 deletions in the enhanced GFP (EGFP), the red-shifted GFP and the red-shifted EGFP. The Gly4 deletion mutants and their corresponding wild-type counterparts were transcribed under the control of the T7 or Trc promoters and their expression patterns were analyzed. Different fluorescent outcomes were observed depending on the type of fluorescent gene versions. In silico analysis of the RNA secondary structures near the ribosome binding site revealed a direct relationship between their minimum free energy and GFP production. Integrative analysis of these results, including SDS-PAGE analysis, led us to conclude that the fluorescence improvement of cells expressing different versions of GFPs with Gly4 deleted is due to an enhancement of the accessibility of the ribosome binding site by reducing the stability of the RNA secondary structures at their mRNA leader regions. © 2016 S. Karger AG, Basel.

  2. Ebolavirus Database: Gene and Protein Information Resource for Ebolaviruses

    PubMed Central

    Sekar, Kanagaraj

    2016-01-01

    Ebola Virus Disease (EVD) is a life-threatening haemorrhagic fever in humans. Even though there are many reports on EVD, the protein precursor functions and virulent factors of ebolaviruses remain poorly understood. Comparative analyses of Ebolavirus genomes will help in the identification of these important features. This prompted us to develop the Ebolavirus Database (EDB) and we have provided links to various tools that will aid researchers to locate important regions in both the genomes and proteomes of Ebolavirus. The genomic analyses of ebolaviruses will provide important clues for locating the essential and core functional genes. The aim of EDB is to act as an integrated resource for ebolaviruses and we strongly believe that the database will be a useful tool for clinicians, microbiologists, health care workers, and bioscience researchers. PMID:27190508

  3. Systematically characterizing and prioritizing chemosensitivity related gene based on Gene Ontology and protein interaction network.

    PubMed

    Chen, Xin; Jiang, Wei; Wang, Qianghu; Huang, Teng; Wang, Peng; Li, Yan; Chen, Xiaowen; Lv, Yingli; Li, Xia

    2012-10-02

    The identification of genes that predict in vitro cellular chemosensitivity of cancer cells is of great importance. Chemosensitivity related genes (CRGs) have been widely utilized to guide clinical and cancer chemotherapy decisions. In addition, CRGs potentially share functional characteristics and network features in protein interaction networks (PPIN). In this study, we proposed a method to identify CRGs based on Gene Ontology (GO) and PPIN. Firstly, we documented 150 pairs of drug-CCRG (curated chemosensitivity related gene) from 492 published papers. Secondly, we characterized CCRGs from the perspective of GO and PPIN. Thirdly, we prioritized CRGs based on CCRGs' GO and network characteristics. Lastly, we evaluated the performance of the proposed method. We found that CCRG enriched GO terms were most often related to chemosensitivity and exhibited higher similarity scores compared to randomly selected genes. Moreover, CCRGs played key roles in maintaining the connectivity and controlling the information flow of PPINs. We then prioritized CRGs using CCRG enriched GO terms and CCRG network characteristics in order to obtain a database of predicted drug-CRGs that included 53 CRGs, 32 of which have been reported to affect susceptibility to drugs. Our proposed method identifies a greater number of drug-CCRGs, and drug-CCRGs are much more significantly enriched in predicted drug-CRGs, compared to a method based on the correlation of gene expression and drug activity. The mean area under ROC curve (AUC) for our method is 65.2%, whereas that for the traditional method is 55.2%. Our method not only identifies CRGs with expression patterns strongly correlated with drug activity, but also identifies CRGs in which expression is weakly correlated with drug activity. This study provides the framework for the identification of signatures that predict in vitro cellular chemosensitivity and offers a valuable database for pharmacogenomics research.

  4. Analysis of the fusion protein gene of the porcine rubulavirus LPMV: comparative analysis of paramyxovirus F proteins.

    PubMed

    Berg, M; Bergvall, A C; Svenda, M; Sundqvist, A; Moreno-López, J; Linné, T

    1997-01-01

    Complementary DNA clones representing the fusion (F) protein gene of the porcine rubulavirus LPMV were isolated and sequenced. The F gene was found to be 1,845 nucleotides long containing one long open reading frame capable of encoding a protein of 541 amino acids. The cleavage motif for F0 into F1 and F2 is His-Arg-Lys-Lys-Arg. A sequence comparison and a phylogenetic analysis was performed in order to identify possible functional domains of paramyxovirus fusion proteins and also to classify the porcine rubulavirus. The F gene of LPMV is most closely related to the human mumps virus and simian virus type 5 F genes, and is therefore classified into the rubulavirus genus. A coding region for a small hydrophobic protein was however not found between the F and hemagglutinin-neuraminidase (HN) genes as previously found in both SV5 and mumps.

  5. Complex character of senS, a novel gene regulating expression of extracellular-protein genes of Bacillus subtilis.

    PubMed Central

    Wang, L F; Doi, R H

    1990-01-01

    The senS gene of Bacillus subtilis, which in high copy number stimulates the expression of several extracellular-protein genes, has been cloned, genetically mapped, and sequenced. The gene codes for a highly charged basic protein containing 65 amino acid residues. The gene is characterized by the presence of a transcription terminator (attenuator) located between the promoter and open reading frame, a strong ribosome-binding site, and a strong transcription terminator at the 3' end of this monocistronic gene. The amino acid sequence of SenS showed partial homology with the N-terminal core binding domain region of bacterial RNA polymerase sigma factors and a helix-turn-helix motif found in DNA-binding proteins. The gene can be deleted without any effect on growth or sporulation. Images FIG. 7 FIG. 9 PMID:2108127

  6. The human BDNF gene: peripheral gene expression and protein levels as biomarkers for psychiatric disorders

    PubMed Central

    Cattaneo, A; Cattane, N; Begni, V; Pariante, C M; Riva, M A

    2016-01-01

    Brain-derived neurotrophic factor (BDNF) regulates the survival and growth of neurons, and influences synaptic efficiency and plasticity. The human BDNF gene consists of 11 exons, and distinct BDNF transcripts are produced through the use of alternative promoters and splicing events. The majority of the BDNF transcripts can be detected not only in the brain but also in the blood cells, although no study has yet investigated the differential expression of BDNF transcripts at the peripheral level. This review provides a description of the human BDNF gene structure as well as a summary of clinical and preclinical evidence supporting the role of BDNF in the pathogenesis of psychiatric disorders. We will discuss several mechanisms as possibly underlying BDNF modulation, including epigenetic mechanisms. We will also discuss the potential use of peripheral BDNF as a biomarker for psychiatric disorders, focusing on the factors that can influence BDNF gene expression and protein levels. Within this context, we have also characterized, for we believe the first time, the expression of BDNF transcripts in the blood, with the aim to provide novel insights into the molecular mechanisms and signaling that may regulate peripheral BDNF gene expression levels. PMID:27874848

  7. Combinatorial codon scrambling enables scalable gene synthesis and amplification of repetitive proteins

    NASA Astrophysics Data System (ADS)

    Tang, Nicholas C.; Chilkoti, Ashutosh

    2016-04-01

    Most genes are synthesized using seamless assembly methods that rely on the polymerase chain reaction (PCR). However, PCR of genes encoding repetitive proteins either fails or generates nonspecific products. Motivated by the need to efficiently generate new protein polymers through high-throughput gene synthesis, here we report a codon-scrambling algorithm that enables the PCR-based gene synthesis of repetitive proteins by exploiting the codon redundancy of amino acids and finding the least-repetitive synonymous gene sequence. We also show that the codon-scrambling problem is analogous to the well-known travelling salesman problem, and obtain an exact solution to it by using De Bruijn graphs and a modern mixed integer linear programme solver. As experimental proof of the utility of this approach, we use it to optimize the synthetic genes for 19 repetitive proteins, and show that the gene fragments are amenable to PCR-based gene assembly and recombinant expression.

  8. (Genetic engineering with a gene encoding a soybean storage protein). Progress report

    SciTech Connect

    Beachy, R.N.

    1985-01-01

    Progress is reported on research directed toward introducing a gene (Gmg 17.1) encoding the ..cap alpha..'-subunit of ..beta..-conglycinin, a soybean seed protein, into petunia plants using gene transfer mechanisms. (ACR)

  9. Orthogonal Cas9 proteins for RNA-guided gene regulation and editing

    DOEpatents

    Church, George M.; Esvelt, Kevin; Mali, Prashant

    2017-03-07

    Methods of modulating expression of a target nucleic acid in a cell are provided including use of multiple orthogonal Cas9 proteins to simultaneously and independently regulate corresponding genes or simultaneously and independently edit corresponding genes.

  10. Topology association analysis in weighted protein interaction network for gene prioritization

    NASA Astrophysics Data System (ADS)

    Wu, Shunyao; Shao, Fengjing; Zhang, Qi; Ji, Jun; Xu, Shaojie; Sun, Rencheng; Sun, Gengxin; Du, Xiangjun; Sui, Yi

    2016-11-01

    Although lots of algorithms for disease gene prediction have been proposed, the weights of edges are rarely taken into account. In this paper, the strengths of topology associations between disease and essential genes are analyzed in weighted protein interaction network. Empirical analysis demonstrates that compared to other genes, disease genes are weakly connected with essential genes in protein interaction network. Based on this finding, a novel global distance measurement for gene prioritization with weighted protein interaction network is proposed in this paper. Positive and negative flow is allocated to disease and essential genes, respectively. Additionally network propagation model is extended for weighted network. Experimental results on 110 diseases verify the effectiveness and potential of the proposed measurement. Moreover, weak links play more important role than strong links for gene prioritization, which is meaningful to deeply understand protein interaction network.

  11. Functions of milk protein gene 5' flanking regions on human growth hormone gene.

    PubMed

    Ninomiya, T; Hirabayashi, M; Sagara, J; Yuki, A

    1994-03-01

    Fragments containing 5' flanking regions of four bovine milk protein genes--alpha lactalbumin (b alpha LA), alpha S1 casein (b alpha S1CN), beta casein (b beta CN), kappa casein (b kappa CN)--and mouse whey acidic protein (mWAP) gene were prepared by PCR and ligated to human growth hormone (hGH) gene. These recombinant DNAs were microinjected into rat embryos to produce transgenic rats, and the functions of the 5' regions to direct secretion of hGH in the milk were tested. Although milk was obtained only in 5 of 19 mWAP/hGH rat lines, more than two-thirds of the rats carrying the other four DNAs produced milk. More than 80% of the lactated rats carrying b alpha LA/, b beta CN/, and mWAP/hGH, and 33% of the lactated b alpha S1CN/hGH rats secreted detectable amounts of hGH (> 0.05 microgram/ml) in the milk. In some rats, the hGH concentrations in the milk were comparable to or more than that of the corresponding milk protein in bovine milk. The ranges of hGH concentrations in the milk of b alpha LA/, b beta CN/, b alpha S1CN/, and mWAP/hGH rats were 1.13-4,360 micrograms/ml, 0.11-10,900 micrograms/ml, 86.8-6,480 micrograms/ml, and 6.87-151 micrograms/ml, respectively. HGH was also detected in the sera of these rats, and some abnormalities of growth and reproduction were observed. All but one virgin mWAP/hGH rat secreted up to 0.0722 microgram/ml of hGH in the serum, and more than half of them showed abnormal fat accumulations at their abdomen.(ABSTRACT TRUNCATED AT 250 WORDS)

  12. Uncoupling protein 2 gene polymorphisms are associated with obesity

    PubMed Central

    2012-01-01

    Background Uncoupling protein 2 (UCP2) gene polymorphisms have been reported as genetic risk factors for obesity and type 2 diabetes mellitus (T2DM). We examined the association of commonly observed UCP2 G(−866)A (rs659366) and Ala55Val (C > T) (rs660339) single nucleotide polymorphisms (SNPs) with obesity, high fasting plasma glucose, and serum lipids in a Balinese population. Methods A total of 603 participants (278 urban and 325 rural subjects) were recruited from Bali Island, Indonesia. Fasting plasma glucose (FPG), triglyceride (TG), high density lipoprotein cholesterol (HDL-C), low density lipoprotein cholesterol (LDL-C) and total cholesterol (TC) were measured. Obesity was determined based on WHO classifications for adult Asians. Participants were genotyped for G(−866)A and Ala55Val polymorphisms of the UCP2 gene. Results Obesity prevalence was higher in urban subjects (51%) as compared to rural subjects (23%). The genotype, minor allele (MAF), and heterozygosity frequencies were similar between urban and rural subjects for both SNPs. All genotype frequencies were in Hardy-Weinberg equilibrium. A combined analysis of genotypes and environment revealed that the urban subjects carrying the A/A genotype of the G(−866)A SNP have higher BMI than the rural subjects with the same genotype. Since the two SNPs showed strong linkage disequilibrium (D’ = 0.946, r2 = 0.657), a haplotype analysis was performed. We found that the AT haplotype was associated with high BMI only when the urban environment was taken into account. Conclusions We have demonstrated the importance of environmental settings in studying the influence of the common UCP2 gene polymorphisms in the development of obesity in a Balinese population. PMID:22533685

  13. Learning to Extract Gene-Protein Names from Weakly-Labeled Text

    DTIC Science & Technology

    2008-01-01

    gene -protein entities. We trained a VP -HMM extractor on the training corpus of YAPEX using Minorthird’s default features. The GENIA dataset...Learning to Extract Gene -Protein Names from Weakly-Labeled Text Richard C. Wang, Anthony Tomasic, Robert E. Frederking...Extract Gene -Protein Names from Weakly-Labeled Text 5a. CONTRACT NUMBER 5b. GRANT NUMBER 5c. PROGRAM ELEMENT NUMBER 6. AUTHOR(S) 5d. PROJECT

  14. Bacteriophage T5 gene A2 protein alters the outer membrane of Escherichia coli.

    PubMed Central

    Snyder, C E

    1984-01-01

    Evidence for changes in Escherichia coli envelope structure caused by the bacteriophage T5 gene A2 protein was obtained by the use of mutant bacteriophages, envelope fractionation procedures, electrophoretic analysis, and in vitro binding studies with purified gene A2 protein. The results suggested that the T5 gene A2 protein perturbs the host envelope as it functions to promote DNA transfer. Images PMID:6389511

  15. [The study of the uncoulping protein gene as the candidate gene for fatness traits in chicken].

    PubMed

    Zhao, Jian-Guo; Li, Hui; Meng, He; Gu, Zhi-Liang; Wang, Qi-Gui; Wang, Yu-Xiang

    2002-06-01

    The UCPs are integral membrane proteins of the mitochondrial respiration from oxidative phosphorylation, diminishing the resulting production of ATP and instead yielding dissipative heat. The action of those proteins creates a futile cycle that decreases the metabolic efficiency of the organism. Thus UCPs provide new clue to obesity's causes. This study was designed to investigate the effect of UCP gene on chicken fatness traits. The fifth generation population of divergent selection broiler line, Hyline Brown layer and three native breeds (shiqiza, Beijing You, baier) were used in this research. Body weight and body composition traits were measured in broiler lines at 7 weeks of age. Primers for the 3'-untranslator region in UCP were designed from database of chicken genomic sequence. Polymorphisms were detected by PCR-SSCP and DNA sequencing. The results showed that there was significant difference (P < 0.01) in the frequency of genotype among breeds except broiler vs Beijing You and Baier vs Hyline Brown layer in mutation sites detected by the two pairs of primers. The distribution of genotype in Beijing You and broiler had no difference. It deduced that Beijing You belongs to the native breed that has dominant meat type traits and has the same genetic background with broiler. Baier and Hyline Brown Layer have no difference in the genotype, it can be viewed as they have same genetic background. A A/C mutation at base position 1197 was found among individuals in broiler line and the least square analysis showed that BB birds had significant lower (P < 0.01) abdominal fat weight and percentage of abdominal fat than AB or AA birds. From the results we can putatively draw the conclusion that UCP gene is the major gene to affect the fatness traits or it links with the major gene.

  16. Evolutionary hallmarks of the human proteome: chasing the age and coregulation of protein-coding genes.

    PubMed

    Lopes, Katia de Paiva; Campos-Laborie, Francisco José; Vialle, Ricardo Assunção; Ortega, José Miguel; De Las Rivas, Javier

    2016-10-25

    The development of large-scale technologies for quantitative transcriptomics has enabled comprehensive analysis of the gene expression profiles in complete genomes. RNA-Seq allows the measurement of gene expression levels in a manner far more precise and global than previous methods. Studies using this technology are altering our view about the extent and complexity of the eukaryotic transcriptomes. In this respect, multiple efforts have been done to determine and analyse the gene expression patterns of human cell types in different conditions, either in normal or pathological states. However, until recently, little has been reported about the evolutionary marks present in human protein-coding genes, particularly from the combined perspective of gene expression and protein evolution. We present a combined analysis of human protein-coding gene expression profiling and time-scale ancestry mapping, that places the genes in taxonomy clades and reveals eight evolutionary major steps ("hallmarks"), that include clusters of functionally coherent proteins. The human expressed genes are analysed using a RNA-Seq dataset of 116 samples from 32 tissues. The evolutionary analysis of the human proteins is performed combining the information from: (i) a database of orthologous proteins (OMA), (ii) the taxonomy mapping of genes to lineage clades (from NCBI Taxonomy) and (iii) the evolution time-scale mapping provided by TimeTree (Timescale of Life). The human protein-coding genes are also placed in a relational context based in the construction of a robust gene coexpression network, that reveals tighter links between age-related protein-coding genes and finds functionally coherent gene modules. Understanding the relational landscape of the human protein-coding genes is essential for interpreting the functional elements and modules of our active genome. Moreover, decoding the evolutionary history of the human genes can provide very valuable information to reveal or uncover their

  17. Nuclear actin-binding proteins as modulators of gene transcription.

    PubMed

    Gettemans, Jan; Van Impe, Katrien; Delanote, Veerle; Hubert, Thomas; Vandekerckhove, Joël; De Corte, Veerle

    2005-10-01

    Dynamic transformations in the organization of the cellular microfilament system are the driving force behind fundamental biological processes such as cellular motility, cytokinesis, wound healing and secretion. Eukaryotic cells express a plethora of actin-binding proteins (ABPs) allowing cells to control the organization of the actin cytoskeleton in a flexible manner. These structural proteins were, not surprisingly, originally described as (major) constituents of the cytoplasm. However, in recent years, there has been a steady flow of reports detailing not only translocation of ABPs into and out of the nucleus but also describing their role in the nuclear compartment. This review focuses on recent developments pertaining to nucleocytoplasmic transport of ABPs, including their mode of translocation and nuclear function. In particular, evidence that structurally and functionally unrelated cytoplasmic ABPs regulate transcription activation by various nuclear (steroid hormone) receptors is steadily accruing. Furthermore, the recent finding that actin is a necessary component of the RNA polymerase II-containing preinitiation complex opens up new opportunities for nuclear ABPs in gene transcription regulation.

  18. Protein and gene model inference based on statistical modeling in k-partite graphs.

    PubMed

    Gerster, Sarah; Qeli, Ermir; Ahrens, Christian H; Bühlmann, Peter

    2010-07-06

    One of the major goals of proteomics is the comprehensive and accurate description of a proteome. Shotgun proteomics, the method of choice for the analysis of complex protein mixtures, requires that experimentally observed peptides are mapped back to the proteins they were derived from. This process is also known as protein inference. We present Markovian Inference of Proteins and Gene Models (MIPGEM), a statistical model based on clearly stated assumptions to address the problem of protein and gene model inference for shotgun proteomics data. In particular, we are dealing with dependencies among peptides and proteins using a Markovian assumption on k-partite graphs. We are also addressing the problems of shared peptides and ambiguous proteins by scoring the encoding gene models. Empirical results on two control datasets with synthetic mixtures of proteins and on complex protein samples of Saccharomyces cerevisiae, Drosophila melanogaster, and Arabidopsis thaliana suggest that the results with MIPGEM are competitive with existing tools for protein inference.

  19. Protein Modulator of Multidrug Efflux Gene Expression in Pseudomonas aeruginosa▿

    PubMed Central

    Daigle, Denis M.; Cao, Lily; Fraud, Sebastien; Wilke, Mark S.; Pacey, Angela; Klinoski, Rachael; Strynadka, Natalie C.; Dean, Charles R.; Poole, Keith

    2007-01-01

    nalC multidrug-resistant mutants of Pseudomonas aeruginosa show enhanced expression of the mexAB-oprM multidrug efflux system as a direct result of the production of a ca. 6,100-Da protein, PA3719, in these mutants. Using a bacterial two-hybrid system, PA3719 was shown to interact in vivo with MexR, a repressor of mexAB-oprM expression. Isothermal titration calorimetry (ITC) studies confirmed a high-affinity interaction (equilibrium dissociation constant [KD], 158.0 ± 18.1 nM) of PA3719 with MexR in vitro. PA3719 binding to and formation of a complex with MexR obviated repressor binding to its operator, which overlaps the efflux operon promoter, suggesting that mexAB-oprM hyperexpression in nalC mutants results from PA3719 modulation of MexR repressor activity. Consistent with this, MexR repression of mexA transcription in an in vitro transcription assay was alleviated by PA3719. Mutations in MexR compromising its interaction with PA3719 in vivo were isolated and shown to be located internally and distributed throughout the protein, suggesting that they impacted PA3719 binding by altering MexR structure or conformation rather than by having residues interacting specifically with PA3719. Four of six mutant MexR proteins studied retained repressor activity even in a nalC strain producing PA3719. Again, this is consistent with a PA3719 interaction with MexR being necessary to obviate MexR repressor activity. The gene encoding PA3719 has thus been renamed armR (antirepressor for MexR). A representative “noninteracting” mutant MexR protein, MexRI104F, was purified, and ITC confirmed that it bound PA3719 with reduced affinity (5.4-fold reduced; KD, 853.2 ± 151.1 nM). Consistent with this, MexRI104F repressor activity, as assessed using the in vitro transcription assay, was only weakly compromised by PA3719. Finally, two mutations (L36P and W45A) in ArmR compromising its interaction with MexR have been isolated and mapped to a putative C-terminal α-helix of the

  20. Cytochrome b5 gene and protein of Candida tropicalis and methods relating thereto

    DOEpatents

    Craft, David L.; Madduri, Krishna M.; Loper, John C.

    2003-01-01

    A novel gene has been isolated which encodes cytochrome b5 (CYTb5) protein of the .omega.-hydroxylase complex of C. tropicalis 20336. Vectors including this gene, and transformed host cells are provided. Methods of increasing the production of a CYTb5 protein are also provided which involve transforming a host cell with a gene encoding this protein and culturing the cells. Methods of increasing the production of a dicarboxylic acid are also provided which involve increasing in the host cell the number of genes encoding this protein.

  1. Chicken Ovalbumin Upstream Promoter Transcription Factor II Regulates Renin Gene Expression*

    PubMed Central

    Mayer, Sandra; Roeser, Marc; Lachmann, Peter; Ishii, Sumiyashi; Suh, Jae Mi; Harlander, Sabine; Desch, Michael; Brunssen, Coy; Morawietz, Henning; Tsai, Sophia Y.; Tsai, Ming-Jer; Hohenstein, Bernd; Hugo, Christian; Todorov, Vladimir T.

    2012-01-01

    This study aimed to investigate the possible involvement of the orphan nuclear receptor chicken ovalbumin upstream promoter transcription factor II (COUP-TFII) in the regulation of renin gene expression. COUP-TFII colocalized with renin in the juxtaglomerular cells of the kidney, which are the main source of renin in vivo. Protein-DNA binding studies demonstrated that COUP-TFII binds to an imperfect direct repeat COUP-TFII recognition sequence (termed hereafter proxDR) in the proximal renin promoter. Because cAMP signaling plays a central role in the control of the renin gene expression, we suggested that COUP-TFII may modulate this cAMP effect. Accordingly, knockdown of COUP-TFII in the clonal renin-producing cell lines As4.1 and Calu-6 diminished the stimulation of the renin mRNA expression by cAMP agonists. In addition, the mutation of the proxDR element in renin promoter reporter gene constructs abrogated the inducibility by cAMP. The proxDR sequence was found to be necessary for the function of a proximal renin promoter cAMP-response element (CRE). Knockdown of COUP-TFII or cAMP-binding protein (CREB), which is the archetypal transcription factor binding to CRE, decreased the basal renin gene expression. However, the deficiency of COUP-TFII did not further diminish the renin expression when CREB was knocked down. In agreement with the cell culture studies, mutant mice deficient in COUP-TFII have lower renin expression than their control strain. Altogether our data show that COUP-TFII is involved in the control of renin gene expression. PMID:22645148

  2. Chicken ovalbumin upstream promoter transcription factor II regulates renin gene expression.

    PubMed

    Mayer, Sandra; Roeser, Marc; Lachmann, Peter; Ishii, Sumiyashi; Suh, Jae Mi; Harlander, Sabine; Desch, Michael; Brunssen, Coy; Morawietz, Henning; Tsai, Sophia Y; Tsai, Ming-Jer; Hohenstein, Bernd; Hugo, Christian; Todorov, Vladimir T

    2012-07-13

    This study aimed to investigate the possible involvement of the orphan nuclear receptor chicken ovalbumin upstream promoter transcription factor II (COUP-TFII) in the regulation of renin gene expression. COUP-TFII colocalized with renin in the juxtaglomerular cells of the kidney, which are the main source of renin in vivo. Protein-DNA binding studies demonstrated that COUP-TFII binds to an imperfect direct repeat COUP-TFII recognition sequence (termed hereafter proxDR) in the proximal renin promoter. Because cAMP signaling plays a central role in the control of the renin gene expression, we suggested that COUP-TFII may modulate this cAMP effect. Accordingly, knockdown of COUP-TFII in the clonal renin-producing cell lines As4.1 and Calu-6 diminished the stimulation of the renin mRNA expression by cAMP agonists. In addition, the mutation of the proxDR element in renin promoter reporter gene constructs abrogated the inducibility by cAMP. The proxDR sequence was found to be necessary for the function of a proximal renin promoter cAMP-response element (CRE). Knockdown of COUP-TFII or cAMP-binding protein (CREB), which is the archetypal transcription factor binding to CRE, decreased the basal renin gene expression. However, the deficiency of COUP-TFII did not further diminish the renin expression when CREB was knocked down. In agreement with the cell culture studies, mutant mice deficient in COUP-TFII have lower renin expression than their control strain. Altogether our data show that COUP-TFII is involved in the control of renin gene expression.

  3. Functional Gene Group Analysis Indicates No Role for Heterotrimeric G Proteins in Cognitive Ability

    PubMed Central

    Davies, Gail; Liewald, David Cherry McLachlan; Payton, Anthony; Craig, Leone C. A.; Whalley, Lawrence J.; Horan, Mike; Ollier, William; Starr, John M.; Pendleton, Neil; Posthuma, Danielle; Bates, Timothy C.; Deary, Ian J.

    2014-01-01

    Previous functional gene group analyses implicated common single nucleotide polymorphisms (SNPs) in heterotrimeric G protein coding genes as being associated with differences in human intelligence. Here, we sought to replicate this finding using five independent cohorts of older adults including current IQ and childhood IQ, and using both gene- and SNP-based analytic strategies. No significant associations were found between variation in heterotrimeric G protein genes and intelligence in any cohort at either of the two time points. These results indicate that, whereas G protein systems are important in cognition, common genetic variation in these genes is unlikely to be a substantial influence on human intelligence differences. PMID:24626473

  4. Dosage Sensitivity of RPL9 and Concerted Evolution of Ribosomal Protein Genes in Plants

    PubMed Central

    Devis, Deborah; Firth, Sue M.; Liang, Zhe; Byrne, Mary E.

    2015-01-01

    The ribosome in higher eukaryotes is a large macromolecular complex composed of four rRNAs and eighty different ribosomal proteins. In plants, each ribosomal protein is encoded by multiple genes. Duplicate genes within a family are often necessary to provide a threshold dose of a ribosomal protein but in some instances appear to have non-redundant functions. Here, we addressed whether divergent members of the RPL9 gene family are dosage sensitive or whether these genes have non-overlapping functions. The RPL9 family in Arabidopsis thaliana comprises two nearly identical members, RPL9B and RPL9C, and a more divergent member, RPL9D. Mutations in RPL9C and RPL9D genes lead to delayed growth early in development, and loss of both genes is embryo lethal, indicating that these are dosage-sensitive and redundant genes. Phylogenetic analysis of RPL9 as well as RPL4, RPL5, RPL27a, RPL36a, and RPS6 family genes in the Brassicaceae indicated that multicopy ribosomal protein genes have been largely retained following whole genome duplication. However, these gene families also show instances of tandem duplication, small scale deletion, and evidence of gene conversion. Furthermore, phylogenetic analysis of RPL9 genes in angiosperm species showed that genes within a species are more closely related to each other than to RPL9 genes in other species, suggesting ribosomal protein genes undergo convergent evolution. Our analysis indicates that ribosomal protein gene retention following whole genome duplication contributes to the number of genes in a family. However, small scale rearrangements influence copy number and likely drive concerted evolution of these dosage-sensitive genes. PMID:26734020

  5. A novel class of gene controlling virulence in plant pathogenic ascomycete fungi

    PubMed Central

    Lu, Shun-Wen; Kroken, Scott; Lee, Bee-Na; Robbertse, Barbara; Churchill, Alice C. L.; Yoder, O. C.; Turgeon, B. Gillian

    2003-01-01

    Insertional mutants of the fungal maize pathogen Cochliobolus heterostrophus were screened for altered virulence. One mutant had 60% reduction in lesion size relative to WT but no other detectable change in phenotype. Analysis of sequence at the insertion site revealed a gene (CPS1) encoding a protein with two AMP-binding domains. CPS1 orthologs were detected in all Cochliobolus spp. examined, in several other classes of ascomycete fungi, and in animals but not in basidiomycete fungi, bacteria, or plants. Phylogenetic analysis suggested that CPS1 represents a previously undescribed subset of adenylate-forming enzymes that have diverged from certain acyl-CoA ligases, which in bacteria are involved in biosynthesis of nonribosomal peptides or polyketide/peptide hybrids. Disruption of CPS1 caused reduced virulence of both race T and race O of C. heterostrophus on maize, of Cochliobolus victoriae on oats, and of Gibberella zeae on wheat. These results suggest that CPS1 functions as a general fungal virulence factor in plant pathogenic ascomycetes. PMID:12730371

  6. Cytoskeletal protein filamin A is a nucleolar protein that suppresses ribosomal RNA gene transcription

    PubMed Central

    Deng, Wensheng; Lopez-Camacho, Cesar; Tang, Jen-Yang; Mendoza-Villanueva, Daniel; Maya-Mendoza, Apolinar; Jackson, Dean A.; Shore, Paul

    2012-01-01

    Filamin A (FLNA) is an actin-binding protein with a well-established role in the cytoskeleton, where it determines cell shape and locomotion by cross-linking actin filaments. Mutations in FLNA are associated with a wide range of genetic disorders. Here we demonstrate a unique role for FLNA as a nucleolar protein that associates with the RNA polymerase I (Pol I) transcription machinery to suppress rRNA gene transcription. We show that depletion of FLNA by siRNAs increased rRNA expression, rDNA promoter activity and cell proliferation. Immunodepletion of FLNA from nuclear extracts resulted in a decrease in rDNA promoter-driven transcription in vitro. FLNA coimmunoprecipitated with the Pol I components actin, TIF-IA, and RPA40, and their occupancy of the rDNA promoter was increased in the absence of FLNA in vivo. The FLNA actin-binding domain is essential for the suppression of rRNA expression and for inhibiting recruitment of the Pol I machinery to the rDNA promoter. These findings reveal an additional role for FLNA as a regulator of rRNA gene expression and have important implications for our understanding of the role of FLNA in human disease. PMID:22307607

  7. Preservation of Gene Duplication Increases the Regulatory Spectrum of Ribosomal Protein Genes and Enhances Growth under Stress.

    PubMed

    Parenteau, Julie; Lavoie, Mathieu; Catala, Mathieu; Malik-Ghulam, Mustafa; Gagnon, Jules; Abou Elela, Sherif

    2015-12-22

    In baker's yeast, the majority of ribosomal protein genes (RPGs) are duplicated, and it was recently proposed that such duplications are preserved via the functional specialization of the duplicated genes. However, the origin and nature of duplicated RPGs' (dRPGs) functional specificity remain unclear. In this study, we show that differences in dRPG functions are generated by variations in the modality of gene expression and, to a lesser extent, by protein sequence. Analysis of the sequence and expression patterns of non-intron-containing RPGs indicates that each dRPG is controlled by specific regulatory sequences modulating its expression levels in response to changing growth conditions. Homogenization of dRPG sequences reduces cell tolerance to growth under stress without changing the number of expressed genes. Together, the data reveal a model where duplicated genes provide a means for modulating the expression of ribosomal proteins in response to stress.

  8. Bcl-2-related protein family gene expression during oligodendroglial differentiation.

    PubMed

    Itoh, Takayuki; Itoh, Aki; Pleasure, David

    2003-06-01

    Oligodendroglial lineage cells (OLC) vary in susceptibility to both necrosis and apoptosis depending on their developmental stages, which might be regulated by differential expression of Bcl-2-related genes. As an initial step to test this hypothesis, we examined the expression of 19 Bcl-2-related genes in purified cultures of rat oligodendroglial progenitors, immature and mature oligodendrocytes. All 'multidomain' anti-apoptotic members (Bcl-x, Bcl-2, Mcl-1, Bcl-w and Bcl2l10/Diva/Boo) except Bcl2a1/A1 are expressed in OLC. Semiquantitative and real-time RT-PCR revealed that Bcl-xL and Mcl-1 mRNAs are the dominant anti-apoptotic members and increase four- and twofold, respectively, with maturation. Bcl-2 mRNA is less abundant than Bcl-xL mRNA in progenitors and falls an additional 10-fold during differentiation. Bcl-w mRNA also increases, with significant changes in its splicing pattern, as OLC mature. Transfection studies demonstrated that Bcl-xL overexpression protects against kainate-induced excitotoxicity, whereas Bcl-2 overexpression does not. As for 'multidomain' pro-apoptotic members (Bax, Bad and Bok/Mtd), Bax and Bak are highly expressed throughout differentiation. Among 'BH3 domain-only' members examined (Bim, Biklk, DP5/Hrk, Bad, Bid, Noxa, Puma/Bbc3, Bmf, BNip3 and BNip3L), BNip3 and Bmf mRNAs increase markedly during differentiation. These results provide basic information to guide further studies on the roles for Bcl-2-related family proteins in OLC death.

  9. Roles of the Polymerase-Associated Protein Genes in Newcastle Disease Virus Virulence

    PubMed Central

    Yu, Xiao-hui; Cheng, Jin-long; Xue, Jia; Jin, Ji-hui; Song, Yang; Zhao, Jing; Zhang, Guo-zhong

    2017-01-01

    The virulence of Newcastle disease virus varies greatly and is determined by multiple genetic factors. In this study, we systematically evaluated the roles of the polymerase-associated (NP, P and L) protein genes in genotype VII NDV virulence after confirming the envelope-associated (F and HN) proteins contributed greatly to NDV virulence. The results revealed that the polymerase-associated protein genes individually had certain effect on virulence, while transfer of these three genes in combination significantly affected the chimeric virus virulence, especially when the L gene was involved. These results indicated that the L protein was a major contributor to NDV virulence when combined with the homologous NP and P proteins. We also investigated viral RNA synthesis using NDV minigenome systems to assess the interaction between the NP, P, and L proteins, which showed that the activity of the polymerase-associated proteins were directly related to viral RNA transcription and replication. PMID:28220114

  10. Centrin protein and genes in Trichomonas vaginalis and close relatives.

    PubMed

    Brugerolle, G; Bricheux, G; Coffe, G

    2000-01-01

    Anti-centrin monoclonal antibodies 20H5 and 11B2 produced against Clamydomononas centrin decorated the group of basal bodies as well as very closely attached structures in all trichomonads studied and in the devescovinids Foaina and Devescovina. Moreover, these antibodies decorated the undulating membrane in Trichomonas vaginalis, Trichomitus batrachorum, and Tritrichomonas foetus, and the cresta in Foaina. Centrin was not demonstrated in the dividing spindle and paradesmosis. Immunogold labeling, both in pre- and post-embedding, confirmed that centrin is associated with the basal body cylinder and is a component of the nine anchoring arms between the terminal plate of flagellar bases and the plasma-membrane. Centrin is also associated with the hook-shaped fibers attached to basal bodies (F1, F3), the X-fiber, and along sigmoid fibers (F2) at the pelta-axostyle junction, which is the microtubule organizing center for pelta-axostyle microtubules. There was no labeling on the striated costa and parabasal fibers nor on microtubular pelta-axostyle, but the fibrous structure inside the undulating membrane was labeled in T. vaginalis. Two proteins of 22-20 kDa corresponding to the centrin molecular mass were recognized by immunoblotting using these antibodies in the three trichomonad species examined. By screening a T. vaginalis cDNA library with 20H5 antibody, two genes encoding identical protein sequences were found. The sequence comprises the 4 typical EF-hand Ca++-binding domains present in every known centrin. Trichomonad centrin is closer to the green algal cluster (70% identity) than to the yeast Cdc31 cluster (55% identity) or the Alveolata cluster (46% identity).

  11. A complete approach for recombinant protein expression training: From gene cloning to assessment of protein functionality*.

    PubMed

    Novo, M Teresa Marques; Soares-Costa, Andrea; de Souza, Antonia Q L; Figueira, Ana Carolina M; Molina, Gustavo C; Palacios, Carlos A; Kull, Claudia R; Monteiro, Izabel F; Baldan-Pineda, Paulo H; Henrique-Silva, Flavio

    2005-01-01

    A practical course was given to undergraduate biology students enrolled in the elective course "Introduction to Genetic Engineering" at the Federal University of São Carlos (UFSCar), São Paulo, Brazil. The goal of the course was to teach current molecular biology tools applied to a real research situation that could be reported by the students themselves. The purpose was to produce a plant recombinant protein and demonstrate a heretofore unreported biological activity. Cystatins, natural inhibitors of cysteine proteases, were proposed for these studies. Initially, the students searched for plant cystatin cDNA sequences in the NCBI databases and selected the Oryzacystatin I gene (ocI) from rice, Oriza sativa, as the target gene for this study. Total RNA was extracted from rice-germinating seeds and primers containing restriction sites for NdeI and EcoRI were designed based on the ocI cDNA sequence and then used to amplify the open reading frame (ORF). RT-PCR amplification provided a band of the expected size for ocI ORF (309 bp). The PCR product was cut with NdeI and EcoRI restriction enzymes and cloned directly in the pET28a expression vector digested with the same enzymes. A pET28-ocI recombinant clone was selected, checked by sequencing, and used to transform Escherichia coli BL21 (DE3) expression strain. After induction of the bacteria with isopropylthiogalactoside and cellular disruption, the His-tagged OCI protein, present mainly in the soluble fraction, was purified by affinity chromatography in a nickel column. The purified protein was successfully used to inhibit fungal growth (Trichoderma reesei). The results were discussed extensively and the students contributed to the writing of this article, of which they are co-authors.

  12. Computer-aided design of modular protein devices: Boolean AND gene activation

    NASA Astrophysics Data System (ADS)

    Salis, H.; Kaznessis, Y. N.

    2006-12-01

    Many potentially useful synthetic gene networks require the expression of an engineered gene if and only if two different DNA-binding proteins exist in sufficient concentration. While some natural and engineered systems activate gene expression according to a logical AND-like behavior, they often utilize allosteric or cooperative protein-protein interactions, rendering their components unsuitable for a toolbox of modular parts for use in multiple applications. Here, we develop a quantitative model to demonstrate that a small system of interacting fusion proteins, called a protein device, can activate an engineered gene according to the Boolean AND behavior while using only modular protein domains and DNA sites. The fusion proteins are created from transactivating, DNA-binding, non-DNA binding, and protein-protein interaction domains along with the corresponding peptide ligands. Using a combined kinetic and thermodynamic model, we identify the characteristics of the molecular components and their rates of constitutive production that maximize the fidelity of AND behavior. These AND protein devices facilitate the creation of complex genetic programs and may be used to create gene therapies, biosensors and other biomedical and biotechnological applications that turn on gene expression only when multiple DNA-binding proteins are simultaneously present.

  13. Avirulence gene mapping in the Hessian fly (Mayetiola destructor) reveals a protein phosphatase 2C effector gene family.

    PubMed

    Zhao, Chaoyang; Shukle, Richard; Navarro-Escalante, Lucio; Chen, Mingshun; Richards, Stephen; Stuart, Jeffrey J

    2016-01-01

    The genetic tractability of the Hessian fly (HF, Mayetiola destructor) provides an opportunity to investigate the mechanisms insects use to induce plant gall formation. Here we demonstrate that capacity using the newly sequenced HF genome by identifying the gene (vH24) that elicits effector-triggered immunity in wheat (Triticum spp.) seedlings carrying HF resistance gene H24. vH24 was mapped within a 230-kb genomic fragment near the telomere of HF chromosome X1. That fragment contains only 21 putative genes. The best candidate vH24 gene in this region encodes a protein containing a secretion signal and a type-2 serine/threonine protein phosphatase (PP2C) domain. This gene has an H24-virulence associated insertion in its promoter that appears to silence transcription of the gene in H24-virulent larvae. Candidate vH24 is a member of a small family of genes that encode secretion signals and PP2C domains. It belongs to the fraction of genes in the HF genome previously predicted to encode effector proteins. Because PP2C proteins are not normally secreted, our results suggest that these are PP2C effectors that HF larvae inject into wheat cells to redirect, or interfere, with wheat signal transduction pathways.

  14. An Introductory Bioinformatics Exercise to Reinforce Gene Structure and Expression and Analyze the Relationship between Gene and Protein Sequences

    ERIC Educational Resources Information Center

    Almeida, Craig A.; Tardiff, Daniel F.; De Luca, Jane P.

    2004-01-01

    We have developed an introductory bioinformatics exercise for sophomore biology and biochemistry students that reinforces the understanding of the structure of a gene and the principles and events involved in its expression. In addition, the activity illustrates the severe effect mutations in a gene sequence can have on the protein product.…

  15. An Introductory Bioinformatics Exercise to Reinforce Gene Structure and Expression and Analyze the Relationship between Gene and Protein Sequences

    ERIC Educational Resources Information Center

    Almeida, Craig A.; Tardiff, Daniel F.; De Luca, Jane P.

    2004-01-01

    We have developed an introductory bioinformatics exercise for sophomore biology and biochemistry students that reinforces the understanding of the structure of a gene and the principles and events involved in its expression. In addition, the activity illustrates the severe effect mutations in a gene sequence can have on the protein product.…

  16. General theory for integrated analysis of growth, gene, and protein expression in biofilms.

    PubMed

    Zhang, Tianyu; Pabst, Breana; Klapper, Isaac; Stewart, Philip S

    2013-01-01

    A theory for analysis and prediction of spatial and temporal patterns of gene and protein expression within microbial biofilms is derived. The theory integrates phenomena of solute reaction and diffusion, microbial growth, mRNA or protein synthesis, biomass advection, and gene transcript or protein turnover. Case studies illustrate the capacity of the theory to simulate heterogeneous spatial patterns and predict microbial activities in biofilms that are qualitatively different from those of planktonic cells. Specific scenarios analyzed include an inducible GFP or fluorescent protein reporter, a denitrification gene repressed by oxygen, an acid stress response gene, and a quorum sensing circuit. It is shown that the patterns of activity revealed by inducible stable fluorescent proteins or reporter unstable proteins overestimate the region of activity. This is due to advective spreading and finite protein turnover rates. In the cases of a gene induced by either limitation for a metabolic substrate or accumulation of a metabolic product, maximal expression is predicted in an internal stratum of the biofilm. A quorum sensing system that includes an oxygen-responsive negative regulator exhibits behavior that is distinct from any stage of a batch planktonic culture. Though here the analyses have been limited to simultaneous interactions of up to two substrates and two genes, the framework applies to arbitrarily large networks of genes and metabolites. Extension of reaction-diffusion modeling in biofilms to the analysis of individual genes and gene networks is an important advance that dovetails with the growing toolkit of molecular and genetic experimental techniques.

  17. General Theory for Integrated Analysis of Growth, Gene, and Protein Expression in Biofilms

    PubMed Central

    Zhang, Tianyu; Pabst, Breana; Klapper, Isaac; Stewart, Philip S.

    2013-01-01

    A theory for analysis and prediction of spatial and temporal patterns of gene and protein expression within microbial biofilms is derived. The theory integrates phenomena of solute reaction and diffusion, microbial growth, mRNA or protein synthesis, biomass advection, and gene transcript or protein turnover. Case studies illustrate the capacity of the theory to simulate heterogeneous spatial patterns and predict microbial activities in biofilms that are qualitatively different from those of planktonic cells. Specific scenarios analyzed include an inducible GFP or fluorescent protein reporter, a denitrification gene repressed by oxygen, an acid stress response gene, and a quorum sensing circuit. It is shown that the patterns of activity revealed by inducible stable fluorescent proteins or reporter unstable proteins overestimate the region of activity. This is due to advective spreading and finite protein turnover rates. In the cases of a gene induced by either limitation for a metabolic substrate or accumulation of a metabolic product, maximal expression is predicted in an internal stratum of the biofilm. A quorum sensing system that includes an oxygen-responsive negative regulator exhibits behavior that is distinct from any stage of a batch planktonic culture. Though here the analyses have been limited to simultaneous interactions of up to two substrates and two genes, the framework applies to arbitrarily large networks of genes and metabolites. Extension of reaction-diffusion modeling in biofilms to the analysis of individual genes and gene networks is an important advance that dovetails with the growing toolkit of molecular and genetic experimental techniques. PMID:24376726

  18. Autogenous Regulation of Splicing of the Transcript of a Yeast Ribosomal Protein Gene

    NASA Astrophysics Data System (ADS)

    Dabeva, Mariana D.; Post-Beittenmiller, Martha A.; Warner, Jonathan R.

    1986-08-01

    The gene for a yeast ribosomal protein, RPL32, contains a single intron. The product of this gene appears to participate in feedback control of the splicing of the intron from the transcript. This autogenous regulation of splicing provides a striking analogy to the autogenous regulation of translation of ribosomal proteins in Escherichia coli.

  19. A homolog of an Escherichia coli phosphate-binding protein gene from Xanthomonas oryzae pv. oryzae

    NASA Technical Reports Server (NTRS)

    Hopkins, C. M.; White, F. F.; Heaton, L. A.; Guikema, J. A.; Leach, J. E.; Spooner, B. S. (Principal Investigator)

    1995-01-01

    A Xanthomonas oryzae pv. oryzae gene with sequence similarity to an Escherichia coli phosphate-binding protein gene (phoS) produces a periplasmic protein of apparent M(r) 35,000 when expressed in E. coli. Amino terminal sequencing revealed that a signal peptide is removed during transport to the periplasm in E. coli.

  20. A homolog of an Escherichia coli phosphate-binding protein gene from Xanthomonas oryzae pv. oryzae

    NASA Technical Reports Server (NTRS)

    Hopkins, C. M.; White, F. F.; Heaton, L. A.; Guikema, J. A.; Leach, J. E.; Spooner, B. S. (Principal Investigator)

    1995-01-01

    A Xanthomonas oryzae pv. oryzae gene with sequence similarity to an Escherichia coli phosphate-binding protein gene (phoS) produces a periplasmic protein of apparent M(r) 35,000 when expressed in E. coli. Amino terminal sequencing revealed that a signal peptide is removed during transport to the periplasm in E. coli.

  1. Functional dissection of Odorant binding protein genes in Drosophila melanogaster

    PubMed Central

    Swarup, S; Williams, T I; Anholt, R R H

    2011-01-01

    Most organisms rely on olfaction for survival and reproduction. The olfactory system of Drosophila melanogaster is one of the best characterized chemosensory systems and serves as a prototype for understanding insect olfaction. Olfaction in Drosophila is mediated by multigene families of odorant receptors and odorant binding proteins (OBPs). Although molecular response profiles of odorant receptors have been well documented, the contributions of OBPs to olfactory behavior remain largely unknown. Here, we used RNAi-mediated suppression of Obp gene expression and measurements of behavioral responses to 16 ecologically relevant odorants to systematically dissect the functions of 17 OBPs. We quantified the effectiveness of RNAi-mediated suppression by quantitative real-time polymerase chain reaction and used a proteomic liquid chromatography and tandem mass spectrometry procedure to show target-specific suppression of OBPs expressed in the antennae. Flies in which expression of a specific OBP is suppressed often show altered behavioral responses to more than one, but not all, odorants, in a sex-dependent manner. Similarly, responses to a specific odorant are frequently affected by suppression of expression of multiple, but not all, OBPs. These results show that OBPs are essential for mediating olfactory behavioral responses and suggest that OBP-dependent odorant recognition is combinatorial. PMID:21605338

  2. Polymorphism of prion protein gene in Arctic fox (Vulpes lagopus).

    PubMed

    Wan, Jiayu; Bai, Xue; Liu, Wensen; Xu, Jing; Xu, Ming; Gao, Hongwei

    2009-07-01

    Prion diseases are fatal neurodegenerative disorders of humans and certain other mammals. Prion protein gene (Prnp) is associated with susceptibility and species barrier to prion diseases. No natural and experimental prion diseases have been documented to date in Arctic fox. In the present study, coding region of Prnp from 135 Arctic foxes were cloned and screened for polymorphisms. Our results indicated that the Arctic fox Prnp open reading frame (ORF) contains 771 nucleotides encoding 257 amino acids. Four single nucleotide polymorphisms (SNPs) (G312C, A337G, C541T, and A723G) were identified. SNPs G312C and A723G produced silent mutations, but SNPs A337G and C541T resulted in a M-V change at codon 113 and R-C at codon 181, respectively. The Arctic fox Prnp amino acid sequence was similar to that of the dog (XM 542906). In short, this study provides preliminary information about genotypes of Prnp in Arctic fox.

  3. Tenebrio molitor antifreeze protein gene identification and regulation.

    PubMed

    Qin, Wensheng; Walker, Virginia K

    2006-02-15

    The yellow mealworm, Tenebrio molitor, is a freeze susceptible, stored product pest. Its winter survival is facilitated by the accumulation of antifreeze proteins (AFPs), encoded by a small gene family. We have now isolated 11 different AFP genomic clones from 3 genomic libraries. All the clones had a single coding sequence, with no evidence of intervening sequences. Three genomic clones were further characterized. All have putative TATA box sequences upstream of the coding regions and multiple potential poly(A) signal sequences downstream of the coding regions. A TmAFP regulatory region, B1037, conferred transcriptional activity when ligated to a luciferase reporter sequence and after transfection into an insect cell line. A 143 bp core promoter including a TATA box sequence was identified. Its promoter activity was increased 4.4 times by inserting an exotic 245 bp intron into the construct, similar to the enhancement of transgenic expression seen in several other systems. The addition of a duplication of the first 120 bp sequence from the 143 bp core promoter decreased promoter activity by half. Although putative hormonal response sequences were identified, none of the five hormones tested enhanced reporter activity. These studies on the mechanisms of AFP transcriptional control are important for the consideration of any transfer of freeze-resistance phenotypes to beneficial hosts.

  4. Automatic extraction of gene/protein biological functions from biomedical text.

    PubMed

    Koike, Asako; Niwa, Yoshiki; Takagi, Toshihisa

    2005-04-01

    With the rapid advancement of biomedical science and the development of high-throughput analysis methods, the extraction of various types of information from biomedical text has become critical. Since automatic functional annotations of genes are quite useful for interpreting large amounts of high-throughput data efficiently, the demand for automatic extraction of information related to gene functions from text has been increasing. We have developed a method for automatically extracting the biological process functions of genes/protein/families based on Gene Ontology (GO) from text using a shallow parser and sentence structure analysis techniques. When the gene/protein/family names and their functions are described in ACTOR (doer of action) and OBJECT (receiver of action) relationships, the corresponding GO-IDs are assigned to the genes/proteins/families. The gene/protein/family names are recognized using the gene/protein/family name dictionaries developed by our group. To achieve wide recognition of the gene/protein/family functions, we semi-automatically gather functional terms based on GO using co-occurrence, collocation similarities and rule-based techniques. A preliminary experiment demonstrated that our method has an estimated recall of 54-64% with a precision of 91-94% for actually described functions in abstracts. When applied to the PUBMED, it extracted over 190 000 gene-GO relationships and 150 000 family-GO relationships for major eukaryotes.

  5. Expression of circadian clock genes and proteins in urothelial cancer is related to cancer-associated genes.

    PubMed

    Litlekalsoy, Jorunn; Rostad, Kari; Kalland, Karl-Henning; Hostmark, Jens G; Laerum, Ole Didrik

    2016-07-27

    The purpose of this study was to evaluate invasive and metastatic potential of urothelial cancer by investigating differential expression of various clock genes/proteins participating in the 24 h circadian rhythms and to compare these gene expressions with transcription of other cancer-associated genes. Twenty seven paired samples of tumour and benign tissue collected from patients who underwent cystectomy were analysed and compared to 15 samples of normal bladder tissue taken from patients who underwent cystoscopy for benign prostate hyperplasia (unrelated donors). Immunohistochemical analyses were made for clock and clock-related proteins. In addition, the gene-expression levels of 22 genes (clock genes, casein kinases, oncogenes, tumour suppressor genes and cytokeratins) were analysed by real-time quantitative PCR (qPCR). Considerable up- or down-regulation and altered cellular distribution of different clock proteins, a reduction of casein kinase1A1 (CSNK1A1) and increase of casein kinase alpha 1 E (CSNK1E) were found. The pattern was significantly correlated with simultaneous up-regulation of stimulatory tumour markers, and a down-regulation of several suppressor genes. The pattern was mainly seen in aneuploid high-grade cancers. Considerable alterations were also found in the neighbouring bladder mucosa. The close correlation between altered expression of various clock genes and common tumour markers in urothelial cancer indicates that disturbed function in the cellular clock work may be an important additional mechanism contributing to cancer progression and malignant behaviour.

  6. Characterizing genes with distinct methylation patterns in the context of protein-protein interaction network: application to human brain tissues.

    PubMed

    Li, Yongsheng; Xu, Juan; Chen, Hong; Zhao, Zheng; Li, Shengli; Bai, Jing; Wu, Aiwei; Jiang, Chunjie; Wang, Yuan; Su, Bin; Li, Xia

    2013-01-01

    DNA methylation is an essential epigenetic mechanism involved in transcriptional control. However, how genes with different methylation patterns are assembled in the protein-protein interaction network (PPIN) remains a mystery. In the present study, we systematically dissected the characterization of genes with different methylation patterns in the PPIN. A negative association was detected between the methylation levels in the brain tissues and topological centralities. By focusing on two classes of genes with considerably different methylation levels in the brain tissues, namely the low methylated genes (LMGs) and high methylated genes (HMGs), we found that their organizing principles in the PPIN are distinct. The LMGs tend to be the center of the PPIN, and attacking them causes a more deleterious effect on the network integrity. Furthermore, the LMGs express their functions in a modular pattern and substantial differences in functions are observed between the two types of genes. The LMGs are enriched in the basic biological functions, such as binding activity and regulation of transcription. More importantly, cancer genes, especially recessive cancer genes, essential genes, and aging-related genes were all found more often in the LMGs. Additionally, our analysis presented that the intra-classes communications are enhanced, but inter-classes communications are repressed. Finally, a functional complementation was revealed between methylation and miRNA regulation in the human genome. We have elucidated the assembling principles of genes with different methylation levels in the context of the PPIN, providing key insights into the complex epigenetic regulation mechanisms.

  7. Characterizing Genes with Distinct Methylation Patterns in the Context of Protein-Protein Interaction Network: Application to Human Brain Tissues

    PubMed Central

    Zhao, Zheng; Li, Shengli; Bai, Jing; Wu, Aiwei; Jiang, Chunjie; Wang, Yuan; Su, Bin; Li, Xia

    2013-01-01

    Background DNA methylation is an essential epigenetic mechanism involved in transcriptional control. However, how genes with different methylation patterns are assembled in the protein-protein interaction network (PPIN) remains a mystery. Results In the present study, we systematically dissected the characterization of genes with different methylation patterns in the PPIN. A negative association was detected between the methylation levels in the brain tissues and topological centralities. By focusing on two classes of genes with considerably different methylation levels in the brain tissues, namely the low methylated genes (LMGs) and high methylated genes (HMGs), we found that their organizing principles in the PPIN are distinct. The LMGs tend to be the center of the PPIN, and attacking them causes a more deleterious effect on the network integrity. Furthermore, the LMGs express their functions in a modular pattern and substantial differences in functions are observed between the two types of genes. The LMGs are enriched in the basic biological functions, such as binding activity and regulation of transcription. More importantly, cancer genes, especially recessive cancer genes, essential genes, and aging-related genes were all found more often in the LMGs. Additionally, our analysis presented that the intra-classes communications are enhanced, but inter-classes communications are repressed. Finally, a functional complementation was revealed between methylation and miRNA regulation in the human genome. Conclusions We have elucidated the assembling principles of genes with different methylation levels in the context of the PPIN, providing key insights into the complex epigenetic regulation mechanisms. PMID:23776563

  8. Automatic extraction of gene and protein synonyms from MEDLINE and journal articles.

    PubMed

    Yu, Hong; Hatzivassiloglou, Vasileios; Friedman, Carol; Rzhetsky, Andrey; Wilbur, W John

    2002-01-01

    Genes and proteins are often associated with multiple names, and more names are added as new functional or structural information is discovered. Because authors often alternate between these synonyms, information retrieval and extraction benefits from identifying these synonymous names. We have developed a method to extract automatically synonymous gene and protein names from MEDLINE and journal articles. We first identified patterns authors use to list synonymous gene and protein names. We developed SGPE (for synonym extraction of gene and protein names), a software program that recognizes the patterns and extracts from MEDLINE abstracts and full-text journal articles candidate synonymous terms. SGPE then applies a sequence of filters that automatically screen out those terms that are not gene and protein names. We evaluated our method to have an overall precision of 71% on both MEDLINE and journal articles, and 90% precision on the more suitable full-text articles alone

  9. Automatic extraction of gene and protein synonyms from MEDLINE and journal articles.

    PubMed Central

    Yu, Hong; Hatzivassiloglou, Vasileios; Friedman, Carol; Rzhetsky, Andrey; Wilbur, W. John

    2002-01-01

    Genes and proteins are often associated with multiple names, and more names are added as new functional or structural information is discovered. Because authors often alternate between these synonyms, information retrieval and extraction benefits from identifying these synonymous names. We have developed a method to extract automatically synonymous gene and protein names from MEDLINE and journal articles. We first identified patterns authors use to list synonymous gene and protein names. We developed SGPE (for synonym extraction of gene and protein names), a software program that recognizes the patterns and extracts from MEDLINE abstracts and full-text journal articles candidate synonymous terms. SGPE then applies a sequence of filters that automatically screen out those terms that are not gene and protein names. We evaluated our method to have an overall precision of 71% on both MEDLINE and journal articles, and 90% precision on the more suitable full-text articles alone PMID:12463959

  10. Wool Keratin-Associated Protein Genes in Sheep-A Review.

    PubMed

    Gong, Hua; Zhou, Huitong; Forrest, Rachel H J; Li, Shaobin; Wang, Jiqing; Dyer, Jolon M; Luo, Yuzhu; Hickford, Jon G H

    2016-05-28

    The importance of sheep's wool in making textiles has inspired extensive research into its structure and the underlying genetics since the 1960s. Wool keratin-associated proteins (KAPs) are a key structural component of the wool fibre. The characterisation of the genes encoding these proteins has progressed rapidly with advances in the nucleotide and protein sequencing. This review describes our knowledge of ovine KAPs, their categorisation into families, polymorphism in the proteins and genes, the clustering and chromosomal location of the genes, some characteristics of gene expression and some potential effects of the KAPs on wool traits. The extent and nature of genetic variation in wool KAP genes and its association with fibre characteristics, provides an opportunity for the development of gene-markers for selective breeding of sheep to produce better wool with properties highly matched to specific end-uses.

  11. [Modification and expression of insecticidal protein structural gene of Bacillus thuringiensis var. aizawai 7-29].

    PubMed

    Guo, S; Hong, Z; Wang, J; Wang, M; Yu, M; Fan, Y

    1992-06-01

    The regulative region (181bp) and the fifth toxic active domain (217bp) were removed from the insecticidal protein gene of Bacillus thuringiensis var. aizawai 7-29. After the synthesis of the adaptor (15bp) that contains initiation codon (ATG) and the PCR synthesis of the fifth toxic active domain (229bp) that contains stop codon (TAA), were inserted into on 5' truncated and 3' truncated of the coding fod N-terminal peptid's DNA fragment, that to become a modified structural gene. The modified structural gene can be play initiatic translation-function and stop translation-function during translation of insecticidal protein. The insecticidal protein was determined by western blotting, showed the expression of modified structural gene in Escherichia coli JM 103. The bioassay of insecticidal proteins showed the 3' truncated and 5' truncated of insecticidal gene was higher toxic active than the 3' truncated of insecticidal gene in Escherichia coli JM 103.

  12. Regeneration of transgenic plants of Prunus armeniaca containing the coat protein gene of Plum Pox Virus.

    PubMed

    da Câmara Machado, M L; da Câmara Machado, A; Hanzer, V; Weiss, H; Regner, F; Steinkellner, H; Mattanovich, D; Plail, R; Knapp, E; Kalthoff, B; Katinger, H

    1992-02-01

    A system was developed which allows the transfer of foreign genes into apricot cultivars. We report the transformation and regeneration of Prunus armeniaca plants with Agrobacterium tumefaciens strain LBA 4404 containing various binary plasmids, pBinGUSint, carrying the marker gene ß-glucuronidase (GUS) and pBinPPVm, carrying the coat protein gene of Plum Pox Virus (PPV). The marker gene GUS was used for optical evaluation of the efficiency of the transformation system. The coat protein gene of PPV was used to introduce coat protein mediated resistance against one of the most important pathogens of stone fruit trees in Europe and the whole Mediterranean area. This is the first report of the successful integration of a viral coat protein gene into a fruit tree species, opening a new perspective on the control of the disease.

  13. A plethora of plant serine/arginine-rich proteins: redundancy or evolution of novel gene functions?

    PubMed Central

    Kalyna, Maria; Barta, Andrea

    2017-01-01

    Pre-mRNA processing is an important step in gene expression and its regulation leads to the expansion of the gene product repertoire. Serine/arginine-rich (SR) proteins are key players in intron recognition and spliceosome assembly and contribute significantly to the alternative splicing process. Due to several duplication events, at least 19 SR proteins are present in the Arabidopsis genome which is almost twice as much as in humans. They fall into seven different subfamilies, three of them homologous to metazoan splicing factors whereas the other four seem to be specific for plants. The current data show that most duplicated genes have different spatio-temporal expression patterns indicating functional diversification. Interestingly, the majority of SR protein genes are alternatively spliced and in some cases this process was shown to be under developmental and/or environmental control. This might greatly influence gene expression of target genes as also exemplified by ectopic expression studies of particular SR proteins. PMID:15270675

  14. Wool Keratin-Associated Protein Genes in Sheep—A Review

    PubMed Central

    Gong, Hua; Zhou, Huitong; Forrest, Rachel H. J.; Li, Shaobin; Wang, Jiqing; Dyer, Jolon M.; Luo, Yuzhu; Hickford, Jon G. H.

    2016-01-01

    The importance of sheep’s wool in making textiles has inspired extensive research into its structure and the underlying genetics since the 1960s. Wool keratin-associated proteins (KAPs) are a key structural component of the wool fibre. The characterisation of the genes encoding these proteins has progressed rapidly with advances in the nucleotide and protein sequencing. This review describes our knowledge of ovine KAPs, their categorisation into families, polymorphism in the proteins and genes, the clustering and chromosomal location of the genes, some characteristics of gene expression and some potential effects of the KAPs on wool traits. The extent and nature of genetic variation in wool KAP genes and its association with fibre characteristics, provides an opportunity for the development of gene-markers for selective breeding of sheep to produce better wool with properties highly matched to specific end-uses. PMID:27240405

  15. Mycobacterium tuberculosis Rv1096 protein: gene cloning, protein expression, and peptidoglycan deacetylase activity

    PubMed Central

    2014-01-01

    Background Many bacteria modulate and evade the immune defenses of their hosts through peptidoglycan (PG) deacetylation. The PG deacetylases from Streptococcus pneumonia, Listeria monocytogenes and Lactococcus lactis have been characterized. However, thus far, the PG deacetylase of Mycobacterium tuberculosis has not been identified. Results In this study, we cloned the Rv1096 gene from the M. tuberculosis H37Rv strain and expressed Rv1096 protein in both Escherichia coli and M. smegmatis. The results showed that the purified Rv1096 protein possessed metallo-dependent PG deacetylase activity, which increased in the presence of Co2+. The kinetic parameters of the PG deacetylase towards M. smegmatis PG as a substrate were as follows: Km, 0.910 ± 0.007 mM; Vmax, 0.514 ± 0.038 μMmin-1; and Kcat = 0.099 ± 0.007 (S-1). Additionally, the viability of M. smegmatis in the presence of over-expressed Rv1096 protein was 109-fold higher than that of wild-type M. smegmatis after lysozyme treatment. Additionally, light microscopy and scanning electron microscopy showed that in the presence of over-expressed Rv1096 protein, M. smegmatis kept its regular shape, with an undamaged cell wall and smooth surface. These results indicate that Rv1096 caused deacetylation of cell wall PG, leading to lysozyme resistance in M. smegmatis. Conclusion We have determined that M. tuberculosis Rv1096 is a PG deacetylase. The PG deacetylase activity of Rv1096 contributed to lysozyme resistance in M. smegmatis. Our findings suggest that deacetylation of cell wall PG may be involved in evasion of host immune defenses by M. tuberculosis. PMID:24975018

  16. Circumsporozoite protein of Plasmodium berghei: gene cloning and identification of the immunodominant epitopes.

    PubMed Central

    Eichinger, D J; Arnot, D E; Tam, J P; Nussenzweig, V; Enea, V

    1986-01-01

    The gene encoding the circumsporozoite (CS) protein of the rodent malaria parasite Plasmodium berghei was cloned and characterized. A cDNA library made from P. berghei sporozoite RNA was screened with a monoclonal antibody for expression of CS protein epitopes. The resulting cDNA clone was used to isolate the CS protein gene from a lambda library containing parasite blood-stage DNA. The CS protein gene contains a central region encoding two types of tandemly repeated amino acid units, flanked by nonrepeated regions encoding amino- and carboxy-terminal signal and anchorlike sequences, respectively. One of the central repeated amino acid unit types contains the immunodominant epitopes. Images PMID:2432395

  17. Fatal infantile cardiac glycogenosis with phosphorylase kinase deficiency and a mutation in the gamma2-subunit of AMP-activated protein kinase.

    PubMed

    Akman, Hasan O; Sampayo, James N; Ross, Fiona A; Scott, John W; Wilson, Gregory; Benson, Lee; Bruno, Claudio; Shanske, Sara; Hardie, D Grahame; Dimauro, Salvatore

    2007-10-01

    A 10-wk-old infant girl with severe hypertrophy of the septal and atrial walls by cardiac ultrasound, developed progressive ventricular wall thickening and died of aspiration pneumonia at 5 mo of age. Postmortem examination revealed ventricular hypertrophy and massive atrial wall thickening due to glycogen accumulation. A skeletal muscle biopsy showed increased free glycogen and decreased activity of phosphorylase b kinase (PHK). The report of a pathogenic mutation (R531Q) in the gene (PRKAG2) encoding the gamma2 subunit of AMP-activated protein kinase (AMPK) in three infants with congenital hypertrophic cardiomyopathy, glycogen storage, and "pseudo PHK deficiency" prompted us to screen this gene in our patient. We found a novel (R384T) heterozygous mutation in PRKAG2, affecting an arginine residue in the N-terminal AMP-binding domain. Like R531Q, this mutation reduces the binding of AMP and ATP to the isolated nucleotide-binding domains, and prevents activation of the heterotrimer by metabolic stress in intact cells. The mutation was not found in DNA from the patient's father, the only available parent, and is likely to have arisen de novo. Our studies confirm that mutations in PRKAG2 can cause fatal infantile cardiomyopathy, often associated with apparent PHK deficiency.

  18. Trithorax group proteins: switching genes on and keeping them active.

    PubMed

    Schuettengruber, Bernd; Martinez, Anne-Marie; Iovino, Nicola; Cavalli, Giacomo

    2011-11-23

    Cellular memory is provided by two counteracting groups of chromatin proteins termed Trithorax group (TrxG) and Polycomb group (PcG) proteins. TrxG proteins activate transcription and are perhaps best known because of the involvement of the TrxG protein MLL in leukaemia. However, in terms of molecular analysis, they have lived in the shadow of their more famous counterparts, the PcG proteins. Recent advances have improved our understanding of TrxG protein function and demonstrated that the heterogeneous group of TrxG proteins is of critical importance in the epigenetic regulation of the cell cycle, senescence, DNA damage and stem cell biology.

  19. Amplification of prolamin storage protein genes in different subfamilies of the Poaceae.

    PubMed

    Xu, Jian-Hong; Messing, Joachim

    2009-11-01

    Prolamins are seed storage proteins in cereals and represent an important source of essential amino acids for feed and food. Genes encoding these proteins resulted from dispersed and tandem amplification. While previous studies have concentrated on protein sequences from different grass species, we now can add a new perspective to their relationships by asking how their genes are shared by ancestry and copied in different lineages of the same family of species. These differences are derived from alignment of chromosomal regions, where collinearity is used to identify prolamin genes in syntenic positions, also called orthologous gene copies. New or paralogous gene copies are inserted in tandem or new locations of the same genome. More importantly, one can detect the loss of older genes. We analyzed chromosomal intervals containing prolamin genes from rice, sorghum, wheat, barley, and Brachypodium, representing different subfamilies of the Poaceae. The Poaceae commonly known as the grasses includes three major subfamilies, the Ehrhartoideae (rice), Pooideae (wheat, barley, and Brachypodium), and Panicoideae (millets, maize, sorghum, and switchgrass). Based on chromosomal position and sequence divergence, it becomes possible to infer the order of gene amplification events. Furthermore, the loss of older genes in different subfamilies seems to permit a faster pace of divergence of paralogous genes. Change in protein structure affects their physical properties, subcellular location, and amino acid composition. On the other hand, regulatory sequence elements and corresponding transcriptional activators of new gene copies are more conserved than coding sequences, consistent with the tissue-specific expression of these genes.

  20. Protein functional links in Trypanosoma brucei, identified by gene fusion analysis

    PubMed Central

    2011-01-01

    Background Domain or gene fusion analysis is a bioinformatics method for detecting gene fusions in one organism by comparing its genome to that of other organisms. The occurrence of gene fusions suggests that the two original genes that participated in the fusion are functionally linked, i.e. their gene products interact either as part of a multi-subunit protein complex, or in a metabolic pathway. Gene fusion analysis has been used to identify protein functional links in prokaryotes as well as in eukaryotic model organisms, such as yeast and Drosophila. Results In this study we have extended this approach to include a number of recently sequenced protists, four of which are pathogenic, to identify fusion linked proteins in Trypanosoma brucei, the causative agent of African sleeping sickness. We have also examined the evolution of the gene fusion events identified, to determine whether they can be attributed to fusion or fission, by looking at the conservation of the fused genes and of the individual component genes across the major eukaryotic and prokaryotic lineages. We find relatively limited occurrence of gene fusions/fissions within the protist lineages examined. Our results point to two trypanosome-specific gene fissions, which have recently been experimentally confirmed, one fusion involving proteins involved in the same metabolic pathway, as well as two novel putative functional links between fusion-linked protein pairs. Conclusions This is the first study of protein functional links in T. brucei identified by gene fusion analysis. We have used strict thresholds and only discuss results which are highly likely to be genuine and which either have already been or can be experimentally verified. We discuss the possible impact of the identification of these novel putative protein-protein interactions, to the development of new trypanosome therapeutic drugs. PMID:21729286

  1. Overlapping Genes Produce Proteins with Unusual Sequence Properties and Offer Insight into De Novo Protein Creation▿ †

    PubMed Central

    Rancurel, Corinne; Khosravi, Mahvash; Dunker, A. Keith; Romero, Pedro R.; Karlin, David

    2009-01-01

    It is widely assumed that new proteins are created by duplication, fusion, or fission of existing coding sequences. Another mechanism of protein birth is provided by overlapping genes. They are created de novo by mutations within a coding sequence that lead to the expression of a novel protein in another reading frame, a process called “overprinting.” To investigate this mechanism, we have analyzed the sequences of the protein products of manually curated overlapping genes from 43 genera of unspliced RNA viruses infecting eukaryotes. Overlapping proteins have a sequence composition globally biased toward disorder-promoting amino acids and are predicted to contain significantly more structural disorder than nonoverlapping proteins. By analyzing the phylogenetic distribution of overlapping proteins, we were able to confirm that 17 of these had been created de novo and to study them individually. Most proteins created de novo are orphans (i.e., restricted to one species or genus). Almost all are accessory proteins that play a role in viral pathogenicity or spread, rather than proteins central to viral replication or structure. Most proteins created de novo are predicted to be fully disordered and have a highly unusual sequence composition. This suggests that some viral overlapping reading frames encoding hypothetical proteins with highly biased composition, often discarded as noncoding, might in fact encode proteins. Some proteins created de novo are predicted to be ordered, however, and whenever a three-dimensional structure of such a protein has been solved, it corresponds to a fold previously unobserved, suggesting that the study of these proteins could enhance our knowledge of protein space. PMID:19640978

  2. Testing the exon theory of genes: the evidence from protein structure.

    PubMed

    Stoltzfus, A; Spencer, D F; Zuker, M; Logsdon, J M; Doolittle, W F

    1994-07-08

    A tendency for exons to correspond to discrete units of protein structure in protein-coding genes of ancient origin would provide clear evidence in favor of the exon theory of genes, which proposes that split genes arose not by insertion of introns into unsplit genes, but from combinations of primordial mini-genes (exons) separated by spacers (introns). Although putative examples of such correspondence have strongly influenced previous debate on the origin of introns, a general correspondence has not been rigorously proved. Objective methods for detecting correspondences were developed and applied to four examples that have been cited previously as evidence of the exon theory of genes. No significant correspondence between exons and units of protein structure was detected, suggesting that the putative correspondence does not exist and that the exon theory of genes is untenable.

  3. Identification and Properties of the Genes Encoding Microcin E492 and Its Immunity Protein

    PubMed Central

    Lagos, Rosalba; Villanueva, Jorge E.; Monasterio, Octavio

    1999-01-01

    The gene coding for the immunity protein (mceB) and the structural gene of microcin E492 (mceA), a low-molecular-weight channel-forming bacteriocin produced by a strain of Klebsiella pneumoniae, have been characterized. The microcin gene codes for a precursor protein of either 99 or 103 amino acids. Protein sequencing of the N-terminal region of microcin E492 unequivocally identified this gene as the microcin structural gene and indicated that this microcin is synthesized as a precursor protein that is cleaved at either amino acid 15 or 19, at a site resembling the double-glycine motif. The gene encoding the 95-amino-acid immunity protein (mceB) was identified by cloning the DNA segment that encodes only this polypeptide into an expression vector and demonstrating the acquisition of immunity to microcin E492. As expected, the immunity protein was found to be associated with the inner membrane. Analysis of the DNA sequence indicates that these genes belong to the same family as microcin 24, and they do not share structural motifs with any other known channel-forming bacteriocin. The organization of the microcin- and immunity protein-encoding genes suggests that they are coordinately expressed. PMID:9864332

  4. Sieve element occlusion (SEO) genes encode structural phloem proteins involved in wound sealing of the phloem.

    PubMed

    Ernst, Antonia M; Jekat, Stephan B; Zielonka, Sascia; Müller, Boje; Neumann, Ulla; Rüping, Boris; Twyman, Richard M; Krzyzanek, Vladislav; Prüfer, Dirk; Noll, Gundula A

    2012-07-10

    The sieve element occlusion (SEO) gene family originally was delimited to genes encoding structural components of forisomes, which are specialized crystalloid phloem proteins found solely in the Fabaceae. More recently, SEO genes discovered in various non-Fabaceae plants were proposed to encode the common phloem proteins (P-proteins) that plug sieve plates after wounding. We carried out a comprehensive characterization of two tobacco (Nicotiana tabacum) SEO genes (NtSEO). Reporter genes controlled by the NtSEO promoters were expressed specifically in immature sieve elements, and GFP-SEO fusion proteins formed parietal agglomerates in intact sieve elements as well as sieve plate plugs after wounding. NtSEO proteins with and without fluorescent protein tags formed agglomerates similar in structure to native P-protein bodies when transiently coexpressed in Nicotiana benthamiana, and the analysis of these protein complexes by electron microscopy revealed ultrastructural features resembling those of native P-proteins. NtSEO-RNA interference lines were essentially devoid of P-protein structures and lost photoassimilates more rapidly after injury than control plants, thus confirming the role of P-proteins in sieve tube sealing. We therefore provide direct evidence that SEO genes in tobacco encode P-protein subunits that affect translocation. We also found that peptides recently identified in fascicular phloem P-protein plugs from squash (Cucurbita maxima) represent cucurbit members of the SEO family. Our results therefore suggest a common evolutionary origin for P-proteins found in the sieve elements of all dicotyledonous plants and demonstrate the exceptional status of extrafascicular P-proteins in cucurbits.

  5. Cross-tissue Analysis of Gene and Protein Expression in Normal and Cancer Tissues

    PubMed Central

    Kosti, Idit; Jain, Nishant; Aran, Dvir; Butte, Atul J.; Sirota, Marina

    2016-01-01

    The central dogma of molecular biology describes the translation of genetic information from mRNA to protein, but does not specify the quantitation or timing of this process across the genome. We have analyzed protein and gene expression in a diverse set of human tissues. To study concordance and discordance of gene and protein expression, we integrated mass spectrometry data from the Human Proteome Map project and RNA-Seq measurements from the Genotype-Tissue Expression project. We analyzed 16,561 genes and the corresponding proteins in 14 tissue types across nearly 200 samples. A comprehensive tissue- and gene-specific analysis revealed that across the 14 tissues, correlation between mRNA and protein expression was positive and ranged from 0.36 to 0.5. We also identified 1,012 genes whose RNA and protein expression was correlated across all the tissues and examined genes and proteins that were concordantly and discordantly expressed for each tissue of interest. We extended our analysis to look for genes and proteins that were differentially correlated in cancer compared to normal tissues, showing higher levels of correlation in normal tissues. Finally, we explored the implications of these findings in the context of biomarker and drug target discovery. PMID:27142790

  6. A Hox Gene, Antennapedia, Regulates Expression of Multiple Major Silk Protein Genes in the Silkworm Bombyx mori*

    PubMed Central

    Tsubota, Takuya; Tomita, Shuichiro; Uchino, Keiro; Kimoto, Mai; Takiya, Shigeharu; Kajiwara, Hideyuki; Yamazaki, Toshimasa; Sezutsu, Hideki

    2016-01-01

    Hox genes play a pivotal role in the determination of anteroposterior axis specificity during bilaterian animal development. They do so by acting as a master control and regulating the expression of genes important for development. Recently, however, we showed that Hox genes can also function in terminally differentiated tissue of the lepidopteran Bombyx mori. In this species, Antennapedia (Antp) regulates expression of sericin-1, a major silk protein gene, in the silk gland. Here, we investigated whether Antp can regulate expression of multiple genes in this tissue. By means of proteomic, RT-PCR, and in situ hybridization analyses, we demonstrate that misexpression of Antp in the posterior silk gland induced ectopic expression of major silk protein genes such as sericin-3, fhxh4, and fhxh5. These genes are normally expressed specifically in the middle silk gland as is Antp. Therefore, the evidence strongly suggests that Antp activates these silk protein genes in the middle silk gland. The putative sericin-1 activator complex (middle silk gland-intermolt-specific complex) can bind to the upstream regions of these genes, suggesting that Antp directly activates their expression. We also found that the pattern of gene expression was well conserved between B. mori and the wild species Bombyx mandarina, indicating that the gene regulation mechanism identified here is an evolutionarily conserved mechanism and not an artifact of the domestication of B. mori. We suggest that Hox genes have a role as a master control in terminally differentiated tissues, possibly acting as a primary regulator for a range of physiological processes. PMID:26814126

  7. [Isolation and study of the properties of the regulator subunit of cAMP-dependent protein kinase].

    PubMed

    Iurkiv, V A; Severin, E S; Petukhov, S P; Bulargina, T V

    1982-12-01

    The regulatory subunit of type II cAMP-dependent proteinkinase was isolated from cytosol of the rabbit small intestinal mucosa by affinity chromatography. The preparation contained 3 proteolytic enzymes and occurred in two forms differing as regards cAMP affinity. The cAMP-binding capacity of the preparation was equal to 17 nmol cAMP/mg protein. To study the topography of the cAMP-binding center, use was made of cAMP analogs. It was demonstrated that introduction of the substituents into the 8th position of the purine ring and substitution with respect to the N6-exoaminogroup affected insignificantly the analog affinity for the cAMP-binding center. At the same time the substituents introduced into the first position of the adenine base, into the area of the 2'-hydroxyl group of ribose and into the cyclophosphate part of the cAMP molecule considerably decreased the analog affinity for the regulatory center of type II cAMP-dependent proteinkinase.

  8. Developmental expression patterns of cuticular protein genes with the R&R Consensus from Anopheles gambiae

    PubMed Central

    Togawa, Toru; Dunn, W. Augustine; Emmons, Aaron C.; Nagao, John; Willis, Judith H.

    2008-01-01

    CPR proteins are the largest cuticular protein family in arthropods. The whole genome sequence of Anopheles gambiae revealed 156 genes that code for proteins with the R&R Consensus and named CPRs. This protein family can be divided into RR-1 and RR-2 subgroups, postulated to contribute to different regions of the cuticle. We determined the temporal expression patterns of these genes throughout post-embryonic development by means of real-time qRT-PCR. Based on expression profiles, these genes were grouped into 21 clusters. Most of the genes were expressed with sharp peaks at single or multiple periods associated with molting. Genes coding for RR-1 and RR-2 proteins were found together in several co-expression clusters. Twenty-five genes were expressed exclusively at one metamorphic stage. Five out of six X-linked genes showed equal expression in males and females, supporting the presence of a gene dosage compensation system in An. gambiae. Many RR-2 genes are organized into sequence clusters whose members are extremely similar to each other and generally closely associated on a chromosome. Most genes in each sequence cluster are expressed with the same temporal expression pattern and at the same level, suggesting a shared mechanism to regulate their expression. PMID:18405829

  9. CCG: an integrative resource of cancer protein-coding genes and long noncoding RNAs.

    PubMed

    Liu, Mengrong; Yang, Yu-Cheng T; Xu, Gang; Tan, Chang; Lu, Zhi John

    2016-12-01

    The identification of cancer genes remains a main aim of cancer research. With the advances of high-throughput sequencing technologies, thousands of novel cancer genes were identified through recurrent mutation analyses and differential expression analyses between normal tissues and tumors in large populations. Many databases were developed to document the cancer genes. However, no public database providing both cancer protein-coding genes and cancer lncRNAs is available presently. Here, we present the Catalogue of Cancer Genes (CCG) database (http://ccg.xingene.net), a catalogue of cancer genes. It includes both well-supported and candidate cancer protein-coding genes and cancer lncRNAs collected from literature search and public databases. In addition, uniform genomic aberration information (such as somatic mutation and copy number variation) and drug-gene interactions were assigned to cancer genes in the database. CCG represents an effort on integrative assembly of well-supported and candidate cancer protein-coding and long noncoding RNA genes and takes advantages of high-throughput sequencing results on large populations. With the help of CCG, users can easily access a comprehensive list of cancer genes as well as genomic aberration related with these genes. The availability of integrative information will facilitate the understanding of cancer mechanisms. In addition, drug-gene information in CCG provides a useful guide to the development of new anti-cancer drugs and selection of rational combination therapies.

  10. Gene structure of murine Gna11 and Gna15: tandemly duplicated Gq class G protein alpha subunit genes.

    PubMed

    Davignon, I; Barnard, M; Gavrilova, O; Sweet, K; Wilkie, T M

    1996-02-01

    G protein alpha subunits are encoded by a multigene family of 16 genes that can be grouped into four classes, Gq, Gs, Gi, and G12. The Gq class is composed of four genes in mouse and human, and two of these genes, Gna11 and Gna15, cosegregate on mouse chromosome 10. We have characterized the gene structures of murine Gna11 and Gna15. The two genes are tandemly duplicated in a head-to-tail array. The upstream gene, Gna11, is ubiquitously expressed, whereas expression of the downstream gene, Gna15, is restricted to hematopoietic cells. The coding sequence of each gene is contained within seven exons, and the two genes together span 43 kb, separated by 6 kb of intergenic region. We have found no evidence for alternative splicing within the coding sequence of either gene. Sequence alignments show that the positions of the six intervening sequences are conserved in the two genes, consistent with Gna11 and Gna15 arising by tandem duplication from a common progenitor gene in vertebrates. Phylogenetic trees reveal unequal evolutionary rates among alpha subunits of the Gq class. The rate of change is approximately six fold higher in Gna15 than in Gna11.

  11. Gene structure of murine Gna11 and Gna15: Tandemly duplicated Gq class G protein {alpha} subunit genes

    SciTech Connect

    Davignon, I.; Barnard, M.; Sweet, K.

    1996-02-01

    G protein {alpha} subunits are encoded by a multigene family of 16 genes that can be grouped into four classes, Gq, Gs, Gi, and G12. The Gq class is composed of four genes in mouse and human, and two of these genes, Gna11 and Gna15, cosegregate on mouse chromosome 10. We have characterized the gene structures of murine Gna11 and Gna15. The two genes are tandemly duplicated in a head-to-tail array. The upstream gene, Gna11, is ubiquitously expressed, whereas expression of the downstream gene, Gna15, is restricted to hematopoietic cells. The coding sequence of each gene is contained within seven exons, and the two genes together span 43 kb, separated by 6 kb of intergenic region. We have found no evidence for alternative splicing within the coding sequence of either gene. Sequence alignments show that the positions of the six intervening sequences are conserved in the two genes, consistent with Gna11 and Gna15 arising by tandem duplication from a common progenitor gene in vertebrates. Phylogenetic trees reveal unequal evolutionary rates among {alpha} subunits of the Gq class. The rate of change is approximately six fold higher in Gna15 than in Gna11. 43 refs., 3 figs., 2 tabs.

  12. Downregulation of ATM Gene and Protein Expression in Canine Mammary Tumors.

    PubMed

    Raposo-Ferreira, T M M; Bueno, R C; Terra, E M; Avante, M L; Tinucci-Costa, M; Carvalho, M; Cassali, G D; Linde, S D; Rogatto, S R; Laufer-Amorim, R

    2016-11-01

    The ataxia telangiectasia mutated (ATM) gene encodes a protein associated with DNA damage repair and maintenance of genomic integrity. In women, ATM transcript and protein downregulation have been reported in sporadic breast carcinomas, and the absence of ATM protein expression has been associated with poor prognosis. The aim of this study was to evaluate ATM gene and protein expression in canine mammary tumors and their association with clinical outcome. ATM gene and protein expression was evaluated by reverse transcription-quantitative polymerase chain reaction and immunohistochemistry, respectively, in normal mammary gland samples (n = 10), benign mammary tumors (n = 11), nonmetastatic mammary carcinomas (n = 19), and metastatic mammary carcinomas (n = 11). Lower ATM transcript levels were detected in benign mammary tumors and carcinomas compared with normal mammary glands (P = .011). Similarly, lower ATM protein expression was observed in benign tumors (P = .0003), nonmetastatic mammary carcinomas (P < .0001), and the primary sites of metastatic carcinomas (P < .0001) compared with normal mammary glands. No significant differences in ATM gene or protein levels were detected among benign tumors and nonmetastatic and metastatic mammary carcinomas (P > .05). The levels of ATM gene or protein expression were not significantly associated with clinical and pathological features or with survival. Similar to human breast cancer, the data in this study suggest that ATM gene and protein downregulation is involved in canine mammary gland tumorigenesis. © The Author(s) 2016.

  13. Bacillus subtilis acyl carrier protein is encoded in a cluster of lipid biosynthesis genes.

    PubMed Central

    Morbidoni, H R; de Mendoza, D; Cronan, J E

    1996-01-01

    A cluster of Bacillus subtilis fatty acid synthetic genes was isolated by complementation of an Escherichia coli fabD mutant encoding a thermosensitive malonyl coenzyme A-acyl carrier protein transacylase. The B. subtilis genomic segment contains genes that encode three fatty acid synthetic proteins, malonyl coenzyme A-acyl carrier protein transacylase (fabD), 3-ketoacyl-acyl carrier protein reductase (fabG), and the N-terminal 14 amino acid residues of acyl carrier protein (acpP). Also present is a sequence that encodes a homolog of E. coli plsX, a gene that plays a poorly understood role in phospholipid synthesis. The B. subtilis plsX gene weakly complemented an E. coli plsX mutant. The order of genes in the cluster is plsX fabD fabG acpP, the same order found in E. coli, except that in E. coli the fabH gene lies between plsX and fabD. The absence of fabH in the B. subtilis cluster is consistent with the different fatty acid compositions of the two organisms. The amino acid sequence of B. subtilis acyl carrier protein was obtained by sequencing the purified protein, and the sequence obtained strongly resembled that of E. coli acyl carrier protein, except that most of the protein retained the initiating methionine residue. The B. subtilis fab cluster was mapped to the 135 to 145 degrees region of the chromosome. PMID:8759840

  14. Protein-protein interaction and gene co-expression maps of ARFs and Aux/IAAs in Arabidopsis

    PubMed Central

    Piya, Sarbottam; Shrestha, Sandesh K.; Binder, Brad; Stewart, C. Neal; Hewezi, Tarek

    2014-01-01

    The phytohormone auxin regulates nearly all aspects of plant growth and development. Based on the current model in Arabidopsis thaliana, Auxin/indole-3-acetic acid (Aux/IAA) proteins repress auxin-inducible genes by inhibiting auxin response transcription factors (ARFs). Experimental evidence suggests that heterodimerization between Aux/IAA and ARF proteins are related to their unique biological functions. The objective of this study was to generate the Aux/IAA-ARF protein-protein interaction map using full length sequences and locate the interacting protein pairs to specific gene co-expression networks in order to define tissue-specific responses of the Aux/IAA-ARF interactome. Pairwise interactions between 19 ARFs and 29 Aux/IAAs resulted in the identification of 213 specific interactions of which 79 interactions were previously unknown. The incorporation of co-expression profiles with protein-protein interaction data revealed a strong correlation of gene co-expression for 70% of the ARF-Aux/IAA interacting pairs in at least one tissue/organ, indicative of the biological significance of these interactions. Importantly, ARF4-8 and 19, which were found to interact with almost all Aux-Aux/IAA showed broad co-expression relationships with Aux/IAA genes, thus, formed the central hubs of the co-expression network. Our analyses provide new insights into the biological significance of ARF-Aux/IAA associations in the morphogenesis and development of various plant tissues and organs. PMID:25566309

  15. Gene cloning and prokaryotic expression of recombinant outer membrane protein from Vibrio parahaemolyticus

    NASA Astrophysics Data System (ADS)

    Yuan, Ye; Wang, Xiuli; Guo, Sheping; Qiu, Xuemei

    2011-06-01

    Gram-negative Vibrio parahaemolyticus is a common pathogen in humans and marine animals. The outer membrane protein of bacteria plays an important role in the infection and pathogenicity to the host. Thus, the outer membrane proteins are an ideal target for vaccines. We amplified a complete outer membrane protein gene (ompW) from V. parahaemolyticus ATCC 17802. We then cloned and expressed the gene into Escherichia coli BL21 (DE3) cells. The gene coded for a protein that was 42.78 kDa. We purified the protein using Ni-NTA affinity chromatography and Anti-His antibody Western blotting, respectively. Our results provide a basis for future application of the OmpW protein as a vaccine candidate against infection by V. parahaemolyticus. In addition, the purified OmpW protein can be used for further functional and structural studies.

  16. Evolution of the duplicated intracellular lipid-binding protein genes of teleost fishes.

    PubMed

    Venkatachalam, Ananda B; Parmar, Manoj B; Wright, Jonathan M

    2017-08-01

    Increasing organismal complexity during the evolution of life has been attributed to the duplication of genes and entire genomes. More recently, theoretical models have been proposed that postulate the fate of duplicated genes, among them the duplication-degeneration-complementation (DDC) model. In the DDC model, the common fate of a duplicated gene is lost from the genome owing to nonfunctionalization. Duplicated genes are retained in the genome either by subfunctionalization, where the functions of the ancestral gene are sub-divided between the sister duplicate genes, or by neofunctionalization, where one of the duplicate genes acquires a new function. Both processes occur either by loss or gain of regulatory elements in the promoters of duplicated genes. Here, we review the genomic organization, evolution, and transcriptional regulation of the multigene family of intracellular lipid-binding protein (iLBP) genes from teleost fishes. Teleost fishes possess many copies of iLBP genes owing to a whole genome duplication (WGD) early in the teleost fish radiation. Moreover, the retention of duplicated iLBP genes is substantially higher than the retention of all other genes duplicated in the teleost genome. The fatty acid-binding protein genes, a subfamily of the iLBP multigene family in zebrafish, are differentially regulated by peroxisome proliferator-activated receptor (PPAR) isoforms, which may account for the retention of iLBP genes in the zebrafish genome by the process of subfunctionalization of cis-acting regulatory elements in iLBP gene promoters.

  17. Direct TFIIA-TFIID protein contacts drive budding yeast ribosomal protein gene transcription.

    PubMed

    Layer, Justin H; Weil, P Anthony

    2013-08-09

    We have previously shown that yeast TFIID provides coactivator function on the promoters of ribosomal protein-encoding genes (RPGs) by making direct contact with the transactivator repressor activator protein 1 (Rap1). Further, our structural studies of assemblies generated with purified Rap1, TFIID, and TFIIA on RPG enhancer-promoter DNA indicate that Rap1-TFIID interaction induces dramatic conformational rearrangements of enhancer-promoter DNA and TFIID-bound TFIIA. These data indicate a previously unknown yet critical role for yeast TFIIA in the integration of activator-TFIID contacts with promoter conformation and downstream preinitiation complex formation and/or function. Here we describe the use of systematic mutagenesis to define how specific TFIIA contacts contribute to these processes. We have verified that TFIIA is required for RPG transcription in vivo and in vitro, consistent with the existence of a critical Rap1-TFIIA-TFIID interaction network. We also identified essential points of contact for TFIIA and Rap1 within the Rap1 binding domain of the Taf4 subunit of TFIID. These data suggest a mechanism for how interactions between TFIID, TFIIA, and Rap1 contribute to the high rate of transcription initiation seen on RPGs in vivo.

  18. Direct TFIIA-TFIID Protein Contacts Drive Budding Yeast Ribosomal Protein Gene Transcription*

    PubMed Central

    Layer, Justin H.; Weil, P. Anthony

    2013-01-01

    We have previously shown that yeast TFIID provides coactivator function on the promoters of ribosomal protein-encoding genes (RPGs) by making direct contact with the transactivator repressor activator protein 1 (Rap1). Further, our structural studies of assemblies generated with purified Rap1, TFIID, and TFIIA on RPG enhancer-promoter DNA indicate that Rap1-TFIID interaction induces dramatic conformational rearrangements of enhancer-promoter DNA and TFIID-bound TFIIA. These data indicate a previously unknown yet critical role for yeast TFIIA in the integration of activator-TFIID contacts with promoter conformation and downstream preinitiation complex formation and/or function. Here we describe the use of systematic mutagenesis to define how specific TFIIA contacts contribute to these processes. We have verified that TFIIA is required for RPG transcription in vivo and in vitro, consistent with the existence of a critical Rap1-TFIIA-TFIID interaction network. We also identified essential points of contact for TFIIA and Rap1 within the Rap1 binding domain of the Taf4 subunit of TFIID. These data suggest a mechanism for how interactions between TFIID, TFIIA, and Rap1 contribute to the high rate of transcription initiation seen on RPGs in vivo. PMID:23814059

  19. Response gene to complement 32 protein promotes macrophage phagocytosis via activation of protein kinase C pathway.

    PubMed

    Tang, Rui; Zhang, Gui; Chen, Shi-You

    2014-08-15

    Macrophage phagocytosis plays an important role in host defense. The molecular mechanism, especially factors regulating the phagocytosis, however, is not completely understood. In the present study, we found that response gene to complement 32 (RGC-32) is an important regulator of phagocytosis. Although RGC-32 is induced and abundantly expressed in macrophage during monocyte-macrophage differentiation, RGC-32 appears not to be important for this process because RGC-32-deficient bone marrow progenitor can normally differentiate to macrophage. However, both peritoneal macrophages and bone marrow-derived macrophages with RGC-32 deficiency exhibit significant defects in phagocytosis, whereas RGC-32-overexpressed macrophages show increased phagocytosis. Mechanistically, RGC-32 is recruited to macrophage membrane where it promotes F-actin assembly and the formation of phagocytic cups. RGC-32 knock-out impairs F-actin assembly. RGC-32 appears to interact with PKC to regulate PKC-induced phosphorylation of F-actin cross-linking protein myristoylated alanine-rich protein kinase C substrate. Taken together, our results demonstrate for the first time that RGC-32 is a novel membrane regulator for macrophage phagocytosis.

  20. Tempo and mode of gene duplication in mammalian ribosomal protein evolution.

    PubMed

    Dharia, Asav P; Obla, Ajay; Gajdosik, Matthew D; Simon, Amanda; Nelson, Craig E

    2014-01-01

    Gene duplication has been widely recognized as a major driver of evolutionary change and organismal complexity through the generation of multi-gene families. Therefore, understanding the forces that govern the evolution of gene families through the retention or loss of duplicated genes is fundamentally important in our efforts to study genome evolution. Previous work from our lab has shown that ribosomal protein (RP) genes constitute one of the largest classes of conserved duplicated genes in mammals. This result was surprising due to the fact that ribosomal protein genes evolve slowly and transcript levels are very tightly regulated. In our present study, we identified and characterized all RP duplicates in eight mammalian genomes in order to investigate the tempo and mode of ribosomal protein family evolution. We show that a sizable number of duplicates are transcriptionally active and are very highly conserved. Furthermore, we conclude that existing gene duplication models do not readily account for the preservation of a very large number of intact retroduplicated ribosomal protein (RT-RP) genes observed in mammalian genomes. We suggest that selection against dominant-negative mutations may underlie the unexpected retention and conservation of duplicated RP genes, and may shape the fate of newly duplicated genes, regardless of duplication mechanism.

  1. Involvement of regucalcin as a suppressor protein in human carcinogenesis: insight into the gene therapy.

    PubMed

    Yamaguchi, Masayoshi

    2015-08-01

    Regucalcin, which its gene is located on the X chromosome, plays a multifunctional role as a suppressor protein in cell signal transduction in various types of cells and tissues. The suppression of regucalcin gene expression has been shown to involve in carcinogenesis. Regucalcin gene expression was uniquely downregulated in carcinogenesis of rat liver in vivo, although the expression of other many genes was upregulated, indicating that endogenous regucalcin plays a suppressive role in the development of hepatocarcinogenesis. Overexpression of endogenous regucalcin was found to suppress proliferation of rat cloned hepatoma cells in vitro. Moreover, the regucalcin gene and its protein levels were demonstrated specifically to downregulate in human hepatocellular carcinoma by analysis with multiple gene expression profiles and proteomics. Regucalcin gene expression was also found to suppress in human tumor tissues including kidney, lung, brain, breast and prostate, suggesting that repressed regucalcin gene expression leads to the development of carcinogenesis in various tissues. Regucalcin may play a role as a suppressor protein in carcinogenesis. Overexpression of endogenous regucalcin is suggested to reveal preventive and therapeutic effects on carcinogenesis. Delivery of the regucalcin gene may be a novel useful tool in the gene therapy of carcinogenesis. This review will discuss regarding to an involvement of regucalcin as a suppressor protein in human carcinogenesis in insight into the gene therapy.

  2. Tempo and Mode of Gene Duplication in Mammalian Ribosomal Protein Evolution

    PubMed Central

    Gajdosik, Matthew D.; Simon, Amanda; Nelson, Craig E.

    2014-01-01

    Gene duplication has been widely recognized as a major driver of evolutionary change and organismal complexity through the generation of multi-gene families. Therefore, understanding the forces that govern the evolution of gene families through the retention or loss of duplicated genes is fundamentally important in our efforts to study genome evolution. Previous work from our lab has shown that ribosomal protein (RP) genes constitute one of the largest classes of conserved duplicated genes in mammals. This result was surprising due to the fact that ribosomal protein genes evolve slowly and transcript levels are very tightly regulated. In our present study, we identified and characterized all RP duplicates in eight mammalian genomes in order to investigate the tempo and mode of ribosomal protein family evolution. We show that a sizable number of duplicates are transcriptionally active and are very highly conserved. Furthermore, we conclude that existing gene duplication models do not readily account for the preservation of a very large number of intact retroduplicated ribosomal protein (RT-RP) genes observed in mammalian genomes. We suggest that selection against dominant-negative mutations may underlie the unexpected retention and conservation of duplicated RP genes, and may shape the fate of newly duplicated genes, regardless of duplication mechanism. PMID:25369106

  3. Optimizing heterologous protein production in the periplasm of E. coli by regulating gene expression levels

    PubMed Central

    2013-01-01

    Background In Escherichia coli many heterologous proteins are produced in the periplasm. To direct these proteins to the periplasm, they are equipped with an N-terminal signal sequence so that they can traverse the cytoplasmic membrane via the protein-conducting Sec-translocon. For poorly understood reasons, the production of heterologous secretory proteins is often toxic to the cell thereby limiting yields. To gain insight into the mechanism(s) that underlie this toxicity we produced two secretory heterologous proteins, super folder green fluorescent protein and a single-chain variable antibody fragment, in the Lemo21(DE3) strain. In this strain, the expression intensity of the gene encoding the target protein can be precisely controlled. Results Both SFGFP and the single-chain variable antibody fragment were equipped with a DsbA-derived signal sequence. Producing these proteins following different gene expression levels in Lemo21(DE3) allowed us to identify the optimal expression level for each target gene. Too high gene expression levels resulted in saturation of the Sec-translocon capacity as shown by hampered translocation of endogenous secretory proteins and a protein misfolding/aggregation problem in the cytoplasm. At the optimal gene expression levels, the negative effects of the production of the heterologous secretory proteins were minimized and yields in the periplasm were optimized. Conclusions Saturating the Sec-translocon capacity can be a major bottleneck hampering heterologous protein production in the periplasm. This bottleneck can be alleviated by harmonizing expression levels of the genes encoding the heterologous secretory proteins with the Sec-translocon capacity. Mechanistic insight into the production of proteins in the periplasm is key to optimizing yields in this compartment. PMID:23497240

  4. Use of gene fusions to study outer membrane protein localization in Escherichia coli.

    PubMed Central

    Silhavy, T J; Shuman, H A; Beckwith, J; Schwartz, M

    1977-01-01

    Escherichia coli strains have been isolated that produce hybrid proteins comprised of an NH2-terminal sequence from the lamB gene product (an outer membrane protein) and a major portion of the COOH-terminal sequence of beta-galactosidase (beta-D-galactoside galactohydrolase, EC 3.2.1.23; a cytoplasmic protein). These proteins exhibit beta-galactosidase activity. One such strain, pop 3105, produces a hybrid protein containing very little of the lamB gene protein; the protein is found in the cytoplasm. The protein found in a second strain, pop 3186, contains much more of the lamB gene protein; a substantial fraction of the beta-galactosidase activity is found in the outer membrane, probably facing outward. These results indicate that information necessary to direct the lamB gene product to its outer membrane location is located within the lamB gene itself. The properties of such fusion strains open up the prospect of a precise genetic analysis of the genetic components involved in protein transport. Images PMID:414221

  5. The Protein Disulfide Isomerase gene family in bread wheat (T. aestivum L.)

    PubMed Central

    2010-01-01

    Background The Protein Disulfide Isomerase (PDI) gene family encodes several PDI and PDI-like proteins containing thioredoxin domains and controlling diversified metabolic functions, including disulfide bond formation and isomerisation during protein folding. Genomic, cDNA and promoter sequences of the three homoeologous wheat genes encoding the "typical" PDI had been cloned and characterized in a previous work. The purpose of present research was the cloning and characterization of the complete set of genes encoding PDI and PDI like proteins in bread wheat (Triticum aestivum cv Chinese Spring) and the comparison of their sequence, structure and expression with homologous genes from other plant species. Results Eight new non-homoeologous wheat genes were cloned and characterized. The nine PDI and PDI-like sequences of wheat were located in chromosome regions syntenic to those in rice and assigned to eight plant phylogenetic groups. The nine wheat genes differed in their sequences, genomic organization as well as in the domain composition and architecture of their deduced proteins; conversely each of them showed high structural conservation with genes from other plant species in the same phylogenetic group. The extensive quantitative RT-PCR analysis of the nine genes in a set of 23 wheat samples, including tissues and developmental stages, showed their constitutive, even though highly variable expression. Conclusions The nine wheat genes showed high diversity, while the members of each phylogenetic group were highly conserved even between taxonomically distant plant species like the moss Physcomitrella patens. Although constitutively expressed the nine wheat genes were characterized by different expression profiles reflecting their different genomic organization, protein domain architecture and probably promoter sequences; the high conservation among species indicated the ancient origin and diversification of the still evolving gene family. The comprehensive

  6. Identification and characterization of a gene and protein required for glycosylation in the yeast Golgi.

    PubMed

    Devlin, C; Ballou, C E

    1990-11-01

    The MNN2 gene of Saccharomyces cerevisiae has been cloned by complementation of the mnn2 mutant phenotype scored by a change in cell surface carbohydrate structure resulting from a lack of alpha 1----2-mannose branching in the outer chain. The gene was subcloned as a 3 kb DNA fragment that integrated at the MNN2 locus, and a gene disruption yielded the mnn2 phenotype. A lacZ-MNN2 gene fusion protein, produced in Escherichia coli, was used to raise a specific antiserum that recognized a 65 kD wild-type yeast protein. This MNN2 gene product lacks N-linked carbohydrate but appears to be an integral membrane protein. Overproduction of MNN2p does not enhance the alpha 1----2-mannosyltransferase activity of yeast cells. The results suggest that MNN2p is a Golgi-associated protein that is involved in mannoprotein sorting rather than glycosylation.

  7. Predicting Essential Genes and Proteins Based on Machine Learning and Network Topological Features: A Comprehensive Review

    PubMed Central

    Zhang, Xue; Acencio, Marcio Luis; Lemke, Ney

    2016-01-01

    Essential proteins/genes are indispensable to the survival or reproduction of an organism, and the deletion of such essential proteins will result in lethality or infertility. The identification of essential genes is very important not only for understanding the minimal requirements for survival of an organism, but also for finding human disease genes and new drug targets. Experimental methods for identifying essential genes are costly, time-consuming, and laborious. With the accumulation of sequenced genomes data and high-throughput experimental data, many computational methods for identifying essential proteins are proposed, which are useful complements to experimental methods. In this review, we show the state-of-the-art methods for identifying essential genes and proteins based on machine learning and network topological features, point out the progress and limitations of current methods, and discuss the challenges and directions for further research. PMID:27014079

  8. Recombinant HT.sub.m4 gene, protein and assays

    SciTech Connect

    Lim, Bing; Adra, Chaker N.; Lelias, Jean-Michel

    1996-01-01

    The invention relates to a recombinant DNA molecule which encodes a HT.sub.m4 protein, a transformed host cell which has been stably transfected with a DNA molecule which encodes a HT.sub.m4 protein and a recombinant HT.sub.m4 protein. The invention also relates to a method for detecting the presence of a hereditary atopy.

  9. Distinguishing protein-coding and noncoding genes in the human genome.

    PubMed

    Clamp, Michele; Fry, Ben; Kamal, Mike; Xie, Xiaohui; Cuff, James; Lin, Michael F; Kellis, Manolis; Lindblad-Toh, Kerstin; Lander, Eric S

    2007-12-04

    Although the Human Genome Project was completed 4 years ago, the catalog of human protein-coding genes remains a matter of controversy. Current catalogs list a total of approximately 24,500 putative protein-coding genes. It is broadly suspected that a large fraction of these entries are functionally meaningless ORFs present by chance in RNA transcripts, because they show no evidence of evolutionary conservation with mouse or dog. However, there is currently no scientific justification for excluding ORFs simply because they fail to show evolutionary conservation: the alternative hypothesis is that most of these ORFs are actually valid human genes that reflect gene innovation in the primate lineage or gene loss in the other lineages. Here, we reject this hypothesis by carefully analyzing the nonconserved ORFs-specifically, their properties in other primates. We show that the vast majority of these ORFs are random occurrences. The analysis yields, as a by-product, a major revision of the current human catalogs, cutting the number of protein-coding genes to approximately 20,500. Specifically, it suggests that nonconserved ORFs should be added to the human gene catalog only if there is clear evidence of an encoded protein. It also provides a principled methodology for evaluating future proposed additions to the human gene catalog. Finally, the results indicate that there has been relatively little true innovation in mammalian protein-coding genes.

  10. Distinguishing protein-coding and noncoding genes in the human genome

    PubMed Central

    Clamp, Michele; Fry, Ben; Kamal, Mike; Xie, Xiaohui; Cuff, James; Lin, Michael F.; Kellis, Manolis; Lindblad-Toh, Kerstin; Lander, Eric S.

    2007-01-01

    Although the Human Genome Project was completed 4 years ago, the catalog of human protein-coding genes remains a matter of controversy. Current catalogs list a total of ≈24,500 putative protein-coding genes. It is broadly suspected that a large fraction of these entries are functionally meaningless ORFs present by chance in RNA transcripts, because they show no evidence of evolutionary conservation with mouse or dog. However, there is currently no scientific justification for excluding ORFs simply because they fail to show evolutionary conservation: the alternative hypothesis is that most of these ORFs are actually valid human genes that reflect gene innovation in the primate lineage or gene loss in the other lineages. Here, we reject this hypothesis by carefully analyzing the nonconserved ORFs—specifically, their properties in other primates. We show that the vast majority of these ORFs are random occurrences. The analysis yields, as a by-product, a major revision of the current human catalogs, cutting the number of protein-coding genes to ≈20,500. Specifically, it suggests that nonconserved ORFs should be added to the human gene catalog only if there is clear evidence of an encoded protein. It also provides a principled methodology for evaluating future proposed additions to the human gene catalog. Finally, the results indicate that there has been relatively little true innovation in mammalian protein-coding genes. PMID:18040051

  11. Molecular structure of the immunity gene and immunity protein of the bacteriocinogenic plasmid Clo DF13.

    PubMed Central

    van den Elzen, P J; Gaastra, W; Spelt, C E; de Graaf, F K; Veltkamp, E; Nijkamp, H J

    1980-01-01

    The nucleotide sequence of the Clo DF13 DNA region comprising the immunity gene has been determined. We also elucidated the aminoacid sequence of the 40 N-terminal and 7 C-terminal aminoacids of the purified immunity protein. From analysis of the data obtained we were able to locate the immunity gene between 11.7 and 14.5% on the Clo DF13 map, and to determine the complete aminoacid sequence of the immunity protein. It was observed that the Clo DF13 immunity gene encodes an 85 aminoacid protein and is transcribed in the same direction as the cloacin gene. These experimental data support our model, presented elsewhere, which implicates that the cloacin and immunity genes of Clo DF13 are coordinately transcribed from the cloacin promoter. We also present DNA sequence data indicating that an extra ribosome binding site precedes the immunity gene on the polycistronic mRNA. This ribosome binding site might explain the fact that in cloacinogenic cells more immunity protein than cloacin is synthesized. The comparison of the complete aminoacid sequence of the Clo DF13 immunity protein, with the aminoacid sequence data of the purified, comparable Col E3 immunity protein revealed that both proteins have extensive homologies in primary and secondary structure, although they are exchangeable only to a low extent in vivo and in vitro. It was also observed that a lysine residue was modified in immunity protein isolated from excreted bacteriocin complexes. Images PMID:6253914

  12. Comparative study of human mitochondrial proteome reveals extensive protein subcellular relocalization after gene duplications

    PubMed Central

    2009-01-01

    Background Gene and genome duplication is the principle creative force in evolution. Recently, protein subcellular relocalization, or neolocalization was proposed as one of the mechanisms responsible for the retention of duplicated genes. This hypothesis received support from the analysis of yeast genomes, but has not been tested thoroughly on animal genomes. In order to evaluate the importance of subcellular relocalizations for retention of duplicated genes in animal genomes, we systematically analyzed nuclear encoded mitochondrial proteins in the human genome by reconstructing phylogenies of mitochondrial multigene families. Results The 456 human mitochondrial proteins selected for this study were clustered into 305 gene families including 92 multigene families. Among the multigene families, 59 (64%) consisted of both mitochondrial and cytosolic (non-mitochondrial) proteins (mt-cy families) while the remaining 33 (36%) were composed of mitochondrial proteins (mt-mt families). Phylogenetic analyses of mt-cy families revealed three different scenarios of their neolocalization following gene duplication: 1) relocalization from mitochondria to cytosol, 2) from cytosol to mitochondria and 3) multiple subcellular relocalizations. The neolocalizations were most commonly enabled by the gain or loss of N-terminal mitochondrial targeting signals. The majority of detected subcellular relocalization events occurred early in animal evolution, preceding the evolution of tetrapods. Mt-mt protein families showed a somewhat different pattern, where gene duplication occurred more evenly in time. However, for both types of protein families, most duplication events appear to roughly coincide with two rounds of genome duplications early in vertebrate evolution. Finally, we evaluated the effects of inaccurate and incomplete annotation of mitochondrial proteins and found that our conclusion of the importance of subcellular relocalization after gene duplication on the genomic scale was

  13. Unusually high frequency of genes encoding vegetative insecticidal proteins in an Australian Bacillus thuringiensis collection.

    PubMed

    Beard, Cheryl E; Court, Leon; Boets, Annemie; Mourant, Roslyn; Van Rie, Jeroen; Akhurst, Raymond J

    2008-09-01

    Of 188 Australian Bacillus thuringiensis strains screened for genes encoding soluble insecticidal proteins by polymerase chain reaction/restriction-length fragment polymorphism (RFLP) analysis, 87% showed the presence of such genes. Although 135 isolates (72%) produced an RFLP pattern identical to that expected for vip3A genes, 29 isolates possessed a novel vip-like gene. The novel vip-like gene was cloned from B. thuringiensis isolate C81, and sequence analysis demonstrated that it was 94% identical to the vip3Ba1 gene. The new gene was designated vip3Bb2. Cell-free supernatants from both the B. thuringiensis strain C81 and from Escherichia coli expressing the Vip3Bb2 protein were toxic for the cotton bollworm, Helicoverpa armigera.

  14. Topological and organizational properties of the products of house-keeping and tissue-specific genes in protein-protein interaction networks.

    PubMed

    Lin, Wen-Hsien; Liu, Wei-Chung; Hwang, Ming-Jing

    2009-03-11

    Human cells of various tissue types differ greatly in morphology despite having the same set of genetic information. Some genes are expressed in all cell types to perform house-keeping functions, while some are selectively expressed to perform tissue-specific functions. In this study, we wished to elucidate how proteins encoded by human house-keeping genes and tissue-specific genes are organized in human protein-protein interaction networks. We constructed protein-protein interaction networks for different tissue types using two gene expression datasets and one protein-protein interaction database. We then calculated three network indices of topological importance, the degree, closeness, and betweenness centralities, to measure the network position of proteins encoded by house-keeping and tissue-specific genes, and quantified their local connectivity structure. Compared to a random selection of proteins, house-keeping gene-encoded proteins tended to have a greater number of directly interacting neighbors and occupy network positions in several shortest paths of interaction between protein pairs, whereas tissue-specific gene-encoded proteins did not. In addition, house-keeping gene-encoded proteins tended to connect with other house-keeping gene-encoded proteins in all tissue types, whereas tissue-specific gene-encoded proteins also tended to connect with other tissue-specific gene-encoded proteins, but only in approximately half of the tissue types examined. Our analysis showed that house-keeping gene-encoded proteins tend to occupy important network positions, while those encoded by tissue-specific genes do not. The biological implications of our findings were discussed and we proposed a hypothesis regarding how cells organize their protein tools in protein-protein interaction networks. Our results led us to speculate that house-keeping gene-encoded proteins might form a core in human protein-protein interaction networks, while clusters of tissue-specific gene

  15. Topological and organizational properties of the products of house-keeping and tissue-specific genes in protein-protein interaction networks

    PubMed Central

    Lin, Wen-hsien; Liu, Wei-chung; Hwang, Ming-jing

    2009-01-01

    Background Human cells of various tissue types differ greatly in morphology despite having the same set of genetic information. Some genes are expressed in all cell types to perform house-keeping functions, while some are selectively expressed to perform tissue-specific functions. In this study, we wished to elucidate how proteins encoded by human house-keeping genes and tissue-specific genes are organized in human protein-protein interaction networks. We constructed protein-protein interaction networks for different tissue types using two gene expression datasets and one protein-protein interaction database. We then calculated three network indices of topological importance, the degree, closeness, and betweenness centralities, to measure the network position of proteins encoded by house-keeping and tissue-specific genes, and quantified their local connectivity structure. Results Compared to a random selection of proteins, house-keeping gene-encoded proteins tended to have a greater number of directly interacting neighbors and occupy network positions in several shortest paths of interaction between protein pairs, whereas tissue-specific gene-encoded proteins did not. In addition, house-keeping gene-encoded proteins tended to connect with other house-keeping gene-encoded proteins in all tissue types, whereas tissue-specific gene-encoded proteins also tended to connect with other tissue-specific gene-encoded proteins, but only in approximately half of the tissue types examined. Conclusion Our analysis showed that house-keeping gene-encoded proteins tend to occupy important network positions, while those encoded by tissue-specific genes do not. The biological implications of our findings were discussed and we proposed a hypothesis regarding how cells organize their protein tools in protein-protein interaction networks. Our results led us to speculate that house-keeping gene-encoded proteins might form a core in human protein-protein interaction networks, while

  16. Cloning and mapping of a human gene (TBX2) sharing a highly conserved protein motif with a Drosophila omb gene

    SciTech Connect

    Campbell, C.; Goodrich, K.; Casey, G.; Beatty, B.

    1995-07-20

    We have identified and cloned a human gene (TBX2) that exhibits strong sequence homology within a putative DNA binding domain to the drosophila optomotor-blind (omb) gene and lesser homology to the DNA binding domain of the murine brachyury or T gene. Unlike omb, which is expressed in neural tissue, or T, which is not expressed in adult animals, TBX2 is expressed primarily in adult in kidney, lung, and placenta as multiple transcripts of between {approximately} 2 and 4 kb. At least part of this transcript heterogenity appears to be due to alternative polyadenylation. This is the first reported human member of a new family of highly evolutionarily conserved DNA binding proteins, the Tbx or T-box proteins. The human gene has been mapped by somatic cell hybrid mapping and chromosomal in situ hybridization to chromosome 17q23, a region frequently altered in ovarian carcinomas. 19 refs., 6 figs.

  17. Alternative splicing in the human gene for the core protein A1 generates another hnRNP protein.

    PubMed Central

    Buvoli, M; Cobianchi, F; Bestagno, M G; Mangiarotti, A; Bassi, M T; Biamonti, G; Riva, S

    1990-01-01

    The human hnRNP core protein A1 (34 kd) is encoded by a 4.6 kb gene split into 10 exons. Here we show that the A1 gene can be differentially spliced by the addition of an extra exon. The new transcript encodes a minor protein of the hnRNP complex, here defined A1B protein, with a calculated mol. wt of 38 kd, that coincides with a protein previously designated as B2 by some authors. In vitro translation of the mRNAs selected by hybridization with A1 cDNA produced two proteins of 34 and 38 kd; Northern blot analysis of poly(A)+ RNA from HeLa cells revealed that the abundance of the A1B mRNA was approximately 5% that of A1. The A1B protein was detected by Western blotting with an anti-A1 monoclonal antibody both in enriched preparations of basic hnRNP proteins and in 40S hnRNP particles. The A1B protein exhibits a significantly higher affinity than A1 for ssDNA. The recombinant A1B protein, expressed in Escherichia coli, shows the same electrophoretic mobility and charge as the cellular one. Images Fig. 2. Fig. 3. Fig. 4. Fig. 5. Fig. 6. Fig. 7. PMID:1691095

  18. Ribosomal protein methylation in Escherichia coli: the gene prmA, encoding the ribosomal protein L11 methyltransferase, is dispensable.

    PubMed

    Vanet, A; Plumbridge, J A; Guérin, M F; Alix, J H

    1994-12-01

    The prmA gene, located at 72 min on the Escherichia coli chromosome, is the genetic determinant of ribosomal protein L11-methyltransferase activity. Mutations at this locus, prmA1 and prmA3, result in a severely undermethylated form of L11. No effect, other than the lack of methyl groups on L11, has been ascribed to these mutations. DNA sequence analysis of the mutant alleles prmA1 and prmA3 detected point mutations near the C-terminus of the protein and plasmids overproducing the wild-type and the two mutant proteins have been constructed. The wild-type PrmA protein could be crosslinked to its radiolabelled substrate, S-adenosyl-L-methionine (SAM), by u.v. irradiation indicating that it is the gene for the methyltransferase rather than a regulatory protein. One of the mutant proteins, PrmA3, was also weakly crosslinked to SAM. Both mutant enzymes when expressed from the overproducing plasmids were capable of catalysing the incorporation of 3H-labelled methyl groups from SAM to L11 in vitro. This confirmed the observation that the mutant proteins possess significant residual activity which could account for their lack of growth phenotype. However, a strain carrying an in vitro-constructed null mutation of the prmA gene, transferred to the E. coli chromosome by homologous recombination, was perfectly viable.

  19. RNA-Seq improves annotation of protein-coding genes in the cucumber genome

    PubMed Central

    2011-01-01

    Background As more and more genomes are sequenced, genome annotation becomes increasingly important in bridging the gap between sequence and biology. Gene prediction, which is at the center of genome annotation, usually integrates various resources to compute consensus gene structures. However, many newly sequenced genomes have limited resources for gene predictions. In an effort to create high-quality gene models of the cucumber genome (Cucumis sativus var. sativus), based on the EVidenceModeler gene prediction pipeline, we incorporated the massively parallel complementary DNA sequencing (RNA-Seq) reads of 10 cucumber tissues into EVidenceModeler. We applied the new pipeline to the reassembled cucumber genome and included a comparison between our predicted protein-coding gene sets and a published set. Results The reassembled cucumber genome, annotated with RNA-Seq reads from 10 tissues, has 23, 248 identified protein-coding genes. Compared with the published prediction in 2009, approximately 8, 700 genes reveal structural modifications and 5, 285 genes only appear in the reassembled cucumber genome. All the related results, including genome sequence and annotations, are available at http://cmb.bnu.edu.cn/Cucumis_sativus_v20/. Conclusions We conclude that RNA-Seq greatly improves the accuracy of prediction of protein-coding genes in the reassembled cucumber genome. The comparison between the two gene sets also suggests that it is feasible to use RNA-Seq reads to annotate newly sequenced or less-studied genomes. PMID:22047402

  20. Protein X is the product of the recA gene of Escherichia coli.

    PubMed Central

    McEntee, K

    1977-01-01

    The inducible protein X of Escherichia coli has been compared to the recA+ protein made by specialized recA transducing phages. The molecular weights and isoelectric points of these proteins are identical. Two mutations located in the recA gene that alter the electrophoretic mobility or the isoelectric point of protein X have been studied. A recA12 mutant strain, deficient in homologous recombination and repair, produces a smaller-than-normal protein X. A transducing phage carrying the recA12 allele directs the synthesis of a smaller recA protein after infection of irradiated cells. A transducing phage carrying the recA region of a tif-1 mutant strain codes for a recA protein with an isoelectric point more basic than that of the lambdaprecA+ product. The protein X of a tif-1 mutant strain shows an identical shift in its isoelectric properties. Examination of several tsl- recA- strains indicates that protein X can be induced in several missense recA mutants but is not detected in tsl- strains carrying amber or deletion mutations of the recA gene. These results demonstrate that protein X is the product of the recA gene and that the tif-1 mutation alters the properties of the recA protein. A model is suggested for autoregulation of the recA protein in the induction of functions expressed in response to DNA damage (SOS functions). Images PMID:341151

  1. Impact of Adenovirus E4-ORF3 Oligomerization and Protein Localization on Cellular Gene Expression.

    PubMed

    Vink, Elizabeth I; Zheng, Yueting; Yeasmin, Rukhsana; Stamminger, Thomas; Krug, Laurie T; Hearing, Patrick

    2015-05-13

    The Adenovirus E4-ORF3 protein facilitates virus replication through the relocalization of cellular proteins into nuclear inclusions termed tracks. This sequestration event disrupts antiviral properties associated with target proteins. Relocalization of Mre11-Rad50-Nbs1 proteins prevents the DNA damage response from inhibiting Ad replication. Relocalization of PML and Daxx impedes the interferon-mediated antiviral response. Several E4-ORF3 targets regulate gene expression, linking E4-ORF3 to transcriptional control. Furthermore, E4-ORF3 was shown to promote the formation of heterochromatin, down-regulating p53-dependent gene expression. Here, we characterize how E4-ORF3 alters cellular gene expression. Using an inducible, E4-ORF3-expressing cell line, we performed microarray experiments to highlight cellular gene expression changes influenced by E4-ORF3 expression, identifying over four hundred target genes. Enrichment analysis of these genes suggests that E4-ORF3 influences factors involved in signal transduction and cellular defense, among others. The expression of mutant E4-ORF3 proteins revealed that nuclear track formation is necessary to induce these expression changes. Through the generation of knockdown cells, we demonstrate that the observed expression changes may be independent of Daxx and TRIM33 suggesting that an additional factor(s) may be responsible. The ability of E4-ORF3 to manipulate cellular gene expression through the sequestration of cellular proteins implicates a novel role for E4-ORF3 in transcriptional regulation.

  2. Impact of Adenovirus E4-ORF3 Oligomerization and Protein Localization on Cellular Gene Expression

    PubMed Central

    Vink, Elizabeth I.; Zheng, Yueting; Yeasmin, Rukhsana; Stamminger, Thomas; Krug, Laurie T.; Hearing, Patrick

    2015-01-01

    The Adenovirus E4-ORF3 protein facilitates virus replication through the relocalization of cellular proteins into nuclear inclusions termed tracks. This sequestration event disrupts antiviral properties associated with target proteins. Relocalization of Mre11-Rad50-Nbs1 proteins prevents the DNA damage response from inhibiting Ad replication. Relocalization of PML and Daxx impedes the interferon-mediated antiviral response. Several E4-ORF3 targets regulate gene expression, linking E4-ORF3 to transcriptional control. Furthermore, E4-ORF3 was shown to promote the formation of heterochromatin, down-regulating p53-dependent gene expression. Here, we characterize how E4-ORF3 alters cellular gene expression. Using an inducible, E4-ORF3-expressing cell line, we performed microarray experiments to highlight cellular gene expression changes influenced by E4-ORF3 expression, identifying over four hundred target genes. Enrichment analysis of these genes suggests that E4-ORF3 influences factors involved in signal transduction and cellular defense, among others. The expression of mutant E4-ORF3 proteins revealed that nuclear track formation is necessary to induce these expression changes. Through the generation of knockdown cells, we demonstrate that the observed expression changes may be independent of Daxx and TRIM33 suggesting that an additional factor(s) may be responsible. The ability of E4-ORF3 to manipulate cellular gene expression through the sequestration of cellular proteins implicates a novel role for E4-ORF3 in transcriptional regulation. PMID:25984715

  3. Accelerated Evolution of Schistosome Genes Coding for Proteins Located at the Host–Parasite Interface

    PubMed Central

    Philippsen, Gisele S.; Wilson, R. Alan; DeMarco, Ricardo

    2015-01-01

    Study of proteins located at the host–parasite interface in schistosomes might provide clues about the mechanisms utilized by the parasite to escape the host immune system attack. Micro-exon gene (MEG) protein products and venom allergen-like (VAL) proteins have been shown to be present in schistosome secretions or associated with glands, which led to the hypothesis that they are important components in the molecular interaction of the parasite with the host. Phylogenetic and structural analysis of genes and their transcripts in these two classes shows that recent species-specific expansion of gene number for these families occurred separately in three different species of schistosomes. Enrichment of transposable elements in MEG and VAL genes in Schistosoma mansoni provides a credible mechanism for preferential expansion of gene numbers for these families. Analysis of the ratio between synonymous and nonsynonymous substitution rates (dN/dS) in the comparison between schistosome orthologs for the two classes of genes reveals significantly higher values when compared with a set of a control genes coding for secreted proteins, and for proteins previously localized in the tegument. Additional analyses of paralog genes indicate that exposure of the protein to the definitive host immune system is a determining factor leading to the higher than usual dN/dS values in those genes. The observation that two genes encoding S. mansoni vaccine candidate proteins, known to be exposed at the parasite surface, also display similar evolutionary dynamics suggests a broad response of the parasite to evolutionary pressure imposed by the definitive host immune system. PMID:25567667

  4. Identification of Gene-Expression Signatures and Protein Markers for Breast Cancer Grading and Staging.

    PubMed

    Yao, Fang; Zhang, Chi; Du, Wei; Liu, Chao; Xu, Ying

    2015-01-01

    The grade of a cancer is a measure of the cancer's malignancy level, and the stage of a cancer refers to the size and the extent that the cancer has spread. Here we present a computational method for prediction of gene signatures and blood/urine protein markers for breast cancer grades and stages based on RNA-seq data, which are retrieved from the TCGA breast cancer dataset and cover 111 pairs of disease and matching adjacent noncancerous tissues with pathologists-assigned stages and grades. By applying a differential expression and an SVM-based classification approach, we found that 324 and 227 genes in cancer have their expression levels consistently up-regulated vs. their matching controls in a grade- and stage-dependent manner, respectively. By using these genes, we predicted a 9-gene panel as a gene signature for distinguishing poorly differentiated from moderately and well differentiated breast cancers, and a 19-gene panel as a gene signature for discriminating between the moderately and well differentiated breast cancers. Similarly, a 30-gene panel and a 21-gene panel are predicted as gene signatures for distinguishing advanced stage (stages III-IV) from early stage (stages I-II) cancer samples and for distinguishing stage II from stage I samples, respectively. We expect these gene panels can be used as gene-expression signatures for cancer grade and stage classification. In addition, of the 324 grade-dependent genes, 188 and 66 encode proteins that are predicted to be blood-secretory and urine-excretory, respectively; and of the 227 stage-dependent genes, 123 and 51 encode proteins predicted to be blood-secretory and urine-excretory, respectively. We anticipate that some combinations of these blood and urine proteins could serve as markers for monitoring breast cancer at specific grades and stages through blood and urine tests.

  5. Identification of Gene-Expression Signatures and Protein Markers for Breast Cancer Grading and Staging

    PubMed Central

    Yao, Fang; Zhang, Chi; Du, Wei; Liu, Chao; Xu, Ying

    2015-01-01

    The grade of a cancer is a measure of the cancer's malignancy level, and the stage of a cancer refers to the size and the extent that the cancer has spread. Here we present a computational method for prediction of gene signatures and blood/urine protein markers for breast cancer grades and stages based on RNA-seq data, which are retrieved from the TCGA breast cancer dataset and cover 111 pairs of disease and matching adjacent noncancerous tissues with pathologists-assigned stages and grades. By applying a differential expression and an SVM-based classification approach, we found that 324 and 227 genes in cancer have their expression levels consistently up-regulated vs. their matching controls in a grade- and stage-dependent manner, respectively. By using these genes, we predicted a 9-gene panel as a gene signature for distinguishing poorly differentiated from moderately and well differentiated breast cancers, and a 19-gene panel as a gene signature for discriminating between the moderately and well differentiated breast cancers. Similarly, a 30-gene panel and a 21-gene panel are predicted as gene signatures for distinguishing advanced stage (stages III-IV) from early stage (stages I-II) cancer samples and for distinguishing stage II from stage I samples, respectively. We expect these gene panels can be used as gene-expression signatures for cancer grade and stage classification. In addition, of the 324 grade-dependent genes, 188 and 66 encode proteins that are predicted to be blood-secretory and urine-excretory, respectively; and of the 227 stage-dependent genes, 123 and 51 encode proteins predicted to be blood-secretory and urine-excretory, respectively. We anticipate that some combinations of these blood and urine proteins could serve as markers for monitoring breast cancer at specific grades and stages through blood and urine tests. PMID:26375396

  6. A "housekeeping" gene on the X chromosome encodes a protein similar to ubiquitin.

    PubMed Central

    Toniolo, D; Persico, M; Alcalay, M

    1988-01-01

    An X chromosome gene located 40 kilobases downstream from the G6PD gene, at Xq28, was isolated and sequenced. This gene, which we named GdX, spans about 3.5 kilobases of genomic DNA. GdX is a single-copy gene, is conserved in evolution, and has the features of a "housekeeping" gene. At its 5' end, a cluster of CpG dinucleotides is methylated on the inactive X chromosome and unmethylated on the active X chromosome. The GdX gene can code for a 157 amino acid protein, GdX. Residues 1-74 of GdX show 43% identity to ubiquitin, a highly conserved 76 amino acid protein. The COOH-terminal moiety of GdX is characterized in its central part (residues 110-128) by a sequence homologous to the COOH-terminal hormonogenic site of thyroglobulin. The structural organization of the GdX protein suggests the existence of a family of genes, in addition to the ubiquitin gene, that could play specific roles in key cellular processes, possibly through protein-protein recognition. Images PMID:2829204

  7. Recombinant HT{sub m4} gene, protein and assays

    DOEpatents

    Lim, B.; Adra, C.N.; Lelias, J.M.

    1996-09-03

    The invention relates to a recombinant DNA molecule which encodes a HT{sub m4} protein, a transformed host cell which has been stably transfected with a DNA molecule which encodes a HT{sub m4} protein and a recombinant HT{sub m4} protein. The invention also relates to a method for detecting the presence of a hereditary atopy. 2 figs.

  8. Specific binding of TUF factor to upstream activation sites of yeast ribosomal protein genes.

    PubMed Central

    Vignais, M L; Woudt, L P; Wassenaar, G M; Mager, W H; Sentenac, A; Planta, R J

    1987-01-01

    Transcription activation of yeast ribosomal protein genes is mediated through homologous, 12-nucleotide-long and, in general, duplicated upstream promoter elements (HOMOL1 and RPG, referred to as UASrpg). As shown previously, a yeast protein factor, TUF, interacts specifically with these conserved boxes in the 5'-flanking sequences of the elongation factor genes TEF1 and TEF2 and the ribosomal protein gene RP51A. We have now extended our studies of TUF-UASrpg binding by analysing--using footprinting and gel electrophoretic retardation techniques--the genes encoding the ribosomal proteins L25, rp28 (both copy genes), S24 + L46 and S33. Most, but not all, conserved sequence elements occurring in front of these genes, turned out to represent binding sites for the same factor, TUF. The two functionally important boxes that are found in a tandem arrangement (a characteristic of many rp genes) upstream of the L25 gene are indistinguishable in their factor binding specificity. Large differences were shown to exist in the affinity of the TUF factor for the various individual boxes and in the half-life of the protein-DNA complexes. No binding cooperativity could be demonstrated on adjacent sites on L25 or RP51A promoters. Based on binding data, the UASrpg sequence ACACCCATACAT appears to be the one recognized most efficiently by the TUF factor. Previously, no conserved box was found in front of the gene encoding S33. Nevertheless, complex formation with the protein fraction used was observed in the upstream region of the S33 gene. Competition experiments disclosed the existence of an additional binding component, distinct from TUF. This component may possibly regulate a subset of genes for the translational apparatus. Images Fig. 2. Fig. 3. Fig. 4. Fig. 5. Fig. 7. PMID:3301327

  9. Identification of nuclear genes encoding chloroplast-localized proteins required for embryo development in Arabidopsis.

    PubMed

    Bryant, Nicole; Lloyd, Johnny; Sweeney, Colleen; Myouga, Fumiyoshi; Meinke, David

    2011-04-01

    We describe here the diversity of chloroplast proteins required for embryo development in Arabidopsis (Arabidopsis thaliana). Interfering with certain chloroplast functions has long been known to result in embryo lethality. What has not been reported before is a comprehensive screen for embryo-defective (emb) mutants altered in chloroplast proteins. From a collection of transposon and T-DNA insertion lines at the RIKEN chloroplast function database (http://rarge.psc.riken.jp/chloroplast/) that initially appeared to lack homozygotes and segregate for defective seeds, we identified 23 additional examples of EMB genes that likely encode chloroplast-localized proteins. Fourteen gene identities were confirmed with allelism tests involving duplicate mutant alleles. We then queried journal publications and the SeedGenes database (www.seedgenes.org) to establish a comprehensive dataset of 381 nuclear genes encoding chloroplast proteins of Arabidopsis associated with embryo-defective (119 genes), plant pigment (121 genes), gametophyte (three genes), and alternate (138 genes) phenotypes. Loci were ranked based on the level of certainty that the gene responsible for the phenotype had been identified and the protein product localized to chloroplasts. Embryo development is frequently arrested when amino acid, vitamin, or nucleotide biosynthesis is disrupted but proceeds when photosynthesis is compromised and when levels of chlorophyll, carotenoids, or terpenoids are reduced. Chloroplast translation is also required for embryo development, with genes encoding chloroplast ribosomal and pentatricopeptide repeat proteins well represented among EMB datasets. The chloroplast accD locus, which is necessary for fatty acid biosynthesis, is essential in Arabidopsis but not in Brassica napus or maize (Zea mays), where duplicated nuclear genes compensate for its absence or loss of function.

  10. Cloning, sequencing and expression of the transferrin-binding protein 1 gene from Actinobacillus pleuropneumoniae.

    PubMed Central

    Daban, M; Medrano, A; Querol, E

    1996-01-01

    Two outer-membrane proteins are involved in the uptake of iron from transferrin by certain Gram-negative bacteria, transferrin-binding proteins 1 and 2. The gene encoding transferrin-binding protein 1 from a serotype 1 isolate of the Gram-negative pathogen Actinobacillus pleuropneumoniae was cloned, and a fragment encoding 700 amino acids of Tbp1 was expressed in Escherichia coli. We also report here sequencing of the tbpl gene and a comparison of the deduced amino acid sequence with Tbpls from related species. The predicted polypeptide product of tbpl is a 106 kDa protein with a 22-residue signal peptide. PMID:8670116

  11. Molecular and Genomic Analysis of Genes Encoding Surface-Anchored Proteins from Clostridium difficile

    PubMed Central

    Karjalainen, Tuomo; Waligora-Dupriet, Anne-Judith; Cerquetti, Marina; Spigaglia, Patrizia; Maggioni, Andrea; Mauri, Pierluigi; Mastrantonio, Paola

    2001-01-01

    The gene slpA, encoding the S-layer precursor protein in the virulent Clostridium difficile strains C253 and 79–685, was identified. The precursor protein carries a C-terminal highly conserved anchoring domain, similar to the one found in the Cwp66 adhesin (previously characterized in strain 79–685), an SLH domain, and a variable N-terminal domain mediating cell adherence. The genes encoding the S-layer precursor proteins and the Cwp66 adhesin are present in a genetic locus carrying 17 open reading frames, 11 of which encode a similar two-domain architecture, likely to include surface-anchored proteins. PMID:11292772

  12. Dominant-Negative Proteins in Herpesviruses – From Assigning Gene Function to Intracellular Immunization

    PubMed Central

    Mühlbach, Hermine; Mohr, Christian A.; Ruzsics, Zsolt; Koszinowski, Ulrich H.

    2009-01-01

    Investigating and assigning gene functions of herpesviruses is a process, which profits from consistent technical innovation. Cloning of bacterial artificial chromosomes encoding herpesvirus genomes permits nearly unlimited possibilities in the construction of genetically modified viruses. Targeted or randomized screening approaches allow rapid identification of essential viral proteins. Nevertheless, mapping of essential genes reveals only limited insight into function. The usage of dominant-negative (DN) proteins has been the tool of choice to dissect functions of proteins during the viral life cycle. DN proteins also facilitate the analysis of host-virus interactions. Finally, DNs serve as starting-point for design of new antiviral strategies. PMID:21994555

  13. Protein-Protein Interaction and Pathway Analyses of Top Schizophrenia Genes Reveal Schizophrenia Susceptibility Genes Converge on Common Molecular Networks and Enrichment of Nucleosome (Chromatin) Assembly Genes in Schizophrenia Susceptibility Loci

    PubMed Central

    Luo, Xiongjian; Huang, Liang; Jia, Peilin

    2014-01-01

    Recent genome-wide association studies have identified many promising schizophrenia candidate genes and demonstrated that common polygenic variation contributes to schizophrenia risk. However, whether these genes represent perturbations to a common but limited set of underlying molecular processes (pathways) that modulate risk to schizophrenia remains elusive, and it is not known whether these genes converge on common biological pathways (networks) or represent different pathways. In addition, the theoretical and genetic mechanisms underlying the strong genetic heterogeneity of schizophrenia remain largely unknown. Using 4 well-defined data sets that contain top schizophrenia susceptibility genes and applying protein-protein interaction (PPI) network analysis, we investigated the interactions among proteins encoded by top schizophrenia susceptibility genes. We found proteins encoded by top schizophrenia susceptibility genes formed a highly significant interconnected network, and, compared with random networks, these PPI networks are statistically highly significant for both direct connectivity and indirect connectivity. We further validated these results using empirical functional data (transcriptome data from a clinical sample). These highly significant findings indicate that top schizophrenia susceptibility genes encode proteins that significantly directly interacted and formed a densely interconnected network, suggesting perturbations of common underlying molecular processes or pathways that modulate risk to schizophrenia. Our findings that schizophrenia susceptibility genes encode a highly interconnected protein network may also provide a novel explanation for the observed genetic heterogeneity of schizophrenia, ie, mutation in any member of this molecular network will lead to same functional consequences that eventually contribute to risk of schizophrenia. PMID:23671194

  14. Protein-protein interaction and pathway analyses of top schizophrenia genes reveal schizophrenia susceptibility genes converge on common molecular networks and enrichment of nucleosome (chromatin) assembly genes in schizophrenia susceptibility loci.

    PubMed

    Luo, Xiongjian; Huang, Liang; Jia, Peilin; Li, Ming; Su, Bing; Zhao, Zhongming; Gan, Lin

    2014-01-01

    Recent genome-wide association studies have identified many promising schizophrenia candidate genes and demonstrated that common polygenic variation contributes to schizophrenia risk. However, whether these genes represent perturbations to a common but limited set of underlying molecular processes (pathways) that modulate risk to schizophrenia remains elusive, and it is not known whether these genes converge on common biological pathways (networks) or represent different pathways. In addition, the theoretical and genetic mechanisms underlying the strong genetic heterogeneity of schizophrenia remain largely unknown. Using 4 well-defined data sets that contain top schizophrenia susceptibility genes and applying protein-protein interaction (PPI) network analysis, we investigated the interactions among proteins encoded by top schizophrenia susceptibility genes. We found proteins encoded by top schizophrenia susceptibility genes formed a highly significant interconnected network, and, compared with random networks, these PPI networks are statistically highly significant for both direct connectivity and indirect connectivity. We further validated these results using empirical functional data (transcriptome data from a clinical sample). These highly significant findings indicate that top schizophrenia susceptibility genes encode proteins that significantly directly interacted and formed a densely interconnected network, suggesting perturbations of common underlying molecular processes or pathways that modulate risk to schizophrenia. Our findings that schizophrenia susceptibility genes encode a highly interconnected protein network may also provide a novel explanation for the observed genetic heterogeneity of schizophrenia, ie, mutation in any member of this molecular network will lead to same functional consequences that eventually contribute to risk of schizophrenia.

  15. Cloning, sequencing and expression of the flagellin core protein and other genes encoding structural proteins of the Vibrio cholerae flagellum.

    PubMed

    Das, M; Chopra, A K; Wood, T; Peterson, J W

    1998-08-15

    Vibrio cholerae is a Gram-negative bacterium with a single polar flagellum. Motility is an important virulence factor for this non-invasive pathogen. We cloned and sequenced a locus in V. cholerae V86 (El Tor, Inaba) that contained five different structural genes of the flagellum. The cloned genes and their products were assigned names and functions based on homology with sequences of similar genes and their products from other related bacteria. All of these genes of V. cholerae V86, namely, flgI, J, M, L and flaA, were transcribed in the same direction. These genes respectively encoded the P- and L-ring proteins, the hook-associated proteins 1 and 3 and the flagellin core protein of the flagellum. Our data indicated the presence of more than one flagellar locus in V. cholerae which could provide a means of immunoavoidance during infection. When compared with homologs in other bacteria, the flagellin core protein of V. cholerae exhibited conservation in the N- and C-termini, but had diverged in the central region.

  16. The blue copper protein gene of Alcaligenes faecalis S-6 directs secretion of blue copper protein from Escherichia coli cells.

    PubMed Central

    Yamamoto, K; Uozumi, T; Beppu, T

    1987-01-01

    The gene encoding a blue copper protein (a member of the pseudoazurins) of 123 amino acid residues, containing a single type I Cu2+ ion, was cloned from Alcaligenes faecalis S-6. The nucleotide sequence of the coding region, as well as the 5'- and 3'-flanking regions, was determined. The deduced amino acid sequence after Glu-24 coincided with the reported sequence of the blue protein, and its NH2-terminal sequence of 23 residues resembled a typical signal peptide. The cloned gene was expressed under the control of the tac promoter in Escherichia coli, and the correctly processed blue protein was secreted into the periplasm. The blue protein produced in E. coli possessed the activity to transfer electrons to the copper-containing nitrite reductase of A. faecalis S-6 in vitro. Images PMID:2824441

  17. A Hox Gene, Antennapedia, Regulates Expression of Multiple Major Silk Protein Genes in the Silkworm Bombyx mori.

    PubMed

    Tsubota, Takuya; Tomita, Shuichiro; Uchino, Keiro; Kimoto, Mai; Takiya, Shigeharu; Kajiwara, Hideyuki; Yamazaki, Toshimasa; Sezutsu, Hideki

    2016-03-25

    Hoxgenes play a pivotal role in the determination of anteroposterior axis specificity during bilaterian animal development. They do so by acting as a master control and regulating the expression of genes important for development. Recently, however, we showed that Hoxgenes can also function in terminally differentiated tissue of the lepidopteranBombyx mori In this species,Antennapedia(Antp) regulates expression of sericin-1, a major silk protein gene, in the silk gland. Here, we investigated whether Antpcan regulate expression of multiple genes in this tissue. By means of proteomic, RT-PCR, and in situ hybridization analyses, we demonstrate that misexpression of Antpin the posterior silk gland induced ectopic expression of major silk protein genes such assericin-3,fhxh4, and fhxh5 These genes are normally expressed specifically in the middle silk gland as is Antp Therefore, the evidence strongly suggests that Antpactivates these silk protein genes in the middle silk gland. The putativesericin-1 activator complex (middle silk gland-intermolt-specific complex) can bind to the upstream regions of these genes, suggesting that Antpdirectly activates their expression. We also found that the pattern of gene expression was well conserved between B. moriand the wild species Bombyx mandarina, indicating that the gene regulation mechanism identified here is an evolutionarily conserved mechanism and not an artifact of the domestication of B. mori We suggest that Hoxgenes have a role as a master control in terminally differentiated tissues, possibly acting as a primary regulator for a range of physiological processes. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.

  18. Bimolecular fluorescence complementation (BiFC) assay for protein-protein interaction in onion cells using the helios gene gun.

    PubMed

    Hollender, Courtney A; Liu, Zhongchi

    2010-06-12

    Investigation of gene function in diverse organisms relies on knowledge of how the gene products interact with each other in their normal cellular environment. The Bimolecular Fluorescence Complementation (BiFC) Assay(1) allows researchers to visualize protein-protein interactions in living cells and has become an essential research tool. This assay is based on the facilitated association of two fragments of a fluorescent protein (GFP) that are each fused to a potential interacting protein partner. The interaction of the two protein partners would facilitate the association of the N-terminal and C-terminal fragment of GFP, leading to fluorescence. For plant researchers, onion epidermal cells are an ideal experimental system for conducting the BiFC assay because of the ease in obtaining and preparing onion tissues and the direct visualization of fluorescence with minimal background fluorescence. The Helios Gene Gun (BioRad) is commonly used for bombarding plasmid DNA into onion cells. We demonstrate the use of Helios Gene Gun to introduce plasmid constructs for two interacting Arabidopsis thaliana transcription factors, SEUSS (SEU) and LEUNIG HOMOLOG (LUH)(2) and the visualization of their interactions mediated by BiFC in onion epidermal cells.

  19. Locating overlapping dense subgraphs in gene (protein) association networks and predicting novel protein functional groups among these subgraphs

    NASA Astrophysics Data System (ADS)

    Palla, Gergely; Derenyi, Imre; Farkas, Illes J.; Vicsek, Tamas

    2006-03-01

    Most tasks in a cell are performed not by individual proteins, but by functional groups of proteins (either physically interacting with each other or associated in other ways). In gene (protein) association networks these groups show up as sets of densely connected nodes. In the yeast, Saccharomyces cerevisiae, known physically interacting groups of proteins (called protein complexes) strongly overlap: the total number of proteins contained by these complexes by far underestimates the sum of their sizes (2750 vs. 8932). Thus, most functional groups of proteins, both physically interacting and other, are likely to share many of their members with other groups. However, current algorithms searching for dense groups of nodes in networks usually exclude overlaps. With the aim to discover both novel functions of individual proteins and novel protein functional groups we combine in protein association networks (i) a search for overlapping dense subgraphs based on the Clique Percolation Method (CPM) (Palla, G., et.al. Nature 435, 814-818 (2005), http://angel.elte.hu/clustering), which explicitly allows for overlaps among the groups, and (ii) a verification and characterization of the identified groups of nodes (proteins) with the help of standard annotation databases listing known functions.

  20. Regulated protein expression for in vivo gene therapy for neurological disorders: progress, strategies, and issues.

    PubMed

    Manfredsson, Fredric P; Bloom, David C; Mandel, Ronald J

    2012-11-01

    The field of in vivo gene therapy has matured to the point where there are numerous clinical trials underway including late-stage clinical trials. Several viral vectors are especially efficient and support lifetime protein expression in the brain and a number of clinical trials are underway for various progressive or chronic neurological disorders including Parkinson's disease, Alzheimer's disease, and Batten's disease. To date, however, none of the vectors in clinical use have any direct way to reverse or control their transgene product in the event continued protein expression should become problematic. Several schemes that use elements within the vector design have been developed that allow an external drug or pro-drug to alter ongoing protein expression after in vivo gene transfer. The most promising and most studied regulated protein expression methods for in vivo gene transfer are reviewed. In addition, potential scientific and clinical advantages of transgene regulation for gene therapy are discussed.

  1. Post-transcriptional regulation of noise in protein distributions during gene expression.

    PubMed

    Jia, Tao; Kulkarni, Rahul V

    2010-07-02

    The intrinsic stochasticity of gene expression can lead to a large variability of protein levels across a population of cells. Variability (or noise) in protein distributions can be modulated by cellular mechanisms of gene regulation; in particular, there is considerable interest in understanding the role of post-transcriptional regulation. To address this issue, we propose and analyze a stochastic model for post-transcriptional regulation of gene expression. The analytical solution of the model provides insight into the effects of different mechanisms of post-transcriptional regulation on the noise in protein distributions. The results obtained also demonstrate how different sources of intrinsic noise in gene expression can be discriminated based on observations of regulated protein distributions.

  2. Post-Transcriptional Regulation of Noise in Protein Distributions during Gene Expression

    NASA Astrophysics Data System (ADS)

    Jia, Tao; Kulkarni, Rahul V.

    2010-07-01

    The intrinsic stochasticity of gene expression can lead to a large variability of protein levels across a population of cells. Variability (or noise) in protein distributions can be modulated by cellular mechanisms of gene regulation; in particular, there is considerable interest in understanding the role of post-transcriptional regulation. To address this issue, we propose and analyze a stochastic model for post-transcriptional regulation of gene expression. The analytical solution of the model provides insight into the effects of different mechanisms of post-transcriptional regulation on the noise in protein distributions. The results obtained also demonstrate how different sources of intrinsic noise in gene expression can be discriminated based on observations of regulated protein distributions.

  3. Regulation of Drosophila yolk protein genes by an ovary-specific GATA factor

    SciTech Connect

    Lossky, M.; Wensink, P.C.

    1995-12-01

    This report investigates the expression of the genes for yolk protein of Drosophila melanogaster and the tissue specific function of the regulatory element which activates transcription in vivo. 70 refs., 8 figs.

  4. Products of lipid, protein and RNA oxidation as signals and regulators of gene expression in plants

    PubMed Central

    Chmielowska-Bąk, Jagna; Izbiańska, Karolina; Deckert, Joanna

    2015-01-01

    Reactive oxygen species (ROS) are engaged in several processes essential for normal cell functioning, such as differentiation, anti-microbial defense, stimulus sensing and signaling. Interestingly, recent studies imply that cellular signal transduction and gene regulation are mediated not only directly by ROS but also by the molecules derived from ROS-mediated oxidation. Lipid peroxidation leads to non-enzymatic formation of oxylipins. These molecules were shown to modulate expression of signaling associated genes including genes encoding phosphatases, kinases and transcription factors. Oxidized peptides derived from protein oxidation might be engaged in organelle-specific ROS signaling. In turn, oxidation of particular mRNAs leads to decrease in the level of encoded proteins and thus, contributes to the post-transcriptional regulation of gene expression. Present mini review summarizes latest findings concerning involvement of products of lipid, protein and RNA oxidation in signal transduction and gene regulation. PMID:26082792

  5. EvoTol: a protein-sequence based evolutionary intolerance framework for disease-gene prioritization

    PubMed Central

    Rackham, Owen J. L.; Shihab, Hashem A.; Johnson, Michael R.; Petretto, Enrico

    2015-01-01

    Methods to interpret personal genome sequences are increasingly required. Here, we report a novel framework (EvoTol) to identify disease-causing genes using patient sequence data from within protein coding-regions. EvoTol quantifies a gene's intolerance to mutation using evolutionary conservation of protein sequences and can incorporate tissue-specific gene expression data. We apply this framework to the analysis of whole-exome sequence data in epilepsy and congenital heart disease, and demonstrate EvoTol's ability to identify known disease-causing genes is unmatched by competing methods. Application of EvoTol to the human interactome revealed networks enriched for genes intolerant to protein sequence variation, informing novel polygenic contributions to human disease. PMID:25550428

  6. The role of green fluorescent protein (GFP) in transgenic plants to reduce gene silencing phenomena.

    PubMed

    El-Shemy, Hany A; Khalafalla, Mutasim M; Ishimoto, Masao

    2009-01-01

    The green fluorescent protein (GFP) of jellyfish (Aequorea victoria) has significant advantages over other reporter genes, because expression can be detected in living cells without any substrates. Recently, epigenetic phenomena are important to consider in plant biotechnology experiments for elucidate unknown mechanism. Therefore, soybean immature cotyledons were generated embryogenesis cells and engineered with two different gene constructs (pHV and pHVS) using gene gun method. Both constructs contain a gene conferring resistance to hygromycin (hpt) as a selective marker and a modified glycinin (11S globulin) gene (V3-1) as a target. However, sGFP(S65T) as a reporter gene was used only in pHVS as a reporter gene for study the relation between using sGFP(S65T) and gene silencing phenomena. Fluorescence microscopic was used for screening after the selection of hygromycin, identified clearly the expression of sGFP(S65T) in the transformed soybean embryos bombarded with the pHVS construct. Protein analysis was used to detect gene expression overall seeds using SDS-PAGE. Percentage of gene down regulation was highly in pHV construct compared with pHVS. Thus, sGFP(S65T ) as a reporter gene in vector system may be play useful role for transgenic evaluation and avoid gene silencing in plants for the benefit of plant transformation system.

  7. Chromatin poises miRNA- and protein-coding genes for expression.

    PubMed

    Barski, Artem; Jothi, Raja; Cuddapah, Suresh; Cui, Kairong; Roh, Tae-Young; Schones, Dustin E; Zhao, Keji

    2009-10-01

    Chromatin modifications have been implicated in the regulation of gene expression. While association of certain modifications with expressed or silent genes has been established, it remains unclear how changes in chromatin environment relate to changes in gene expression. In this article, we used ChIP-seq (chromatin immunoprecipitation with massively parallel sequencing) to analyze the genome-wide changes in chromatin modifications during activation of total human CD4(+) T cells by T-cell receptor (TCR) signaling. Surprisingly, we found that the chromatin modification patterns at many induced and silenced genes are relatively stable during the short-term activation of resting T cells. Active chromatin modifications were already in place for a majority of inducible protein-coding genes, even while the genes were silent in resting cells. Similarly, genes that were silenced upon T-cell activation retained positive chromatin modifications even after being silenced. To investigate if these observations are also valid for miRNA-coding genes, we systematically identified promoters for known miRNA genes using epigenetic marks and profiled their expression patterns using deep sequencing. We found that chromatin modifications can poise miRNA-coding genes as well. Our data suggest that miRNA- and protein-coding genes share similar mechanisms of regulation by chromatin modifications, which poise inducible genes for activation in response to environmental stimuli.

  8. Combining sequence and Gene Ontology for protein module detection in the Weighted Network.

    PubMed

    Yu, Yang; Liu, Jie; Feng, Nuan; Song, Bo; Zheng, Zeyu

    2017-01-07

    Studies of protein modules in a Protein-Protein Interaction (PPI) network contribute greatly to the understanding of biological mechanisms. With the development of computing science, computational approaches have played an important role in locating protein modules. In this paper, a new approach combining Gene Ontology and amino acid background frequency is introduced to detect the protein modules in the weighted PPI networks. The proposed approach mainly consists of three parts: the feature extraction, the weighted graph construction and the protein complex detection. Firstly, the topology-sequence information is utilized to present the feature of protein complex. Secondly, six types of the weighed graph are constructed by combining PPI network and Gene Ontology information. Lastly, protein complex algorithm is applied to the weighted graph, which locates the clusters based on three conditions, including density, network diameter and the included angle cosine. Experiments have been conducted on two protein complex benchmark sets for yeast and the results show that the approach is more effective compared to five typical algorithms with the performance of f-measure and precision. The combination of protein interaction network with sequence and gene ontology data is helpful to improve the performance and provide a optional method for protein module detection. Copyright © 2016 Elsevier Ltd. All rights reserved.

  9. Combining Sequence and Gene Ontology for Protein Module Detection in the Weighted Network.

    PubMed

    Yu, Yang; Liu, Jie; Feng, Nuan; Song, Bo; Zheng, Zeyu

    2016-10-29

    Studies of protein modules in a Protein-Protein Interaction (PPI) network contribute greatly to the understanding of biological mechanisms. With the development of computing science, computational approaches have played an important role in locating protein modules. In this paper, a new approach combining Gene Ontology and amino acid background frequency is introduced to detect the protein modules in the weighted PPI networks. The proposed approach mainly consists of three parts: the feature extraction, the weighted graph construction and the protein complex detection. Firstly, the topology-sequence information is utilized to present the feature of protein complex. Secondly, six types of the weighed graph are constructed by combining PPI network and Gene Ontology information. Lastly, protein complex algorithm is applied to the weighted graph, which locates the clusters based on three conditions, including density, network diameter and the included angle cosine. Experiments have been conducted on two protein complex benchmark sets for yeast and the results show that the approach is more effective compared to five typical algorithms with the performance of f-measure and precision. The combination of protein interaction network with sequence and gene ontology data is helpful to improve the performance and provide a optional method for protein module detection.

  10. Genes for Drosophila small heat shock proteins are regulated differently by ecdysterone

    SciTech Connect

    Amin, J.; Voellmy, R. ); Mestril, R. )

    1991-12-01

    Genes for small heat shock proteins (hsp27 to hsp22) are activated in late third-instar larvae of Drosophila melanogaster in the absence of heat stress. This regulation has been stimulated in cultured Drosophila cells in which the genes are activated by the addition of ecdysterone. Sequence elements (HERE) involved in ecdysterone regulation of the hsp27 and hsp23 genes have been defined by transfection studies and have recently been identified as binding sites for ecdysterone receptor. The authors report here that the shp27 and hsp23 genes are regulated differently by ecdysterone. The hsp27 gene is activated rapidly by ecdysterone, even in the absence of protein synthesis. In contrast, high-level expression of the hsp23 gene begins only after a lag of about 6 h, is dependent on the continuous presence of ecdysterone, and is sensitive to low concentrations of protein synthesis inhibitors. Transfection experiments with reported constructs show that this difference in regulation is at the transcriptional level. Synthetic hsp27 or hsp23 HERE sequences confer hsp27- or hsp23-type ecdysterone regulation on a basal promoter. These findings indicate that the hsp27 gene is primary, and the hsp23 gene is mainly a secondary, hormone-responsive gene. Ecdysterone receptor is implied to play a role in the regulation of both genes.

  11. Predicting protein phosphorylation from gene expression: top methods from the IMPROVER Species Translation Challenge

    PubMed Central

    Biehl, Michael; Bilal, Erhan; Hormoz, Sahand; Meyer, Pablo; Norel, Raquel; Rhrissorrakrai, Kahn; Bhanot, Gyan; Luo, Feng; Tarca, Adi L.

    2015-01-01

    Motivation: Using gene expression to infer changes in protein phosphorylation levels induced in cells by various stimuli is an outstanding problem. The intra-species protein phosphorylation challenge organized by the IMPROVER consortium provided the framework to identify the best approaches to address this issue. Results: Rat lung epithelial cells were treated with 52 stimuli, and gene expression and phosphorylation levels were measured. Competing teams used gene expression data from 26 stimuli to develop protein phosphorylation prediction models and were ranked based on prediction performance for the remaining 26 stimuli. Three teams were tied in first place in this challenge achieving a balanced accuracy of about 70%, indicating that gene expression is only moderately predictive of protein phosphorylation. In spite of the similar performance, the approaches used by these three teams, described in detail in this article, were different, with the average number of predictor genes per phosphoprotein used by the teams ranging from 3 to 124. However, a significant overlap of gene signatures between teams was observed for the majority of the proteins considered, while Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways were enriched in the union of the predictor genes of the three teams for multiple proteins. Availability and implementation: Gene expression and protein phosphorylation data are available from ArrayExpress (E-MTAB-2091). Software implementation of the approach of Teams 49 and 75 are available at http://bioinformaticsprb.med.wayne.edu and http://people.cs.clemson.edu/∼luofeng/sbv.rar, respectively. Contact: gyanbhanot@gmail.com or luofeng@clemson.edu or atarca@med.wayne.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:25061067

  12. Molecular characterization of a human gene for S28 ribosomal binding protein

    SciTech Connect

    Wong, P.; Borst, D.E.; Chader, G.J.

    1994-09-01

    The mechanism of ribosome action and the ribosomal binding proteins which cooperatively interact in the working of this structure are not completely understood. Theoretically, mutations in genes that encode these proteins may compromise the efficiency of protein synthesis and therefore lead to a functional disorder. In the course of our search for human genes which show homology to the C. elegans CED-4 death gene, we have serendipitously identified one of the human S28 ribosomal binding protein genes as a random fragment fused to the end of one of our putative CED-4 positive homologue clones. The cloned S28 fragment consists of 381 nucleotides with a putative open reading frame of 113 amino acids. Sequence comparisons to GenBank revealed significant homologies to ribosomal binding protein genes in other species (including the rat S28 ribosomal binding protein gene) indicating that the S28 gene sequence is highly conserved. This finding is confirmed by zooblot analysis. Significant homologies also exist to two human expressed tagged sites (HUMRIBPROB; L05091 and HSAFIF072; Z21908). Analysis of the putative S28 peptide sequence allows insights into possible functional regions of the protein. The identification of 8 distinct bands upon Southern analysis of the S28 fragments suggests that there are multiple copies of the S28 gene in the human genome. Mapping of the S28 fragment on somatic cell hybrid panels identified distinct S28 gene loci on chromosomes 1, 2, 7, 10, 11, 12, 17 expression in adult tissues (pancreas, kidney, muscle, liver, lung, placenta, brain, heart, and retina) as well as in fetal tissues (kidney, liver, lung, brain, and heart).

  13. Integrated Method to Attach DNA Handles and Functionally Select Proteins to Study Folding and Protein-Ligand Interactions with Optical Tweezers.

    PubMed

    Hao, Yuxin; Canavan, Clare; Taylor, Susan S; Maillard, Rodrigo A

    2017-09-07

    Optical tweezers has emerged as a powerful tool to study folding, ligand binding, and motor enzymes. The manipulation of proteins with optical tweezers requires attaching molecular handles to the protein of interest. Here, we describe a novel method that integrates the covalent attachment of DNA handles to target proteins with a selection step for functional and properly folded molecules. In addition, this method enables obtaining protein molecules in different liganded states and can be used with handles of different lengths. We apply this method to study the cAMP binding domain A (CBD-A) of Protein kinase A. We find that the functional selection step drastically improves the reproducibility and homogeneity of the single molecule data. In contrast, without a functional selection step, proteins often display misfolded conformations. cAMP binding stabilizes the CBD-A against a denaturing force, and increases the folded state lifetime. Data obtained with handles of 370 and 70 base pairs are indistinguishable, but at low forces short handles provide a higher spatial resolution. Altogether, this method is flexible, selects for properly folded molecules in different liganded states, and can be readily applicable to study protein folding or protein-ligand interactions with force spectroscopy that require molecular handles.

  14. Recommended nomenclature for five mammalian carboxylesterase gene families: human, mouse, and rat genes and proteins.

    PubMed

    Holmes, Roger S; Wright, Matthew W; Laulederkind, Stanley J F; Cox, Laura A; Hosokawa, Masakiyo; Imai, Teruko; Ishibashi, Shun; Lehner, Richard; Miyazaki, Masao; Perkins, Everett J; Potter, Phillip M; Redinbo, Matthew R; Robert, Jacques; Satoh, Tetsuo; Yamashita, Tetsuro; Yan, Bingfan; Yokoi, Tsuyoshi; Zechner, Rudolf; Maltais, Lois J

    2010-10-01

    Mammalian carboxylesterase (CES or Ces) genes encode enzymes that participate in xenobiotic, drug, and lipid metabolism in the body and are members of at least five gene families. Tandem duplications have added more genes for some families, particularly for mouse and rat genomes, which has caused confusion in naming rodent Ces genes. This article describes a new nomenclature system for human, mouse, and rat carboxylesterase genes that identifies homolog gene families and allocates a unique name for each gene. The guidelines of human, mouse, and rat gene nomenclature committees were followed and "CES" (human) and "Ces" (mouse and rat) root symbols were used followed by the family number (e.g., human CES1). Where multiple genes were identified for a family or where a clash occurred with an existing gene name, a letter was added (e.g., human CES4A; mouse and rat Ces1a) that reflected gene relatedness among rodent species (e.g., mouse and rat Ces1a). Pseudogenes were named by adding "P" and a number to the human gene name (e.g., human CES1P1) or by using a new letter followed by ps for mouse and rat Ces pseudogenes (e.g., Ces2d-ps). Gene transcript isoforms were named by adding the GenBank accession ID to the gene symbol (e.g., human CES1_AB119995 or mouse Ces1e_BC019208). This nomenclature improves our understanding of human, mouse, and rat CES/Ces gene families and facilitates research into the structure, function, and evolution of these gene families. It also serves as a model for naming CES genes from other mammalian species.

  15. Molecular cloning and sequencing of the gene encoding the fimbrial subunit protein of Bacteroides gingivalis.

    PubMed Central

    Dickinson, D P; Kubiniec, M A; Yoshimura, F; Genco, R J

    1988-01-01

    The gene encoding the fimbrial subunit protein of Bacteroides gingivalis 381, fimbrilin, has been cloned and sequenced. The gene was present as a single copy on the bacterial chromosome, and the codon usage in the gene conformed closely to that expected for an abundant protein. The predicted size of the mature protein was 35,924 daltons, and the secretory form may have had a 10-amino-acid, hydrophilic leader sequence similar to the leader sequences of the MePhe fimbriae family. The protein sequence had no marked similarity to known fimbrial sequences, and no homologous sequences could be found in other black-pigmented Bacteroides species, suggesting that fimbrillin represents a class of fimbrial subunit protein of limited distribution. Images PMID:2895100

  16. Phylogenetic analysis of immunomodulatory protein genes of camelpoxvirus obtained from India.

    PubMed

    Nagarajan, G; Swami, Shelesh Kumar; Dahiya, Shyam Singh; Sivakumar, G; Yadav, Vinay Kumar; Tuteja, F C; Narnaware, S D; Patil, N V

    2013-07-01

    The haemagglutinin (HA) encoding gene and genes encoding for immunomodulatory proteins i.e., schlafen-like protein, epidermal growth factor and golgi anti apoptotic protein of camelpoxvirus (CMLV) obtained from Indian dromedarian camels were cloned and characterized. In this study, the size of the HA encoding gene obtained from the Indian CMLV is 941 bp which is only partial. Sequence analysis of schlafen-like protein gene revealed that CMLV obtained from India shared 99.6% identity with CMLV-Iran and CMLV-Kazakhstan strains both at nucleotide and amino acid level. The size of epidermal growth factor (EGF) gene of Indian CMLV obtained in this study was 418 bp, which was due to the addition of one cytosine residue position 132 of EGF gene of Indian CMLV. Sequence analysis revealed that the Golgi anti-apoptotic protein (GAAP) of Indian CMLV shared 99.5% sequence identity both at the nucleotide and amino acid level with CMLV-Kazakhstan. Based on the nucleotide and amino acid sequence identities and phylogenetic analyses of these genes, it is found that CMLV-India is forming a cluster with Kazakhstan and Iranian CMLV isolates. Copyright © 2013 Elsevier Ltd. All rights reserved.

  17. Eukaryotic expression vectors containing genes encoding plant proteins for killing of cancer cells.

    PubMed

    Glinka, Elena M

    2013-12-01

    Gene therapy has attracted attention for its potential to specifically and efficiently target cancer cells with minimal toxicity to normal cells. At present, it offers a promising direction for the treatment of cancer patients. Numerous vectors have been engineered for the sole purpose of killing cancer cells, and some have successfully suppressed malignant tumours. Many plant proteins have anticancer properties; consequently, genes encoding some of these proteins are being used to design constructs for the inhibition of multiplying cancer cells. Data addressing the function of vectors harbouring genes specifically encoding ricin, saporin, lunasin, linamarase, and tomato thymidine kinase 1 under the control of different promoters are summarised here. Constructs employing genes to encode cytotoxic proteins as well as constructs employing genes of enzymes that convert a nontoxic prodrug into a toxic drug are considered here. Generation of eukaryotic expression vectors containing genes encoding plant proteins for killing of cancer cells may permit the broadening of cancer gene therapy strategy, particularly because of the specific mode of action of anticancer plant proteins. Copyright © 2013 Elsevier Ltd. All rights reserved.

  18. MYB98 positively regulates a battery of synergid-expressed genes encoding filiform apparatus localized proteins.

    PubMed

    Punwani, Jayson A; Rabiger, David S; Drews, Gary N

    2007-08-01

    The synergid cells within the female gametophyte are essential for reproduction in angiosperms. MYB98 encodes an R2R3-MYB protein required for pollen tube guidance and filiform apparatus formation by the synergid cells. To test the predicted function of MYB98 as a transcriptional regulator, we determined its subcellular localization and examined its DNA binding properties. We show that MYB98 binds to a specific DNA sequence (TAAC) and that a MYB98-green fluorescent protein fusion protein localizes to the nucleus, consistent with a role in transcriptional regulation. To identify genes regulated by MYB98, we tested previously identified synergid-expressed genes for reduced expression in myb98 female gametophytes and identified 16 such genes. We dissected the promoter of one of the downstream genes, DD11, and show that it contains a MYB98 binding site required for synergid expression, suggesting that DD11 is regulated directly by MYB98. To gain insight into the functions of the downstream genes, we chose five genes and determined the subcellular localization of the encoded proteins. We show that these five proteins are secreted into the filiform apparatus, suggesting that they play a role in either the formation or the function of this unique structure. Together, these data suggest that MYB98 functions as a transcriptional regulator in the synergid cells and activates the expression of genes required for pollen tube guidance and filiform apparatus formation.

  19. HIVed, a knowledgebase for differentially expressed human genes and proteins during HIV infection, replication and latency

    PubMed Central

    Li, Chen; Ramarathinam, Sri H.; Revote, Jerico; Khoury, Georges; Song, Jiangning; Purcell, Anthony W.

    2017-01-01

    Measuring the altered gene expression level and identifying differentially expressed genes/proteins during HIV infection, replication and latency is fundamental for broadening our understanding of the mechanisms of HIV infection and T-cell dysfunction. Such studies are crucial for developing effective strategies for virus eradication from the body. Inspired by the availability and enrichment of gene expression data during HIV infection, replication and latency, in this study, we proposed a novel compendium termed HIVed (HIV expression database; http://hivlatency.erc.monash.edu/) that harbours comprehensive functional annotations of proteins, whose genes have been shown to be dysregulated during HIV infection, replication and latency using different experimental designs and measurements. We manually curated a variety of third-party databases for structural and functional annotations of the protein entries in HIVed. With the goal of benefiting HIV related research, we collected a number of biological annotations for all the entries in HIVed besides their expression profile, including basic protein information, Gene Ontology terms, secondary structure, HIV-1 interaction and pathway information. We hope this comprehensive protein-centric knowledgebase can bridge the gap between the understanding of differentially expressed genes and the functions of their protein products, facilitating the generation of novel hypotheses and treatment strategies to fight against the HIV pandemic. PMID:28358052

  20. Chicken genome analysis reveals novel genes encoding biotin-binding proteins related to avidin family

    PubMed Central

    Niskanen, Einari A; Hytönen, Vesa P; Grapputo, Alessandro; Nordlund, Henri R; Kulomaa, Markku S; Laitinen, Olli H

    2005-01-01

    Background A chicken egg contains several biotin-binding proteins (BBPs), whose complete DNA and amino acid sequences are not known. In order to identify and characterise these genes and proteins we studied chicken cDNAs and genes available in the NCBI database and chicken genome database using the reported N-terminal amino acid sequences of chicken egg-yolk BBPs as search strings. Results Two separate hits showing significant homology for these N-terminal sequences were discovered. For one of these hits, the chromosomal location in the immediate proximity of the avidin gene family was found. Both of these hits encode proteins having high sequence similarity with avidin suggesting that chicken BBPs are paralogous to avidin family. In particular, almost all residues corresponding to biotin binding in avidin are conserved in these putative BBP proteins. One of the found DNA sequences, however, seems to encode a carboxy-terminal extension not present in avidin. Conclusion We describe here the predicted properties of the putative BBP genes and proteins. Our present observations link BBP genes together with avidin gene family and shed more light on the genetic arrangement and variability of this family. In addition, comparative modelling revealed the potential structural elements important for the functional and structural properties of the putative BBP proteins. PMID:15777476

  1. Chicken genome analysis reveals novel genes encoding biotin-binding proteins related to avidin family.

    PubMed

    Niskanen, Einari A; Hytönen, Vesa P; Grapputo, Alessandro; Nordlund, Henri R; Kulomaa, Markku S; Laitinen, Olli H

    2005-03-18

    A chicken egg contains several biotin-binding proteins (BBPs), whose complete DNA and amino acid sequences are not known. In order to identify and characterise these genes and proteins we studied chicken cDNAs and genes available in the NCBI database and chicken genome database using the reported N-terminal amino acid sequences of chicken egg-yolk BBPs as search strings. Two separate hits showing significant homology for these N-terminal sequences were discovered. For one of these hits, the chromosomal location in the immediate proximity of the avidin gene family was found. Both of these hits encode proteins having high sequence similarity with avidin suggesting that chicken BBPs are paralogous to avidin family. In particular, almost all residues corresponding to biotin binding in avidin are conserved in these putative BBP proteins. One of the found DNA sequences, however, seems to encode a carboxy-terminal extension not present in avidin. We describe here the predicted properties of the putative BBP genes and proteins. Our present observations link BBP genes together with avidin gene family and shed more light on the genetic arrangement and variability of this family. In addition, comparative modelling revealed the potential structural elements important for the functional and structural properties of the putative BBP proteins.

  2. HIVed, a knowledgebase for differentially expressed human genes and proteins during HIV infection, replication and latency.

    PubMed

    Li, Chen; Ramarathinam, Sri H; Revote, Jerico; Khoury, Georges; Song, Jiangning; Purcell, Anthony W

    2017-03-30

    Measuring the altered gene expression level and identifying differentially expressed genes/proteins during HIV infection, replication and latency is fundamental for broadening our understanding of the mechanisms of HIV infection and T-cell dysfunction. Such studies are crucial for developing effective strategies for virus eradication from the body. Inspired by the availability and enrichment of gene expression data during HIV infection, replication and latency, in this study, we proposed a novel compendium termed HIVed (HIV expression database; http://hivlatency.erc.monash.edu/) that harbours comprehensive functional annotations of proteins, whose genes have been shown to be dysregulated during HIV infection, replication and latency using different experimental designs and measurements. We manually curated a variety of third-party databases for structural and functional annotations of the protein entries in HIVed. With the goal of benefiting HIV related research, we collected a number of biological annotations for all the entries in HIVed besides their expression profile, including basic protein information, Gene Ontology terms, secondary structure, HIV-1 interaction and pathway information. We hope this comprehensive protein-centric knowledgebase can bridge the gap between the understanding of differentially expressed genes and the functions of their protein products, facilitating the generation of novel hypotheses and treatment strategies to fight against the HIV pandemic.

  3. Gene expression analysis uncovers novel Hedgehog interacting protein (HHIP) effects in human bronchial epithelial cells

    PubMed Central

    Zhou, Xiaobo; Qiu, Weiliang; Sathirapongsasuti, J. Fah.; Cho, Michael H.; Mancini, John D.; Lao, Taotao; Thibault, Derek M.; Litonjua, Gus; Bakke, Per S.; Gulsvik, Amund; Lomas, David A.; Beaty, Terri H.; Hersh, Craig P.; Anderson, Christopher; Geigenmuller, Ute; Raby, Benjamin A.; Rennard, Stephen I.; Perrella, Mark A.; Choi, Augustine M.K.; Quackenbush, John; Silverman, Edwin K.

    2013-01-01

    Hedgehog Interacting Protein (HHIP) was implicated in chronic obstructive pulmonary disease (COPD) by genome-wide association studies (GWAS). However, it remains unclear how HHIP contributes to COPD pathogenesis. To identify genes regulated by HHIP, we performed gene expression microarray analysis in a human bronchial epithelial cell line (Beas-2B) stably infected with HHIP shRNAs. HHIP silencing led to differential expression of 296 genes; enrichment for variants nominally associated with COPD was found. Eighteen of the differentially expressed genes were validated by real-time PCR in Beas-2B cells. Seven of 11 validated genes tested in human COPD and control lung tissues demonstrated significant gene expression differences. Functional annotation indicated enrichment for extracellular matrix and cell growth genes. Network modeling demonstrated that the extracellular matrix and cell proliferation genes influenced by HHIP tended to be interconnected. Thus, we identified potential HHIP targets in human bronchial epithelial cells that may contribute to COPD pathogenesis. PMID:23459001

  4. Expression of hcp in freshwater Synechococcus spp., a gene encoding a hyperconserved protein in picocyanobacteria.

    PubMed

    Kutovaya, Olga A; McKay, R Michael L; Bullerjahn, George S

    2010-06-01

    Marine picoplankton of the genus Synechococcus and Prochlorococcus spp. are widely studied members of the picocyanobacterial clade, composed of unicellular cyanobacteria that dominate pelagic regions of the ocean. Less studied are the related freshwater Synechococcus spp. that similarly dominate the euphotic zone of oligotrophic lakes. Previous work has shown that marine picocyanobacteria harbor a small gene, hcp, that encodes a 62 amino acid protein 100% conserved among all strains examined. The gene is restricted exclusively to the picocyanobacterial lineage. The current study reveals that hcp is also 100% conserved in four freshwater Synechococcus spp. strains isolated from the Laurentian Great Lakes, and that the gene constitutively expressed with genes encoding a ribosomal protein and two tRNA genes. The synteny of the hcp region is also conserved between the marine and freshwater strains. Last, the hcp gene and the organization of the surrounding genetic region has been retained in the reduced genome of a picocyanobacterial endosymbiont of the amoeba Paulinella sp.

  5. Inducible Amplification of Gene Copy Number and Heterologous Protein Production in the Yeast Kluyveromyces lactis

    PubMed Central

    Morlino, Giovanni B.; Tizzani, Lorenza; Fleer, Reinhard; Frontali, Laura; Bianchi, Michele M.

    1999-01-01

    Heterologous protein production can be doubled by increasing the copy number of the corresponding heterologous gene. We constructed a host-vector system in the yeast Kluyveromyces lactis that was able to induce copy number amplification of pKD1 plasmid-based vectors upon expression of an integrated copy of the plasmid recombinase gene. We increased the production and secretion of two heterologous proteins, glucoamylase from the yeast Arxula adeninivorans and mammalian interleukin-1β, following gene dosage amplification when the heterologous genes were carried by pKD1-based vectors. The choice of the promoters for expression of the integrated recombinase gene and of the episomal heterologous genes are critical for the mitotic stability of the host-vector system. PMID:10543790

  6. Inducible amplification of gene copy number and heterologous protein production in the yeast Kluyveromyces lactis.

    PubMed

    Morlino, G B; Tizzani, L; Fleer, R; Frontali, L; Bianchi, M M

    1999-11-01

    Heterologous protein production can be doubled by increasing the copy number of the corresponding heterologous gene. We constructed a host-vector system in the yeast Kluyveromyces lactis that was able to induce copy number amplification of pKD1 plasmid-based vectors upon expression of an integrated copy of the plasmid recombinase gene. We increased the production and secretion of two heterologous proteins, glucoamylase from the yeast Arxula adeninivorans and mammalian interleukin-1beta, following gene dosage amplification when the heterologous genes were carried by pKD1-based vectors. The choice of the promoters for expression of the integrated recombinase gene and of the episomal heterologous genes are critical for the mitotic stability of the host-vector system.

  7. Identification and expression pattern of the chemosensory protein gene family in the silkworm, Bombyx mori.

    PubMed

    Gong, Da-Ping; Zhang, Hui-Jie; Zhao, Ping; Lin, Ying; Xia, Qing-You; Xiang, Zhong-Huai

    2007-03-01

    Insect chemosensory proteins (CSPs) as well as odorant-binding proteins (OBPs) have been supposed to transport hydrophobic chemicals to receptors on sensory neurons. Compared with OBPs, CSPs are expressed more broadly in various insect tissues. We performed a genome-wide analysis of the candidate CSP gene family in the silkworm. A total of 20 candidate CSPs, including 3 gene fragments and 2 pseudogenes, were characterized based on their conserved cysteine residues and their similarity to CSPs in other insects. Some of these genes were clustered in the silkworm genome. The gene expression pattern of these candidates was investigated using RT-PCR and microarray, and the results showed that these genes were expressed primarily in mature larvae and the adult moth, suggesting silkworm CSPs may be involved in development. The majority of silkworm CSP genes are expressed broadly in tissues including the antennae, head, thorax, legs, wings, epithelium, testes, ovaries, pheromone glands, wing disks, and compound eyes.

  8. The ZIC gene family encodes multi-functional proteins essential for patterning and morphogenesis.

    PubMed

    Houtmeyers, Rob; Souopgui, Jacob; Tejpar, Sabine; Arkell, Ruth

    2013-10-01

    The zinc finger of the cerebellum gene (ZIC) discovered in Drosophila melanogaster (odd-paired) has five homologs in Xenopus, chicken, mice, and humans, and seven in zebrafish. This pattern of gene copy expansion is accompanied by a divergence in gene and protein structure, suggesting that Zic family members share some, but not all, functions. ZIC genes are implicated in neuroectodermal development and neural crest cell induction. All share conserved regions encoding zinc finger domains, however their heterogeneity and specification remain unexplained. In this review, the evolution, structure, and expression patterns of the ZIC homologs are described; specific functions attributable to individual family members are supported. A review of data from functional studies in Xenopus and murine models suggest that ZIC genes encode multifunctional proteins operating in a context-specific manner to drive critical events during embryogenesis. The identification of ZIC mutations in congenital syndromes highlights the relevance of these genes in human development.

  9. Nucleoprotein filament formation is the structural basis for bacterial protein H-NS gene silencing

    NASA Astrophysics Data System (ADS)

    Lim, Ci Ji; Lee, Sin Yi; Kenney, Linda J.; Yan, Jie

    2012-07-01

    H-NS is an abundant nucleoid-associated protein in bacteria that globally silences genes, including horizontally-acquired genes related to pathogenesis. Although it has been shown that H-NS has multiple modes of DNA-binding, which mode is employed in gene silencing is still unclear. Here, we report that in H-NS mutants that are unable to silence genes, are unable to form a rigid H-NS nucleoprotein filament. These results indicate that the H-NS nucleoprotein filament is crucial for its gene silencing function, and serves as the fundamental structural basis for gene silencing by H-NS and likely other H-NS-like bacterial proteins.

  10. A protein disulfide isomerase gene fusion expression system that increases the extracellular productivity of Bacillus brevis.

    PubMed

    Kajino, T; Ohto, C; Muramatsu, M; Obata, S; Udaka, S; Yamada, Y; Takahashi, H

    2000-02-01

    We have developed a versatile Bacillus brevis expression and secretion system based on the use of fungal protein disulfide isomerase (PDI) as a gene fusion partner. Fusion with PDI increased the extracellular production of heterologous proteins (light chain of immunoglobulin G, 8-fold; geranylgeranyl pyrophosphate synthase, 12-fold). Linkage to PDI prevented the aggregation of the secreted proteins, resulting in high-level accumulation of fusion proteins in soluble and biologically active forms. We also show that the disulfide isomerase activity of PDI in a fusion protein is responsible for the suppression of the aggregation of the protein with intradisulfide, whereas aggregation of the protein without intradisulfide was prevented even when the protein was fused to a mutant PDI whose two active sites were disrupted, suggesting that another PDI function, such as chaperone-like activity, synergistically prevented the aggregation of heterologous proteins in the PDI fusion expression system.

  11. Molecular evolution of the fusion protein gene in human respiratory syncytial virus subgroup A.

    PubMed

    Kimura, Hirokazu; Nagasawa, Koo; Tsukagoshi, Hiroyuki; Matsushima, Yuki; Fujita, Kiyotaka; Yoshida, Lay Myint; Tanaka, Ryota; Ishii, Haruyuki; Shimojo, Naoki; Kuroda, Makoto; Ryo, Akihide

    2016-09-01

    We studied the molecular evolution of the fusion protein (F) gene in the human respiratory syncytial virus subgroup A (HRSV-A). We performed time-scaled phylogenetic analyses using the Bayesian Markov chain Monte Carlo (MCMC) method. We also conducted genetic distance (p-distance), positive/negative selection, and Bayesian skyline plot analyses. Furthermore, we mapped the amino acid substitutions of the protein. The MCMC-constructed tree indicated that the HRSV F gene diverged from the bovine RSV (BRSV) gene approximately 550years ago and had a relatively low substitution rate (7.59×10(-4) substitutions/site/year). Moreover, a common ancestor of HRSV-A and -B diverged approximately 280years ago, which has since formed four distinct clusters. The present HRSV-A strains were assigned six genotypes based on F gene sequences and attachment glycoprotein gene sequences. The present strains exhibited high F gene sequence similarity values and low genetic divergence. No positive selection sites were identified; however, 50 negative selection sites were identified. F protein amino acid substitutions at 17 sites were distributed in the F protein. The effective population size of the gene has remained relatively constant, but the population size of the prevalent genotype (GA2) has increased in the last 10years. These results suggest that the HRSV-AF gene has evolved independently and formed some genotypes. Copyright © 2016 Elsevier B.V. All rights reserved.

  12. Elevated expression of protein biosynthesis genes in liver and muscle of hibernating black bears (Ursus americanus).

    PubMed

    Fedorov, Vadim B; Goropashnaya, Anna V; Tøien, Oivind; Stewart, Nathan C; Gracey, Andrew Y; Chang, Celia; Qin, Shizhen; Pertea, Geo; Quackenbush, John; Showe, Louise C; Showe, Michael K; Boyer, Bert B; Barnes, Brian M

    2009-04-10

    We conducted a large-scale gene expression screen using the 3,200 cDNA probe microarray developed specifically for Ursus americanus to detect expression differences in liver and skeletal muscle that occur during winter hibernation compared with animals sampled during summer. The expression of 12 genes, including RNA binding protein motif 3 (Rbm3), that are mostly involved in protein biosynthesis, was induced during hibernation in both liver and muscle. The Gene Ontology and Gene Set Enrichment analysis consistently showed a highly significant enrichment of the protein biosynthesis category by overexpressed genes in both liver and skeletal muscle during hibernation. Coordinated induction in transcriptional level of genes involved in protein biosynthesis is a distinctive feature of the transcriptome in hibernating black bears. This finding implies induction of translation and suggests an adaptive mechanism that contributes to a unique ability to reduce muscle atrophy over prolonged periods of immobility during hibernation. Comparing expression profiles in bears to small mammalian hibernators shows a general trend during hibernation of transcriptional changes that include induction of genes involved in lipid metabolism and carbohydrate synthesis as well as depression of genes involved in the urea cycle and detoxification function in liver.

  13. A review of the occurrence of grain softness protein-1 genes in wheat (Triticum aestivum L.)

    USDA-ARS?s Scientific Manuscript database

    Grain softness protein-1 (Gsp-1) is a small, 495-bp intronless gene found throughout the Triticeae tribe at the distal end of group 5 chromosomes. With the Puroindolines, it constitutes a key component of the Hardness locus. In the polyploid wheats, Triticum aestivum and T. turgidum, the gene is pr...

  14. Unconventional conservation among genes encoding small secreted salivary sland proteins from a gall midge

    USDA-ARS?s Scientific Manuscript database

    Due to functional constraints associated with protein-coding sequences, introns and the 3’-untranslated region (UTR) of most genes vary the most, followed by the 5’-UTR. The coding region is the most conserved due to stronger functional constraints. During characterization of transcripts and gene...

  15. Gene Mining for Proline Based Signaling Proteins in Cell Wall of Arabidopsis thaliana.

    PubMed

    Ihsan, Muhammad Z; Ahmad, Samina J N; Shah, Zahid Hussain; Rehman, Hafiz M; Aslam, Zubair; Ahuja, Ishita; Bones, Atle M; Ahmad, Jam N

    2017-01-01

    The cell wall (CW) as a first line of defense against biotic and abiotic stresses is of primary importance in plant biology. The proteins associated with cell walls play a significant role in determining a plant's sustainability to adverse environmental conditions. In this work, the genes encoding cell wall proteins (CWPs) in Arabidopsis were identified and functionally classified using geneMANIA and GENEVESTIGATOR with published microarrays data. This yielded 1605 genes, out of which 58 genes encoded proline-rich proteins (PRPs) and glycine-rich proteins (GRPs). Here, we have focused on the cellular compartmentalization, biological processes, and molecular functioning of proline-rich CWPs along with their expression at different plant developmental stages. The mined genes were categorized into five classes on the basis of the type of PRPs encoded in the cell wall of Arabidopsis thaliana. We review the domain structure and function of each class of protein, many with respect to the developmental stages of the plant. We have then used networks, hierarchical clustering and correlations to analyze co-expression, co-localization, genetic, and physical interactions and shared protein domains of these PRPs. This has given us further insight into these functionally important CWPs and identified a number of potentially new cell-wall related proteins in A. thaliana.

  16. Human diabetes associated with defects in nuclear regulatory proteins for the insulin receptor gene.

    PubMed Central

    Brunetti, A; Brunetti, L; Foti, D; Accili, D; Goldfine, I D

    1996-01-01

    The control of gene transcription is mediated by sequence-specific DNA-binding proteins (trans-acting factors) that bind to upstream regulatory elements (cis elements). We have previously identified two DNA-binding proteins that specifically interact with two unique AT-rich sequences of the 5' regulatory region of the insulin receptor gene which have in vivo promoter activity. Herein we have investigated the expression of these DNA-binding proteins in cells from two unrelated patients with insulin resistance and non-insulin-dependent diabetes mellitus. In these patients, the insulin receptor gene was normal. In EBV-transformed lymphoblasts from both patients, insulin receptor mRNA levels and insulin receptor expression were decreased. The expression of nuclear-binding proteins for the 5' regulatory region of the insulin receptor gene was markedly reduced, and this defect paralleled the decrease in insulin receptor protein expression. These studies indicate that DNA-binding proteins to the regulatory region of the insulin receptor gene are important for expression of the insulin receptor. Further, they suggest that in affected individuals, defects in the expression of these proteins may cause decreased insulin receptor expression and insulin resistance. PMID:8550844

  17. Hormonal regulation of platypus Beta-lactoglobulin and monotreme lactation protein genes.

    PubMed

    Enjapoori, Ashwantha Kumar; Lefèvre, Christophe M; Nicholas, Kevin R; Sharp, Julie A

    2017-02-01

    Endocrine regulation of milk protein gene expression in marsupials and eutherians is well studied. However, the evolution of this complex regulation that began with monotremes is unknown. Monotremes represent the oldest lineage of extant mammals and the endocrine regulation of lactation in these mammals has not been investigated. Here we characterised the proximal promoter and hormonal regulation of two platypus milk protein genes, Beta-lactoglobulin (BLG), a whey protein and monotreme lactation protein (MLP), a monotreme specific milk protein, using in vitro reporter assays and a bovine mammary epithelial cell line (BME-UV1). Insulin and dexamethasone alone provided partial induction of MLP, while the combination of insulin, dexamethasone and prolactin was required for maximal induction. Partial induction of BLG was achieved by insulin, dexamethasone and prolactin alone, with maximal induction using all three hormones. Platypus MLP and BLG core promoter regions comprised transcription factor binding sites (e.g. STAT5, NF-1 and C/EBPα) that were conserved in marsupial and eutherian lineages that regulate caseins and whey protein gene expression. Our analysis suggests that insulin, dexamethasone and/or prolactin alone can regulate the platypus MLP and BLG gene expression, unlike those of therian lineage. The induction of platypus milk protein genes by lactogenic hormones suggests they originated before the divergence of marsupial and eutherians.

  18. Gene Mining for Proline Based Signaling Proteins in Cell Wall of Arabidopsis thaliana

    PubMed Central

    Ihsan, Muhammad Z.; Ahmad, Samina J. N.; Shah, Zahid Hussain; Rehman, Hafiz M.; Aslam, Zubair; Ahuja, Ishita; Bones, Atle M.; Ahmad, Jam N.

    2017-01-01

    The cell wall (CW) as a first line of defense against biotic and abiotic stresses is of primary importance in plant biology. The proteins associated with cell walls play a significant role in determining a plant's sustainability to adverse environmental conditions. In this work, the genes encoding cell wall proteins (CWPs) in Arabidopsis were identified and functionally classified using geneMANIA and GENEVESTIGATOR with published microarrays data. This yielded 1605 genes, out of which 58 genes encoded proline-rich proteins (PRPs) and glycine-rich proteins (GRPs). Here, we have focused on the cellular compartmentalization, biological processes, and molecular functioning of proline-rich CWPs along with their expression at different plant developmental stages. The mined genes were categorized into five classes on the basis of the type of PRPs encoded in the cell wall of Arabidopsis thaliana. We review the domain structure and function of each class of protein, many with respect to the developmental stages of the plant. We have then used networks, hierarchical clustering and correlations to analyze co-expression, co-localization, genetic, and physical interactions and shared protein domains of these PRPs. This has given us further insight into these functionally important CWPs and identified a number of potentially new cell-wall related proteins in A. thaliana. PMID:28289422

  19. Cloning and expression of prion protein encoding gene of flounder ( Paralichthys olivaceus)

    NASA Astrophysics Data System (ADS)

    Zhang, Zhiwen; Sun, Xiuqin; Zhang, Jinxing; Zan, Jindong

    2008-02-01

    The prion protein (PrP) encoding gene of flounder ( Paralichthys olivaceus) was cloned. It was not interrupted by an intron. This gene has two promoters in its 5' upstream, indicating that its transcription may be intensive, and should have an important function. It was expressed in all 14 tissues tested, demonstrating that it is a house-keeping gene. Its expression in digestion and reproduction systems implies that the possible prions of fish may transfer horizontally.

  20. Hepatitis B viral core protein disrupts human host gene expression by binding to promoter regions

    PubMed Central

    2012-01-01

    Background The core protein (HBc) of hepatitis B virus (HBV) has been implicated in the malignant transformation of chronically-infected hepatocytes and displays pleiotropic functions, including RNA- and DNA-binding activities. However, the mechanism by which HBc interacts with the human genome to exert effects on hepatocyte function remains unknown. This study investigated the distribution of HBc binding to promoters in the human genome and evaluated its effects on the related genes’ expression. Results Whole-genome chromatin immunoprecipitation microarray (ChIP-on-chip) analysis was used to identify HBc-bound human gene promoters. Gene Ontology and pathway analyses were performed on related genes. The quantitative polymerase chain reaction assay was used to verify ChIP-on-chip results. Five novel genes were selected for luciferase reporter assay evaluation to assess the influence of HBc promoter binding. The HBc antibody immunoprecipitated approximately 3100 human gene promoters. Among these, 1993 are associated with known biological processes, and 2208 regulate genes with defined molecular functions. In total, 1286 of the related genes mediate primary metabolic processes, and 1398 encode proteins with binding activity. Sixty-four of the promoters regulate genes related to the mitogen-activated protein kinase (MAPK) pathways, and 41 regulate Wnt/beta-catenin pathway genes. The reporter gene assay indicated that HBc binding up-regulates proto-oncogene tyrosine-protein kinase (SRC), type 1 insulin-like growth factor receptor (IGF1R), and neurotrophic tyrosine kinase receptor 2 (NTRK2), and down-regulates v-Ha-ras Harvey rat sarcoma viral oncogene (HRAS). Conclusion HBc has the ability to bind a large number of human gene promoters, and can disrupt normal host gene expression. Manipulation of the transcriptional profile in HBV-infected hepatocytes may represent a key pathogenic mechanism of HBV infection. PMID:23088787

  1. Identification and analysis of YELLOW protein family genes in the silkworm, Bombyx mori

    PubMed Central

    Xia, Ai-Hua; Zhou, Qing-Xiang; Yu, Lin-Lin; Li, Wei-Guo; Yi, Yong-Zhu; Zhang, Yao-Zhou; Zhang, Zhi-Fang

    2006-01-01

    Background The major royal jelly proteins/yellow (MRJP/YELLOW) family possesses several physiological and chemical functions in the development of Apis mellifera and Drosophila melanogaster. Each protein of the family has a conserved domain named MRJP. However, there is no report of MRJP/YELLOW family proteins in the Lepidoptera. Results Using the YELLOW protein sequence in Drosophila melanogaster to BLAST silkworm EST database, we found a gene family composed of seven members with a conserved MRJP domain each and named it YELLOW protein family of Bombyx mori. We completed the cDNA sequences with RACE method. The protein of each member possesses a MRJP domain and a putative cleavable signal peptide consisting of a hydrophobic sequence. In view of genetic evolution, the whole Bm YELLOW protein family composes a monophyletic group, which is distinctly separate from Drosophila melanogaster and Apis mellifera. We then showed the tissue expression profiles of Bm YELLOW protein family genes by RT-PCR. Conclusion A Bombyx mori YELLOW protein family is found to be composed of at least seven members. The low homogeneity and unique pattern of gene expression by each member among the family ensure us to prophesy that the members of Bm YELLOW protein family would play some important physiological functions in silkworm development. PMID:16884544

  2. Synthesis of Bacteriophage M13-Specific Proteins in a DNA-Dependent Cell-Free System II. In Vitro Synthesis of Biologically Active Gene 5 Protein

    PubMed Central

    Konings, Ruud N. H.; Jansen, Josephine; Cuypers, Theo; Schoenmakers, John G. G.

    1973-01-01

    It is shown that gene 5 protein of bacteriophage M13 is one of the major proteins synthesized in vitro under the direction of M13 replicative-form DNA. By means of DNA-cellulose chromatography, this protein has been purified to homogeneity and its biological characteristics have been compared with those of its native counterpart. Like native gene 5 protein, the purified, in vitro-synthesized protein binds tightly and selectively to single-stranded, but not to double-stranded, DNAs. These results suggest that truly functional gene 5 protein is made in the cell-free system. Images PMID:4586780

  3. POPDC1S201F causes muscular dystrophy and arrhythmia by affecting protein trafficking

    PubMed Central

    Schindler, Roland F.R.; Scotton, Chiara; Zhang, Jianguo; Passarelli, Chiara; Ortiz-Bonnin, Beatriz; Simrick, Subreena; Schwerte, Thorsten; Poon, Kar-Lai; Fang, Mingyan; Rinné, Susanne; Froese, Alexander; Nikolaev, Viacheslav O.; Grunert, Christiane; Müller, Thomas; Tasca, Giorgio; Sarathchandra, Padmini; Drago, Fabrizio; Dallapiccola, Bruno; Rapezzi, Claudio; Arbustini, Eloisa; Di Raimo, Francesca Romana; Neri, Marcella; Selvatici, Rita; Gualandi, Francesca; Fattori, Fabiana; Pietrangelo, Antonello; Li, Wenyan; Jiang, Hui; Xu, Xun; Bertini, Enrico; Decher, Niels; Wang, Jun; Brand, Thomas; Ferlini, Alessandra

    2015-01-01

    The Popeye domain–containing 1 (POPDC1) gene encodes a plasma membrane–localized cAMP-binding protein that is abundantly expressed in striated muscle. In animal models, POPDC1 is an essential regulator of structure and function of cardiac and skeletal muscle; however, POPDC1 mutations have not been associated with human cardiac and muscular diseases. Here, we have described a homozygous missense variant (c.602C>T, p.S201F) in POPDC1, identified by whole-exome sequencing, in a family of 4 with cardiac arrhythmia and limb-girdle muscular dystrophy (LGMD). This allele was absent in known databases and segregated with the pathological phenotype in this family. We did not find the allele in a further screen of 104 patients with a similar phenotype, suggesting this mutation to be family specific. Compared with WT protein, POPDC1S201F displayed a 50% reduction in cAMP affinity, and in skeletal muscle from patients, both POPDC1S201F and WT POPDC2 displayed impaired membrane trafficking. Forced expression of POPDC1S201F in a murine cardiac muscle cell line (HL-1) increased hyperpolarization and upstroke velocity of the action potential. In zebrafish, expression of the homologous mutation (popdc1S191F) caused heart and skeletal muscle phenotypes that resembled those observed in patients. Our study therefore identifies POPDC1 as a disease gene causing a very rare autosomal recessive cardiac arrhythmia and LGMD, expanding the genetic causes of this heterogeneous group of inherited rare diseases. PMID:26642364

  4. POPDC1(S201F) causes muscular dystrophy and arrhythmia by affecting protein trafficking.

    PubMed

    Schindler, Roland F R; Scotton, Chiara; Zhang, Jianguo; Passarelli, Chiara; Ortiz-Bonnin, Beatriz; Simrick, Subreena; Schwerte, Thorsten; Poon, Kar-Lai; Fang, Mingyan; Rinné, Susanne; Froese, Alexander; Nikolaev, Viacheslav O; Grunert, Christiane; Müller, Thomas; Tasca, Giorgio; Sarathchandra, Padmini; Drago, Fabrizio; Dallapiccola, Bruno; Rapezzi, Claudio; Arbustini, Eloisa; Di Raimo, Francesca Romana; Neri, Marcella; Selvatici, Rita; Gualandi, Francesca; Fattori, Fabiana; Pietrangelo, Antonello; Li, Wenyan; Jiang, Hui; Xu, Xun; Bertini, Enrico; Decher, Niels; Wang, Jun; Brand, Thomas; Ferlini, Alessandra

    2016-01-01

    The Popeye domain-containing 1 (POPDC1) gene encodes a plasma membrane-localized cAMP-binding protein that is abundantly expressed in striated muscle. In animal models, POPDC1 is an essential regulator of structure and function of cardiac and skeletal muscle; however, POPDC1 mutations have not been associated with human cardiac and muscular diseases. Here, we have described a homozygous missense variant (c.602C>T, p.S201F) in POPDC1, identified by whole-exome sequencing, in a family of 4 with cardiac arrhythmia and limb-girdle muscular dystrophy (LGMD). This allele was absent in known databases and segregated with the pathological phenotype in this family. We did not find the allele in a further screen of 104 patients with a similar phenotype, suggesting this mutation to be family specific. Compared with WT protein, POPDC1(S201F) displayed a 50% reduction in cAMP affinity, and in skeletal muscle from patients, both POPDC1(S201F) and WT POPDC2 displayed impaired membrane trafficking. Forced expression of POPDC1(S201F) in a murine cardiac muscle cell line (HL-1) increased hyperpolarization and upstroke velocity of the action potential. In zebrafish, expression of the homologous mutation (popdc1(S191F)) caused heart and skeletal muscle phenotypes that resembled those observed in patients. Our study therefore identifies POPDC1 as a disease gene causing a very rare autosomal recessive cardiac arrhythmia and LGMD, expanding the genetic causes of this heterogeneous group of inherited rare diseases.

  5. Protein-protein and DNA-protein interactions mediate induction of defense genes by fruit extract of Azadirachta indica A. Juss. in Solanum lycopersicum L.

    PubMed

    Bhuvaneshwari, V; Goel, N; Paul, P K

    2015-10-01

    The present work demonstrates that induction of defense-related genes in tomato by neem extract was mediated by protein-protein and DNA-protein interactions. The induction of elicitor-mediated defense responses in plants is known, but the molecular mechanisms underlying its induction are not well studied. In the present study, third node leaf from the base of aseptically raised tomato plants was treated with aqueous fruit extracts of Azadirachta indica A. Juss. (neem). Samples were collected from the treated node at 24-h intervals for up to 96 h and analyzed for the gene expression of phenylalanine ammonia lyase (PAL), Peroxidase (POX) and Polyphenol Oxidase (PPO), β-actin (standard). Samples were collected from elicitor-induced node at 5-min interval up to 70 min for analysis of protein-protein and DNA-protein interactions. The results demonstrated the induction of expression of PAL, POX and PPO due to the treatment whereas no change was observed in the expression of β-actin. There was disappearance of lower molecular weight proteins which cross-linked with other proteins to form complexes. MALDI-TOF MS analysis revealed the interaction of mitogen-activated protein kinases (MAPK). The analysis of proteins interacted with DNA after induction by neem extract indicated the involvement of WRKY transcriptional factors. Neem-elicited defense responses could possibly due to interaction of proteins with other proteins and transcription factors with DNA which might be crucial in enhancing the expression of defense-related genes (PAL, POX and PPO).

  6. apl-1, a Caenorhabditis elegans gene encoding a protein related to the human beta-amyloid protein precursor.

    PubMed Central

    Daigle, I; Li, C

    1993-01-01

    The major component of senile plaques found in the brains of Alzheimer disease patients is the beta-amyloid peptide, which is derived from a larger amyloid precursor protein (APP). Recently, a number of APP and APP-related proteins have been identified in different organisms and constitute the family of APP proteins. We have isolated several cDNAs encoding an APP-related protein in the nematode Caenorhabditis elegans and have designated the corresponding gene as apl-1. The apl-1 transcripts undergo two forms of posttranscriptional modification: trans-splicing and alternative polyadenylylation. In vitro translation of an apl-1 cDNA results in a protein of approximately the expected size. Similar to the Drosophila, human, and mouse APP-related proteins, APL-1 does not appear to contain the beta-amyloid peptide. Because APP-related proteins seem to be conserved through evolution, the apl-1 gene from C. elegans should be important for determining the normal function of human APP. Images Fig. 2 Fig. 3 PMID:8265668

  7. Gene Sequence Variability of the Three Surface Proteins of Human Respiratory Syncytial Virus (HRSV) in Texas

    PubMed Central

    Tapia, Lorena I.; Shaw, Chad A.; Aideyan, Letisha O.; Jewell, Alan M.; Dawson, Brian C.; Haq, Taha R.; Piedra, Pedro A.

    2014-01-01

    Human respiratory syncytial virus (HRSV) has three surface glycoproteins: small hydrophobic (SH), attachment (G) and fusion (F), encoded by three consecutive genes (SH-G-F). A 270-nt fragment of the G gene is used to genotype HRSV isolates. This study genotyped and investigated the variability of the gene and amino acid sequences of the three surface proteins of HRSV strains collected from 1987 to 2005 from one center. Sixty original clinical isolates and 5 prototype strains were analyzed. Sequences containing SH, F and G genes were generated, and multiple alignments and phylogenetic trees were analyzed. Genetic variability by protein domains comparing virus genotypes was assessed. Complete sequences of the SH-G-F genes were obtained for all 65 samples: HRSV-A = 35; HRSV-B = 30. In group A strains, genotypes GA5 and GA2 were predominant. For HRSV-B strains, the genotype GB4 was predominant from 1992 to 1994 and only genotype BA viruses were detected in 2004–2005. Different genetic variability at nucleotide level was detected between the genes, with G gene being the most variable and the highest variability detected in the 270-nt G fragment that is frequently used to genotype the virus. High variability (>10%) was also detected in the signal peptide and transmembrane domains of the F gene of HRSV A strains. Variability among the HRSV strains resulting in non-synonymous changes was detected in hypervariable domains of G protein, the signal peptide of the F protein, a not previously defined domain in the F protein, and the antigenic site Ø in the pre-fusion F. Divergent trends were observed between HRSV -A and -B groups for some functional domains. A diverse population of HRSV -A and -B genotypes circulated in Houston during an 18 year period. We hypothesize that diverse sequence variation of the surface protein genes provide HRSV strains a survival advantage in a partially immune-protected community. PMID:24625544

  8. Gene sequence variability of the three surface proteins of human respiratory syncytial virus (HRSV) in Texas.

    PubMed

    Tapia, Lorena I; Shaw, Chad A; Aideyan, Letisha O; Jewell, Alan M; Dawson, Brian C; Haq, Taha R; Piedra, Pedro A

    2014-01-01

    Human respiratory syncytial virus (HRSV) has three surface glycoproteins: small hydrophobic (SH), attachment (G) and fusion (F), encoded by three consecutive genes (SH-G-F). A 270-nt fragment of the G gene is used to genotype HRSV isolates. This study genotyped and investigated the variability of the gene and amino acid sequences of the three surface proteins of HRSV strains collected from 1987 to 2005 from one center. Sixty original clinical isolates and 5 prototype strains were analyzed. Sequences containing SH, F and G genes were generated, and multiple alignments and phylogenetic trees were analyzed. Genetic variability by protein domains comparing virus genotypes was assessed. Complete sequences of the SH-G-F genes were obtained for all 65 samples: HRSV-A = 35; HRSV-B = 30. In group A strains, genotypes GA5 and GA2 were predominant. For HRSV-B strains, the genotype GB4 was predominant from 1992 to 1994 and only genotype BA viruses were detected in 2004-2005. Different genetic variability at nucleotide level was detected between the genes, with G gene being the most variable and the highest variability detected in the 270-nt G fragment that is frequently used to genotype the virus. High variability (>10%) was also detected in the signal peptide and transmembrane domains of the F gene of HRSV A strains. Variability among the HRSV strains resulting in non-synonymous changes was detected in hypervariable domains of G protein, the signal peptide of the F protein, a not previously defined domain in the F protein, and the antigenic site Ø in the pre-fusion F. Divergent trends were observed between HRSV -A and -B groups for some functional domains. A diverse population of HRSV -A and -B genotypes circulated in Houston during an 18 year period. We hypothesize that diverse sequence variation of the surface protein genes provide HRSV strains a survival advantage in a partially immune-protected community.

  9. Proteolipid protein gene: Pelizaeus-Merzbacher disease in humans and neurodegeneration in mice.

    PubMed

    Woodward, K; Malcolm, S

    1999-04-01

    The dosage of the myelin gene and mutant forms of the protein can affect the CNS and PNS. Pelizaeus-Merzbacher disease (PMD) is a myelin disorder of the CNS that arises from both mutational mechanisms. Investigating the molecular basis of PMD in patients and animal models is furthering our understanding of the disease, dosage sensitivity and proteolipid protein function during myelinogenesis.

  10. Molecular Principles of Gene Fusion Mediated Rewiring of Protein Interaction Networks in Cancer.

    PubMed

    Latysheva, Natasha S; Oates, Matt E; Maddox, Louis; Flock, Tilman; Gough, Julian; Buljan, Marija; Weatheritt, Robert J; Babu, M Madan

    2016-08-18

    Gene fusions are common cancer-causing mutations, but the molecular principles by which fusion protein products affect interaction networks and cause disease are not well understood. Here, we perform an integrative analysis of the structural, interactomic, and regulatory properties of thousands of putative fusion proteins. We demonstrate that genes that form fusions (i.e., parent genes) tend to be highly connected hub genes, whose protein products are enriched in structured and disordered interaction-mediating features. Fusion often results in the loss of these parental features and the depletion of regulatory sites such as post-translational modifications. Fusion products disproportionately connect proteins that did not previously interact in the protein interaction network. In this manner, fusion products can escape cellular regulation and constitutively rewire protein interaction networks. We suggest that the deregulation of central, interaction-prone proteins may represent a widespread mechanism by which fusion proteins alter the topology of cellular signaling pathways and promote cancer. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.

  11. Maternal low protein diet and postnatal high fat diet increases adipose imprinted gene expression

    USDA-ARS?s Scientific Manuscript database

    Maternal and postnatal diet can alter Igf2 gene expression and DNA methylation. To test whether maternal low protein and postnatal high fat (HF) diet result in alteration in Igf2 expression and obesity, we fed obese-prone Sprague-Dawley rats 8% (LP) or 20% (NP) protein for 3 wk prior to breeding and...

  12. Protein Methylation and Interaction with the Antiproliferative Gene, BTG2/TIS21/Pc3

    PubMed Central

    Kim, Sangduk

    2014-01-01

    The last one and half a decade witnessed an outstanding re-emergence of attention and remarkable progress in the field of protein methylation. In the present article, we describe the early discoveries in research and review the role protein methylation played in the biological function of the antiproliferative gene, BTG2/TIS21/PC3. PMID:24532495

  13. Protein methylation and interaction with the antiproliferative gene, BTG2/TIS21/Pc3.

    PubMed

    Paik, Woon Ki; Kim, Sangduk; Lim, In Kyoung

    2014-03-01

    The last one and half a decade witnessed an outstanding re-emergence of attention and remarkable progress in the field of protein methylation. In the present article, we describe the early discoveries in research and review the role protein methylation played in the biological function of the antiproliferative gene, BTG2/TIS21/PC3.

  14. Comprehensive identification of LMW-GS genes and their protein products in a common wheat variety

    USDA-ARS?s Scientific Manuscript database

    Although it is well known that low-molecular-weight glutenin subunits (LMW-GS) affect bread and noodle processing quality, the function of specific LMW-GS proteins remains unclear. It is important to find the genes that correspond to individual LMW-GS proteins in order to understand the functions o...

  15. Genome-wide RNAi screening identifies protein damage as a regulator of osmoprotective gene expression

    PubMed Central

    Lamitina, Todd; Huang, Chunyi George; Strange, Kevin

    2006-01-01

    The detection, stabilization, and repair of stress-induced damage are essential requirements for cellular life. All cells respond to osmotic stress-induced water loss with increased expression of genes that mediate accumulation of organic osmolytes, solutes that function as chemical chaperones and restore osmotic homeostasis. The signals and signaling mechanisms that regulate osmoprotective gene expression in animal cells are poorly understood. Here, we show that gpdh-1 and gpdh-2, genes that mediate the accumulation of the organic osmolyte glycerol, are essential for survival of the nematode Caenorhabditis elegans during osmotic stress. Expression of GFP driven by the gpdh-1 promoter (Pgpdh-1::GFP) is detected only during hypertonic stress but is not induced by other stressors. Using Pgpdh-1::GFP expression as a phenotype, we screened ≈16,000 genes by RNAi feeding and identified 122 that cause constitutive activation of gpdh-1 expression and glycerol accumulation. Many of these genes function to regulate protein translation and cotranslational protein folding and to target and degrade denatured proteins, suggesting that the accumulation of misfolded proteins functions as a signal to activate osmoprotective gene expression and organic osmolyte accumulation in animal cells. Consistent with this hypothesis, 73% of these protein-homeostasis genes have been shown to slow age-dependent protein aggregation in C. elegans. Because diverse environmental stressors and numerous disease states result in protein misfolding, mechanisms must exist that discriminate between osmotically induced and other forms of stress-induced protein damage. Our findings provide a foundation for understanding how these damage-selectivity mechanisms function. PMID:16880390

  16. Three genes encoding citrate synthases in Saccharopolyspora erythraea are regulated by the global nutrient-sensing regulators GlnR, DasR, and CRP.

    PubMed

    Liao, Cheng-Heng; Yao, Li-Li; Ye, Bang-Ce

    2014-10-08

    Saccharopolyspora erythraea has three citrate synthases encoded by gltA-2, citA, and citA4. Here, we characterized and identified the expression and regulatory properties of these synthases. Three pleiotropic global regulatory proteins of S. erythraea - CRP, GlnR, and DasR - are involved in carbon metabolism, nitrogen metabolism, and amino-sugar (chitin and GlcNAc) metabolism. Using electrophoretic mobility shift assays (EMSAs), we identified these regulators as proteins that bind directly to the promoter regions of all citrate synthase genes (gltA-2, citA, and citA4). Footprinting assays indicated the exact protect sequences of CRP, GlnR, and DasR on the promoter region of gltA-2, revealing binding competition between GlnR and DasR. Moreover, by comparing the transcription levels of citrate synthase genes between parental and glnR mutant or dasR mutant strains, or by comparing the transcription response of citrate synthases under various nutrient conditions, we found that GlnR and DasR negatively regulated citA and citA4 transcription but had no regulatory effects on the gltA-2 gene. Although no CRP mutant was available, the results indicated that CRP was a cAMP-binding receptor affecting gltA-2 transcription when the intracellular cAMP concentration increased. Thus, an overall model of CS regulation by C and/or N metabolism regulators and cAMP receptor protein was proposed.

  17. Differential analysis of "protein corona" profile adsorbed onto different nonviral gene delivery systems.

    PubMed

    Capriotti, Anna Laura; Caracciolo, Giulio; Caruso, Giuseppe; Foglia, Patrizia; Pozzi, Daniela; Samperi, Roberto; Laganà, Aldo

    2011-12-15

    A shotgun proteomics approach was used to characterize and compare the proteins that lead to the formation of a rich "protein corona" adsorbed onto the surfaces of cationic liposomes (CLs), lipoplexes, and lipid/polycation/DNA (LPD) complexes, when they come into contact with plasma. After separation of the nanoparticle-protein complex from plasma, the protein mixture was digested, and peptides were analyzed by nanoliquid chromatography-Orbitrap LTQ-XL mass spectrometry. The number of proteins bound to lipoplexes was double that of those identified in the corona of CLs (208 vs 105), while 77 proteins were common to both coronas. The number of proteins bound to the surface of the LPD complexes (158, 133 of which are common to lipoplexes) is intermediate between those found in the protein corona of both CLs and lipoplexes. About half of them were found in the protein corona of CLs. By overlapping the three formulations, it can be seen that only 12 proteins are peculiar to LPD complexes. These results may help in designing gene delivery systems capable of binding the minimum possible quantity of proteins that influence transfection negatively, binding selectively proteins capable of helping in steering in vivo the vector toward the target, and obtaining more efficient and effective gene therapy.

  18. A comparative study of disease genes and drug targets in the human protein interactome

    PubMed Central

    2015-01-01

    Background Disease genes cause or contribute genetically to the development of the most complex diseases. Drugs are the major approaches to treat the complex disease through interacting with their targets. Thus, drug targets are critical for treatment efficacy. However, the interrelationship between the disease genes and drug targets is not clear. Results In this study, we comprehensively compared the network properties of disease genes and drug targets for five major disease categories (cancer, cardiovascular disease, immune system disease, metabolic disease, and nervous system disease). We first collected disease genes from genome-wide association studies (GWAS) for five disease categories and collected their corresponding drugs based on drugs' Anatomical Therapeutic Chemical (ATC) classification. Then, we obtained the drug targets for these five different disease categories. We found that, though the intersections between disease genes and drug targets were small, disease genes were significantly enriched in targets compared to their enrichment in human protein-coding genes. We further compared network properties of the proteins encoded by disease genes and drug targets in human protein-protein interaction networks (interactome). The results showed that the drug targets tended to have higher degree, higher betweenness, and lower clustering coefficient in cancer Furthermore, we observed a clear fraction increase of disease proteins or drug targets in the near neighborhood compared with the randomized genes. Conclusions The study presents the first comprehensive comparison of the disease genes and drug targets in the context of interactome. The results provide some foundational network characteristics for further designing computational strategies to predict novel drug targets and drug repurposing. PMID:25861037

  19. A comparative study of disease genes and drug targets in the human protein interactome.

    PubMed

    Sun, Jingchun; Zhu, Kevin; Zheng, W; Xu, Hua

    2015-01-01

    Disease genes cause or contribute genetically to the development of the most complex diseases. Drugs are the major approaches to treat the complex disease through interacting with their targets. Thus, drug targets are critical for treatment efficacy. However, the interrelationship between the disease genes and drug targets is not clear. In this study, we comprehensively compared the network properties of disease genes and drug targets for five major disease categories (cancer, cardiovascular disease, immune system disease, metabolic disease, and nervous system disease). We first collected disease genes from genome-wide association studies (GWAS) for five disease categories and collected their corresponding drugs based on drugs' Anatomical Therapeutic Chemical (ATC) classification. Then, we obtained the drug targets for these five different disease categories. We found that, though the intersections between disease genes and drug targets were small, disease genes were significantly enriched in targets compared to their enrichment in human protein-coding genes. We further compared network properties of the proteins encoded by disease genes and drug targets in human protein-protein interaction networks (interactome). The results showed that the drug targets tended to have higher degree, higher betweenness, and lower clustering coefficient in cancer Furthermore, we observed a clear fraction increase of disease proteins or drug targets in the near neighborhood compared with the randomized genes. The study presents the first comprehensive comparison of the disease genes and drug targets in the context of interactome. The results provide some foundational network characteristics for further designing computational strategies to predict novel drug targets and drug repurposing.

  20. Hominoid-Specific De Novo Protein-Coding Genes Originating from Long Non-Coding RNAs

    PubMed Central

    Liu, Chu-Jun; Zhou, Wei-Zhen; Li, Ying; Zhang, Mao; Zhang, Rongli; Wei, Liping; Li, Chuan-Yun

    2012-01-01

    Tinkering with pre-existing genes has long been known as a major way to create new genes. Recently, however, motherless protein-coding genes have been found to have emerged de novo from ancestral non-coding DNAs. How these genes originated is not well addressed to date. Here we identified 24 hominoid-specific de novo protein-coding genes with precise origination timing in vertebrate phylogeny. Strand-specific RNA–Seq analyses were performed in five rhesus macaque tissues (liver, prefrontal cortex, skeletal muscle, adipose, and testis), which were then integrated with public transcriptome data from human, chimpanzee, and rhesus macaque. On the basis of comparing the RNA expression profiles in the three species, we found that most of the hominoid-specific de novo protein-coding genes encoded polyadenylated non-coding RNAs in rhesus macaque or chimpanzee with a similar transcript structure and correlated tissue expression profile. According to the rule of parsimony, the majority of these hominoid-specific de novo protein-coding genes appear to have acquired a regulated transcript structure and expression profile before acquiring coding potential. Interestingly, although the expression profile was largely correlated, the coding genes in human often showed higher transcriptional abundance than their non-coding counterparts in rhesus macaque. The major findings we report in this manuscript are robust and insensitive to the parameters used in the identification and analysis of de novo genes. Our results suggest that at least a portion of long non-coding RNAs, especially those with active and regulated transcription, may serve as a birth pool for protein-coding genes, which are then further optimized at the transcriptional level. PMID:23028352

  1. Ribosomal protein genes are highly enriched among genes with allele-specific expression in the interspecific F1 hybrid catfish.

    PubMed

    Chen, Ailu; Wang, Ruijia; Liu, Shikai; Peatman, Eric; Sun, Luyang; Bao, Lisui; Jiang, Chen; Li, Chao; Li, Yun; Zeng, Qifan; Liu, Zhanjiang

    2016-06-01

    Interspecific hybrids provide a rich source for the analysis of allele-specific expression (ASE). In this work, we analyzed ASE in F1 hybrid catfish using RNA-Seq datasets. While the vast majority of genes were expressed with both alleles, 7-8 % SNPs exhibited significant differences in allele ratios of expression. Of the 66,251 and 177,841 SNPs identified from the datasets of the liver and gill, 5420 (8.2 %) and 13,390 (7.5 %) SNPs were identified as significant ASE-SNPs, respectively. With these SNPs, a total of 1519 and 3075 ASE-genes were identified. Gene Ontology analysis revealed that genes encoding cytoplasmic ribosomal proteins (RP) were highly enriched among ASE genes. Parent-of-origin was determined for 27 and 30 ASE RP genes in the liver and gill, respectively. The results indicated that genes from both channel catfish and blue catfish were involved in ASE. However, each RP gene appeared to be almost exclusively expressed from only one parent, indicating that ribosomes in the hybrid catfish were in the "hybrid" form. Overall representation of RP transcripts among the transcriptome appeared lower in the F1 hybrid catfish than in channel catfish or blue catfish, suggesting that the "hybrid" ribosomes may work more efficiently for translation in the F1 hybrid catfish.

  2. Recombinant MafA protein containing its own protein transduction domain stimulates insulin gene expression in IEC-6 cells.

    PubMed

    Lu, Jun; Luo, Hailan; Wu, Huiling; Lan, Michael S; Tan, Jianming; Lu, Daru

    2011-07-18

    MafA, a basic leucine zipper (bZIP) transcription factor, functions as a potent activator of insulin gene transcription in β-cell. In this paper, we aimed to investigate whether the entire MafA protein has the self-delivery activity, and that the arginine- and lysine-rich sequence in MafA bZIP domain is an efficient protein transduction domain (PTD). Entire MafA protein internalization was evaluated with Western blot and immunofluorescence. The distribution of the PTD-EGFP (enhanced green fluorescence protein) was examined by fluorescent microscope observation. Luciferase reporter assay was used to investigate the effect of the transduced MafA protein on insulin 2 promoter activity. Additionally, we conducted RT-PCR to detect the expression of insulin mRNA in MafA treated IEC-6 cells. The arginine- and lysine-rich peptide of MafA serves as a novel PTD. PTD-EGFP can permeate into various cell types including Min6 (a β-cell line), and transduce in a dose- and time-dependent manner. The cellular uptake of MafA PTD can be completely blocked by heparin, whereas cytochalasin D and amiloride were partially effective in blocking the PTD-EGFP protein entry. Transduced intact MafA protein behaves in the same way as the endogenous MafA, stimulating the transcription of insulin promoter and further inducing insulin expression in treated non-β-cell line (IEC-6). These results indicate that the MafA PTD could serve as a therapeutic delivery vehicle, and further suggest that MafA protein transduction could be a valuable strategy for enhancing insulin gene transcription without requiring gene transfer technology. Copyright © 2011. Published by Elsevier Inc.

  3. Evidence for showing gene/protein name suggestions in bioscience literature search interfaces.

    PubMed

    Divoli, Anna; Hearst, Marti A; Wooldridge, Michael A

    2008-01-01

    This paper reports on the results of two questionnaires asking biologists about the incorporation of text-extracted entity information, specifically gene and protein names, into bioscience literature search user interfaces. Among the findings are that study participants want to see gene/protein metadata in combination with organism information; that a significant proportion would like to see gene names grouped by type (synonym, homolog, etc.), and that most participants want to see information that the system is confident about immediately, and see less certain information after taking additional action. These results inform future interface designs.

  4. Characterization of the gene encoding a fibrinogen-related protein expressed in Crassostrea gigas hemocytes.

    PubMed

    Skazina, M A; Gorbushin, A M

    2016-07-01

    Four exons of the CgFrep1 gene (3333 bp long) encode a putative fibrinogen-related protein (324 aa) bearing a single C-terminal FBG domain. Transcripts of the gene obtained from hemocytes of different Pacific oysters show prominent individual variation based on SNP and indels of tandem repeats resulted in polymorphism of N-terminus of the putative CgFrep1 polypeptide. The polypeptide chain bears N-terminal coiled-coil region potentially acting as inter-subunit interface in the protein oligomerization. It is suggested that CgFrep1 gene encodes the oligomeric lectin composed of at least two subunits.

  5. Oscillatory kinetics of gene expression: Protein conversion and slow mRNA transport

    SciTech Connect

    Zhdanov, V. P.

    2009-06-15

    The negative feedback between mRNA and regulatory-protein production may result in oscillations in the kinetics of gene expression if the mRNA-protein interplay includes protein conversion. Using a mean-field kinetic model, we show that such oscillations can be amplified due to limitations of the mRNA transport between the nucleus and cytoplasm. This effect may be dramatic for the mRNA population in the nucleus.

  6. Yeast prion architecture explains how proteins can be genes

    NASA Astrophysics Data System (ADS)

    Wickner, Reed

    2013-03-01

    Prions (infectious proteins) transmit information without an accompanying DNA or RNA. Most yeast prions are self-propagating amyloids that inactivate a normally functional protein. A single protein can become any of several prion variants, with different manifestations due to different amyloid structures. We showed that the yeast prion amyloids of Ure2p, Sup35p and Rnq1p are folded in-register parallel beta sheets using solid state NMR dipolar recoupling experiments, mass-per-filament-length measurements, and filament diameter measurements. The extent of beta sheet structure, measured by chemical shifts in solid-state NMR and acquired protease-resistance on amyloid formation, combined with the measured filament diameters, imply that the beta sheets must be folded along the long axis of the filament. We speculate that prion variants of a single protein sequence differ in the location of these folds. Favorable interactions between identical side chains must hold these structures in-register. The same interactions must guide an unstructured monomer joining the end of a filament to assume the same conformation as molecules already in the filament, with the turns at the same locations. In this way, a protein can template its own conformation, in analogy to the ability of a DNA molecule to template its sequence by specific base-pairing. Bldg. 8, Room 225, NIH, 8 Center Drive MSC 0830, Bethesda, MD 20892-0830, wickner@helix.nih.gov, 301-496-3452

  7. Unusual conservation among genes encoding small secreted salivary gland proteins from a gall midge.

    PubMed

    Chen, Ming-Shun; Liu, Xuming; Yang, Ziheng; Zhao, Huixian; Shukle, Richard H; Stuart, Jeffrey J; Hulbert, Scot

    2010-09-28

    In most protein-coding genes, greater sequence variation is observed in noncoding regions (introns and untranslated regions) than in coding regions due to selective constraints. During characterization of genes and transcripts encoding small secreted salivary gland proteins (SSSGPs) from the Hessian fly, we found exactly the opposite pattern of conservation in several families of genes: the non-coding regions were highly conserved, but the coding regions were highly variable. Seven genes from the SSSGP-1 family are clustered as one inverted and six tandem repeats within a 15 kb region of the genome. Except for SSSGP-1A2, a gene that encodes a protein identical to that encoded by SSSGP-1A1, the other six genes consist of a highly diversified, mature protein-coding region as well as highly conserved regions including the promoter, 5'- and 3'-UTRs, a signal peptide coding region, and an intron. This unusual pattern of highly diversified coding regions coupled with highly conserved regions in the rest of the gene was also observed in several other groups of SSSGP-encoding genes or cDNAs. The unusual conservation pattern was also found in some of the SSSGP cDNAs from the Asian rice gall midge, but not from the orange wheat blossom midge. Strong positive selection was one of the forces driving for diversification whereas concerted homogenization was likely a mechanism for sequence conservation. Rapid diversification in mature SSSGPs suggests that the genes are under selection pressure for functional adaptation. The conservation in the noncoding regions of these genes including introns also suggested potential mechanisms for sequence homogenization that are not yet fully understood. This report should be useful for future studies on genetic mechanisms involved in evolution and functional adaptation of parasite genes.

  8. Abundantly and rarely expressed Lhc protein genes exhibit distinct regulation patterns in plants.

    PubMed

    Klimmek, Frank; Sjödin, Andreas; Noutsos, Christos; Leister, Dario; Jansson, Stefan

    2006-03-01

    We have analyzed gene regulation of the Lhc supergene family in poplar (Populus spp.) and Arabidopsis (Arabidopsis thaliana) using digital expression profiling. Multivariate analysis of the tissue-specific, environmental, and developmental Lhc expression patterns in Arabidopsis and poplar was employed to characterize four rarely expressed Lhc genes, Lhca5, Lhca6, Lhcb7, and Lhcb4.3. Those genes have high expression levels under different conditions and in different tissues than the abundantly expressed Lhca1 to 4 and Lhcb1 to 6 genes that code for the 10 major types of higher plant light-harvesting proteins. However, in some of the datasets analyzed, the Lhcb4 and Lhcb6 genes as well as an Arabidopsis gene not present in poplar (Lhcb2.3) exhibited minor differences to the main cooperative Lhc gene expression pattern. The pattern of the rarely expressed Lhc genes was always found to be more similar to that of PsbS and the various light-harvesting-like genes, which might indicate distinct physiological functions for the rarely and abundantly expressed Lhc proteins. The previously undetected Lhcb7 gene encodes a novel plant Lhcb-type protein that possibly contains an additional, fourth, transmembrane N-terminal helix with a highly conserved motif. As the Lhcb4.3 gene seems to be present only in Eurosid species and as its regulation pattern varies significantly from that of Lhcb4.1 and Lhcb4.2, we conclude it to encode a distinct Lhc protein type, Lhcb8.

  9. Importin-β facilitates nuclear import of human GW proteins and balances cytoplasmic gene silencing protein levels

    PubMed Central

    Schraivogel, Daniel; Schindler, Susann G.; Danner, Johannes; Kremmer, Elisabeth; Pfaff, Janina; Hannus, Stefan; Depping, Reinhard; Meister, Gunter

    2015-01-01

    MicroRNAs (miRNAs) guide Argonaute (Ago) proteins to distinct target mRNAs leading to translational repression and mRNA decay. Ago proteins interact with a member of the GW protein family, referred to as TNRC6A-C in mammals, which coordinate downstream gene-silencing processes. The cytoplasmic functions of TNRC6 and Ago proteins are reasonably well established. Both protein families are found in the nucleus as well. Their detailed nuclear functions, however, remain elusive. Furthermore, it is not clear which import routes Ago and TNRC6 proteins take into the nucleus. Using different nuclear transport assays, we find that Ago as well as TNRC6 proteins shuttle between the cytoplasm and the nucleus. While import receptors might function redundantly to transport Ago2, we demonstrate that TNRC6 proteins are imported by the Importin-β pathway. Finally, we show that nuclear localization of both Ago2 and TNRC6 proteins can depend on each other suggesting actively balanced cytoplasmic Ago – TNRC6 levels. PMID:26170235

  10. A new class of wheat gliadin genes and proteins

    USDA-ARS?s Scientific Manuscript database

    The utility of mining DNA sequence data to understand the structure and expression of cereal prolamin genes is demonstrated by the identification of a new class of wheat prolamins. This previously unrecognized wheat prolamin class, given the name δ-gliadins, is the most direct ortholog of bar...

  11. A novel gene encoding an integral membrane protein is mutated in nephropathic cystinosis.

    PubMed

    Town, M; Jean, G; Cherqui, S; Attard, M; Forestier, L; Whitmore, S A; Callen, D F; Gribouval, O; Broyer, M; Bates, G P; van't Hoff, W; Antignac, C

    1998-04-01

    Nephropathic cystinosis, an autosomal recessive disorder resulting from defective lysosomal transport of cystine, is the most common inherited cause of renal Fanconi syndrome. The cystinosis gene has been mapped to chromosome 17p13. We found that the locus D17S829 was homozygously deleted in 23 out of 70 patients, and identified a novel gene, CTNS, which mapped to the deletion interval. CTNS encodes an integral membrane protein, cystinosin, with features of a lysosomal membrane protein. Eleven different mutations, all predicted to cause loss of function of the protein, were found to segregate with the disorder.

  12. Molecular analysis of CIB4 gene and protein in Kermani sheep.

    PubMed

    Mohammadabadi, M R; Jafari, A H D; Bordbar, F

    2017-09-12

    The human calcium- and integrin-binding protein (CIB) family is composed of CIB1, CIB2, CIB3, and CIB4 proteins and the CIB4 gene affects fertility. Kermani sheep is one of the most important breeds of Iranian sheep breeds. The aim of this study was to analyze for the first time molecular characteristics of the CIB4 gene and protein in Kermani sheep. Different tissues were collected from the Kermani sheep and real time PCR was performed. The PCR products were sequenced, comparative analyses of the nucleotide sequences were performed, a phylogenetic tree was constructed, and different characteristics of CIB4 proteins were predicted. Real time PCR results showed that the CIB4 gene is expressed only in testis of Kermani sheep. The cDNA nucleotide sequence was identical with small tail Han sheep, cattle, goat, camel, horse, dog, mouse and human, respectively 100, 99, 99, 98, 98, 96, 96, and 96%. Hence, it can be suggested that the CIB4 gene plays a role in male fertility. Based on the phylogenetic analysis, sheep CIB4 gene has a close relationship with goat and cattle first, and then with camel and whale. Although we demonstrated that CIB4 is a testis-specific gene, expressed only in the testis and it interacts with other proteins, the mechanisms by which CIB4 expression is regulated need to be elucidated.

  13. Genome-wide identification, evolution and expression analysis of RING finger protein genes in Brassica rapa

    PubMed Central

    Alam, Intikhab; Yang, Yan-Qing; Wang, Yong; Zhu, Mei-Lan; Wang, Heng-Bo; Chalhoub, Boulos; Lu, Yun-Hai

    2017-01-01

    More and more RING finger genes were found to be implicated in various important biological processes. In the present study, a total of 731 RING domains in 715 predicted proteins were identified in Brassica rapa genome (AA, 2n = 20), which were further divided into eight types: RING-H2 (371), RING-HCa (215), RING-HCb (47), RING-v (44), RING-C2 (38), RING-D (10), RING-S/T (5) and RING-G (1). The 715 RING finger proteins were further classified into 51 groups according to the presence of additional domains. 700 RING finger protein genes were mapped to the 10 chromosomes of B. rapa with a range of 47 to 111 genes for each chromosome. 667 RING finger protein genes were expressed in at least one of the six tissues examined, indicating their involvement in various physiological and developmental processes in B. rapa. Hierarchical clustering analysis of RNA-seq data divided them into seven major groups, one of which includes 231 members preferentially expressed in leaf, and constitutes then a panel of gene candidates for studying the genetic and molecular mechanisms of leafy head traits in Brassica crops. Our results lay the foundation for further studies on the classification, evolution and putative functions of RING finger protein genes in Brassica species. PMID:28094809

  14. Proteins.

    ERIC Educational Resources Information Center

    Doolittle, Russell F.

    1985-01-01

    Examines proteins which give rise to structure and, by virtue of selective binding to other molecules, make genes. Binding sites, amino acids, protein evolution, and molecular paleontology are discussed. Work with encoding segments of deoxyribonucleic acid (exons) and noncoding stretches (introns) provides new information for hypotheses. (DH)

  15. Proteins.

    ERIC Educational Resources Information Center

    Doolittle, Russell F.

    1985-01-01

    Examines proteins which give rise to structure and, by virtue of selective binding to other molecules, make genes. Binding sites, amino acids, protein evolution, and molecular paleontology are discussed. Work with encoding segments of deoxyribonucleic acid (exons) and noncoding stretches (introns) provides new information for hypotheses. (DH)

  16. Novel Approaches to the Characterization of Specific Protein-Protein Interactions Important in Gene Expression.

    DTIC Science & Technology

    2007-11-02

    tyrosine phenol lyase-promoter of Citrobacter freundii is regulated not only by the TyrR protein but also by two global transcription factors, namely Integration Host Factor and cyclic AMP receptor protein.

  17. Cloning of fish enzymes and other fish protein genes.

    PubMed

    Macouzet, M; Simpson, B K; Lee, B H

    1999-01-01

    Fish metabolism needs special enzymes that have maximum activity at very different conditions than their mammalian counterparts. Due to the differences in activity, these enzymes, especially cold-adapted proteases, could be used advantageously for the production of some foods. In addition to the enzymes, this review describes some other unique fish polypeptides such as antifreeze proteins, fluorescent proteins, antitumor peptides, antibiotics, and hormones, that have already been cloned and used in food processing, genetic engineering, medicine, and aquaculture. Recombinant DNA technology, which allows these biological molecules to be cloned and overexpressed in microorganisms is also described, highlighting innovative applications. The expected impact of cloning fish proteins in different fields of technology is discussed.

  18. Chromosomal localization of genes encoding guanine nucleotide-binding protein subunits in mouse and human

    SciTech Connect

    Blatt, C.; Eversole-Cire, P.; Cohn, V.H.; Zollman, S.; Fournier, R.E.K.; Mohandas, L.T.; Nesbitt, M.; Lugo, T.; Jones, D.T.; Reed, R.R.; Weiner, L.P.; Sparkes, R.S.; Simon, M.I. )

    1988-10-01

    A variety of genes have been identified that specify the synthesis of the components of guanine nucleotide-binding proteins (G proteins). Eight different guanine nucleotide-binding {alpha}-subunit proteins, two different {beta} subunits, and one {gamma} subunit have been described. Hybridization of cDNA clones with DNA from human-mouse somatic cell hybrids was used to assign many of these genes to human chromosomes. The retinal-specific transducin subunit genes GNAT1 and GNAT2 were on chromosomes 3 and 1; GNAI1, GNAI2, and GNAI3 were assigned to chromosomes 7, 3, and 1, respectively; GNAZ and GNAS were found on chromosomes 22 and 20. The {beta} subunits were also assigned-GNB1 to chromosome 1 and GNB2 to chromosome 7. Restriction fragment length polymorphisms were used to map the homologues of some of these genes in the mouse. GNAT1 and GNAI2 were found to map adjacent to each other on mouse chromosome 9 and GNAT2 was mapped on chromosome 17. The mouse GNB1 gene was assigned to chromosome 19. These mapping assignments will be useful in defining the extend of the G{alpha} gene family and may help in attempts to correlate specific genetic diseases and with genes corresponding to G proteins.

  19. The proteolipid protein gene and myelin disorders in man and animal models.

    PubMed

    Yool, D A; Edgar, J M; Montague, P; Malcolm, S

    2000-04-12

    The two proteins, proteolipid protein and DM20, which are encoded by alternative transcripts from the proteolipid protein ( PLP ) gene, are major components of central nervous system myelin. In man, mutations of these proteins cause Pelizaeus-Merzbacher disease (PMD), an X-linked dysmyelinating neuropathy. The mutations found are very varied, ranging from deletions, loss-of-function and missense mutations to additional copies of the gene. This same range of known genetic defects has been observed in animal models with spontaneous and engineered Plp gene mutations. The relationship between genotype and phenotype is remarkably close in the animal models and the PMD cases, making them useful models for studying the mechanisms of PLP gene-related disease. As a result, it has become clear that the PLP gene plays a wider role in neural development in addition to its function as a structural component of myelin. It has also emerged that duplications of the PLP gene are the commonest mutation in PMD. Genetic disorders arising from a dosage effect may be more common than previously recognized. The study of the PLP gene in this rare disorder is, therefore, contributing both to our understanding of neural development and maintenance and to the mechanisms of human genetic disorders.

  20. Structure and expression of the Drosophila ubiquitin-80-amino-acid fusion-protein gene.

    PubMed Central

    Barrio, R; del Arco, A; Cabrera, H L; Arribas, C

    1994-01-01

    In the fruitfly Drosophila, as in all eukaryotes examined so far, some ubiquitin-coding sequences appear fused to unrelated open reading frames. Two of these fusion genes have been previously described (the homologues of UBI1-UBI2 and UBI4 in yeast), and we report here the organization and expression of a third one, the DUb80 gene (the homologue of UBI3 in yeast). This gene encodes a ubiquitin monomer fused to an 80-amino-acid extension which is homologous with the ribosomal protein encoded by the UB13 gene. The 5' regulatory region of DUb80 shares common features with another ubiquitin fusion gene, DUb52, and with the ribosomal protein genes of Drosophila, Xenopus and mouse. We also find helix-loop-helix protein-binding sequences (E-boxes). The DUb80 gene is transcribed to a 0.9 kb mRNA which is particularly abundant under conditions of high protein synthesis, such as in ovaries and exponentially growing cells. Images Figure 3 Figure 4 PMID:8068011

  1. The Saccharomyces Cerevisiae Spt7 Gene Encodes a Very Acidic Protein Important for Transcription in Vivo

    PubMed Central

    Gansheroff, L. J.; Dollard, C.; Tan, P.; Winston, F.

    1995-01-01

    Mutations in the SPT7 gene of Saccharomyces cerevisiae originally were identified as suppressors of Ty and {delta small} insertion mutations in the 5' regions of the HIS4 and LYS2 genes. Other genes that have been identified in mutant hunts of this type have been shown to play a role in transcription. In this work we show that SPT7 is also important for proper transcription in vivo. We have cloned and sequenced the SPT7 gene and have shown that it encodes a large, acidic protein that is localized to the nucleus. The SPT7 protein contains a bromodomain sequence; a deletion that removes the bromodomain from the SPT7 protein causes no detectable mutant phenotype. Strains that contain an spt7 null mutation are viable but grow very slowly and have transcriptional defects at many loci including insertion mutations, Ty elements, the INO1 gene and the MFA1 gene. These transcriptional defects and other mutant phenotypes are similar to those caused by certain mutations in SPT15, which encodes the TATA binding protein (TBP). The similarity of the phenotypes of spt7 and spt15 mutants, including effects of spt7 mutations on the transcription start site of certain genes, suggests that SPT7 plays an important role in transcription initiation in vivo. PMID:7713415

  2. Cloning and characterization of nuclear genes for two mitochondrial ribosomal proteins in Saccharomyces cerevisiae.

    PubMed Central

    Kitakawa, M; Grohmann, L; Graack, H R; Isono, K

    1990-01-01

    The genes for two large subunit proteins, YmL8 and YmL20, of the mitochondrial ribosome of Saccharomyces cerevisiae were cloned by hybridization with synthetic oligonucleotide mixtures corresponding to their N-terminal amino acid sequences. They were termed MRP-L8 and MRP-L20, respectively, and their nucleotide sequences were determined using a DNA sequencer. The MRP-L8 gene was found to encode a 26.8-kDa protein whose deduced amino acid sequence has a high degree of similarity to ribosomal protein L17 of Escherichia coli. The gene MRP-L20 was found to encode a 22.3-kDa protein with a presequence consisting of 18 amino acid residues. By Southern blot hybridization to the yeast chromosomes separated by field-inversion gel electrophoresis, the MRP-L8 and MRP-L20 genes were located on chromosomes X and XI, respectively. Gene disruption experiments indicate that their products, YmL8 and YmL20 proteins, are essential for the mitochondrial function and the absence of these proteins causes instability of the mitochondrial DNA. Images PMID:2183197

  3. Identification of a STOP1-like protein in Eucalyptus that regulates transcription of Al tolerance genes.

    PubMed

    Sawaki, Yoshiharu; Kobayashi, Yuriko; Kihara-Doi, Tomonori; Nishikubo, Nobuyuki; Kawazu, Tetsu; Kobayashi, Masatomo; Kobayashi, Yasufumi; Iuchi, Satoshi; Koyama, Hiroyuki; Sato, Shigeru

    2014-06-01

    Tolerance to soil acidity is an important trait for eucalyptus clones that are introduced to commercial forestry plantations in pacific Asian countries, where acidic soil is dominant in many locations. A conserved transcription factor regulating aluminum (Al) and proton (H⁺) tolerance in land-plant species, STOP1 (SENSITIVE TOPROTON RHIZOTOXICITY 1)-like protein, was isolated by polymerase chain reaction-based cloning, and then suppressed by RNA interference in hairy roots produced by Agrobacterium rhizogenes-mediated transformation. Eucalyptus STOP1-like protein complemented proton tolerance in an Arabidopsis thaliana stop1-mutant, and localized to the nucleus in a transient assay of a green fluorescent protein fusion protein expressed in tobacco leaves by Agrobacterium tumefaciens-mediated transformation. Genes encoding a citrate transporting MULTIDRUGS AND TOXIC COMPOUND EXTRUSION protein and an orthologue of ALUMINUM SENSITIVE 3 were suppressed in transgenic hairy roots in which the STOP1 orthologue was knocked down. In summary, we identified a series of genes for Al-tolerance in eucalyptus, including a gene for STOP1-like protein and the Al-tolerance genes it regulates. These genes may be useful for molecular breeding and genomic selection of elite clones to introduce into acid soil regions. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.

  4. HMGN4, a newly discovered nucleosome-binding protein encoded by an intronless gene.

    PubMed

    Birger, Y; Ito, Y; West, K L; Landsman, D; Bustin, M

    2001-05-01

    We describe a newly discovered nuclear protein, HMGN4, that is closely related to the canonical HMGN2 nucleosome-binding protein. The protein is encoded by an intronless gene, which, in humans, is located in the hereditary hemochromatosis [correction of hemachromatosis] region at position 6p21.3. A single approximately 2-kb HMGN4 mRNA was found to be expressed, in variable amounts, in all human tissues tested; however, the HMGN4 transcript was significantly less abundant than that of HMGN2. The HMGN4 protein could be detected in HeLa cells by Western analysis with an antibody elicited against a unique region of the protein. Transfection of HeLa cells with a plasmid expressing HMGN4-GFP indicated that the protein localizes to the nucleus. Our results expand the multiplicity of the HMGN protein family and increase the known cellular repertoire of nucleosome-binding proteins.

  5. Isolation of Drosophila genes encoding G protein-coupled receptor kinases.

    PubMed Central

    Cassill, J A; Whitney, M; Joazeiro, C A; Becker, A; Zuker, C S

    1991-01-01

    G protein-coupled receptors are regulated via phosphorylation by a variety of protein kinases. Recently, termination of the active state of two such receptors, the beta-adrenergic receptor and rhodopsin, has been shown to be mediated by agonist- or light-dependent phosphorylation of the receptor by members of a family of protein-serine/threonine kinases (here referred to as G protein-coupled receptor kinases). We now report the isolation of a family of genes encoding a set of Drosophila protein kinases that appear to code for G protein-coupled receptor kinases. These proteins share a high degree of sequence homology with the bovine beta-adrenergic receptor kinase. The presence of a conserved family of G protein-coupled receptor kinases in vertebrates and invertebrates points to the central role of these kinases in signal transduction cascades. Images PMID:1662381

  6. [Gene expression of Interleukin 1 in vitamin A and proteins deficiency].

    PubMed

    Sánchez Alvarez, Vivian; Hernández Triana, Manuel; Abreu Peñate, Mario; de las Cagigas Reig, Ada; Tam Hurtado, Miguel; Reboso Pérez, José; Noa Puig, Miriam; Arias Verdé, José; Fernández Carriera, Rebeca; González Calderín, Soraya; Sigarroa González, Antonio

    2002-03-01

    The influence of low levels of protein and vitamin A on indicators of the immune response was assayed in rats. The levels of protein and vitamin A intake of the Cuban population affected by epidemic neuropathy in 1993 was reproduced in 4 diets: control, protein deficiency (DP), vitamin A deficiency (DA), protein and vitamin A deficiency (DAP). The Peyer's patches evaluated the Interleukin 1 expression gene and was related with corporal weight, food intake, serum protein, vitamin A, immunology indicators and histology evaluation (spleen, thymus and liver). Protein deficiency generated a significant decrease of the expression gene of Interleukin 1. Atrophy signs in lymphoid tissues and morphologic changes in the liver were associated with the dietary protein utilization. Protein and vitamin A deficiency generated significant stimulation of the Interleukin 1 expression gene with increase of the level of the inflammatory state indicators as serum alpha protein, total complement and neutrophils. This stimulation could be generated by a deficient retinol mobilization to tissues. These results support the hypothesis of the function of cytokines as mediators of subclinical symptoms of the immune system during the nutritional affectations.

  7. Evaluating the significance of protein functional similarity based on gene ontology.

    PubMed

    Konopka, Bogumil M; Golda, Tomasz; Kotulska, Malgorzata

    2014-11-01

    Gene ontology is among the most successful ontologies in the biomedical domain. It is used to describe, unambiguously, protein molecular functions, cellular localizations, and processes in which proteins participate. The hierarchical structure of gene ontology allows quantifying protein functional similarity by application of algorithms that calculate semantic similarities. The scores, however, are meaningless without a given context. Here, we propose how to evaluate the significance of protein function semantic similarity scores by comparing them to reference distributions calculated for randomly chosen proteins. In the study, thresholds for significant functional semantic similarity, in four representative annotation corpuses, were estimated. We also show that the score significance is influenced by the number and specificity of gene ontology terms that are annotated to compared proteins. While proteins with a greater number of terms tend to yield higher similarity scores, proteins with more specific terms produce lower scores. The estimated significance thresholds were validated using protein sequence-function and structure-function relationships. Taking into account the term number and term specificity improves the distinction between significant and insignificant semantic similarity comparisons.

  8. Characterization of vegetative insecticidal protein vip genes of Bacillus thuringiensis from Sichuan Basin in China.

    PubMed

    Yu, Xiumei; Zheng, Aiping; Zhu, Jun; Wang, Shiquan; Wang, Lingxia; Deng, Qiming; Li, Shuangcheng; Liu, Huainian; Li, Ping

    2011-03-01

    Vegetative insecticidal proteins (Vip), the second generation of insecticides, are produced during the vegetative growth stage of Bacillus thuringiensis (Bt). To perform a systematic study of vip genes in Bt strains from different ecological regions of Sichuan Basin, 1,789 soil samples were collected from this basin, which is situated in the western region of China. The basin has a complicated geomorphology and contains mountains, forests, highlands, hursts, and plains. A total of 2,134 Bt strains have been screened from the 1,789 soil samples. According to the results, three vip-type genes were found in this basin, namely the vip1, vip2, and vip3-type genes. Strains containing vip3-type genes were the most abundant in our collection (67.4%), followed by vip2-type genes (14.6%) and vip1-type genes (8.1%). The three types of vip genes were distributed in most of the regions, but E Mei Mountain and the Ba Lang Mountains only contained vip3 genes in environments with high elevation, low temperature, insufficient oxygen, and abundant snow. Moreover, five novel vip3 genes were found, and these Vip proteins were toxic for Chilo suppressalis. All the results mentioned above suggest that Sichuan Basin is a rich resource for vip genes.

  9. Using phylogenomic patterns and gene ontology to identify proteins of importance in plant evolution.

    PubMed

    Cibrián-Jaramillo, Angélica; De la Torre-Bárcena, Jose E; Lee, Ernest K; Katari, Manpreet S; Little, Damon P; Stevenson, Dennis W; Martienssen, Rob; Coruzzi, Gloria M; DeSalle, Rob

    2010-07-12

    We use measures of congruence on a combined expressed sequenced tag genome phylogeny to identify proteins that have potential significance in the evolution of seed plants. Relevant proteins are identified based on the direction of partitioned branch and hidden support on the hypothesis obtained on a 16-species tree, constructed from 2,557 concatenated orthologous genes. We provide a general method for detecting genes or groups of genes that may be under selection in directions that are in agreement with the phylogenetic pattern. Gene partitioning methods and estimates of the degree and direction of support of individual gene partitions to the overall data set are used. Using this approach, we correlate positive branch support of specific genes for key branches in the seed plant phylogeny. In addition to basic metabolic functions, such as photosynthesis or hormones, genes involved in posttranscriptional regulation by small RNAs were significantly overrepresented in key nodes of the phylogeny of seed plants. Two genes in our matrix are of critical importance as they are involved in RNA-dependent regulation, essential during embryo and leaf development. These are Argonaute and the RNA-dependent RNA polymerase 6 found to be overrepresented in the angiosperm clade. We use these genes as examples of our phylogenomics approach and show that identifying partitions or genes in this way provides a platform to explain some of the more interesting organismal differences among species, and in particular, in the evolution of plants.

  10. Using Phylogenomic Patterns and Gene Ontology to Identify Proteins of Importance in Plant Evolution

    PubMed Central

    Cibrián-Jaramillo, Angélica; De la Torre-Bárcena, Jose E.; Lee, Ernest K.; Katari, Manpreet S.; Little, Damon P.; Stevenson, Dennis W.; Martienssen, Rob; Coruzzi, Gloria M.; DeSalle, Rob

    2010-01-01

    We use measures of congruence on a combined expressed sequenced tag genome phylogeny to identify proteins that have potential significance in the evolution of seed plants. Relevant proteins are identified based on the direction of partitioned branch and hidden support on the hypothesis obtained on a 16-species tree, constructed from 2,557 concatenated orthologous genes. We provide a general method for detecting genes or groups of genes that may be under selection in directions that are in agreement with the phylogenetic pattern. Gene partitioning methods and estimates of the degree and direction of support of individual gene partitions to the overall data set are used. Using this approach, we correlate positive branch support of specific genes for key branches in the seed plant phylogeny. In addition to basic metabolic functions, such as photosynthesis or hormones, genes involved in posttranscriptional regulation by small RNAs were significantly overrepresented in key nodes of the phylogeny of seed plants. Two genes in our matrix are of critical importance as they are involved in RNA-dependent regulation, essential during embryo and leaf development. These are Argonaute and the RNA-dependent RNA polymerase 6 found to be overrepresented in the angiosperm clade. We use these genes as examples of our phylogenomics approach and show that identifying partitions or genes in this way provides a platform to explain some of the more interesting organismal differences among species, and in particular, in the evolution of plants. PMID:20624728

  11. Prioritization of candidate disease genes by topological similarity between disease and protein diffusion profiles.

    PubMed

    Zhu, Jie; Qin, Yufang; Liu, Taigang; Wang, Jun; Zheng, Xiaoqi

    2013-01-01

    Identification of gene-phenotype relationships is a fundamental challenge in human health clinic. Based on the observation that genes causing the same or similar phenotypes tend to correlate with each other in the protein-protein interaction network, a lot of network-based approaches were proposed based on different underlying models. A recent comparative study showed that diffusion-based methods achieve the state-of-the-art predictive performance. In this paper, a new diffusion-based method was proposed to prioritize candidate disease genes. Diffusion profile of a disease was defined as the stationary distribution of candidate genes given a random walk with restart where similarities between phenotypes are incorporated. Then, candidate disease genes are prioritized by comparing their diffusion profiles with that of the disease. Finally, the effectiveness of our method was demonstrated through the leave-one-out cross-validation against control genes from artificial linkage intervals and randomly chosen genes. Comparative study showed that our method achieves improved performance compared to some classical diffusion-based methods. To further illustrate our method, we used our algorithm to predict new causing genes of 16 multifactorial diseases including Prostate cancer and Alzheimer's disease, and the top predictions were in good consistent with literature reports. Our study indicates that integration of multiple information sources, especially the phenotype similarity profile data, and introduction of global similarity measure between disease and gene diffusion profiles are helpful for prioritizing candidate disease genes. Programs and data are available upon request.

  12. Double replacement gene targeting for the production of a series of mouse strains with different prion protein gene alterations

    SciTech Connect

    Moore, R.C.; Redhead, N.J.; Selfridge, J.

    1995-09-01

    We have developed a double replacement gene targeting strategy which enables the production of a series of mouse strains bearing different subtle alterations to endogenous genes. This is a two-step process in which a region of the gene of interest is first replaced with a selectable marker to produce an inactivated allele, which is then re-targeted with a second vector to reconstruct the inactivated allele, concomitantly introducing an engineered mutation. Five independent embryonic stem cell lines have been produced bearing different targeted alterations to the prion protein gene, including one which raises the level of expression. We have constructed mice bearing the codon 101 proline to leucine substitution linked to the human familial prion disease, Gerstmann-Straussler-Scheinker syndrome. We anticipate that this procedure will have applications to the study of human inherited diseases and the development of therapies. 43 refs., 6 figs., 1 tab.

  13. Analysis of the multi-copied genes and the impact of the redundant protein coding sequences on gene annotation in prokaryotic genomes.

    PubMed

    Yu, Jia-Feng; Chen, Qing-Li; Ren, Jing; Yang, Yan-Ling; Wang, Ji-Hua; Sun, Xiao

    2015-07-07

    The important roles of duplicated genes in evolutional process have been recognized in bacteria, archaebacteria and eukaryotes, while there is very little study on the multi-copied protein coding genes that share sequence identity of 100%. In this paper, the multi-copied protein coding genes in a number of prokaryotic genomes are comprehensively analyzed firstly. The results show that 0-15.93% of the protein coding genes in each genome are multi-copied genes and 0-16.49% of the protein coding genes in each genome are highly similar with the sequence identity ≥ 80%. Function and COG (Clusters of Orthologous Groups of proteins) analysis shows that 64.64% of multi-copied genes concentrate on the function of transposase and 86.28% of the COG assigned multi-copied genes concentrate on the COG code of 'L'. Furthermore, the impact of redundant protein coding sequences on the gene prediction results is studied. The results show that the problem of protein coding sequence redundancies cannot be ignored and the consistency of the gene annotation results before and after excluding the redundant sequences is negatively related with the sequences redundancy degree of the protein coding sequences in the training set.

  14. Evolution, gene expression profiling and 3D modeling of CSLD proteins in cotton.

    PubMed

    Li, Yanpeng; Yang, Tiegang; Dai, Dandan; Hu, Ying; Guo, Xiaoyang; Guo, Hongxia

    2017-07-10

    Among CESA-like gene superfamily, the cellulose synthase-like D (CSLD) genes are most similar to cellulose synthase genes and have been reported to be involved in tip-growing cell and stem development. However, there has been no genome-wide characterization of this gene subfamily in cotton. We thus sought to analyze the evolution and functional characterization of CSLD proteins in cotton based on fully sequenced cotton genomes. A total of 23 full-length CSLD proteins were identified in Gossypium raimondii, Gossypium arboreum and Gossypium hirsutum. The phylogenetic tree divided the CSLD proteins into five clades with strong support: CSLD1, CSLD2/3, CSLD4, CSLD5 and CSLD6. The total expression of GhCSLD genes was the highest in androecium & gynoecium (mostly contributed by CSLD1 and CSLD4) compared with other CSL genes. CSLD1 and CSLD4 were only highly expressed in androecium & gynoecium (A&G), and showed tissue-specific expression. The total expression of CSLD2/3, 5 and 6 was highest in the specific tissues. These results suggest that CSLD genes showed the different pattern of expression. Cotton CSLD proteins were subjected to different evolutionary pressures, and the CSLD1 and CSLD4 proteins exhibited episodic and long-term shift positive selection. The predicted three-dimensional structure of GrCSLD1 suggested that GrCSLD1 belongs to glycosyltransferase family 2. The amino acid residues under positive selection in the CSLD1 lineage are positioned in a region adjacent to the class-specific region (CSR), β1-strand and transmembrane helices (TMHs) in the GrCSLD1structure. Our results characterized the CSLD proteins by an integrated approach containing phylogeny, transcriptional profiling and 3D modeling. The study added to the understanding about the importance of the CSLD family and provide a useful reference for selecting candidate genes and their associations with the biosynthesis of the cell wall in cotton.

  15. Mutations of 3c and spike protein genes correlate with the occurrence of feline infectious peritonitis.

    PubMed

    Bank-Wolf, Barbara Regina; Stallkamp, Iris; Wiese, Svenja; Moritz, Andreas; Tekes, Gergely; Thiel, Heinz-Jürgen

    2014-10-10

    The genes encoding accessory proteins 3a, 3b, 3c, 7a and 7b, the S2 domain of the spike (S) protein gene and the membrane (M) protein gene of feline infectious peritonitis virus (FIPV) and feline enteric coronavirus (FECV) samples were amplified, cloned and sequenced. For this faeces and/or ascites samples from 19 cats suffering from feline infectious peritonitis (FIP) as well as from 20 FECV-infected healthy cats were used. Sequence comparisons revealed that 3c genes of animals with FIP were heavily affected by nucleotide deletions and point mutations compared to animals infected with FECV; these alterations resulted either in early termination or destruction of the translation initiation codon. Two ascites-derived samples of cats with FIP which displayed no alterations of ORF3c harboured mutations in the S2 domain of the S protein gene which resulted in amino acid exchanges or deletions. Moreover, changes in 3c were often accompanied by mutations in S2. In contrast, in samples obtained from faeces of healthy cats, the ORF3c was never affected by such mutations. Similarly ORF3c from faecal samples of the cats with FIP was mostly intact and showed only in a few cases the same mutations found in the respective ascites samples. The genes encoding 3a, 3b, 7a and 7b displayed no mutations linked to the feline coronavirus (FCoV) biotype. The M protein gene was found to be conserved between FECV and FIPV samples. Our findings suggest that mutations of 3c and spike protein genes correlate with the occurrence of FIP. Copyright © 2014 Elsevier B.V. All rights reserved.

  16. Massively convergent evolution for ribosomal protein gene content in plastid and mitochondrial genomes.

    PubMed

    Maier, Uwe-G; Zauner, Stefan; Woehle, Christian; Bolte, Kathrin; Hempel, Franziska; Allen, John F; Martin, William F

    2013-01-01

    Plastid and mitochondrial genomes have undergone parallel evolution to encode the same functional set of genes. These encode conserved protein components of the electron transport chain in their respective bioenergetic membranes and genes for the ribosomes that express them. This highly convergent aspect of organelle genome evolution is partly explained by the redox regulation hypothesis, which predicts a separate plastid or mitochondrial location for genes encoding bioenergetic membrane proteins of either photosynthesis or respiration. Here we show that convergence in organelle genome evolution is far stronger than previously recognized, because the same set of genes for ribosomal proteins is independently retained by both plastid and mitochondrial genomes. A hitherto unrecognized selective pressure retains genes for the same ribosomal proteins in both organelles. On the Escherichia coli ribosome assembly map, the retained proteins are implicated in 30S and 50S ribosomal subunit assembly and initial rRNA binding. We suggest that ribosomal assembly imposes functional constraints that govern the retention of ribosomal protein coding genes in organelles. These constraints are subordinate to redox regulation for electron transport chain components, which anchor the ribosome to the organelle genome in the first place. As organelle genomes undergo reduction, the rRNAs also become smaller. Below size thresholds of approximately 1,300 nucleotides (16S rRNA) and 2,100 nucleotides (26S rRNA), all ribosomal protein coding genes are lost from organelles, while electron transport chain components remain organelle encoded as long as the organelles use redox chemistry to generate a proton motive force.

  17. Massively Convergent Evolution for Ribosomal Protein Gene Content in Plastid and Mitochondrial Genomes

    PubMed Central

    Maier, Uwe-G; Zauner, Stefan; Woehle, Christian; Bolte, Kathrin; Hempel, Franziska; Allen, John F.; Martin, William F.

    2013-01-01

    Plastid and mitochondrial genomes have undergone parallel evolution to encode the same functional set of genes. These encode conserved protein components of the electron transport chain in their respective bioenergetic membranes and genes for the ribosomes that express them. This highly convergent aspect of organelle genome evolution is partly explained by the redox regulation hypothesis, which predicts a separate plastid or mitochondrial location for genes encoding bioenergetic membrane proteins of either photosynthesis or respiration. Here we show that convergence in organelle genome evolution is far stronger than previously recognized, because the same set of genes for ribosomal proteins is independently retained by both plastid and mitochondrial genomes. A hitherto unrecognized selective pressure retains genes for the same ribosomal proteins in both organelles. On the Escherichia coli ribosome assembly map, the retained proteins are implicated in 30S and 50S ribosomal subunit assembly and initial rRNA binding. We suggest that ribosomal assembly imposes functional constraints that govern the retention of ribosomal protein coding genes in organelles. These constraints are subordinate to redox regulation for electron transport chain components, which anchor the ribosome to the organelle genome in the first place. As organelle genomes undergo reduction, the rRNAs also become smaller. Below size thresholds of approximately 1,300 nucleotides (16S rRNA) and 2,100 nucleotides (26S rRNA), all ribosomal protein coding genes are lost from organelles, while electron transport chain components remain organelle encoded as long as the organelles use redox chemistry to generate a proton motive force. PMID:24259312

  18. Gene and Protein Expression of Fibronectin and Tenascin-C in Lung Samples from COPD Patients.

    PubMed

    Muñoz-Esquerre, Mariana; Huertas, Daniel; Escobar, Ignacio; López-Sánchez, Marta; Penín, Rosa; Peinado, Victor; Barberà, Joan Albert; Molina-Molina, María; Manresa, Frederic; Dorca, Jordi; Santos, Salud

    2015-06-01

    Fibronectin (Fn) and tenascin-C (TnC) are two extracellular matrix proteins associated with remodeling changes. Fn and TnC gene and protein expression in lung tissue, including their predominant location in bronchial and pulmonary artery structures, have not yet been fully evaluated. The aim of the present study was to assess: (1) gene expression of Fn and TnC in lung samples from chronic obstructive pulmonary disease (COPD) and non-COPD subjects; and (2) protein content and location of Fn and TnC in both groups. Consecutive subjects requiring lung resection due to lung cancer surgery were included. Lung specimens were examined for gene expression by quantitative real-time PCR (values expressed as fold change ratio). The analysis of their protein content and location was performed by western blot and immunohistochemical studies, respectively. Patients were divided into two cohorts according to COPD status. A total of 41 patients (20 with COPD and 21 without COPD) were included. An enhanced Fn gene expression was observed in the COPD group compared to the non-COPD group (4.73 ± 0.54 vs. 2.65 ± 0.57; P = 0.012), whereas no differences in gene TnC expression were observed (2.91 ± 0.44 vs. 2.60 ± 0.48; P = 0.633). No differences in lung protein content and location were found between groups. Immunohistochemical evaluation showed a predominantly vascular and bronchial location of Fn and TnC in both groups. An enhanced lung gene expression of Fn was observed in COPD subjects compared to non-COPD subjects. No differences were found in Fn protein expression or in TnC gene or protein expression among groups.

  19. Developmental Regulation of Genes Encoding Universal Stress Proteins in Schistosoma mansoni

    PubMed Central

    Isokpehi, Raphael D.; Mahmud, Ousman; Mbah, Andreas N.; Simmons, Shaneka S.; Avelar, Lívia; Rajnarayanan, Rajendram V.; Udensi, Udensi K.; Ayensu, Wellington K.; Cohly, Hari H.; Brown, Shyretha D.; Dates, Centdrika R.; Hentz, Sonya D.; Hughes, Shawntae J.; Smith-McInnis, Dominique R.; Patterson, Carvey O.; Sims, Jennifer N.; Turner, Kelisha T.; Williams, Baraka S.; Johnson, Matilda O.; Adubi, Taiwo; Mbuh, Judith V.; Anumudu, Chiaka I.; Adeoye, Grace O.; Thomas, Bolaji N.; Nashiru, Oyekanmi; Oliveira, Guilherme

    2011-01-01

    The draft nuclear genome sequence of the snail-transmitted, dimorphic, parasitic, platyhelminth Schistosoma mansoni revealed eight genes encoding proteins that contain the Universal Stress Protein (USP) domain. Schistosoma mansoni is a causative agent of human schistosomiasis, a severe and debilitating Neglected Tropical Disease (NTD) of poverty, which is endemic in at least 76 countries. The availability of the genome sequences of Schistosoma species presents opportunities for bioinformatics and genomics analyses of associated gene families that could be targets for understanding schistosomiasis ecology, intervention, prevention and control. Proteins with the USP domain are known to provide bacteria, archaea, fungi, protists and plants with the ability to respond to diverse environmental stresses. In this research investigation, the functional annotations of the USP genes and predicted nucleotide and protein sequences were initially verified. Subsequently, sequence clusters and distinctive features of the sequences were determined. A total of twelve ligand binding sites were predicted based on alignment to the ATP-binding universal stress protein from Methanocaldococcus jannaschii. In addition, six USP sequences showed the presence of ATP-binding motif residues indicating that they may be regulated by ATP. Public domain gene expression data and RT-PCR assays confirmed that all the S. mansoni USP genes were transcribed in at least one of the developmental life cycle stages of the helminth. Six of these genes were up-regulated in the miracidium, a free-swimming stage that is critical for transmission to the snail intermediate host. It is possible that during the intra-snail stages, S. mansoni gene transcripts for universal stress proteins are low abundant and are induced to perform specialized functions triggered by environmental stressors such as oxidative stress due to hydrogen peroxide that is present in the snail hemocytes. This report serves to catalyze the

  20. Induction of several acute-phase protein genes by heavy metals: A new class of metal-responsive genes

    SciTech Connect

    Yiangou, Minas; Ge, Xin; Carter, K.C.; Papaconstantinou, J. Shriners Burns Institute, Galveston, TX )

    1991-04-16

    Acute-phase reactants, metallothioneins, and heat-shock proteins are the products of three families of genes that respond to glucocorticoids and cytokines. Metallothioneins and heat-shock proteins, however, are also stimulated by heavy metals whereas very little is known about the effect of heavy metals on acute-phase-reactant genes. The authors have studied the effect of heavy metals (Hg, Cd, Pb, Cu, Ni, and Zn) and Mg on the acute-phase reactants {alpha}{sub 1}-acid glycoprotein, C-reactive protein, {alpha}{sub 1}-antitrypsin and {alpha}{sub 1}-antichymotrypsin. {alpha}{sub 1}-Acid glycoprotein and C-reactive protein mRNA levels were increased severalfold in livers of heavy-metal-treated Balb/c mice. The strongest induction was mediated by Hg, followed in order of response by Cd > Pb > Cu > Ni > Zn > Mg. None of the metals affected the mRNA levels of albumin, {alpha}{sub 1}-antitrypsin, and {alpha}{sub 1}-antichymotrypsin. Furthermore, failure to repress albumin, a negative acute-phase reactant, indicated that the induction of these genes was not due to a metal-mediated inflammatory response. The metals also induced {alpha}{sub 1}-acid glycoprotein and C-reactive protein in adrenalectomized animals, indicating that induction by the heavy metals is not mediated by the glucocorticoid induction pathway. Sequence analysis has revealed a region of homology to metal-responsive elements in the {alpha}{sub 1}-acid glycoprotein and C-reactive protein promoters. The studies indicate that the induction of {alpha}{sub 1}-acid glycoprotein and C-reactive protein by heavy metals may be regulated by these metal-responsive elements at the level of transcription.

  1. Phylogenomic analysis reveals dynamic evolutionary history of the Drosophila heterochromatin protein 1 (HP1) gene family.

    PubMed

    Levine, Mia T; McCoy, Connor; Vermaak, Danielle; Lee, Yuh Chwen G; Hiatt, Mary Alice; Matsen, Frederick A; Malik, Harmit S

    2012-01-01

    Heterochromatin is the gene-poor, satellite-rich eukaryotic genome compartment that supports many essential cellular processes. The functional diversity of proteins that bind and often epigenetically define heterochromatic DNA sequence reflects the diverse functions supported by this enigmatic genome compartment. Moreover, heterogeneous signatures of selection at chromosomal proteins often mirror the heterogeneity of evolutionary forces that act on heterochromatic DNA. To identify new such surrogates for dissecting heterochromatin function and evolution, we conducted a comprehensive phylogenomic analysis of the Heterochromatin Protein 1 gene family across 40 million years of Drosophila evolution. Our study expands this gene family from 5 genes to at least 26 genes, including several uncharacterized genes in Drosophila melanogaster. The 21 newly defined HP1s introduce unprecedented structural diversity, lineage-restriction, and germline-biased expression patterns into the HP1 family. We find little evidence of positive selection at these HP1 genes in both population genetic and molecular evolution analyses. Instead, we find that dynamic evolution occurs via prolific gene gains and losses. Despite this dynamic gene turnover, the number of HP1 genes is relatively constant across species. We propose that karyotype evolution drives at least some HP1 gene turnover. For example, the loss of the male germline-restricted HP1E in the obscura group coincides with one episode of dramatic karyotypic evolution, including the gain of a neo-Y in this lineage. This expanded compendium of ovary- and testis-restricted HP1 genes revealed by our study, together with correlated gain/loss dynamics and chromosome fission/fusion events, will guide functional analyses of novel roles supported by germline chromatin.

  2. GeneValidator: identify problems with protein-coding gene predictions

    PubMed Central

    Drăgan, Monica-Andreea; Moghul, Ismail; Priyam, Anurag; Bustos, Claudio; Wurm, Yannick

    2016-01-01

    Summary: Genomes of emerging model organisms are now being sequenced at very low cost. However, obtaining accurate gene predictions remains challenging: even the best gene prediction algorithms make substantial errors and can jeopardize subsequent analyses. Therefore, many predicted genes must be time-consumingly visually inspected and manually curated. We developed GeneValidator (GV) to automatically identify problematic gene predictions and to aid manual curation. For each gene, GV performs multiple analyses based on comparisons to gene sequences from large databases. The resulting report identifies problematic gene predictions and includes extensive statistics and graphs for each prediction to guide manual curation efforts. GV thus accelerates and enhances the work of biocurators and researchers who need accurate gene predictions from newly sequenced genomes. Availability and implementation: GV can be used through a web interface or in the command-line. GV is open-source (AGPL), available at https://wurmlab.github.io/tools/genevalidator. Contact: y.wurm@qmul.ac.uk Supplementary information: Supplementary data are available at Bioinformatics online. PMID:26787666

  3. Ontogenetic changes in seminal fluid gene expression and the protein composition of cricket seminal fluid.

    PubMed

    Simmons, Leigh W; Beveridge, Maxine; Li, Lei; Li, Lie; Tan, Yew-Foon; Millar, A Harvey

    2014-03-01

    The ejaculates of most internally fertilizing species consists of both sperm and seminal fluid proteins. Seminal fluid proteins have been studied largely in relation to their post-mating effects on female reproductive physiology, and predominantly in genomically well-characterized species. Seminal fluids can also play important roles in sperm maturation and performance. In the field cricket Teleogryllus oceanicus the viability of ejaculated sperm increases as males age, as does their competitive fertilization success. Here, using quantitative proteomics and quantitative real-time PCR, we document ontogenetic changes in seminal fluid protein abundance and in seminal fluid gene expression. We identified at least nine proteins that changed in abundance in the seminal fluid of crickets as they aged. Gene expression was quantified for five seminal fluid protein genes, and in four of these gene expression changed as males aged. These ontogenetic changes were associated with a general increase in the size of the male accessory glands. Several of the seminal fluid proteins that we have identified are novel, and some have BLAST matches to proteins implicated in sperm function. Our data suggest that age related changes in competitive fertilization success may be dependent on seminal fluid chemistry.

  4. Acute myeloid leukemia fusion proteins deregulate genes involved in stem cell maintenance and DNA repair

    PubMed Central

    Alcalay, Myriam; Meani, Natalia; Gelmetti, Vania; Fantozzi, Anna; Fagioli, Marta; Orleth, Annette; Riganelli, Daniela; Sebastiani, Carla; Cappelli, Enrico; Casciari, Cristina; Sciurpi, Maria Teresa; Mariano, Angela Rosa; Minardi, Simone Paolo; Luzi, Lucilla; Muller, Heiko; Di Fiore, Pier Paolo; Frosina, Guido; Pelicci, Pier Giuseppe

    2003-01-01

    Acute myelogenous leukemias (AMLs) are genetically heterogeneous and characterized by chromosomal rearrangements that produce fusion proteins with aberrant transcriptional regulatory activities. Expression of AML fusion proteins in transgenic mice increases the risk of myeloid leukemias, suggesting that they induce a preleukemic state. The underlying molecular and biological mechanisms are, however, unknown. To address this issue, we performed a systematic analysis of fusion protein transcriptional targets. We expressed AML1/ETO, PML/RAR, and PLZF/RAR in U937 hemopoietic precursor cells and measured global gene expression using oligonucleotide chips. We identified 1,555 genes regulated concordantly by at least two fusion proteins that were further validated in patient samples and finally classified according to available functional information. Strikingly, we found that AML fusion proteins induce genes involved in the maintenance of the stem cell phenotype and repress DNA repair genes, mainly of the base excision repair pathway. Functional studies confirmed that ectopic expression of fusion proteins constitutively activates pathways leading to increased stem cell renewal (e.g., the Jagged1/Notch pathway) and provokes accumulation of DNA damage. We propose that expansion of the stem cell compartment and induction of a mutator phenotype are relevant features underlying the leukemic potential of AML-associated fusion proteins. PMID:14660751

  5. Revisiting the protein-coding gene catalog of Drosophila melanogaster using 12 fly genomes

    PubMed Central

    Lin, Michael F.; Carlson, Joseph W.; Crosby, Madeline A.; Matthews, Beverley B.; Yu, Charles; Park, Soo; Wan, Kenneth H.; Schroeder, Andrew J.; Gramates, L. Sian; St. Pierre, Susan E.; Roark, Margaret; Wiley, Kenneth L.; Kulathinal, Rob J.; Zhang, Peili; Myrick, Kyl V.; Antone, Jerry V.; Celniker, Susan E.; Gelbart, William M.; Kellis, Manolis

    2007-01-01

    The availability of sequenced genomes from 12 Drosophila species has enabled the use of comparative genomics for the systematic discovery of functional elements conserved within this genus. We have developed quantitative metrics for the evolutionary signatures specific to protein-coding regions and applied them genome-wide, resulting in 1193 candidate new protein-coding exons in the D. melanogaster genome. We have reviewed these predictions by manual curation and validated a subset by directed cDNA screening and sequencing, revealing both new genes and new alternative splice forms of known genes. We also used these evolutionary signatures to evaluate existing gene annotations, resulting in the validation of 87% of genes lacking descriptive names and identifying 414 poorly conserved genes that are likely to be spurious predictions, noncoding, or species-specific genes. Furthermore, our methods suggest a variety of refinements to hundreds of existing gene models, such as modifications to translation start codons and exon splice boundaries. Finally, we performed directed genome-wide searches for unusual protein-coding structures, discovering 149 possible examples of stop codon readthrough, 125 new candidate ORFs of polycistronic mRNAs, and several candidate translational frameshifts. These results affect >10% of annotated fly genes and demonstrate the power of comparative genomics to enhance our understanding of genome organization, even in a model organism as intensively studied as Drosophila melanogaster. PMID:17989253

  6. Extensive gene amplification and concerted evolution within the CPR family of cuticular proteins in mosquitoes.

    PubMed

    Cornman, R Scott; Willis, Judith H

    2008-06-01

    Annotation of the Anopheles gambiae genome has revealed a large increase in the number of genes encoding cuticular proteins with the Rebers and Riddiford Consensus (the CPR gene family) relative to Drosophila melanogaster. This increase reflects an expansion of the RR-2 group of CPR genes, particularly the amplification of sets of highly similar paralogs. Patterns of nucleotide variation indicate that extensive concerted evolution is occurring within these clusters. The pattern of concerted evolution is complex, however, as sequence similarity within clusters is uncorrelated with gene order and orientation, and no comparable clusters occur within similarly compact arrays of the RR-1 group in mosquitoes or in either group in D. melanogaster. The dearth of pseudogenes suggests that sequence clusters are maintained by selection for high gene-copy number, perhaps due to selection for high expression rates. This hypothesis is consistent with the apparently parallel evolution of compact gene architectures within sequence clusters relative to single-copy genes. We show that RR-2 proteins from sequence-cluster genes have complex repeats and extreme amino-acid compositions relative to single-copy CPR proteins in An. gambiae, and that the amino-acid composition of the N-terminal and C-terminal sequence flanking the chitin-binding consensus region evolves in a correlated fashion.

  7. A substitution mutation in the myosin binding protein C gene in ragdoll hypertrophic cardiomyopathy.

    PubMed

    Meurs, Kathryn M; Norgard, Michelle M; Ederer, Martina M; Hendrix, Kristina P; Kittleson, Mark D

    2007-08-01

    Familial hypertrophic cardiomyopathy (HCM) is a primary myocardial disease with a prevalence of 1 in 500 in human beings. Causative mutations have been identified in several sarcomeric genes, including the cardiac myosin binding protein C (MYBPC3) gene. Heritable HCM also exists in a large-animal model, the cat, and we have previously reported a mutation in the MYBPC3 gene in the Maine coon breed. We now report a separate mutation in the MYBPC3 gene in ragdoll cats with HCM. The mutation changes a conserved arginine to tryptophan and appears to alter the protein structure. The ragdoll is not related to the Maine coon and the mutation identified is in a domain different from that of the previously identified feline mutation. The identification of two separate mutations within this gene in unrelated breeds suggests that these mutations occurred independently rather than being passed on from a common founder.

  8. Human major histocompatibility complex class I gene that encodes a protein with a shortened cytoplasmic segment

    SciTech Connect

    Geraghty, D.E.; Koller, B.H.; Orr, H.T.

    1987-12-01

    The authors have cloned genomic DNA encoding a non-HLA-A, -B, -C class I gene located within a HindIII-generated restriction fragment of 6.0 kilobase pairs. This gene, designated HLA-6.0, is as homologous to HLA-A and HLA-B as they are to each other. The HLA class I protein encoded by HLA-6.0 is similar in organization to the HLA-A-, -B-, and -C-encoded proteins except that an in-frame termination codon prevents translation of a majority of the cytoplasmic region of the HLA-6.0 polypeptide. Moreover, the promoter region of HLA-6.0 resembles the promoter region of a Qa region gene. These structural features of HLA-6.0 suggest that this nonHLA-A, -B, -C gene is a structural homolog of a murine Qa region class I gene.

  9. DNA regions that regulate the ovarian transcriptional specificity of Drosophila yolk protein genes.

    PubMed

    Logan, S K; Garabedian, M J; Wensink, P C

    1989-09-01

    Yolk protein genes 1 and 2 (yp1 and yp2) of Drosophila melanogaster are divergently transcribed neighboring genes. Both are transcribed in only two tissues, the ovarian follicle cells and the fat bodies of adult females. Previous work has identified a yolk protein enhancer between the genes that is sufficient to direct transcription in one of the tissues, female fat bodies. Using germ-line transformation methods, we identify two cis-acting regions with positive effects on transcription in ovaries. One, a 301-bp region located between the genes, influences both genes and is an enhancer determining the stage and cell type specificity of ovarian transcription. The other, a 105-bp region located in the first exon of yp2, acts across the yp2 promoter region to stimulate yp1 transcription in ovaries. Additional observations suggest how a single enhancer influences both promoters.

  10. Rice Ribosomal Protein Large Subunit Genes and Their Spatio-temporal and Stress Regulation

    PubMed Central

    Moin, Mazahar; Bakshi, Achala; Saha, Anusree; Dutta, Mouboni; Madhav, Sheshu M.; Kirti, P. B.

    2016-01-01

    Ribosomal proteins (RPs) are well-known for their role in mediating protein synthesis and maintaining the stability of the ribosomal complex, which includes small and large subunits. In the present investigation, in a genome-wide survey, we predicted that the large subunit of rice ribosomes is encoded by at least 123 genes including individual gene copies, distributed throughout the 12 chromosomes. We selected 34 candidate genes, each having 2–3 identical copies, for a detailed characterization of their gene structures, protein properties, cis-regulatory elements and comprehensive expression analysis. RPL proteins appear to be involved in interactions with other RP and non-RP proteins and their encoded RNAs have a higher content of alpha-helices in their predicted secondary structures. The majority of RPs have binding sites for metal and non-metal ligands. Native expression profiling of 34 ribosomal protein large (RPL) subunit genes in tissues covering the major stages of rice growth shows that they are predominantly expressed in vegetative tissues and seedlings followed by meiotically active tissues like flowers. The putative promoter regions of these genes also carry cis-elements that respond specifically to stress and signaling molecules. All the 34 genes responded differentially to the abiotic stress treatments. Phytohormone and cold treatments induced significant up-regulation of several RPL genes, while heat and H2O2 treatments down-regulated a majority of them. Furthermore, infection with a bacterial pathogen, Xanthomonas oryzae, which causes leaf blight also induced the expression of 80% of the RPL genes in leaves. Although the expression of RPL genes was detected in all the tissues studied, they are highly responsive to stress and signaling molecules indicating that their encoded proteins appear to have roles in stress amelioration besides house-keeping. This shows that the RPL gene family is a valuable resource for manipulation of stress tolerance in

  11. Gene gun transferring-bone morphogenetic protein 2 (BMP-2) gene enhanced bone fracture healing in rabbits

    PubMed Central

    Li, Wenju; Wei, Haifeng; Xia, Chunmei; Zhu, Xiaomeng; Hou, Guozhu; Xu, Feng; Song, Xinghua; Zhan, Yulin

    2015-01-01

    Purpose: Transferring the bone morphogenetic protein 2 (BMP-2) genes into the tissues or cells can improve the bone healing of the fracture has been widely accepted. We evaluated the efficiency of using gene gun to transfer the BMP-2 gene thereby affected the healing of a fractured bone. Methods: The vector coding for BMP-2 was constructed by a non-replicating encephalo-myocarditis virus (ECMV)-based vector. The segmental bone defect (1.5 cm) model was created by a wire-saw at the middle part of the radius bone of the New Zealand white rabbits. Then either BMP-2 gene or control vector without BMP-2 gene was injected into the tissues around the fracture site. Healing of the defects was monitored radiographically for 9 weeks, bone consolidation was determined by the Lane-Sandhu score pre- and post-operatively, which can evaluated bone formation, bone connect and bone plasticity. Results: The radiographic score and bone consolidation rates were significantly higher in animals injected with BMP-2 gene group as compared with control vector-injected animals (P<0.05). The control group still showed no radiological signs of stable healing. Western-blot and RT-PCR showed BMP-2 expression was significant increase in the tissues around the site of osseous lesions in comparison with the control vector-injected animals (P<0.05). Conclusions: Our results suggested that BMP-2 gene transferred by gene gun could increase the expression of BMP-2 protein and improved the bone callus formation therefore shortened the time of bone defect healing. PMID:26884910

  12. Secreted proteins and genes in fetal and neonatal pig adipose tissue and stromal-vascular cells.

    PubMed

    Hausman, G J; Poulos, S P; Richardson, R L; Barb, C R; Andacht, T; Kirk, H C; Mynatt, R L

    2006-07-01

    Although microarray and proteomic studies have indicated the expression of unique and unexpected genes and their products in human and rodent adipose tissue, similar studies of meat animal adipose tissue have not been reported. Thus, total RNA was isolated from stromal-vascular (S-V) cell cultures (n = 4; 2 arrays; 2 cultures/array) from 90-d (79% of gestation) fetuses and adipose tissue from 105-d (92% of gestation) fetuses (n = 2) and neonatal (5-d-old) pigs (n = 2). Duplicate adipose tissue microarrays (n = 4) represented RNA samples from a pig and a fetus. Dye-labeled cDNA probes were hybridized to custom microarrays (70-mer oligonucleotides) representing more than 600 pig genes involved in growth and reproduction. Microarray studies showed significant expression of 40 genes encoding for known adipose tissue secreted proteins in fetal S-V cell cultures and adipose tissue. Expression of 10 genes encoding secreted proteins not known to be expressed by adipose tissue was also observed in neonatal adipose tissue and fetal S-V cell cultures. Additionally, the agouti gene was detected by reverse transcription-PCR in pig S-V cultures and adipose tissue. Proteomic analysis of adipose tissue and fetal and young pig S-V cell culture-conditioned media identified multiple secreted proteins including heparin-like epidermal growth factor-like growth factor and several apolipoproteins. Another adipose tissue secreted protein, plasminogen activator inhibitor-1, was identified by ELISA in S-V cell culture media. A group of 20 adipose tissue secreted proteins were detected or identified using the gene microarray and the proteomic and protein assay approaches including apolipoprotein-A1, apolipoprotein-E, relaxin, brain-derived neurotrophic factor, and IGF binding protein-5. These studies demonstrate, for the first time, the expression of several major secreted proteins in pig adipose tissue that may influence local and central metabolism and growth.

  13. Epithelial and endothelial expression of the green fluorescent protein reporter gene under the control of bovine prion protein (PrP) gene regulatory sequences in transgenic mice

    NASA Astrophysics Data System (ADS)

    Lemaire-Vieille, Catherine; Schulze, Tobias; Podevin-Dimster, Valérie; Follet, Jérome; Bailly, Yannick; Blanquet-Grossard, Françoise; Decavel, Jean-Pierre; Heinen, Ernst; Cesbron, Jean-Yves

    2000-05-01

    The expression of the cellular form of the prion protein (PrPc) gene is required for prion replication and neuroinvasion in transmissible spongiform encephalopathies. The identification of the cell types expressing PrPc is necessary to understanding how the agent replicates and spreads from peripheral sites to the central nervous system. To determine the nature of the cell types expressing PrPc, a green fluorescent protein reporter gene was expressed in transgenic mice under the control of 6.9 kb of the bovine PrP gene regulatory sequences. It was shown that the bovine PrP gene is expressed as two populations of mRNA differing by alternative splicing of one 115-bp 5' untranslated exon in 17 different bovine tissues. The analysis of transgenic mice showed reporter gene expression in some cells that have been identified as expressing PrP, such as cerebellar Purkinje cells, lymphocytes, and keratinocytes. In addition, expression of green fluorescent protein was observed in the plexus of the enteric nervous system and in a restricted subset of cells not yet clearly identified as expressing PrP: the epithelial cells of the thymic medullary and the endothelial cells of both the mucosal capillaries of the intestine and the renal capillaries. These data provide valuable information on the distribution of PrPc at the cellular level and argue for roles of the epithelial and endothelial cells in the spread of infection from the periphery to the brain. Moreover, the transgenic mice described in this paper provide a model that will allow for the study of the transcriptional activity of the PrP gene promoter in response to scrapie infection.

  14. Genepleio software for effective estimation of gene pleiotropy from protein sequences.

    PubMed

    Chen, Wenhai; Chen, Dandan; Zhao, Ming; Zou, Yangyun; Zeng, Yanwu; Gu, Xun

    2015-01-01

    Though pleiotropy, which refers to the phenomenon of a gene affecting multiple traits, has long played a central role in genetics, development, and evolution, estimation of the number of pleiotropy components remains a hard mission to accomplish. In this paper, we report a newly developed software package, Genepleio, to estimate the effective gene pleiotropy from phylogenetic analysis of protein sequences. Since this estimate can be interpreted as the minimum pleiotropy of a gene, it is used to play a role of reference for many empirical pleiotropy measures. This work would facilitate our understanding of how gene pleiotropy affects the pattern of genotype-phenotype map and the consequence of organismal evolution.

  15. Structural annotation of equine protein-coding genes determined by mRNA sequencing.

    PubMed

    Coleman, S J; Zeng, Z; Wang, K; Luo, S; Khrebtukova, I; Mienaltowski, M J; Schroth, G P; Liu, J; MacLeod, J N

    2010-12-01

    The horse, like the majority of animal species, has a limited amount of species-specific expressed sequence data available in public databases. As a result, structural models for the majority of genes defined in the equine genome are predictions based on ab initio sequence analysis or the projection of gene structures from other mammalian species. The current study used Illumina-based sequencing of messenger RNA (RNA-seq) to help refine structural annotation of equine protein-coding genes and for a preliminary assessment of gene expression patterns. Sequencing of mRNA from eight equine tissues generated 293,758105 sequence tags of 35 bases each, equalling 10.28 gbp of total sequence data. The tag alignments represent approximately 207 × coverage of the equine mRNA transcriptome and confirmed transcriptional activity for roughly 90% of the protein-coding gene structures predicted by Ensembl and NCBI. Tag coverage was sufficient to refine the structural annotation for 11,356 of these predicted genes, while also identifying an additional 456 transcripts with exon/intron features that are not listed by either Ensembl or NCBI. Genomic locus data and intervals for the protein-coding genes predicted by the Ensembl and NCBI annotation pipelines were combined with 75,116 RNA-seq-derived transcriptional units to generate a consensus equine protein-coding gene set of 20,302 defined loci. Gene ontology annotation was used to compare the functional and structural categories of genes expressed in either a tissue-restricted pattern or broadly across all tissue samples. © 2010 The Authors, Journal compilation © 2010 Stichting International Foundation for Animal Genetics.

  16. New genes from non-coding sequence: the role of de novo protein-coding genes in eukaryotic evolutionary innovation

    PubMed Central

    McLysaght, Aoife; Guerzoni, Daniele

    2015-01-01

    The origin of novel protein-coding genes de novo was once considered so improbable as to be impossible. In less than a decade, and especially in the last five years, this view has been overturned by extensive evidence from diverse eukaryotic lineages. There is now evidence that this mechanism has contributed a significant number of genes to genomes of organisms as diverse as Saccharomyces, Drosophila, Plasmodium, Arabidopisis and human. From simple beginnings, these genes have in some instances acquired complex structure, regulated expression and important functional roles. New genes are often thought of as dispensable late additions; however, some recent de novo genes in human can play a role in disease. Rather than an extremely rare occurrence, it is now evident that there is a relatively constant trickle of proto-genes released into the testing ground of natural selection. It is currently unknown whether de novo genes arise primarily through an ‘RNA-first’ or ‘ORF-first’ pathway. Either way, evolutionary tinkering with this pool of genetic potential may have been a significant player in the origins of lineage-specific traits and adaptations. PMID:26323763

  17. The genomic structure of the human Charcot-Leyden crystal protein gene is analogous to those of the galectin genes

    SciTech Connect

    Dyer, K.D. |; Handen, J.S.; Rosenberg, H.F.

    1997-03-01

    The Charcot-Leyden crystal (CLC) protein, or eosinophil lysophospholipase, is a characteristic protein of human eosinophils and basophils; recent work has demonstrated that the CLC protein is both structurally and functionally related to the galectin family of {beta}-galactoside binding proteins. The galectins as a group share a number of features in common, including a linear ligand binding site encoded on a single exon. In this work, we demonstrate that the intron-exon structure of the gene encoding CLC is analogous to those encoding the galectins. The coding sequence of the CLC gene is divided into four exons, with the entire {beta}-galactoside binding site encoded by exon III. We have isolated CLC {beta}-galactoside binding sites from both orangutan (Pongo pygmaeus) and murine (Mus musculus) genomic DNAs, both encoded on single exons, and noted conservation of the amino acids shown to interact directly with the {beta}-galactoside ligand. The most likely interpretation of these results suggests the occurrence of one or more exon duplication and insertion events, resulting in the distribution of this lectin domain to CLC as well as to the multiple galectin genes. 35 refs., 3 figs.

  18. Characterization and mapping of human genes encoding zinc finger proteins.

    PubMed Central

    Bray, P; Lichter, P; Thiesen, H J; Ward, D C; Dawid, I B

    1991-01-01

    The zinc finger motif, exemplified by a segment of the Drosophila gap gene Krüppel, is a nucleic acid-binding domain present in many transcription factors. To investigate the gene family encoding this motif in the human genome, a placental genomic library was screened at moderate stringency with a degenerate oligodeoxynucleotide probe designed to