Music, Nedzad; Gagnon, Carl A
2010-12-01
Porcine reproductive and respiratory syndrome (PRRS) is an economically devastating viral disease affecting the swine industry worldwide. The etiological agent, PRRS virus (PRRSV), possesses a RNA viral genome with nine open reading frames (ORFs). The ORF1a and ORF1b replicase-associated genes encode the polyproteins pp1a and pp1ab, respectively. The pp1a is processed in nine non-structural proteins (nsps): nsp1α, nsp1β, and nsp2 to nsp8. Proteolytic cleavage of pp1ab generates products nsp9 to nsp12. The proteolytic pp1a cleavage products process and cleave pp1a and pp1ab into nsp products. The nsp9 to nsp12 are involved in virus genome transcription and replication. The 3' end of the viral genome encodes four minor and three major structural proteins. The GP(2a), GP₃ and GP₄ (encoded by ORF2a, 3 and 4), are glycosylated membrane associated minor structural proteins. The fourth minor structural protein, the E protein (encoded by ORF2b), is an unglycosylated membrane associated protein. The viral envelope contains two major structural proteins: a glycosylated major envelope protein GP₅ (encoded by ORF5) and an unglycosylated membrane M protein (encoded by ORF6). The third major structural protein is the nucleocapsid N protein (encoded by ORF7). All PRRSV non-structural and structural proteins are essential for virus replication, and PRRSV infectivity is relatively intolerant to subtle changes within the structural proteins. PRRSV virulence is multigenic and resides in both the non-structural and structural viral proteins. This review discusses the molecular characteristics, biological and immunological functions of the PRRSV structural and nsps and their involvement in the virus pathogenesis.
Valles, Steven M; Bell, Susanne; Firth, Andrew E
2014-01-01
Solenopsis invicta virus 3 (SINV-3) is a positive-sense single-stranded RNA virus that infects the red imported fire ant, Solenopsis invicta. We show that the second open reading frame (ORF) of the dicistronic genome is expressed via a frameshifting mechanism and that the sequences encoding the structural proteins map to both ORF2 and the 3' end of ORF1, downstream of the sequence that encodes the RNA-dependent RNA polymerase. The genome organization and structural protein expression strategy resemble those of Acyrthosiphon pisum virus (APV), an aphid virus. The capsid protein that is encoded by the 3' end of ORF1 in SINV-3 and APV is predicted to have a jelly-roll fold similar to the capsid proteins of picornaviruses and caliciviruses. The capsid-extension protein that is produced by frameshifting, includes the jelly-roll fold domain encoded by ORF1 as its N-terminus, while the C-terminus encoded by the 5' half of ORF2 has no clear homology with other viral structural proteins. A third protein, encoded by the 3' half of ORF2, is associated with purified virions at sub-stoichiometric ratios. Although the structural proteins can be translated from the genomic RNA, we show that SINV-3 also produces a subgenomic RNA encoding the structural proteins. Circumstantial evidence suggests that APV may also produce such a subgenomic RNA. Both SINV-3 and APV are unclassified picorna-like viruses distantly related to members of the order Picornavirales and the family Caliciviridae. Within this grouping, features of the genome organization and capsid domain structure of SINV-3 and APV appear more similar to caliciviruses, perhaps suggesting the basis for a "Calicivirales" order.
Vasala, A; Dupont, L; Baumann, M; Ritzenthaler, P; Alatossava, T
1993-01-01
Virulent phage LL-H and temperate phage mv4 are two related bacteriophages of Lactobacillus delbrueckii. The gene clusters encoding structural proteins of these two phages have been sequenced and further analyzed. Six open reading frames (ORF-1 to ORF-6) were detected. Protein sequencing and Western immunoblotting experiments confirmed that ORF-3 (g34) encoded the main capsid protein Gp34. The presence of a putative late promoter in front of the phage LL-H g34 gene was suggested by primer extension experiments. Comparative sequence analysis between phage LL-H and phage mv4 revealed striking similarities in the structure and organization of this gene cluster, suggesting that the genes encoding phage structural proteins belong to a highly conservative module. Images PMID:8497043
Distribution and Evolution of Yersinia Leucine-Rich Repeat Proteins
Hu, Yueming; Huang, He; Hui, Xinjie; Cheng, Xi; White, Aaron P.
2016-01-01
Leucine-rich repeat (LRR) proteins are widely distributed in bacteria, playing important roles in various protein-protein interaction processes. In Yersinia, the well-characterized type III secreted effector YopM also belongs to the LRR protein family and is encoded by virulence plasmids. However, little has been known about other LRR members encoded by Yersinia genomes or their evolution. In this study, the Yersinia LRR proteins were comprehensively screened, categorized, and compared. The LRR proteins encoded by chromosomes (LRR1 proteins) appeared to be more similar to each other and different from those encoded by plasmids (LRR2 proteins) with regard to repeat-unit length, amino acid composition profile, and gene expression regulation circuits. LRR1 proteins were also different from LRR2 proteins in that the LRR1 proteins contained an E3 ligase domain (NEL domain) in the C-terminal region or an NEL domain-encoding nucleotide relic in flanking genomic sequences. The LRR1 protein-encoding genes (LRR1 genes) varied dramatically and were categorized into 4 subgroups (a to d), with the LRR1a to -c genes evolving from the same ancestor and LRR1d genes evolving from another ancestor. The consensus and ancestor repeat-unit sequences were inferred for different LRR1 protein subgroups by use of a maximum parsimony modeling strategy. Structural modeling disclosed very similar repeat-unit structures between LRR1 and LRR2 proteins despite the different unit lengths and amino acid compositions. Structural constraints may serve as the driving force to explain the observed mutations in the LRR regions. This study suggests that there may be functional variation and lays the foundation for future experiments investigating the functions of the chromosomally encoded LRR proteins of Yersinia. PMID:27217422
Molecular mechanisms for protein-encoded inheritance
Wiltzius, Jed J. W.; Landau, Meytal; Nelson, Rebecca; Sawaya, Michael R.; Apostol, Marcin I.; Goldschmidt, Lukasz; Soriaga, Angela B.; Cascio, Duilio; Rajashankar, Kanagalaghatta; Eisenberg, David
2013-01-01
Strains are phenotypic variants, encoded by nucleic acid sequences in chromosomal inheritance and by protein “conformations” in prion inheritance and transmission. But how is a protein “conformation” stable enough to endure transmission between cells or organisms? Here new polymorphic crystal structures of segments of prion and other amyloid proteins offer structural mechanisms for prion strains. In packing polymorphism, prion strains are encoded by alternative packings (polymorphs) of β-sheets formed by the same segment of a protein; in a second mechanism, segmental polymorphism, prion strains are encoded by distinct β-sheets built from different segments of a protein. Both forms of polymorphism can produce enduring “conformations,” capable of encoding strains. These molecular mechanisms for transfer of information into prion strains share features with the familiar mechanism for transfer of information by nucleic acid inheritance, including sequence specificity and recognition by non-covalent bonds. PMID:19684598
A protein-dependent side-chain rotamer library.
Bhuyan, Md Shariful Islam; Gao, Xin
2011-12-14
Protein side-chain packing problem has remained one of the key open problems in bioinformatics. The three main components of protein side-chain prediction methods are a rotamer library, an energy function and a search algorithm. Rotamer libraries summarize the existing knowledge of the experimentally determined structures quantitatively. Depending on how much contextual information is encoded, there are backbone-independent rotamer libraries and backbone-dependent rotamer libraries. Backbone-independent libraries only encode sequential information, whereas backbone-dependent libraries encode both sequential and locally structural information. However, side-chain conformations are determined by spatially local information, rather than sequentially local information. Since in the side-chain prediction problem, the backbone structure is given, spatially local information should ideally be encoded into the rotamer libraries. In this paper, we propose a new type of backbone-dependent rotamer library, which encodes structural information of all the spatially neighboring residues. We call it protein-dependent rotamer libraries. Given any rotamer library and a protein backbone structure, we first model the protein structure as a Markov random field. Then the marginal distributions are estimated by the inference algorithms, without doing global optimization or search. The rotamers from the given library are then re-ranked and associated with the updated probabilities. Experimental results demonstrate that the proposed protein-dependent libraries significantly outperform the widely used backbone-dependent libraries in terms of the side-chain prediction accuracy and the rotamer ranking ability. Furthermore, without global optimization/search, the side-chain prediction power of the protein-dependent library is still comparable to the global-search-based side-chain prediction methods.
Karimi, Ashkan; Milewicz, Dianna M
2016-01-01
The medial layer of the aorta confers elasticity and strength to the aortic wall and is composed of alternating layers of smooth muscle cells (SMCs) and elastic fibres. The SMC elastin-contractile unit is a structural unit that links the elastin fibres to the SMCs and is characterized by the following: (1) layers of elastin fibres that are surrounded by microfibrils; (2) microfibrils that bind to the integrin receptors in focal adhesions on the cell surface of the SMCs; and (3) SMC contractile filaments that are linked to the focal adhesions on the inner side of the membrane. The genes that are altered to cause thoracic aortic aneurysms and aortic dissections encode proteins involved in the structure or function of the SMC elastin-contractile unit. Included in this gene list are the genes encoding protein that are structural components of elastin fibres and microfibrils, FBN1, MFAP5, ELN, and FBLN4. Also included are genes that encode structural proteins in the SMC contractile unit, including ACTA2, which encodes SMC-specific α-actin and MYH11, which encodes SMC-specific myosin heavy chain, along with MYLK and PRKG1, which encode kinases that control SMC contraction. Finally, mutations in the gene encoding the protein linking integrin receptors to the contractile filaments, FLNA, also predispose to thoracic aortic disease. Thus, these data suggest that functional SMC elastin-contractile units are important for maintaining the structural integrity of the aorta. Copyright © 2016 Canadian Cardiovascular Society. Published by Elsevier Inc. All rights reserved.
Gauci, Penelope J.; Wu, Josh Q. H.; Rayner, George A.; Barabé, Nicole D.; Nagata, Leslie P.; Proll, David F.
2010-01-01
DNA vaccines encoding different portions of the structural proteins of western equine encephalitis virus were tested for the efficacy of their protection in a 100% lethal mouse model of the virus. The 6K-E1 structural protein encoded by the DNA vaccine conferred complete protection against challenge with the homologous strain and limited protection against challenge with a heterologous strain. PMID:19923571
Possenti, Andrea; Vendruscolo, Michele; Camilloni, Carlo; Tiana, Guido
2018-05-23
Proteins employ the information stored in the genetic code and translated into their sequences to carry out well-defined functions in the cellular environment. The possibility to encode for such functions is controlled by the balance between the amount of information supplied by the sequence and that left after that the protein has folded into its structure. We study the amount of information necessary to specify the protein structure, providing an estimate that keeps into account the thermodynamic properties of protein folding. We thus show that the information remaining in the protein sequence after encoding for its structure (the 'information gap') is very close to what needed to encode for its function and interactions. Then, by predicting the information gap directly from the protein sequence, we show that it may be possible to use these insights from information theory to discriminate between ordered and disordered proteins, to identify unknown functions, and to optimize artificially-designed protein sequences. This article is protected by copyright. All rights reserved. © 2018 Wiley Periodicals, Inc.
Ernst, Antonia M; Jekat, Stephan B; Zielonka, Sascia; Müller, Boje; Neumann, Ulla; Rüping, Boris; Twyman, Richard M; Krzyzanek, Vladislav; Prüfer, Dirk; Noll, Gundula A
2012-07-10
The sieve element occlusion (SEO) gene family originally was delimited to genes encoding structural components of forisomes, which are specialized crystalloid phloem proteins found solely in the Fabaceae. More recently, SEO genes discovered in various non-Fabaceae plants were proposed to encode the common phloem proteins (P-proteins) that plug sieve plates after wounding. We carried out a comprehensive characterization of two tobacco (Nicotiana tabacum) SEO genes (NtSEO). Reporter genes controlled by the NtSEO promoters were expressed specifically in immature sieve elements, and GFP-SEO fusion proteins formed parietal agglomerates in intact sieve elements as well as sieve plate plugs after wounding. NtSEO proteins with and without fluorescent protein tags formed agglomerates similar in structure to native P-protein bodies when transiently coexpressed in Nicotiana benthamiana, and the analysis of these protein complexes by electron microscopy revealed ultrastructural features resembling those of native P-proteins. NtSEO-RNA interference lines were essentially devoid of P-protein structures and lost photoassimilates more rapidly after injury than control plants, thus confirming the role of P-proteins in sieve tube sealing. We therefore provide direct evidence that SEO genes in tobacco encode P-protein subunits that affect translocation. We also found that peptides recently identified in fascicular phloem P-protein plugs from squash (Cucurbita maxima) represent cucurbit members of the SEO family. Our results therefore suggest a common evolutionary origin for P-proteins found in the sieve elements of all dicotyledonous plants and demonstrate the exceptional status of extrafascicular P-proteins in cucurbits.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Dyer, K.D.; Handen, J.S.; Rosenberg, H.F.
The Charcot-Leyden crystal (CLC) protein, or eosinophil lysophospholipase, is a characteristic protein of human eosinophils and basophils; recent work has demonstrated that the CLC protein is both structurally and functionally related to the galectin family of {beta}-galactoside binding proteins. The galectins as a group share a number of features in common, including a linear ligand binding site encoded on a single exon. In this work, we demonstrate that the intron-exon structure of the gene encoding CLC is analogous to those encoding the galectins. The coding sequence of the CLC gene is divided into four exons, with the entire {beta}-galactoside bindingmore » site encoded by exon III. We have isolated CLC {beta}-galactoside binding sites from both orangutan (Pongo pygmaeus) and murine (Mus musculus) genomic DNAs, both encoded on single exons, and noted conservation of the amino acids shown to interact directly with the {beta}-galactoside ligand. The most likely interpretation of these results suggests the occurrence of one or more exon duplication and insertion events, resulting in the distribution of this lectin domain to CLC as well as to the multiple galectin genes. 35 refs., 3 figs.« less
Protein Secondary Structure Prediction Using AutoEncoder Network and Bayes Classifier
NASA Astrophysics Data System (ADS)
Wang, Leilei; Cheng, Jinyong
2018-03-01
Protein secondary structure prediction is belong to bioinformatics,and it's important in research area. In this paper, we propose a new prediction way of protein using bayes classifier and autoEncoder network. Our experiments show some algorithms including the construction of the model, the classification of parameters and so on. The data set is a typical CB513 data set for protein. In terms of accuracy, the method is the cross validation based on the 3-fold. Then we can get the Q3 accuracy. Paper results illustrate that the autoencoder network improved the prediction accuracy of protein secondary structure.
Lin, Wen-Hsien; Liu, Wei-Chung; Hwang, Ming-Jing
2009-03-11
Human cells of various tissue types differ greatly in morphology despite having the same set of genetic information. Some genes are expressed in all cell types to perform house-keeping functions, while some are selectively expressed to perform tissue-specific functions. In this study, we wished to elucidate how proteins encoded by human house-keeping genes and tissue-specific genes are organized in human protein-protein interaction networks. We constructed protein-protein interaction networks for different tissue types using two gene expression datasets and one protein-protein interaction database. We then calculated three network indices of topological importance, the degree, closeness, and betweenness centralities, to measure the network position of proteins encoded by house-keeping and tissue-specific genes, and quantified their local connectivity structure. Compared to a random selection of proteins, house-keeping gene-encoded proteins tended to have a greater number of directly interacting neighbors and occupy network positions in several shortest paths of interaction between protein pairs, whereas tissue-specific gene-encoded proteins did not. In addition, house-keeping gene-encoded proteins tended to connect with other house-keeping gene-encoded proteins in all tissue types, whereas tissue-specific gene-encoded proteins also tended to connect with other tissue-specific gene-encoded proteins, but only in approximately half of the tissue types examined. Our analysis showed that house-keeping gene-encoded proteins tend to occupy important network positions, while those encoded by tissue-specific genes do not. The biological implications of our findings were discussed and we proposed a hypothesis regarding how cells organize their protein tools in protein-protein interaction networks. Our results led us to speculate that house-keeping gene-encoded proteins might form a core in human protein-protein interaction networks, while clusters of tissue-specific gene-encoded proteins are attached to the core at more peripheral positions of the networks.
Livingston, B T; Shaw, R; Bailey, A; Wilt, F
1991-12-01
In order to investigate the role of proteins in the formation of mineralized tissues during development, we have isolated a cDNA that encodes a protein that is a component of the organic matrix of the skeletal spicule of the sea urchin, Lytechinus pictus. The expression of the RNA encoding this protein is regulated over development and is localized to the descendents of the micromere lineage. Comparison of the sequence of this cDNA to homologous cDNAs from other species of urchin reveal that the protein is basic and contains three conserved structural motifs: a signal peptide, a proline-rich region, and an unusual region composed of a series of direct repeats. Studies on the protein encoded by this cDNA confirm the predicted reading frame deduced from the nucleotide sequence and show that the protein is secreted and not glycosylated. Comparison of the amino acid sequence to databases reveal that the repeat domain is similar to proteins that form a unique beta-spiral supersecondary structure.
Transcriptomic analysis of Arabidopsis developing stems: a close-up on cell wall genes
Minic, Zoran; Jamet, Elisabeth; San-Clemente, Hélène; Pelletier, Sandra; Renou, Jean-Pierre; Rihouey, Christophe; Okinyo, Denis PO; Proux, Caroline; Lerouge, Patrice; Jouanin, Lise
2009-01-01
Background Different strategies (genetics, biochemistry, and proteomics) can be used to study proteins involved in cell biogenesis. The availability of the complete sequences of several plant genomes allowed the development of transcriptomic studies. Although the expression patterns of some Arabidopsis thaliana genes involved in cell wall biogenesis were identified at different physiological stages, detailed microarray analysis of plant cell wall genes has not been performed on any plant tissues. Using transcriptomic and bioinformatic tools, we studied the regulation of cell wall genes in Arabidopsis stems, i.e. genes encoding proteins involved in cell wall biogenesis and genes encoding secreted proteins. Results Transcriptomic analyses of stems were performed at three different developmental stages, i.e., young stems, intermediate stage, and mature stems. Many genes involved in the synthesis of cell wall components such as polysaccharides and monolignols were identified. A total of 345 genes encoding predicted secreted proteins with moderate or high level of transcripts were analyzed in details. The encoded proteins were distributed into 8 classes, based on the presence of predicted functional domains. Proteins acting on carbohydrates and proteins of unknown function constituted the two most abundant classes. Other proteins were proteases, oxido-reductases, proteins with interacting domains, proteins involved in signalling, and structural proteins. Particularly high levels of expression were established for genes encoding pectin methylesterases, germin-like proteins, arabinogalactan proteins, fasciclin-like arabinogalactan proteins, and structural proteins. Finally, the results of this transcriptomic analyses were compared with those obtained through a cell wall proteomic analysis from the same material. Only a small proportion of genes identified by previous proteomic analyses were identified by transcriptomics. Conversely, only a few proteins encoded by genes having moderate or high level of transcripts were identified by proteomics. Conclusion Analysis of the genes predicted to encode cell wall proteins revealed that about 345 genes had moderate or high levels of transcripts. Among them, we identified many new genes possibly involved in cell wall biogenesis. The discrepancies observed between results of this transcriptomic study and a previous proteomic study on the same material revealed post-transcriptional mechanisms of regulation of expression of genes encoding cell wall proteins. PMID:19149885
Kemege, Kyle E.; Hickey, John M.; Lovell, Scott; Battaile, Kevin P.; Zhang, Yang; Hefty, P. Scott
2011-01-01
Chlamydia trachomatis is a medically important pathogen that encodes a relatively high percentage of proteins with unknown function. The three-dimensional structure of a protein can be very informative regarding the protein's functional characteristics; however, determining protein structures experimentally can be very challenging. Computational methods that model protein structures with sufficient accuracy to facilitate functional studies have had notable successes. To evaluate the accuracy and potential impact of computational protein structure modeling of hypothetical proteins encoded by Chlamydia, a successful computational method termed I-TASSER was utilized to model the three-dimensional structure of a hypothetical protein encoded by open reading frame (ORF) CT296. CT296 has been reported to exhibit functional properties of a divalent cation transcription repressor (DcrA), with similarity to the Escherichia coli iron-responsive transcriptional repressor, Fur. Unexpectedly, the I-TASSER model of CT296 exhibited no structural similarity to any DNA-interacting proteins or motifs. To validate the I-TASSER-generated model, the structure of CT296 was solved experimentally using X-ray crystallography. Impressively, the ab initio I-TASSER-generated model closely matched (2.72-Å Cα root mean square deviation [RMSD]) the high-resolution (1.8-Å) crystal structure of CT296. Modeled and experimentally determined structures of CT296 share structural characteristics of non-heme Fe(II) 2-oxoglutarate-dependent enzymes, although key enzymatic residues are not conserved, suggesting a unique biochemical process is likely associated with CT296 function. Additionally, functional analyses did not support prior reports that CT296 has properties shared with divalent cation repressors such as Fur. PMID:21965559
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kemege, Kyle E.; Hickey, John M.; Lovell, Scott
2012-02-13
Chlamydia trachomatis is a medically important pathogen that encodes a relatively high percentage of proteins with unknown function. The three-dimensional structure of a protein can be very informative regarding the protein's functional characteristics; however, determining protein structures experimentally can be very challenging. Computational methods that model protein structures with sufficient accuracy to facilitate functional studies have had notable successes. To evaluate the accuracy and potential impact of computational protein structure modeling of hypothetical proteins encoded by Chlamydia, a successful computational method termed I-TASSER was utilized to model the three-dimensional structure of a hypothetical protein encoded by open reading frame (ORF)more » CT296. CT296 has been reported to exhibit functional properties of a divalent cation transcription repressor (DcrA), with similarity to the Escherichia coli iron-responsive transcriptional repressor, Fur. Unexpectedly, the I-TASSER model of CT296 exhibited no structural similarity to any DNA-interacting proteins or motifs. To validate the I-TASSER-generated model, the structure of CT296 was solved experimentally using X-ray crystallography. Impressively, the ab initio I-TASSER-generated model closely matched (2.72-{angstrom} C{alpha} root mean square deviation [RMSD]) the high-resolution (1.8-{angstrom}) crystal structure of CT296. Modeled and experimentally determined structures of CT296 share structural characteristics of non-heme Fe(II) 2-oxoglutarate-dependent enzymes, although key enzymatic residues are not conserved, suggesting a unique biochemical process is likely associated with CT296 function. Additionally, functional analyses did not support prior reports that CT296 has properties shared with divalent cation repressors such as Fur.« less
USDA-ARS?s Scientific Manuscript database
The Rift Valley fever virus (RVFV) encodes structural proteins, nucleoprotein (N), N-terminus glycoprotein (Gn), C-terminus glycoprotein (Gc) and L protein, 78-kDa and non-structural proteins NSm and NSs. Using the baculovirus system we expressed the full-length coding sequence of N, NSs, NSm, Gc an...
Bartl, S; Weissman, I L
1994-01-04
The major histocompatibility complex (MHC) contains a set of linked genes which encode cell surface proteins involved in the binding of small peptide antigens for their subsequent recognition by T lymphocytes. MHC proteins share structural features and the presence and location of polymorphic residues which play a role in the binding of antigens. In order to compare the structure of these molecules and gain insights into their evolution, we have isolated two MHC class IIB genes from the nurse shark, Ginglymostoma cirratum. Two clones, most probably alleles, encode proteins which differ by 13 amino acids located in the putative antigen-binding cleft. The protein structure and the location of polymorphic residues are similar to their mammalian counterparts. Although these genes appear to encode a typical MHC protein, no T-cell-mediated responses have been demonstrated in cartilaginous fish. The nurse shark represents the most phylogenetically primitive organism in which both class IIA [Kasahara, M., Vazquez, M., Sato, K., McKinney, E.C. & Flajnik, M.F. (1992) Proc. Natl. Acad. Sci USA 89, 6688-6692] and class IIB genes, presumably encoding the alpha/beta heterodimer, have been isolated.
Cook, W B; Walker, J C
1992-01-01
A cDNA encoding a nuclear-encoded chloroplast nucleic acid-binding protein (NBP) has been isolated from maize. Identified as an in vitro DNA-binding activity, NBP belongs to a family of nuclear-encoded chloroplast proteins which share a common domain structure and are thought to be involved in posttranscriptional regulation of chloroplast gene expression. NBP contains an N-terminal chloroplast transit peptide, a highly acidic domain and a pair of ribonucleoprotein consensus sequence domains. NBP is expressed in a light-dependent, organ-specific manner which is consistent with its involvement in chloroplast biogenesis. The relationship of NBP to the other members of this protein family and their possible regulatory functions are discussed. Images PMID:1346929
Protein structural similarity search by Ramachandran codes
Lo, Wei-Cheng; Huang, Po-Jung; Chang, Chih-Hung; Lyu, Ping-Chiang
2007-01-01
Background Protein structural data has increased exponentially, such that fast and accurate tools are necessary to access structure similarity search. To improve the search speed, several methods have been designed to reduce three-dimensional protein structures to one-dimensional text strings that are then analyzed by traditional sequence alignment methods; however, the accuracy is usually sacrificed and the speed is still unable to match sequence similarity search tools. Here, we aimed to improve the linear encoding methodology and develop efficient search tools that can rapidly retrieve structural homologs from large protein databases. Results We propose a new linear encoding method, SARST (Structural similarity search Aided by Ramachandran Sequential Transformation). SARST transforms protein structures into text strings through a Ramachandran map organized by nearest-neighbor clustering and uses a regenerative approach to produce substitution matrices. Then, classical sequence similarity search methods can be applied to the structural similarity search. Its accuracy is similar to Combinatorial Extension (CE) and works over 243,000 times faster, searching 34,000 proteins in 0.34 sec with a 3.2-GHz CPU. SARST provides statistically meaningful expectation values to assess the retrieved information. It has been implemented into a web service and a stand-alone Java program that is able to run on many different platforms. Conclusion As a database search method, SARST can rapidly distinguish high from low similarities and efficiently retrieve homologous structures. It demonstrates that the easily accessible linear encoding methodology has the potential to serve as a foundation for efficient protein structural similarity search tools. These search tools are supposed applicable to automated and high-throughput functional annotations or predictions for the ever increasing number of published protein structures in this post-genomic era. PMID:17716377
DOE Office of Scientific and Technical Information (OSTI.GOV)
Domingo Meza-Aguilar, J.; Laboratorio de Patogenicidad Bacteriana, Unidad de Hemato Oncología e Investigación, Hospital Infantil de México Federico Gómez 06720, D.F.; Fromme, Petra
Highlights: • X-ray crystal structure of the passenger domain of Plasmid encoded toxin at 2.3 Å. • Structural differences between Pet passenger domain and EspP protein are described. • High flexibility of the C-terminal beta helix is structurally assigned. - Abstract: Autotransporters (ATs) represent a superfamily of proteins produced by a variety of pathogenic bacteria, which include the pathogenic groups of Escherichia coli (E. coli) associated with gastrointestinal and urinary tract infections. We present the first X-ray structure of the passenger domain from the Plasmid-encoded toxin (Pet) a 100 kDa protein at 2.3 Å resolution which is a cause ofmore » acute diarrhea in both developing and industrialized countries. Pet is a cytoskeleton-altering toxin that induces loss of actin stress fibers. While Pet (pdb code: 4OM9) shows only a sequence identity of 50% compared to the closest related protein sequence, extracellular serine protease plasmid (EspP) the structural features of both proteins are conserved. A closer structural look reveals that Pet contains a β-pleaded sheet at the sequence region of residues 181–190, the corresponding structural domain in EspP consists of a coiled loop. Secondary, the Pet passenger domain features a more pronounced beta sheet between residues 135 and 143 compared to the structure of EspP.« less
Structural alphabets derived from attractors in conformational space
2010-01-01
Background The hierarchical and partially redundant nature of protein structures justifies the definition of frequently occurring conformations of short fragments as 'states'. Collections of selected representatives for these states define Structural Alphabets, describing the most typical local conformations within protein structures. These alphabets form a bridge between the string-oriented methods of sequence analysis and the coordinate-oriented methods of protein structure analysis. Results A Structural Alphabet has been derived by clustering all four-residue fragments of a high-resolution subset of the protein data bank and extracting the high-density states as representative conformational states. Each fragment is uniquely defined by a set of three independent angles corresponding to its degrees of freedom, capturing in simple and intuitive terms the properties of the conformational space. The fragments of the Structural Alphabet are equivalent to the conformational attractors and therefore yield a most informative encoding of proteins. Proteins can be reconstructed within the experimental uncertainty in structure determination and ensembles of structures can be encoded with accuracy and robustness. Conclusions The density-based Structural Alphabet provides a novel tool to describe local conformations and it is specifically suitable for application in studies of protein dynamics. PMID:20170534
Structural evolution of the 4/1 genes and proteins in non-vascular and lower vascular plants.
Morozov, Sergey Y; Milyutina, Irina A; Bobrova, Vera K; Ryazantsev, Dmitry Y; Erokhina, Tatiana N; Zavriev, Sergey K; Agranovsky, Alexey A; Solovyev, Andrey G; Troitsky, Alexey V
2015-12-01
The 4/1 protein of unknown function is encoded by a single-copy gene in most higher plants. The 4/1 protein of Nicotiana tabacum (Nt-4/1 protein) has been shown to be alpha-helical and predominantly expressed in conductive tissues. Here, we report the analysis of 4/1 genes and the encoded proteins of lower land plants. Sequences of a number of 4/1 genes from liverworts, lycophytes, ferns and gymnosperms were determined and analyzed together with sequences available in databases. Most of the vascular plants were found to encode Magnoliophyta-like 4/1 proteins exhibiting previously described gene structure and protein properties. Identification of the 4/1-like proteins in hornworts, liverworts and charophyte algae (sister lineage to all land plants) but not in mosses suggests that 4/1 proteins are likely important for plant development but not required for a primary metabolic function of plant cell. Copyright © 2015 Elsevier B.V. and Société Française de Biochimie et Biologie Moléculaire (SFBBM). All rights reserved.
Three reasons protein disorder analysis makes more sense in the light of collagen
Oates, Matt E.; Tompa, Peter; Gough, Julian
2016-01-01
Abstract We have identified that the collagen helix has the potential to be disruptive to analyses of intrinsically disordered proteins. The collagen helix is an extended fibrous structure that is both promiscuous and repetitive. Whilst its sequence is predicted to be disordered, this type of protein structure is not typically considered as intrinsic disorder. Here, we show that collagen‐encoding proteins skew the distribution of exon lengths in genes. We find that previous results, demonstrating that exons encoding disordered regions are more likely to be symmetric, are due to the abundance of the collagen helix. Other related results, showing increased levels of alternative splicing in disorder‐encoding exons, still hold after considering collagen‐containing proteins. Aside from analyses of exons, we find that the set of proteins that contain collagen significantly alters the amino acid composition of regions predicted as disordered. We conclude that research in this area should be conducted in the light of the collagen helix. PMID:26941008
Lubin, Johnathan W; Rao, Timsi; Mandell, Edward K; Wuttke, Deborah S; Lundblad, Victoria
2013-03-01
Mutations that confer the loss of a single biochemical property (separation-of-function mutations) can often uncover a previously unknown role for a protein in a particular biological process. However, most mutations are identified based on loss-of-function phenotypes, which cannot differentiate between separation-of-function alleles vs. mutations that encode unstable/unfolded proteins. An alternative approach is to use overexpression dominant-negative (ODN) phenotypes to identify mutant proteins that disrupt function in an otherwise wild-type strain when overexpressed. This is based on the assumption that such mutant proteins retain an overall structure that is comparable to that of the wild-type protein and are able to compete with the endogenous protein (Herskowitz 1987). To test this, the in vivo phenotypes of mutations in the Est3 telomerase subunit from Saccharomyces cerevisiae were compared with the in vitro secondary structure of these mutant proteins as analyzed by circular-dichroism spectroscopy, which demonstrates that ODN is a more sensitive assessment of protein stability than the commonly used method of monitoring protein levels from extracts. Reverse mutagenesis of EST3, which targeted different categories of amino acids, also showed that mutating highly conserved charged residues to the oppositely charged amino acid had an increased likelihood of generating a severely defective est3(-) mutation, which nevertheless encoded a structurally stable protein. These results suggest that charge-swap mutagenesis directed at a limited subset of highly conserved charged residues, combined with ODN screening to eliminate partially unfolded proteins, may provide a widely applicable and efficient strategy for generating separation-of-function mutations.
Structural insights into the multifunctional protein VP3 of birnaviruses.
Casañas, Arnau; Navarro, Aitor; Ferrer-Orta, Cristina; González, Dolores; Rodríguez, José F; Verdaguer, Núria
2008-01-01
Infectious bursal disease virus (IBDV), a member of the Birnaviridae family, is the causative agent of one of the most harmful poultry diseases. The IBDV genome encodes five mature proteins; of these, the multifunctional protein VP3 plays an essential role in virus morphogenesis. This protein, which interacts with the structural protein VP2, with the double-stranded RNA genome, and with the virus-encoded, RNA-dependent RNA polymerase, VP1, is involved not only in the formation of the viral capsid, but also in the recruitment of VP1 into the capsid and in the encapsidation of the viral genome. Here, we report the X-ray structure of the central region of VP3, residues 92-220, consisting of two alpha-helical domains connected by a long and flexible hinge that are organized as a dimer. Unexpectedly, the overall fold of the second VP3 domain shows significant structural similarities with different transcription regulation factors.
Necessities for the First Life to Emerge
NASA Astrophysics Data System (ADS)
Ikehara, K.
2017-07-01
For the first life to emerge, the first protein must be produced by random joining of amino acids in protein 0th-order structure. In addition, the first genetic code and the first double-stranded gene must encode the protein 0th-order structure.
Osada, Naoki; Akashi, Hiroshi
2012-01-01
Accelerated rates of mitochondrial protein evolution have been proposed to reflect Darwinian coadaptation for efficient energy production for mammalian flight and brain activity. However, several features of mammalian mtDNA (absence of recombination, small effective population size, and high mutation rate) promote genome degradation through the accumulation of weakly deleterious mutations. Here, we present evidence for "compensatory" adaptive substitutions in nuclear DNA- (nDNA) encoded mitochondrial proteins to prevent fitness decline in primate mitochondrial protein complexes. We show that high mutation rate and small effective population size, key features of primate mitochondrial genomes, can accelerate compensatory adaptive evolution in nDNA-encoded genes. We combine phylogenetic information and the 3D structure of the cytochrome c oxidase (COX) complex to test for accelerated compensatory changes among interacting sites. Physical interactions among mtDNA- and nDNA-encoded components are critical in COX evolution; amino acids in close physical proximity in the 3D structure show a strong tendency for correlated evolution among lineages. Only nuclear-encoded components of COX show evidence for positive selection and adaptive nDNA-encoded changes tend to follow mtDNA-encoded amino acid changes at nearby sites in the 3D structure. This bias in the temporal order of substitutions supports compensatory weak selection as a major factor in accelerated primate COX evolution.
Structure of the virulence-associated protein VapD from the intracellular pathogen Rhodococcus equi
DOE Office of Scientific and Technical Information (OSTI.GOV)
Whittingham, Jean L.; Blagova, Elena V.; Finn, Ciaran E.
2014-08-01
VapD is one of a set of highly homologous virulence-associated proteins from the multi-host pathogen Rhodococcus equi. The crystal structure reveals an eight-stranded β-barrel with a novel fold and a glycine rich ‘bald’ surface. Rhodococcus equi is a multi-host pathogen that infects a range of animals as well as immune-compromised humans. Equine and porcine isolates harbour a virulence plasmid encoding a homologous family of virulence-associated proteins associated with the capacity of R. equi to divert the normal processes of endosomal maturation, enabling bacterial survival and proliferation in alveolar macrophages. To provide a basis for probing the function of the Vapmore » proteins in virulence, the crystal structure of VapD was determined. VapD is a monomer as determined by multi-angle laser light scattering. The structure reveals an elliptical, compact eight-stranded β-barrel with a novel strand topology and pseudo-twofold symmetry, suggesting evolution from an ancestral dimer. Surface-associated octyl-β-d-glucoside molecules may provide clues to function. Circular-dichroism spectroscopic analysis suggests that the β-barrel structure is preceded by a natively disordered region at the N-terminus. Sequence comparisons indicate that the core folds of the other plasmid-encoded virulence-associated proteins from R. equi strains are similar to that of VapD. It is further shown that sequences encoding putative R. equi Vap-like proteins occur in diverse bacterial species. Finally, the functional implications of the structure are discussed in the light of the unique structural features of VapD and its partial structural similarity to other β-barrel proteins.« less
de-Couet, H. G.; Fong, KSK.; Weeds, A. G.; McLaughlin, P. J.; Miklos, GLG.
1995-01-01
The flightless locus of Drosophila melanogaster has been analyzed at the genetic, molecular, ultrastructural and comparative crystallographic levels. The gene encodes a single transcript encoding a protein consisting of a leucine-rich amino terminal half and a carboxyterminal half with high sequence similarity to gelsolin. We determined the genomic sequence of the flightless landscape, the breakpoints of four chromosomal rearrangements, and the molecular lesions in two lethal and two viable alleles of the gene. The two alleles that lead to flight muscle abnormalities encode mutant proteins exhibiting amino acid replacements within the S1-like domain of their gelsolin-like region. Furthermore, the deduced intronexon structure of the D. melanogaster gene has been compared with that of the Caenorhabditis elegans homologue. Furthermore, the sequence similarities of the flightless protein with gelsolin allow it to be evaluated in the context of the published crystallographic structure of the S1 domain of gelsolin. Amino acids considered essential for the structural integrity of the core are found to be highly conserved in the predicted flightless protein. Some of the residues considered essential for actin and calcium binding in gelsolin S1 and villin V1 are also well conserved. These data are discussed in light of the phenotypic characteristics of the mutants and the putative functions of the protein. PMID:8582612
A highly divergent gene cluster in honey bees encodes a novel silk family.
Sutherland, Tara D; Campbell, Peter M; Weisman, Sarah; Trueman, Holly E; Sriskantha, Alagacone; Wanjura, Wolfgang J; Haritos, Victoria S
2006-11-01
The pupal cocoon of the domesticated silk moth Bombyx mori is the best known and most extensively studied insect silk. It is not widely known that Apis mellifera larvae also produce silk. We have used a combination of genomic and proteomic techniques to identify four honey bee fiber genes (AmelFibroin1-4) and two silk-associated genes (AmelSA1 and 2). The four fiber genes are small, comprise a single exon each, and are clustered on a short genomic region where the open reading frames are GC-rich amid low GC intergenic regions. The genes encode similar proteins that are highly helical and predicted to form unusually tight coiled coils. Despite the similarity in size, structure, and composition of the encoded proteins, the genes have low primary sequence identity. We propose that the four fiber genes have arisen from gene duplication events but have subsequently diverged significantly. The silk-associated genes encode proteins likely to act as a glue (AmelSA1) and involved in silk processing (AmelSA2). Although the silks of honey bees and silkmoths both originate in larval labial glands, the silk proteins are completely different in their primary, secondary, and tertiary structures as well as the genomic arrangement of the genes encoding them. This implies independent evolutionary origins for these functionally related proteins.
A Survey of Protein Structures from Archaeal Viruses
Dellas, Nikki; Lawrence, C. Martin; Young, Mark J.
2013-01-01
Viruses that infect the third domain of life, Archaea, are a newly emerging field of interest. To date, all characterized archaeal viruses infect archaea that thrive in extreme conditions, such as halophilic, hyperthermophilic, and methanogenic environments. Viruses in general, especially those replicating in extreme environments, contain highly mosaic genomes with open reading frames (ORFs) whose sequences are often dissimilar to all other known ORFs. It has been estimated that approximately 85% of virally encoded ORFs do not match known sequences in the nucleic acid databases, and this percentage is even higher for archaeal viruses (typically 90%–100%). This statistic suggests that either virus genomes represent a larger segment of sequence space and/or that viruses encode genes of novel fold and/or function. Because the overall three-dimensional fold of a protein evolves more slowly than its sequence, efforts have been geared toward structural characterization of proteins encoded by archaeal viruses in order to gain insight into their potential functions. In this short review, we provide multiple examples where structural characterization of archaeal viral proteins has indeed provided significant functional and evolutionary insight. PMID:25371334
Nucleic acids encoding plant glutamine phenylpyruvate transaminase (GPT) and uses thereof
Unkefer, Pat J.; Anderson, Penelope S.; Knight, Thomas J.
2016-03-29
Glutamine phenylpyruvate transaminase (GPT) proteins, nucleic acid molecules encoding GPT proteins, and uses thereof are disclosed. Provided herein are various GPT proteins and GPT gene coding sequences isolated from a number of plant species. As disclosed herein, GPT proteins share remarkable structural similarity within plant species, and are active in catalyzing the synthesis of 2-hydroxy-5-oxoproline (2-oxoglutaramate), a powerful signal metabolite which regulates the function of a large number of genes involved in the photosynthesis apparatus, carbon fixation and nitrogen metabolism.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Parish, D.; Benach, J; Liu, G
2008-01-01
The structure of the 142-residue protein Q8ZP25 SALTY encoded in the genome of Salmonella typhimurium LT2 was determined independently by NMR and X-ray crystallography, and the structure of the 140-residue protein HYAE ECOLI encoded in the genome of Escherichia coli was determined by NMR. The two proteins belong to Pfam (Finn et al. 34:D247-D251, 2006) PF07449, which currently comprises 50 members, and belongs itself to the 'thioredoxin-like clan'. However, protein HYAE ECOLI and the other proteins of Pfam PF07449 do not contain the canonical Cys-X-X-Cys active site sequence motif of thioredoxin. Protein HYAE ECOLI was previously classified as a (NiFe)more » hydrogenase-1 specific chaperone interacting with the twin-arginine translocation (Tat) signal peptide. The structures presented here exhibit the expected thioredoxin-like fold and support the view that members of Pfam family PF07449 specifically interact with Tat signal peptides.« less
Plant, Ewan P; Rakauskaite, Rasa; Taylor, Deborah R; Dinman, Jonathan D
2010-05-01
In retroviruses and the double-stranded RNA totiviruses, the efficiency of programmed -1 ribosomal frameshifting is critical for ensuring the proper ratios of upstream-encoded capsid proteins to downstream-encoded replicase enzymes. The genomic organizations of many other frameshifting viruses, including the coronaviruses, are very different, in that their upstream open reading frames encode nonstructural proteins, the frameshift-dependent downstream open reading frames encode enzymes involved in transcription and replication, and their structural proteins are encoded by subgenomic mRNAs. The biological significance of frameshifting efficiency and how the relative ratios of proteins encoded by the upstream and downstream open reading frames affect virus propagation has not been explored before. Here, three different strategies were employed to test the hypothesis that the -1 PRF signals of coronaviruses have evolved to produce the correct ratios of upstream- to downstream-encoded proteins. Specifically, infectious clones of the severe acute respiratory syndrome (SARS)-associated coronavirus harboring mutations that lower frameshift efficiency decreased infectivity by >4 orders of magnitude. Second, a series of frameshift-promoting mRNA pseudoknot mutants was employed to demonstrate that the frameshift signals of the SARS-associated coronavirus and mouse hepatitis virus have evolved to promote optimal frameshift efficiencies. Finally, we show that a previously described frameshift attenuator element does not actually affect frameshifting per se but rather serves to limit the fraction of ribosomes available for frameshifting. The findings of these analyses all support a "golden mean" model in which viruses use both programmed ribosomal frameshifting and translational attenuation to control the relative ratios of their encoded proteins.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Rodríguez Guilbe, María M.; Protein Research and Development Center, University of Puerto Rico; Alfaro Malavé, Elisa C.
The genetically encoded fluorescent calcium-indicator protein GCaMP2 was crystallized in the calcium-saturated form. X-ray diffraction data were collected to 2.0 Å resolution and the structure was solved by molecular replacement. Fluorescent proteins and their engineered variants have played an important role in the study of biology. The genetically encoded calcium-indicator protein GCaMP2 comprises a circularly permuted fluorescent protein coupled to the calcium-binding protein calmodulin and a calmodulin target peptide, M13, derived from the intracellular calmodulin target myosin light-chain kinase and has been used to image calcium transients in vivo. To aid rational efforts to engineer improved variants of GCaMP2, thismore » protein was crystallized in the calcium-saturated form. X-ray diffraction data were collected to 2.0 Å resolution. The crystals belong to space group C2, with unit-cell parameters a = 126.1, b = 47.1, c = 68.8 Å, β = 100.5° and one GCaMP2 molecule in the asymmetric unit. The structure was phased by molecular replacement and refinement is currently under way.« less
Hsu, Jack C-C; Reid, David W; Hoffman, Alyson M; Sarkar, Devanand; Nicchitta, Christopher V
2018-05-01
Astrocyte elevated gene-1 (AEG-1), an oncogene whose overexpression promotes tumor cell proliferation, angiogenesis, invasion, and enhanced chemoresistance, is thought to function primarily as a scaffolding protein, regulating PI3K/Akt and Wnt/β-catenin signaling pathways. Here we report that AEG-1 is an endoplasmic reticulum (ER) resident integral membrane RNA-binding protein (RBP). Examination of the AEG-1 RNA interactome by HITS-CLIP and PAR-CLIP methodologies revealed a high enrichment for endomembrane organelle-encoding transcripts, most prominently those encoding ER resident proteins, and within this cohort, for integral membrane protein-encoding RNAs. Cluster mapping of the AEG-1/RNA interaction sites demonstrated a normalized rank order interaction of coding sequence >5' untranslated region, with 3' untranslated region interactions only weakly represented. Intriguingly, AEG-1/membrane protein mRNA interaction sites clustered downstream from encoded transmembrane domains, suggestive of a role in membrane protein biogenesis. Secretory and cytosolic protein-encoding mRNAs were also represented in the AEG-1 RNA interactome, with the latter category notably enriched in genes functioning in mRNA localization, translational regulation, and RNA quality control. Bioinformatic analyses of RNA-binding motifs and predicted secondary structure characteristics indicate that AEG-1 lacks established RNA-binding sites though shares the property of high intrinsic disorder commonly seen in RBPs. These data implicate AEG-1 in the localization and regulation of secretory and membrane protein-encoding mRNAs and provide a framework for understanding AEG-1 function in health and disease. © 2018 Hsu et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Stayton, M M; Black, M; Bedbrook, J; Dunsmuir, P
1986-12-22
The 16 petunia Cab genes which have been characterized are all closely related at the nucleotide sequence level and they encode Cab precursor polypeptides which are similar in sequence and length. Here we describe a novel petunia Cab gene which encodes a unique Cab precursor protein. This protein is a member of the smallest class of Cab precursor proteins for which no gene has previously been assigned in petunia or any other species. The features of this Cab precursor protein are that it is shorter by 2-3 amino acids than the formerly characterized Cab precursors, its transit peptide sequence is unrelated, and the mature polypeptide is significantly diverged at the functionally important N terminus from other petunia Cab proteins. Gene structure also discriminates this gene which is the only intron containing Cab gene in petunia genomic DNA.
Computer analysis of protein functional sites projection on exon structure of genes in Metazoa.
Medvedeva, Irina V; Demenkov, Pavel S; Ivanisenko, Vladimir A
2015-01-01
Study of the relationship between the structural and functional organization of proteins and their coding genes is necessary for an understanding of the evolution of molecular systems and can provide new knowledge for many applications for designing proteins with improved medical and biological properties. It is well known that the functional properties of proteins are determined by their functional sites. Functional sites are usually represented by a small number of amino acid residues that are distantly located from each other in the amino acid sequence. They are highly conserved within their functional group and vary significantly in structure between such groups. According to this facts analysis of the general properties of the structural organization of the functional sites at the protein level and, at the level of exon-intron structure of the coding gene is still an actual problem. One approach to this analysis is the projection of amino acid residue positions of the functional sites along with the exon boundaries to the gene structure. In this paper, we examined the discontinuity of the functional sites in the exon-intron structure of genes and the distribution of lengths and phases of the functional site encoding exons in vertebrate genes. We have shown that the DNA fragments coding the functional sites were in the same exons, or in close exons. The observed tendency to cluster the exons that code functional sites which could be considered as the unit of protein evolution. We studied the characteristics of the structure of the exon boundaries that code, and do not code, functional sites in 11 Metazoa species. This is accompanied by a reduced frequency of intercodon gaps (phase 0) in exons encoding the amino acid residue functional site, which may be evidence of the existence of evolutionary limitations to the exon shuffling. These results characterize the features of the coding exon-intron structure that affect the functionality of the encoded protein and allow a better understanding of the emergence of biological diversity.
Yang, Hongfang; Medeiros, Patricia F; Raha, Kaushik; Elkins, Patricia; Lind, Kenneth E; Lehr, Ruth; Adams, Nicholas D; Burgess, Joelle L; Schmidt, Stanley J; Knight, Steven D; Auger, Kurt R; Schaber, Michael D; Franklin, G Joseph; Ding, Yun; DeLorey, Jennifer L; Centrella, Paolo A; Mataruse, Sibongile; Skinner, Steven R; Clark, Matthew A; Cuozzo, John W; Evindar, Ghotas
2015-05-14
In the search of PI3K p110α wild type and H1047R mutant selective small molecule leads, an encoded library technology (ELT) campaign against the desired target proteins was performed which led to the discovery of a selective chemotype for PI3K isoforms from a three-cycle DNA encoded library. An X-ray crystal structure of a representative inhibitor from this chemotype demonstrated a unique binding mode in the p110α protein.
2015-01-01
In the search of PI3K p110α wild type and H1047R mutant selective small molecule leads, an encoded library technology (ELT) campaign against the desired target proteins was performed which led to the discovery of a selective chemotype for PI3K isoforms from a three-cycle DNA encoded library. An X-ray crystal structure of a representative inhibitor from this chemotype demonstrated a unique binding mode in the p110α protein. PMID:26005528
USDA-ARS?s Scientific Manuscript database
Bean pod mottle virus (BPMV) is a bipartite, positive-sense (+) RNA plant virus of the family Secoviridae. Its RNA1 encodes all proteins needed for genome replication and is capable of autonomous replication. By contrast, BPMV RNA2 must utilize RNA1-encoded proteins for replication. Here, we sought ...
A lanthipeptide library used to identify a protein-protein interaction inhibitor.
Yang, Xiao; Lennard, Katherine R; He, Chang; Walker, Mark C; Ball, Andrew T; Doigneaux, Cyrielle; Tavassoli, Ali; van der Donk, Wilfred A
2018-04-01
In this article we describe the production and screening of a genetically encoded library of 10 6 lanthipeptides in Escherichia coli using the substrate-tolerant lanthipeptide synthetase ProcM. This plasmid-encoded library was combined with a bacterial reverse two-hybrid system for the interaction of the HIV p6 protein with the UEV domain of the human TSG101 protein, which is a critical protein-protein interaction for HIV budding from infected cells. Using this approach, we identified an inhibitor of this interaction from the lanthipeptide library, whose activity was verified in vitro and in cell-based virus-like particle-budding assays. Given the variety of lanthipeptide backbone scaffolds that may be produced with ProcM, this method may be used for the generation of genetically encoded libraries of natural product-like lanthipeptides containing substantial structural diversity. Such libraries may be combined with any cell-based assay to identify lanthipeptides with new biological activities.
Meza-Aguilar, J. Domingo; Fromme, Petra; Torres-Larios, Alfredo; Mendoza-Hernández, Guillermo; Hernandez-Chiñas, Ulises; Monteros, Roberto A. Arreguin-Espinosa de los; Campos, Carlos A. Eslava; Fromme, Raimund
2014-01-01
Autotransporters (ATs) represent a superfamily of proteins produced by a variety of pathogenic bacteria, which include the pathogenic groups of Escherichia coli (E. coli) associated with gastrointestinal and urinary tract infections. We present the first X-ray structure of the passenger domain from the Plasmid-encoded toxin (Pet) a 100 kDa protein at 2.3 Å resolution which is a cause of acute diarrhea in both developing and industrialized countries. Pet is a cytoskeleton-altering toxin that induces loss of actin stress fibers. While Pet (pdb code: 4OM9) shows only a sequence identity of 50 % compared to the closest related protein sequence, extracellular serine protease plasmid (EspP) the structural features of both proteins are conserved. A closer structural look reveals that Pet contains a β-pleaded sheet at the sequence region of residues 181-190, the corresponding structural domain in EspP consists of a coiled loop. Secondary, the Pet passenger domain features a more pronounced beta sheet between residues 135-143 compared to the structure of EspP. PMID:24530907
Molecular basis of hypohidrotic ectodermal dysplasia: an update.
Trzeciak, Wieslaw H; Koczorowski, Ryszard
2016-02-01
Recent advances in understanding the molecular events underlying hypohidrotic ectodermal dysplasia (HED) caused by mutations of the genes encoding proteins of the tumor necrosis factor α (TNFα)-related signaling pathway have been presented. These proteins are involved in signal transduction from ectoderm to mesenchyme during development of the fetus and are indispensable for the differentiation of ectoderm-derived structures such as eccrine sweat glands, teeth, hair, skin, and/or nails. Novel data were reviewed and discussed on the structure and functions of the components of TNFα-related signaling pathway, the consequences of mutations of the genes encoding these proteins, and the prospect for further investigations, which might elucidate the origin of HED.
D'Onofrio, Giuseppe; Ghosh, Tapash Chandra
2005-01-17
Fluctuations and increments of both C(3) and G(3) levels along the human coding sequences were investigated comparing two sets of Xenopus/human orthologous genes. The first set of genes shows minor differences of the GC(3) levels, the second shows considerable increments of the GC(3) levels in the human genes. In both data sets, the fluctuations of C(3) and G(3) levels along the coding sequences correlated with the secondary structures of the encoded proteins. The human genes that underwent the compositional transition showed a different increment of the C(3) and G(3) levels within and among the structural units of the proteins. The relative synonymous codon usage (RSCU) of several amino acids were also affected during the compositional transition, showing that there exists a correlation between RSCU and protein secondary structures in human genes. The importance of natural selection for the formation of isochore organization of the human genome has been discussed on the basis of these results.
Evolution and Structural Organization of the C Proteins of Paramyxovirinae
Karlin, David G.
2014-01-01
The phosphoprotein (P) gene of most Paramyxovirinae encodes several proteins in overlapping frames: P and V, which share a common N-terminus (PNT), and C, which overlaps PNT. Overlapping genes are of particular interest because they encode proteins originated de novo, some of which have unknown structural folds, challenging the notion that nature utilizes only a limited, well-mapped area of fold space. The C proteins cluster in three groups, comprising measles, Nipah, and Sendai virus. We predicted that all C proteins have a similar organization: a variable, disordered N-terminus and a conserved, α-helical C-terminus. We confirmed this predicted organization by biophysically characterizing recombinant C proteins from Tupaia paramyxovirus (measles group) and human parainfluenza virus 1 (Sendai group). We also found that the C of the measles and Nipah groups have statistically significant sequence similarity, indicating a common origin. Although the C of the Sendai group lack sequence similarity with them, we speculate that they also have a common origin, given their similar genomic location and structural organization. Since C is dispensable for viral replication, unlike PNT, we hypothesize that C may have originated de novo by overprinting PNT in the ancestor of Paramyxovirinae. Intriguingly, in measles virus and Nipah virus, PNT encodes STAT1-binding sites that overlap different regions of the C-terminus of C, indicating they have probably originated independently. This arrangement, in which the same genetic region encodes simultaneously a crucial functional motif (a STAT1-binding site) and a highly constrained region (the C-terminus of C), seems paradoxical, since it should severely reduce the ability of the virus to adapt. The fact that it originated twice suggests that it must be balanced by an evolutionary advantage, perhaps from reducing the size of the genetic region vulnerable to mutations. PMID:24587180
Subversion of cytokine networks by virally encoded decoy receptors
Epperson, Megan L.; Lee, Chung A.; Fremont, Daved H.
2012-01-01
Summary During the course of evolution, viruses have captured or created a diverse array of open reading frames that encode for proteins that serve to evade and sabotage the host innate and adaptive immune responses, which would otherwise lead to their elimination. These viral genomes are some of the best textbooks of immunology ever written. The established arsenal of immunomodulatory proteins encoded by viruses is large and growing and includes specificities for virtually all known inflammatory pathways and targets. The focus of this review is on herpes and poxvirus-encoded cytokine and chemokine binding proteins that serve to undermine the coordination of host immune surveillance. Structural and mechanistic studies of these decoy receptors have provided a wealth of information, not only about viral pathogenesis but also about the inner workings of cytokine signaling networks. PMID:23046131
Kunig, Verena; Potowski, Marco; Gohla, Anne; Brunschweiger, Andreas
2018-06-27
DNA-encoded compound libraries are a highly attractive technology for the discovery of small molecule protein ligands. These compound collections consist of small molecules covalently connected to individual DNA sequences carrying readable information about the compound structure. DNA-tagging allows for efficient synthesis, handling and interrogation of vast numbers of chemically synthesized, drug-like compounds. They are screened on proteins by an efficient, generic assay based on Darwinian principles of selection. To date, selection of DNA-encoded libraries allowed for the identification of numerous bioactive compounds. Some of these compounds uncovered hitherto unknown allosteric binding sites on target proteins; several compounds proved their value as chemical biology probes unraveling complex biology; and the first examples of clinical candidates that trace their ancestry to a DNA-encoded library were reported. Thus, DNA-encoded libraries proved their value for the biomedical sciences as a generic technology for the identification of bioactive drug-like molecules numerous times. However, large scale experiments showed that even the selection of billions of compounds failed to deliver bioactive compounds for the majority of proteins in an unbiased panel of target proteins. This raises the question of compound library design.
Single-Molecule Encoders for Tracking Motor Proteins on DNA
NASA Astrophysics Data System (ADS)
Lipman, Everett A.
2012-02-01
Devices such as inkjet printers and disk drives track position and velocity using optical encoders, which produce periodic signals precisely synchronized with linear or rotational motion. We have implemented this technique at the nanometer scale by labeling DNA with regularly spaced fluorescent dyes. The resulting molecular encoders can be used in several ways for high-resolution continuous tracking of individual motor proteins. These measurements do not require mechanical coupling to macroscopic instrumentation, are automatically calibrated by the underlying structure of DNA, and depend on signal periodicity rather than absolute level. I will describe the synthesis of single-molecule encoders, data from and modeling of experiments on a helicase and a DNA polymerase, and some ideas for future work.
Niskanen, Einari A; Hytönen, Vesa P; Grapputo, Alessandro; Nordlund, Henri R; Kulomaa, Markku S; Laitinen, Olli H
2005-01-01
Background A chicken egg contains several biotin-binding proteins (BBPs), whose complete DNA and amino acid sequences are not known. In order to identify and characterise these genes and proteins we studied chicken cDNAs and genes available in the NCBI database and chicken genome database using the reported N-terminal amino acid sequences of chicken egg-yolk BBPs as search strings. Results Two separate hits showing significant homology for these N-terminal sequences were discovered. For one of these hits, the chromosomal location in the immediate proximity of the avidin gene family was found. Both of these hits encode proteins having high sequence similarity with avidin suggesting that chicken BBPs are paralogous to avidin family. In particular, almost all residues corresponding to biotin binding in avidin are conserved in these putative BBP proteins. One of the found DNA sequences, however, seems to encode a carboxy-terminal extension not present in avidin. Conclusion We describe here the predicted properties of the putative BBP genes and proteins. Our present observations link BBP genes together with avidin gene family and shed more light on the genetic arrangement and variability of this family. In addition, comparative modelling revealed the potential structural elements important for the functional and structural properties of the putative BBP proteins. PMID:15777476
DOE Office of Scientific and Technical Information (OSTI.GOV)
Jackson, P.J.; Walthers, E.A.; Richmond, K.L.
1997-04-01
PCR analysis of 198 Bacillus anthracis isolates revealed a variable region of DNA sequence differing in length among the isolates. Five Polymorphisms differed by the presence Of two to six copies of the 12-bp tandem repeat 5{prime}-CAATATCAACAA-3{prime}. This variable-number tandem repeat (VNTR) region is located within a larger sequence containing one complete open reading frame that encodes a putative 30-kDa protein. Length variation did not change the reading frame of the encoded protein and only changed the copy number of a 4-amino-acid sequence (QYQQ) from 2 to 6. The structure of the VNTR region suggests that these multiple repeats aremore » generated by recombination or polymerase slippage. Protein structures predicted from the reverse-translated DNA sequence suggest that any structural changes in the encoded protein are confined to the region encoded by the VNTR sequence. Copy number differences in the VNTR region were used to define five different B. anthracis alleles. Characterization of 198 isolates revealed allele frequencies of 6.1, 17.7, 59.6, 5.6, and 11.1% sequentially from shorter to longer alleles. The high degree of polymorphism in the VNTR region provides a criterion for assigning isolates to five allelic categories. There is a correlation between categories and geographic distribution. Such molecular markers can be used to monitor the epidemiology of anthrax outbreaks in domestic and native herbivore populations. 22 refs., 4 figs., 3 tabs.« less
Crystal structures of OrfX2 and P47 from a Botulinum neurotoxin OrfX-type gene cluster.
Gustafsson, Robert; Berntsson, Ronnie P-A; Martínez-Carranza, Markel; El Tekle, Geniver; Odegrip, Richard; Johnson, Eric A; Stenmark, Pål
2017-11-01
Botulinum neurotoxins are highly toxic substances and are all encoded together with one of two alternative gene clusters, the HA or the OrfX gene cluster. Very little is known about the function and structure of the proteins encoded in the OrfX gene cluster, which in addition to the toxin contains five proteins (OrfX1, OrfX2, OrfX3, P47, and NTNH). We here present the structures of OrfX2 and P47, solved to 2.1 and 1.8 Å, respectively. We show that they belong to the TULIP protein superfamily, which are often involved in lipid binding. OrfX1 and OrfX2 were both found to bind phosphatidylinositol lipids. © 2017 Federation of European Biochemical Societies.
Structure of adenovirus bound to cellular receptor car
Freimuth, Paul I.
2007-01-02
Disclosed is a mutant CAR-DI-binding adenovirus which has a genome comprising one or more mutations in sequences which encode the fiber protein knob domain wherein the mutation causes the encoded viral particle to have a significantly weakened binding affinity for CAR-DI relative to wild-type adenovirus. Such mutations may be in sequences which encode either the AB loop, or the HI loop of the fiber protein knob domain. Specific residues and mutations are described. Also disclosed is a method for generating a mutant adenovirus which is characterized by a receptor binding affinity or specificity which differs substantially from wild type.
Plant, Ewan P.; Rakauskaitė, Rasa; Taylor, Deborah R.; Dinman, Jonathan D.
2010-01-01
In retroviruses and the double-stranded RNA totiviruses, the efficiency of programmed −1 ribosomal frameshifting is critical for ensuring the proper ratios of upstream-encoded capsid proteins to downstream-encoded replicase enzymes. The genomic organizations of many other frameshifting viruses, including the coronaviruses, are very different, in that their upstream open reading frames encode nonstructural proteins, the frameshift-dependent downstream open reading frames encode enzymes involved in transcription and replication, and their structural proteins are encoded by subgenomic mRNAs. The biological significance of frameshifting efficiency and how the relative ratios of proteins encoded by the upstream and downstream open reading frames affect virus propagation has not been explored before. Here, three different strategies were employed to test the hypothesis that the −1 PRF signals of coronaviruses have evolved to produce the correct ratios of upstream- to downstream-encoded proteins. Specifically, infectious clones of the severe acute respiratory syndrome (SARS)-associated coronavirus harboring mutations that lower frameshift efficiency decreased infectivity by >4 orders of magnitude. Second, a series of frameshift-promoting mRNA pseudoknot mutants was employed to demonstrate that the frameshift signals of the SARS-associated coronavirus and mouse hepatitis virus have evolved to promote optimal frameshift efficiencies. Finally, we show that a previously described frameshift attenuator element does not actually affect frameshifting per se but rather serves to limit the fraction of ribosomes available for frameshifting. The findings of these analyses all support a “golden mean” model in which viruses use both programmed ribosomal frameshifting and translational attenuation to control the relative ratios of their encoded proteins. PMID:20164235
Functional conservation and structural diversification of silk sericins in two moth species.
Zurovec, Michal; Kludkiewicz, Barbara; Fedic, Robert; Sulitkova, Jitka; Mach, Vaclav; Kucerova, Lucie; Sehnal, Frantisek
2013-06-10
Sericins are hydrophilic structural proteins produced by caterpillars in the middle section of silk glands and layered over fibroin proteins secreted in the posterior section. In the process of spinning, fibroins form strong solid filaments, while sericins seal the pair of filaments into a single fiber and glue the fiber into a cocoon. Galleria mellonella and the previously examined Bombyx mori harbor three sericin genes that encode proteins containing long repetitive regions. Galleria sericin genes are similar to each other and the protein repeats are built from short and extremely serine-rich motifs, while Bombyx sericin genes are diversified and encode proteins with long and complex repeats. Developmental changes in sericin properties are controlled at the level of gene expression and splicing. In Galleria , MG-1 sericin is produced throughout larval life until the wandering stage, while the production of MG-2 and MG-3 reaches a peak during cocoon spinning.
A new yeast gene with a myosin-like heptad repeat structure.
Kölling, R; Nguyen, T; Chen, E Y; Botstein, D
1993-03-01
We isolated a gene encoding a 218 kDa myosin-like protein from Saccharomyces cerevisiae using a monoclonal antibody directed against human platelet myosin as a probe. The protein sequence encoded by the MLP1 gene (for myosin-like protein) contains extensive stretches of a heptad-repeat pattern suggesting that the protein can form coiled coils typical of myosins. Immunolocalization experiments using affinity-purified antibodies raised against a TrpE-MLP1 fusion protein showed a dot-like structure adjacent to the nucleus in yeast cells bearing the MLP1 gene on a multicopy plasmid. In mouse epithelial cells the yeast anti-MLP1 antibodies stained the nucleus. Mutants bearing disruptions of the MLP1 gene were viable, but more sensitive to ultraviolet light than wild-type strains, suggesting an involvement of MLP1 in DNA repair. The MLP1 gene was mapped to chromosome 11, 25 cM from met1.
Yeates, Todd O.; Padilla, Jennifer; Colovos, Chris
2004-06-29
Novel fusion proteins capable of self-assembling into regular structures, as well as nucleic acids encoding the same, are provided. The subject fusion proteins comprise at least two oligomerization domains rigidly linked together, e.g. through an alpha helical linking group. Also provided are regular structures comprising a plurality of self-assembled fusion proteins of the subject invention, and methods for producing the same. The subject fusion proteins find use in the preparation of a variety of nanostructures, where such structures include: cages, shells, double-layer rings, two-dimensional layers, three-dimensional crystals, filaments, and tubes.
Structure of adenovirus bound to cellular receptor car
Freimuth, Paul I.
2004-05-18
Disclosed is a mutant adenovirus which has a genome comprising one or more mutations in sequences which encode the fiber protein knob domain wherein the mutation causes the encoded viral particle to have significantly weakened binding affinity for CARD1 relative to wild-type adenovirus. Such mutations may be in sequences which encode either the AB loop, or the HI loop of the fiber protein knob domain. Specific residues and mutations are described. Also disclosed is a method for generating a mutant adenovirus which is characterized by a receptor binding affinity or specificity which differs substantially from wild type. In the method, residues of the adenovirus fiber protein knob domain which are predicted to alter D1 binding when mutated, are identified from the crystal structure coordinates of the AD12knob:CAR-D1 complex. A mutation which alters one or more of the identified residues is introduced into the genome of the adenovirus to generate a mutant adenovirus. Whether or not the mutant produced exhibits altered adenovirus-CAR binding properties is then determined.
Walker, J; Tait, A
1997-11-01
A reverse-transcriptase polymerase chain reaction (PCR) procedure was used to isolate an Ostertagia circumcincta partial cDNA encoding a protein with general primary sequence features characteristic of members of the mitochondrial processing peptidase (MPP) subfamily of M16 metallopeptidases. The structural relationships of the predicted protein (Oc MPPX) with MPP subfamily proteins from other species (including the model free-living nematode Caenorhabditis elegans) were examined, and Northern analysis confirmed the expression of the Oc mppx gene in adult nematodes.
Gurung, Ratna B.; Purdie, Auriol C.; Begg, Douglas J.
2012-01-01
Johne's disease in ruminants is caused by Mycobacterium avium subsp. paratuberculosis. Diagnosis of M. avium subsp. paratuberculosis infection is difficult, especially in the early stages. To date, ideal antigen candidates are not available for efficient immunization or immunodiagnosis. This study reports the in silico selection and subsequent analysis of epitopes of M. avium subsp. paratuberculosis proteins that were found to be upregulated under stress conditions as a means to identify immunogenic candidate proteins. Previous studies have reported differential regulation of proteins when M. avium subsp. paratuberculosis is exposed to stressors which induce a response similar to dormancy. Dormancy may be involved in evading host defense mechanisms, and the host may also mount an immune response against these proteins. Twenty-five M. avium subsp. paratuberculosis proteins that were previously identified as being upregulated under in vitro stress conditions were analyzed for B and T cell epitopes by use of the prediction tools at the Immune Epitope Database and Analysis Resource. Major histocompatibility complex class I T cell epitopes were predicted using an artificial neural network method, and class II T cell epitopes were predicted using the consensus method. Conformational B cell epitopes were predicted from the relevant three-dimensional structure template for each protein. Based on the greatest number of predicted epitopes, eight proteins (MAP2698c [encoded by desA2], MAP2312c [encoded by fadE19], MAP3651c [encoded by fadE3_2], MAP2872c [encoded by fabG5_2], MAP3523c [encoded by oxcA], MAP0187c [encoded by sodA], and the hypothetical proteins MAP3567 and MAP1168c) were identified as potential candidates for study of antibody- and cell-mediated immune responses within infected hosts. PMID:22496492
Computer analysis of protein functional sites projection on exon structure of genes in Metazoa
2015-01-01
Background Study of the relationship between the structural and functional organization of proteins and their coding genes is necessary for an understanding of the evolution of molecular systems and can provide new knowledge for many applications for designing proteins with improved medical and biological properties. It is well known that the functional properties of proteins are determined by their functional sites. Functional sites are usually represented by a small number of amino acid residues that are distantly located from each other in the amino acid sequence. They are highly conserved within their functional group and vary significantly in structure between such groups. According to this facts analysis of the general properties of the structural organization of the functional sites at the protein level and, at the level of exon-intron structure of the coding gene is still an actual problem. Results One approach to this analysis is the projection of amino acid residue positions of the functional sites along with the exon boundaries to the gene structure. In this paper, we examined the discontinuity of the functional sites in the exon-intron structure of genes and the distribution of lengths and phases of the functional site encoding exons in vertebrate genes. We have shown that the DNA fragments coding the functional sites were in the same exons, or in close exons. The observed tendency to cluster the exons that code functional sites which could be considered as the unit of protein evolution. We studied the characteristics of the structure of the exon boundaries that code, and do not code, functional sites in 11 Metazoa species. This is accompanied by a reduced frequency of intercodon gaps (phase 0) in exons encoding the amino acid residue functional site, which may be evidence of the existence of evolutionary limitations to the exon shuffling. Conclusions These results characterize the features of the coding exon-intron structure that affect the functionality of the encoded protein and allow a better understanding of the emergence of biological diversity. PMID:26693737
A novel extracellular matrix protein from tomato associated with lignified secondary cell walls.
Domingo, C; Gómez, M D; Cañas, L; Hernández-Yago, J; Conejero, V; Vera, P
1994-01-01
A cDNA clone representing a novel cell wall protein was isolated from a tomato cDNA library. The deduced amino acid sequence shows that the encoded protein is very small (88 amino acids), contains an N-terminal hydrophobic signal peptide, and is enriched in lysine and tyrosine. We have designated this protein TLRP for tyrosine- and lysine-rich protein. RNA gel blot hybridization identified TLRP transcripts constitutively present in roots, stems, and leaves from tomato plants. The encoded protein seems to be highly insolubilized in the cell wall, and we present evidence that this protein is specifically localized in the modified secondary cell walls of the xylem and in cells of the sclerenchyma. In addition, the protein is localized in the protective periderm layer of the growing root. The highly localized deposition in cells destined to give support and protection to the plant indicates that this cell wall protein alone and/or in collaboration with other cell wall structural proteins may have a specialized structural function by mechanically strengthening the walls. PMID:7919979
Elrobh, Mohamed S.; Alanazi, Mohammad S.; Khan, Wajahatullah; Abduljaleel, Zainularifeen; Al-Amri, Abdullah; Bazzi, Mohammad D.
2011-01-01
Heat shock proteins are ubiquitous, induced under a number of environmental and metabolic stresses, with highly conserved DNA sequences among mammalian species. Camelus dromedaries (the Arabian camel) domesticated under semi-desert environments, is well adapted to tolerate and survive against severe drought and high temperatures for extended periods. This is the first report of molecular cloning and characterization of full length cDNA of encoding a putative stress-induced heat shock HSPA6 protein (also called HSP70B′) from Arabian camel. A full-length cDNA (2417 bp) was obtained by rapid amplification of cDNA ends (RACE) and cloned in pET-b expression vector. The sequence analysis of HSPA6 gene showed 1932 bp-long open reading frame encoding 643 amino acids. The complete cDNA sequence of the Arabian camel HSPA6 gene was submitted to NCBI GeneBank (accession number HQ214118.1). The BLAST analysis indicated that C. dromedaries HSPA6 gene nucleotides shared high similarity (77–91%) with heat shock gene nucleotide of other mammals. The deduced 643 amino acid sequences (accession number ADO12067.1) showed that the predicted protein has an estimated molecular weight of 70.5 kDa with a predicted isoelectric point (pI) of 6.0. The comparative analyses of camel HSPA6 protein sequences with other mammalian heat shock proteins (HSPs) showed high identity (80–94%). Predicted camel HSPA6 protein structure using Protein 3D structural analysis high similarities with human and mouse HSPs. Taken together, this study indicates that the cDNA sequences of HSPA6 gene and its amino acid and protein structure from the Arabian camel are highly conserved and have similarities with other mammalian species. PMID:21845074
NASA Astrophysics Data System (ADS)
Larios, Edgar; Yang, Wei Y.; Schulten, K.; Gruebele, M.
2004-12-01
Computing the root-mean-square deviation (RMSD) of a partially folded protein structure from the folded state requires the two structures to be translationally and rotationally aligned. We examine the constraint matrix L that preserves orthogonality of the rotation matrix during minimization of the RMSD. L is proportional to the sensitivity of the RMSD to the rotational alignment matrix. Its trace yields an isotropic reaction coordinate, while its off-diagonal matrix elements are related to the moment of inertia derivative tensor that encodes anisotropic information about the structure. We use L to compare λ-repressor fragment 6-85 (λ 6-85) to several partially folded structures obtained from molecular dynamics simulation (MD), and find that L as a reaction coordinate indeed encodes some information about protein topology. We also apply C α RMSD, L and tryptophan sidechain mobility as criteria for native state structural fluctuations of several λ 6-85 mutants. The mutants' denaturation curves and fluorescence quenching are measured experimentally for comparison. The results are in accord with a recent proposal that structural fluctuations near the chromophore can induce increased native state fluorescence or hyperfluorescence during unfolding of proteins.
Kinetics and Mechanism of Mammalian Mitochondrial Ribosome Assembly.
Bogenhagen, Daniel F; Ostermeyer-Fay, Anne G; Haley, John D; Garcia-Diaz, Miguel
2018-02-13
Mammalian mtDNA encodes only 13 proteins, all essential components of respiratory complexes, synthesized by mitochondrial ribosomes. Mitoribosomes contain greatly truncated RNAs transcribed from mtDNA, including a structural tRNA in place of 5S RNA as a scaffold for binding 82 nucleus-encoded proteins, mitoribosomal proteins (MRPs). Cryoelectron microscopy (cryo-EM) studies have determined the structure of the mitoribosome, but its mechanism of assembly is unknown. Our SILAC pulse-labeling experiments determine the rates of mitochondrial import of MRPs and their assembly into intact mitoribosomes, providing a basis for distinguishing MRPs that bind at early and late stages in mitoribosome assembly to generate a working model for mitoribosome assembly. Mitoribosome assembly is a slow process initiated at the mtDNA nucleoid driven by excess synthesis of individual MRPs. MRPs that are tightly associated in the structure frequently join the complex in a coordinated manner. Clinically significant MRP mutations reported to date affect proteins that bind early on during assembly. Copyright © 2018 The Author(s). Published by Elsevier Inc. All rights reserved.
Zivanovic, Yvan; Confalonieri, Fabrice; Ponchon, Luc; Lurz, Rudi; Chami, Mohamed; Flayhan, Ali; Renouard, Madalena; Huet, Alexis; Decottignies, Paulette; Davidson, Alan R.; Breyton, Cécile
2014-01-01
Bacteriophage T5 represents a large family of lytic Siphoviridae infecting Gram-negative bacteria. The low-resolution structure of T5 showed the T=13 geometry of the capsid and the unusual trimeric organization of the tail tube, and the assembly pathway of the capsid was established. Although major structural proteins of T5 have been identified in these studies, most of the genes encoding the morphogenesis proteins remained to be identified. Here, we combine a proteomic analysis of T5 particles with a bioinformatic study and electron microscopic immunolocalization to assign function to the genes encoding the structural proteins, the packaging proteins, and other nonstructural components required for T5 assembly. A head maturation protease that likely accounts for the cleavage of the different capsid proteins is identified. Two other proteins involved in capsid maturation add originality to the T5 capsid assembly mechanism: the single head-to-tail joining protein, which closes the T5 capsid after DNA packaging, and the nicking endonuclease responsible for the single-strand interruptions in the T5 genome. We localize most of the tail proteins that were hitherto uncharacterized and provide a detailed description of the tail tip composition. Our findings highlight novel variations of viral assembly strategies and of virion particle architecture. They further recommend T5 for exploring phage structure and assembly and for deciphering conformational rearrangements that accompany DNA transfer from the capsid to the host cytoplasm. PMID:24198424
Antonczak, Alicja K; Milholland, Kedric; Tippmann, Eric M
2018-05-01
The target protein, Hcp1, was first described as part of the bacterial Type VI secretion system from Pseudomonas aeruginosa. The protein first self-assembles into a hexamer and then the hexamers further stack into a nanotubular structure. Hcp1 monomers were targeted for mutagenesis with two widely used photoactivatable amino acids: para-benzoyl phenylalanine or para-azidophenylalanine. The ability of these amino acids to form covalent adducts within the Hcp1 self-assembled system was investigated. Multiple residues, putatively of equal distance between the monomer-monomer interface were targeted. The efficiency of each amino acid to covalently link self-assembled hexamers was determined. The results demonstrate the choice and role of genetically encoded tools applied to complicated biological processes such as self-assembly and also suggested some structural dynamics of the Hcp-1 protein not obvious from crystallographic structures.
USDA-ARS?s Scientific Manuscript database
Dirigent proteins regulate coupling of monolignol plant phenols to generate the structural cell wall polymers lignins and lignans that are involved in structural fortification of cell wall and defense against pathogens and pests. Microarray expression profiling of resistant wheat (Triticum aestivum)...
Steininger, Christoph; Widhopf, George F.; Ghia, Emanuela M.; Morello, Christopher S.; Vanura, Katrina; Sanders, Rebecca; Spector, Deborah; Guiney, Don; Jäger, Ulrich
2012-01-01
Leukemia cells from patients with chronic lymphocytic leukemia (CLL) express a highly restricted immunoglobulin heavy variable chain (IGHV) repertoire, suggesting that a limited set of antigens reacts with leukemic cells. Here, we evaluated the reactivity of a panel of different CLL recombinant antibodies (rAbs) encoded by the most commonly expressed IGHV genes with a panel of selected viral and bacterial pathogens. Six different CLL rAbs encoded by IGHV1-69 or IGHV3-21, but not a CLL rAb encoded by IGHV4-39 genes, reacted with a single protein of human cytomegalovirus (CMV). The CMV protein was identified as the large structural phosphoprotein pUL32. In contrast, none of the CLL rAbs bound to any other structure of CMV, adenovirus serotype 2, Salmonella enterica serovar Typhimurium, or of cells used for propagation of these microorganisms. Monoclonal antibodies or humanized rAbs of irrelevant specificity to pUL32 did not react with any of the proteins present in the different lysates. Still, rAbs encoded by a germ line IGHV1-69 51p1 allele from CMV-seropositive and -negative adults also reacted with pUL32. The observed reactivity of multiple different CLL rAbs and natural antibodies from CMV-seronegative adults with pUL32 is consistent with the properties of a superantigen. PMID:22234695
de Bruin, Olle M.; Duplantis, Barry N.; Ludu, Jagjit S.; Hare, Rebekah F.; Nix, Eli B.; Schmerk, Crystal L.; Robb, Craig S.; Boraston, Alisdair B.; Hueffer, Karsten
2011-01-01
The Francisella pathogenicity island (FPI) encodes proteins thought to compose a type VI secretion system (T6SS) that is required for the intracellular growth of Francisella novicida. In this work we used deletion mutagenesis and genetic complementation to determine that the intracellular growth of F. novicida was dependent on 14 of the 18 genes in the FPI. The products of the iglABCD operon were localized by the biochemical fractionation of F. novicida, and Francisella tularensis LVS. Sucrose gradient separation of water-insoluble material showed that the FPI-encoded proteins IglA, IglB and IglC were found in multiple fractions, especially in a fraction that did not correspond to a known membrane fraction. We interpreted these data to suggest that IglA, IglB and IglC are part of a macromolecular structure. Analysis of published structural data suggested that IglC is an analogue of Hcp, which is thought to form long nano-tubes. Thus the fractionation properties of IglA, IglB and IglC are consistent with the current model of the T6SS apparatus, which supposes that IglA and IglB homologues form an outer tube structure that surrounds an inner tube composed of Hcp (IglC) subunits. Fractionation of F. novicida expressing FLAG-tagged DotU (IcmH homologue) and PdpB (IcmF homologue) showed that these proteins localize to the inner membrane. Deletion of dotU led to the cleavage of PdpB, suggesting an interaction of these two proteins that is consistent with results obtained with other T6SSs. Our results may provide a mechanistic basis for many of the studies that have examined the virulence properties of Francisella mutants in FPI genes, namely that the observed phenotypes of the mutants are the result of the disruption of the FPI-encoded T6SS structure. PMID:21980115
Mariadassou, Mahendra; Bardowski, Jacek K.; Bidnenko, Elena
2011-01-01
Background The single-stranded-nucleic acid binding (SSB) protein superfamily includes proteins encoded by different organisms from Bacteria and their phages to Eukaryotes. SSB proteins share common structural characteristics and have been suggested to descend from an ancestor polypeptide. However, as other proteins involved in DNA replication, bacterial SSB proteins are clearly different from those found in Archaea and Eukaryotes. It was proposed that the corresponding genes in the phage genomes were transferred from the bacterial hosts. Recently new SSB proteins encoded by the virulent lactococcal bacteriophages (Orf14bIL67-like proteins) have been identified and characterized structurally and biochemically. Methodology/Principal Findings This study focused on the determination of phylogenetic relationships between Orf14bIL67-like proteins and other SSBs. We have performed a large scale phylogenetic analysis and pairwise sequence comparisons of SSB proteins from different phyla. The results show that, in remarkable contrast to other phage SSBs, the Orf14bIL67–like proteins form a distinct, self-contained and well supported phylogenetic group connected to the archaeal SSBs. Functional studies demonstrated that, despite the structural and amino acid sequence differences from bacterial SSBs, Orf14bIL67 protein complements the conditional lethal ssb-1 mutation of Escherichia coli. Conclusions/Significance Here we identified for the first time a group of phages encoded SSBs which are clearly distinct from their bacterial counterparts. All methods supported the recognition of these phage proteins as a new family within the SSB superfamily. Our findings suggest that unlike other phages, the virulent lactococcal phages carry ssb genes that were not acquired from their hosts, but transferred from an archaeal genome. This represents a unique example of a horizontal gene transfer between Archaea and bacterial phages. PMID:22073223
A bacterial Argonaute with noncanonical guide RNA specificity
Kaya, Emine; Doxzen, Kevin W.; Knoll, Kilian R.; Wilson, Ross C.; Strutt, Steven C.; Kranzusch, Philip J.; Doudna, Jennifer A.
2016-01-01
Eukaryotic Argonaute proteins induce gene silencing by small RNA-guided recognition and cleavage of mRNA targets. Although structural similarities between human and prokaryotic Argonautes are consistent with shared mechanistic properties, sequence and structure-based alignments suggested that Argonautes encoded within CRISPR-cas [clustered regularly interspaced short palindromic repeats (CRISPR)-associated] bacterial immunity operons have divergent activities. We show here that the CRISPR-associated Marinitoga piezophila Argonaute (MpAgo) protein cleaves single-stranded target sequences using 5′-hydroxylated guide RNAs rather than the 5′-phosphorylated guides used by all known Argonautes. The 2.0-Å resolution crystal structure of an MpAgo–RNA complex reveals a guide strand binding site comprising residues that block 5′ phosphate interactions. Using structure-based sequence alignment, we were able to identify other putative MpAgo-like proteins, all of which are encoded within CRISPR-cas loci. Taken together, our data suggest the evolution of an Argonaute subclass with noncanonical specificity for a 5′-hydroxylated guide. PMID:27035975
NASA Astrophysics Data System (ADS)
Li, Yizhou; De Luca, Roberto; Cazzamalli, Samuele; Pretto, Francesca; Bajic, Davor; Scheuermann, Jörg; Neri, Dario
2018-03-01
In nature, specific antibodies can be generated as a result of an adaptive selection and expansion of lymphocytes with suitable protein binding properties. We attempted to mimic antibody-antigen recognition by displaying multiple chemical diversity elements on a defined macrocyclic scaffold. Encoding of the displayed combinations was achieved using distinctive DNA tags, resulting in a library size of 35,393,112. Specific binders could be isolated against a variety of proteins, including carbonic anhydrase IX, horseradish peroxidase, tankyrase 1, human serum albumin, alpha-1 acid glycoprotein, calmodulin, prostate-specific antigen and tumour necrosis factor. Similar to antibodies, the encoded display of multiple chemical elements on a constant scaffold enabled practical applications, such as fluorescence microscopy procedures or the selective in vivo delivery of payloads to tumours. Furthermore, the versatile structure of the scaffold facilitated the generation of protein-specific chemical probes, as illustrated by photo-crosslinking.
Cloning and bioinformatic analysis of lovastatin biosynthesis regulatory gene lovE.
Huang, Xin; Li, Hao-ming
2009-08-05
Lovastatin is an effective drug for treatment of hyperlipidemia. This study aimed to clone lovastatin biosynthesis regulatory gene lovE and analyze the structure and function of its encoding protein. According to the lovastatin synthase gene sequence from genebank, primers were designed to amplify and clone the lovastatin biosynthesis regulatory gene lovE from Aspergillus terrus genomic DNA. Bioinformatic analysis of lovE and its encoding animo acid sequence was performed through internet resources and software like DNAMAN. Target fragment lovE, almost 1500 bp in length, was amplified from Aspergillus terrus genomic DNA and the secondary and three-dimensional structures of LovE protein were predicted. In the lovastatin biosynthesis process lovE is a regulatory gene and LovE protein is a GAL4-like transcriptional factor.
Structure, Function, Interaction, Co-evolution of Rice Blast Resistance Genes
USDA-ARS?s Scientific Manuscript database
Rice blast disease caused by the fungal pathogen Magnaporthe oryzae is one of the most destructive rice diseases worldwide. Resistance (R) genes to blast encode proteins that detect pathogen signaling molecules encoded by M. oryzae avirulence (AVR) genes. R genes can be a single or a member of clu...
Transcriptional analysis of Penaeus stylirostris densovirus genes
USDA-ARS?s Scientific Manuscript database
Penaeus stylirostris densovirus (PstDNV) genome contains three open reading frames (ORFs), left, middle, and right, which encode a non-structural (NS) protein, an unknown protein, and a capsid protein (CP), respectively. Transcription mapping revealed that P2, P11 and P61 promoters transcribe the le...
Favalli, Nicholas; Biendl, Stefan; Hartmann, Marco; Piazzi, Jacopo; Sladojevich, Filippo; Gräslund, Susanne; Brown, Peter J; Näreoja, Katja; Schüler, Herwig; Scheuermann, Jörg; Franzini, Raphael; Neri, Dario
2018-06-01
A DNA-encoded chemical library (DECL) with 1.2 million compounds was synthesized by combinatorial reaction of seven central scaffolds with two sets of 343×492 building blocks. Library screening by affinity capture revealed that for some target proteins, the chemical nature of building blocks dominated the selection results, whereas for other proteins, the central scaffold also crucially contributed to ligand affinity. Molecules based on a 3,5-bis(aminomethyl)benzoic acid core structure were found to bind human serum albumin with a K d value of 6 nm, while compounds with the same substituents on an equidistant but flexible l-lysine scaffold showed 140-fold lower affinity. A 18 nm tankyrase-1 binder featured l-lysine as linking moiety, while molecules based on d-Lysine or (2S,4S)-amino-l-proline showed no detectable binding to the target. This work suggests that central scaffolds which predispose the orientation of chemical building blocks toward the protein target may enhance the screening productivity of encoded libraries. © 2018 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
Genomic analysis of Staphylococcus phage Stau2 isolated from medical specimen.
Hsieh, Sue-Er; Tseng, Yi-Hsiung; Lo, Hsueh-Hsia; Chen, Shui-Tu; Wu, Cheng-Nan
2016-02-01
Stau2 is a lytic myophage of Staphylococcus aureus isolated from medical specimen. Exhibiting a broad host range against S. aureus clinical isolates, Stau2 is potentially useful for topical phage therapy or as an additive in food preservation. In this study, Stau2 was firstly revealed to possess a circularly permuted linear genome of 133,798 bp, with low G + C content, containing 146 open reading frames, but encoding no tRNA. The genome is organized into several modules containing genes for packaging, structural proteins, replication/transcription and host-cell-lysis, with the structural proteins and DNA polymerase modules being organized similarly to that in Twort-like phages of Staphylococcus. With the encoded DNA replication genes, Stau2 can possibly use its own system for replication. In addition, analysis in silico found several introns in seven genes, including those involved in DNA metabolism, packaging, and structure, while one of them (helicase gene) is experimentally confirmed to undergo splicing. Furthermore, phylogenetic analysis suggested Stau2 to be most closely related to Staphylococcus phages SA11 and Remus, members of Twort-like phages. The results of sodium dodecyl sulfate polyacrylamide gel electrophoresis showed 14 structural proteins of Stau2 and N-terminal sequencing identified three of them. Importantly, this phage does not encode any proteins which are known or suspected to be involved in toxicity, pathogenicity, or antibiotic resistance. Therefore, further investigations of feasible therapeutic application of Stau2 are needed.
Ubiquitin--conserved protein or selfish gene?
Catic, André; Ploegh, Hidde L
2005-11-01
The posttranslational modifier ubiquitin is encoded by a multigene family containing three primary members, which yield the precursor protein polyubiquitin and two ubiquitin moieties, Ub(L40) and Ub(S27), that are fused to the ribosomal proteins L40 and S27, respectively. The gene encoding polyubiquitin is highly conserved and, until now, those encoding Ub(L40) and Ub(S27) have been generally considered to be equally invariant. The evolution of the ribosomal ubiquitin moieties is, however, proving to be more dynamic. It seems that the genes encoding Ub(L40) and Ub(S27) are actively maintained by homologous recombination with the invariant polyubiquitin locus. Failure to recombine leads to deterioration of the sequence of the ribosomal ubiquitin moieties in several phyla, although this deterioration is evidently constrained by the structural requirements of the ubiquitin fold. Only a few amino acids in ubiquitin are vital for its function, and we propose that conservation of all three ubiquitin genes is driven not only by functional properties of the ubiquitin protein, but also by the propensity of the polyubiquitin locus to act as a 'selfish gene'.
2010-01-01
Background Trichomonas vaginalis is the most common non-viral human sexually transmitted pathogen and importantly, contributes to facilitating the spread of HIV. Yet very little is known about its surface and secreted proteins mediating interactions with, and permitting the invasion and colonisation of, the host mucosa. Initial annotations of T. vaginalis genome identified a plethora of candidate extracellular proteins. Results Data mining of the T. vaginalis genome identified 911 BspA-like entries (TvBspA) sharing TpLRR-like leucine-rich repeats, which represent the largest gene family encoding potential extracellular proteins for the pathogen. A broad range of microorganisms encoding BspA-like proteins was identified and these are mainly known to live on mucosal surfaces, among these T. vaginalis is endowed with the largest gene family. Over 190 TvBspA proteins with inferred transmembrane domains were characterised by a considerable structural diversity between their TpLRR and other types of repetitive sequences and two subfamilies possessed distinct classic sorting signal motifs for endocytosis. One TvBspA subfamily also shared a glycine-rich protein domain with proteins from Clostridium difficile pathogenic strains and C. difficile phages. Consistent with the hypothesis that TvBspA protein structural diversity implies diverse roles, we demonstrated for several TvBspA genes differential expression at the transcript level in different growth conditions. Identified variants of repetitive segments between several TvBspA paralogues and orthologues from two clinical isolates were also consistent with TpLRR and other repetitive sequences to be functionally important. For one TvBspA protein cell surface expression and antibody responses by both female and male T. vaginalis infected patients were also demonstrated. Conclusions The biased mucosal habitat for microbial species encoding BspA-like proteins, the characterisation of a vast structural diversity for the TvBspA proteins, differential expression of a subset of TvBspA genes and the cellular localisation and immunological data for one TvBspA; all point to the importance of the TvBspA proteins to various aspects of T. vaginalis pathobiology at the host-pathogen interface. PMID:20144183
Structure of a putative acetyltransferase (PA1377) from Pseudomonas aeruginosa
DOE Office of Scientific and Technical Information (OSTI.GOV)
Davies, Anna M.; Tata, Renée; Chauviac, François-Xavier
2008-05-01
The crystal structure of an acetyltransferase encoded by the gene PA1377 from Pseudomonas aeruginosa has been determined at 2.25 Å resolution. Comparison with a related acetyltransferase revealed a structural difference in the active site that was taken to reflect a difference in substrate binding and/or specificity between the two enzymes. Gene PA1377 from Pseudomonas aeruginosa encodes a 177-amino-acid conserved hypothetical protein of unknown function. The structure of this protein (termed pitax) has been solved in space group I222 to 2.25 Å resolution. Pitax belongs to the GCN5-related N-acetyltransferase family and contains all four sequence motifs conserved among family members. Themore » β-strand structure in one of these motifs (motif A) is disrupted, which is believed to affect binding of the substrate that accepts the acetyl group from acetyl-CoA.« less
Tian, Guilian; Zhou, Yun; Hajkova, Dagmar; Miyagi, Masaru; Dinculescu, Astra; Hauswirth, William W; Palczewski, Krzysztof; Geng, Ruishuang; Alagramam, Kumar N; Isosomppi, Juha; Sankila, Eeva-Marja; Flannery, John G; Imanishi, Yoshikazu
2009-07-10
Clarin-1 is the protein product encoded by the gene mutated in Usher syndrome III. Although the molecular function of clarin-1 is unknown, its primary structure predicts four transmembrane domains similar to a large family of membrane proteins that include tetraspanins. Here we investigated the role of clarin-1 by using heterologous expression and in vivo model systems. When expressed in HEK293 cells, clarin-1 localized to the plasma membrane and concentrated in low density compartments distinct from lipid rafts. Clarin-1 reorganized actin filament structures and induced lamellipodia. This actin-reorganizing function was absent in the modified protein encoded by the most prevalent North American Usher syndrome III mutation, the N48K form of clarin-1 deficient in N-linked glycosylation. Proteomics analyses revealed a number of clarin-1-interacting proteins involved in cell-cell adhesion, focal adhesions, cell migration, tight junctions, and regulation of the actin cytoskeleton. Consistent with the hypothesized role of clarin-1 in actin organization, F-actin-enriched stereocilia of auditory hair cells evidenced structural disorganization in Clrn1(-/-) mice. These observations suggest a possible role for clarin-1 in the regulation and homeostasis of actin filaments, and link clarin-1 to the interactive network of Usher syndrome gene products.
Johanson, Urban; Karlsson, Maria; Johansson, Ingela; Gustavsson, Sofia; Sjövall, Sara; Fraysse, Laure; Weig, Alfons R.; Kjellbom, Per
2001-01-01
Major intrinsic proteins (MIPs) facilitate the passive transport of small polar molecules across membranes. MIPs constitute a very old family of proteins and different forms have been found in all kinds of living organisms, including bacteria, fungi, animals, and plants. In the genomic sequence of Arabidopsis, we have identified 35 different MIP-encoding genes. Based on sequence similarity, these 35 proteins are divided into four different subfamilies: plasma membrane intrinsic proteins, tonoplast intrinsic proteins, NOD26-like intrinsic proteins also called NOD26-like MIPs, and the recently discovered small basic intrinsic proteins. In Arabidopsis, there are 13 plasma membrane intrinsic proteins, 10 tonoplast intrinsic proteins, nine NOD26-like intrinsic proteins, and three small basic intrinsic proteins. The gene structure in general is conserved within each subfamily, although there is a tendency to lose introns. Based on phylogenetic comparisons of maize (Zea mays) and Arabidopsis MIPs (AtMIPs), it is argued that the general intron patterns in the subfamilies were formed before the split of monocotyledons and dicotyledons. Although the gene structure is unique for each subfamily, there is a common pattern in how transmembrane helices are encoded on the exons in three of the subfamilies. The nomenclature for plant MIPs varies widely between different species but also between subfamilies in the same species. Based on the phylogeny of all AtMIPs, a new and more consistent nomenclature is proposed. The complete set of AtMIPs, together with the new nomenclature, will facilitate the isolation, classification, and labeling of plant MIPs from other species. PMID:11500536
Ankyrin-repeat containing proteins of microbes: a conserved structure with functional diversity
Al-Khodor, Souhaila; Price, Christopher T.; Kalia, Awdhesh; Kwaik, Yousef Abu
2009-01-01
Summary The ankyrin repeat (ANK) is the most common protein-protein interaction motif in nature and predominantly found in eukaryotic proteins. The genome sequencing of various pathogenic or symbiotic bacteria and eukaryotic viruses identified numerous genes encoding ANK-containing proteins that were proposed to have been acquired from eukaryotes by horizontal gene transfer. However, the recent discovery of additional ANK-containing proteins encoded in the genomes of archaea and free-living bacteria suggests either a more ancient origin of the ANK motif or multiple convergent evolution events. Many bacterial pathogens employ various types of secretion systems to deliver ANK-containing proteins into eukaryotic cells where they mimic or manipulate various host functions. Understanding the molecular and biochemical functions of this family of proteins will enhance our understanding of important host-microbe interactions. PMID:19962898
Wise, C A; Chiang, L C; Paznekas, W A; Sharma, M; Musy, M M; Ashley, J A; Lovett, M; Jabs, E W
1997-04-01
Treacher Collins Syndrome (TCS) is the most common of the human mandibulofacial dysostosis disorders. Recently, a partial TCOF1 cDNA was identified and shown to contain mutations in TCS families. Here we present the entire exon/intron genomic structure and the complete coding sequence of TCOF1. TCOF1 encodes a low complexity protein of 1,411 amino acids, whose predicted protein structure reveals repeated motifs that mirror the organization of its exons. These motifs are shared with nucleolar trafficking proteins in other species and are predicted to be highly phosphorylated by casein kinase. Consistent with this, the full-length TCOF1 protein sequence also contains putative nuclear and nucleolar localization signals. Throughout the open reading frame, we detected an additional eight mutations in TCS families and several polymorphisms. We postulate that TCS results from defects in a nucleolar trafficking protein that is critically required during human craniofacial development.
Wise, Carol A.; Chiang, Lydia C.; Paznekas, William A.; Sharma, Mridula; Musy, Maurice M.; Ashley, Jennifer A.; Lovett, Michael; Jabs, Ethylin W.
1997-01-01
Treacher Collins Syndrome (TCS) is the most common of the human mandibulofacial dysostosis disorders. Recently, a partial TCOF1 cDNA was identified and shown to contain mutations in TCS families. Here we present the entire exon/intron genomic structure and the complete coding sequence of TCOF1. TCOF1 encodes a low complexity protein of 1,411 amino acids, whose predicted protein structure reveals repeated motifs that mirror the organization of its exons. These motifs are shared with nucleolar trafficking proteins in other species and are predicted to be highly phosphorylated by casein kinase. Consistent with this, the full-length TCOF1 protein sequence also contains putative nuclear and nucleolar localization signals. Throughout the open reading frame, we detected an additional eight mutations in TCS families and several polymorphisms. We postulate that TCS results from defects in a nucleolar trafficking protein that is critically required during human craniofacial development. PMID:9096354
Ramakrishnan, Gayatri; Ochoa-Montaño, Bernardo; Raghavender, Upadhyayula S; Mudgal, Richa; Joshi, Adwait G; Chandra, Nagasuma R; Sowdhamini, Ramanathan; Blundell, Tom L; Srinivasan, Narayanaswamy
2015-01-01
The availability of the genome sequence of Mycobacterium tuberculosis H37Rv has encouraged determination of large numbers of protein structures and detailed definition of the biological information encoded therein; yet, the functions of many proteins in M. tuberculosis remain unknown. The emergence of multidrug resistant strains makes it a priority to exploit recent advances in homology recognition and structure prediction to re-analyse its gene products. Here we report the structural and functional characterization of gene products encoded in the M. tuberculosis genome, with the help of sensitive profile-based remote homology search and fold recognition algorithms resulting in an enhanced annotation of the proteome where 95% of the M. tuberculosis proteins were identified wholly or partly with information on structure or function. New information includes association of 244 proteins with 205 domain families and a separate set of new association of folds to 64 proteins. Extending structural information across uncharacterized protein families represented in the M. tuberculosis proteome, by determining superfamily relationships between families of known and unknown structures, has contributed to an enhancement in the knowledge of structural content. In retrospect, such superfamily relationships have facilitated recognition of probable structure and/or function for several uncharacterized protein families, eventually aiding recognition of probable functions for homologous proteins corresponding to such families. Gene products unique to mycobacteria for which no functions could be identified are 183. Of these 18 were determined to be M. tuberculosis specific. Such pathogen-specific proteins are speculated to harbour virulence factors required for pathogenesis. A re-annotated proteome of M. tuberculosis, with greater completeness of annotated proteins and domain assigned regions, provides a valuable basis for experimental endeavours designed to obtain a better understanding of pathogenesis and to accelerate the process of drug target discovery. Copyright © 2014 Elsevier Ltd. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mavromatis, K; Doyle, C Kuyler; Lykidis, A
2006-01-01
Ehrlichia canis, a small obligately intracellular, tick-transmitted, gram-negative, {alpha}-proteobacterium, is the primary etiologic agent of globally distributed canine monocytic ehrlichiosis. Complete genome sequencing revealed that the E. canis genome consists of a single circular chromosome of 1,315,030 bp predicted to encode 925 proteins, 40 stable RNA species, 17 putative pseudogenes, and a substantial proportion of noncoding sequence (27%). Interesting genome features include a large set of proteins with transmembrane helices and/or signal sequences and a unique serine-threonine bias associated with the potential for O glycosylation that was prominent in proteins associated with pathogen-host interactions. Furthermore, two paralogous protein families associatedmore » with immune evasion were identified, one of which contains poly(G-C) tracts, suggesting that they may play a role in phase variation and facilitation of persistent infections. Genes associated with pathogen-host interactions were identified, including a small group encoding proteins (n = 12) with tandem repeats and another group encoding proteins with eukaryote-like ankyrin domains (n = 7).« less
Yang, Yunhuang; Ramelot, Theresa A; Cort, John R; Garcia, Maite; Yee, Adelinda; Arrowsmith, Cheryl H; Kennedy, Michael A
2012-01-01
CV_2116 is a small hypothetical protein of 82 amino acids from the Gram-negative coccobacillus Chromobacterium violaceum. A PSI-BLAST search using the CV_2116 sequence as a query identified only one hit (E = 2e(-07)) corresponding to a hypothetical protein OR16_04617 from Cupriavidus basilensis OR16, which failed to provide insight into the function of CV_2116. The CV_2116 gene was cloned into the p15TvLic expression plasmid, transformed into E. coli, and (13)C- and (15)N-labeled NMR samples of CV_2116 were overexpressed in E. coli and purified for structure determination using NMR spectroscopy. The resulting high-quality solution NMR structure of CV_2116 revealed a novel α + β fold containing two anti-parallel β-sheets in the N-terminal two-thirds of the protein and one α-helix in the C-terminal third of the protein. CV_2116 does not belong to any known protein sequence family and a Dali search indicated that no similar structures exist in the protein data bank. Although no function of CV_2116 could be derived from either sequence or structural similarity searches, the neighboring genes of CV_2116 encode various proteins annotated as similar to bacteriophage tail assembly proteins. Interestingly, C. violaceum exhibits an extensive network of bacteriophage tail-like structures that likely result from lateral gene transfer by incorporation of viral DNA into its genome (prophages) due to bacteriophage infection. Indeed, C. violaceum has been shown to contain four prophage elements and CV_2116 resides in the fourth of these elements. Analysis of the putative operon in which CV_2116 resides indicates that CV_2116 might be a component of the bacteriophage tail-like assembly that occurs in C. violaceum.
Li, Yanan; Zeng, Xiaobo; Zhou, Xuejuan; Li, Youguo
2016-12-04
Lipid transfer protein superfamily is involved in lipid transport and metabolism. This study aimed to construct mutants of three lipid transfer protein encoding genes in Mesorhizobium huakuii 7653R, and to study the phenotypes and function of mutations during symbiosis with Astragalus sinicus. We used bioinformatics to predict structure characteristics and biological functions of lipid transfer proteins, and conducted semi-quantitative and fluorescent quantitative real-time PCR to analyze the expression levels of target genes in free-living and symbiotic conditions. Using pK19mob insertion mutagenesis to construct mutants, we carried out pot plant experiments to observe symbiotic phenotypes. MCHK-5577, MCHK-2172 and MCHK-2779 genes encoding proteins belonged to START/RHO alpha_C/PITP/Bet_v1/CoxG/CalC (SRPBCC) superfamily, involved in lipid transport or metabolism, and were identical to M. loti at 95% level. Gene relative transcription level of the three genes all increased compared to free-living condition. We obtained three mutants. Compared with wild-type 7653R, above-ground biomass of plants and nodulenitrogenase activity induced by the three mutants significantly decreased. Results indicated that lipid transfer protein encoding genes of Mesorhizobium huakuii 7653R may play important roles in symbiotic nitrogen fixation, and the mutations significantly affected the symbiotic phenotypes. The present work provided a basis to study further symbiotic function mechanism associated with lipid transfer proteins from rhizobia.
Serfiotis-Mitsa, Dimitra; Herbert, Andrew P.; Roberts, Gareth A.; Soares, Dinesh C.; White, John H.; Blakely, Garry W.; Uhrín, Dušan; Dryden, David T. F.
2010-01-01
Plasmids, conjugative transposons and phage frequently encode anti-restriction proteins to enhance their chances of entering a new bacterial host that is highly likely to contain a Type I DNA restriction and modification (RM) system. The RM system usually destroys the invading DNA. Some of the anti-restriction proteins are DNA mimics and bind to the RM enzyme to prevent it binding to DNA. In this article, we characterize ArdB anti-restriction proteins and their close homologues, the KlcA proteins from a range of mobile genetic elements; including an ArdB encoded on a pathogenicity island from uropathogenic Escherichia coli and a KlcA from an IncP-1b plasmid, pBP136 isolated from Bordetella pertussis. We show that all the ArdB and KlcA act as anti-restriction proteins and inhibit the four main families of Type I RM systems in vivo, but fail to block the restriction endonuclease activity of the archetypal Type I RM enzyme, EcoKI, in vitro indicating that the action of ArdB is indirect and very different from that of the DNA mimics. We also present the structure determined by NMR spectroscopy of the pBP136 KlcA protein. The structure shows a novel protein fold and it is clearly not a DNA structural mimic. PMID:20007596
DOE Office of Scientific and Technical Information (OSTI.GOV)
Adams, Melanie A.; Udell, Christian M.; Pal, Gour Pada
The crystallization and preliminary X-ray diffraction analysis of MraZ, formerly known as hypothetical protein YabB, from Escherichia coli K-12 is presented. The MraZ family of proteins, also referred to as the UPF0040 family, are highly conserved in bacteria and are thought to play a role in cell-wall biosynthesis and cell division. The murein region A (mra) gene cluster encodes MraZ proteins along with a number of other proteins involved in this complex process. To date, there has been no clear functional assignment provided for MraZ proteins and the structure of a homologue from Mycoplasma pneumoniae, MPN314, failed to suggest amore » molecular function. The b0081 gene from Escherichia coli that encodes the MraZ protein was cloned and the protein was overexpressed, purified and crystallized. This data is presented along with evidence that the E. coli homologue exists in a different oligomeric state to the MPN314 protein.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Marks, Shawn M.; Lockhart, Samuel N.; Baker, Suzanne L.
Normal aging is associated with a decline in episodic memory and also with aggregation of the β-amyloid (Aβ) and tau proteins and atrophy of medial temporal lobe (MTL) structures crucial to memory formation. Although some evidence suggests that Aβ is associated with aberrant neural activity, the relationships among these two aggregated proteins, neural function, and brain structure are poorly understood. Using in vivo human Aβ and tau imaging, we demonstrate that increased Aβ and tau are both associated with aberrant fMRI activity in the MTL during memory encoding in cognitively normal older adults. This pathological neural activity was in turnmore » associated with worse memory performance and atrophy within the MTL. A mediation analysis revealed that the relationship with regional atrophy was explained by MTL tau. These findings broaden the concept of cognitive aging to include evidence of Alzheimer’s disease-related protein aggregation as an underlying mechanism of age-related memory impairment.« less
Protein Structure Prediction by Protein Threading
NASA Astrophysics Data System (ADS)
Xu, Ying; Liu, Zhijie; Cai, Liming; Xu, Dong
The seminal work of Bowie, Lüthy, and Eisenberg (Bowie et al., 1991) on "the inverse protein folding problem" laid the foundation of protein structure prediction by protein threading. By using simple measures for fitness of different amino acid types to local structural environments defined in terms of solvent accessibility and protein secondary structure, the authors derived a simple and yet profoundly novel approach to assessing if a protein sequence fits well with a given protein structural fold. Their follow-up work (Elofsson et al., 1996; Fischer and Eisenberg, 1996; Fischer et al., 1996a,b) and the work by Jones, Taylor, and Thornton (Jones et al., 1992) on protein fold recognition led to the development of a new brand of powerful tools for protein structure prediction, which we now term "protein threading." These computational tools have played a key role in extending the utility of all the experimentally solved structures by X-ray crystallography and nuclear magnetic resonance (NMR), providing structural models and functional predictions for many of the proteins encoded in the hundreds of genomes that have been sequenced up to now.
Song, Jiangning; Yuan, Zheng; Tan, Hao; Huber, Thomas; Burrage, Kevin
2007-12-01
Disulfide bonds are primary covalent crosslinks between two cysteine residues in proteins that play critical roles in stabilizing the protein structures and are commonly found in extracy-toplasmatic or secreted proteins. In protein folding prediction, the localization of disulfide bonds can greatly reduce the search in conformational space. Therefore, there is a great need to develop computational methods capable of accurately predicting disulfide connectivity patterns in proteins that could have potentially important applications. We have developed a novel method to predict disulfide connectivity patterns from protein primary sequence, using a support vector regression (SVR) approach based on multiple sequence feature vectors and predicted secondary structure by the PSIPRED program. The results indicate that our method could achieve a prediction accuracy of 74.4% and 77.9%, respectively, when averaged on proteins with two to five disulfide bridges using 4-fold cross-validation, measured on the protein and cysteine pair on a well-defined non-homologous dataset. We assessed the effects of different sequence encoding schemes on the prediction performance of disulfide connectivity. It has been shown that the sequence encoding scheme based on multiple sequence feature vectors coupled with predicted secondary structure can significantly improve the prediction accuracy, thus enabling our method to outperform most of other currently available predictors. Our work provides a complementary approach to the current algorithms that should be useful in computationally assigning disulfide connectivity patterns and helps in the annotation of protein sequences generated by large-scale whole-genome projects. The prediction web server and Supplementary Material are accessible at http://foo.maths.uq.edu.au/~huber/disulfide
Structural Insights into Helicobacter pylori Cag Protein Interactions with Host Cell Factors.
Bergé, Célia; Terradot, Laurent
2017-01-01
The most virulent strains of Helicobacter pylori carry a genomic island (cagPAI) containing a set of 27-31 genes. The encoded proteins assemble a syringe-like apparatus to inject the cytotoxin-associated gene A (CagA) protein into gastric cells. This molecular device belongs to the type IV secretion system (T4SS) family albeit with unique characteristics. The cagPAI-encoded T4SS and its effector protein CagA have an intricate relationship with the host cell, with multiple interactions that only start to be deciphered from a structural point of view. On the one hand, the major roles of the interactions between CagL and CagA (and perhaps CagI and CagY) and host cell factors are to facilitate H. pylori adhesion and to mediate the injection of the CagA oncoprotein. On the other hand, CagA interactions with host cell partners interfere with cellular pathways to subvert cell defences and to promote H. pylori infection. Although a clear mechanism for CagA translocation is still lacking, the structural definition of CagA and CagL domains involved in interactions with signalling proteins are progressively coming to light. In this chapter, we will focus on the structural aspects of Cag protein interactions with host cell molecules, critical molecular events precluding H. pylori-mediated gastric cancer development.
Andera, L; Spangler, C J; Galeone, A; Mayol, L; Geiduschek, E P
1994-02-11
TF1, a homodimeric DNA-binding and -bending protein with a preference for hydroxymethyluracil-containing DNA is the Bacillus subtilis-encoded homolog of the bacterial HU proteins and of the E. coli integration host factor. A temperature-sensitive mutation at amino acid 25 of TF1 (L25-->A) and two intragenic second site revertants at amino acids 15 (E15-->G) and 32 (L32-->I) were previously identified and their effects on virus development were examined. The DNA-binding properties of these proteins and the thermal stability of their secondary structures have now been analyzed. Amino acids 15 and 32 are far removed from the putative DNA-binding domains of TF1 but changes there exert striking effects on DNA affinity that correlate with effects on structure. The double mutant protein TF1-G15I32 binds to a preferred site in hydroxymethyluracil-containing DNA 40 times more tightly, denatures at higher temperature (delta tm = 21 degrees C), and also exchanges subunits much more slowly than does the wild-type protein. The L25-->A mutation makes TF1 secondary structure and DNA-binding highly salt concentration-dependent. The E15-->G mutation partly suppresses this effect: secondary structure of TF1-A25G15 is restored at 21 degrees C by 1 M NaCl or, at low NaCl concentration, by binding to DNA.
Darbro, Benjamin W.; Mahajan, Vinit B.; Gakhar, Lokesh; Skeie, Jessica M.; Campbell, Elizabeth; Wu, Shu; Bing, Xinyu; Millen, Kathleen J.; Dobyns, William B.; Kessler, John A.; Jalali, Ali; Cremer, James; Segre, Alberto; Manak, J. Robert; Aldinger, Kimerbly A.; Suzuki, Satoshi; Natsume, Nagato; Ono, Maya; Hai, Huynh Dai; Viet, Le Thi; Loddo, Sara; Valente, Enza M.; Bernardini, Laura; Ghonge, Nitin; Ferguson, Polly J.; Bassuk, Alexander G.
2013-01-01
We performed whole-exome sequencing of a family with autosomal dominant Dandy-Walker malformation and occipital cephaloceles (ADDWOC) and detected a mutation in the extracellular matrix protein encoding gene NID1. In a second family, protein interaction network analysis identified a mutation in LAMC1, which encodes a NID1 binding partner. Structural modeling the NID1-LAMC1 complex demonstrated that each mutation disrupts the interaction. These findings implicate the extracellular matrix in the pathogenesis of Dandy-Walker spectrum disorders. PMID:23674478
Brown, Dean G; Brown, Giles A; Centrella, Paolo; Certel, Kaan; Cooke, Robert M; Cuozzo, John W; Dekker, Niek; Dumelin, Christoph E; Ferguson, Andrew; Fiez-Vandal, Cédric; Geschwindner, Stefan; Guié, Marie-Aude; Habeshian, Sevan; Keefe, Anthony D; Schlenker, Oliver; Sigel, Eric A; Snijder, Arjan; Soutter, Holly T; Sundström, Linda; Troast, Dawn M; Wiggin, Giselle; Zhang, Jing; Zhang, Ying; Clark, Matthew A
2018-06-01
The discovery of ligands via affinity-mediated selection of DNA-encoded chemical libraries is driven by the quality and concentration of the protein target. G-protein-coupled receptors (GPCRs) and other membrane-bound targets can be difficult to isolate in their functional state and at high concentrations, and therefore have been challenging for affinity-mediated selection. Here, we report a successful selection campaign against protease-activated receptor 2 (PAR2). Using a thermo-stabilized mutant of PAR2, we conducted affinity selection using our >100-billion-compound DNA-encoded library. We observed a number of putative ligands enriched upon selection, and subsequent cellular profiling revealed these ligands to comprise both agonists and antagonists. The agonist series shared structural similarity with known agonists. The antagonists were shown to bind in a novel allosteric binding site on the PAR2 protein. This report serves to demonstrate that cell-free affinity selection against GPCRs can be achieved with mutant stabilized protein targets.
NASA Technical Reports Server (NTRS)
Lu, C.; Fedoroff, N.
2000-01-01
Both physiological and genetic evidence indicate interconnections among plant responses to different hormones. We describe a pleiotropic recessive Arabidopsis transposon insertion mutation, designated hyponastic leaves (hyl1), that alters the plant's responses to several hormones. The mutant is characterized by shorter stature, delayed flowering, leaf hyponasty, reduced fertility, decreased rate of root growth, and an altered root gravitropic response. It also exhibits less sensitivity to auxin and cytokinin and hypersensitivity to abscisic acid (ABA). The auxin transport inhibitor 2,3,5-triiodobenzoic acid normalizes the mutant phenotype somewhat, whereas another auxin transport inhibitor, N-(1-naph-thyl)phthalamic acid, exacerbates the phenotype. The gene, designated HYL1, encodes a 419-amino acid protein that contains two double-stranded RNA (dsRNA) binding motifs, a nuclear localization motif, and a C-terminal repeat structure suggestive of a protein-protein interaction domain. We present evidence that the HYL1 gene is ABA-regulated and encodes a nuclear dsRNA binding protein. We hypothesize that the HYL1 protein is a regulatory protein functioning at the transcriptional or post-transcriptional level.
Chaturvedi, Navaneet; Kajsik, Michal; Forsythe, Stephen; Pandey, Paras Nath
2015-12-01
The recently annotated genome of the bacterium Cronobacter sakazakii BAA-894 suggests that the organism has the ability to bind heavy metals. This study demonstrates heavy metal tolerance in C. sakazakii, in which proteins with the heavy metal interaction were recognized by computational and experimental study. As the result, approximately one-fourth of proteins encoded on the plasmid pESA3 are proposed to have potential interaction with heavy metals. Interaction between heavy metals and predicted proteins was further corroborated using protein crystal structures from protein data bank database and comparison of metal-binding ligands. In addition, a phylogenetic study was undertaken for the toxic heavy metals, arsenic, cadmium, lead and mercury, which generated relatedness clustering for lead, cadmium and arsenic. Laboratory studies confirmed the organism's tolerance to tellurite, copper and silver. These experimental and computational study data extend our understanding of the genes encoding for proteins of this important neonatal pathogen and provide further insights into the genotypes associated with features that can contribute to its persistence in the environment. The information will be of value for future environmental protection from heavy toxic metals.
Ciok, Anna; Adamczuk, Marcin; Bartosik, Dariusz; Dziewit, Lukasz
2016-11-28
Pseudomonas strains isolated from the heavily contaminated Lubin copper mine and Zelazny Most post-flotation waste reservoir in Poland were screened for the presence of integrons. This analysis revealed that two strains carried homologous DNA regions composed of a gene encoding a DNA_BRE_C domain-containing tyrosine recombinase (with no significant sequence similarity to other integrases of integrons) plus a three-component array of putative integron gene cassettes. The predicted gene cassettes encode three putative polypeptides with homology to (i) transmembrane proteins, (ii) GCN5 family acetyltransferases, and (iii) hypothetical proteins of unknown function (homologous proteins are encoded by the gene cassettes of several class 1 integrons). Comparative sequence analyses identified three structural variants of these novel integron-like elements within the sequenced bacterial genomes. Analysis of their distribution revealed that they are found exclusively in strains of the genus Pseudomonas .
DOE Office of Scientific and Technical Information (OSTI.GOV)
Norton, Jeanette M.; Klotz, Martin G; Stein, Lisa Y
2008-01-01
The complete genome of the ammonia-oxidizing bacterium, Nitrosospira multiformis (ATCC 25196T), consists of a circular chromosome and three small plasmids totaling 3,234,309 bp and encoding 2827 putative proteins. Of these, 2026 proteins have predicted functions and 801 are without conserved functional domains, yet 747 of these have similarity to other predicted proteins in databases. Gene homologs from Nitrosomonas europaea and N. eutropha were the best match for 42% of the predicted genes in N. multiformis. The genome contains three nearly identical copies of amo and hao gene clusters as large repeats. Distinguishing features compared to N. europaea include: the presencemore » of gene clusters encoding urease and hydrogenase, a RuBisCO-encoding operon of distinctive structure and phylogeny, and a relatively small complement of genes related to Fe acquisition. Systems for synthesis of a pyoverdine-like siderophore and for acyl-homoserine lactone were unique to N. multiformis among the sequenced AOB genomes. Gene clusters encoding proteins associated with outer membrane and cell envelope functions including transporters, porins, exopolysaccharide synthesis, capsule formation and protein sorting/export were abundant. Numerous sensory transduction and response regulator gene systems directed towards sensing of the extracellular environment are described. Gene clusters for glycogen, polyphosphate and cyanophycin storage and utilization were identified providing mechanisms for meeting energy requirements under substrate-limited conditions. The genome of N. multiformis encodes the core pathways for chemolithoautotrophy along with adaptations for surface growth and survival in soil environments.« less
Capturing novel mouse genes encoding chromosomal and other nuclear proteins.
Tate, P; Lee, M; Tweedie, S; Skarnes, W C; Bickmore, W A
1998-09-01
The burgeoning wealth of gene sequences contrasts with our ignorance of gene function. One route to assigning function is by determining the sub-cellular location of proteins. We describe the identification of mouse genes encoding proteins that are confined to nuclear compartments by splicing endogeneous gene sequences to a promoterless betageo reporter, using a gene trap approach. Mouse ES (embryonic stem) cell lines were identified that express betageo fusions located within sub-nuclear compartments, including chromosomes, the nucleolus and foci containing splicing factors. The sequences of 11 trapped genes were ascertained, and characterisation of endogenous protein distribution in two cases confirmed the validity of the approach. Three novel proteins concentrated within distinct chromosomal domains were identified, one of which appears to be a serine/threonine kinase. The sequence of a gene whose product co-localises with splicesome components suggests that this protein may be an E3 ubiquitin-protein ligase. The majority of the other genes isolated represent novel genes. This approach is shown to be a powerful tool for identifying genes encoding novel proteins with specific sub-nuclear localisations and exposes our ignorance of the protein composition of the nucleus. Motifs in two of the isolated genes suggest new links between cellular regulatory mechanisms (ubiquitination and phosphorylation) and mRNA splicing and chromosome structure/function.
Mark, Linda; Spiller, O Brad; Okroj, Marcin; Chanas, Simon; Aitken, Jim A; Wong, Scott W; Damania, Blossom; Blom, Anna M; Blackbourn, David J
2007-04-01
The diversity of viral strategies to modulate complement activation indicates that this component of the immune system has significant antiviral potential. One example is the Kaposi's sarcoma-associated herpesvirus (KSHV) complement control protein (KCP), which inhibits progression of the complement cascade. Rhesus rhadinovirus (RRV), like KSHV, is a member of the subfamily Gammaherpesvirinae and currently provides the only in vivo model of KSHV pathobiology in primates. In the present study, we characterized the KCP homologue encoded by RRV, RRV complement control protein (RCP). Two strains of RRV have been sequenced to date (H26-95 and 17577), and the RCPs they encode differ substantially in structure: RCP from strain H26-95 has four complement control protein (CCP) domains, whereas RCP from strain 17577 has eight CCP domains. Transcriptional analyses of the RCP gene (ORF4, referred to herein as RCP) in infected rhesus macaque fibroblasts mapped the ends of the transcripts of both strains. They revealed that H26-95 encodes a full-length, unspliced RCP transcript, while 17577 RCP generates a full-length unspliced mRNA and two alternatively spliced transcripts. Western blotting confirmed that infected cells express RCP, and immune electron microscopy disclosed this protein on the surface of RRV virions. Functional studies of RCP encoded by both RRV strains revealed their ability to suppress complement activation by the classical (antibody-mediated) pathway. These data provide the foundation for studies into the biological significance of gammaherpesvirus complement regulatory proteins in a tractable, non-human primate model.
Bombyx mori nucleopolyhedrovirus ORF101 encodes a budded virus envelope associated protein.
Chen, Huiqing; Li, Mei; Huang, Guoping; Mai, Weijun; Chen, Keping; Zhou, Yajing
2014-08-01
Orf101 (Bm101) of Bombyx mori nucleopolyhedrovirus (BmNPV) is a highly conserved gene in lepidopteran nucleopolyhedroviruses, but its function remains unknown. In this study, Bm101 was characterized. Transcripts of Bm101 were detected from 24 through 96 h post infection (h p.i.) by RT-PCR. The corresponding protein was also detected from 24 to 96 h p.i. in BmNPV-infected BmN cells by Western blot analysis using a polyclonal antibody against Bm101. Western blot assay of occlusion-derived virus and budded virus (BV) preparations revealed that Bm101 encodes a 28-kDa structural protein that is associated with BV and is located in the envelope fraction of budded virions. In addition, confocal analysis showed that the protein was localized in the cytosol and cytoplasmic membrane in virus-infected cells. In conclusion, the available data suggest that Bm101 is a functional ORF of BmNPV and encodes a protein expressed in the late stage of the infection cycle that is associated with the BV envelope.
Karmi, Ola; Marjault, Henri-Baptiste; Pesce, Luca; Carloni, Paolo; Onuchic, Jose' N; Jennings, Patricia A; Mittler, Ron; Nechushtai, Rachel
2018-02-12
NEET proteins comprise a new class of [2Fe-2S] cluster proteins. In human, three genes encode for NEET proteins: cisd1 encodes mitoNEET (mNT), cisd2 encodes the Nutrient-deprivation autophagy factor-1 (NAF-1) and cisd3 encodes MiNT (Miner2). These recently discovered proteins play key roles in many processes related to normal metabolism and disease. Indeed, NEET proteins are involved in iron, Fe-S, and reactive oxygen homeostasis in cells and play an important role in regulating apoptosis and autophagy. mNT and NAF-1 are homodimeric and reside on the outer mitochondrial membrane. NAF-1 also resides in the membranes of the ER associated mitochondrial membranes (MAM) and the ER. MiNT is a monomer with distinct asymmetry in the molecular surfaces surrounding the clusters. Unlike its paralogs mNT and NAF-1, it resides within the mitochondria. NAF-1 and mNT share similar backbone folds to the plant homodimeric NEET protein (At-NEET), while MiNT's backbone fold resembles a bacterial MiNT protein. Despite the variation of amino acid composition among these proteins, all NEET proteins retained their unique CDGSH domain harboring their unique 3Cys:1His [2Fe-2S] cluster coordination through evolution. The coordinating exposed His was shown to convey the lability to the NEET proteins' [2Fe-2S] clusters. In this minireview, we discuss the NEET fold and its structural elements. Special attention is given to the unique lability of the NEETs' [2Fe-2S] cluster and the implication of the latter to the NEET proteins' cellular and systemic function in health and disease.
Chen, Y M; Zhu, Y; Lin, E C
1987-12-01
In Escherichia coli the six known genes specifying the utilization of L-fucose as carbon and energy source cluster at 60.2 min and constitute a regulon. These genes include fucP (encoding L-fucose permease), fucI (encoding L-fucose isomerase), fucK (encoding L-fuculose kinase), fucA (encoding L-fuculose 1-phosphate aldolase), fucO (encoding L-1,2-propanediol oxidoreductase), and fucR (encoding the regulatory protein). In this study the fuc genes were cloned and their positions on the chromosome were established by restriction endonuclease and complementation analyses. Clockwise, the gene order is: fucO-fucA-fucP-fucI-fucK-fucR. The operons comprising the structural genes and the direction of transcription were determined by complementation analysis and Southern blot hybridization. The fucPIK and fucA operons are transcribed clockwise. The fucO operon is transcribed counterclockwise. The fucR gene product activates the three structural operons in trans.
Mature forms of the major seed storage albumins in sunflower: A mass spectrometric approach.
Franke, Bastian; Colgrave, Michelle L; Mylne, Joshua S; Rosengren, K Johan
2016-09-16
Seed storage albumins are abundant, water-soluble proteins that are degraded to provide critical nutrients for the germinating seedling. It has been established that the sunflower albumins encoded by SEED STORAGE ALBUMIN 2 (SESA2), SESA20 and SESA3 are the major components of the albumin-rich fraction of the common sunflower Helianthus annuus. To determine the structure of sunflowers most important albumins we performed a detailed chromatographic and mass spectrometric characterization to assess what post-translational processing they receive prior to deposition in the protein storage vacuole. We found that SESA2 and SESA20 each encode two albumins. The first of the two SESA2 albumins (SESA2-1) exists as a monomer of 116 or 117 residues, differing by a threonine at the C-terminus. The second of the two SESA2 albumins (SESA2-2) is a monomer of 128 residues. SESA20 encodes the albumin SESA20-2, which is a 127-residue monomer, whereas SESA20-1 was not abundant enough to be structurally described. SESA3, which has been partly characterized previously, was found in several forms with methylation of its asparagine residues. In contrast to other dicot albumins, which are generally matured into a heterodimer, all the dominant mature sunflower albumins SESA2, SESA20-2, SESA3 and its post-translationally modified analogue SESA3-a are monomeric. Sunflower plants have been bred to thrive in various climate zones making them favored crops to meet the growing worldwide demand by humans for protein. The abundance of seed storage proteins makes them an important source of protein for animal and human nutrition. This study explores the structures of the dominant sunflower napin-type seed storage albumins to understand what structures evolution has favored in the most abundant proteins in sunflower seed. Crown Copyright © 2016. Published by Elsevier B.V. All rights reserved.
Cheng, Yuan; Zhou, Yuan; Yang, Yan; Chi, Ying-Jun; Zhou, Jie; Chen, Jian-Ye; Wang, Fei; Fan, Baofang; Shi, Kai; Zhou, Yan-Hong; Yu, Jing-Quan; Chen, Zhixiang
2012-01-01
WRKY transcription factors are encoded by a large gene superfamily with a broad range of roles in plants. Recently, several groups have reported that proteins containing a short VQ (FxxxVQxLTG) motif interact with WRKY proteins. We have recently discovered that two VQ proteins from Arabidopsis (Arabidopsis thaliana), SIGMA FACTOR-INTERACTING PROTEIN1 and SIGMA FACTOR-INTERACTING PROTEIN2, act as coactivators of WRKY33 in plant defense by specifically recognizing the C-terminal WRKY domain and stimulating the DNA-binding activity of WRKY33. In this study, we have analyzed the entire family of 34 structurally divergent VQ proteins from Arabidopsis. Yeast (Saccharomyces cerevisiae) two-hybrid assays showed that Arabidopsis VQ proteins interacted specifically with the C-terminal WRKY domains of group I and the sole WRKY domains of group IIc WRKY proteins. Using site-directed mutagenesis, we identified structural features of these two closely related groups of WRKY domains that are critical for interaction with VQ proteins. Quantitative reverse transcription polymerase chain reaction revealed that expression of a majority of Arabidopsis VQ genes was responsive to pathogen infection and salicylic acid treatment. Functional analysis using both knockout mutants and overexpression lines revealed strong phenotypes in growth, development, and susceptibility to pathogen infection. Altered phenotypes were substantially enhanced through cooverexpression of genes encoding interacting VQ and WRKY proteins. These findings indicate that VQ proteins play an important role in plant growth, development, and response to environmental conditions, most likely by acting as cofactors of group I and IIc WRKY transcription factors. PMID:22535423
Cheng, Yuan; Zhou, Yuan; Yang, Yan; Chi, Ying-Jun; Zhou, Jie; Chen, Jian-Ye; Wang, Fei; Fan, Baofang; Shi, Kai; Zhou, Yan-Hong; Yu, Jing-Quan; Chen, Zhixiang
2012-06-01
WRKY transcription factors are encoded by a large gene superfamily with a broad range of roles in plants. Recently, several groups have reported that proteins containing a short VQ (FxxxVQxLTG) motif interact with WRKY proteins. We have recently discovered that two VQ proteins from Arabidopsis (Arabidopsis thaliana), SIGMA FACTOR-INTERACTING PROTEIN1 and SIGMA FACTOR-INTERACTING PROTEIN2, act as coactivators of WRKY33 in plant defense by specifically recognizing the C-terminal WRKY domain and stimulating the DNA-binding activity of WRKY33. In this study, we have analyzed the entire family of 34 structurally divergent VQ proteins from Arabidopsis. Yeast (Saccharomyces cerevisiae) two-hybrid assays showed that Arabidopsis VQ proteins interacted specifically with the C-terminal WRKY domains of group I and the sole WRKY domains of group IIc WRKY proteins. Using site-directed mutagenesis, we identified structural features of these two closely related groups of WRKY domains that are critical for interaction with VQ proteins. Quantitative reverse transcription polymerase chain reaction revealed that expression of a majority of Arabidopsis VQ genes was responsive to pathogen infection and salicylic acid treatment. Functional analysis using both knockout mutants and overexpression lines revealed strong phenotypes in growth, development, and susceptibility to pathogen infection. Altered phenotypes were substantially enhanced through cooverexpression of genes encoding interacting VQ and WRKY proteins. These findings indicate that VQ proteins play an important role in plant growth, development, and response to environmental conditions, most likely by acting as cofactors of group I and IIc WRKY transcription factors.
Christie, Andrew E.; Fontanilla, Tiana M.; Nesbit, Katherine T.; Lenz, Petra H.
2013-01-01
Diel vertical migration and seasonal diapause are critical life history events for the copepod Calanus finmarchicus. While much is known about these behaviors phenomenologically, little is known about their molecular underpinnings. Recent studies in insects suggest that some circadian genes/proteins also contribute to the establishment of seasonal diapause. Thus, it is possible that in Calanus these distinct timing regimes share some genetic components. To begin to address this possibility, we used the well-established Drosophila melanogaster circadian system as a reference for mining clock transcripts from a 200,000+ sequence Calanus transcriptome; the proteins encoded by the identified transcripts were also deduced and characterized. Sequences encoding homologs of the Drosophila core clock proteins CLOCK, CYCLE, PERIOD and TIMELESS were identified, as was one encoding CRYPTOCHROME 2, a core clock protein in ancestral insect systems, but absent in Drosophila. Calanus transcripts encoding proteins known to modulate the Drosophila core clock were also identified and characterized, e.g. CLOCKWORK ORANGE, DOUBLETIME, SHAGGY and VRILLE. Alignment and structural analyses of the deduced Calanus proteins with their Drosophila counterparts revealed extensive sequence conservation, particularly in functional domains. Interestingly, reverse BLAST analyses of these sequences against all arthropod proteins typically revealed non-Drosophila isoforms to be most similar to the Calanus queries. This, in combination with the presence of both CRYPTOCHROME 1 (a clock input pathway protein) and CRYPTOCHROME 2 in Calanus, suggests that the organization of the copepod circadian system is an ancestral one, more similar to that of insects like Danaus plexippus than to that of Drosophila. PMID:23727418
P.A. Counce; Davidi R. Gealy; Shi-Jean Susana Sung
2002-01-01
Physiology occurs tn physical space through chemical reactions constrained by anatomy and morphology, yet guided by genetics. Physiology has been called the logic of life. Genes encode structural and fimcdonal proteins. These proteins are subsequently processed to produce enzymes that direct and govern the biomechanical processes involved in the physiology of the...
Pakdaman, Yasaman; Sanchez-Guixé, Monica; Kleppe, Rune; Erdal, Sigrid; Bustad, Helene J.; Bjørkhaug, Lise; Haugarvoll, Kristoffer; Tzoulis, Charalampos; Heimdal, Ketil; Knappskog, Per M.; Johansson, Stefan
2017-01-01
Spinocerebellar ataxia, autosomal recessive 16 (SCAR16) is caused by biallelic mutations in the STIP1 homology and U-box containing protein 1 (STUB1) gene encoding the ubiquitin E3 ligase and dimeric co-chaperone C-terminus of Hsc70-interacting protein (CHIP). It has been proposed that the disease mechanism is related to CHIP’s impaired E3 ubiquitin ligase properties and/or interaction with its chaperones. However, there is limited knowledge on how these mutations affect the stability, folding, and protein structure of CHIP itself. To gain further insight, six previously reported pathogenic STUB1 variants (E28K, N65S, K145Q, M211I, S236T, and T246M) were expressed as recombinant proteins and studied using limited proteolysis, size-exclusion chromatography (SEC), and circular dichroism (CD). Our results reveal that N65S shows increased CHIP dimerization, higher levels of α-helical content, and decreased degradation rate compared with wild-type (WT) CHIP. By contrast, T246M demonstrates a strong tendency for aggregation, a more flexible protein structure, decreased levels of α-helical structures, and increased degradation rate compared with WT CHIP. E28K, K145Q, M211I, and S236T also show defects on structural properties compared with WT CHIP, although less profound than what observed for N65S and T246M. In conclusion, our results illustrate that some STUB1 mutations known to cause recessive SCAR16 have a profound impact on the protein structure, stability, and ability of CHIP to dimerize in vitro. These results add to the growing understanding on the mechanisms behind the disorder. PMID:28396517
Pakdaman, Yasaman; Sanchez-Guixé, Monica; Kleppe, Rune; Erdal, Sigrid; Bustad, Helene J; Bjørkhaug, Lise; Haugarvoll, Kristoffer; Tzoulis, Charalampos; Heimdal, Ketil; Knappskog, Per M; Johansson, Stefan; Aukrust, Ingvild
2017-04-30
Spinocerebellar ataxia, autosomal recessive 16 (SCAR16) is caused by biallelic mutations in the STIP1 homology and U-box containing protein 1 ( STUB1 ) gene encoding the ubiquitin E3 ligase and dimeric co-chaperone C-terminus of Hsc70-interacting protein (CHIP). It has been proposed that the disease mechanism is related to CHIP's impaired E3 ubiquitin ligase properties and/or interaction with its chaperones. However, there is limited knowledge on how these mutations affect the stability, folding, and protein structure of CHIP itself. To gain further insight, six previously reported pathogenic STUB1 variants (E28K, N65S, K145Q, M211I, S236T, and T246M) were expressed as recombinant proteins and studied using limited proteolysis, size-exclusion chromatography (SEC), and circular dichroism (CD). Our results reveal that N65S shows increased CHIP dimerization, higher levels of α-helical content, and decreased degradation rate compared with wild-type (WT) CHIP. By contrast, T246M demonstrates a strong tendency for aggregation, a more flexible protein structure, decreased levels of α-helical structures, and increased degradation rate compared with WT CHIP. E28K, K145Q, M211I, and S236T also show defects on structural properties compared with WT CHIP, although less profound than what observed for N65S and T246M. In conclusion, our results illustrate that some STUB1 mutations known to cause recessive SCAR16 have a profound impact on the protein structure, stability, and ability of CHIP to dimerize in vitro. These results add to the growing understanding on the mechanisms behind the disorder. © 2017 The Author(s).
The ribosome as a missing link in the evolution of life.
Root-Bernstein, Meredith; Root-Bernstein, Robert
2015-02-21
Many steps in the evolution of cellular life are still mysterious. We suggest that the ribosome may represent one important missing link between compositional (or metabolism-first), RNA-world (or genes-first) and cellular (last universal common ancestor) approaches to the evolution of cells. We present evidence that the entire set of transfer RNAs for all twenty amino acids are encoded in both the 16S and 23S rRNAs of Escherichia coli K12; that nucleotide sequences that could encode key fragments of ribosomal proteins, polymerases, ligases, synthetases, and phosphatases are to be found in each of the six possible reading frames of the 16S and 23S rRNAs; and that every sequence of bases in rRNA has information encoding more than one of these functions in addition to acting as a structural component of the ribosome. Ribosomal RNA, in short, is not just a structural scaffold for proteins, but the vestigial remnant of a primordial genome that may have encoded a self-organizing, self-replicating, auto-catalytic intermediary between macromolecules and cellular life. Copyright © 2014 The Authors. Published by Elsevier Ltd.. All rights reserved.
NASA Astrophysics Data System (ADS)
Illing, Gerd; Saenger, Wolfram; Heinemann, Udo
2000-06-01
The Protein Structure Factory will be established to characterize proteins encoded by human genes or cDNAs, which will be selected by criteria of potential structural novelty or medical or biotechnological usefulness. It represents an integrative approach to structure analysis combining bioinformatics techniques, automated gene expression and purification of gene products, generation of a biophysical fingerprint of the proteins and the determination of their three-dimensional structures either by NMR spectroscopy or by X-ray diffraction. The use of synchrotron radiation will be crucial to the Protein Structure Factory: high brilliance and tunable wavelengths are prerequisites for fast data collection, the use of small crystals and multiwavelength anomalous diffraction (MAD) phasing. With the opening of BESSY II, direct access to a third-generation XUV storage ring source with excellent conditions is available nearby. An insertion device with two MAD beamlines and one constant energy station will be set up until 2001.
Islam, Nazrul; Woo, Sun-Hee; Tsujimoto, Hisashi; Kawasaki, Hiroshi; Hirano, Hisashi
2002-09-01
Changes in protein composition of wheat endosperm proteome were investigated in 39 ditelocentric chromosome lines of common wheat (Triticum aestivum L.) cv. Chinese Spring. Two-dimensional gel electrophoresis followed by Coomassie Brilliant Blue staining has resolved a total of 105 protein spots in a gel. Quantitative image analysis of protein spots was performed by PDQuest. Variations in protein spots between the euploid and the 39 ditelocentric lines were evaluated by spot number, appearance, disappearance and intensity. A specific spot present in all gels was taken as an internal standard, and the intensity of all other spots was calculated as the ratio of the internal standard. Out of the 1755 major spots detected in 39 ditelocentric lines, 1372 (78%) spots were found variable in different spot parameters: 147 (11%) disappeared, 978 (71%) up-regulated and 247 (18%) down-regulated. Correlation studies in changes in protein intensities among 24 protein spots across the ditelocentric lines were performed. High correlations in changes of protein intensities were observed among the proteins encoded by genes located in the homoeologous arms. Locations of structural genes controlling 26 spots were identified in 10 chromosomal arms. Multiple regulators of the same protein located at various chromosomal arms were also noticed. Identification of structural genes for most of the proteins was found difficult due to multiple regulators encoding the same protein. Two novel subunits (1B(Z,) 1BDz), the structure of which are very similar to the high molecular weight glutenin subunit 12, were identified, and the chromosome arm locations of these subunits were assigned.
Gene Mining for Proline Based Signaling Proteins in Cell Wall of Arabidopsis thaliana
Ihsan, Muhammad Z.; Ahmad, Samina J. N.; Shah, Zahid Hussain; Rehman, Hafiz M.; Aslam, Zubair; Ahuja, Ishita; Bones, Atle M.; Ahmad, Jam N.
2017-01-01
The cell wall (CW) as a first line of defense against biotic and abiotic stresses is of primary importance in plant biology. The proteins associated with cell walls play a significant role in determining a plant's sustainability to adverse environmental conditions. In this work, the genes encoding cell wall proteins (CWPs) in Arabidopsis were identified and functionally classified using geneMANIA and GENEVESTIGATOR with published microarrays data. This yielded 1605 genes, out of which 58 genes encoded proline-rich proteins (PRPs) and glycine-rich proteins (GRPs). Here, we have focused on the cellular compartmentalization, biological processes, and molecular functioning of proline-rich CWPs along with their expression at different plant developmental stages. The mined genes were categorized into five classes on the basis of the type of PRPs encoded in the cell wall of Arabidopsis thaliana. We review the domain structure and function of each class of protein, many with respect to the developmental stages of the plant. We have then used networks, hierarchical clustering and correlations to analyze co-expression, co-localization, genetic, and physical interactions and shared protein domains of these PRPs. This has given us further insight into these functionally important CWPs and identified a number of potentially new cell-wall related proteins in A. thaliana. PMID:28289422
The COG database: new developments in phylogenetic classification of proteins from complete genomes
Tatusov, Roman L.; Natale, Darren A.; Garkavtsev, Igor V.; Tatusova, Tatiana A.; Shankavaram, Uma T.; Rao, Bachoti S.; Kiryutin, Boris; Galperin, Michael Y.; Fedorova, Natalie D.; Koonin, Eugene V.
2001-01-01
The database of Clusters of Orthologous Groups of proteins (COGs), which represents an attempt on a phylogenetic classification of the proteins encoded in complete genomes, currently consists of 2791 COGs including 45 350 proteins from 30 genomes of bacteria, archaea and the yeast Saccharomyces cerevisiae (http://www.ncbi.nlm.nih.gov/COG). In addition, a supplement to the COGs is available, in which proteins encoded in the genomes of two multicellular eukaryotes, the nematode Caenorhabditis elegans and the fruit fly Drosophila melanogaster, and shared with bacteria and/or archaea were included. The new features added to the COG database include information pages with structural and functional details on each COG and literature references, improvements of the COGNITOR program that is used to fit new proteins into the COGs, and classification of genomes and COGs constructed by using principal component analysis. PMID:11125040
Kunze, Markus; Berger, Johannes
2015-01-01
The proper distribution of proteins between the cytosol and various membrane-bound compartments is crucial for the functionality of eukaryotic cells. This requires the cooperation between protein transport machineries that translocate diverse proteins from the cytosol into these compartments and targeting signal(s) encoded within the primary sequence of these proteins that define their cellular destination. The mechanisms exerting protein translocation differ remarkably between the compartments, but the predominant targeting signals for mitochondria, chloroplasts and the ER share the N-terminal position, an α-helical structural element and the removal from the core protein by intraorganellar cleavage. Interestingly, similar properties have been described for the peroxisomal targeting signal type 2 mediating the import of a fraction of soluble peroxisomal proteins, whereas other peroxisomal matrix proteins encode the type 1 targeting signal residing at the extreme C-terminus. The structural similarity of N-terminal targeting signals poses a challenge to the specificity of protein transport, but allows the generation of ambiguous targeting signals that mediate dual targeting of proteins into different compartments. Dual targeting might represent an advantage for adaptation processes that involve a redistribution of proteins, because it circumvents the hierarchy of targeting signals. Thus, the co-existence of two equally functional import pathways into peroxisomes might reflect a balance between evolutionary constant and flexible transport routes. PMID:26441678
Disabling a Type I-E CRISPR-Cas Nuclease with a Bacteriophage-Encoded Anti-CRISPR Protein.
Pawluk, April; Shah, Megha; Mejdani, Marios; Calmettes, Charles; Moraes, Trevor F; Davidson, Alan R; Maxwell, Karen L
2017-12-12
CRISPR (clustered regularly interspaced short palindromic repeat)-Cas adaptive immune systems are prevalent defense mechanisms in bacteria and archaea. They provide sequence-specific detection and neutralization of foreign nucleic acids such as bacteriophages and plasmids. One mechanism by which phages and other mobile genetic elements are able to overcome the CRISPR-Cas system is through the expression of anti-CRISPR proteins. Over 20 different families of anti-CRISPR proteins have been described, each of which inhibits a particular type of CRISPR-Cas system. In this work, we determined the structure of type I-E anti-CRISPR protein AcrE1 by X-ray crystallography. We show that AcrE1 binds to the CRISPR-associated helicase/nuclease Cas3 and that the C-terminal region of the anti-CRISPR protein is important for its inhibitory activity. We further show that AcrE1 can convert the endogenous type I-E CRISPR system into a programmable transcriptional repressor. IMPORTANCE The CRISPR-Cas immune system provides bacteria with resistance to invasion by potentially harmful viruses, plasmids, and other foreign mobile genetic elements. This study presents the first structural and mechanistic insight into a phage-encoded protein that inactivates the type I-E CRISPR-Cas system in Pseudomonas aeruginosa The interaction of this anti-CRISPR protein with the CRISPR-associated helicase/nuclease proteins Cas3 shuts down the CRISPR-Cas system and protects phages carrying this gene from destruction. This interaction also allows the repurposing of the endogenous type I-E CRISPR system into a programmable transcriptional repressor, providing a new biotechnological tool for genetic studies of bacteria encoding this type I-E CRISPR-Cas system. Copyright © 2017 Pawluk et al.
Havird, Justin C; Whitehill, Nicholas S; Snow, Christopher D; Sloan, Daniel B
2015-12-01
Interactions between nuclear and mitochondrial gene products are critical for eukaryotic cell function. Nuclear genes encoding mitochondrial-targeted proteins (N-mt genes) experience elevated rates of evolution, which has often been interpreted as evidence of nuclear compensation in response to elevated mitochondrial mutation rates. However, N-mt genes may be under relaxed functional constraints, which could also explain observed increases in their evolutionary rate. To disentangle these hypotheses, we examined patterns of sequence and structural evolution in nuclear- and mitochondrial-encoded oxidative phosphorylation proteins from species in the angiosperm genus Silene with vastly different mitochondrial mutation rates. We found correlated increases in N-mt gene evolution in species with fast-evolving mitochondrial DNA. Structural modeling revealed an overrepresentation of N-mt substitutions at positions that directly contact mutated residues in mitochondrial-encoded proteins, despite overall patterns of conservative structural evolution. These findings support the hypothesis that selection for compensatory changes in response to mitochondrial mutations contributes to the elevated rate of evolution in N-mt genes. We discuss these results in light of theories implicating mitochondrial mutation rates and mitonuclear coevolution as drivers of speciation and suggest comparative and experimental approaches that could take advantage of heterogeneity in rates of mtDNA evolution across eukaryotes to evaluate such theories. © 2015 The Author(s). Evolution © 2015 The Society for the Study of Evolution.
Ou, Horng D.; Deerinck, Thomas J.; Bushong, Eric; Ellisman, Mark H.; O’Shea, Clodagh C.
2015-01-01
Structural studies of viral proteins most often use high-resolution techniques such as X-ray crystallography, nuclear magnetic resonance, single particle negative stain, or cryo-electron microscopy (EM) to reveal atomic interactions of soluble, homogeneous viral proteins or viral protein complexes. Once viral proteins or complexes are separated from their host’s cellular environment, their natural in-situ structure and details of how they interact with other cellular components may be lost. EM has been an invaluable tool in virology since its introduction in the late 1940’s and subsequent application to cells in the 1950’s. EM studies have expanded our knowledge of viral entry, viral replication, alteration of cellular components, and viral lysis. Most of these early studies were focused on conspicuous morphological cellular changes, because classic EM metal stains were designed to highlight classes of cellular structures rather than specific molecular structures. Much later, to identify viral proteins inducing specific structural configurations at the cellular level, immunostaining with a primary antibody followed by colloidal gold secondary antibody was employed to mark the location of specific viral proteins. This technique can suffer from artifacts in cellular ultrastructure due to compromises required to provide access to the immuno-reagents. Immunolocalization methods also require the generation of highly specific antibodies, which may not be available for every viral protein. Here we discuss new methods to visualize viral proteins and structures at high resolutions in-situ using correlated light and electron microscopy (CLEM). We discuss the use of genetically encoded protein fusions that oxidize diaminobenzidine (DAB) into an osmiophilic polymer that can be visualized by EM. Detailed protocols for applying the genetically encoded photo-oxidizing protein MiniSOG to a viral protein, photo-oxidation of the fusion protein to yield DAB polymer staining, and preparation of photo-oxidized samples for TEM and serial block-face scanning EM (SBEM) for large-scale volume EM data acquisition are also presented. As an example, we discuss the recent multi-scale analysis of Adenoviral protein E4-ORF3 that reveals a new type of multi-functional polymer that disrupts multiple cellular proteins. This new capability to visualize unambiguously specific viral protein structures at high resolutions in the native cellular environment is revealing new insights into how they usurp host proteins and functions to drive pathological viral replication. PMID:26066760
Ou, Horng D; Deerinck, Thomas J; Bushong, Eric; Ellisman, Mark H; O'Shea, Clodagh C
2015-11-15
Structural studies of viral proteins most often use high-resolution techniques such as X-ray crystallography, nuclear magnetic resonance, single particle negative stain, or cryo-electron microscopy (EM) to reveal atomic interactions of soluble, homogeneous viral proteins or viral protein complexes. Once viral proteins or complexes are separated from their host's cellular environment, their natural in situ structure and details of how they interact with other cellular components may be lost. EM has been an invaluable tool in virology since its introduction in the late 1940's and subsequent application to cells in the 1950's. EM studies have expanded our knowledge of viral entry, viral replication, alteration of cellular components, and viral lysis. Most of these early studies were focused on conspicuous morphological cellular changes, because classic EM metal stains were designed to highlight classes of cellular structures rather than specific molecular structures. Much later, to identify viral proteins inducing specific structural configurations at the cellular level, immunostaining with a primary antibody followed by colloidal gold secondary antibody was employed to mark the location of specific viral proteins. This technique can suffer from artifacts in cellular ultrastructure due to compromises required to provide access to the immuno-reagents. Immunolocalization methods also require the generation of highly specific antibodies, which may not be available for every viral protein. Here we discuss new methods to visualize viral proteins and structures at high resolutions in situ using correlated light and electron microscopy (CLEM). We discuss the use of genetically encoded protein fusions that oxidize diaminobenzidine (DAB) into an osmiophilic polymer that can be visualized by EM. Detailed protocols for applying the genetically encoded photo-oxidizing protein MiniSOG to a viral protein, photo-oxidation of the fusion protein to yield DAB polymer staining, and preparation of photo-oxidized samples for TEM and serial block-face scanning EM (SBEM) for large-scale volume EM data acquisition are also presented. As an example, we discuss the recent multi-scale analysis of Adenoviral protein E4-ORF3 that reveals a new type of multi-functional polymer that disrupts multiple cellular proteins. This new capability to visualize unambiguously specific viral protein structures at high resolutions in the native cellular environment is revealing new insights into how they usurp host proteins and functions to drive pathological viral replication. Copyright © 2015 Elsevier Inc. All rights reserved.
Parallel protein secondary structure prediction based on neural networks.
Zhong, Wei; Altun, Gulsah; Tian, Xinmin; Harrison, Robert; Tai, Phang C; Pan, Yi
2004-01-01
Protein secondary structure prediction has a fundamental influence on today's bioinformatics research. In this work, binary and tertiary classifiers of protein secondary structure prediction are implemented on Denoeux belief neural network (DBNN) architecture. Hydrophobicity matrix, orthogonal matrix, BLOSUM62 and PSSM (position specific scoring matrix) are experimented separately as the encoding schemes for DBNN. The experimental results contribute to the design of new encoding schemes. New binary classifier for Helix versus not Helix ( approximately H) for DBNN produces prediction accuracy of 87% when PSSM is used for the input profile. The performance of DBNN binary classifier is comparable to other best prediction methods. The good test results for binary classifiers open a new approach for protein structure prediction with neural networks. Due to the time consuming task of training the neural networks, Pthread and OpenMP are employed to parallelize DBNN in the hyperthreading enabled Intel architecture. Speedup for 16 Pthreads is 4.9 and speedup for 16 OpenMP threads is 4 in the 4 processors shared memory architecture. Both speedup performance of OpenMP and Pthread is superior to that of other research. With the new parallel training algorithm, thousands of amino acids can be processed in reasonable amount of time. Our research also shows that hyperthreading technology for Intel architecture is efficient for parallel biological algorithms.
Discovery, SAR, and X-ray Binding Mode Study of BCATm Inhibitors from a Novel DNA-Encoded Library
2015-01-01
As a potential target for obesity, human BCATm was screened against more than 14 billion DNA encoded compounds of distinct scaffolds followed by off-DNA synthesis and activity confirmation. As a consequence, several series of BCATm inhibitors were discovered. One representative compound (R)-3-((1-(5-bromothiophene-2-carbonyl)pyrrolidin-3-yl)oxy)-N-methyl-2′-(methylsulfonamido)-[1,1′-biphenyl]-4-carboxamide (15e) from a novel compound library synthesized via on-DNA Suzuki–Miyaura cross-coupling showed BCATm inhibitory activity with IC50 = 2.0 μM. A protein crystal structure of 15e revealed that it binds to BCATm within the catalytic site adjacent to the PLP cofactor. The identification of this novel inhibitor series plus the establishment of a BCATm protein structure provided a good starting point for future structure-based discovery of BCATm inhibitors. PMID:26288694
USDA-ARS?s Scientific Manuscript database
The human mitochondrial glutamate dehydrogenase isozymes (hGDH1 and 2) are abundant matrix-localized proteins encoded by nuclear genes. The proteins are synthesized in the cytoplasm, with an atypically long N-terminal mitochondrial targeting sequence (MTS). The results of secondary structure predi...
Non-Structural Proteins of Arthropod-Borne Bunyaviruses: Roles and Functions
Eifan, Saleh; Schnettler, Esther; Dietrich, Isabelle; Kohl, Alain; Blomström, Anne-Lie
2013-01-01
Viruses within the Bunyaviridae family are tri-segmented, negative-stranded RNA viruses. The family includes several emerging and re-emerging viruses of humans, animals and plants, such as Rift Valley fever virus, Crimean-Congo hemorrhagic fever virus, La Crosse virus, Schmallenberg virus and tomato spotted wilt virus. Many bunyaviruses are arthropod-borne, so-called arboviruses. Depending on the genus, bunyaviruses encode, in addition to the RNA-dependent RNA polymerase and the different structural proteins, one or several non-structural proteins. These non-structural proteins are not always essential for virus growth and replication but can play an important role in viral pathogenesis through their interaction with the host innate immune system. In this review, we will summarize current knowledge and understanding of insect-borne bunyavirus non-structural protein function(s) in vertebrate, plant and arthropod. PMID:24100888
Self-assembled bionanostructures: proteins following the lead of DNA nanostructures
2014-01-01
Natural polymers are able to self-assemble into versatile nanostructures based on the information encoded into their primary structure. The structural richness of biopolymer-based nanostructures depends on the information content of building blocks and the available biological machinery to assemble and decode polymers with a defined sequence. Natural polypeptides comprise 20 amino acids with very different properties in comparison to only 4 structurally similar nucleotides, building elements of nucleic acids. Nevertheless the ease of synthesizing polynucleotides with selected sequence and the ability to encode the nanostructural assembly based on the two specific nucleotide pairs underlay the development of techniques to self-assemble almost any selected three-dimensional nanostructure from polynucleotides. Despite more complex design rules, peptides were successfully used to assemble symmetric nanostructures, such as fibrils and spheres. While earlier designed protein-based nanostructures used linked natural oligomerizing domains, recent design of new oligomerizing interaction surfaces and introduction of the platform for topologically designed protein fold may enable polypeptide-based design to follow the track of DNA nanostructures. The advantages of protein-based nanostructures, such as the functional versatility and cost effective and sustainable production methods provide strong incentive for further development in this direction. PMID:24491139
Structural Features of Antiviral APOBEC3 Proteins are Linked to Their Functional Activities
Kitamura, Shingo; Ode, Hirotaka; Iwatani, Yasumasa
2011-01-01
Human APOBEC3 (A3) proteins are cellular cytidine deaminases that potently restrict the replication of retroviruses by hypermutating viral cDNA and/or inhibiting reverse transcription. There are seven members of this family including A3A, B, C, DE, F, G, and H, all encoded in a tandem array on human chromosome 22. A3F and A3G are the most potent inhibitors of HIV-1, but only in the absence of the virus-encoded protein, Vif. HIV-1 utilizes Vif to abrogate A3 functions in the producer cells. More specifically, Vif, serving as a substrate receptor, facilitates ubiquitination of A3 proteins by forming a Cullin5 (Cul5)-based E3 ubiquitin ligase complex, which targets A3 proteins for rapid proteasomal degradation. The specificity of A3 degradation is determined by the ability of Vif to bind to the target. Several lines of evidence have suggested that three distinct regions of A3 proteins are involved in the interaction with Vif. Here, we review the biological functions of A3 family members with special focus on A3G and base our analysis on the available structural information. PMID:22203821
Du, Yu-Jie; Hou, Yi-Ling; Hou, Wan-Ru
2013-02-01
The Giant Panda is an endangered and valuable gene pool in genetic, its important functional gene POLR2H encodes an essential shared peptide H of RNA polymerases. The genomic DNA and cDNA sequences were cloned successfully for the first time from the Giant Panda (Ailuropoda melanoleuca) adopting touchdown-PCR and reverse transcription polymerase chain reaction (RT-PCR), respectively. The length of the genomic sequence of the Giant Panda is 3,285 bp, including five exons and four introns. The cDNA fragment cloned is 509 bp in length, containing an open reading frame of 453 bp encoding 150 amino acids. Alignment analysis indicated that both the cDNA and its deduced amino acid sequence were highly conserved. Protein structure prediction showed that there was one protein kinase C phosphorylation site, four casein kinase II phosphorylation sites and one amidation site in the POLR2H protein, further shaping advanced protein structure. The cDNA cloned was expressed in Escherichia coli, which indicated that POLR2H fusion with the N-terminally His-tagged form brought about the accumulation of an expected 20.5 kDa polypeptide in line with the predicted protein. On the basis of what has already been achieved in this study, further deep-in research will be conducted, which has great value in theory and practical significance.
Darbro, Benjamin W; Mahajan, Vinit B; Gakhar, Lokesh; Skeie, Jessica M; Campbell, Elizabeth; Wu, Shu; Bing, Xinyu; Millen, Kathleen J; Dobyns, William B; Kessler, John A; Jalali, Ali; Cremer, James; Segre, Alberto; Manak, J Robert; Aldinger, Kimerbly A; Suzuki, Satoshi; Natsume, Nagato; Ono, Maya; Hai, Huynh Dai; Viet, Le Thi; Loddo, Sara; Valente, Enza M; Bernardini, Laura; Ghonge, Nitin; Ferguson, Polly J; Bassuk, Alexander G
2013-08-01
We performed whole-exome sequencing of a family with autosomal dominant Dandy-Walker malformation and occipital cephaloceles and detected a mutation in the extracellular matrix (ECM) protein-encoding gene NID1. In a second family, protein interaction network analysis identified a mutation in LAMC1, which encodes a NID1-binding partner. Structural modeling of the NID1-LAMC1 complex demonstrated that each mutation disrupts the interaction. These findings implicate the ECM in the pathogenesis of Dandy-Walker spectrum disorders. © 2013 WILEY PERIODICALS, INC.
Pineda-Lucena, Antonio; Liao, Jack C C; Cort, John R; Yee, Adelinda; Kennedy, Michael A; Edwards, Aled M; Arrowsmith, Cheryl H
2003-05-01
As part of the Northeast Structural Genomics Consortium pilot project focused on small eukaryotic proteins and protein domains, we have determined the NMR structure of the protein encoded by ORF YML108W from Saccharomyces cerevisiae. YML108W belongs to one of the numerous structural proteomics targets whose biological function is unknown. Moreover, this protein does not have sequence similarity to any other protein. The NMR structure of YML108W consists of a four-stranded beta-sheet with strand order 2143 and two alpha-helices, with an overall topology of betabetaalphabetabetaalpha. Strand beta1 runs parallel to beta4, and beta2:beta1 and beta4:beta3 pairs are arranged in an antiparallel fashion. Although this fold belongs to the split betaalphabeta family, it appears to be unique among this family; it is a novel arrangement of secondary structure, thereby expanding the universe of protein folds.
Iwanaga, Masashi; Kurihara, Masaaki; Kobayashi, Masahiko; Kang, WonKyung
2002-05-25
All lepidopteran baculovirus genomes sequenced to date encode a homolog of the Bombyx mori nucleopolyhedrovirus (BmNPV) orf68 gene, suggesting that it performs an important role in the virus life cycle. In this article we describe the characterization of BmNPV orf68 gene. Northern and Western analyses demonstrated that orf68 gene was expressed as a late gene and encoded a structural protein of budded virus (BV). Immunohistochemical analysis by confocal microscopy showed that ORF68 protein was localized mainly in the nucleus of infected cells. To examine the function of orf68 gene, we constructed orf68 deletion mutant (BmD68) and characterized it in BmN cells and larvae of B. mori. BV production was delayed in BmD68-infected cells. The larval bioassays also demonstrated that deletion of orf68 did not reduce the infectivity, but mutant virus took 70 h longer to kill the host than wild-type BmNPV. In addition, dot-blot analysis showed viral DNA accumulated more slowly in mutant infected cells. Further examination suggested that BmD68 was less efficient in entry and budding from cells, although it seemed to possess normal attachment ability. These results suggest that ORF68 is a BV-associated protein involved in secondary infection from cell-to-cell. (c) 2002 Elsevier Science (USA).
Showalter, Aaron D; Smith, Timothy P L; Bennett, Gary L; Sloop, Kyle W; Whitsett, Julie A; Rhodes, Simon J
2002-05-29
The Prophet of Pit-1 (PROP1) gene encodes a paired class homeodomain transcription factor that is exclusively expressed in the developing mammalian pituitary gland. PROP1 function is essential for anterior pituitary organogenesis, and heritable mutations in the gene are associated with combined pituitary hormone deficiency in human patients and animals. By cloning the bovine PROP1 gene and by comparative analysis, we demonstrate that the homeodomains and carboxyl termini of mammalian PROP1 proteins are highly conserved while the amino termini are diverged. Whereas the carboxyl termini of the human and bovine PROP1 proteins contain potent transcriptional activation domains, the amino termini and homeodomains have repressive activities. The bovine PROP1 gene has four exons and three introns and maps to a region of chromosome seven carrying a quantitative trait locus affecting ovulation rate. Two alleles of the bovine gene were found that encode distinct protein products with different DNA binding and transcriptional activities. These experiments demonstrate that mammalian PROP1 genes encode proteins with complex regulatory capacities and that modest changes in protein sequence can significantly alter the activity of this pituitary developmental transcription factor.
Quiles-Puchalt, Nuria; Tormo-Más, María Ángeles; Campoy, Susana; Toledo-Arana, Alejandro; Monedero, Vicente; Lasa, Íñigo; Novick, Richard P.; Christie, Gail E.; Penadés, José R.
2013-01-01
The propagation of bacteriophages and other mobile genetic elements requires exploitation of the phage mechanisms involved in virion assembly and DNA packaging. Here, we identified and characterized four different families of phage-encoded proteins that function as activators required for transcription of the late operons (morphogenetic and lysis genes) in a large group of phages infecting Gram-positive bacteria. These regulators constitute a super-family of proteins, here named late transcriptional regulators (Ltr), which share common structural, biochemical and functional characteristics and are unique to this group of phages. They are all small basic proteins, encoded by genes present at the end of the early gene cluster in their respective phage genomes and expressed under cI repressor control. To control expression of the late operon, the Ltr proteins bind to a DNA repeat region situated upstream of the terS gene, activating its transcription. This involves the C-terminal part of the Ltr proteins, which control specificity for the DNA repeat region. Finally, we show that the Ltr proteins are the only phage-encoded proteins required for the activation of the packaging and lysis modules. In summary, we provide evidence that phage packaging and lysis is a conserved mechanism in Siphoviridae infecting a wide variety of Gram-positive bacteria. PMID:23771138
Xie, Jianming [San Diego, CA; Wang, Lei [San Diego, CA; Wu, Ning [Boston, MA; Schultz, Peter G [La Jolla, CA
2008-07-15
Translation systems and other compositions including orthogonal aminoacyl tRNA-synthetases that preferentially charge an orthogonal tRNA with an iodinated or brominated amino acid are provided. Nucleic acids encoding such synthetases are also described, as are methods and kits for producing proteins including heavy atom-containing amino acids, e.g., brominated or iodinated amino acids. Methods of determining the structure of a protein, e.g., a protein into which a heavy atom has been site-specifically incorporated through use of an orthogonal tRNA/aminoacyl tRNA-synthetase pair, are also described.
Engineering Encodable Lanthanide-Binding Tags (LBTs) into Loop Regions of Proteins
Barthelmes, Katja; Reynolds, Anne M.; Peisach, Ezra; Jonker, Hendrik R. A.; DeNunzio, Nicholas J.; Allen, Karen N.; Imperiali, Barbara; Schwalbe, Harald
2011-01-01
Lanthanide-binding-tags (LBTs) are valuable tools for investigation of protein structure, function, and dynamics by NMR spectroscopy, X-ray crystallography and luminescence studies. We have inserted LBTs into three different loop positions (denoted L, R, and S) of the model protein interleukin-1β and varied the length of the spacer between the LBT and the protein (denoted 1-3). Luminescence studies demonstrate that all nine constructs bind Tb3+ tightly in the low nanomolar range. No significant change in the fusion protein occurs from insertion of the LBT, as shown by two X-ray crystallographic structures of the IL1β-S1 and IL1β-L3 constructs and for the remaining constructs by comparing 1H-15N-HSQC NMR spectra with wild-type IL1β. Additionally, binding of LBT-loop IL1β proteins to their native binding partner in vitro remains unaltered. X-ray crystallographic phasing was successful using only the signal from the bound lanthanide. Large residual dipolar couplings (RDCs) could be determined by NMR spectroscopy for all LBT-loop-constructs and revealed that the LBT-2 series were rigidly incorporated into the interleukin-1β structure. The paramagnetic NMR spectra of loop-LBT mutant IL1β-R2 were assigned and the Δχ tensor components were calculated based on RDCs and pseudocontact shifts (PCSs). A structural model of the IL1β-R2 construct was calculated using the paramagnetic restraints. The current data provide support that encodable LBTs serve as versatile biophysical tags when inserted into loop regions of proteins of known structure or predicted via homology modelling. PMID:21182275
Comparative Analysis of Type IV Pilin in Desulfuromonadales
Shu, Chuanjun; Xiao, Ke; Yan, Qin; Sun, Xiao
2016-01-01
During anaerobic respiration, the bacteria Geobacter sulfurreducens can transfer electrons to extracellular electron accepters through its pilus. G. sulfurreducens pili have been reported to have metallic-like conductivity that is similar to doped organic semiconductors. To study the characteristics and origin of conductive pilin proteins found in the pilus structure, their genetic, structural, and phylogenetic properties were analyzed. The genetic relationships, and conserved structures and sequences that were obtained were used to predict the evolution of the pilins. Homologous genes that encode conductive pilin were found using PilFind and Cluster. Sequence characteristics and protein tertiary structures were analyzed with MAFFT and QUARK, respectively. The origin of conductive pilins was explored by building a phylogenetic tree. Truncation is a characteristic of conductive pilin. The structures of truncated pilins and their accompanying proteins were found to be similar to the N-terminal and C-terminal ends of full-length pilins respectively. The emergence of the truncated pilins can probably be ascribed to the evolutionary pressure of their extracellular electron transporting function. Genes encoding truncated pilins and proteins similar to the C-terminal of full-length pilins, which contain a group of consecutive anti-parallel beta-sheets, are adjacent in bacterial genomes. According to the genetic, structure, and phylogenetic analyses performed in this study, we inferred that the truncated pilins and their accompanying proteins probably evolved from full-length pilins by gene fission through duplication, degeneration, and separation. These findings provide new insights about the molecular mechanisms involved in long-range electron transport along the conductive pili of Geobacter species. PMID:28066394
Structural Insight into the Clostridium difficile Ethanolamine Utilisation Microcompartment
Faulds-Pain, Alexandra; Lewis, Richard J.; Marles-Wright, Jon
2012-01-01
Bacterial microcompartments form a protective proteinaceous barrier around metabolic enzymes that process unstable or toxic chemical intermediates. The genome of the virulent, multidrug-resistant Clostridium difficile 630 strain contains an operon, eut, encoding a bacterial microcompartment with genes for the breakdown of ethanolamine and its utilisation as a source of reduced nitrogen and carbon. The C. difficile eut operon displays regulatory genetic elements and protein encoding regions in common with homologous loci found in the genomes of other bacteria, including the enteric pathogens Salmonella enterica and Enterococcus faecalis. The crystal structures of two microcompartment shell proteins, CD1908 and CD1918, and an uncharacterised protein with potential enzymatic activity, CD1925, were determined by X-ray crystallography. CD1908 and CD1918 display the same protein fold, though the order of secondary structure elements is permuted in CD1908 and this protein displays an N-terminal β-strand extension. These proteins form hexamers with molecules related by crystallographic and non-crystallographic symmetry. The structure of CD1925 has a cupin β-barrel fold and a putative active site that is distinct from the metal-ion dependent catalytic cupins. Thin-section transmission electron microscopy of Escherichia coli over-expressing eut proteins indicates that CD1918 is capable of self-association into arrays, suggesting an organisational role for CD1918 in the formation of this microcompartment. The work presented provides the basis for further study of the architecture and function of the C. difficile eut microcompartment, its role in metabolism and the wider consequences of intestinal colonisation and virulence in this pathogen. PMID:23144756
Ianushevich, Iu G; Shagin, D A; Fradkov, A F; Shakhbazov, K S; Barsova, E V; Gurskaia, N G; Labas, Iu A; Matts, M V; Luk'ianov, k A; Lul'ianov, S A
2005-01-01
The cDNAs encoding the genes of new proteins homologous to the well-known Green Fluorescent Protein (GFP) from the hydroid jellyfish Aequorea victoria were cloned. Two green fluorescent proteins from one un-identified anthojellyfish, a yellow fluorescent protein from Phialidium sp., and a nonfluorescent chromoprotein from another unidentified anthojellyfish were characterized. Thus, a broad diversity of GFP-like proteins among the organisms of the class Hydrozoa in both spectral properties and primary structure was shown.
NASA Technical Reports Server (NTRS)
Biermann, B.; Johnson, E. M.; Feldman, L. J.
1990-01-01
Maize (Zea mays) roots respond to a variety of environmental stimuli which are perceived by a specialized group of cells, the root cap. We are studying the transduction of extracellular signals by roots, particularly the role of protein kinases. Protein phosphorylation by kinases is an important step in many eukaryotic signal transduction pathways. As a first phase of this research we have isolated a cDNA encoding a maize protein similar to fungal and animal protein kinases known to be involved in the transduction of extracellular signals. The deduced sequence of this cDNA encodes a polypeptide containing amino acids corresponding to 33 out of 34 invariant or nearly invariant sequence features characteristic of protein kinase catalytic domains. The maize cDNA gene product is more closely related to the branch of serine/threonine protein kinase catalytic domains composed of the cyclic-nucleotide- and calcium-phospholipid-dependent subfamilies than to other protein kinases. Sequence identity is 35% or more between the deduced maize polypeptide and all members of this branch. The high structural similarity strongly suggests that catalytic activity of the encoded maize protein kinase may be regulated by second messengers, like that of all members of this branch whose regulation has been characterized. Northern hybridization with the maize cDNA clone shows a single 2400 base transcript at roughly similar levels in maize coleoptiles, root meristems, and the zone of root elongation, but the transcript is less abundant in mature leaves. In situ hybridization confirms the presence of the transcript in all regions of primary maize root tissue.
Yong, K J; Scott, D J
2015-03-01
Directed evolution is a powerful method for engineering proteins towards user-defined goals and has been used to generate novel proteins for industrial processes, biological research and drug discovery. Typical directed evolution techniques include cellular display, phage display, ribosome display and water-in-oil compartmentalization, all of which physically link individual members of diverse gene libraries to their translated proteins. This allows the screening or selection for a desired protein function and subsequent isolation of the encoding gene from diverse populations. For biotechnological and industrial applications there is a need to engineer proteins that are functional under conditions that are not compatible with these techniques, such as high temperatures and harsh detergents. Cellular High-throughput Encapsulation Solubilization and Screening (CHESS), is a directed evolution method originally developed to engineer detergent-stable G proteins-coupled receptors (GPCRs) for structural biology. With CHESS, library-transformed bacterial cells are encapsulated in detergent-resistant polymers to form capsules, which serve to contain mutant genes and their encoded proteins upon detergent mediated solubilization of cell membranes. Populations of capsules can be screened like single cells to enable rapid isolation of genes encoding detergent-stable protein mutants. To demonstrate the general applicability of CHESS to other proteins, we have characterized the stability and permeability of CHESS microcapsules and employed CHESS to generate thermostable, sodium dodecyl sulfate (SDS) resistant green fluorescent protein (GFP) mutants, the first soluble proteins to be engineered using CHESS. © 2014 Wiley Periodicals, Inc.
Ilk, Nicola; Völlenkle, Christine; Egelseer, Eva M.; Breitwieser, Andreas; Sleytr, Uwe B.; Sára, Margit
2002-01-01
The nucleotide sequence encoding the crystalline bacterial cell surface (S-layer) protein SbpA of Bacillus sphaericus CCM 2177 was determined by a PCR-based technique using four overlapping fragments. The entire sbpA sequence indicated one open reading frame of 3,804 bp encoding a protein of 1,268 amino acids with a theoretical molecular mass of 132,062 Da and a calculated isoelectric point of 4.69. The N-terminal part of SbpA, which is involved in anchoring the S-layer subunits via a distinct type of secondary cell wall polymer to the rigid cell wall layer, comprises three S-layer-homologous motifs. For screening of amino acid positions located on the outer surface of the square S-layer lattice, the sequence encoding Strep-tag I, showing affinity to streptavidin, was linked to the 5′ end of the sequence encoding the recombinant S-layer protein (rSbpA) or a C-terminally truncated form (rSbpA31-1068). The deletion of 200 C-terminal amino acids did not interfere with the self-assembly properties of the S-layer protein but significantly increased the accessibility of Strep-tag I. Thus, the sequence encoding the major birch pollen allergen (Bet v1) was fused via a short linker to the sequence encoding the C-terminally truncated form rSpbA31-1068. Labeling of the square S-layer lattice formed by recrystallization of rSbpA31-1068/Bet v1 on peptidoglycan-containing sacculi with a Bet v1-specific monoclonal mouse antibody demonstrated the functionality of the fused protein sequence and its location on the outer surface of the S-layer lattice. The specific interactions between the N-terminal part of SbpA and the secondary cell wall polymer will be exploited for an oriented binding of the S-layer fusion protein on solid supports to generate regularly structured functional protein lattices. PMID:12089001
Complementation for an essential ancillary non-structural protein function across parvovirus genera.
Mihaylov, Ivailo S; Cotmore, Susan F; Tattersall, Peter
2014-11-01
Parvoviruses encode a small number of ancillary proteins that differ substantially between genera. Within the genus Protoparvovirus, minute virus of mice (MVM) encodes three isoforms of its ancillary protein NS2, while human bocavirus 1 (HBoV1), in the genus Bocaparvovirus, encodes an NP1 protein that is unrelated in primary sequence to MVM NS2. To search for functional overlap between NS2 and NP1, we generated murine A9 cell populations that inducibly express HBoV1 NP1. These were used to test whether NP1 expression could complement specific defects resulting from depletion of MVM NS2 isoforms. NP1 induction had little impact on cell viability or cell cycle progression in uninfected cells, and was unable to complement late defects in MVM virion production associated with low NS2 levels. However, NP1 did relocate to MVM replication centers, and supports both the normal expansion of these foci and overcomes the early paralysis of DNA replication in NS2-null infections. Copyright © 2014 Elsevier Inc. All rights reserved.
Zhao, Jie
2010-01-01
Arabinogalactan proteins (AGPs) comprise a family of hydroxyproline-rich glycoproteins that are implicated in plant growth and development. In this study, 69 AGPs are identified from the rice genome, including 13 classical AGPs, 15 arabinogalactan (AG) peptides, three non-classical AGPs, three early nodulin-like AGPs (eNod-like AGPs), eight non-specific lipid transfer protein-like AGPs (nsLTP-like AGPs), and 27 fasciclin-like AGPs (FLAs). The results from expressed sequence tags, microarrays, and massively parallel signature sequencing tags are used to analyse the expression of AGP-encoding genes, which is confirmed by real-time PCR. The results reveal that several rice AGP-encoding genes are predominantly expressed in anthers and display differential expression patterns in response to abscisic acid, gibberellic acid, and abiotic stresses. Based on the results obtained from this analysis, an attempt has been made to link the protein structures and expression patterns of rice AGP-encoding genes to their functions. Taken together, the genome-wide identification and expression analysis of the rice AGP gene family might facilitate further functional studies of rice AGPs. PMID:20423940
Genome-Wide Protein Interaction Screens Reveal Functional Networks Involving Sm-Like Proteins
Fromont-Racine, Micheline; Mayes, Andrew E.; Brunet-Simon, Adeline; Rain, Jean-Christophe; Colley, Alan; Dix, Ian; Decourty, Laurence; Joly, Nicolas; Ricard, Florence; Beggs, Jean D.
2000-01-01
A set of seven structurally related Sm proteins forms the core of the snRNP particles containing the spliceosomal U1, U2, U4 and U5 snRNAs. A search of the genomic sequence of Saccharomyces cerevisiae has identified a number of open reading frames that potentially encode structurally similar proteins termed Lsm (Like Sm) proteins. With the aim of analysing all possible interactions between the Lsm proteins and any protein encoded in the yeast genome, we performed exhaustive and iterative genomic two-hybrid screens, starting with the Lsm proteins as baits. Indeed, extensive interactions amongst eight Lsm proteins were found that suggest the existence of a Lsm complex or complexes. These Lsm interactions apparently involve the conserved Sm domain that also mediates interactions between the Sm proteins. The screens also reveal functionally significant interactions with splicing factors, in particular with Prp4 and Prp24, compatible with genetic studies and with the reported association of Lsm proteins with spliceosomal U6 and U4/U6 particles. In addition, interactions with proteins involved in mRNA turnover, such as Mrt1, Dcp1, Dcp2 and Xrn1, point to roles for Lsm complexes in distinct RNA metabolic processes, that are confirmed in independent functional studies. These results provide compelling evidence that two-hybrid screens yield functionally meaningful information about protein–protein interactions and can suggest functions for uncharacterized proteins, especially when they are performed on a genome-wide scale. PMID:10900456
Protein functional features are reflected in the patterns of mRNA translation speed.
López, Daniel; Pazos, Florencio
2015-07-09
The degeneracy of the genetic code makes it possible for the same amino acid string to be coded by different messenger RNA (mRNA) sequences. These "synonymous mRNAs" may differ largely in a number of aspects related to their overall translational efficiency, such as secondary structure content and availability of the encoded transfer RNAs (tRNAs). Consequently, they may render different yields of the translated polypeptides. These mRNA features related to translation efficiency are also playing a role locally, resulting in a non-uniform translation speed along the mRNA, which has been previously related to some protein structural features and also used to explain some dramatic effects of "silent" single-nucleotide-polymorphisms (SNPs). In this work we perform the first large scale analysis of the relationship between three experimental proxies of mRNA local translation efficiency and the local features of the corresponding encoded proteins. We found that a number of protein functional and structural features are reflected in the patterns of ribosome occupancy, secondary structure and tRNA availability along the mRNA. One or more of these proxies of translation speed have distinctive patterns around the mRNA regions coding for certain protein local features. In some cases the three patterns follow a similar trend. We also show specific examples where these patterns of translation speed point to the protein's important structural and functional features. This support the idea that the genome not only codes the protein functional features as sequences of amino acids, but also as subtle patterns of mRNA properties which, probably through local effects on the translation speed, have some consequence on the final polypeptide. These results open the possibility of predicting a protein's functional regions based on a single genomic sequence, and have implications for heterologous protein expression and fine-tuning protein function.
USDA-ARS?s Scientific Manuscript database
Disease resistance (R-) genes have been isolated from many plant species. Most encode nucleotide binding leucine-rich-repeat (NLR) proteins that trigger a rapid localized programmed cell death termed the hypersensitive response (HR) upon pathogen recognition. Despite their structural similarities, d...
Gardenia jasminoides Encodes an Inhibitor-2 Protein for Protein Phosphatase Type 1
NASA Astrophysics Data System (ADS)
Gao, Lan; Li, Hao-Ming
2017-08-01
Protein phosphatase-1 (PP1) regulates diverse, essential cellular processes such as cell cycle progression, protein synthesis, muscle contraction, carbohydrate metabolism, transcription and neuronal signaling. Inhibitor-2 (I-2) can inhibit the activity of PP1 and has been found in diverse organisms. In this work, a Gardenia jasminoides fruit cDNA library was constructed, and the GjI-2 cDNA was isolated from the cDNA library by sequencing method. The GjI-2 cDNA contains a predicted 543 bp open reading frame that encodes 180 amino acids. The bioinformatics analysis suggested that the GjI-2 has conserved PP1c binding motif, and contains a conserved phosphorylation site, which is important in regulation of its activity. The three-dimensional model structure of GjI-2 was buite, its similar with the structure of I-2 from mouse. The results suggest that GjI-2 has relatively conserved RVxF, FxxR/KxR/K and HYNE motif, and these motifs are involved in interaction with PP1.
Primary structure of the Aequorea victoria green-fluorescent protein.
Prasher, D C; Eckenrode, V K; Ward, W W; Prendergast, F G; Cormier, M J
1992-02-15
Many cnidarians utilize green-fluorescent proteins (GFPs) as energy-transfer acceptors in bioluminescence. GFPs fluoresce in vivo upon receiving energy from either a luciferase-oxyluciferin excited-state complex or a Ca(2+)-activated phosphoprotein. These highly fluorescent proteins are unique due to the chemical nature of their chromophore, which is comprised of modified amino acid (aa) residues within the polypeptide. This report describes the cloning and sequencing of both cDNA and genomic clones of GFP from the cnidarian, Aequorea victoria. The gfp10 cDNA encodes a 238-aa-residue polypeptide with a calculated Mr of 26,888. Comparison of A. victoria GFP genomic clones shows three different restriction enzyme patterns which suggests that at least three different genes are present in the A. victoria population at Friday Harbor, Washington. The gfp gene encoded by the lambda GFP2 genomic clone is comprised of at least three exons spread over 2.6 kb. The nucleotide sequences of the cDNA and the gene will aid in the elucidation of structure-function relationships in this unique class of proteins.
Regulation of Glycan Structures in Animal Tissues
Nairn, Alison V.; York, William S.; Harris, Kyle; Hall, Erica M.; Pierce, J. Michael; Moremen, Kelley W.
2008-01-01
Glycan structures covalently attached to proteins and lipids play numerous roles in mammalian cells, including protein folding, targeting, recognition, and adhesion at the molecular or cellular level. Regulating the abundance of glycan structures on cellular glycoproteins and glycolipids is a complex process that depends on numerous factors. Most models for glycan regulation hypothesize that transcriptional control of the enzymes involved in glycan synthesis, modification, and catabolism determines glycan abundance and diversity. However, few broad-based studies have examined correlations between glycan structures and transcripts encoding the relevant biosynthetic and catabolic enzymes. Low transcript abundance for many glycan-related genes has hampered broad-based transcript profiling for comparison with glycan structural data. In an effort to facilitate comparison with glycan structural data and to identify the molecular basis of alterations in glycan structures, we have developed a medium-throughput quantitative real time reverse transcriptase-PCR platform for the analysis of transcripts encoding glycan-related enzymes and proteins in mouse tissues and cells. The method employs a comprehensive list of >700 genes, including enzymes involved in sugar-nucleotide biosynthesis, transporters, glycan extension, modification, recognition, catabolism, and numerous glycosylated core proteins. Comparison with parallel microarray analyses indicates a significantly greater sensitivity and dynamic range for our quantitative real time reverse transcriptase-PCR approach, particularly for the numerous low abundance glycan-related enzymes. Mapping of the genes and transcript levels to their respective biosynthetic pathway steps allowed a comparison with glycan structural data and provides support for a model where many, but not all, changes in glycan abundance result from alterations in transcript expression of corresponding biosynthetic enzymes. PMID:18411279
Meyer, Philippe; Liger, Dominique; Leulliot, Nicolas; Quevillon-Cheruel, Sophie; Zhou, Cong-Zhao; Borel, Franck; Ferrer, Jean-Luc; Poupon, Anne; Janin, Joël; van Tilbeurgh, Herman
2005-12-01
We have determined the three-dimensional crystal structure of the protein encoded by the open reading frame YFL030w from Saccharomyces cerevisiae to a resolution of 2.6 A using single wavelength anomalous diffraction. YFL030w is a 385 amino-acid protein with sequence similarity to the aminotransferase family. The structure of the protein reveals a homodimer adopting the fold-type I of pyridoxal 5'-phosphate (PLP)-dependent aminotransferases. The PLP co-factor is covalently bound to the active site in the crystal structure. The protein shows close structural resemblance with the human alanine:glyoxylate aminotransferase (EC 2.6.1.44), an enzyme involved in the hereditary kidney stone disease primary hyperoxaluria type 1. In this paper we show that YFL030w codes for an alanine:glyoxylate aminotransferase, highly specific for its amino donor and acceptor substrates.
Challenges in NMR-based structural genomics
NASA Astrophysics Data System (ADS)
Sue, Shih-Che; Chang, Chi-Fon; Huang, Yao-Te; Chou, Ching-Yu; Huang, Tai-huang
2005-05-01
Understanding the functions of the vast number of proteins encoded in many genomes that have been completely sequenced recently is the main challenge for biologists in the post-genomics era. Since the function of a protein is determined by its exact three-dimensional structure it is paramount to determine the 3D structures of all proteins. This need has driven structural biologists to undertake the structural genomics project aimed at determining the structures of all known proteins. Several centers for structural genomics studies have been established throughout the world. Nuclear magnetic resonance (NMR) spectroscopy has played a major role in determining protein structures in atomic details and in a physiologically relevant solution state. Since the number of new genes being discovered daily far exceeds the number of structures determined by both NMR and X-ray crystallography, a high-throughput method for speeding up the process of protein structure determination is essential for the success of the structural genomics effort. In this article we will describe NMR methods currently being employed for protein structure determination. We will also describe methods under development which may drastically increase the throughput, as well as point out areas where opportunities exist for biophysicists to make significant contribution in this important field.
Redinbaugh, M G; Hogenhout, S A
2005-01-01
This chapter provides an overview of plant rhabdovirus structure and taxonomy, genome structure, protein function, and insect and plant infection. It is focused on recent research and unique aspects of rhabdovirus biology. Plant rhabdoviruses are transmitted by aphid, leafhopper or planthopper vectors, and the viruses replicate in both their insect and plant hosts. The two plant rhabdovirus genera, Nucleorhabdovirus and Cytorhabdovirus, can be distinguished on the basis of their intracellular site of morphogenesis in plant cells. All plant rhabdoviruses carry analogs of the five core genes: the nucleocapsid (N), phosphoprotein (P), matrix (M), glycoprotein (G) and large or polymerase (L). However, compared to vesiculoviruses that are composed of the five core genes, all plant rhabdoviruses encode more than these five genes, at least one of which is inserted between the P and M genes in the rhabdoviral genome. Interestingly, while these extra genes are not similar among plant rhabdoviruses, two encode proteins with similarity to the 30K superfamily of plant virus movement proteins. Analysis of nucleorhabdoviral protein sequences revealed nuclear localization signals for the N, P, M and L proteins, consistent with virus replication and morphogenesis of these viruses in the nucleus. Plant and insect factors that limit virus infection and transmission are discussed.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wong, E.C.C.; Mullersman, J.E.; Thomas, M.L.
1993-07-01
The leukocyte common antigen-related protein tyrosine phosphatase (LRP) is a widely expressed transmembrane glycoprotein thought to be involved in cell growth and differentiation. Similar to most other transmembrane protein tyrosine phosphatases, LRP contains two tandem cytoplasmic phosphatase domains. To understand further the regulation and evolution of LRP, the authors have isolated and characterized mouse [lambda] genomic clones. Thirteen genomic clones could be divided into two non-overlapping clusters. The first cluster contained the transcription initiation site and the exon encoding most of the 5[prime] untranslated region. The second cluster contained the remaining exons encoding the protein and the 3[prime] untranslated region.more » The gene consists of 22 exons spanning over 75 kb. The distance between exon 1 and exon 2 is at least 25 kb. Characterization of the 5[prime] ends of LRP mRNA by S1 nuclease protection identifies putative initiation start sites within a G/C-rich region. The upstream region does not contain a TATA box. Comparison of the LRP gene structure to the mammalian protein tyrosine phosphatase gene, CD45, shows striking similarities in size and genomic organization. 29 refs., 5 figs., 1 tab.« less
Staphylococcal SCCmec elements encode an active MCM-like helicase and thus may be replicative
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mir-Sanchis, Ignacio; Roman, Christina A.; Misiura, Agnieszka
2016-08-29
Methicillin-resistant Staphylococcus aureus (MRSA) is a public-health threat worldwide. Although the mobile genomic island responsible for this phenotype, staphylococcal cassette chromosome (SCC), has been thought to be nonreplicative, we predicted DNA-replication-related functions for some of the conserved proteins encoded by SCC. We show that one of these, Cch, is homologous to the self-loading initiator helicases of an unrelated family of genomic islands, that it is an active 3'-to-5' helicase and that the adjacent ORF encodes a single-stranded DNA–binding protein. Our 2.9-Å crystal structure of intact Cch shows that it forms a hexameric ring. Cch, like the archaeal and eukaryotic MCM-familymore » replicative helicases, belongs to the pre–sensor II insert clade of AAA+ ATPases. Additionally, we found that SCC elements are part of a broader family of mobile elements, all of which encode a replication initiator upstream of their recombinases. Replication after excision would enhance the efficiency of horizontal gene transfer.« less
Qian, Guoliang; Zhou, Yijing; Zhao, Yancun; Song, Zhiwei; Wang, Suyan; Fan, Jiaqin; Hu, Baishi; Venturi, Vittorio; Liu, Fengquan
2013-07-05
Quorum sensing (QS) in Xanthomonas oryzae pv. oryzicola (Xoc), the causal agent of bacterial leaf streak, is mediated by the diffusible signal factor (DSF). DSF-mediating QS has been shown to control virulence and a set of virulence-related functions; however, the expression profiles and functions of extracellular proteins controlled by DSF signal remain largely unclear. In the present study, 33 DSF-regulated extracellular proteins, whose functions include small-protein mediating QS, oxidative adaptation, macromolecule metabolism, cell structure, biosynthesis of small molecules, intermediary metabolism, cellular process, protein catabolism, and hypothetical function, were identified by proteomics in Xoc. Of these, 15 protein encoding genes were in-frame deleted, and 4 of them, including three genes encoding type II secretion system (T2SS)-dependent proteins and one gene encoding an Ax21 (activator of XA21-mediated immunity)-like protein (a novel small-protein type QS signal) were determined to be required for full virulence in Xoc. The contributions of these four genes to important virulence-associated functions, including bacterial colonization, extracellular polysaccharide, cell motility, biofilm formation, and antioxidative ability, are presented. To our knowledge, our analysis is the first complete list of DSF-regulated extracellular proteins and functions in a Xanthomonas species. Our results show that DSF-type QS played critical roles in regulation of T2SS and Ax21-mediating QS, which sheds light on the role of DSF signaling in Xanthomonas.
PRIMARY STRUCTURE OF THE P450 LANOSTEROL DEMETHYLASE GENE FROM SACCHAROMYCES CEREVISIAE
We have sequenced the structural gene and flanking regions for lanosterol 14 alpha-demethylase (14DM) from Saccharomyces cerevisiae. An open reading frame of 530 codons encodes a 60.7-kDa protein. When this gene is disrupted by integrative transformation, the resulting strain req...
Plant Proteins Are Smaller Because They Are Encoded by Fewer Exons than Animal Proteins.
Ramírez-Sánchez, Obed; Pérez-Rodríguez, Paulino; Delaye, Luis; Tiessen, Axel
2016-12-01
Protein size is an important biochemical feature since longer proteins can harbor more domains and therefore can display more biological functionalities than shorter proteins. We found remarkable differences in protein length, exon structure, and domain count among different phylogenetic lineages. While eukaryotic proteins have an average size of 472 amino acid residues (aa), average protein sizes in plant genomes are smaller than those of animals and fungi. Proteins unique to plants are ∼81aa shorter than plant proteins conserved among other eukaryotic lineages. The smaller average size of plant proteins could neither be explained by endosymbiosis nor subcellular compartmentation nor exon size, but rather due to exon number. Metazoan proteins are encoded on average by ∼10 exons of small size [∼176 nucleotides (nt)]. Streptophyta have on average only ∼5.7 exons of medium size (∼230nt). Multicellular species code for large proteins by increasing the exon number, while most unicellular organisms employ rather larger exons (>400nt). Among subcellular compartments, membrane proteins are the largest (∼520aa), whereas the smallest proteins correspond to the gene ontology group of ribosome (∼240aa). Plant genes are encoded by half the number of exons and also contain fewer domains than animal proteins on average. Interestingly, endosymbiotic proteins that migrated to the plant nucleus became larger than their cyanobacterial orthologs. We thus conclude that plants have proteins larger than bacteria but smaller than animals or fungi. Compared to the average of eukaryotic species, plants have ∼34% more but ∼20% smaller proteins. This suggests that photosynthetic organisms are unique and deserve therefore special attention with regard to the evolutionary forces acting on their genomes and proteomes. Copyright © 2016 The Authors. Production and hosting by Elsevier Ltd.. All rights reserved.
Crystal structure of secretory protein Hcp3 from Pseudomonas aeruginosa.
Osipiuk, Jerzy; Xu, Xiaohui; Cui, Hong; Savchenko, Alexei; Edwards, Aled; Joachimiak, Andrzej
2011-03-01
The Type VI secretion pathway transports proteins across the cell envelope of Gram-negative bacteria. Pseudomonas aeruginosa, an opportunistic Gram-negative bacterial pathogen infecting humans, uses the type VI secretion pathway to export specific effector proteins crucial for its pathogenesis. The HSI-I virulence locus encodes for several proteins that has been proposed to participate in protein transport including the Hcp1 protein, which forms hexameric rings that assemble into nanotubes in vitro. Two Hcp1 paralogues have been identified in the P. aeruginosa genome, Hsp2 and Hcp3. Here, we present the structure of the Hcp3 protein from P. aeruginosa. The overall structure of the monomer resembles Hcp1 despite the lack of amino-acid sequence similarity between the two proteins. The monomers assemble into hexamers similar to Hcp1. However, instead of forming nanotubes in head-to-tail mode like Hcp1, Hcp3 stacks its rings in head-to-head mode forming double-ring structures.
Sounds of silence: synonymous nucleotides as a key to biological regulation and complexity
Shabalina, Svetlana A.; Spiridonov, Nikolay A.; Kashina, Anna
2013-01-01
Messenger RNA is a key component of an intricate regulatory network of its own. It accommodates numerous nucleotide signals that overlap protein coding sequences and are responsible for multiple levels of regulation and generation of biological complexity. A wealth of structural and regulatory information, which mRNA carries in addition to the encoded amino acid sequence, raises the question of how these signals and overlapping codes are delineated along non-synonymous and synonymous positions in protein coding regions, especially in eukaryotes. Silent or synonymous codon positions, which do not determine amino acid sequences of the encoded proteins, define mRNA secondary structure and stability and affect the rate of translation, folding and post-translational modifications of nascent polypeptides. The RNA level selection is acting on synonymous sites in both prokaryotes and eukaryotes and is more common than previously thought. Selection pressure on the coding gene regions follows three-nucleotide periodic pattern of nucleotide base-pairing in mRNA, which is imposed by the genetic code. Synonymous positions of the coding regions have a higher level of hybridization potential relative to non-synonymous positions, and are multifunctional in their regulatory and structural roles. Recent experimental evidence and analysis of mRNA structure and interspecies conservation suggest that there is an evolutionary tradeoff between selective pressure acting at the RNA and protein levels. Here we provide a comprehensive overview of the studies that define the role of silent positions in regulating RNA structure and processing that exert downstream effects on proteins and their functions. PMID:23293005
Natarajaseenivasan, Kalimuthusamy; Shanmughapriya, Santhanam; Velineni, Sridhar; Artiushin, Sergey C; Timoney, John F
2011-10-01
Leptospirosis is an infectious bacterial disease caused by Leptospira species. In this study, we cloned and sequenced the gene encoding the immunodominant protein GroEL from L. interrogans serovar Autumnalis strain N2, which was isolated from the urine of a patient during an outbreak of leptospirosis in Chennai, India. This groEL gene encodes a protein of 60 kDa with a high degree of homology (99% similarity) to those of other leptospiral serovars. Recombinant GroEL was overexpressed in Escherichia coli. Immunoblot analysis indicated that the sera from confirmed leptospirosis patients showed strong reactivity with the recombinant GroEL while no reactivity was observed with the sera from seronegative control patient. In addition, the 3D structure of GroEL was constructed using chaperonin complex cpn60 from Thermus thermophilus as template and validated. The results indicated a Z-score of -8.35, which is in good agreement with the expected value for a protein. The superposition of the Ca traces of cpn60 structure and predicted structure of leptospiral GroEL indicates good agreement of secondary structure elements with an RMSD value of 1.5 Å. Further study is necessary to evaluate GroEL for serological diagnosis of leptospirosis and for its potential as a vaccine component. Copyright © 2011 Beijing Genomics Institute. Published by Elsevier Ltd. All rights reserved.
Tempo and Mode in the Molecular Evolution of Influenza C
Gatherer, Derek
2010-01-01
Abstract: Influenza C contributes to economic damage caused by working days lost through absence or inefficiency and may occasionally cause an acute respiratory illness in a paediatric setting. All Influenza C sequences from the NCBI Influenza Virus Resource were examined to determine the date of the most recent common ancestor (t-MRCA), the average nucleotide substitution rate, and the location of residues under positive selection, for each of the seven genome segments of this virus. The segment with the deepest phylogeny was found to be segment 4, encoding the haemagglutinin-esterase protein (HE) with mean t-MRCA at 1890 of the common era (AD), at a 95% highest posterior density (HPD) of 1857-1924 AD. Other genome segments have slightly more recent common ancestors, ranging from mean t-MRCAs of 1916 AD (HPD 1891-1937) for segment 7, encoding the two non-structural proteins (NS) to 1944 AD (HPD 1940-1948) for segment 2 encoding the type 1 basic polymerase (PB1). On the basis of the Bayesian analysis a reclassification of lineages within genome segments is proposed. Some evidence for positive selection was found in the receptor-binding domain of the haemagglutinin-esterase protein. However, average ω (omega) values ranged from 0.05 for polymerase basic protein 2 (PB2) to 0.38 for non-structural protein 2 (NS2), suggesting that strong to moderate purifying selection is the main trend. Characteristic combinations of segment lineages were identified (genome constellations) and shown to have a relatively short life-span before being broken up by reassortment. PMID:21127722
Tong, Xiangjun; Xia, Zhidan; Zu, Yao; Telfer, Helena; Hu, Jing; Yu, Jingyi; Liu, Huan; Zhang, Quan; Sodmergen; Lin, Shuo; Zhang, Bo
2013-01-25
The notochord is an important organ involved in embryonic patterning and locomotion. In zebrafish, the mature notochord consists of a single stack of fully differentiated, large vacuolated cells called chordocytes, surrounded by a single layer of less differentiated notochordal epithelial cells called chordoblasts. Through genetic analysis of zebrafish lines carrying pseudo-typed retroviral insertions, a mutant exhibiting a defective notochord with a granular appearance was isolated, and the corresponding gene was identified as ngs (notochord granular surface), which was specifically expressed in the notochord. In the mutants, the notochord started to degenerate from 32 hours post-fertilization, and the chordocytes were then gradually replaced by smaller cells derived from chordoblasts. The granular notochord phenotype was alleviated by anesthetizing the mutant embryos with tricaine to prevent muscle contraction and locomotion. Phylogenetic analysis showed that ngs encodes a new type of intermediate filament (IF) family protein, which we named chordostatin based on its function. Under the transmission electron microcopy, bundles of 10-nm-thick IF-like filaments were enriched in the chordocytes of wild-type zebrafish embryos, whereas the chordocytes in ngs mutants lacked IF-like structures. Furthermore, chordostatin-enhanced GFP (EGFP) fusion protein assembled into a filamentous network specifically in chordocytes. Taken together, our work demonstrates that ngs encodes a novel type of IF protein and functions to maintain notochord integrity for larval development and locomotion. Our work sheds light on the mechanisms of notochord structural maintenance, as well as the evolution and biological function of IF family proteins.
Tong, Xiangjun; Xia, Zhidan; Zu, Yao; Telfer, Helena; Hu, Jing; Yu, Jingyi; Liu, Huan; Zhang, Quan; Sodmergen; Lin, Shuo; Zhang, Bo
2013-01-01
The notochord is an important organ involved in embryonic patterning and locomotion. In zebrafish, the mature notochord consists of a single stack of fully differentiated, large vacuolated cells called chordocytes, surrounded by a single layer of less differentiated notochordal epithelial cells called chordoblasts. Through genetic analysis of zebrafish lines carrying pseudo-typed retroviral insertions, a mutant exhibiting a defective notochord with a granular appearance was isolated, and the corresponding gene was identified as ngs (notochord granular surface), which was specifically expressed in the notochord. In the mutants, the notochord started to degenerate from 32 hours post-fertilization, and the chordocytes were then gradually replaced by smaller cells derived from chordoblasts. The granular notochord phenotype was alleviated by anesthetizing the mutant embryos with tricaine to prevent muscle contraction and locomotion. Phylogenetic analysis showed that ngs encodes a new type of intermediate filament (IF) family protein, which we named chordostatin based on its function. Under the transmission electron microcopy, bundles of 10-nm-thick IF-like filaments were enriched in the chordocytes of wild-type zebrafish embryos, whereas the chordocytes in ngs mutants lacked IF-like structures. Furthermore, chordostatin-enhanced GFP (EGFP) fusion protein assembled into a filamentous network specifically in chordocytes. Taken together, our work demonstrates that ngs encodes a novel type of IF protein and functions to maintain notochord integrity for larval development and locomotion. Our work sheds light on the mechanisms of notochord structural maintenance, as well as the evolution and biological function of IF family proteins. PMID:23132861
Transgenic cells with increased plastoquinone levels and methods of use
DOE Office of Scientific and Technical Information (OSTI.GOV)
Sayre, Richard T.; Subramanian, Sowmya; Cahoon, Edgar
Disclosed herein are transgenic cells expressing a heterologous nucleic acid encoding a prephenate dehydrogenase (PDH) protein, a heterologous nucleic acid encoding a homogentisate solanesyl transferase (HST) protein, a heterologous nucleic acid encoding a deoxyxylulose phosphate synthase (DXS) protein, or a combination of two or more thereof. In particular examples, the disclosed transgenic cells have increased plastoquinone levels. Also disclosed are methods of increasing cell growth rates or production of biomass by cultivating transgenic cells expressing a heterologous nucleic acid encoding a PDH protein, a heterologous nucleic acid encoding an HST protein, a heterologous nucleic acid encoding a DXS protein, ormore » a combination of two or more thereof under conditions sufficient to produce cell growth or biomass.« less
Frameshifting in alphaviruses: a diversity of 3' stimulatory structures.
Chung, Betty Y-W; Firth, Andrew E; Atkins, John F
2010-03-26
Programmed ribosomal frameshifting allows the synthesis of alternative, N-terminally coincident, C-terminally distinct proteins from the same RNA. Many viruses utilize frameshifting to optimize the coding potential of compact genomes, to circumvent the host cell's canonical rule of one functional protein per mRNA, or to express alternative proteins in a fixed ratio. Programmed frameshifting is also used in the decoding of a small number of cellular genes. Recently, specific ribosomal -1 frameshifting was discovered at a conserved U_UUU_UUA motif within the sequence encoding the alphavirus 6K protein. In this case, frameshifting results in the synthesis of an additional protein, termed TF (TransFrame). This new case of frameshifting is unusual in that the -1 frame ORF is very short and completely embedded within the sequence encoding the overlapping polyprotein. The present work shows that there is remarkable diversity in the 3' sequences that are functionally important for efficient frameshifting at the U_UUU_UUA motif. While many alphavirus species utilize a 3' RNA structure such as a hairpin or pseudoknot, some species (such as Semliki Forest virus) apparently lack any intra-mRNA stimulatory structure, yet just 20 nt 3'-adjacent to the shift site stimulates up to 10% frameshifting. The analysis, both experimental and bioinformatic, significantly expands the known repertoire of -1 frameshifting stimulators in mammalian and insect systems.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Trindade, Inês B.; Fonseca, Bruno M.; Matias, Pedro M.
The gene encoding a putative siderophore-interacting protein from the marine bacterium S. frigidimarina was successfully cloned, followed by expression and purification of the gene product. Optimized crystals diffracted to 1.35 Å resolution and preliminary crystallographic analysis is promising with respect to structure determination and increased insight into the poorly understood molecular mechanisms underlying iron acquisition. Siderophore-binding proteins (SIPs) perform a key role in iron acquisition in multiple organisms. In the genome of the marine bacterium Shewanella frigidimarina NCIMB 400, the gene tagged as SFRI-RS12295 encodes a protein from this family. Here, the cloning, expression, purification and crystallization of this proteinmore » are reported, together with its preliminary X-ray crystallographic analysis to 1.35 Å resolution. The SIP crystals belonged to the monoclinic space group P2{sub 1}, with unit-cell parameters a = 48.04, b = 78.31, c = 67.71 Å, α = 90, β = 99.94, γ = 90°, and are predicted to contain two molecules per asymmetric unit. Structure determination by molecular replacement and the use of previously determined ∼2 Å resolution SIP structures with ∼30% sequence identity as templates are ongoing.« less
A chalcone isomerase-like protein enhances flavonoid production and flower pigmentation.
Morita, Yasumasa; Takagi, Kyoko; Fukuchi-Mizutani, Masako; Ishiguro, Kanako; Tanaka, Yoshikazu; Nitasaka, Eiji; Nakayama, Masayoshi; Saito, Norio; Kagami, Takashi; Hoshino, Atsushi; Iida, Shigeru
2014-04-01
Flavonoids are major pigments in plants, and their biosynthetic pathway is one of the best-studied metabolic pathways. Here we have identified three mutations within a gene that result in pale-colored flowers in the Japanese morning glory (Ipomoea nil). As the mutations lead to a reduction of the colorless flavonoid compound flavonol as well as of anthocyanins in the flower petal, the identified gene was designated enhancer of flavonoid production (EFP). EFP encodes a chalcone isomerase (CHI)-related protein classified as a type IV CHI protein. CHI is the second committed enzyme of the flavonoid biosynthetic pathway, but type IV CHI proteins are thought to lack CHI enzymatic activity, and their functions remain unknown. The spatio-temporal expression of EFP and structural genes encoding enzymes that produce flavonoids is very similar. Expression of both EFP and the structural genes is coordinately promoted by genes encoding R2R3-MYB and WD40 family proteins. The EFP gene is widely distributed in land plants, and RNAi knockdown mutants of the EFP homologs in petunia (Petunia hybrida) and torenia (Torenia hybrida) had pale-colored flowers and low amounts of anthocyanins. The flavonol and flavone contents in the knockdown petunia and torenia flowers, respectively, were also significantly decreased, suggesting that the EFP protein contributes in early step(s) of the flavonoid biosynthetic pathway to ensure production of flavonoid compounds. From these results, we conclude that EFP is an enhancer of flavonoid production and flower pigmentation, and its function is conserved among diverse land plant species. © 2014 The Authors The Plant Journal © 2014 John Wiley & Sons Ltd.
Biogenesis of light harvesting proteins.
Dall'Osto, Luca; Bressan, Mauro; Bassi, Roberto
2015-09-01
The LHC family includes nuclear-encoded, integral thylakoid membrane proteins, most of which coordinate chlorophyll and xanthophyll chromophores. By assembling with the core complexes of both photosystems, LHCs form a flexible peripheral moiety for enhancing light-harvesting cross-section, regulating its efficiency and providing protection against photo-oxidative stress. Upon its first appearance, LHC proteins underwent evolutionary diversification into a large protein family with a complex genetic redundancy. Such differentiation appears as a crucial event in the adaptation of photosynthetic organisms to changing environmental conditions and land colonization. The structure of photosystems, including nuclear- and chloroplast-encoded subunits, presented the cell with a number of challenges for the control of the light harvesting function. Indeed, LHC-encoding messages are translated in the cytosol, and pre-proteins imported into the chloroplast, processed to their mature size and targeted to the thylakoids where are assembled with chromophores. Thus, a tight coordination between nuclear and plastid gene expression, in response to environmental stimuli, is required to adjust LHC composition during photoacclimation. In recent years, remarkable progress has been achieved in elucidating structure, function and regulatory pathways involving LHCs; however, a number of molecular details still await elucidation. In this review, we will provide an overview on the current knowledge on LHC biogenesis, ranging from organization of pigment-protein complexes to the modulation of gene expression, import and targeting to the photosynthetic membranes, and regulation of LHC assembly and turnover. Genes controlling these events are potential candidate for biotechnological applications aimed at optimizing light use efficiency of photosynthetic organisms. This article is part of a Special Issue entitled: Chloroplast biogenesis. Copyright © 2015 Elsevier B.V. All rights reserved.
Rodríguez Guilbe, María M.; Alfaro Malavé, Elisa C.; Akerboom, Jasper; Marvin, Jonathan S.; Looger, Loren L.; Schreiter, Eric R.
2008-01-01
Fluorescent proteins and their engineered variants have played an important role in the study of biology. The genetically encoded calcium-indicator protein GCaMP2 comprises a circularly permuted fluorescent protein coupled to the calcium-binding protein calmodulin and a calmodulin target peptide, M13, derived from the intracellular calmodulin target myosin light-chain kinase and has been used to image calcium transients in vivo. To aid rational efforts to engineer improved variants of GCaMP2, this protein was crystallized in the calcium-saturated form. X-ray diffraction data were collected to 2.0 Å resolution. The crystals belong to space group C2, with unit-cell parameters a = 126.1, b = 47.1, c = 68.8 Å, β = 100.5° and one GCaMP2 molecule in the asymmetric unit. The structure was phased by molecular replacement and refinement is currently under way. PMID:18607093
Enzymes and Enzyme Activity Encoded by Nonenveloped Viruses.
Azad, Kimi; Banerjee, Manidipa; Johnson, John E
2017-09-29
Viruses are obligate intracellular parasites that rely on host cell machineries for their replication and survival. Although viruses tend to make optimal use of the host cell protein repertoire, they need to encode essential enzymatic or effector functions that may not be available or accessible in the host cellular milieu. The enzymes encoded by nonenveloped viruses-a group of viruses that lack any lipid coating or envelope-play vital roles in all the stages of the viral life cycle. This review summarizes the structural, biochemical, and mechanistic information available for several classes of enzymes and autocatalytic activity encoded by nonenveloped viruses. Advances in research and development of antiviral inhibitors targeting specific viral enzymes are also highlighted.
Uversky, Vladimir N
2015-03-01
Intrinsically disordered proteins (IDPs) and intrinsically disordered protein regions (IDPRs) are functional proteins or regions that do not have unique 3D structures under functional conditions. Therefore, from the viewpoint of their lack of stable 3D structure, IDPs/IDPRs are inherently unstable. As much as structure and function of normal ordered globular proteins are determined by their amino acid sequences, the lack of unique 3D structure in IDPs/IDPRs and their disorder-based functionality are also encoded in the amino acid sequences. Because of their specific sequence features and distinctive conformational behavior, these intrinsically unstable proteins or regions have several applications in biotechnology. This review introduces some of the most characteristic features of IDPs/IDPRs (such as peculiarities of amino acid sequences of these proteins and regions, their major structural features, and peculiar responses to changes in their environment) and describes how these features can be used in the biotechnology, for example for the proteome-wide analysis of the abundance of extended IDPs, for recombinant protein isolation and purification, as polypeptide nanoparticles for drug delivery, as solubilization tools, and as thermally sensitive carriers of active peptides and proteins. Copyright © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Structure of faustovirus, a large dsDNA virus
DOE Office of Scientific and Technical Information (OSTI.GOV)
Klose, Thomas; Reteno, Dorine G.; Benamar, Samia
Many viruses protect their genome with a combination of a protein shell with or without a membrane layer. In this paper, we describe the structure of faustovirus, the first DNA virus (to our knowledge) that has been found to use two protein shells to encapsidate and protect its genome. The crystal structure of the major capsid protein, in combination with cryo-electron microscopy structures of two different maturation stages of the virus, shows that the outer virus shell is composed of a double jelly-roll protein that can be found in many double-stranded DNA viruses. The structure of the repeating hexameric unitmore » of the inner shell is different from all other known capsid proteins. In addition to the unique architecture, the region of the genome that encodes the major capsid protein stretches over 17,000 bp and contains a large number of introns and exons. Finally, this complexity might help the virus to rapidly adapt to new environments or hosts.« less
Structure of faustovirus, a large dsDNA virus
Klose, Thomas; Reteno, Dorine G.; Benamar, Samia; ...
2016-05-16
Many viruses protect their genome with a combination of a protein shell with or without a membrane layer. In this paper, we describe the structure of faustovirus, the first DNA virus (to our knowledge) that has been found to use two protein shells to encapsidate and protect its genome. The crystal structure of the major capsid protein, in combination with cryo-electron microscopy structures of two different maturation stages of the virus, shows that the outer virus shell is composed of a double jelly-roll protein that can be found in many double-stranded DNA viruses. The structure of the repeating hexameric unitmore » of the inner shell is different from all other known capsid proteins. In addition to the unique architecture, the region of the genome that encodes the major capsid protein stretches over 17,000 bp and contains a large number of introns and exons. Finally, this complexity might help the virus to rapidly adapt to new environments or hosts.« less
Mark, Linda; Spiller, O. Brad; Okroj, Marcin; Chanas, Simon; Aitken, Jim A.; Wong, Scott W.; Damania, Blossom; Blom, Anna M.; Blackbourn, David J.
2007-01-01
The diversity of viral strategies to modulate complement activation indicates that this component of the immune system has significant antiviral potential. One example is the Kaposi's sarcoma-associated herpesvirus (KSHV) complement control protein (KCP), which inhibits progression of the complement cascade. Rhesus rhadinovirus (RRV), like KSHV, is a member of the subfamily Gammaherpesvirinae and currently provides the only in vivo model of KSHV pathobiology in primates. In the present study, we characterized the KCP homologue encoded by RRV, RRV complement control protein (RCP). Two strains of RRV have been sequenced to date (H26-95 and 17577), and the RCPs they encode differ substantially in structure: RCP from strain H26-95 has four complement control protein (CCP) domains, whereas RCP from strain 17577 has eight CCP domains. Transcriptional analyses of the RCP gene (ORF4, referred to herein as RCP) in infected rhesus macaque fibroblasts mapped the ends of the transcripts of both strains. They revealed that H26-95 encodes a full-length, unspliced RCP transcript, while 17577 RCP generates a full-length unspliced mRNA and two alternatively spliced transcripts. Western blotting confirmed that infected cells express RCP, and immune electron microscopy disclosed this protein on the surface of RRV virions. Functional studies of RCP encoded by both RRV strains revealed their ability to suppress complement activation by the classical (antibody-mediated) pathway. These data provide the foundation for studies into the biological significance of gammaherpesvirus complement regulatory proteins in a tractable, non-human primate model. PMID:17287274
Martín, A E; Burgess, B K; Stout, C D; Cash, V L; Dean, D R; Jensen, G M; Stephens, P J
1990-01-01
Azotobacter vinelandii ferredoxin I is a small protein that contains one [4Fe-4S] cluster and one [3Fe-4S] cluster. Recently the x-ray crystal structure has been redetermined and the fdxA gene, which encodes the protein, has been cloned and sequenced. Here we report the site-directed mutation of Cys-20, which is a ligand of the [4Fe-4S] cluster in the native protein, to alanine and the characterization of the protein product by x-ray crystallographic and spectroscopic methods. The data show that the mutant protein again contains one [4Fe-4S] cluster and one [3Fe-4S] cluster. The new [4Fe-4S] cluster obtains its fourth ligand from Cys-24, a free cysteine in the native structure. The formation of this [4Fe-4S] cluster drives rearrangement of the protein structure. PMID:2153958
USDA-ARS?s Scientific Manuscript database
Plant disease resistance is often mediated by nucleotide binding-leucine rich repeat (NB-LRR or NLR) proteins, which trigger a hypersensitive response (HR), a rapid, localized cell death upon recognition of specific pathogens. The maize NLR-encoding Rp1-D21 gene is the result of an intergenic recomb...
FERM proteins in animal morphogenesis.
Tepass, Ulrich
2009-08-01
Proteins containing a FERM domain are ubiquitous components of the cytocortex of animal cells where they are engaged in structural, transport, and signaling functions. Recent years have seen a wealth of genetic studies in model organisms that explore FERM protein function in development and tissue organization. In addition, mutations in several FERM protein-encoding genes have been associated with human diseases. This review will provide a brief overview of the FERM domain structure and the FERM protein superfamily and then discuss recent advances in our understanding of the mechanism of function and developmental requirement of several FERM proteins including Moesin, Myosin-VIIA, Myosin-XV, Coracle/Band4.1 as well as Yurt and its vertebrate homologs Mosaic Eyes and EPB41L5/YMO1/Limulus.
Mayer-Jaekel, R E; Baumgartner, S; Bilbe, G; Ohkura, H; Glover, D M; Hemmings, B A
1992-01-01
cDNA clones encoding the catalytic subunit and the 65-kDa regulatory subunit of protein phosphatase 2A (PR65) from Drosophila melanogaster have been isolated by homology screening with the corresponding human cDNAs. The Drosophila clones were used to analyze the spatial and temporal expression of the transcripts encoding these two proteins. The Drosophila PR65 cDNA clones contained an open reading frame of 1773 nucleotides encoding a protein of 65.5 kDa. The predicted amino acid sequence showed 75 and 71% identity to the human PR65 alpha and beta isoforms, respectively. As previously reported for the mammalian PR65 isoforms, Drosophila PR65 is composed of 15 imperfect repeating units of approximately 39 amino acids. The residues contributing to this repeat structure show also the highest sequence conservation between species, indicating a functional importance for these repeats. The gene encoding Drosophila PR65 was located at 29B1,2 on the second chromosome. A major transcript of 2.8 kilobase (kb) encoding the PR65 subunit and two transcripts of 1.6 and 2.5 kb encoding the catalytic subunit could be detected throughout Drosophila development. All of these mRNAs were most abundant during early embryogenesis and were expressed at lower levels in larvae and adult flies. In situ hybridization of different developmental stages showed a colocalization of the PR65 and catalytic subunit transcripts. The mRNA expression is high in the nurse cells and oocytes, consistent with a high equally distributed expression in early embryos. In later embryonal development, the expression remains high in the nervous system and the gonads but the overall transcript levels decrease. In third instar larvae, high levels of mRNA could be observed in brain, imaginal discs, and in salivary glands. These results indicate that protein phosphatase 2A transcript levels change during development in a tissue and in a time-specific manner. Images PMID:1320961
Samson, Marie-Laure
2008-01-01
Background The Drosophila gene embryonic lethal abnormal visual system (elav) is the prototype of a gene family present in all metazoans. Its members encode structurally conserved neuronal proteins with three RNA Recognition Motifs (RRM) but they paradoxically act at diverse levels of post-transcriptional regulation. In an attempt to understand the history of this family, we searched for orthologs in eleven completely sequenced genomes, including those of humans, D. melanogaster and C. elegans, for which cDNAs are available. Results We analyzed 23 orthologs/paralogs of elav, and found evidence of gain/loss of gene copy number. For one set of genes, including elav itself, the coding sequences are free of introns and their products most resemble ELAV. The remaining genes show remarkable conservation of their exon organization, and their products most resemble FNE and RBP9, proteins encoded by the two elav paralogs of Drosophila. Remarkably, three of the conserved exon junctions are both close to structural elements, involved respectively in protein-RNA interactions and in the regulation of sub-cellular localization, and in the vicinity of diverse sequence variations. Conclusion The data indicate that the essential elav gene of Drosophila is newly emerged, restricted to dipterans and of retrotransposed origin. We propose that the conserved exon junctions constitute potential sites for sequence/function modifications, and that RRM binding proteins, whose function relies upon plastic RNA-protein interactions, may have played an important role in brain evolution. PMID:18715504
Taylor, Gregory K.; Stoddard, Barry L.
2012-01-01
Homing endonucleases (HEs) are highly specific DNA-cleaving enzymes that are encoded by invasive DNA elements (usually mobile introns or inteins) within the genomes of phage, bacteria, archea, protista and eukaryotic organelles. Six unique structural HE families, that collectively span four distinct nuclease catalytic motifs, have been characterized to date. Members of each family display structural homology and functional relationships to a wide variety of proteins from various organisms. The biological functions of those proteins are highly disparate and include non-specific DNA-degradation enzymes, restriction endonucleases, DNA-repair enzymes, resolvases, intron splicing factors and transcription factors. These relationships suggest that modern day HEs share common ancestors with proteins involved in genome fidelity, maintenance and gene expression. This review summarizes the results of structural studies of HEs and corresponding proteins from host organisms that have illustrated the manner in which these factors are related. PMID:22406833
Adaptability of Protein Structures to Enable Functional Interactions and Evolutionary Implications
Haliloglu, Turkan; Bahar, Ivet
2015-01-01
Several studies in recent years have drawn attention to the ability of proteins to adapt to intermolecular interactions by conformational changes along structure-encoded collective modes of motions. These so-called soft modes, primarily driven by entropic effects, facilitate, if not enable, functional interactions. They represent excursions on the conformational space along principal low-ascent directions/paths away from the original free energy minimum, and they are accessible to the protein even prior to protein-protein/ligand interactions. An emerging concept from these studies is the evolution of structures or modular domains to favor such modes of motion that will be recruited or integrated for enabling functional interactions. Structural dynamics, including the allosteric switches in conformation that are often stabilized upon formation of complexes and multimeric assemblies, emerge as key properties that are evolutionarily maintained to accomplish biological activities, consistent with the paradigm sequence → structure → dynamics → function where ‘dynamics’ bridges structure and function. PMID:26254902
Rüping, Boris; Ernst, Antonia M; Jekat, Stephan B; Nordzieke, Steffen; Reineke, Anna R; Müller, Boje; Bornberg-Bauer, Erich; Prüfer, Dirk; Noll, Gundula A
2010-10-08
The phloem of dicotyledonous plants contains specialized P-proteins (phloem proteins) that accumulate during sieve element differentiation and remain parietally associated with the cisternae of the endoplasmic reticulum in mature sieve elements. Wounding causes P-protein filaments to accumulate at the sieve plates and block the translocation of photosynthate. Specialized, spindle-shaped P-proteins known as forisomes that undergo reversible calcium-dependent conformational changes have evolved exclusively in the Fabaceae. Recently, the molecular characterization of three genes encoding forisome components in the model legume Medicago truncatula (MtSEO1, MtSEO2 and MtSEO3; SEO = sieve element occlusion) was reported, but little is known about the molecular characteristics of P-proteins in non-Fabaceae. We performed a comprehensive genome-wide comparative analysis by screening the M. truncatula, Glycine max, Arabidopsis thaliana, Vitis vinifera and Solanum phureja genomes, and a Malus domestica EST library for homologs of MtSEO1, MtSEO2 and MtSEO3 and identified numerous novel SEO genes in Fabaceae and even non-Fabaceae plants, which do not possess forisomes. Even in Fabaceae some SEO genes appear to not encode forisome components. All SEO genes have a similar exon-intron structure and are expressed predominantly in the phloem. Phylogenetic analysis revealed the presence of several subgroups with Fabaceae-specific subgroups containing all of the known as well as newly identified forisome component proteins. We constructed Hidden Markov Models that identified three conserved protein domains, which characterize SEO proteins when present in combination. In addition, one common and three subgroup specific protein motifs were found in the amino acid sequences of SEO proteins. SEO genes are organized in genomic clusters and the conserved synteny allowed us to identify several M. truncatula vs G. max orthologs as well as paralogs within the G. max genome. The unexpected occurrence of forisome-like genes in non-Fabaceae plants may indicate that these proteins encode species-specific P-proteins, which is backed up by the phloem-specific expression profiles. The conservation of gene structure, the presence of specific motifs and domains and the genomic synteny argue for a common phylogenetic origin of forisomes and other P-proteins.
Bunney, Tom D.; Cole, Ambrose R.; Broncel, Malgorzata; Esposito, Diego; Tate, Edward W.; Katan, Matilda
2014-01-01
Summary Protein AMPylation, the transfer of AMP from ATP to protein targets, has been recognized as a new mechanism of host-cell disruption by some bacterial effectors that typically contain a FIC-domain. Eukaryotic genomes also encode one FIC-domain protein, HYPE, which has remained poorly characterized. Here we describe the structure of human HYPE, solved by X-ray crystallography, representing the first structure of a eukaryotic FIC-domain protein. We demonstrate that HYPE forms stable dimers with structurally and functionally integrated FIC-domains and with TPR-motifs exposed for protein-protein interactions. As HYPE also uniquely possesses a transmembrane helix, dimerization is likely to affect its positioning and function in the membrane vicinity. The low rate of autoAMPylation of the wild-type HYPE could be due to autoinhibition, consistent with the mechanism proposed for a number of putative FIC AMPylators. Our findings also provide a basis to further consider possible alternative cofactors of HYPE and distinct modes of target-recognition. PMID:25435325
Bunney, Tom D; Cole, Ambrose R; Broncel, Malgorzata; Esposito, Diego; Tate, Edward W; Katan, Matilda
2014-12-02
Protein AMPylation, the transfer of AMP from ATP to protein targets, has been recognized as a new mechanism of host-cell disruption by some bacterial effectors that typically contain a FIC-domain. Eukaryotic genomes also encode one FIC-domain protein,HYPE, which has remained poorly characterized.Here we describe the structure of human HYPE, solved by X-ray crystallography, representing the first structure of a eukaryotic FIC-domain protein. We demonstrate that HYPE forms stable dimers with structurally and functionally integrated FIC-domains and with TPR-motifs exposed for protein-protein interactions. As HYPE also uniquely possesses a transmembrane helix, dimerization is likely to affect its positioning and function in the membrane vicinity. The low rate of auto AMPylation of the wild-type HYPE could be due to autoinhibition, consistent with the mechanism proposed for a number of putative FIC AMPylators. Our findings also provide a basis to further consider possible alternative cofactors of HYPE and distinct modes of target-recognition.
Miller, Ona K; Potter, Jane A; Vijayakrishnan, Swetha; Bhella, David; Naismith, James H; Elliott, Richard M
2017-01-01
Rift Valley fever phlebovirus (RVFV) is a clinically and economically important pathogen increasingly likely to cause widespread epidemics. RVFV virulence depends on the interferon antagonist non-structural protein (NSs), which remains poorly characterized. We identified a stable core domain of RVFV NSs (residues 83–248), and solved its crystal structure, a novel all-helical fold organized into highly ordered fibrils. A hallmark of RVFV pathology is NSs filament formation in infected cell nuclei. Recombinant virus encoding the NSs core domain induced intranuclear filaments, suggesting it contains all essential determinants for nuclear translocation and filament formation. Mutations of key crystal fibril interface residues in viruses encoding full-length NSs completely abrogated intranuclear filament formation in infected cells. We propose the fibrillar arrangement of the NSs core domain in crystals reveals the molecular basis of assembly of this key virulence factor in cell nuclei. Our findings have important implications for fundamental understanding of RVFV virulence. PMID:28915104
Barski, Michal; Brennan, Benjamin; Miller, Ona K; Potter, Jane A; Vijayakrishnan, Swetha; Bhella, David; Naismith, James H; Elliott, Richard M; Schwarz-Linek, Ulrich
2017-09-15
Rift Valley fever phlebovirus (RVFV) is a clinically and economically important pathogen increasingly likely to cause widespread epidemics. RVFV virulence depends on the interferon antagonist non-structural protein (NSs), which remains poorly characterized. We identified a stable core domain of RVFV NSs (residues 83-248), and solved its crystal structure, a novel all-helical fold organized into highly ordered fibrils. A hallmark of RVFV pathology is NSs filament formation in infected cell nuclei. Recombinant virus encoding the NSs core domain induced intranuclear filaments, suggesting it contains all essential determinants for nuclear translocation and filament formation. Mutations of key crystal fibril interface residues in viruses encoding full-length NSs completely abrogated intranuclear filament formation in infected cells. We propose the fibrillar arrangement of the NSs core domain in crystals reveals the molecular basis of assembly of this key virulence factor in cell nuclei. Our findings have important implications for fundamental understanding of RVFV virulence.
Raibaud, A; Zalacain, M; Holt, T G; Tizard, R; Thompson, C J
1991-01-01
Nucleotide sequence analysis of a 5,000-bp region of the bialaphos antibiotic production (bap) gene cluster defined five open reading frames (ORFs) which predicted structural genes in the order bah, ORF1, ORF2, and ORF3 followed by the regulatory gene, brpA (H. Anzai, T. Murakami, S. Imai, A. Satoh, K. Nagaoka, and C.J. Thompson, J. Bacteriol. 169:3482-3488, 1987). The four structural genes were translationally coupled and apparently cotranscribed from an undefined promoter(s) under the positive control of the brpA gene product. S1 mapping experiments indicated that brpA was transcribed by two promoters (brpAp1 and brpAp2) which initiate transcription 150 and 157 bp upstream of brp A within an intergenic region and at least one promoter further upstream within the bap gene cluster (brpAp3). All three transcripts were present at low levels during exponential growth and increased just before the stationary phase. The levels of the brpAp3 band continued to increase at the onset of stationary phase, whereas brpAp1-and brpAp2-protected fragments showed no further change. BrpA contained a possible helix-turn-helix motif at its C terminus which was similar to the C-terminal regulatory motif found in the receiver component of a family of two-component transcriptional activator proteins. This motif was not associated with the N-terminal domain conserved in other members of the family. The structural gene cluster sequenced began with bah, encoding a bialaphos acetylhydrolase which removes the N-acetyl group from bialaphos as one of the final steps in the biosynthetic pathway. The observation that Bah was similar to a rat and to a bacterial (Acinetobacter calcoaceticus) lipase probably reflects the fact that the ester bonds of triglycerides and the amide bond linking acetate to phosphinothricin are similar and hydrolysis is catalyzed by structurally related enzymes. This was followed by two regions encoding ORF1 and ORF2 which were similar to each other (48% nucleotide identity, 31% amino acid identity), as well as to GrsT, a protein encoded by a gene located adjacent to gramicidin S synthetase in Bacillus brevis, and to vertebrate (mallard duck and rat) thioesterases. The amino acid sequence and hydrophobicity profile of ORF3 indicated that it was related to a family of membrane transport proteins. It was strikingly similar to the citrate uptake protein encoded by the transposon Tn3411. Images PMID:2066341
Cellular and molecular biology of orphan G protein-coupled receptors.
Oh, Da Young; Kim, Kyungjin; Kwon, Hyuk Bang; Seong, Jae Young
2006-01-01
The superfamily of G protein-coupled receptors (GPCRs) is the largest and most diverse group of membrane-spanning proteins. It plays a variety of roles in pathophysiological processes by transmitting extracellular signals to cells via heterotrimeric G proteins. Completion of the human genome project revealed the presence of approximately 168 genes encoding established nonsensory GPCRs, as well as 207 genes predicted to encode novel GPCRs for which the natural ligands remained to be identified, the so-called orphan GPCRs. Eighty-six of these orphans have now been paired to novel or previously known molecules, and 121 remain to be deorphaned. A better understanding of the GPCR structures and classification; knowledge of the receptor activation mechanism, either dependent on or independent of an agonist; increased understanding of the control of GPCR-mediated signal transduction; and development of appropriate ligand screening systems may improve the probability of discovering novel ligands for the remaining orphan GPCRs.
A core viral protein binds host nucleosomes to sequester immune danger signals
Avgousti, Daphne C.; Herrmann, Christin; Kulej, Katarzyna; Pancholi, Neha J.; Sekulic, Nikolina; Petrescu, Joana; Molden, Rosalynn C.; Blumenthal, Daniel; Paris, Andrew J.; Reyes, Emigdio D.; Ostapchuk, Philomena; Hearing, Patrick; Seeholzer, Steven H.; Worthen, G. Scott; Black, Ben E.; Garcia, Benjamin A.; Weitzman, Matthew D.
2016-01-01
Viral proteins mimic host protein structure and function to redirect cellular processes and subvert innate defenses1. Small basic proteins compact and regulate both viral and cellular DNA genomes. Nucleosomes are the repeating units of cellular chromatin and play an important role in innate immune responses2. Viral encoded core basic proteins compact viral genomes but their impact on host chromatin structure and function remains unexplored. Adenoviruses encode a highly basic protein called protein VII that resembles cellular histones3. Although protein VII binds viral DNA and is incorporated with viral genomes into virus particles4,5, it is unknown whether protein VII impacts cellular chromatin. Our observation that protein VII alters cellular chromatin led us to hypothesize that this impacts antiviral responses during adenovirus infection. We found that protein VII forms complexes with nucleosomes and limits DNA accessibility. We identified post-translational modifications on protein VII that are responsible for chromatin localization. Furthermore, proteomic analysis demonstrated that protein VII is sufficient to alter protein composition of host chromatin. We found that protein VII is necessary and sufficient for retention in chromatin of members of the high-mobility group protein B family (HMGB1, HMGB2, and HMGB3). HMGB1 is actively released in response to inflammatory stimuli and functions as a danger signal to activate immune responses6,7. We showed that protein VII can directly bind HMGB1 in vitro and further demonstrated that protein VII expression in mouse lungs is sufficient to decrease inflammation-induced HMGB1 content and neutrophil recruitment in the bronchoalveolar lavage fluid. Together our in vitro and in vivo results show that protein VII sequesters HMGB1 and can prevent its release. This study uncovers a viral strategy in which nucleosome binding is exploited to control extracellular immune signaling. PMID:27362237
fRMSDPred: Predicting Local RMSD Between Structural Fragments Using Sequence Information
2007-04-04
machine learning approaches for estimating the RMSD value of a pair of protein fragments. These estimated fragment-level RMSD values can be used to construct the alignment, assess the quality of an alignment, and identify high-quality alignment segments. We present algorithms to solve this fragment-level RMSD prediction problem using a supervised learning framework based on support vector regression and classification that incorporates protein profiles, predicted secondary structure, effective information encoding schemes, and novel second-order pairwise exponential kernel
Zhang, Jiaxin; Movahedi, Ali; Wang, Xiaoli; Wu, Xiaolong; Yin, Tongming; Zhuge, Qiang
2015-06-01
The increasing resistance of bacteria and fungi to currently available antibiotics is a major concern worldwide, leading to enormous efforts to develop new antibiotics with new modes of actions. In this paper, cDNA encoding cecropin A was amplified from drury (Hyphantria cunea) (dHC) pupa fatbody total RNA using RT-PCR. The full-length dHC-cecropin A cDNA encoded a protein of 63 amino acids with a predicted 26-amino acid signal peptide and a 37-amino acid functional domain. We synthesized the antibacterial peptide (ABP) from the 37-amino acid functional domain (ABP-dHC-cecropin A), and amidated it via the C-terminus. Time-of-flight mass spectrometry showed its molecular weight to be 4058.94. The ABP-dHC-cecropin A was assessed in terms of its protein structure using bioinformatics and CD spectroscopy. The protein's secondary structure was predicted to be α-helical. In an antibacterial activity analysis, the ABP-dHC-cecropin A exhibited strong antibacterial activity against E. coli K12D31 and Agrobacterium EHA105. Copyright © 2014 Elsevier Inc. All rights reserved.
Schaeffer, E; Sninsky, J J
1984-01-01
Proteins that are related evolutionarily may have diverged at the level of primary amino acid sequence while maintaining similar secondary structures. Computer analysis has been used to compare the open reading frames of the hepatitis B virus to those of the woodchuck hepatitis virus at the level of amino acid sequence, and to predict the relative hydrophilic character and the secondary structure of putative polypeptides. Similarity is seen at the levels of relative hydrophilicity and secondary structure, in the absence of sequence homology. These data reinforce the proposal that these open reading frames encode viral proteins. Computer analysis of this type can be more generally used to establish structural similarities between proteins that do not share obvious sequence homology as well as to assess whether an open reading frame is fortuitous or codes for a protein. PMID:6585835
Cytoplasmic bacteriophage display system
Studier, F.W.; Rosenberg, A.H.
1998-06-16
Disclosed are display vectors comprising DNA encoding a portion of a structural protein from a cytoplasmic bacteriophage, joined covalently to a protein or peptide of interest. Exemplified are display vectors wherein the structural protein is the T7 bacteriophage capsid protein. More specifically, in the exemplified display vectors the C-terminal amino acid residue of the portion of the capsid protein is joined to the N-terminal residue of the protein or peptide of interest. The portion of the T7 capsid protein exemplified comprises an N-terminal portion corresponding to form 10B of the T7 capsid protein. The display vectors are useful for high copy number display or lower copy number display (with larger fusion). Compositions of the type described herein are useful in connection with methods for producing a virus displaying a protein or peptide of interest. 1 fig.
Cytoplasmic bacteriophage display system
Studier, F. William; Rosenberg, Alan H.
1998-06-16
Disclosed are display vectors comprising DNA encoding a portion of a structural protein from a cytoplasmic bacteriophage, joined covalently to a protein or peptide of interest. Exemplified are display vectors wherein the structural protein is the T7 bacteriophage capsid protein. More specifically, in the exemplified display vectors the C-terminal amino acid residue of the portion of the capsid protein is joined to the N-terminal residue of the protein or peptide of interest. The portion of the T7 capsid protein exemplified comprises an N-terminal portion corresponding to form 10B of the T7 capsid protein. The display vectors are useful for high copy number display or lower copy number display (with larger fusion). Compositions of the type described herein are useful in connection with methods for producing a virus displaying a protein or peptide of interest.
Lee, Hyung Ho; Jung, Sang Taek
2013-02-01
β-N-acetylglucosaminidase (NagA) protein hs a chitin-degrading activity and chitin is one of the most abundant polymers in nature. NagA contains a family 3 glycoside (GH3)-type N-terminal domain and a unique C-terminal domain. The structurally uncharacterized C-terminal domain of NagA may be involved in substrate specificity. To provide a structural basis for the substrate specificity of NagA, structural analysis of NagA from Thermotoga maritima encoded by the Tm0809 gene was initiated. NagA from T. maritima has been overexpressed in Escherichia coli and crystallized at 296 K using ammonium sulfate as a precipitant. Crystals of T. maritima NagA diffracted to 3.80 Å resolution and belonged to the monoclinic space group C2, with unit-cell parameters a = 231.15, b = 133.62, c = 140.88 Å, β = 89.97°. The crystallization of selenomethionyl-substituted protein is in progress to solve the crystal structure of T. maritima NagA.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Love, Robert A.; Maegley, Karen A.; Yu, Xiu
Human rhinoviruses (HRV), the predominant members of the Picornaviridae family of positive-strand RNA viruses, are the major causative agents of the common cold. Given the lack of effective treatments for rhinoviral infections, virally encoded proteins have become attractive therapeutic targets. The HRV genome encodes an RNA-dependent RNA polymerase (RdRp) denoted 3D{sup pol}, which is responsible for replicating the viral genome and for synthesizing a protein primer used in the replication. Here the crystal structures for three viral serotypes (1B, 14, and 16) of HRV 3D{sup pol} have been determined. The three structures are very similar to one another, and tomore » the closely related poliovirus (PV) 3D{sup pol} enzyme. Because the reported PV crystal structure shows significant disorder, HRV 3D{sup pol} provides the first complete view of a picornaviral RdRp. The folding topology of HRV 3D{sup pol} also resembles that of RdRps from hepatitis C virus (HCV) and rabbit hemorrhagic disease virus (RHDV) despite very low sequence homology.« less
From Genomes to Protein Models and Back
NASA Astrophysics Data System (ADS)
Tramontano, Anna; Giorgetti, Alejandro; Orsini, Massimiliano; Raimondo, Domenico
2007-12-01
The alternative splicing mechanism allows genes to generate more than one product. When the splicing events occur within protein coding regions they can modify the biological function of the protein. Alternative splicing has been suggested as one way for explaining the discrepancy between the number of human genes and functional complexity. We analysed the putative structure of the alternatively spliced gene products annotated in the ENCODE pilot project and discovered that many of the potential alternative gene products will be unlikely to produce stable functional proteins.
Platica, Micsunica; Ivan, Elena; Holland, James F; Ionescu, Alin; Chen, Sheryl; Mandeli, John; Unger, Pamela D; Platica, Ovidiu
2004-02-10
A cDNA clone of 1.1 kb encoding a 108-aa polypeptide was isolated from a human pituitary cDNA library by expression cloning. This protein was named tumor differentiation factor (TDF). The recombinant TDF protein and a 20-aa peptide, P1, selected from the ORF of the gene, induced morphological and biochemical changes consistent with differentiation of human breast and prostate cancer cells. Fibroblast, kidney, hepatoma, and leukemic lymphocytic cell lines were unaffected. Breast and prostate cancer cells aggregated in spheroid-like structures within 24 h of exposure to TDF. This effect was abrogated by a specific affinity-purified rabbit polyclonal anti-P1 Ab. E-cadherin expression was increased in a dose-dependent manner by TDF. Treatment of MCF7 cells with TDF led to production of a lactalbumin-related protein. Peptide P1 significantly decreased the growth of androgen-independent DU145 prostate cancer in severe combined immunodeficient mice. The presence of TDF protein in human sera was detected by the anti-P1 Ab, suggesting a role of TDF in endocrine metabolism. The fact that all activities of TDF can be mimicked by a peptide derived from the encoding TDF sequence opens the possibility of therapeutic applications.
Blancher, C; Omri, B; Bidou, L; Pessac, B; Crisanti, P
1996-10-18
We report the isolation and characterization of a novel cDNA from quail neuroretina encoding a putative protein named nectinepsin. The nectinepsin cDNA identifies a major 2.2-kilobase mRNA that is detected from ED 5 in neuroretina and is increasingly abundant during embryonic development. A nectinepsin mRNA is also found in quail liver, brain, and intestine and in mouse retina. The deduced nectinepsin amino acid sequence contains the RGD cell binding motif of integrin ligands. Furthermore, nectinepsin shares substantial homologies with vitronectin and structural protein similarities with most of the matricial metalloproteases. However, the presence of a specific sequence and the lack of heparin and collagen binding domains of the vitronectin indicate that nectinepsin is a new extracellular matrix protein. Furthermore, genomic Southern blot studies suggest that nectinepsin and vitronectin are encoded by different genes. Western blot analysis with an anti-human vitronectin antiserum revealed, in addition to the 65- and 70-kDa vitronectin bands, an immunoreactive protein of about 54 kDa in all tissues containing nectinepsin mRNA. It seems likely that the form of vitronectin found in chick egg yolk plasma by Nagano et al. ((1992) J. Biol. Chem. 267, 24863-24870) is the protein that corresponds to the nectinepsin cDNA. This new protein could be an important molecule involved in the early steps of the development.
Batra, Ritu; Saripalli, Gautam; Mohan, Amita; Gupta, Saurabh; Gill, Kulvinder S.; Varadwaj, Pritish K.; Balyan, Harindra S.; Gupta, Pushpendra K.
2017-01-01
ADP-glucose pyrophosphorylase (AGPase) is a heterotetrameric enzyme with two large subunits (LS) and two small subunits (SS). It plays a critical role in starch biosynthesis. We are reporting here detailed structure, function and evolution of the genes encoding the LS and the SS among monocots and dicots. “True” orthologs of maize Sh2 (AGPase LS) and Bt2 (AGPase SS) were identified in seven other monocots and three dicots; structure of the enzyme at protein level was also studied. Novel findings of the current study include the following: (i) at the DNA level, the genes controlling the SS are more conserved than those controlling the LS; the variation in both is mainly due to intron number, intron length and intron phase distribution; (ii) at protein level, the SS genes are more conserved relative to those for LS; (iii) “QTCL” motif present in SS showed evolutionary differences in AGPase belonging to wheat 7BS, T. urartu, rice and sorghum, while “LGGG” motif in LS was present in all species except T. urartu and chickpea; SS provides thermostability to AGPase, while LS is involved in regulation of AGPase activity; (iv) heterotetrameric structure of AGPase was predicted and analyzed in real time environment through molecular dynamics simulation for all the species; (v) several cis-acting regulatory elements were identified in the AGPase promoters with their possible role in regulating spatial and temporal expression (endosperm and leaf tissue) and also the expression, in response to abiotic stresses; and (vi) expression analysis revealed downregulation of both subunits under conditions of heat and drought stress. The results of the present study have allowed better understanding of structure and evolution of the genes and the encoded proteins and provided clues for exploitation of variability in these genes for engineering thermostable AGPase. PMID:28174576
Ndungu, John Maina; Suponitsky-Kroyter, Irena; Cavett, Valerie J.; McEnaney, Patrick J.; MacConnell, Andrew B.; Doran, Todd. M.; Ronacher, Katharina; Stanley, Kim; Utset, Ofelia; Walzl, Gerhard; Paegel, Brian M.; Kodadek, Thomas
2017-01-01
The circulating antibody repertoire encodes a patient's health status and pathogen exposure history, but identifying antibodies with diagnostic potential usually requires knowledge of the antigen(s). We previously circumvented this problem by screening libraries of bead-displayed small molecules against case and control serum samples to discover “epitope surrogates” (ligands of IgGs enriched in the case sample). Here, we describe an improved version of this technology that employs DNA-encoded libraries and high-throughput FACS-based screening to discover epitope surrogates that differentiate noninfectious/latent (LTB) patients from infectious/active TB (ATB) patients, which is imperative for proper treatment selection and antibiotic stewardship. Normal control/LTB (10 patients each, NCL) and ATB (10 patients) serum pools were screened against a library (5 × 106 beads, 448k unique compounds) using fluorescent anti-human IgG to label hit compound beads for FACS. Deep sequencing decoded all hit structures and each hit's occurrence frequencies. ATB hits were pruned of NCL hits and prioritized for resynthesis based on occurrence and homology. Several structurally homologous families were identified and 16/21 resynthesized representative hits validated as selective ligands of ATB serum IgGs (p < 0.005). The native secreted TB protein Ag85B (though not the E. coli recombinant form) competed with one of the validated ligands for binding to antibodies, suggesting that it mimics a native Ag85B epitope. The use of DNA-encoded libraries and FACS-based screening in epitope surrogate discovery reveals thousands of potential hit structures. Distilling this list down to several consensus chemical structures yielded a diagnostic panel for ATB composed of thermally stable and economically produced small molecule ligands in place of protein antigens. PMID:27957856
Fast and Forceful Refolding of Stretched α-Helical Solenoid Proteins
Kim, Minkyu; Abdi, Khadar; Lee, Gwangrog; Rabbi, Mahir; Lee, Whasil; Yang, Ming; Schofield, Christopher J.; Bennett, Vann; Marszalek, Piotr E.
2010-01-01
Abstract Anfinsen's thermodynamic hypothesis implies that proteins can encode for stretching through reversible loss of structure. However, large in vitro extensions of proteins that occur through a progressive unfolding of their domains typically dissipate a significant amount of energy, and therefore are not thermodynamically reversible. Some coiled-coil proteins have been found to stretch nearly reversibly, although their extension is typically limited to 2.5 times their folded length. Here, we report investigations on the mechanical properties of individual molecules of ankyrin-R, β-catenin, and clathrin, which are representative examples of over 800 predicted human proteins composed of tightly packed α-helical repeats (termed ANK, ARM, or HEAT repeats, respectively) that form spiral-shaped protein domains. Using atomic force spectroscopy, we find that these polypeptides possess unprecedented stretch ratios on the order of 10–15, exceeding that of other proteins studied so far, and their extension and relaxation occurs with minimal energy dissipation. Their sequence-encoded elasticity is governed by stepwise unfolding of small repeats, which upon relaxation of the stretching force rapidly and forcefully refold, minimizing the hysteresis between the stretching and relaxing parts of the cycle. Thus, we identify a new class of proteins that behave as highly reversible nanosprings that have the potential to function as mechanosensors in cells and as building blocks in springy nanostructures. Our physical view of the protein component of cells as being comprised of predominantly inextensible structural elements under tension may need revision to incorporate springs. PMID:20550922
NASA Astrophysics Data System (ADS)
Mahmood, Zakaria N.; Mahmuddin, Massudi; Mahmood, Mohammed Nooraldeen
Encoding proteins of amino acid sequence to predict classified into their respective families and subfamilies is important research area. However for a given protein, knowing the exact action whether hormonal, enzymatic, transmembranal or nuclear receptors does not depend solely on amino acid sequence but on the way the amino acid thread folds as well. This study provides a prototype system that able to predict a protein tertiary structure. Several methods are used to develop and evaluate the system to produce better accuracy in protein 3D structure prediction. The Bees Optimization algorithm which inspired from the honey bees food foraging method, is used in the searching phase. In this study, the experiment is conducted on short sequence proteins that have been used by the previous researches using well-known tools. The proposed approach shows a promising result.
Pritham, Ellen J; Putliwala, Tasneem; Feschotte, Cédric
2007-04-01
We previously identified a group of atypical mobile elements designated Mavericks from the nematodes Caenorhabditis elegans and C. briggsae and the zebrafish Danio rerio. Here we present the results of comprehensive database searches of the genome sequences available, which reveal that Mavericks are widespread in invertebrates and non-mammalian vertebrates but show a patchy distribution in non-animal species, being present in the fungi Glomus intraradices and Phakopsora pachyrhizi and in several single-celled eukaryotes such as the ciliate Tetrahymena thermophila, the stramenopile Phytophthora infestans and the trichomonad Trichomonas vaginalis, but not detectable in plants. This distribution, together with comparative and phylogenetic analyses of Maverick-encoded proteins, is suggestive of an ancient origin of these elements in eukaryotes followed by lineage-specific losses and/or recurrent episodes of horizontal transmission. In addition, we report that Maverick elements have amplified recently to high copy numbers in T. vaginalis where they now occupy as much as 30% of the genome. Sequence analysis confirms that most Mavericks encode a retroviral-like integrase, but lack other open reading frames typically found in retroelements. Nevertheless, the length and conservation of the target site duplication created upon Maverick insertion (5- or 6-bp) is consistent with a role of the integrase-like protein in the integration of a double-stranded DNA transposition intermediate. Mavericks also display long terminal-inverted repeats but do not contain ORFs similar to proteins encoded by DNA transposons. Instead, Mavericks encode a conserved set of 5 to 9 genes (in addition to the integrase) that are predicted to encode proteins with homology to replication and packaging proteins of some bacteriophages and diverse eukaryotic double-stranded DNA viruses, including a DNA polymerase B homolog and putative capsid proteins. Based on these and other structural similarities, we speculate that Mavericks represent an evolutionary missing link between seemingly disparate invasive DNA elements that include bacteriophages, adenoviruses and eukaryotic linear plasmids.
Trabanino, Rene J; Vaidehi, Nagarajan; Hall, Spencer E; Goddard, William A; Floriano, Wely
2013-02-05
The invention provides computer-implemented methods and apparatus implementing a hierarchical protocol using multiscale molecular dynamics and molecular modeling methods to predict the presence of transmembrane regions in proteins, such as G-Protein Coupled Receptors (GPCR), and protein structural models generated according to the protocol. The protocol features a coarse grain sampling method, such as hydrophobicity analysis, to provide a fast and accurate procedure for predicting transmembrane regions. Methods and apparatus of the invention are useful to screen protein or polynucleotide databases for encoded proteins with transmembrane regions, such as GPCRs.
Ghouzam, Yassine; Postic, Guillaume; Guerin, Pierre-Edouard; de Brevern, Alexandre G.; Gelly, Jean-Christophe
2016-01-01
Protein structure prediction based on comparative modeling is the most efficient way to produce structural models when it can be performed. ORION is a dedicated webserver based on a new strategy that performs this task. The identification by ORION of suitable templates is performed using an original profile-profile approach that combines sequence and structure evolution information. Structure evolution information is encoded into profiles using structural features, such as solvent accessibility and local conformation —with Protein Blocks—, which give an accurate description of the local protein structure. ORION has recently been improved, increasing by 5% the quality of its results. The ORION web server accepts a single protein sequence as input and searches homologous protein structures within minutes. Various databases such as PDB, SCOP and HOMSTRAD can be mined to find an appropriate structural template. For the modeling step, a protein 3D structure can be directly obtained from the selected template by MODELLER and displayed with global and local quality model estimation measures. The sequence and the predicted structure of 4 examples from the CAMEO server and a recent CASP11 target from the ‘Hard’ category (T0818-D1) are shown as pertinent examples. Our web server is accessible at http://www.dsimb.inserm.fr/ORION/. PMID:27319297
Ghouzam, Yassine; Postic, Guillaume; Guerin, Pierre-Edouard; de Brevern, Alexandre G; Gelly, Jean-Christophe
2016-06-20
Protein structure prediction based on comparative modeling is the most efficient way to produce structural models when it can be performed. ORION is a dedicated webserver based on a new strategy that performs this task. The identification by ORION of suitable templates is performed using an original profile-profile approach that combines sequence and structure evolution information. Structure evolution information is encoded into profiles using structural features, such as solvent accessibility and local conformation -with Protein Blocks-, which give an accurate description of the local protein structure. ORION has recently been improved, increasing by 5% the quality of its results. The ORION web server accepts a single protein sequence as input and searches homologous protein structures within minutes. Various databases such as PDB, SCOP and HOMSTRAD can be mined to find an appropriate structural template. For the modeling step, a protein 3D structure can be directly obtained from the selected template by MODELLER and displayed with global and local quality model estimation measures. The sequence and the predicted structure of 4 examples from the CAMEO server and a recent CASP11 target from the 'Hard' category (T0818-D1) are shown as pertinent examples. Our web server is accessible at http://www.dsimb.inserm.fr/ORION/.
Clark, A M; Jacobsen, K R; Bostwick, D E; Dannenhoffer, J M; Skaggs, M I; Thompson, G A
1997-07-01
Sieve elements in the phloem of most angiosperms contain proteinaceous filaments and aggregates called P-protein. In the genus Cucurbita, these filaments are composed of two major proteins: PP1, the phloem filament protein, and PP2, the phloem lactin. The gene encoding the phloem filament protein in pumpkin (Cucurbita maxima Duch.) has been isolated and characterized. Nucleotide sequence analysis of the reconstructed gene gPP1 revealed a continuous 2430 bp protein coding sequence, with no introns, encoding an 809 amino acid polypeptide. The deduced polypeptide had characteristics of PP1 and contained a 15 amino acid sequence determined by N-terminal peptide sequence analysis of PP1. The sequence of PP1 was highly repetitive with four 200 amino acid sequence domains containing structural motifs in common with cysteine proteinase inhibitors. Expression of the PP1 gene was detected in roots, hypocotyls, cotyledons, stems, and leaves of pumpkin plants. PP1 and its mRNA accumulated in pumpkin hypocotyls during the period of rapid hypocotyl elongation after which mRNA levels declined, while protein levels remained elevated. PP1 was immunolocalized in slime plugs and P-protein bodies in sieve elements of the phloem. Occasionally, PP1 was detected in companion cells. PP1 mRNA was localized by in situ hybridization in companion cells at early stages of vascular differentiation. The developmental accumulation and localization of PP1 and its mRNA paralleled the phloem lactin, further suggesting an interaction between these phloem-specific proteins.
USDA-ARS?s Scientific Manuscript database
Molecular epidemiology and evolution of foot-and-mouth disease virus (FMDV) are widely studied using genomic sequences encoding VP1, the capsid protein containing the most relevant antigenic domains. Although sequencing of the full viral genome is not used as a routine diagnostic or surveillance too...
Van Damme, Els J.M.; Hao, Qiang; Barre, Annick; Rougé, Pierre; Van Leuven, Fred; Peumans, Willy J.
2000-01-01
The most abundant protein of resting rhizomes of Calystegia sepium (L.) R.Br. (hedge bindweed) has been isolated and its corresponding cDNA cloned. The native protein consists of a single polypeptide of 212 amino acid residues and occurs as a mixture of glycosylated and unglycosylated isoforms. Both forms are derived from the same preproprotein containing a signal peptide and a C-terminal propeptide. Analysis of the deduced amino acid sequence indicated that the C. sepium protein shows high sequence identity and structural similarity with plant RNases. However, no RNase activity could be detected in highly purified preparations of the protein. This apparent lack of activity results most probably from the replacement of a conserved His residue, which is essential for the catalytic activity of plant RNases. Our findings not only demonstrate the occurrence of a catalytically inactive variant of an S-like RNase, but also provide further evidence that genes encoding storage proteins may have evolved from genes encoding enzymes or other biologically active proteins. PMID:10677436
Knowles, D P; Cheevers, W P; McGuire, T C; Brassfield, A L; Harwood, W G; Stem, T A
1991-11-01
To define the structure of the caprine arthritis-encephalitis virus (CAEV) env gene and characterize genetic changes which occur during antigenic variation, we sequenced the env genes of CAEV-63 and CAEV-Co, two antigenic variants of CAEV defined by serum neutralization. The deduced primary translation product of the CAEV env gene consists of a 60- to 80-amino-acid signal peptide followed by an amino-terminal surface protein (SU) and a carboxy-terminal transmembrane protein (TM) separated by an Arg-Lys-Lys-Arg cleavage site. The signal peptide cleavage site was verified by amino-terminal amino acid sequencing of native CAEV-63 SU. In addition, immunoprecipitation of [35S]methionine-labeled CAEV-63 proteins by sera from goats immunized with recombinant vaccinia virus expressing the CAEV-63 env gene confirmed that antibodies induced by env-encoded recombinant proteins react specifically with native virion SU and TM. The env genes of CAEV-63 and CAEV-Co encode 28 conserved cysteines and 25 conserved potential N-linked glycosylation sites. Nucleotide sequence variability results in 62 amino acid changes and one deletion within the SU and 34 amino acid changes within the TM.
Knowles, D P; Cheevers, W P; McGuire, T C; Brassfield, A L; Harwood, W G; Stem, T A
1991-01-01
To define the structure of the caprine arthritis-encephalitis virus (CAEV) env gene and characterize genetic changes which occur during antigenic variation, we sequenced the env genes of CAEV-63 and CAEV-Co, two antigenic variants of CAEV defined by serum neutralization. The deduced primary translation product of the CAEV env gene consists of a 60- to 80-amino-acid signal peptide followed by an amino-terminal surface protein (SU) and a carboxy-terminal transmembrane protein (TM) separated by an Arg-Lys-Lys-Arg cleavage site. The signal peptide cleavage site was verified by amino-terminal amino acid sequencing of native CAEV-63 SU. In addition, immunoprecipitation of [35S]methionine-labeled CAEV-63 proteins by sera from goats immunized with recombinant vaccinia virus expressing the CAEV-63 env gene confirmed that antibodies induced by env-encoded recombinant proteins react specifically with native virion SU and TM. The env genes of CAEV-63 and CAEV-Co encode 28 conserved cysteines and 25 conserved potential N-linked glycosylation sites. Nucleotide sequence variability results in 62 amino acid changes and one deletion within the SU and 34 amino acid changes within the TM. Images PMID:1656067
Nirasawa, Satoru; Nakahara, Kazuhiko; Takahashi, Saori
2018-02-27
Paenidase is the first microorganism-derived D-aspartyl endopeptidase that specifically recognizes an internal D-Asp residue to cleave [D-Asp]-X peptide bonds. Using peptide sequences obtained from the protein, we performed PCR with degenerate primers to amplify the paenidase I-encoding gene. Nucleotide sequencing revealed that mature paenidase I consists of 322 amino acid residues and that the protein is encoded as a pro-protein with a 197-amino-acid N-terminal extension compared to the mature protein. Paenidase I exhibits amino acid sequence similarity to several penicillin-binding proteins. In addition, paenidase I was classified into peptidase family S12 based on a MEROPS database search. Family S12 contains serine-type D-Ala-D-Ala carboxypeptidases that have three active site residues (Ser, Lys, and Tyr) in the conserved motifs Ser-Xaa-Thr-Lys and Tyr-Xaa-Asn. These motifs were conserved in the primary structure of paenidase I, and the role of these residues was confirmed by site-directed mutagenesis.
Yan, Y; Xu, W; Chen, H; Ma, Z; Zhu, Y; Cai, S
1994-01-01
The partial structure gene encoding ES antigen derived from Trichinella spiralis (TSP) muscle larvae was cloned, characterized, and expressed in E. coli. The target DNA (0.7 kb) was directly obtained from the TSP total RNA by using RNA PCR technique. Based on the analysis with the RE digestion, the fragment was cloned into the fusion expression vector pEX31C. It was shown that a kind of 37kDa fusion protein was expressed in E. coli containing the recombinant plasmid by SDS-PAGE electrophoresis. The expressed protein was over 22% of the total cell protein, and it was aggregated in the form of inclusion bodies in E. coli. The purified protein could be recognized in ELISA both by sera from swine-infected with TSP and by the monoclonal antibody against TSP. These findings suggest that the recombinant protein is a potentially valuable antigen both for immunodiagnosis and vaccine development of trichinellosis.
Comprehensive assessment of cancer missense mutation clustering in protein structures.
Kamburov, Atanas; Lawrence, Michael S; Polak, Paz; Leshchiner, Ignaty; Lage, Kasper; Golub, Todd R; Lander, Eric S; Getz, Gad
2015-10-06
Large-scale tumor sequencing projects enabled the identification of many new cancer gene candidates through computational approaches. Here, we describe a general method to detect cancer genes based on significant 3D clustering of mutations relative to the structure of the encoded protein products. The approach can also be used to search for proteins with an enrichment of mutations at binding interfaces with a protein, nucleic acid, or small molecule partner. We applied this approach to systematically analyze the PanCancer compendium of somatic mutations from 4,742 tumors relative to all known 3D structures of human proteins in the Protein Data Bank. We detected significant 3D clustering of missense mutations in several previously known oncoproteins including HRAS, EGFR, and PIK3CA. Although clustering of missense mutations is often regarded as a hallmark of oncoproteins, we observed that a number of tumor suppressors, including FBXW7, VHL, and STK11, also showed such clustering. Beside these known cases, we also identified significant 3D clustering of missense mutations in NUF2, which encodes a component of the kinetochore, that could affect chromosome segregation and lead to aneuploidy. Analysis of interaction interfaces revealed enrichment of mutations in the interfaces between FBXW7-CCNE1, HRAS-RASA1, CUL4B-CAND1, OGT-HCFC1, PPP2R1A-PPP2R5C/PPP2R2A, DICER1-Mg2+, MAX-DNA, SRSF2-RNA, and others. Together, our results indicate that systematic consideration of 3D structure can assist in the identification of cancer genes and in the understanding of the functional role of their mutations.
Comprehensive assessment of cancer missense mutation clustering in protein structures
Kamburov, Atanas; Lawrence, Michael S.; Polak, Paz; Leshchiner, Ignaty; Lage, Kasper; Golub, Todd R.; Lander, Eric S.; Getz, Gad
2015-01-01
Large-scale tumor sequencing projects enabled the identification of many new cancer gene candidates through computational approaches. Here, we describe a general method to detect cancer genes based on significant 3D clustering of mutations relative to the structure of the encoded protein products. The approach can also be used to search for proteins with an enrichment of mutations at binding interfaces with a protein, nucleic acid, or small molecule partner. We applied this approach to systematically analyze the PanCancer compendium of somatic mutations from 4,742 tumors relative to all known 3D structures of human proteins in the Protein Data Bank. We detected significant 3D clustering of missense mutations in several previously known oncoproteins including HRAS, EGFR, and PIK3CA. Although clustering of missense mutations is often regarded as a hallmark of oncoproteins, we observed that a number of tumor suppressors, including FBXW7, VHL, and STK11, also showed such clustering. Beside these known cases, we also identified significant 3D clustering of missense mutations in NUF2, which encodes a component of the kinetochore, that could affect chromosome segregation and lead to aneuploidy. Analysis of interaction interfaces revealed enrichment of mutations in the interfaces between FBXW7-CCNE1, HRAS-RASA1, CUL4B-CAND1, OGT-HCFC1, PPP2R1A-PPP2R5C/PPP2R2A, DICER1-Mg2+, MAX-DNA, SRSF2-RNA, and others. Together, our results indicate that systematic consideration of 3D structure can assist in the identification of cancer genes and in the understanding of the functional role of their mutations. PMID:26392535
Protein structure similarity from Principle Component Correlation analysis.
Zhou, Xiaobo; Chou, James; Wong, Stephen T C
2006-01-25
Owing to rapid expansion of protein structure databases in recent years, methods of structure comparison are becoming increasingly effective and important in revealing novel information on functional properties of proteins and their roles in the grand scheme of evolutionary biology. Currently, the structural similarity between two proteins is measured by the root-mean-square-deviation (RMSD) in their best-superimposed atomic coordinates. RMSD is the golden rule of measuring structural similarity when the structures are nearly identical; it, however, fails to detect the higher order topological similarities in proteins evolved into different shapes. We propose new algorithms for extracting geometrical invariants of proteins that can be effectively used to identify homologous protein structures or topologies in order to quantify both close and remote structural similarities. We measure structural similarity between proteins by correlating the principle components of their secondary structure interaction matrix. In our approach, the Principle Component Correlation (PCC) analysis, a symmetric interaction matrix for a protein structure is constructed with relationship parameters between secondary elements that can take the form of distance, orientation, or other relevant structural invariants. When using a distance-based construction in the presence or absence of encoded N to C terminal sense, there are strong correlations between the principle components of interaction matrices of structurally or topologically similar proteins. The PCC method is extensively tested for protein structures that belong to the same topological class but are significantly different by RMSD measure. The PCC analysis can also differentiate proteins having similar shapes but different topological arrangements. Additionally, we demonstrate that when using two independently defined interaction matrices, comparison of their maximum eigenvalues can be highly effective in clustering structurally or topologically similar proteins. We believe that the PCC analysis of interaction matrix is highly flexible in adopting various structural parameters for protein structure comparison.
Judelson, Howard S; Ah-Fong, Audrey M V; Aux, George; Avrova, Anna O; Bruce, Catherine; Cakir, Cahid; da Cunha, Luis; Grenville-Briggs, Laura; Latijnhouwers, Maita; Ligterink, Wilco; Meijer, Harold J G; Roberts, Samuel; Thurber, Carrie S; Whisson, Stephen C; Birch, Paul R J; Govers, Francine; Kamoun, Sophien; van West, Pieter; Windass, John
2008-04-01
Much of the pathogenic success of Phytophthora infestans, the potato and tomato late blight agent, relies on its ability to generate from mycelia large amounts of sporangia, which release zoospores that encyst and form infection structures. To better understand these stages, Affymetrix GeneChips based on 15,650 unigenes were designed and used to profile the life cycle. Approximately half of P. infestans genes were found to exhibit significant differential expression between developmental transitions, with approximately (1)/(10) being stage-specific and most changes occurring during zoosporogenesis. Quantitative reverse-transcription polymerase chain reaction assays confirmed the robustness of the array results and showed that similar patterns of differential expression were obtained regardless of whether hyphae were from laboratory media or infected tomato. Differentially expressed genes encode potential cellular regulators, especially protein kinases; metabolic enzymes such as those involved in glycolysis, gluconeogenesis, or the biosynthesis of amino acids or lipids; regulators of DNA synthesis; structural proteins, including predicted flagellar proteins; and pathogenicity factors, including cell-wall-degrading enzymes, RXLR effector proteins, and enzymes protecting against plant defense responses. Curiously, some stage-specific transcripts do not appear to encode functional proteins. These findings reveal many new aspects of oomycete biology, as well as potential targets for crop protection chemicals.
Genetic and Functional Diversification of Small RNA Pathways in Plants
Gustafson, Adam M; Kasschau, Kristin D; Lellis, Andrew D; Zilberman, Daniel; Jacobsen, Steven E
2004-01-01
Multicellular eukaryotes produce small RNA molecules (approximately 21–24 nucleotides) of two general types, microRNA (miRNA) and short interfering RNA (siRNA). They collectively function as sequence-specific guides to silence or regulate genes, transposons, and viruses and to modify chromatin and genome structure. Formation or activity of small RNAs requires factors belonging to gene families that encode DICER (or DICER-LIKE [DCL]) and ARGONAUTE proteins and, in the case of some siRNAs, RNA-dependent RNA polymerase (RDR) proteins. Unlike many animals, plants encode multiple DCL and RDR proteins. Using a series of insertion mutants of Arabidopsis thaliana, unique functions for three DCL proteins in miRNA (DCL1), endogenous siRNA (DCL3), and viral siRNA (DCL2) biogenesis were identified. One RDR protein (RDR2) was required for all endogenous siRNAs analyzed. The loss of endogenous siRNA in dcl3 and rdr2 mutants was associated with loss of heterochromatic marks and increased transcript accumulation at some loci. Defects in siRNA-generation activity in response to turnip crinkle virus in dcl2 mutant plants correlated with increased virus susceptibility. We conclude that proliferation and diversification of DCL and RDR genes during evolution of plants contributed to specialization of small RNA-directed pathways for development, chromatin structure, and defense. PMID:15024409
Conserved herpesvirus protein kinases
Gershburg, Edward; Pagano, Joseph S.
2008-01-01
Conserved herpesviral protein kinases (CHPKs) are a group of enzymes conserved throughout all subfamilies of Herpesviridae. Members of this group are serine/threonine protein kinases that are likely to play a conserved role in viral infection by interacting with common host cellular and viral factors; however along with a conserved role, individual kinases may have unique functions in the context of viral infection in such a way that they are only partially replaceable even by close homologues. Recent studies demonstrated that CHPKs are crucial for viral infection and suggested their involvement in regulation of numerous processes at various infection steps (primary infection, nuclear egress, tegumentation), although the mechanisms of this regulation remain unknown. Notwithstanding, recent advances in discovery of new CHPK targets, and studies of CHPK knockout phenotypes have raised their attractiveness as targets for antiviral therapy. A number of compounds have been shown to inhibit the activity of human cytomegalovirus (HCMV)-encoded UL97 protein kinase and exhibit a pronounced antiviral effect, although the same compounds are inactive against Epstein-Barr Virus (EBV)-encoded protein kinase BGLF4, illustrating the fact that low homology between the members of this group complicates development of compounds targeting the whole group, and suggesting that individualized, structure-based inhibitor design will be more effective. Determination of CHPK structures will greatly facilitate this task. PMID:17881303
Cobessi, David; Dumas, Renaud; Pautre, Virginie; Meinguet, Céline; Ferrer, Jean-Luc; Alban, Claude
2012-01-01
Diaminopelargonic acid aminotransferase (DAPA-AT) and dethiobiotin synthetase (DTBS) catalyze the antepenultimate and the penultimate steps, respectively, of biotin synthesis. Whereas DAPA-AT and DTBS are encoded by distinct genes in bacteria, in biotin-synthesizing eukaryotes (plants and most fungi), both activities are carried out by a single enzyme encoded by a bifunctional gene originating from the fusion of prokaryotic monofunctional ancestor genes. In few angiosperms, including Arabidopsis thaliana, this chimeric gene (named BIO3-BIO1) also produces a bicistronic transcript potentially encoding separate monofunctional proteins that can be produced following an alternative splicing mechanism. The functional significance of the occurrence of a bifunctional enzyme in biotin synthesis pathway in eukaryotes and the relative implication of each of the potential enzyme forms (bifunctional versus monofunctional) in the plant biotin pathway are unknown. In this study, we demonstrate that the BIO3-BIO1 fusion protein is the sole protein form produced by the BIO3-BIO1 locus in Arabidopsis. The enzyme catalyzes both DAPA-AT and DTBS reactions in vitro and is targeted to mitochondria in vivo. Our biochemical and kinetic characterizations of the pure recombinant enzyme show that in the course of the reaction, the DAPA intermediate is directly transferred from the DAPA-AT active site to the DTBS active site. Analysis of several structures of the enzyme crystallized in complex with and without its ligands reveals key structural elements involved for acquisition of bifunctionality and brings, together with mutagenesis experiments, additional evidences for substrate channeling. PMID:22547782
Phan, Isabelle Q. H.; Scheib, Holger; Subramanian, Sandhya; Edwards, Thomas E.; Lehman, Stephanie S.; Piitulainen, Hanna; Sayeedur Rahman, M.; Rennoll-Bankert, Kristen E.; Staker, Bart L.; Taira, Suvi; Stacy, Robin; Myler, Peter J.; Azad, Abdu F.
2015-01-01
ABSTRACT Prokaryotes use type IV secretion systems (T4SSs) to translocate substrates (e.g., nucleoprotein, DNA, and protein) and/or elaborate surface structures (i.e., pili or adhesins). Bacterial genomes may encode multiple T4SSs, e.g., there are three functionally divergent T4SSs in some Bartonella species (vir, vbh, and trw). In a unique case, most rickettsial species encode a T4SS (rvh) enriched with gene duplication. Within single genomes, the evolutionary and functional implications of cross-system interchangeability of analogous T4SS protein components remains poorly understood. To lend insight into cross-system interchangeability, we analyzed the VirB8 family of T4SS channel proteins. Crystal structures of three VirB8 and two TrwG Bartonella proteins revealed highly conserved C-terminal periplasmic domain folds and dimerization interfaces, despite tremendous sequence divergence. This implies remarkable structural constraints for VirB8 components in the assembly of a functional T4SS. VirB8/TrwG heterodimers, determined via bacterial two-hybrid assays and molecular modeling, indicate that differential expression of trw and vir systems is the likely barrier to VirB8-TrwG interchangeability. We also determined the crystal structure of Rickettsia typhi RvhB8-II and modeled its coexpressed divergent paralog RvhB8-I. Remarkably, while RvhB8-I dimerizes and is structurally similar to other VirB8 proteins, the RvhB8-II dimer interface deviates substantially from other VirB8 structures, potentially preventing RvhB8-I/RvhB8-II heterodimerization. For the rvh T4SS, the evolution of divergent VirB8 paralogs implies a functional diversification that is unknown in other T4SSs. Collectively, our data identify two different constraints (spatiotemporal for Bartonella trw and vir T4SSs and structural for rvh T4SSs) that mediate the functionality of multiple divergent T4SSs within a single bacterium. PMID:26646013
Galloway-Peña, Jessica R.; Liang, Xiaowen; Singh, Kavindra V.; Yadav, Puja; Chang, Chungyu; La Rosa, Sabina Leanti; Shelburne, Samuel; Ton-That, Hung; Höök, Magnus
2014-01-01
The WxL domain recently has been identified as a novel cell wall binding domain found in numerous predicted proteins within multiple Gram-positive bacterial species. However, little is known about the function of proteins containing this novel domain. Here, we identify and characterize 6 Enterococcus faecium proteins containing the WxL domain which, by reverse transcription-PCR (RT-PCR) and genomic analyses, are located in three similarly organized operons, deemed WxL loci A, B, and C. Western blotting, electron microscopy, and enzyme-linked immunosorbent assays (ELISAs) determined that genes of WxL loci A and C encode antigenic, cell surface proteins exposed at higher levels in clinical isolates than in commensal isolates. Secondary structural analyses of locus A recombinant WxL domain-containing proteins found they are rich in β-sheet structure and disordered segments. Using Biacore analyses, we discovered that recombinant WxL proteins from locus A bind human extracellular matrix proteins, specifically type I collagen and fibronectin. Proteins encoded by locus A also were found to bind to each other, suggesting a novel cell surface complex. Furthermore, bile salt survival assays and animal models using a mutant from which all three WxL loci were deleted revealed the involvement of WxL operons in bile salt stress and endocarditis pathogenesis. In summary, these studies extend our understanding of proteins containing the WxL domain and their potential impact on colonization and virulence in E. faecium and possibly other Gram-positive bacterial species. PMID:25512313
Lorentsen, R H; Graversen, J H; Caterer, N R; Thogersen, H C; Etzerodt, M
2000-01-01
Tetranectin is a homotrimeric plasma and extracellular-matrix protein that binds plasminogen and complex sulphated polysaccharides including heparin. In terms of primary and tertiary structure, tetranectin is related to the collectin family of Ca(2+)-binding C-type lectins. Tetranectin is encoded in three exons. Exon 3 encodes the carbohydrate recognition domain, which binds to kringle 4 in plasminogen at low levels of Ca(2+). Exon 2 encodes an alpha-helix, which is necessary and sufficient to govern the trimerization of tetranectin by assembling into a triple-helical coiled-coil structural element. Here we show that the heparin-binding site in tetranectin resides not in the carbohydrate recognition domain but within the N-terminal region, comprising the 16 amino acid residues encoded by exon 1. In particular, the lysine residues in the decapeptide segment KPKKIVNAKK (tetranectin residues 6-15) are shown to be of primary importance in heparin binding. PMID:10727405
Lorentsen, R H; Graversen, J H; Caterer, N R; Thogersen, H C; Etzerodt, M
2000-04-01
Tetranectin is a homotrimeric plasma and extracellular-matrix protein that binds plasminogen and complex sulphated polysaccharides including heparin. In terms of primary and tertiary structure, tetranectin is related to the collectin family of Ca(2+)-binding C-type lectins. Tetranectin is encoded in three exons. Exon 3 encodes the carbohydrate recognition domain, which binds to kringle 4 in plasminogen at low levels of Ca(2+). Exon 2 encodes an alpha-helix, which is necessary and sufficient to govern the trimerization of tetranectin by assembling into a triple-helical coiled-coil structural element. Here we show that the heparin-binding site in tetranectin resides not in the carbohydrate recognition domain but within the N-terminal region, comprising the 16 amino acid residues encoded by exon 1. In particular, the lysine residues in the decapeptide segment KPKKIVNAKK (tetranectin residues 6-15) are shown to be of primary importance in heparin binding.
Tula hantavirus NSs protein accumulates in the perinuclear area in infected and transfected cells.
Virtanen, Jussi Oskari; Jääskeläinen, Kirsi Maria; Djupsjöbacka, Janica; Vaheri, Antti; Plyusnin, Alexander
2010-01-01
The small RNA segment of some hantaviruses (family Bunyaviridae) encodes two proteins: the nucleocapsid protein and, in an overlapping reading frame, a non-structural (NSs) protein. The hantavirus NSs protein, like those of orthobunya- and phleboviruses, counteracts host innate immunity. Here, for the first time, the NSs protein of a hantavirus (Tula virus) has been observed in infected cells and shown to localize in the perinuclear area. Transiently expressed NSs protein showed similar localization, although the kinetics was slightly different, suggesting that to reach its proper location in the infected cell, the NSs protein does not have to cooperate with other viral proteins.
2004-01-01
Numerous invertebrate species belonging to several phyla cannot synthesize sterols de novo and rely on a dietary source of the compound. SCPx (sterol carrier protein 2/3-oxoacyl-CoA thiolase) is a protein involved in the trafficking of sterols and oxidation of branched-chain fatty acids. We have isolated SCPx protein from Spodoptera littoralis (cotton leafworm) and have subjected it to limited amino acid sequencing. A reverse-transcriptase PCR-based approach has been used to clone the cDNA (1.9 kb), which encodes a 57 kDa protein. Northern blotting detected two mRNA transcripts, one of 1.9 kb, encoding SCPx, and one of 0.95 kb, presumably encoding SCP2 (sterol carrier protein 2). The former mRNA was highly expressed in midgut and Malpighian tubules during the last larval instar. Furthermore, constitutive expression of the gene was detected in the prothoracic glands, which are the main tissue producing the insect moulting hormone. There was no significant change in the 1.9 kb mRNA in midgut throughout development, but slightly higher expression in the early stages. Conceptual translation of the cDNA and a database search revealed that the gene includes the SCP2 sequence and a putative peroxisomal targeting signal in the C-terminal region. Also a cysteine residue at the putative active site for the 3-oxoacyl-CoA thiolase is conserved. Southern blotting showed that SCPx is likely to be encoded by a single-copy gene. The mRNA expression pattern and the gene structure suggest that SCPx from S. littoralis (a lepidopteran) is evolutionarily closer to that of mammals than to that of dipterans. PMID:15149283
Kim, Do Jin; Bitto, Eduard; Bingman, Craig A; Kim, Hyun-Jung; Han, Byung Woo; Phillips, George N
2015-07-01
Members of the universal stress protein (USP) family are conserved in a phylogenetically diverse range of prokaryotes, fungi, protists, and plants and confer abilities to respond to a wide range of environmental stresses. Arabidopsis thaliana contains 44 USP domain-containing proteins, and USP domain is found either in a small protein with unknown physiological function or in an N-terminal portion of a multi-domain protein, usually a protein kinase. Here, we report the first crystal structure of a eukaryotic USP-like protein encoded from the gene At3g01520. The crystal structure of the protein At3g01520 was determined by the single-wavelength anomalous dispersion method and refined to an R factor of 21.8% (Rfree = 26.1%) at 2.5 Å resolution. The crystal structure includes three At3g01520 protein dimers with one AMP molecule bound to each protomer, comprising a Rossmann-like α/β overall fold. The bound AMP and conservation of residues in the ATP-binding loop suggest that the protein At3g01520 also belongs to the ATP-binding USP subfamily members. © 2015 The Authors. Proteins: Structure, Function, and Bioinformatics Published by Wiley Periodicals, Inc.
Heiman, Erica M.; McDonald, Sarah M.; Barro, Mario; Taraporewala, Zenobia F.; Bar-Magen, Tamara; Patton, John T.
2008-01-01
Group A human rotaviruses (HRVs) are the major cause of severe viral gastroenteritis in infants and young children. To gain insight into the level of genetic variation among HRVs, we determined the genome sequences for 10 strains belonging to different VP7 serotypes (G types). The HRVs chosen for this study, D, DS-1, P, ST3, IAL28, Se584, 69M, WI61, A64, and L26, were isolated from infected persons and adapted to cell culture to use as serotype references. Our sequencing results revealed that most of the individual proteins from each HRV belong to one of three genotypes (1, 2, or 3) based on their similarities to proteins of genogroup strains (Wa, DS-1, or AU-1, respectively). Strains D, P, ST3, IAL28, and WI61 encode genotype 1 (Wa-like) proteins, whereas strains DS-1 and 69M encode genotype 2 (DS-1-like) proteins. Of the 10 HRVs sequenced, 3 of them (Se584, A64, and L26) encode proteins belonging to more than one genotype, indicating that they are intergenogroup reassortants. We used amino acid sequence alignments to identify residues that distinguish proteins belonging to HRV genotype 1, 2, or 3. These genotype-specific changes cluster in definitive regions within each viral protein, many of which are sites of known protein-protein interactions. For the intermediate viral capsid protein (VP6), the changes map onto the atomic structure at the VP2-VP6, VP4-VP6, and VP7-VP6 interfaces. The results of this study provide evidence that group A HRV gene constellations exist and may be influenced by interactions among viral proteins during replication. PMID:18786998
Auto-FPFA: An Automated Microscope for Characterizing Genetically Encoded Biosensors.
Nguyen, Tuan A; Puhl, Henry L; Pham, An K; Vogel, Steven S
2018-05-09
Genetically encoded biosensors function by linking structural change in a protein construct, typically tagged with one or more fluorescent proteins, to changes in a biological parameter of interest (such as calcium concentration, pH, phosphorylation-state, etc.). Typically, the structural change triggered by alterations in the bio-parameter is monitored as a change in either fluorescent intensity, or lifetime. Potentially, other photo-physical properties of fluorophores, such as fluorescence anisotropy, molecular brightness, concentration, and lateral and/or rotational diffusion could also be used. Furthermore, while it is likely that multiple photo-physical attributes of a biosensor might be altered as a function of the bio-parameter, standard measurements monitor only a single photo-physical trait. This limits how biosensors are designed, as well as the accuracy and interpretation of biosensor measurements. Here we describe the design and construction of an automated multimodal-microscope. This system can autonomously analyze 96 samples in a micro-titer dish and for each sample simultaneously measure intensity (photon count), fluorescence lifetime, time-resolved anisotropy, molecular brightness, lateral diffusion time, and concentration. We characterize the accuracy and precision of this instrument, and then demonstrate its utility by characterizing three types of genetically encoded calcium sensors as well as a negative control.
Arboretum and Puerto Almendras viruses: two novel rhabdoviruses isolated from mosquitoes in Peru.
Vasilakis, Nikos; Castro-Llanos, Fanny; Widen, Steven G; Aguilar, Patricia V; Guzman, Hilda; Guevara, Carolina; Fernandez, Roberto; Auguste, Albert J; Wood, Thomas G; Popov, Vsevolod; Mundal, Kirk; Ghedin, Elodie; Kochel, Tadeusz J; Holmes, Edward C; Walker, Peter J; Tesh, Robert B
2014-04-01
Arboretum virus (ABTV) and Puerto Almendras virus (PTAMV) are two mosquito-associated rhabdoviruses isolated from pools of Psorophora albigenu and Ochlerotattus fulvus mosquitoes, respectively, collected in the Department of Loreto, Peru, in 2009. Initial tests suggested that both viruses were novel rhabdoviruses and this was confirmed by complete genome sequencing. Analysis of their 11 482 nt (ABTV) and 11 876 (PTAMV) genomes indicates that they encode the five canonical rhabdovirus structural proteins (N, P, M, G and L) with an additional gene (U1) encoding a small hydrophobic protein. Evolutionary analysis of the L protein indicates that ABTV and PTAMV are novel and phylogenetically distinct rhabdoviruses that cannot be classified as members of any of the eight currently recognized genera within the family Rhabdoviridae, highlighting the vast diversity of this virus family.
Arboretum and Puerto Almendras viruses: two novel rhabdoviruses isolated from mosquitoes in Peru
Castro-Llanos, Fanny; Widen, Steven G.; Aguilar, Patricia V.; Guzman, Hilda; Guevara, Carolina; Fernandez, Roberto; Auguste, Albert J.; Wood, Thomas G.; Popov, Vsevolod; Mundal, Kirk; Ghedin, Elodie; Kochel, Tadeusz J.; Holmes, Edward C.; Walker, Peter J.; Tesh, Robert B.
2014-01-01
Arboretum virus (ABTV) and Puerto Almendras virus (PTAMV) are two mosquito-associated rhabdoviruses isolated from pools of Psorophora albigenu and Ochlerotattus fulvus mosquitoes, respectively, collected in the Department of Loreto, Peru, in 2009. Initial tests suggested that both viruses were novel rhabdoviruses and this was confirmed by complete genome sequencing. Analysis of their 11 482 nt (ABTV) and 11 876 (PTAMV) genomes indicates that they encode the five canonical rhabdovirus structural proteins (N, P, M, G and L) with an additional gene (U1) encoding a small hydrophobic protein. Evolutionary analysis of the L protein indicates that ABTV and PTAMV are novel and phylogenetically distinct rhabdoviruses that cannot be classified as members of any of the eight currently recognized genera within the family Rhabdoviridae, highlighting the vast diversity of this virus family. PMID:24421116
Genetic structure and regulation of isoprene synthase in Poplar (Populus spp.).
Vickers, Claudia E; Possell, Malcolm; Nicholas Hewitt, C; Mullineaux, Philip M
2010-07-01
Isoprene is a volatile 5-carbon hydrocarbon derived from the chloroplastic methylerythritol 2-C-methyl-D: -erythritol 4-phosphate isoprenoid pathway. In plants, isoprene emission is controlled by the enzyme isoprene synthase; however, there is still relatively little known about the genetics and regulation of this enzyme. Isoprene synthase gene structure was analysed in three poplar species. It was found that genes encoding stromal isoprene synthase exist as a small gene family, the members of which encode virtually identical proteins and are differentially regulated. Accumulation of isoprene synthase protein is developmentally regulated, but does not differ between sun and shade leaves and does not increase when heat stress is applied. Our data suggest that, in mature leaves, isoprene emission rates are primarily determined by substrate (dimethylallyl diphosphate, DMADP) availability. In immature leaves, where isoprene synthase levels are variable, emission levels are also influenced by the amount of isoprene synthase protein. No thylakoid isoforms could be identified in Populus alba or in Salix babylonica. Together, these data show that control of isoprene emission at the genetic level is far more complicated than previously assumed.
NASA Astrophysics Data System (ADS)
Mozhdehi, Davoud; Luginbuhl, Kelli M.; Simon, Joseph R.; Dzuricky, Michael; Berger, Rüdiger; Varol, H. Samet; Huang, Fred C.; Buehne, Kristen L.; Mayne, Nicholas R.; Weitzhandler, Isaac; Bonn, Mischa; Parekh, Sapun H.; Chilkoti, Ashutosh
2018-05-01
Post-translational modification of proteins is a strategy widely used in biological systems. It expands the diversity of the proteome and allows for tailoring of both the function and localization of proteins within cells as well as the material properties of structural proteins and matrices. Despite their ubiquity in biology, with a few exceptions, the potential of post-translational modifications in biomaterials synthesis has remained largely untapped. As a proof of concept to demonstrate the feasibility of creating a genetically encoded biohybrid material through post-translational modification, we report here the generation of a family of three stimulus-responsive hybrid materials—fatty-acid-modified elastin-like polypeptides—using a one-pot recombinant expression and post-translational lipidation methodology. These hybrid biomaterials contain an amphiphilic domain, composed of a β-sheet-forming peptide that is post-translationally functionalized with a C14 alkyl chain, fused to a thermally responsive elastin-like polypeptide. They exhibit temperature-triggered hierarchical self-assembly across multiple length scales with varied structure and material properties that can be controlled at the sequence level.
Mitochondrial disease associated with complex I (NADH-CoQ oxidoreductase) deficiency.
Scheffler, Immo E
2015-05-01
Mitochondrial diseases due to a reduced capacity for oxidative phosphorylation were first identified more than 20 years ago, and their incidence is now recognized to be quite significant. In a large proportion of cases the problem can be traced to a complex I (NADH-CoQ oxidoreductase) deficiency (Phenotype MIM #252010). Because the complex consists of 44 subunits, there are many potential targets for pathogenic mutations, both on the nuclear and mitochondrial genomes. Surprisingly, however, almost half of the complex I deficiencies are due to defects in as yet unidentified genes that encode proteins other than the structural proteins of the complex. This review attempts to summarize what we know about the molecular basis of complex I deficiencies: mutations in the known structural genes, and mutations in an increasing number of genes encoding "assembly factors", that is, proteins required for the biogenesis of a functional complex I that are not found in the final complex I. More such genes must be identified before definitive genetic counselling can be applied in all cases of affected families.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Deka, Ranjit K.; Brautigam, Chad A.; Goldberg, Martin
2012-05-25
Treponema pallidum, the bacterial agent of syphilis, is predicted to encode one tripartite ATP-independent periplasmic transporter (TRAP-T). TRAP-Ts typically employ a periplasmic substrate-binding protein (SBP) to deliver the cognate ligand to the transmembrane symporter. Herein, we demonstrate that the genes encoding the putative TRAP-T components from T. pallidum, tp0957 (the SBP), and tp0958 (the symporter), are in an operon with an uncharacterized third gene, tp0956. We determined the crystal structure of recombinant Tp0956; the protein is trimeric and perforated by a pore. Part of Tp0956 forms an assembly similar to those of 'tetratricopeptide repeat' (TPR) motifs. The crystal structure ofmore » recombinant Tp0957 was also determined; like the SBPs of other TRAP-Ts, there are two lobes separated by a cleft. In these other SBPs, the cleft binds a negatively charged ligand. However, the cleft of Tp0957 has a strikingly hydrophobic chemical composition, indicating that its ligand may be substantially different and likely hydrophobic. Analytical ultracentrifugation of the recombinant versions of Tp0956 and Tp0957 established that these proteins associate avidly. This unprecedented interaction was confirmed for the native molecules using in vivo cross-linking experiments. Finally, bioinformatic analyses suggested that this transporter exemplifies a new subfamily of TPATs (TPR-protein-associated TRAP-Ts) that require the action of a TPR-containing accessory protein for the periplasmic transport of a potentially hydrophobic ligand(s).« less
Ni, Lisheng; Jensen, Slade O; Ky Tonthat, Nam; Berg, Tracey; Kwong, Stephen M; Guan, Fiona H X; Brown, Melissa H; Skurray, Ronald A; Firth, Neville; Schumacher, Maria A
2009-11-01
Plasmids harbored by Staphylococcus aureus are a major contributor to the spread of bacterial multi-drug resistance. Plasmid conjugation and partition are critical to the dissemination and inheritance of such plasmids. Here, we demonstrate that the ArtA protein encoded by the S. aureus multi-resistance plasmid pSK41 is a global transcriptional regulator of pSK41 genes, including those involved in conjugation and segregation. ArtA shows no sequence homology to any structurally characterized DNA-binding protein. To elucidate the mechanism by which it specifically recognizes its DNA site, we obtained the structure of ArtA bound to its cognate operator, ACATGACATG. The structure reveals that ArtA is representative of a new family of ribbon-helix-helix (RHH) DNA-binding proteins that contain extended, N-terminal basic motifs. Strikingly, unlike most well-studied RHH proteins ArtA binds its cognate operators as a dimer. However, we demonstrate that it is also able to recognize an atypical operator site by binding as a dimer-of-dimers and the extended N-terminal regions of ArtA were shown to be essential for this dimer-of-dimer binding mode. Thus, these data indicate that ArtA is a master regulator of genes critical for both horizontal and vertical transmission of pSK41 and that it can recognize DNA utilizing alternate binding modes.
Ni, Lisheng; Jensen, Slade O.; Ky Tonthat, Nam; Berg, Tracey; Kwong, Stephen M.; Guan, Fiona H. X.; Brown, Melissa H.; Skurray, Ronald A.; Firth, Neville; Schumacher, Maria A.
2009-01-01
Plasmids harbored by Staphylococcus aureus are a major contributor to the spread of bacterial multi-drug resistance. Plasmid conjugation and partition are critical to the dissemination and inheritance of such plasmids. Here, we demonstrate that the ArtA protein encoded by the S. aureus multi-resistance plasmid pSK41 is a global transcriptional regulator of pSK41 genes, including those involved in conjugation and segregation. ArtA shows no sequence homology to any structurally characterized DNA-binding protein. To elucidate the mechanism by which it specifically recognizes its DNA site, we obtained the structure of ArtA bound to its cognate operator, ACATGACATG. The structure reveals that ArtA is representative of a new family of ribbon–helix–helix (RHH) DNA-binding proteins that contain extended, N-terminal basic motifs. Strikingly, unlike most well-studied RHH proteins ArtA binds its cognate operators as a dimer. However, we demonstrate that it is also able to recognize an atypical operator site by binding as a dimer-of-dimers and the extended N-terminal regions of ArtA were shown to be essential for this dimer-of-dimer binding mode. Thus, these data indicate that ArtA is a master regulator of genes critical for both horizontal and vertical transmission of pSK41 and that it can recognize DNA utilizing alternate binding modes. PMID:19759211
Schwartz, N B; Pirok, E W; Mensch, J R; Domowicz, M S
1999-01-01
Proteoglycans are complex macromolecules, consisting of a polypeptide backbone to which are covalently attached one or more glycosaminoglycan chains. Molecular cloning has allowed identification of the genes encoding the core proteins of various proteoglycans, leading to a better understanding of the diversity of proteoglycan structure and function, as well as to the evolution of a classification of proteoglycans on the basis of emerging gene families that encode the different core proteins. One such family includes several proteoglycans that have been grouped with aggrecan, the large aggregating chondroitin sulfate proteoglycan of cartilage, based on a high number of sequence similarities within the N- and C-terminal domains. Thus far these proteoglycans include versican, neurocan, and brevican. It is now apparent that these proteins, as a group, are truly a gene family with shared structural motifs on the protein and nucleotide (mRNA) levels, and with nearly identical genomic organizations. Clearly a common ancestral origin is indicated for the members of the aggrecan family of proteoglycans. However, differing patterns of amplification and divergence have also occurred within certain exons across species and family members, leading to the class-characteristic protein motifs in the central carbohydrate-rich region exclusively. Thus the overall domain organization strongly suggests that sequence conservation in the terminal globular domains underlies common functions, whereas differences in the central portions of the genes account for functional specialization among the members of this gene family.
Structure of the Bacillus subtilis phage SPO1-encoded type II DNA-binding protein TF1 in solution.
Jia, X; Grove, A; Ivancic, M; Hsu, V L; Geiduscheck, E P; Kearns, D R
1996-10-25
The solution structure of a type II DNA-binding protein, the bacteriophage SPO1-encoded transcription factor 1 (TF1), was determined using NMR spectroscopy. Selective 2H-labeling, 13C-labeling and isotopic heterodimers were used to distinguish contacts between and within monomers of the dimeric protein. A total of 1914 distance and dihedral angle constraints derived from NMR experiments were used in structure calculations using restrained molecular dynamics and simulated annealing protocols. The ensemble of 30 calculated structures has a root-mean-square deviation (r.m.s.d.) of 0.9 A, about the average structure for the backbone atoms, and 1.2 A for all heavy-atoms of the dimeric core (helices 1 and 2) and the beta-sheets. A severe helix distortion at residues 92-93 in the middle of helix 3 is associated with r.m.s.d. of approximately 1.5 A for the helix 3 backbone. Deviations of approximately 5 A or larger are noted for the very flexible beta-ribbon arms that constitute part of a proposed DNA-binding region. A structural model of TF1 has been calculated based on the previously reported crystal structure of the homologous HU protein and this model was used as the starting structure for calculations. A comparison between the calculated average solution structure of TF1 and a solution structure of HU indicates a similarity in the dimeric core (excluding the nine amino acid residue tail) with pairwise deviations of 2 to 3 A. The largest deviations between the average structure and the HU solution structure were found in the beta-ribbon arms, as expected. A 4 A deviation is found at residue 15 of TF1 which is in a loop connecting two helical segments; it has been reported that substitution of Glu15 by Gly increases the thermostability of TF1. The homology between TF1 and other proteins of this family leads us to anticipate similar tertiary structures.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Weeks, S.; Grasty, K; Hernandez-Cuebas, L
2009-01-01
The covalent attachment of different types of poly-ubiquitin chains signal different outcomes for the proteins so targeted. For example, a protein modified with Lys-48-linked poly-ubiquitin chains is targeted for proteasomal degradation, whereas Lys-63-linked chains encode nondegradative signals. The structural features that enable these different types of chains to encode different signals have not yet been fully elucidated. We report here the X-ray crystal structures of Lys-63-linked tri- and di-ubiquitin at resolutions of 2.3 and 1.9 {angstrom}, respectively. The tri- and di-ubiquitin species adopt essentially identical structures. In both instances, the ubiquitin chain assumes a highly extended conformation with a left-handedmore » helical twist; the helical chain contains four ubiquitin monomers per turn and has a repeat length of {approx}110 {angstrom}. Interestingly, Lys-48 ubiquitin chains also adopt a left-handed helical structure with a similar repeat length. However, the Lys-63 architecture is much more open than that of Lys-48 chains and exposes much more of the ubiquitin surface for potential recognition events. These new crystal structures are consistent with the results of solution studies of Lys-63 chain conformation, and reveal the structural basis for differential recognition of Lys-63 versus Lys-48 chains.« less
Meyer, Irmtraud M
2017-05-01
RNA transcripts are the primary products of active genes in any living organism, including many viruses. Their cellular destiny not only depends on primary sequence signals, but can also be determined by RNA structure. Recent experimental evidence shows that many transcripts can be assigned more than a single functional RNA structure throughout their cellular life and that structure formation happens co-transcriptionally, i.e. as the transcript is synthesised in the cell. Moreover, functional RNA structures are not limited to non-coding transcripts, but can also feature in coding transcripts. The picture that now emerges is that RNA structures constitute an additional layer of information that can be encoded in any RNA transcript (and on top of other layers of information such as protein-context) in order to exert a wide range of functional roles. Moreover, different encoded RNA structures can be expressed at different stages of a transcript's life in order to alter the transcript's behaviour depending on its actual cellular context. Similar to the concept of alternative splicing for protein-coding genes, where a single transcript can yield different proteins depending on cellular context, it is thus appropriate to propose the notion of alternative RNA structure expression for any given transcript. This review introduces several computational strategies that my group developed to detect different aspects of RNA structure expression in vivo. Two aspects are of particular interest to us: (1) RNA secondary structure features that emerge during co-transcriptional folding and (2) functional RNA structure features that are expressed at different times of a transcript's life and potentially mutually exclusive. Copyright © 2017. Published by Elsevier Inc.
Inhibiting autophagy reduces retinal degeneration caused by protein misfolding.
Yao, Jingyu; Qiu, Yaoyan; Frontera, Eric; Jia, Lin; Khan, Naheed W; Klionsky, Daniel J; Ferguson, Thomas A; Thompson, Debra A; Zacks, David N
2018-06-25
Mutations in the genes necessary for the structure and function of vertebrate photoreceptor cells are associated with multiple forms of inherited retinal degeneration. Mutations in the gene encoding RHO (rhodopsin) are a common cause of autosomal dominant retinitis pigmentosa (adRP), with the Pro23His variant of RHO resulting in a misfolded protein that activates endoplasmic reticulum stress and the unfolded protein response. Stimulating macroautophagy/autophagy has been proposed as a strategy for clearing misfolded RHO and reducing photoreceptor death. We found that retinas from mice heterozygous for the gene encoding the RHO P23H variant (hereafter called P23H) exhibited elevated levels of autophagy flux, and that pharmacological stimulation of autophagy accelerated retinal degeneration. In contrast, reducing autophagy flux pharmacologically or by rod-specific deletion of the autophagy-activating gene Atg5, improved photoreceptor structure and function. Furthermore, proteasome levels and activity were reduced in the P23H retina, and increased when Atg5 was deleted. Our findings suggest that autophagy contributes to photoreceptor cell death in P23H mice, and that decreasing autophagy shifts the degradation of misfolded RHO protein to the proteasome and is protective. These observations suggest that modulating the flux of misfolded proteins from autophagy to the proteasome may represent an important therapeutic strategy for reducing proteotoxicity in adRP and other diseases caused by protein folding defects.
Rocha, Antônio J; Sousa, Bruno L; Girão, Matheus S; Barroso-Neto, Ito L; Monteiro-Júnior, José E; Oliveira, José T A; Nagano, Celso S; Carneiro, Rômulo F; Monteiro-Moreira, Ana C O; Rocha, Bruno A M; Freire, Valder N; Grangeiro, Thalles B
2018-05-27
Vicilins are 7S globulins which constitute the major seed storage proteins in leguminous species. Variant vicilins showing differential binding affinities for chitin have been implicated in the resistance and susceptibility of cowpea to the bruchid Callosobruchus maculatus. These proteins are members of the cupin superfamily, which includes a wide variety of enzymes and non-catalytic seed storage proteins. The cupin fold does not share similarity with any known chitin-biding domain. Therefore, it is poorly understood how these storage proteins bind to chitin. In this work, partial cDNA sequences encoding β-vignin, the major component of cowpea vicilins, were obtained from developing seeds. Three-dimensional molecular models of β-vignin showed the characteristic cupin fold and computational simulations revealed that each vicilin trimer contained 3 chitin-binding sites. Interaction models showed that chito-oligosaccharides bound to β-vignin were stabilized mainly by hydrogen bonds, a common structural feature of typical carbohydrate-binding proteins. Furthermore, many of the residues involved in the chitin-binding sites of β-vignin are conserved in other 7S globulins. These results support previous experimental evidences on the ability of vicilin-like proteins from cowpea and other leguminous species to bind in vitro to chitin as well as in vivo to chitinous structures of larval C. maculatus midgut. Copyright © 2018. Published by Elsevier B.V.
Elphick, Maurice R; Rowe, Matthew L
2009-04-01
The myoactive neuropeptide NGIWYamide was originally isolated from the holothurian (sea cucumber) Apostichopus japonicus but there is evidence that NGIWYamide-like peptides also occur in other echinoderms. Here we report the discovery of a gene in the sea urchin Strongylocentrotus purpuratus that encodes two copies of an NGIWYamide-like peptide: Asn-Gly-Phe-Phe-Phe-(NH(2)) or NGFFFamide. Interestingly, the C-terminal region of the NGFFFamide precursor shares sequence similarity with neurophysins, carrier proteins hitherto uniquely associated with precursors of vasopressin/oxytocin-like neuropeptides. Thus, the NGFFFamide precursor is the first neurophysin-containing neuropeptide precursor to be discovered that does not contain a vasopressin/oxytocin-like peptide. However, it remains to be determined whether neurophysin acts as a carrier protein for NGFFFamide. The S. purpuratus genome also contains a gene encoding a precursor comprising a neurophysin polypeptide and 'echinotocin' (CFISNCPKGamide) - the first vasopressin/oxytocin-like peptide to be identified in an echinoderm. Therefore, in S. purpuratus there are two genes encoding precursors that have a neurophysin domain but which encode neuropeptides that are structurally unrelated. Furthermore, both NGFFFamide and echinotocin cause contraction of tube foot and oesophagus preparations from the sea urchin Echinus esculentus, consistent with the myoactivity of NGIWYamide in sea cucumbers and the myoactivity of vasopressin/oxytocin-like peptides in other animal phyla. Presumably the NGFFFamide precursor acquired its neurophysin domain following partial or complete duplication of a gene encoding a vasopressin/oxytocin-like peptide, but it remains to be determined when in evolutionary history this occurred.
Medvedeva, Irina V; Demenkov, Pavel S; Ivanisenko, Vladimir A
2017-04-01
Functional sites define the diversity of protein functions and are the central object of research of the structural and functional organization of proteins. The mechanisms underlying protein functional sites emergence and their variability during evolution are distinguished by duplication, shuffling, insertion and deletion of the exons in genes. The study of the correlation between a site structure and exon structure serves as the basis for the in-depth understanding of sites organization. In this regard, the development of programming resources that allow the realization of the mutual projection of exon structure of genes and primary and tertiary structures of encoded proteins is still the actual problem. Previously, we developed the SitEx system that provides information about protein and gene sequences with mapped exon borders and protein functional sites amino acid positions. The database included information on proteins with known 3D structure. However, data with respect to orthologs was not available. Therefore, we added the projection of sites positions to the exon structures of orthologs in SitEx 2.0. We implemented a search through database using site conservation variability and site discontinuity through exon structure. Inclusion of the information on orthologs allowed to expand the possibilities of SitEx usage for solving problems regarding the analysis of the structural and functional organization of proteins. Database URL: http://www-bionet.sscc.ru/sitex/ .
Sequencing proteins with transverse ionic transport in nanochannels.
Boynton, Paul; Di Ventra, Massimiliano
2016-05-03
De novo protein sequencing is essential for understanding cellular processes that govern the function of living organisms and all sequence modifications that occur after a protein has been constructed from its corresponding DNA code. By obtaining the order of the amino acids that compose a given protein one can then determine both its secondary and tertiary structures through structure prediction, which is used to create models for protein aggregation diseases such as Alzheimer's Disease. Here, we propose a new technique for de novo protein sequencing that involves translocating a polypeptide through a synthetic nanochannel and measuring the ionic current of each amino acid through an intersecting perpendicular nanochannel. We find that the distribution of ionic currents for each of the 20 proteinogenic amino acids encoded by eukaryotic genes is statistically distinct, showing this technique's potential for de novo protein sequencing.
Sinz, Andrea
2014-12-01
During the last 15 years, chemical cross-linking combined with mass spectrometry (MS) and computational modeling has advanced from investigating 3D-structures of isolated proteins to deciphering protein interaction networks. In this article, the author discusses the advent, the development and the current status of the chemical cross-linking/MS strategy in the context of recent technological developments. A direct way to probe in vivo protein-protein interactions is by site-specific incorporation of genetically encoded photo-reactive amino acids or by non-directed incorporation of photo-reactive amino acids. As the chemical cross-linking/MS approach allows the capture of transient and weak interactions, it has the potential to become a routine technique for unraveling protein interaction networks in their natural cellular environment.
Structural basis for activity of highly efficient RNA mimics of green fluorescent protein
Warner, Katherine Deigan; Chen, Michael C.; Song, Wenjiao; Strack, Rita L.; Thorn, Andrea; Jaffrey, Samie R.; Ferré-D’Amaré, Adrian R.
2014-01-01
Green fluorescent protein (GFP) and its derivatives revolutionized the study of proteins. Spinach is a recently reported in vitro evolved RNA mimic of GFP, which as genetically encoded fusions, makes possible live-cell, real-time imaging of biological RNAs, without resorting to large RNA-binding protein-GFP fusions. To elucidate the molecular basis of Spinach fluorescence, we have solved its co-crystal structure bound to its cognate exogenous chromophore, revealing that Spinach activates the small molecule by immobilizing it between a base triple, a G-quadruplex, and an unpaired guanine. Mutational and NMR analyses indicate that the G-quadruplex is essential for Spinach fluorescence, is also present in other fluorogenic RNAs, and may represent a general strategy for RNAs to induce fluorescence of chromophores. The structure has guided the design of a miniaturized 'Baby Spinach', and provides the foundation for structure-driven design and tuning of fluorescent RNAs. PMID:25026079
The role of protein structural analysis in the next generation sequencing era.
Yue, Wyatt W; Froese, D Sean; Brennan, Paul E
2014-01-01
Proteins are macromolecules that serve a cell's myriad processes and functions in all living organisms via dynamic interactions with other proteins, small molecules and cellular components. Genetic variations in the protein-encoding regions of the human genome account for >85% of all known Mendelian diseases, and play an influential role in shaping complex polygenic diseases. Proteins also serve as the predominant target class for the design of small molecule drugs to modulate their activity. Knowledge of the shape and form of proteins, by means of their three-dimensional structures, is therefore instrumental to understanding their roles in disease and their potentials for drug development. In this chapter we outline, with the wide readership of non-structural biologists in mind, the various experimental and computational methods available for protein structure determination. We summarize how the wealth of structure information, contributed to a large extent by the technological advances in structure determination to date, serves as a useful tool to decipher the molecular basis of genetic variations for disease characterization and diagnosis, particularly in the emerging era of genomic medicine, and becomes an integral component in the modern day approach towards rational drug development.
Designed Proteins Induce the Formation of Nanocage-containing Extracellular Vesicles
Votteler, Jörg; Ogohara, Cassandra; Yi, Sue; Hsia, Yang; Nattermann, Una; Belnap, David M.; King, Neil P.; Sundquist, Wesley I.
2017-01-01
Complex biological processes are often performed by self-organizing nanostructures comprising multiple classes of macromolecules, such as ribosomes (proteins and RNA) or enveloped viruses (proteins, nucleic acids, and lipids). Approaches have been developed for designing self-assembling structures consisting of either nucleic acids1,2 or proteins3–5, but strategies for engineering hybrid biological materials are only beginning to emerge6,7. Here, we describe the design of self-assembling protein nanocages that direct their own release from human cells inside small vesicles in a manner that resembles some viruses. We refer to these hybrid biomaterials as Enveloped Protein Nanocages (EPNs). Robust EPN biogenesis required protein sequence elements that encode three distinct functions: membrane binding, self-assembly, and recruitment of the Endosomal Sorting Complexes Required for Transport (ESCRT) machinery8. A variety of synthetic proteins with these functional elements induced EPN biogenesis, highlighting the modularity and generality of the design strategy. Biochemical and electron cryomicroscopic (cryo-EM) analyses revealed that one design, EPN-01, comprised small (~100 nm) vesicles containing multiple protein nanocages that closely matched the structure of the designed 60-subunit self-assembling scaffold9. EPNs that incorporated the vesicular stomatitis viral glycoprotein (VSV-G) could fuse with target cells and deliver their contents, thereby transferring cargoes from one cell to another. These studies show how proteins can be programmed to direct the formation of hybrid biological materials that perform complex tasks, and establish EPNs as a novel class of designed, modular, genetically-encoded nanomaterials that can transfer molecules between cells. PMID:27919066
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kozbial, Piotr; Xu, Qingping; Chiu, Hsiu-Ju
2009-08-28
To extend the structural coverage of proteins with unknown functions, we targeted a novel protein family (Pfam accession number PF08807, DUF1798) for which we proposed and determined the structures of two representative members. The MW1337R gene of Staphylococcus aureus subsp. aureus Rosenbach (Wood 46) encodes a protein with a molecular weight of 13.8 kDa (residues 1-116) and a calculated isoelectric point of 5.15. The lin2004 gene of the nonspore-forming bacterium Listeria innocua Clip11262 encodes a protein with a molecular weight of 14.6 kDa (residues 1-121) and a calculated isoelectric point of 5.45. MW1337R and lin2004, as well as their homologs,more » which, so far, have been found only in Bacillus, Staphylococcus, Listeria, and related genera (Geobacillus, Exiguobacterium, and Oceanobacillus), have unknown functions and are annotated as hypothetical proteins. The genomic contexts of MW1337R and lin2004 are similar and conserved in related species. In prokaryotic genomes, most often, functionally interacting proteins are coded by genes, which are colocated in conserved operons. Proteins from the same operon as MW1337R and lin2004 either have unknown functions (i.e., belong to DUF1273, Pfam accession number PF06908) or are similar to ypsB from Bacillus subtilis. The function of ypsB is unclear, although it has a strong similarity to the N-terminal region of DivIVA, which was characterized as a bifunctional protein with distinct roles during vegetative growth and sporulation. In addition, members of the DUF1273 family display distant sequence similarity with the DprA/Smf protein, which acts downstream of the DNA uptake machinery, possibly in conjunction with RecA. The RecA activities in Bacillus subtilis are modulated by RecU Holliday-junction resolvase. In all analyzed cases, the gene coding for RecU is in the vicinity of MW1337R, lin2004, or their orthologs, but on a different operon located in the complementary DNA strand. Here, we report the crystal structures of MW1337R and lin2004, which were determined using the semiautomated, high-throughput pipeline of the Joint Center for Structural Genomics (JCSG), part of the National Institute of General Medical Sciences Protein Structure Initiative.« less
Buried and accessible surface area control intrinsic protein flexibility.
Marsh, Joseph A
2013-09-09
Proteins experience a wide variety of conformational dynamics that can be crucial for facilitating their diverse functions. How is the intrinsic flexibility required for these motions encoded in their three-dimensional structures? Here, the overall flexibility of a protein is demonstrated to be tightly coupled to the total amount of surface area buried within its fold. A simple proxy for this, the relative solvent-accessible surface area (Arel), therefore shows excellent agreement with independent measures of global protein flexibility derived from various experimental and computational methods. Application of Arel on a large scale demonstrates its utility by revealing unique sequence and structural properties associated with intrinsic flexibility. In particular, flexibility as measured by Arel shows little correspondence with intrinsic disorder, but instead tends to be associated with multiple domains and increased α-helical structure. Furthermore, the apparent flexibility of monomeric proteins is found to be useful for identifying quaternary-structure errors in published crystal structures. There is also a strong tendency for the crystal structures of more flexible proteins to be solved to lower resolutions. Finally, local solvent accessibility is shown to be a primary determinant of local residue flexibility. Overall, this work provides both fundamental mechanistic insight into the origin of protein flexibility and a simple, practical method for predicting flexibility from protein structures. © 2013 Elsevier Ltd. All rights reserved.
Human Autoantibodies Reveal Titin as a Chromosomal Protein
Machado, Cristina; Sunkel, Claudio E.; Andrew, Deborah J.
1998-01-01
Assembly of the higher-order structure of mitotic chromosomes is a prerequisite for proper chromosome condensation, segregation and integrity. Understanding the details of this process has been limited because very few proteins involved in the assembly of chromosome structure have been discovered. Using a human autoimmune scleroderma serum that identifies a chromosomal protein in human cells and Drosophila embryos, we cloned the corresponding Drosophila gene that encodes the homologue of vertebrate titin based on protein size, sequence similarity, developmental expression and subcellular localization. Titin is a giant sarcomeric protein responsible for the elasticity of striated muscle that may also function as a molecular scaffold for myofibrillar assembly. Molecular analysis and immunostaining with antibodies to multiple titin epitopes indicates that the chromosomal and muscle forms of titin may vary in their NH2 termini. The identification of titin as a chromosomal component provides a molecular basis for chromosome structure and elasticity. PMID:9548712
Structure of a group II intron in complex with its reverse transcriptase.
Qu, Guosheng; Kaushal, Prem Singh; Wang, Jia; Shigematsu, Hideki; Piazza, Carol Lyn; Agrawal, Rajendra Kumar; Belfort, Marlene; Wang, Hong-Wei
2016-06-01
Bacterial group II introns are large catalytic RNAs related to nuclear spliceosomal introns and eukaryotic retrotransposons. They self-splice, yielding mature RNA, and integrate into DNA as retroelements. A fully active group II intron forms a ribonucleoprotein complex comprising the intron ribozyme and an intron-encoded protein that performs multiple activities including reverse transcription, in which intron RNA is copied into the DNA target. Here we report cryo-EM structures of an endogenously spliced Lactococcus lactis group IIA intron in its ribonucleoprotein complex form at 3.8-Å resolution and in its protein-depleted form at 4.5-Å resolution, revealing functional coordination of the intron RNA with the protein. Remarkably, the protein structure reveals a close relationship between the reverse transcriptase catalytic domain and telomerase, whereas the active splicing center resembles the spliceosomal Prp8 protein. These extraordinary similarities hint at intricate ancestral relationships and provide new insights into splicing and retromobility.
Co-opting sulphur-carrier proteins from primary metabolic pathways for 2-thiosugar biosynthesis.
Sasaki, Eita; Zhang, Xuan; Sun, He G; Lu, Mei-yeh Jade; Liu, Tsung-lin; Ou, Albert; Li, Jeng-yi; Chen, Yu-hsiang; Ealick, Steven E; Liu, Hung-wen
2014-06-19
Sulphur is an essential element for life and is ubiquitous in living systems. Yet how the sulphur atom is incorporated into many sulphur-containing secondary metabolites is poorly understood. For bond formation between carbon and sulphur in primary metabolites, the major ionic sulphur sources are the persulphide and thiocarboxylate groups on sulphur-carrier (donor) proteins. Each group is post-translationally generated through the action of a specific activating enzyme. In all reported bacterial cases, the gene encoding the enzyme that catalyses the carbon-sulphur bond formation reaction and that encoding the cognate sulphur-carrier protein exist in the same gene cluster. To study the production of the 2-thiosugar moiety in BE-7585A, an antibiotic from Amycolatopsis orientalis, we identified a putative 2-thioglucose synthase, BexX, whose protein sequence and mode of action seem similar to those of ThiG, the enzyme that catalyses thiazole formation in thiamine biosynthesis. However, no gene encoding a sulphur-carrier protein could be located in the BE-7585A cluster. Subsequent genome sequencing uncovered a few genes encoding sulphur-carrier proteins that are probably involved in the biosynthesis of primary metabolites but only one activating enzyme gene in the A. orientalis genome. Further experiments showed that this activating enzyme can adenylate each of these sulphur-carrier proteins and probably also catalyses the subsequent thiolation, through its rhodanese domain. A proper combination of these sulphur-delivery systems is effective for BexX-catalysed 2-thioglucose production. The ability of BexX to selectively distinguish sulphur-carrier proteins is given a structural basis using X-ray crystallography. This study is, to our knowledge, the first complete characterization of thiosugar formation in nature and also demonstrates the receptor promiscuity of the A. orientalis sulphur-delivery system. Our results also show that co-opting the sulphur-delivery machinery of primary metabolism for the biosynthesis of sulphur-containing natural products is probably a general strategy found in nature.
The host outer membrane proteins OmpA and OmpC are associated with the Shigella phage Sf6 virion
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhao Haiyan, E-mail: zhaohy@ku.ed; Sequeira, Reuben D., E-mail: sequen@ku.ed; Galeva, Nadezhda A., E-mail: galeva@ku.ed
2011-01-20
Assembly of dsDNA bacteriophage is a precisely programmed process. Potential roles of host cell components in phage assembly haven't been well understood. It was previously reported that two unidentified proteins were present in bacteriophage Sf6 virion (Casjens et al, 2004, J.Mol.Biol. 339, 379-394, Fig. 2A). Using tandem mass spectrometry, we have identified the two proteins as outer membrane proteins (OMPs) OmpA and OmpC from its host Shigella flexneri. The transmission electron cryo-microscopy structure of Sf6 shows significant density at specific sites at the phage capsid inner surface. This density fit well with the characteristic beta-barrel domains of OMPs, thus maymore » be due to the two host proteins. Locations of this density suggest a role in Sf6 morphogenesis reminiscent of phage-encoded cementing proteins. These data indicate a new, OMP-related phage:host linkage, adding to previous knowledge that some lambdoid bacteriophage genomes contain OmpC-like genes that express phage-encoded porins in the lysogenic state.« less
Semova, Natalia; Kapanadze, Bagrat; Corcoran, Martin; Kutsenko, Alexei; Baranova, Ancha; Semov, Alexandre
2003-09-01
IRLB was originally identified as a partial cDNA clone, encoding a 191-aa protein binding the interferon-stimulated response element (ISRE) in the P2 promoter of human MYC. Here, we cloned the full-size IRLB using different bioinformatics tools and an RT-PCR approach. The full-size gene encompasses 131 kb within chromosome 15q22 and consists of 32 exons. IRLB is transcribed as a 6.6-kb mRNA encoding a protein of 1865 aa. IRLB is ubiquitously expressed and its expression is regulated in a growth- and cell cycle-dependent manner. In addition to the ISRE-binding domain IRLB contains a tripartite DENN domain, a nuclear localization signal, two PPRs, and a calmodulin-binding domain. The presence of DENN domains predicts possible interactions of IRLB with GTPases from the Rab family or regulation of growth-induced MAPKs. Strongly homologous proteins were identified in all available vertebrate genomes as well as in Caenorhabditis elegans and Drosophila melanogaster. In human and mouse a family of IRLB proteins exists, consisting of at least three members.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Claus, Claudia; Tzeng, W.-P.; Liebert, Uwe Gerd
During serial passaging of rubella virus (RUB) in cell culture, the dominant species of defective-interfering RNA (DI) generated contains an in-frame deletion between the capsid protein (C) gene and E1 glycoprotein gene resulting in production of a C-E1 fusion protein that is necessary for the maintenance of the DI [Tzeng, W.P., Frey, T.K. (2006). C-E1 fusion protein synthesized by rubella virus DI RNAs maintained during serial passage. Virology 356 198-207.]. A BHK cell line stably expressing the RUB structural proteins was established which was used to package DIs into virus particles following transfection with in vitro transcripts from DI infectiousmore » cDNA constructs. Packaging of a DI encoding an in-frame C-GFP-E1 reporter fusion protein corresponding to the C-E1 fusion protein expressed in a native DI was only marginally more efficient than packaging of a DI encoding GFP, indicating that the C-E1 fusion protein did not function by enhancing packaging. However, infection with the DI encoding the C-GFP-E1 fusion protein (in the absence of wt RUB helper virus) resulted in formation of clusters of GFP-positive cells and the percentage of GFP-positive cells in the culture following infection remained relatively constant. In contrast, a DI encoding GFP did not form GFP-positive clusters and the percentage of GFP-positive cells declined by roughly half from 2 to 4 days post-infection. Cluster formation and sustaining the percentage of infected (GFP-positive) cells required the C part of the fusion protein, including the downstream but not the upstream of two arginine clusters (both of which are associated with RNA binding and association with mitochondrial p32 protein) and the E1 part through the transmembrane sequence, but not the C-terminal cytoplasmic tail. Among a collection of mutant DI constructs, cluster formation and sustaining infected cell percentage correlated with maintenance during serial passage with wt RUB. We hypothesize that cluster formation and sustaining infected cell percentage increase the likelihood of co-infection by a DI and wt RUB during serial passage thus enhancing maintenance of the DI. Cluster formation and sustaining infected cell percentage were found to be due to a combination of attenuated cytopathogenicity of DIs that express the C-E1 fusion protein and cell-to-cell movement of the DI. In infected cells, the C-GFP-E1 fusion protein was localized to potentially novel vesicular structures that appear to originate from ER-Golgi transport vacuoles. This species of DI expressing a C-E1 fusion protein that exhibits attenuated cytopathogenicity and the ability to increase the number of infected cells through cell-to-cell movement could be the basis for development of an attractive vaccine vector.« less
Structural Heterogeneity and Functional Domains of Murine Immunoglobulin G Fc Receptors
NASA Astrophysics Data System (ADS)
Ravetch, Jeffrey V.; Luster, Andrew D.; Weinshank, Richard; Kochan, Jarema; Pavlovec, Amalia; Portnoy, Daniel A.; Hulmes, Jeffrey; Pan, Yu-Ching E.; Unkeless, Jay C.
1986-11-01
Binding of antibodies to effector cells by way of receptors to their constant regions (Fc receptors) is central to the pathway that leads to clearance of antigens by the immune system. The structure and function of this important class of receptors on immune cells is addressed through the molecular characterization of Fc receptors (FcR) specific for the murine immunoglobulin G isotype. Structural diversity is encoded by two genes that by alternative splicing result in expression of molecules with highly conserved extracellular domains and different transmembrane and intracytoplasmic domains. The proteins encoded by these genes are members of the immunoglobulin supergene family, most homologous to the major histocompatibility complex molecule Eβ. Functional reconstitution of ligand binding by transfection of individual FcR genes demonstrates that the requirements for ligand binding are encoded in a single gene. These studies demonstrate the molecular basis for the functional heterogeneity of FcR's, accounting for the possible transduction of different signals in response to a single ligand.
Walia, Rasna R; Caragea, Cornelia; Lewis, Benjamin A; Towfic, Fadi; Terribilini, Michael; El-Manzalawy, Yasser; Dobbs, Drena; Honavar, Vasant
2012-05-10
RNA molecules play diverse functional and structural roles in cells. They function as messengers for transferring genetic information from DNA to proteins, as the primary genetic material in many viruses, as catalysts (ribozymes) important for protein synthesis and RNA processing, and as essential and ubiquitous regulators of gene expression in living organisms. Many of these functions depend on precisely orchestrated interactions between RNA molecules and specific proteins in cells. Understanding the molecular mechanisms by which proteins recognize and bind RNA is essential for comprehending the functional implications of these interactions, but the recognition 'code' that mediates interactions between proteins and RNA is not yet understood. Success in deciphering this code would dramatically impact the development of new therapeutic strategies for intervening in devastating diseases such as AIDS and cancer. Because of the high cost of experimental determination of protein-RNA interfaces, there is an increasing reliance on statistical machine learning methods for training predictors of RNA-binding residues in proteins. However, because of differences in the choice of datasets, performance measures, and data representations used, it has been difficult to obtain an accurate assessment of the current state of the art in protein-RNA interface prediction. We provide a review of published approaches for predicting RNA-binding residues in proteins and a systematic comparison and critical assessment of protein-RNA interface residue predictors trained using these approaches on three carefully curated non-redundant datasets. We directly compare two widely used machine learning algorithms (Naïve Bayes (NB) and Support Vector Machine (SVM)) using three different data representations in which features are encoded using either sequence- or structure-based windows. Our results show that (i) Sequence-based classifiers that use a position-specific scoring matrix (PSSM)-based representation (PSSMSeq) outperform those that use an amino acid identity based representation (IDSeq) or a smoothed PSSM (SmoPSSMSeq); (ii) Structure-based classifiers that use smoothed PSSM representation (SmoPSSMStr) outperform those that use PSSM (PSSMStr) as well as sequence identity based representation (IDStr). PSSMSeq classifiers, when tested on an independent test set of 44 proteins, achieve performance that is comparable to that of three state-of-the-art structure-based predictors (including those that exploit geometric features) in terms of Matthews Correlation Coefficient (MCC), although the structure-based methods achieve substantially higher Specificity (albeit at the expense of Sensitivity) compared to sequence-based methods. We also find that the expected performance of the classifiers on a residue level can be markedly different from that on a protein level. Our experiments show that the classifiers trained on three different non-redundant protein-RNA interface datasets achieve comparable cross-validation performance. However, we find that the results are significantly affected by differences in the distance threshold used to define interface residues. Our results demonstrate that protein-RNA interface residue predictors that use a PSSM-based encoding of sequence windows outperform classifiers that use other encodings of sequence windows. While structure-based methods that exploit geometric features can yield significant increases in the Specificity of protein-RNA interface residue predictions, such increases are offset by decreases in Sensitivity. These results underscore the importance of comparing alternative methods using rigorous statistical procedures, multiple performance measures, and datasets that are constructed based on several alternative definitions of interface residues and redundancy cutoffs as well as including evaluations on independent test sets into the comparisons.
Krupska, Izabela; Bruford, Elspeth A; Chaqour, Brahim
2015-09-23
"CCN" is an acronym referring to the first letter of each of the first three members of this original group of mammalian functionally and phylogenetically distinct extracellular matrix (ECM) proteins [i.e., cysteine-rich 61 (CYR61), connective tissue growth factor (CTGF), and nephroblastoma-overexpressed (NOV)]. Although "CCN" genes are unlikely to have arisen from a common ancestral gene, their encoded proteins share multimodular structures in which most cysteine residues are strictly conserved in their positions within several structural motifs. The CCN genes can be subdivided into members developmentally indispensable for embryonic viability (e.g., CCN1, 2 and 5), each assuming unique tissue-specific functions, and members not essential for embryonic development (e.g., CCN3, 4 and 6), probably due to a balance of functional redundancy and specialization during evolution. The temporo-spatial regulation of the CCN genes and the structural information contained within the sequences of their encoded proteins reflect diversity in their context and tissue-specific functions. Genetic association studies and experimental anomalies, replicated in various animal models, have shown that altered CCN gene structure or expression is associated with "injury" stimuli--whether mechanical (e.g., trauma, shear stress) or chemical (e.g., ischemia, hyperglycemia, hyperlipidemia, inflammation). Consequently, increased organ-specific susceptibility to structural damages ensues. These data underscore the critical functions of CCN proteins in the dynamics of tissue repair and regeneration and in the compensatory responses preceding organ failure. A better understanding of the regulation and mode of action of each CCN member will be useful in developing specific gain- or loss-of-function strategies for therapeutic purposes.
Mechanism of Resilin Elasticity
Qin, Guokui; Hu, Xiao; Cebe, Peggy; Kaplan, David L.
2012-01-01
Resilin is critical in the flight and jumping systems of insects as a polymeric rubber-like protein with outstanding elasticity. However, insight into the underlying molecular mechanisms responsible for resilin elasticity remains undefined. Here we report the structure and function of resilin from Drosophila CG15920. A reversible beta-turn transition was identified in the peptide encoded by exon III and for full length resilin during energy input and release, features that correlate to the rapid deformation of resilin during functions in vivo. Micellar structures and nano-porous patterns formed after beta-turn structures were present via changes in either the thermal or mechanical inputs. A model is proposed to explain the super elasticity and energy conversion mechanisms of resilin, providing important insight into structure-function relationships for this protein. Further, this model offers a view of elastomeric proteins in general where beta-turn related structures serve as fundamental units of the structure and elasticity. PMID:22893127
da Fonseca, Néli José; Lima Afonso, Marcelo Querino; Pedersolli, Natan Gonçalves; de Oliveira, Lucas Carrijo; Andrade, Dhiego Souto; Bleicher, Lucas
2017-10-28
Flaviviruses are responsible for serious diseases such as dengue, yellow fever, and zika fever. Their genomes encode a polyprotein which, after cleavage, results in three structural and seven non-structural proteins. Homologous proteins can be studied by conservation and coevolution analysis as detected in multiple sequence alignments, usually reporting positions which are strictly necessary for the structure and/or function of all members in a protein family or which are involved in a specific sub-class feature requiring the coevolution of residue sets. This study provides a complete conservation and coevolution analysis on all flaviviruses non-structural proteins, with results mapped on all well-annotated available sequences. A literature review on the residues found in the analysis enabled us to compile available information on their roles and distribution among different flaviviruses. Also, we provide the mapping of conserved and coevolved residues for all sequences currently in SwissProt as a supplementary material, so that particularities in different viruses can be easily analyzed. Copyright © 2017 Elsevier Inc. All rights reserved.
NASA Astrophysics Data System (ADS)
Rauf, Muhammad; Saeed, Nasir A.; Habib, Imran; Ahmed, Moddassir; Shahzad, Khurram; Mansoor, Shahid; Ali, Rashid
2017-02-01
Structure prediction can provide information about function and active sites of protein which helps to design new functional proteins. H+-pyrophosphatase is transmembrane protein involved in establishing proton motive force for active transport of Na+ across membrane by Na+/H+ antiporters. A full length novel H+-pyrophosphatase gene was isolated from halophytic grass Leptochloa fusca using RT-PCR and RACE method. Full length LfVP1 gene sequence of 2292 nucleotides encodes protein of 764 amino acids. DNA and protein sequences were used for characterization using bioinformatics tools. Various important potential sites were predicted by PROSITE webserver. Primary structural analysis showed LfVP1 as stable protein and Grand average hydropathy (GRAVY) indicated that LfVP1 protein has good hydrosolubility. Secondary structure analysis showed that LfVP1 protein sequence contains significant proportion of alpha helix and random coil. Protein membrane topology suggested the presence of 14 transmembrane domains and presence of catalytic domain in TM3. Three dimensional structure from LfVP1 protein sequence also indicated the presence of 14 transmembrane domains and hydrophobicity surface model showed amino acid hydrophobicity. Ramachandran plot showed that 98% amino acid residues were predicted in the favored region.
The Popeye Domain Containing Genes and Their Function as cAMP Effector Proteins in Striated Muscle.
Brand, Thomas
2018-03-13
The Popeye domain containing (POPDC) genes encode transmembrane proteins, which are abundantly expressed in striated muscle cells. Hallmarks of the POPDC proteins are the presence of three transmembrane domains and the Popeye domain, which makes up a large part of the cytoplasmic portion of the protein and functions as a cAMP-binding domain. Interestingly, despite the prediction of structural similarity between the Popeye domain and other cAMP binding domains, at the protein sequence level they strongly differ from each other suggesting an independent evolutionary origin of POPDC proteins. Loss-of-function experiments in zebrafish and mouse established an important role of POPDC proteins for cardiac conduction and heart rate adaptation after stress. Loss-of function mutations in patients have been associated with limb-girdle muscular dystrophy and AV-block. These data suggest an important role of these proteins in the maintenance of structure and function of striated muscle cells.
Binding ligand prediction for proteins using partial matching of local surface patches.
Sael, Lee; Kihara, Daisuke
2010-01-01
Functional elucidation of uncharacterized protein structures is an important task in bioinformatics. We report our new approach for structure-based function prediction which captures local surface features of ligand binding pockets. Function of proteins, specifically, binding ligands of proteins, can be predicted by finding similar local surface regions of known proteins. To enable partial comparison of binding sites in proteins, a weighted bipartite matching algorithm is used to match pairs of surface patches. The surface patches are encoded with the 3D Zernike descriptors. Unlike the existing methods which compare global characteristics of the protein fold or the global pocket shape, the local surface patch method can find functional similarity between non-homologous proteins and binding pockets for flexible ligand molecules. The proposed method improves prediction results over global pocket shape-based method which was previously developed by our group.
Binding Ligand Prediction for Proteins Using Partial Matching of Local Surface Patches
Sael, Lee; Kihara, Daisuke
2010-01-01
Functional elucidation of uncharacterized protein structures is an important task in bioinformatics. We report our new approach for structure-based function prediction which captures local surface features of ligand binding pockets. Function of proteins, specifically, binding ligands of proteins, can be predicted by finding similar local surface regions of known proteins. To enable partial comparison of binding sites in proteins, a weighted bipartite matching algorithm is used to match pairs of surface patches. The surface patches are encoded with the 3D Zernike descriptors. Unlike the existing methods which compare global characteristics of the protein fold or the global pocket shape, the local surface patch method can find functional similarity between non-homologous proteins and binding pockets for flexible ligand molecules. The proposed method improves prediction results over global pocket shape-based method which was previously developed by our group. PMID:21614188
In silico modeling of the yeast protein and protein family interaction network
NASA Astrophysics Data System (ADS)
Goh, K.-I.; Kahng, B.; Kim, D.
2004-03-01
Understanding of how protein interaction networks of living organisms have evolved or are organized can be the first stepping stone in unveiling how life works on a fundamental ground. Here we introduce an in silico ``coevolutionary'' model for the protein interaction network and the protein family network. The essential ingredient of the model includes the protein family identity and its robustness under evolution, as well as the three previously proposed: gene duplication, divergence, and mutation. This model produces a prototypical feature of complex networks in a wide range of parameter space, following the generalized Pareto distribution in connectivity. Moreover, we investigate other structural properties of our model in detail with some specific values of parameters relevant to the yeast Saccharomyces cerevisiae, showing excellent agreement with the empirical data. Our model indicates that the physical constraints encoded via the domain structure of proteins play a crucial role in protein interactions.
Naranjo, Yandi; Pons, Miquel; Konrat, Robert
2012-01-01
The number of existing protein sequences spans a very small fraction of sequence space. Natural proteins have overcome a strong negative selective pressure to avoid the formation of insoluble aggregates. Stably folded globular proteins and intrinsically disordered proteins (IDPs) use alternative solutions to the aggregation problem. While in globular proteins folding minimizes the access to aggregation prone regions, IDPs on average display large exposed contact areas. Here, we introduce the concept of average meta-structure correlation maps to analyze sequence space. Using this novel conceptual view we show that representative ensembles of folded and ID proteins show distinct characteristics and respond differently to sequence randomization. By studying the way evolutionary constraints act on IDPs to disable a negative function (aggregation) we might gain insight into the mechanisms by which function-enabling information is encoded in IDPs.
Functional analysis of the Helicobacter pullorum N-linked protein glycosylation system.
Jervis, Adrian J; Wood, Alison G; Cain, Joel A; Butler, Jonathan A; Frost, Helen; Lord, Elizabeth; Langdon, Rebecca; Cordwell, Stuart J; Wren, Brendan W; Linton, Dennis
2018-04-01
N-linked protein glycosylation systems operate in species from all three domains of life. The model bacterial N-linked glycosylation system from Campylobacter jejuni is encoded by pgl genes present at a single chromosomal locus. This gene cluster includes the pglB oligosaccharyltransferase responsible for transfer of glycan from lipid carrier to protein. Although all genomes from species of the Campylobacter genus contain a pgl locus, among the related Helicobacter genus only three evolutionarily related species (H. pullorum, H. canadensis and H. winghamensis) potentially encode N-linked protein glycosylation systems. Helicobacter putative pgl genes are scattered in five chromosomal loci and include two putative oligosaccharyltransferase-encoding pglB genes per genome. We have previously demonstrated the in vitro N-linked glycosylation activity of H. pullorum resulting in transfer of a pentasaccharide to a peptide at asparagine within the sequon (D/E)XNXS/T. In this study, we identified the first H. pullorum N-linked glycoprotein, termed HgpA. Production of histidine-tagged HgpA in the background of insertional knockout mutants of H. pullorum pgl/wbp genes followed by analysis of HgpA glycan structures demonstrated the role of individual gene products in the PglB1-dependent N-linked protein glycosylation pathway. Glycopeptide purification by zwitterionic-hydrophilic interaction liquid chromatography coupled with tandem mass spectrometry identified six glycosites from five H. pullorum proteins, which was consistent with proteins reactive with a polyclonal antiserum generated against glycosylated HgpA. This study demonstrates functioning of a H. pullorum N-linked general protein glycosylation system.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tzeng, W.-P.; Frey, Teryl K.
Rubella virus (RUB) replicons are derivatives of the RUB infectious cDNA clone that retain the nonstructural open reading frame (NS-ORF) that encodes the replicase proteins but not the structural protein ORF (SP-ORF) that encodes the virion proteins. RUB defective interfering (DI) RNAs contain deletions within the SP-ORF and thus resemble replicons. DI RNAs often retain the 5' end of the capsid protein (C) gene that has been shown to modulate virus-specific RNA synthesis. However, when replicons either with or without the C gene were passaged serially in the presence of wt RUB as a source of the virion proteins, itmore » was found that neither replicon was maintained and DI RNAs were generated. The majority DI RNA species contained in-frame deletions in the SP-ORF leading to a fusion between the 5' end of the C gene and the 3' end of the E1 glycoprotein gene. DI infectious cDNA clones were constructed and transcripts from these DI infectious cDNA clones were maintained during serial passage with wt RUB. The C-E1 fusion protein encoded by the DI RNAs was synthesized and was required for maintenance of the DI RNA during serial passage. This is the first report of a functional novel gene product resulting from deletion during DI RNA generation. Thus far, the role of the C-E1 fusion protein in maintenance of DI RNAs during serial passage remained elusive as it was found that the fusion protein diminished rather than enhanced DI RNA synthesis and was not incorporated into virus particles.« less
Sequence heuristics to encode phase behaviour in intrinsically disordered protein polymers
Quiroz, Felipe García; Chilkoti, Ashutosh
2015-01-01
Proteins and synthetic polymers that undergo aqueous phase transitions mediate self-assembly in nature and in man-made material systems. Yet little is known about how the phase behaviour of a protein is encoded in its amino acid sequence. Here, by synthesizing intrinsically disordered, repeat proteins to test motifs that we hypothesized would encode phase behaviour, we show that the proteins can be designed to exhibit tunable lower or upper critical solution temperature (LCST and UCST, respectively) transitions in physiological solutions. We also show that mutation of key residues at the repeat level abolishes phase behaviour or encodes an orthogonal transition. Furthermore, we provide heuristics to identify, at the proteome level, proteins that might exhibit phase behaviour and to design novel protein polymers consisting of biologically active peptide repeats that exhibit LCST or UCST transitions. These findings set the foundation for the prediction and encoding of phase behaviour at the sequence level. PMID:26390327
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bae, Euiyoung; Bingman, Craig A.; Aceti, David J.
LOC79017 (MW 21.0 kDa, residues 1-188) was annotated as a hypothetical protein encoded by Homo sapiens chromosome 7 open reading frame 24. It was selected as a target by the Center for Eukaryotic Structural Genomics (CESG) because it did not share more than 30% sequence identity with any protein for which the three-dimensional structure is known. The biological function of the protein has not been established yet. Parts of LOC79017 were identified as members of uncharacterized Pfam families (residues 1-95 as PB006073 and residues 104-180 as PB031696). BLAST searches revealed homologues of LOC79017 in many eukaryotes, but none of themmore » have been functionally characterized. Here, we report the crystal structure of H. sapiens protein LOC79017 (UniGene code Hs.530024, UniProt code O75223, CESG target number go.35223).« less
Hulshof, Janneke W; Casarosa, Paola; Menge, Wiro M P B; Kuusisto, Leena M S; van der Goot, Henk; Smit, Martine J; de Esch, Iwan J P; Leurs, Rob
2005-10-06
US28 is a human cytomegalovirus (HCMV) encoded G-protein-coupled receptor that signals in a constitutively active manner. Recently, we identified 1 [5-(4-(4-chlorophenyl)-4-hydroxypiperidin-1-yl)-2,2-diphenylpentanenitrile] as the first reported nonpeptidergic inverse agonist for a viral-encoded chemokine receptor. Interestingly, this compound is able to partially inhibit the viral entry of HIV-1. In this study we describe the synthesis of 1 and several of its analogues and unique structure-activity relationships for this first class of small-molecule ligands for the chemokine receptor US28. Moreover, the compounds have been pharmacologically characterized as inverse agonists on US28. By modification of lead structure 1, it is shown that a 4-phenylpiperidine moiety is essential for affinity and activity. Other structural features of 1 are shown to be of less importance. These compounds define the first SAR of ligands on a viral GPCR (US28) and may therefore serve as important tools to investigate the significance of US28-mediated constitutive activity during viral infection.
van de Ven, W J; Vermorken, A J; Onnekink, C; Bloemers, H P; Bloemendal, H
1978-01-01
A preparative method for isolating pure viral envelopes from a type-C RNA tumor virus, Rauscher murine leukemia virus, is described. Fractionation of virions of Rauscher murine leukemia virus was studied after disruption of the virions with the detergents sodium dodecyl sulfate of Nonidet P-40 in combination with ether. Fractionation was performed through flotation in a discontinuous sucrose gradient and, as appeared from electron microscopic examination, a pure viral envelope fraction was obtained in this way. By use of sensitive competition radioimmunoassays or sodium dodecyl sulfate-polyacrylamide gel electrophoresis after immunoprecipitation with polyvalent and monospecific antisera directed against Rauscher murine leukemia virus proteins, the amount of the gag and env gene-encoded structural polypeptides in the virions and the isolated envelope fraction was compared. The predominant viral structural polypeptides in the purified envelope fraction were the env gene-encoded polypeptides gp70, p15(E), and p12(E), whereas, except for p15, there was only a relatively small amount of the gag gene-encoded structural polypeptides in this fraction. Images PMID:702639
NASA Astrophysics Data System (ADS)
Simon, Joseph R.; Carroll, Nick J.; Rubinstein, Michael; Chilkoti, Ashutosh; López, Gabriel P.
2017-06-01
Dynamic protein-rich intracellular structures that contain phase-separated intrinsically disordered proteins (IDPs) composed of sequences of low complexity (SLC) have been shown to serve a variety of important cellular functions, which include signalling, compartmentalization and stabilization. However, our understanding of these structures and our ability to synthesize models of them have been limited. We present design rules for IDPs possessing SLCs that phase separate into diverse assemblies within droplet microenvironments. Using theoretical analyses, we interpret the phase behaviour of archetypal IDP sequences and demonstrate the rational design of a vast library of multicomponent protein-rich structures that ranges from uniform nano-, meso- and microscale puncta (distinct protein droplets) to multilayered orthogonally phase-separated granular structures. The ability to predict and program IDP-rich assemblies in this fashion offers new insights into (1) genetic-to-molecular-to-macroscale relationships that encode hierarchical IDP assemblies, (2) design rules of such assemblies in cell biology and (3) molecular-level engineering of self-assembled recombinant IDP-rich materials.
Ruano-Gallego, David; Álvarez, Beatriz; Fernández, Luis Ángel
2015-09-18
Bacterial pathogens containing type III protein secretion systems (T3SS) assemble large needle-like protein complexes in the bacterial envelope, called injectisomes, for translocation of protein effectors into host cells. The application of these "molecular syringes" for the injection of proteins into mammalian cells is hindered by their structural and genomic complexity, requiring multiple polypeptides encoded along with effectors in various transcriptional units (TUs) with intricate regulation. In this work, we have rationally designed the controlled expression of the filamentous injectisomes found in enteropathogenic Escherichia coli (EPEC) in the nonpathogenic strain E. coli K-12. All structural components of EPEC injectisomes, encoded in a genomic island called the locus of enterocyte effacement (LEE), were engineered in five TUs (eLEEs) excluding effectors, promoters and transcriptional regulators. These eLEEs were placed under the control of the IPTG-inducible promoter Ptac and integrated into specific chromosomal sites of E. coli K-12 using a marker-less strategy. The resulting strain, named synthetic injector E. coli (SIEC), assembles filamentous injectisomes similar to those in EPEC. SIEC injectisomes form pores in the host plasma membrane and are able to translocate T3-substrate proteins (e.g., translocated intimin receptor, Tir) into the cytoplasm of HeLa cells reproducing the phenotypes of intimate attachment and polymerization of actin-pedestals elicited by EPEC bacteria. Hence, SIEC strain allows the controlled expression of functional filamentous injectisomes for efficient translocation of proteins with T3S-signals into mammalian cells.
NASA Astrophysics Data System (ADS)
Yusof, Nik Yusnoraini; Bakar, Farah Diba Abu; Mahadi, Nor Muhammad; Raih, Mohd Firdaus; Murad, Abdul Munir Abdul
2015-09-01
A cDNA encoding Fe(II) 2-oxoglutarate (2OG) dependent dioxygenases was isolated from psychrophilic yeast, Glaciozyma antarctica PI12. We have successfully amplified 1,029 bp cDNA sequence that encodes 342 amino acid with predicted molecular weight 38 kDa. The prediction protein was analysed using various bioinformatics tools to explore the properties of the protein. Based on a BLAST search analysis, the Fe2OX amino acid sequence showed 61% identity to the sequence of oxoglutarate/iron-dependent oxygenase from Rhodosporidium toruloides NP11. SignalP prediction showed that the Fe2OX protein contains no putative signal peptide, which suggests that this enzyme most probably localised intracellularly.The structure of Fe2OX was predicted by homology modelling using MODELLER9v11. The model with the lowest objective function was selected from hundred models generated using MODELLER9v11. Analysis of the structure revealed the longer loop at Fe2OX from G.antarctica that might be responsible for the flexibility of the structure, which contributes to its adaptation to low temperatures. Fe2OX hold a highly conserved Fe(II) binding HXD/E…H triad motif. The binding site for 2-oxoglutarate was found conserved for Arg280 among reported studies, however the Phe268 was found to be different in Fe2OX.
Root-Bernstein, Robert; Root-Bernstein, Meredith
2016-05-21
We have proposed that the ribosome may represent a missing link between prebiotic chemistries and the first cells. One of the predictions that follows from this hypothesis, which we test here, is that ribosomal RNA (rRNA) must have encoded the proteins necessary for ribosomal function. In other words, the rRNA also functioned pre-biotically as mRNA. Since these ribosome-binding proteins (rb-proteins) must bind to the rRNA, but the rRNA also functioned as mRNA, it follows that rb-proteins should bind to their own mRNA as well. This hypothesis can be contrasted to a "null" hypothesis in which rb-proteins evolved independently of the rRNA sequences and therefore there should be no necessary similarity between the rRNA to which rb-proteins bind and the mRNA that encodes the rb-protein. Five types of evidence reported here support the plausibility of the hypothesis that the mRNA encoding rb-proteins evolved from rRNA: (1) the ubiquity of rb-protein binding to their own mRNAs and autogenous control of their own translation; (2) the higher-than-expected incidence of Arginine-rich modules associated with RNA binding that occurs in rRNA-encoded proteins; (3) the fact that rRNA-binding regions of rb-proteins are homologous to their mRNA binding regions; (4) the higher than expected incidence of rb-protein sequences encoded in rRNA that are of a high degree of homology to their mRNA as compared with a random selection of other proteins; and (5) rRNA in modern prokaryotes and eukaryotes encodes functional proteins. None of these results can be explained by the null hypothesis that assumes independent evolution of rRNA and the mRNAs encoding ribosomal proteins. Also noteworthy is that very few proteins bind their own mRNAs that are not associated with ribosome function. Further tests of the hypothesis are suggested: (1) experimental testing of whether rRNA-encoded proteins bind to rRNA at their coding sites; (2) whether tRNA synthetases, which are also known to bind to their own mRNAs, are encoded by the tRNA sequences themselves; (3) and the prediction that archaeal and prokaryotic (DNA-based) genomes were built around rRNA "genes" so that rRNA-related sequences will be found to make up an unexpectedly high proportion of these genomes. Copyright © 2016 The Authors. Published by Elsevier Ltd.. All rights reserved.
Fiévet, Anouchka; My, Laetitia; Cascales, Eric; Ansaldi, Mireille; Pauleta, Sofia R.; Moura, Isabel; Dermoun, Zorah; Bernard, Christophe S.; Dolla, Alain; Aubert, Corinne
2011-01-01
Analysis of sequenced bacterial genomes revealed that the genomes encode more than 30% hypothetical and conserved hypothetical proteins of unknown function. Among proteins of unknown function that are conserved in anaerobes, some might be determinants of the anaerobic way of life. This study focuses on two divergent clusters specifically found in anaerobic microorganisms and mainly composed of genes encoding conserved hypothetical proteins. We show that the two gene clusters DVU2103-DVU2104-DVU2105 (orp2) and DVU2107-DVU2108-DVU2109 (orp1) form two divergent operons transcribed by the σ54-RNA polymerase. We further demonstrate that the σ54-dependent transcriptional regulator DVU2106, located between orp1 and orp2, collaborates with σ54-RNA polymerase to orchestrate the simultaneous expression of the divergent orp operons. DVU2106, whose structural gene is transcribed by the σ70-RNA polymerase, negatively retrocontrols its own expression. By using an endogenous pulldown strategy, we identify a physiological complex composed of DVU2103, DVU2104, DVU2105, DVU2108, and DVU2109. Interestingly, inactivation of DVU2106, which is required for orp operon transcription, induces morphological defects that are likely linked to the absence of the ORP complex. A putative role of the ORP proteins in positioning the septum during cell division is discussed. PMID:21531797
Macaulay, Iain C; Tijssen, Marloes R; Thijssen-Timmer, Daphne C; Gusnanto, Arief; Steward, Michael; Burns, Philippa; Langford, Cordelia F; Ellis, Peter D; Dudbridge, Frank; Zwaginga, Jaap-Jan; Watkins, Nicholas A; van der Schoot, C Ellen; Ouwehand, Willem H
2007-04-15
To identify previously unknown platelet receptors we compared the transcriptomes of in vitro differentiated megakaryocytes (MKs) and erythroblasts (EBs). RNA was obtained from purified, biologically paired MK and EB cultures and compared using cDNA microarrays. Bioinformatical analysis of MK-up-regulated genes identified 151 transcripts encoding transmembrane domain-containing proteins. Although many of these were known platelet genes, a number of previously unidentified or poorly characterized transcripts were also detected. Many of these transcripts, including G6b, G6f, LRRC32, LAT2, and the G protein-coupled receptor SUCNR1, encode proteins with structural features or functions that suggest they may be involved in the modulation of platelet function. Immunoblotting on platelets confirmed the presence of the encoded proteins, and flow cytometric analysis confirmed the expression of G6b, G6f, and LRRC32 on the surface of platelets. Through comparative analysis of expression in platelets and other blood cells we demonstrated that G6b, G6f, and LRRC32 are restricted to the platelet lineage, whereas LAT2 and SUCNR1 were also detected in other blood cells. The identification of the succinate receptor SUCNR1 in platelets is of particular interest, because physiologically relevant concentrations of succinate were shown to potentiate the effect of low doses of a variety of platelet agonists.
Extraordinary Structured Noncoding RNAs Revealed by Bacterial Metagenome Analysis
Weinberg, Zasha; Perreault, Jonathan; Meyer, Michelle M.; Breaker, Ronald R.
2012-01-01
Estimates of the total number of bacterial species1-3 suggest that existing DNA sequence databases carry only a tiny fraction of the total amount of DNA sequence space represented by this division of life. Indeed, environmental DNA samples have been shown to encode many previously unknown classes of proteins4 and RNAs5. Bioinformatics searches6-10 of genomic DNA from bacteria commonly identify novel noncoding RNAs (ncRNAs)10-12 such as riboswitches13,14. In rare instances, RNAs that exhibit more extensive sequence and structural conservation across a wide range of bacteria are encountered15,16. Given that large structured RNAs are known to carry out complex biochemical functions such as protein synthesis and RNA processing reactions, identifying more RNAs of great size and intricate structure is likely to reveal additional biochemical functions that can be achieved by RNA. We applied an updated computational pipeline17 to discover ncRNAs that rival the known large ribozymes in size and structural complexity or that are among the most abundant RNAs in bacteria that encode them. These RNAs would have been difficult or impossible to detect without examining environmental DNA sequences, suggesting that numerous RNAs with extraordinary size, structural complexity, or other exceptional characteristics remain to be discovered in unexplored sequence space. PMID:19956260
Simon, Matthew J; Murchison, Charles; Iliff, Jeffrey J
2018-02-01
Astrocytes play a critical role in regulating the interface between the cerebral vasculature and the central nervous system. Contributing to this is the astrocytic endfoot domain, a specialized structure that ensheathes the entirety of the vasculature and mediates signaling between endothelial cells, pericytes, and neurons. The astrocytic endfoot has been implicated as a critical element of the glymphatic pathway, and changes in protein expression profiles in this cellular domain are linked to Alzheimer's disease pathology. Despite this, basic physiological properties of this structure remain poorly understood including the developmental timing of its formation, and the protein components that localize there to mediate its functions. Here we use human transcriptome data from male and female subjects across several developmental stages and brain regions to characterize the gene expression profile of the dystrophin-associated complex (DAC), a known structural component of the astrocytic endfoot that supports perivascular localization of the astroglial water channel aquaporin-4. Transcriptomic profiling is also used to define genes exhibiting parallel expression profiles to DAC elements, generating a pool of candidate genes that encode gene products that may contribute to the physiological function of the perivascular astrocytic endfoot domain. We found that several genes encoding transporter proteins are transcriptionally associated with DAC genes. © 2017 Wiley Periodicals, Inc.
Xu, Aishi; Li, Guang; Yang, Dong; Wu, Songfeng; Ouyang, Hongsheng; Xu, Ping; He, Fuchu
2015-12-04
Although the "missing protein" is a temporary concept in C-HPP, the biological information for their "missing" could be an important clue in evolutionary studies. Here we classified missing-protein-encoding genes into two groups, the genes encoding PE2 proteins (with transcript evidence) and the genes encoding PE3/4 proteins (with no transcript evidence). These missing-protein-encoding genes distribute unevenly among different chromosomes, chromosomal regions, or gene clusters. In the view of evolutionary features, PE3/4 genes tend to be young, spreading at the nonhomology chromosomal regions and evolving at higher rates. Interestingly, there is a higher proportion of singletons in PE3/4 genes than the proportion of singletons in all genes (background) and OTCSGs (organ, tissue, cell type-specific genes). More importantly, most of the paralogous PE3/4 genes belong to the newly duplicated members of the paralogous gene groups, which mainly contribute to special biological functions, such as "smell perception". These functions are heavily restricted into specific type of cells, tissues, or specific developmental stages, acting as the new functional requirements that facilitated the emergence of the missing-protein-encoding genes during evolution. In addition, the criteria for the extremely special physical-chemical proteins were first set up based on the properties of PE2 proteins, and the evolutionary characteristics of those proteins were explored. Overall, the evolutionary analyses of missing-protein-encoding genes are expected to be highly instructive for proteomics and functional studies in the future.
The ABC protein turned chloride channel whose failure causes cystic fibrosis
NASA Astrophysics Data System (ADS)
Gadsby, David C.; Vergani, Paola; Csanády, László
2006-03-01
CFTR chloride channels are encoded by the gene mutated in patients with cystic fibrosis. These channels belong to the superfamily of ABC transporter ATPases. ATP-driven conformational changes, which in other ABC proteins fuel uphill substrate transport across cellular membranes, in CFTR open and close a gate to allow transmembrane flow of anions down their electrochemical gradient. New structural and biochemical information from prokaryotic ABC proteins and functional information from CFTR channels has led to a unifying mechanism explaining those ATP-driven conformational changes.
Toward a unified nomenclature for mammalian ADP-ribosyltransferases.
Hottiger, Michael O; Hassa, Paul O; Lüscher, Bernhard; Schüler, Herwig; Koch-Nolte, Friedrich
2010-04-01
ADP-ribosylation is a post-translational modification of proteins catalyzed by ADP-ribosyltransferases. It comprises the transfer of the ADP-ribose moiety from NAD+ to specific amino acid residues on substrate proteins or to ADP-ribose itself. Currently, 22 human genes encoding proteins that possess an ADP-ribosyltransferase catalytic domain are known. Recent structural and enzymological evidence of poly(ADP-ribose)polymerase (PARP) family members demonstrate that earlier proposed names and classifications of these proteins are no longer accurate. Here we summarize these new findings and propose a new consensus nomenclature for all ADP-ribosyltransferases (ARTs) based on the catalyzed reaction and on structural features. A unified nomenclature would facilitate communication between researchers both inside and outside the ADP-ribosylation field. 2009 Elsevier Ltd. All rights reserved.
Discovery of new enzymes and metabolic pathways by using structure and genome context.
Zhao, Suwen; Kumar, Ritesh; Sakai, Ayano; Vetting, Matthew W; Wood, B McKay; Brown, Shoshana; Bonanno, Jeffery B; Hillerich, Brandan S; Seidel, Ronald D; Babbitt, Patricia C; Almo, Steven C; Sweedler, Jonathan V; Gerlt, John A; Cronan, John E; Jacobson, Matthew P
2013-10-31
Assigning valid functions to proteins identified in genome projects is challenging: overprediction and database annotation errors are the principal concerns. We and others are developing computation-guided strategies for functional discovery with 'metabolite docking' to experimentally derived or homology-based three-dimensional structures. Bacterial metabolic pathways often are encoded by 'genome neighbourhoods' (gene clusters and/or operons), which can provide important clues for functional assignment. We recently demonstrated the synergy of docking and pathway context by 'predicting' the intermediates in the glycolytic pathway in Escherichia coli. Metabolite docking to multiple binding proteins and enzymes in the same pathway increases the reliability of in silico predictions of substrate specificities because the pathway intermediates are structurally similar. Here we report that structure-guided approaches for predicting the substrate specificities of several enzymes encoded by a bacterial gene cluster allowed the correct prediction of the in vitro activity of a structurally characterized enzyme of unknown function (PDB 2PMQ), 2-epimerization of trans-4-hydroxy-L-proline betaine (tHyp-B) and cis-4-hydroxy-D-proline betaine (cHyp-B), and also the correct identification of the catabolic pathway in which Hyp-B 2-epimerase participates. The substrate-liganded pose predicted by virtual library screening (docking) was confirmed experimentally. The enzymatic activities in the predicted pathway were confirmed by in vitro assays and genetic analyses; the intermediates were identified by metabolomics; and repression of the genes encoding the pathway by high salt concentrations was established by transcriptomics, confirming the osmolyte role of tHyp-B. This study establishes the utility of structure-guided functional predictions to enable the discovery of new metabolic pathways.
2010-01-01
Background The phloem of dicotyledonous plants contains specialized P-proteins (phloem proteins) that accumulate during sieve element differentiation and remain parietally associated with the cisternae of the endoplasmic reticulum in mature sieve elements. Wounding causes P-protein filaments to accumulate at the sieve plates and block the translocation of photosynthate. Specialized, spindle-shaped P-proteins known as forisomes that undergo reversible calcium-dependent conformational changes have evolved exclusively in the Fabaceae. Recently, the molecular characterization of three genes encoding forisome components in the model legume Medicago truncatula (MtSEO1, MtSEO2 and MtSEO3; SEO = sieve element occlusion) was reported, but little is known about the molecular characteristics of P-proteins in non-Fabaceae. Results We performed a comprehensive genome-wide comparative analysis by screening the M. truncatula, Glycine max, Arabidopsis thaliana, Vitis vinifera and Solanum phureja genomes, and a Malus domestica EST library for homologs of MtSEO1, MtSEO2 and MtSEO3 and identified numerous novel SEO genes in Fabaceae and even non-Fabaceae plants, which do not possess forisomes. Even in Fabaceae some SEO genes appear to not encode forisome components. All SEO genes have a similar exon-intron structure and are expressed predominantly in the phloem. Phylogenetic analysis revealed the presence of several subgroups with Fabaceae-specific subgroups containing all of the known as well as newly identified forisome component proteins. We constructed Hidden Markov Models that identified three conserved protein domains, which characterize SEO proteins when present in combination. In addition, one common and three subgroup specific protein motifs were found in the amino acid sequences of SEO proteins. SEO genes are organized in genomic clusters and the conserved synteny allowed us to identify several M. truncatula vs G. max orthologs as well as paralogs within the G. max genome. Conclusions The unexpected occurrence of forisome-like genes in non-Fabaceae plants may indicate that these proteins encode species-specific P-proteins, which is backed up by the phloem-specific expression profiles. The conservation of gene structure, the presence of specific motifs and domains and the genomic synteny argue for a common phylogenetic origin of forisomes and other P-proteins. PMID:20932300
[Entification of the Rubella virus genotype 1H in Western Siberia].
Seregin, S V; Babkin, I V; Petrova, I D; Iashina, L N; Malkova, E M; Petrov, V S
2011-01-01
Molecular epidemiological study of novel strain of Rubella virus isolated during the outbreak in Western Siberia in 2004 was described. Detailed phylogenetic analysis performed based upon entire SP-region, which encodes all three Rubella structural proteins (C, E2, and E1), was implemented. This analysis provides characterization of this strain and classifies it as 1H genotype, thereby correcting previous classification of this strain based upon shorter nucleotide sequence, only encoding E1 protein. Therefore, this study identified the genotype of the Rubella virus not previously detected in Western Siberia (and even entire Russian Federation), which highlights the importance of more extensive characterization of genetic variability of the Rubella virus, especially with regard to potential influence of vaccination on the Rubella virus mutagenesis.
Simmons, Michael J; Peterson, Mark P; Thorp, Michael W; Buschette, Jared T; DiPrima, Stephanie N; Harter, Christine L; Skolnick, Matthew J
2015-03-01
Transposons, especially retrotransposons, are abundant in the genome of Drosophila melanogaster. These mobile elements are regulated by small RNAs that interact with the Piwi family of proteins-the piwi-interacting or piRNAs. The Piwi proteins are encoded by the genes argonaute3 (ago3), aubergine (aub), and piwi. Heterochromatin Protein 1 (HP1), a chromatin-organizing protein encoded by the Suppressor of variegation 205 [Su(var)205] gene, also plays a role in this regulation. To assess the mutational impact of weakening the system for transposon regulation, we measured the frequency of recessive X-linked lethal mutations occurring in the germ lines of males from stocks that were heterozygous for mutant alleles of the ago3, aub, piwi, or Su(var)205 genes. These mutant alleles are expected to deplete the wild-type proteins encoded by these genes by as much as 50%. The mutant alleles of piwi and Su(var)205 significantly increased the X-linked lethal mutation frequency, whereas the mutant alleles of ago3 did not. An increased mutation frequency was also observed in males from one of two mutant aub stocks, but this increase may not have been due to the aub mutant. The increased mutation frequency caused by depleting Piwi or HP1suggests that chromatin-organizing proteins play important roles in minimizing the germ-line mutation rate, possibly by stabilizing the structure of the heterochromatin in which many transposons are situated. Copyright © 2015 Elsevier B.V. All rights reserved.
Kopylov, Artur T; Ilgisonis, Ekaterina V; Moysa, Alexander A; Tikhonova, Olga V; Zavialova, Maria G; Novikova, Svetlana E; Lisitsa, Andrey V; Ponomarenko, Elena A; Moshkovskii, Sergei A; Markin, Andrey A; Grigoriev, Anatoly I; Zgoda, Victor G; Archakov, Alexander I
2016-11-04
This work was aimed at estimating the concentrations of proteins encoded by human chromosome 18 (Chr 18) in plasma samples of 54 healthy male volunteers (aged 20-47). These young persons have been certified by the medical evaluation board as healthy subjects ready for space flight training. Over 260 stable isotope-labeled peptide standards (SIS) were synthesized to perform the measurements of proteins encoded by Chr 18. Selected reaction monitoring (SRM) with SIS allowed an estimate of the levels of 84 of 276 proteins encoded by Chr 18. These proteins were quantified in whole and depleted plasma samples. Concentration of the proteins detected varied from 10 -6 M (transthyretin, P02766) to 10 -11 M (P4-ATPase, O43861). A minor part of the proteins (mostly representing intracellular proteins) was characterized by extremely high inter individual variations. The results provide a background for studies of a potential biomarker in plasma among proteins encoded by Chr 18. The SRM raw data are available in ProteomeXchange repository (PXD004374).
Structural and evolutionary analysis of Leishmania Alba proteins.
da Costa, Kauê Santana; Galúcio, João Marcos Pereira; Leonardo, Elvis Santos; Cardoso, Guelber; Leal, Élcio; Conde, Guilherme; Lameira, Jerônimo
2017-10-01
The Alba superfamily proteins share a common RNA-binding domain. These proteins participate in a variety of regulatory pathways by controlling developmental gene expression. They also interact with ribosomal subunits, translation factors, and other RNA-binding proteins. The Leishmania infantum genome encodes two Alba-domain proteins, LiAlba1 and LiAlba3. In this work, we used homology modeling, protein-protein docking, and molecular dynamics (MD) simulations to explore the details of the Alba1-Alba3-RNA complex from Leishmania infantum at the molecular level. In addition, we compared the structure of LiAlba3 with the human ribonuclease P component, Rpp20. We also mapped the ligand-binding residues on the Alba3 surface to analyze its druggability and performed mutational analyses in Alba3 using alanine scanning to identify residues involved in its function and structural stability. These results suggest that the RGG-box motif of LiAlba1 is important for protein function and stability. Finally, we discuss the function of Alba proteins in the context of pathogen adaptation to host cells. The data provided herein will facilitate further translational research regarding Alba structure and function. Copyright © 2017 Elsevier B.V. All rights reserved.
Imaging intracellular pH in live cells with a genetically encoded red fluorescent protein sensor.
Tantama, Mathew; Hung, Yin Pun; Yellen, Gary
2011-07-06
Intracellular pH affects protein structure and function, and proton gradients underlie the function of organelles such as lysosomes and mitochondria. We engineered a genetically encoded pH sensor by mutagenesis of the red fluorescent protein mKeima, providing a new tool to image intracellular pH in live cells. This sensor, named pHRed, is the first ratiometric, single-protein red fluorescent sensor of pH. Fluorescence emission of pHRed peaks at 610 nm while exhibiting dual excitation peaks at 440 and 585 nm that can be used for ratiometric imaging. The intensity ratio responds with an apparent pK(a) of 6.6 and a >10-fold dynamic range. Furthermore, pHRed has a pH-responsive fluorescence lifetime that changes by ~0.4 ns over physiological pH values and can be monitored with single-wavelength two-photon excitation. After characterizing the sensor, we tested pHRed's ability to monitor intracellular pH by imaging energy-dependent changes in cytosolic and mitochondrial pH.
Primary structure, expression and chromosomal locus of a human homolog of rat ERK3.
Meloche, S; Beatty, B G; Pellerin, J
1996-10-03
We report the cloning and characterization of a human cDNA encoding a novel homolog of rat extracellular signal-regulated kinase 3 (ERK3). The cDNA encodes a predicted protein of 721 amino acids which shares 92% amino acid identity with rat ERK3 over their shared length. Interestingly, the human protein contains a unique extension of 178 amino acids at its carboxy terminal extremity. The human ERK3 protein also displays various degrees of homology to other members of the MAP kinases family, but does not contain the typical TXY regulatory motif between subdomains VII and VIII. Northern blot analysis revealed that ERK3 mRNA is widely distributed in human tissues, with the highest expression detected in skeletal muscle. The human ERK3 gene was mapped by fluorescence in situ hybridization to chromosome 15q21, a region associated with chromosomal abnormalities in acute nonlymphoblastic leukemias. This information should prove valuable in designing studies to define the cellular function of the ERK3 protein kinase.
Patent protection for structural genomics-related inventions.
Vinarov, Sara D
2003-01-01
Recently there have been some important developments with respect to the patentability of inventions in the field of structural genomics. The leaders of the European Patent Office (EPO), Japan Patent Office (JPO) and the United States Patent Office (USPTO) came together for a trilateral meeting to conduct a comparative study on protein 3-dimensional (3-D) structure related claims in an effort to come to a mutual understanding about the examination of such inventions. The three patent offices were presented with eight different cases: 1) 3-D structural data of a protein per se; 2) computer-readable storage medium encoded with structural data of a protein; 3) protein defined by its tertiary structure; 4) crystals of known proteins; 5) binding pockets and protein domains; 6) and 7) are both directed to in silico screening methods directed to a specific protein; and 8) pharmacophores. The preliminary conclusions reached at the trilateral meeting provide clarity regarding the types of inventions that may be patentable given a specific set of scientific facts in a patent application. Therefore, the guidance provided by this study will help inventors, attorneys and other patent practitioners who file for patent protection on structural genomics-based inventions both here and abroad comply with the patentability requirements of each office.
Alicea, Ismael; Marvin, Jonathan S; Miklos, Aleksandr E; Ellington, Andrew D; Looger, Loren L; Schreiter, Eric R
2011-12-02
The phnD gene of Escherichia coli encodes the periplasmic binding protein of the phosphonate (Pn) uptake and utilization pathway. We have crystallized and determined structures of E. coli PhnD (EcPhnD) in the absence of ligand and in complex with the environmentally abundant 2-aminoethylphosphonate (2AEP). Similar to other bacterial periplasmic binding proteins, 2AEP binds near the center of mass of EcPhnD in a cleft formed between two lobes. Comparison of the open, unliganded structure with the closed 2AEP-bound structure shows that the two lobes pivot around a hinge by ~70° between the two states. Extensive hydrogen bonding and electrostatic interactions stabilize 2AEP, which binds to EcPhnD with low nanomolar affinity. These structures provide insight into Pn uptake by bacteria and facilitated the rational design of high signal-to-noise Pn biosensors based on both coupled small-molecule dyes and autocatalytic fluorescent proteins. Copyright © 2011 Elsevier Ltd. All rights reserved.
Alicea, Ismael; Marvin, Jonathan S.; Miklos, Aleksandr E.; Ellington, Andrew D.; Looger, Loren L.; Schreiter, Eric R.
2012-01-01
The phnD gene of Escherichia coli encodes the periplasmic binding protein of the phosphonate uptake and utilization pathway. We have crystallized and determined structures of E. coli PhnD (EcPhnD) in the absence of ligand and in complex with the environmentally abundant 2-aminoethylphosphonate (2AEP). Similar to other bacterial periplasmic binding proteins, 2AEP binds near the center of mass of EcPhnD in a cleft formed between two lobes. Comparison of the open, unliganded structure with the closed 2AEP-bound structure shows that the two lobes pivot around a hinge by ~70° between the two states. Extensive hydrogen bonding and electrostatic interactions stabilize 2AEP, which binds to EcPhnD with low nanomolar affinity. These structures provide insight into phosphonate uptake by bacteria and facilitated the rational design of high signal-to-noise phosphonate biosensors based both on coupled small molecule dyes and autocatalytic fluorescent proteins. PMID:22019591
DOE Office of Scientific and Technical Information (OSTI.GOV)
Alicea, Ismael; Marvin, Jonathan S.; Miklos, Aleksandr E.
2012-09-17
The phnD gene of Escherichia coli encodes the periplasmic binding protein of the phosphonate (Pn) uptake and utilization pathway. We have crystallized and determined structures of E. coli PhnD (EcPhnD) in the absence of ligand and in complex with the environmentally abundant 2-aminoethylphosphonate (2AEP). Similar to other bacterial periplasmic binding proteins, 2AEP binds near the center of mass of EcPhnD in a cleft formed between two lobes. Comparison of the open, unliganded structure with the closed 2AEP-bound structure shows that the two lobes pivot around a hinge by {approx}70{sup o} between the two states. Extensive hydrogen bonding and electrostatic interactionsmore » stabilize 2AEP, which binds to EcPhnD with low nanomolar affinity. These structures provide insight into Pn uptake by bacteria and facilitated the rational design of high signal-to-noise Pn biosensors based on both coupled small-molecule dyes and autocatalytic fluorescent proteins.« less
The sieve element occlusion gene family in dicotyledonous plants
Jekat, Stephan B; Nordzieke, Steffen; Reineke, Anna R; Müller, Boje; Bornberg-Bauer, Erich; Noll, Gundula A
2011-01-01
Sieve element occlusion (SEO) genes encoding forisome subunits have been identified in Medicago truncatula and other legumes. Forisomes are structural phloem proteins uniquely found in Fabaceae sieve elements. They undergo a reversible conformational change after wounding, from a condensed to a dispersed state, thereby blocking sieve tube translocation and preventing the loss of photoassimilates. Recently, we identified SEO genes in several non-Fabaceae plants (lacking forisomes) and concluded that they most probably encode conventional non-forisome P-proteins. Molecular and phylogenetic analysis of the SEO gene family has identified domains that are characteristic for SEO proteins. Here, we extended our phylogenetic analysis by including additional SEO genes from several diverse species based on recently published genomic data. Our results strengthen the original assumption that SEO genes seem to be widespread in dicotyledonous angiosperms, and further underline the divergent evolution of SEO genes within the Fabaceae. PMID:21422825
Construction of a filamentous phage display peptide library.
Fagerlund, Annette; Myrset, Astrid Hilde; Kulseth, Mari Ann
2014-01-01
The concept of phage display is based on insertion of random oligonucleotides at an appropriate location within a structural gene of a bacteriophage. The resulting phage will constitute a library of random peptides displayed on the surface of the bacteriophages, with the encoding genotype packaged within each phage particle. Using a phagemid/helper phage system, the random peptides are interspersed between wild-type coat proteins. Libraries of phage-expressed peptides may be used to search for novel peptide ligands to target proteins. The success of finding a peptide with a desired property in a given library is highly dependent on the diversity and quality of the library. The protocols in this chapter describe the construction of a high-diversity library of phagemid vector encoding fusions of the phage coat protein pVIII with random peptides, from which a phage library displaying random peptides can be prepared.
The sieve element occlusion gene family in dicotyledonous plants.
Ernst, Antonia M; Rüping, Boris; Jekat, Stephan B; Nordzieke, Steffen; Reineke, Anna R; Müller, Boje; Bornberg-Bauer, Erich; Prüfer, Dirk; Noll, Gundula A
2011-01-01
Sieve element occlusion (SEO) genes encoding forisome subunits have been identified in Medicago truncatula and other legumes. Forisomes are structural phloem proteins uniquely found in Fabaceae sieve elements. They undergo a reversible conformational change after wounding, from a condensed to a dispersed state, thereby blocking sieve tube translocation and preventing the loss of photoassimilates. Recently, we identified SEO genes in several non-Fabaceae plants (lacking forisomes) and concluded that they most probably encode conventional non-forisome P-proteins. Molecular and phylogenetic analysis of the SEO gene family has identified domains that are characteristic for SEO proteins. Here, we extended our phylogenetic analysis by including additional SEO genes from several diverse species based on recently published genomic data. Our results strengthen the original assumption that SEO genes seem to be widespread in dicotyledonous angiosperms, and further underline the divergent evolution of SEO genes within the Fabaceae.
Kozak, Natalia A; Buss, Meghan; Lucas, Claressa E; Frace, Michael; Govil, Dhwani; Travis, Tatiana; Olsen-Rasmussen, Melissa; Benson, Robert F; Fields, Barry S
2010-02-01
Legionella longbeachae causes most cases of legionellosis in Australia and may be underreported worldwide due to the lack of L. longbeachae-specific diagnostic tests. L. longbeachae displays distinctive differences in intracellular trafficking, caspase 1 activation, and infection in mouse models compared to Legionella pneumophila, yet these two species have indistinguishable clinical presentations in humans. Unlike other legionellae, which inhabit freshwater systems, L. longbeachae is found predominantly in moist soil. In this study, we sequenced and annotated the genome of an L. longbeachae clinical isolate from Oregon, isolate D-4968, and compared it to the previously published genomes of L. pneumophila. The results revealed that the D-4968 genome is larger than the L. pneumophila genome and has a gene order that is different from that of the L. pneumophila genome. Genes encoding structural components of type II, type IV Lvh, and type IV Icm/Dot secretion systems are conserved. In contrast, only 42/140 homologs of genes encoding L. pneumophila Icm/Dot substrates have been found in the D-4968 genome. L. longbeachae encodes numerous proteins with eukaryotic motifs and eukaryote-like proteins unique to this species, including 16 ankyrin repeat-containing proteins and a novel U-box protein. We predict that these proteins are secreted by the L. longbeachae Icm/Dot secretion system. In contrast to the L. pneumophila genome, the L. longbeachae D-4968 genome does not contain flagellar biosynthesis genes, yet it contains a chemotaxis operon. The lack of a flagellum explains the failure of L. longbeachae to activate caspase 1 and trigger pyroptosis in murine macrophages. These unique features of L. longbeachae may reflect adaptation of this species to life in soil.
Live-cell imaging of cell signaling using genetically encoded fluorescent reporters.
Ni, Qiang; Mehta, Sohum; Zhang, Jin
2018-01-01
Synergistic advances in fluorescent protein engineering and live-cell imaging techniques in recent years have fueled the concurrent development and application of genetically encoded fluorescent reporters that are tailored for tracking signaling dynamics in living systems over multiple length and time scales. These biosensors are uniquely suited for this challenging task, owing to their specificity, sensitivity, and versatility, as well as to the noninvasive and nondestructive nature of fluorescence and the power of genetic encoding. Over the past 10 years, a growing number of fluorescent reporters have been developed for tracking a wide range of biological signals in living cells and animals, including second messenger and metabolite dynamics, enzyme activation and activity, and cell cycle progression and neuronal activity. Many of these biosensors are gaining wide use and are proving to be indispensable for unraveling the complex biological functions of individual signaling molecules in their native environment, the living cell, shedding new light on the structural and molecular underpinnings of cell signaling. In this review, we highlight recent advances in protein engineering that are likely to help expand and improve the design and application of these valuable tools. We then turn our focus to specific examples of live-cell imaging using genetically encoded fluorescent reporters as an important platform for advancing our understanding of G protein-coupled receptor signaling and neuronal activity. © 2017 Federation of European Biochemical Societies.
Functional analysis of the Brassica napus L. phytoene synthase (PSY) gene family.
López-Emparán, Ada; Quezada-Martinez, Daniela; Zúñiga-Bustos, Matías; Cifuentes, Víctor; Iñiguez-Luy, Federico; Federico, María Laura
2014-01-01
Phytoene synthase (PSY) has been shown to catalyze the first committed and rate-limiting step of carotenogenesis in several crop species, including Brassica napus L. Due to its pivotal role, PSY has been a prime target for breeding and metabolic engineering the carotenoid content of seeds, tubers, fruits and flowers. In Arabidopsis thaliana, PSY is encoded by a single copy gene but small PSY gene families have been described in monocot and dicotyledonous species. We have recently shown that PSY genes have been retained in a triplicated state in the A- and C-Brassica genomes, with each paralogue mapping to syntenic locations in each of the three "Arabidopsis-like" subgenomes. Most importantly, we have shown that in B. napus all six members are expressed, exhibiting overlapping redundancy and signs of subfunctionalization among photosynthetic and non photosynthetic tissues. The question of whether this large PSY family actually encodes six functional enzymes remained to be answered. Therefore, the objectives of this study were to: (i) isolate, characterize and compare the complete protein coding sequences (CDS) of the six B. napus PSY genes; (ii) model their predicted tridimensional enzyme structures; (iii) test their phytoene synthase activity in a heterologous complementation system and (iv) evaluate their individual expression patterns during seed development. This study further confirmed that the six B. napus PSY genes encode proteins with high sequence identity, which have evolved under functional constraint. Structural modeling demonstrated that they share similar tridimensional protein structures with a putative PSY active site. Significantly, all six B. napus PSY enzymes were found to be functional. Taking into account the specific patterns of expression exhibited by these PSY genes during seed development and recent knowledge of PSY suborganellar localization, the selection of transgene candidates for metabolic engineering the carotenoid content of oilseeds is discussed.
Li, Na; Yan, Yunhuan; Zhang, Angke; Gao, Jiming; Zhang, Chong; Wang, Xue; Hou, Gaopeng; Zhang, Gaiping; Jia, Jinbu; Zhou, En-Min; Xiao, Shuqi
2016-12-13
Many viruses encode microRNAs (miRNAs) that are small non-coding single-stranded RNAs which play critical roles in virus-host interactions. Porcine reproductive and respiratory syndrome virus (PRRSV) is one of the most economically impactful viruses in the swine industry. The present study sought to determine whether PRRSV encodes miRNAs that could regulate PRRSV replication. Four viral small RNAs (vsRNAs) were mapped to the stem-loop structures in the ORF1a, ORF1b and GP2a regions of the PRRSV genome by bioinformatics prediction and experimental verification. Of these, the structures with the lowest minimum free energy (MFE) values predicted for PRRSV-vsRNA1 corresponded to typical stem-loop, hairpin structures. Inhibition of PRRSV-vsRNA1 function led to significant increases in viral replication. Transfection with PRRSV-vsRNA1 mimics significantly inhibited PRRSV replication in primary porcine alveolar macrophages (PAMs). The time-dependent increase in the abundance of PRRSV-vsRNA1 mirrored the gradual upregulation of PRRSV RNA expression. Knockdown of proteins associated with cellular miRNA biogenesis demonstrated that Drosha and Argonaute (Ago2) are involved in PRRSV-vsRNA1 biogenesis. Moreover, PRRSV-vsRNA1 bound specifically to the nonstructural protein 2 (NSP2)-coding sequence of PRRSV genome RNA. Collectively, the results reveal that PRRSV encodes a functional PRRSV-vsRNA1 which auto-regulates PRRSV replication by directly targeting and suppressing viral NSP2 gene expression. These findings not only provide new insights into the mechanism of the pathogenesis of PRRSV, but also explore a potential avenue for controlling PRRSV infection using viral small RNAs.
2017-01-01
Normal aging is associated with a decline in episodic memory and also with aggregation of the β-amyloid (Aβ) and tau proteins and atrophy of medial temporal lobe (MTL) structures crucial to memory formation. Although some evidence suggests that Aβ is associated with aberrant neural activity, the relationships among these two aggregated proteins, neural function, and brain structure are poorly understood. Using in vivo human Aβ and tau imaging, we demonstrate that increased Aβ and tau are both associated with aberrant fMRI activity in the MTL during memory encoding in cognitively normal older adults. This pathological neural activity was in turn associated with worse memory performance and atrophy within the MTL. A mediation analysis revealed that the relationship with regional atrophy was explained by MTL tau. These findings broaden the concept of cognitive aging to include evidence of Alzheimer's disease-related protein aggregation as an underlying mechanism of age-related memory impairment. SIGNIFICANCE STATEMENT Alterations in episodic memory and the accumulation of Alzheimer's pathology are common in cognitively normal older adults. However, evidence of pathological effects on episodic memory has largely been limited to β-amyloid (Aβ). Because Aβ and tau often cooccur in older adults, previous research offers an incomplete understanding of the relationship between pathology and episodic memory. With the recent development of in vivo tau PET radiotracers, we show that Aβ and tau are associated with different aspects of memory encoding, leading to aberrant neural activity that is behaviorally detrimental. In addition, our results provide evidence linking Aβ- and tau-associated neural dysfunction to brain atrophy. PMID:28213439
Gambetta, Gregory A; Matthews, Mark A; Syvanen, Michael
2018-05-04
Xylella fastidiosa (Xf) is a gram negative bacterium inhabiting the plant vascular system. In most species this bacterium lives as a benign symbiote, but in several agriculturally important plants (e.g. coffee, citrus, grapevine) Xf is pathogenic. Xf has four loci encoding homologues to hemolysin RTX proteins, virulence factors involved in a wide range of plant pathogen interactions. We show that all four genes are expressed during pathogenesis in grapevine. The sequences from these four genes have a complex repetitive structure. At the C-termini, sequence diversity between strains is what would be expected from orthologous genes. However, within strains there is no N-terminal homology, indicating these loci encode RTXs of different functions and/or specificities. More striking is that many of the orthologous loci between strains share this extreme variation at the N-termini. Thus these RTX orthologues are most easily visualized as fusions between the orthologous C-termini and different N-termini. Further, the four genes are found in operons having a peculiar structure with an extensively duplicated module encoding a small protein with homology to the N-terminal region of the full length RTX. Surprisingly, some of these small peptides are most similar not to their corresponding full length RTX, but to the N-termini of RTXs from other Xf strains, and even other remotely related species. These results demonstrate that these genes are expressed in planta during pathogenesis. Their structure suggests extensive evolutionary restructuring through horizontal gene transfers and heterologous recombination mechanisms. The sum of the evidence suggests these repetitive modules are a novel kind of mobile genetic element.
Makeyev, Aleksandr V.; Erdenechimeg, Lkhamsuren; Mungunsukh, Ognoon; Roth, Jutta J.; Enkhmandakh, Badam; Ruddle, Frank H.; Bayarsaihan, Dashzeveg
2004-01-01
Williams–Beuren syndrome (also known as Williams syndrome) is caused by a deletion of a 1.55- to 1.84-megabase region from chromosome band 7q11.23. GTF2IRD1 and GTF2I, located within this critical region, encode proteins of the TFII-I family with multiple helix–loop–helix domains known as I repeats. In the present work, we characterize a third member, GTF2IRD2, which has sequence and structural similarity to the GTF2I and GTF2IRD1 paralogs. The ORF encodes a protein with several features characteristic of regulatory factors, including two I repeats, two leucine zippers, and a single Cys-2/His-2 zinc finger. The genomic organization of human, baboon, rat, and mouse genes is well conserved. Our exon-by-exon comparison has revealed that GTF2IRD2 is more closely related to GTF2I than to GTF2IRD1 and apparently is derived from the GTF2I sequence. The comparison of GTF2I and GTF2IRD2 genes revealed two distinct regions of homology, indicating that the helix–loop–helix domain structure of the GTF2IRD2 gene has been generated by two independent genomic duplications. We speculate that GTF2I is derived from GTF2IRD1 as a result of local duplication and the further evolution of its structure was associated with its functional specialization. Comparison of genomic sequences surrounding GTF2IRD2 genes in mice and humans allows refinement of the centromeric breakpoint position of the primate-specific inversion within the Williams–Beuren syndrome critical region. PMID:15243160
Makeyev, Aleksandr V; Erdenechimeg, Lkhamsuren; Mungunsukh, Ognoon; Roth, Jutta J; Enkhmandakh, Badam; Ruddle, Frank H; Bayarsaihan, Dashzeveg
2004-07-27
Williams-Beuren syndrome (also known as Williams syndrome) is caused by a deletion of a 1.55- to 1.84-megabase region from chromosome band 7q11.23. GTF2IRD1 and GTF2I, located within this critical region, encode proteins of the TFII-I family with multiple helix-loop-helix domains known as I repeats. In the present work, we characterize a third member, GTF2IRD2, which has sequence and structural similarity to the GTF2I and GTF2IRD1 paralogs. The ORF encodes a protein with several features characteristic of regulatory factors, including two I repeats, two leucine zippers, and a single Cys-2/His-2 zinc finger. The genomic organization of human, baboon, rat, and mouse genes is well conserved. Our exon-by-exon comparison has revealed that GTF2IRD2 is more closely related to GTF2I than to GTF2IRD1 and apparently is derived from the GTF2I sequence. The comparison of GTF2I and GTF2IRD2 genes revealed two distinct regions of homology, indicating that the helix-loop-helix domain structure of the GTF2IRD2 gene has been generated by two independent genomic duplications. We speculate that GTF2I is derived from GTF2IRD1 as a result of local duplication and the further evolution of its structure was associated with its functional specialization. Comparison of genomic sequences surrounding GTF2IRD2 genes in mice and humans allows refinement of the centromeric breakpoint position of the primate-specific inversion within the Williams-Beuren syndrome critical region.
Lieutaud, Philippe; Uversky, Alexey V.; Uversky, Vladimir N.; Longhi, Sonia
2016-01-01
ABSTRACT In the last 2 decades it has become increasingly evident that a large number of proteins are either fully or partially disordered. Intrinsically disordered proteins lack a stable 3D structure, are ubiquitous and fulfill essential biological functions. Their conformational heterogeneity is encoded in their amino acid sequences, thereby allowing intrinsically disordered proteins or regions to be recognized based on properties of these sequences. The identification of disordered regions facilitates the functional annotation of proteins and is instrumental for delineating boundaries of protein domains amenable to structural determination with X-ray crystallization. This article discusses a comprehensive selection of databases and methods currently employed to disseminate experimental and putative annotations of disorder, predict disorder and identify regions involved in induced folding. It also provides a set of detailed instructions that should be followed to perform computational analysis of disorder. PMID:28232901
Paterno, Gary D; Ding, Zhihu; Lew, Yuan-Y; Nash, Gord W; Mercer, F Corinne; Gillespie, Laura L
2002-07-24
mi-er1 (previously called er1) is a fibroblast growth factor-inducible early response gene activated during mesoderm induction in Xenopus embryos and encoding a nuclear protein that functions as a transcriptional activator. The human orthologue of mi-er1 was shown to be upregulated in breast carcinoma cell lines and breast tumours when compared to normal breast cells. In this report, we investigate the structure of the human mi-er1 (hmi-er1) gene and characterize the alternatively spliced transcripts and protein isoforms. hmi-er1 is a single copy gene located at 1p31.2 and spanning 63 kb. It contains 17 exons and includes one skipped exon, a facultative intron and three polyadenylation signals to produce 12 transcripts encoding six distinct proteins. hmi-er1 transcripts were expressed at very low levels in most human adult tissues and the mRNA isoform pattern varied with the tissue. The 12 transcripts encode proteins containing a common internal sequence with variable N- and C-termini. Three distinct N- and two distinct C-termini were identified, giving rise to six protein isoforms. The two C-termini differ significantly in size and sequence and arise from alternate use of a facultative intron to produce hMI-ER1alpha and hMI-ER1beta. In all tissues except testis, transcripts encoding the beta isoform were predominant. hMI-ER1alpha lacks the predicted nuclear localization signal and transfection assays revealed that, unlike hMI-ER1beta, it is not a nuclear protein, but remains in the cytoplasm. Our results demonstrate that alternate use of a facultative intron regulates the subcellular localization of hMI-ER1 proteins and this may have important implications for hMI-ER1 function.
Grussendorf, Kelly A.; Trezza, Christopher J.; Salem, Alexander T.; Al-Hashimi, Hikmat; Mattingly, Brendan C.; Kampmeyer, Drew E.; Khan, Liakot A.; Hall, David H.; Göbel, Verena; Ackley, Brian D.; Buechner, Matthew
2016-01-01
Determination of luminal diameter is critical to the function of small single-celled tubes. A series of EXC proteins, including EXC-1, prevent swelling of the tubular excretory canals in Caenorhabditis elegans. In this study, cloning of exc-1 reveals it to encode a homolog of mammalian IRG proteins, which play roles in immune response and autophagy and are associated with Crohn’s disease. Mutants in exc-1 accumulate early endosomes, lack recycling endosomes, and exhibit abnormal apical cytoskeletal structure in regions of enlarged tubules. EXC-1 interacts genetically with two other EXC proteins that also affect endosomal trafficking. In yeast two-hybrid assays, wild-type and putative constitutively active EXC-1 binds to the LIM-domain protein EXC-9, whose homolog, cysteine-rich intestinal protein, is enriched in mammalian intestine. These results suggest a model for IRG function in forming and maintaining apical tubule structure via regulation of endosomal recycling. PMID:27334269
Pietrowski, D; Durante, M J; Liebstein, A; Schmitt-John, T; Werner, T; Graw, J
1994-07-08
The promoter of the murine gamma E-crystallin (gamma E-Cry) encoding gene (gamma E-cry) was analyzed for specific interactions with lenticular proteins in a gel-retardation assay. A 21-bp fragment immediately downstream of the transcription initiation site (DOTIS) is demonstrated to be responsible for specific interactions with lens extracts. The DOTIS-binding protein(s) accept only the sense DNA strand as target; anti-sense or double-stranded DNA do not interact with these proteins. The DOTIS sequence element is highly conserved among the murine gamma D-, gamma E- and gamma F-cry and is present at comparable positions in the orthologous rat genes. Only a weak or even no protein-binding activity is observed if a few particular bases are changed, as in the rat gamma A-, gamma C- and gamma E-cry elements. DOTIS-binding proteins were found in commercially available bovine alpha-Cry preparations. The essential participation of alpha-Cry in the DNA-binding protein complex was confirmed using alpha-Cry-specific monoclonal antibody. The results reported here point to a novel function of alpha-Cry besides the structural properties in the lens.
Accelerating Biomedical Research in Designing Diagnostic Assays, Drugs, and Vaccines
2010-10-01
biodefense. For example, USAMRIID researchers are using Dovis to initiate drug discovery efforts against the ricin A-chain toxin and the Ebola virus...in host cell invasion and bacterial toxin production). Traditional experimental methods to determine the functions of proteins encoded in genomic...readily modeled. A second study involved determining the pro- tein structure of VP24, the smallest protein in the Ebola and Marburg virus genomes.9
Steigemann, Birthe; Schulz, Annina; Werten, Sebastiaan
2013-11-15
The RNA polymerase II cofactor PC4 globally regulates transcription of protein-encoding genes through interactions with unwinding DNA, the basal transcription machinery and transcription activators. Here, we report the surprising identification of PC4 homologs in all sequenced representatives of the T5 family of bacteriophages, as well as in an archaeon and seven phyla of eubacteria. We have solved the crystal structure of the full-length T5 protein at 1.9Å, revealing a striking resemblance to the characteristic single-stranded DNA (ssDNA)-binding core domain of PC4. Intriguing novel structural features include a potential regulatory region at the N-terminus and a C-terminal extension of the homodimerisation interface. The genome organisation of T5-related bacteriophages points at involvement of the PC4 homolog in recombination-dependent DNA replication, strongly suggesting that the protein corresponds to the hitherto elusive replicative ssDNA-binding protein of the T5 family. Our findings imply that PC4-like factors intervene in multiple unwinding-related processes by acting as versatile modifiers of nucleic acid conformation and raise the possibility that the eukaryotic transcription coactivator derives from ancestral DNA replication, recombination and repair factors. © 2013.
Iovinella, Immacolata; Dani, Francesca Romana; Niccolini, Alberto; Sagona, Simona; Michelucci, Elena; Gazzano, Angelo; Turillazzi, Stefano; Felicioli, Antonio; Pelosi, Paolo
2011-08-05
Odorant-binding proteins (OBPs) and chemosensory proteins (CSPs) mediate both perception and release of chemical stimuli in insects. The genome of the honey bee contains 21 genes encoding OBPs and 6 encoding CSPs. Using a proteomic approach, we have investigated the expression of OBPs and CSPs in the mandibular glands of adult honey bees in relation to caste and age. OBP13 is mostly expressed in young individuals and in virgin queens, while OBP21 is abundant in older bees and is prevalent in mated queens. OBP14, which had been found in larvae, is produced in hive workers' glands. Quite unexpectedly, the mandibular glands of drones also contain OBPs, mainly OBP18 and OBP21. We have expressed three of the most represented OBPs and studied their binding properties. OBP13 binds with good specificity oleic acid and some structurally related compounds, OBP14 is better tuned to monoterpenoid structures, while OBP21 binds the main components of queen mandibular pheromone as well as farnesol, a compound used as a trail pheromone in the honey bee and other hymenopterans. The high expression of different OBPs in the mandibular glands suggests that such proteins could be involved in solubilization and release of semiochemicals.
A genetically encoded fluorescent tRNA is active in live-cell protein synthesis
Masuda, Isao; Igarashi, Takao; Sakaguchi, Reiko; Nitharwal, Ram G.; Takase, Ryuichi; Han, Kyu Young; Leslie, Benjamin J.; Liu, Cuiping; Gamper, Howard; Ha, Taekjip; Sanyal, Suparna
2017-01-01
Abstract Transfer RNAs (tRNAs) perform essential tasks for all living cells. They are major components of the ribosomal machinery for protein synthesis and they also serve in non-ribosomal pathways for regulation and signaling metabolism. We describe the development of a genetically encoded fluorescent tRNA fusion with the potential for imaging in live Escherichia coli cells. This tRNA fusion carries a Spinach aptamer that becomes fluorescent upon binding of a cell-permeable and non-toxic fluorophore. We show that, despite having a structural framework significantly larger than any natural tRNA species, this fusion is a viable probe for monitoring tRNA stability in a cellular quality control mechanism that degrades structurally damaged tRNA. Importantly, this fusion is active in E. coli live-cell protein synthesis allowing peptidyl transfer at a rate sufficient to support cell growth, indicating that it is accommodated by translating ribosomes. Imaging analysis shows that this fusion and ribosomes are both excluded from the nucleoid, indicating that the fusion and ribosomes are in the cytosol together possibly engaged in protein synthesis. This fusion methodology has the potential for developing new tools for live-cell imaging of tRNA with the unique advantage of both stoichiometric labeling and broader application to all cells amenable to genetic engineering. PMID:27956502
Krol, Kamil; Jendrysek, Justyna; Debski, Janusz; Skoneczny, Marek; Kurlandzka, Anna; Kaminska, Joanna; Dadlez, Michal; Skoneczna, Adrianna
2017-04-11
Ribosomal RNA-encoding genes (rDNA) are the most abundant genes in eukaryotic genomes. To meet the high demand for rRNA, rDNA genes are present in multiple tandem repeats clustered on a single or several chromosomes and are vastly transcribed. To facilitate intensive transcription and prevent rDNA destabilization, the rDNA-encoding portion of the chromosome is confined in the nucleolus. However, the rDNA region is susceptible to recombination and DNA damage, accumulating mutations, rearrangements and atypical DNA structures. Various sophisticated techniques have been applied to detect these abnormalities. Here, we present a simple method for the evaluation of the activity and integrity of an rDNA region called a "DNA cloud assay". We verified the efficacy of this method using yeast mutants lacking genes important for nucleolus function and maintenance (RAD52, SGS1, RRM3, PIF1, FOB1 and RPA12). The DNA cloud assay permits the evaluation of nucleolus status and is compatible with downstream analyses, such as the chromosome comet assay to identify DNA structures present in the cloud and mass spectrometry of agarose squeezed proteins (ASPIC-MS) to detect nucleolar DNA-bound proteins, including Las17, the homolog of human Wiskott-Aldrich Syndrome Protein (WASP).
Krol, Kamil; Jendrysek, Justyna; Debski, Janusz; Skoneczny, Marek; Kurlandzka, Anna; Kaminska, Joanna; Dadlez, Michal; Skoneczna, Adrianna
2017-01-01
Ribosomal RNA-encoding genes (rDNA) are the most abundant genes in eukaryotic genomes. To meet the high demand for rRNA, rDNA genes are present in multiple tandem repeats clustered on a single or several chromosomes and are vastly transcribed. To facilitate intensive transcription and prevent rDNA destabilization, the rDNA-encoding portion of the chromosome is confined in the nucleolus. However, the rDNA region is susceptible to recombination and DNA damage, accumulating mutations, rearrangements and atypical DNA structures. Various sophisticated techniques have been applied to detect these abnormalities. Here, we present a simple method for the evaluation of the activity and integrity of an rDNA region called a “DNA cloud assay”. We verified the efficacy of this method using yeast mutants lacking genes important for nucleolus function and maintenance (RAD52, SGS1, RRM3, PIF1, FOB1 and RPA12). The DNA cloud assay permits the evaluation of nucleolus status and is compatible with downstream analyses, such as the chromosome comet assay to identify DNA structures present in the cloud and mass spectrometry of agarose squeezed proteins (ASPIC-MS) to detect nucleolar DNA-bound proteins, including Las17, the homolog of human Wiskott-Aldrich Syndrome Protein (WASP). PMID:28212567
Neuenfeldt, Anne; Lorber, Bernard; Ennifar, Eric; Gaudry, Agnès; Sauter, Claude; Sissler, Marie; Florentz, Catherine
2013-02-01
In the mammalian mitochondrial translation apparatus, the proteins and their partner RNAs are coded by two genomes. The proteins are nuclear-encoded and resemble their homologs, whereas the RNAs coming from the rapidly evolving mitochondrial genome have lost critical structural information. This raises the question of molecular adaptation of these proteins to their peculiar partner RNAs. The crystal structure of the homodimeric bacterial-type human mitochondrial aspartyl-tRNA synthetase (DRS) confirmed a 3D architecture close to that of Escherichia coli DRS. However, the mitochondrial enzyme distinguishes by an enlarged catalytic groove, a more electropositive surface potential and an alternate interaction network at the subunits interface. It also presented a thermal stability reduced by as much as 12°C. Isothermal titration calorimetry analyses revealed that the affinity of the mitochondrial enzyme for cognate and non-cognate tRNAs is one order of magnitude higher, but with different enthalpy and entropy contributions. They further indicated that both enzymes bind an adenylate analog by a cooperative allosteric mechanism with different thermodynamic contributions. The larger flexibility of the mitochondrial synthetase with respect to the bacterial enzyme, in combination with a preserved architecture, may represent an evolutionary process, allowing nuclear-encoded proteins to cooperate with degenerated organelle RNAs.
Structural requirements of oleosin domains for subcellular targeting to the oil body.
van Rooijen, G J; Moloney, M M
1995-01-01
We have investigated the protein domains responsible for the correct subcellular targeting of plant seed oleosins. We have attempted to study this targeting in vivo using "tagged" oleosins in transgenic plants. Different constructs were prepared lacking gene sequences encoding one of three structural domains of natural oleosins. Each was fused in frame to the Escherichia coli uid A gene encoding beta-glucuronidase (GUS). These constructs were introduced into Brassica napus using Agrobacterium-mediated transformation. GUS activity was measured in washed oil bodies and in the soluble protein fraction of the transgenic seeds. It was found that complete Arabidopsis oleosin-GUS fusions undergo correct subcellular targeting in transgenic Brassica seeds. Removal of the C-terminal domain of the Arabidopsis oleosin comprising the last 48 amino acids had no effect on overall subcellular targeting. In contrast, loss of the first 47 amino acids (N terminus) or amino acids 48 to 113 (which make up a lipophilic core) resulted in impaired targeting of the fusion protein to the oil bodies and greatly reduced accumulation of the fusion protein. Northern blotting revealed that this reduction is not due to differences in mRNA accumulation. Results from these measurements indicated that both the N-terminal and central oleosin domain are important for targeting to the oil body and show that there is a direct correlation between the inability to target to the oil body and protein stability. PMID:8539295
Primary structure and mapping of the hupA gene of Salmonella typhimurium.
Higgins, N P; Hillyard, D
1988-01-01
In bacteria, the complex nucleoid structure is folded and maintained by negative superhelical tension and a set of type II DNA-binding proteins, also called histonelike proteins. The most abundant type II DNA-binding protein is HU. Southern blot analysis showed that Salmonella typhimurium contained two HU genes that corresponded to Escherichia coli genes hupA (encoding HU-2 protein) and hupB (encoding HU-1). Salmonella hupA was cloned, and the nucleotide sequence of the gene was determined. Comparison of hupA of E. coli and S. typhimurium revealed that the HU-2 proteins were identical and that there was high conservation of nucleotide sequences outside the coding frames of the genes. A 300-member genomic library of S. typhimurium was constructed by using random transposition of MudP, a specialized chimeric P22-Mu phage that packages chromosomal DNA unidirectionally from its insertion point. Oligonucleotide hybridization against the library identified one MudP insertion that lies within 28 kilobases of hupA; the MudP was 12% linked to purH at 90.5 min on the standard map. Plasmids expressing HU-2 had a surprising phenotype; they caused growth arrest when they were introduced into E. coli strains bearing a himA or hip mutation. These results suggest that IHF and HU have interactive roles in bacteria. Images PMID:3056912
Antigenic Determinants of Alpha-Like Proteins of Streptococcus agalactiae
Maeland, Johan A.; Bevanger, Lars; Lyng, Randi Valsoe
2004-01-01
The majority of group B streptococcus (GBS) isolates express one or more of a family of surface-anchored proteins that vary by strain and that form ladder-like patterns on Western blotting due to large repeat units. These proteins, which are important as GBS serotype markers and as inducers of protective antibodies, include the alpha C (Cα) and R4 proteins and the recently described alpha-like protein 2 (Alp2), encoded by alp2, and Alp3, encoded by alp3. In this study, we examined antigenic determinants possessed by Alp2 and Alp3 by testing of antibodies raised in rabbits, mainly by using enzyme-linked immunosorbent assays (ELISA) and an ELISA absorption test. The results showed that Alp2 and Alp3 shared an antigenic determinant, which may be a unique immunological marker of the Alp variants of GBS proteins. Alp2, in addition, possessed an antigenic determinant which showed specificity for Alp2 and a third determinant which showed serological cross-reactivity with Cα. Alp3, in addition to the determinant common to Alp2 and Alp3, harbored an antigenic site which also was present in the R4 protein, whereas no Alp3-specific antigenic site was detected. These ELISA-based results were confirmed by Western blotting and a fluorescent-antibody test. The results are consistent with highly complex antigenic structures of the alpha-like proteins in a fashion which is in agreement with the recently described structural mosaicism of the alp2 and alp3 genes. The results are expected to influence GBS serotyping, immunoprotection studies, and GBS vaccine developments. PMID:15539502
Principles of protein folding--a perspective from simple exact models.
Dill, K. A.; Bromberg, S.; Yue, K.; Fiebig, K. M.; Yee, D. P.; Thomas, P. D.; Chan, H. S.
1995-01-01
General principles of protein structure, stability, and folding kinetics have recently been explored in computer simulations of simple exact lattice models. These models represent protein chains at a rudimentary level, but they involve few parameters, approximations, or implicit biases, and they allow complete explorations of conformational and sequence spaces. Such simulations have resulted in testable predictions that are sometimes unanticipated: The folding code is mainly binary and delocalized throughout the amino acid sequence. The secondary and tertiary structures of a protein are specified mainly by the sequence of polar and nonpolar monomers. More specific interactions may refine the structure, rather than dominate the folding code. Simple exact models can account for the properties that characterize protein folding: two-state cooperativity, secondary and tertiary structures, and multistage folding kinetics--fast hydrophobic collapse followed by slower annealing. These studies suggest the possibility of creating "foldable" chain molecules other than proteins. The encoding of a unique compact chain conformation may not require amino acids; it may require only the ability to synthesize specific monomer sequences in which at least one monomer type is solvent-averse. PMID:7613459
A Novel Kinesin-Like Protein with a Calmodulin-Binding Domain
NASA Technical Reports Server (NTRS)
Wang, W.; Takezawa, D.; Narasimhulu, S. B.; Reddy, A. S. N.; Poovaiah, B. W.
1996-01-01
Calcium regulates diverse developmental processes in plants through the action of calmodulin. A cDNA expression library from developing anthers of tobacco was screened with S-35-labeled calmodulin to isolate cDNAs encoding calmodulin-binding proteins. Among several clones isolated, a kinesin-like gene (TCK1) that encodes a calmodulin-binding kinesin-like protein was obtained. The TCK1 cDNA encodes a protein with 1265 amino acid residues. Its structural features are very similar to those of known kinesin heavy chains and kinesin-like proteins from plants and animals, with one distinct exception. Unlike other known kinesin-like proteins, TCK1 contains a calmodulin-binding domain which distinguishes it from all other known kinesin genes. Escherichia coli-expressed TCK1 binds calmodulin in a Ca(2+)-dependent manner. In addition to the presence of a calmodulin-binding domain at the carboxyl terminal, it also has a leucine zipper motif in the stalk region. The amino acid sequence at the carboxyl terminal of TCK1 has striking homology with the mechanochemical motor domain of kinesins. The motor domain has ATPase activity that is stimulated by microtubules. Southern blot analysis revealed that TCK1 is coded by a single gene. Expression studies indicated that TCKI is expressed in all of the tissues tested. Its expression is highest in the stigma and anther, especially during the early stages of anther development. Our results suggest that Ca(2+)/calmodulin may play an important role in the function of this microtubule-associated motor protein and may be involved in the regulation of microtubule-based intracellular transport.
Niiranen, Laila; Lian, Kjersti; Johnson, Kenneth A; Moe, Elin
2015-02-27
Deinococcus radiodurans is an extremely radiation and desiccation resistant bacterium which can tolerate radiation doses up to 5,000 Grays without losing viability. We are studying the role of DNA repair and replication proteins for this unusual phenotype by a structural biology approach. The DNA polymerase III β subunit (β-clamp) acts as a sliding clamp on DNA, promoting the binding and processivity of many DNA-acting proteins, and here we report the crystal structure of D. radiodurans β-clamp (Drβ-clamp) at 2.0 Å resolution. The sequence verification process revealed that at the time of the study the gene encoding Drβ-clamp was wrongly annotated in the genome database, encoding a protein of 393 instead of 362 amino acids. The short protein was successfully expressed, purified and used for crystallisation purposes in complex with Cy5-labeled DNA. The structure, which was obtained from blue crystals, shows a typical ring-shaped bacterial β-clamp formed of two monomers, each with three domains of identical topology, but with no visible DNA in electron density. A visualisation of the electrostatic surface potential reveals a highly negatively charged outer surface while the inner surface and the dimer forming interface have a more even charge distribution. The structure of Drβ-clamp was determined to 2.0 Å resolution and shows an evenly distributed electrostatic surface charge on the DNA interacting side. We hypothesise that this charge distribution may facilitate efficient movement on encircled DNA and help ensure efficient DNA metabolism in D. radiodurans upon exposure to high doses of ionizing irradiation or desiccation.
Thionin-D4E1 chimeric protein protects plants against bacterial infections
Stover, Eddie W; Gupta, Goutam; Hao, Guixia
2017-08-08
The generation of a chimeric protein containing a first domain encoding either a pro-thionon or thionin, a second domain encoding D4E1 or pro-D4E1, and a third domain encoding a peptide linker located between the first domain and second domain is described. Either the first domain or the second domain is located at the amino terminal of the chimeric protein and the other domain (second domain or first domain, respectively) is located at the carboxyl terminal. The chimeric protein has antibacterial activity. Genetically altered plants and their progeny expressing a polynucleotide encoding the chimeric protein resist diseases caused by bacteria.
Reuther, Peter; Göpfert, Kristina; Dudek, Alexandra H.; Heiner, Monika; Herold, Susanne; Schwemmle, Martin
2015-01-01
Influenza A viruses (IAV) pose a constant threat to the human population and therefore a better understanding of their fundamental biology and identification of novel therapeutics is of upmost importance. Various reporter-encoding IAV were generated to achieve these goals, however, one recurring difficulty was the genetic instability especially of larger reporter genes. We employed the viral NS segment coding for the non-structural protein 1 (NS1) and nuclear export protein (NEP) for stable expression of diverse reporter proteins. This was achieved by converting the NS segment into a single open reading frame (ORF) coding for NS1, the respective reporter and NEP. To allow expression of individual proteins, the reporter genes were flanked by two porcine Teschovirus-1 2A peptide (PTV-1 2A)-coding sequences. The resulting viruses encoding luciferases, fluorescent proteins or a Cre recombinase are characterized by a high genetic stability in vitro and in mice and can be readily employed for antiviral compound screenings, visualization of infected cells or cells that survived acute infection. PMID:26068081
Partial Gene Cloning and Enzyme Structure Modeling of Exolevanase Fragment from Bacillus subtilis
NASA Astrophysics Data System (ADS)
Azhar, M.; Natalia, D.; Syukur, S.; Andriani, N.; Jamsari, J.
2018-04-01
Inulin hydrolysis thermophilic and thermotolerant bacteria are potential sources of inulin hydrolysis enzymes. Partial gene that encodes inulin hydrolysis enzymes had been isolated from Bacillus subtilis using polymerase chain reaction (PCR) method with the DPE.slFandDPE.eR degenerative primers. The partial gene was cloned into pGEM-T Easy vector with E. coli as host cells and analyzed using BLASTx, CrustalW2, and Phyre2 programs. Size of thepartial gene had been found539 bp that encoded 179aminoacid residues of protein fragment. The sequences of protein fragment was more similar to exolevanase than exoinulinase. The protein fragment had conserved motif FSGS, and specific hits GH32 β-fructosidase. It had three residues of active site and five residues of substrate binding. The active site on the protein fragment were D (1-WLNDP-5), D (125-FRDPK-129) and E (177-WEC-179). Substrate binding on the protein fragment were ND (1-WLNDP-5), Q (18-FYQY-21), FS (60-FSGS-63) RD (125-FRDPK-129) and E (177-WEC-179).
Structure-Function Analysis of Chloroplast Proteins via Random Mutagenesis Using Error-Prone PCR.
Dumas, Louis; Zito, Francesca; Auroy, Pascaline; Johnson, Xenie; Peltier, Gilles; Alric, Jean
2018-06-01
Site-directed mutagenesis of chloroplast genes was developed three decades ago and has greatly advanced the field of photosynthesis research. Here, we describe a new approach for generating random chloroplast gene mutants that combines error-prone polymerase chain reaction of a gene of interest with chloroplast complementation of the knockout Chlamydomonas reinhardtii mutant. As a proof of concept, we targeted a 300-bp sequence of the petD gene that encodes subunit IV of the thylakoid membrane-bound cytochrome b 6 f complex. By sequencing chloroplast transformants, we revealed 149 mutations in the 300-bp target petD sequence that resulted in 92 amino acid substitutions in the 100-residue target subunit IV sequence. Our results show that this method is suited to the study of highly hydrophobic, multisubunit, and chloroplast-encoded proteins containing cofactors such as hemes, iron-sulfur clusters, and chlorophyll pigments. Moreover, we show that mutant screening and sequencing can be used to study photosynthetic mechanisms or to probe the mutational robustness of chloroplast-encoded proteins, and we propose that this method is a valuable tool for the directed evolution of enzymes in the chloroplast. © 2018 American Society of Plant Biologists. All rights reserved.
Bell, Andrew; Moreau, Carol; Chinoy, Catherine; Spanner, Rebecca; Dalmais, Marion; Le Signor, Christine; Bendahmane, Abdel; Klenell, Markus; Domoney, Claire
2015-12-01
Among a set of genes in pea (Pisum sativum L.) that were induced under drought-stress growth conditions, one encoded a protein with significant similarity to a regulator of chlorophyll catabolism, SGR. This gene, SGRL, is distinct from SGR in genomic location, encoded carboxy-terminal motif, and expression through plant and seed development. Divergence of the two encoded proteins is associated with a loss of similarity in intron/exon gene structure. Transient expression of SGRL in leaves of Nicotiana benthamiana promoted the degradation of chlorophyll, in a manner that was distinct from that shown by SGR. Removal of a predicted transmembrane domain from SGRL reduced its activity in transient expression assays, although variants with and without this domain reduced SGR-induced chlorophyll degradation, indicating that the effects of the two proteins are not additive. The combined data suggest that the function of SGRL during growth and development is in chlorophyll re-cycling, and its mode of action is distinct from that of SGR. Studies of pea sgrL mutants revealed that plants had significantly lower stature and yield, a likely consequence of reduced photosynthetic efficiencies in mutant compared with control plants under conditions of high light intensity.
Rice Ribosomal Protein Large Subunit Genes and Their Spatio-temporal and Stress Regulation
Moin, Mazahar; Bakshi, Achala; Saha, Anusree; Dutta, Mouboni; Madhav, Sheshu M.; Kirti, P. B.
2016-01-01
Ribosomal proteins (RPs) are well-known for their role in mediating protein synthesis and maintaining the stability of the ribosomal complex, which includes small and large subunits. In the present investigation, in a genome-wide survey, we predicted that the large subunit of rice ribosomes is encoded by at least 123 genes including individual gene copies, distributed throughout the 12 chromosomes. We selected 34 candidate genes, each having 2–3 identical copies, for a detailed characterization of their gene structures, protein properties, cis-regulatory elements and comprehensive expression analysis. RPL proteins appear to be involved in interactions with other RP and non-RP proteins and their encoded RNAs have a higher content of alpha-helices in their predicted secondary structures. The majority of RPs have binding sites for metal and non-metal ligands. Native expression profiling of 34 ribosomal protein large (RPL) subunit genes in tissues covering the major stages of rice growth shows that they are predominantly expressed in vegetative tissues and seedlings followed by meiotically active tissues like flowers. The putative promoter regions of these genes also carry cis-elements that respond specifically to stress and signaling molecules. All the 34 genes responded differentially to the abiotic stress treatments. Phytohormone and cold treatments induced significant up-regulation of several RPL genes, while heat and H2O2 treatments down-regulated a majority of them. Furthermore, infection with a bacterial pathogen, Xanthomonas oryzae, which causes leaf blight also induced the expression of 80% of the RPL genes in leaves. Although the expression of RPL genes was detected in all the tissues studied, they are highly responsive to stress and signaling molecules indicating that their encoded proteins appear to have roles in stress amelioration besides house-keeping. This shows that the RPL gene family is a valuable resource for manipulation of stress tolerance in rice and other crops, which may be achieved by overexpressing and raising independent transgenic plants carrying the genes that became up-regulated significantly and instantaneously. PMID:27605933
Recombinant vaccinia/Venezuelan equine encephalitis (VEE) virus expresses VEE structural proteins.
Kinney, R M; Esposito, J J; Johnson, B J; Roehrig, J T; Mathews, J H; Barrett, A D; Trent, D W
1988-12-01
cDNA molecules encoding the structural proteins of the virulent Trinidad donkey and the TC-83 vaccine strains of Venezuelan equine encephalitis (VEE) virus were inserted under control of the vaccinia virus 7.5K promoter into the thymidine kinase gene of vaccinia virus. Synthesis of the capsid protein and glycoproteins E2 and E1 of VEE virus was demonstrated by immunoblotting of lysates of CV-1 cells infected with recombinant vaccinia/VEE viruses. VEE glycoproteins were detected in recombinant virus-infected cells by fluorescent antibody (FA) analysis performed with a panel of VEE-specific monoclonal antibodies. Seven E2-specific epitopes and two of four E1-specific epitopes were demonstrated by FA.
Iida, Satoko; Kobiyama, Atsushi; Ogata, Takehiko; Murakami, Akio
2008-01-01
Plastid encoded genes of the dinoflagellates are rapidly evolving and most divergent. The importance of unusually accumulated mutations on structure of PSII core protein and photosynthetic function was examined in the dinoflagellates, Symbiodinium sp. and Alexandrium tamarense. Full-length cDNA sequences of psbA (D1 protein) and psbD (D2 protein) were obtained and compared with the other oxygen-evolving photoautotrophs. Twenty-three amino acid positions (7%) for the D1 protein and 34 positions (10%) for the D2 were mutated in the dinoflagellates, although amino acid residues at these positions were conserved in cyanobacteria, the other algae, and plant. Many mutations were likely to distribute in the N-terminus and the D-E interhelical loop of the D1 protein and helix B of D2 protein, while the remaining regions were well conserved. The different structural properties in these mutated regions were supported by hydropathy profiles. The chlorophyll fluorescence kinetics of the dinoflagellates was compared with Synechocystis sp. PCC6803 in relation to the altered protein structure.
Novel RepA-MCM proteins encoded in plasmids pTAU4, pORA1 and pTIK4 from Sulfolobus neozealandicus
Greve, Bo; Jensen, Susanne; Phan, Hoa; Brügger, Kim; Zillig, Wolfram; She, Qunxin; Garrett, Roger A.
2005-01-01
Three plasmids isolated from the crenarchaeal thermoacidophile Sulfolobus neozealandicus were characterized. Plasmids pTAU4 (7,192 bp), pORA1 (9,689 bp) and pTIK4 (13,638 bp) show unusual properties that distinguish them from previously characterized cryptic plasmids of the genus Sulfolobus. Plasmids pORA1 and pTIK4 encode RepA proteins, only the former of which carries the novel polymerase–primase domain of other known Sulfolobus plasmids. Plasmid pTAU4 encodes a mini-chromosome maintenance protein homolog and no RepA protein; the implications for DNA replication are considered. Plasmid pORA1 is the first Sulfolobus plasmid to be characterized that does not encode the otherwise highly conserved DNA-binding PlrA protein. Another encoded protein appears to be specific for the New Zealand plasmids. The three plasmids should provide useful model systems for functional studies of these important crenarchaeal proteins. PMID:15876565
Maitip, Jakkrawut; Trueman, Holly E; Kaehler, Benjamin D; Huttley, Gavin A; Chantawannakul, Panuwan; Sutherland, Tara D
2015-04-01
Multiple gene duplication events in the precursor of the Aculeata (bees, ants, hornets) gave rise to four silk genes. Whilst these homologs encode proteins with similar amino acid composition and coiled coil structure, the retention of all four homologs implies they each are important. In this study we identified, produced and characterized the four silk proteins from Apis dorsata, the giant Asian honeybee. The proteins were readily purified, allowing us to investigate the folding behavior of solutions of individual proteins in comparison to mixtures of all four proteins at concentrations where they assemble into their native coiled coil structure. In contrast to solutions of any one protein type, solutions of a mixture of the four proteins formed coiled coils that were stable against dilution and detergent denaturation. The results are consistent with the formation of a heteromeric coiled coil protein complex. The mechanism of silk protein coiled coil formation and evolution is discussed in light of these results. Copyright © 2015 Elsevier Ltd. All rights reserved.
Crystal structure of bacillus subtilis YdaF protein : a putative ribosomal N-acetyltransferase.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Brunzelle, J. S.; Wu, R.; Korolev, S. V.
2004-12-01
Comparative sequence analysis suggests that the ydaF gene encodes a protein (YdaF) that functions as an N-acetyltransferase, more specifically, a ribosomal N-acetyltransferase. Sequence analysis using basic local alignment search tool (BLAST) suggests that YdaF belongs to a large family of proteins (199 proteins found in 88 unique species of bacteria, archaea, and eukaryotes). YdaF also belongs to the COG1670, which includes the Escherichia coli RimL protein that is known to acetylate ribosomal protein L12. N-acetylation (NAT) has been found in all kingdoms. NAT enzymes catalyze the transfer of an acetyl group from acetyl-CoA (AcCoA) to a primary amino group. Formore » example, NATs can acetylate the N-terminal {alpha}-amino group, the {epsilon}-amino group of lysine residues, aminoglycoside antibiotics, spermine/speridine, or arylalkylamines such as serotonin. The crystal structure of the alleged ribosomal NAT protein, YdaF, from Bacillus subtilis presented here was determined as a part of the Midwest Center for Structural Genomics. The structure maintains the conserved tertiary structure of other known NATs and a high sequence similarity in the presumed AcCoA binding pocket in spite of a very low overall level of sequence identity to other NATs of known structure.« less
Designing pH induced fold switch in proteins
NASA Astrophysics Data System (ADS)
Baruah, Anupaul; Biswas, Parbati
2015-05-01
This work investigates the computational design of a pH induced protein fold switch based on a self-consistent mean-field approach by identifying the ensemble averaged characteristics of sequences that encode a fold switch. The primary challenge to balance the alternative sets of interactions present in both target structures is overcome by simultaneously optimizing two foldability criteria corresponding to two target structures. The change in pH is modeled by altering the residual charge on the amino acids. The energy landscape of the fold switch protein is found to be double funneled. The fold switch sequences stabilize the interactions of the sites with similar relative surface accessibility in both target structures. Fold switch sequences have low sequence complexity and hence lower sequence entropy. The pH induced fold switch is mediated by attractive electrostatic interactions rather than hydrophobic-hydrophobic contacts. This study may provide valuable insights to the design of fold switch proteins.
Cooperative Subunit Refolding of a Light-Harvesting Protein through a Self-Chaperone Mechanism.
Laos, Alistair J; Dean, Jacob C; Toa, Zi S D; Wilk, Krystyna E; Scholes, Gregory D; Curmi, Paul M G; Thordarson, Pall
2017-07-10
The fold of a protein is encoded by its amino acid sequence, but how complex multimeric proteins fold and assemble into functional quaternary structures remains unclear. Here we show that two structurally different phycobiliproteins refold and reassemble in a cooperative manner from their unfolded polypeptide subunits, without biological chaperones. Refolding was confirmed by ultrafast broadband transient absorption and two-dimensional electronic spectroscopy to probe internal chromophores as a marker of quaternary structure. Our results demonstrate a cooperative, self-chaperone refolding mechanism, whereby the β-subunits independently refold, thereby templating the folding of the α-subunits, which then chaperone the assembly of the native complex, quantitatively returning all coherences. Our results indicate that subunit self-chaperoning is a robust mechanism for heteromeric protein folding and assembly that could also be applied in self-assembled synthetic hierarchical systems. © 2017 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
USDA-ARS?s Scientific Manuscript database
Plant resistance (R) genes typically encode proteins with nucleotide binding site-leucine rich repeat (NLR) domains. We identified a novel, broad-spectrum rice blast R gene, Ptr, encoding a non-NLR protein with four Armadillo repeats. Ptr was originally identified by fast neutron mutagenesis as a ...
Alternative intronic promoters in development and disease.
Vacik, Tomas; Raska, Ivan
2017-05-01
Approximately 20,000 mammalian genes are estimated to encode between 250 thousand and 1 million different proteins. This enormous diversity of the mammalian proteome is caused by the ability of a single-gene locus to encode multiple protein isoforms. Protein isoforms encoded by one gene locus can be functionally distinct, and they can even have antagonistic functions. One of the mechanisms involved in creating this proteome complexity is alternative promoter usage. Alternative intronic promoters are located downstream from their canonical counterparts and drive the expression of alternative RNA isoforms that lack upstream exons. These upstream exons can encode some important functional domains, and proteins encoded by alternative mRNA isoforms can be thus functionally distinct from the full-length protein encoded by canonical mRNA isoforms. Since any misbalance of functionally distinct protein isoforms is likely to have detrimental consequences for the cell and the whole organism, their expression must be precisely regulated. Misregulation of alternative intronic promoters is frequently associated with various developmental defects and diseases including cancer, and it is becoming increasingly clear that this phenomenon deserves more attention.
Bassi, Maria R; Larsen, Mads A B; Kongsgaard, Michael; Rasmussen, Michael; Buus, Søren; Stryhn, Anette; Thomsen, Allan R; Christensen, Jan P
2016-02-01
The live attenuated yellow fever vaccine (YF-17D) has been successfully used for more than 70 years. It is generally considered a safe vaccine, however, recent reports of serious adverse events following vaccination have raised concerns and led to suggestions that even safer YF vaccines should be developed. Replication deficient adenoviruses (Ad) have been widely evaluated as recombinant vectors, particularly in the context of prophylactic vaccination against viral infections in which induction of CD8+ T-cell mediated immunity is crucial, but potent antibody responses may also be elicited using these vectors. In this study, we present two adenobased vectors targeting non-structural and structural YF antigens and characterize their immunological properties. We report that a single immunization with an Ad-vector encoding the non-structural protein 3 from YF-17D could elicit a strong CD8+ T-cell response, which afforded a high degree of protection from subsequent intracranial challenge of vaccinated mice. However, full protection was only observed using a vector encoding the structural proteins from YF-17D. This vector elicited virus-specific CD8+ T cells as well as neutralizing antibodies, and both components were shown to be important for protection thus mimicking the situation recently uncovered in YF-17D vaccinated mice. Considering that Ad-vectors are very safe, easy to produce and highly immunogenic in humans, our data indicate that a replication deficient adenovector-based YF vaccine may represent a safe and efficient alternative to the classical live attenuated YF vaccine and should be further tested.
Szczuka, Ewa; Urbańska, Katarzyna; Pietryka, Marta; Kaznowski, Adam
2013-01-01
Many serious diseases caused by Staphylococcus aureus appear to be associated with biofilms. Therefore, we investigated the biofilm-forming ability of the methicillin-resistant S. aureus (MRSA) isolates collected from hospitalized patients. As many as 96 % strains had the ability to form biofilm in vitro. The majority of S. aureus strains formed biofilm in ica-dependent mechanism. However, 23 % of MRSA isolates formed biofilm in ica-independent mechanism. Half of these strains carried fnbB genes encoding surface proteins fibronectin-binding protein B involved in intercellular accumulation and biofilm development in S. aureus strains. The biofilm structures were examined via confocal laser scanning microscopy (CLSM) and three-dimensional structures were reconstructed. The images obtained in CLSM revealed that the biofilm created by ica-positive strains was different from biofilm formed by ica-negative strains. The MRSA population showed a large genetic diversity and we did not find a single clone that occurred preferentially in hospital environment. Our results demonstrated the variation in genes encoding adhesins for the host matrix proteins (elastin, laminin, collagen, fibronectin, and fibrinogen) and in the gene involved in biofilm formation (icaA) within the majority of S. aureus clones.
Homez, a homeobox leucine zipper gene specific to the vertebrate lineage.
Bayarsaihan, Dashzeveg; Enkhmandakh, Badam; Makeyev, Aleksandr; Greally, John M; Leckman, James F; Ruddle, Frank H
2003-09-02
This work describes a vertebrate homeobox gene, designated Homez (homeodomain leucine zipper-encoding gene), that encodes a protein with an unusual structural organization. There are several regions within Homez, including three atypical homeodomains, two leucine zipper-like motifs, and an acidic domain. The gene is ubiquitously expressed in human and murine tissues, although the expression pattern is more restricted during mouse development. Genomic analysis revealed that human and mouse genes are located at 14q11.2 and 14C, respectively, and are composed of two exons. The zebrafish and pufferfish homologs share high similarity to mammalian sequences, particularly within the homeodomain sequences. Based on homology of homeodomains and on the similarity in overall protein structure, we delineate Homez and members of ZHX family of zinc finger homeodomain factors as a subset within the superfamily of homeobox-containing proteins. The type and composition of homeodomains in the Homez subfamily are vertebrate-specific. Phylogenetic analysis indicates that Homez lineage was separated from related genes >400 million years ago before separation of ray- and lobe-finned fishes. We apply a duplication-degeneration-complementation model to explain how this family of genes has evolved.
Photocontrollable Fluorescent Proteins for Superresolution Imaging
Shcherbakova, Daria M.; Sengupta, Prabuddha; Lippincott-Schwartz, Jennifer; Verkhusha, Vladislav V.
2014-01-01
Superresolution fluorescence microscopy permits the study of biological processes at scales small enough to visualize fine subcellular structures that are unresolvable by traditional diffraction-limited light microscopy. Many superresolution techniques, including those applicable to live cell imaging, utilize genetically encoded photocontrollable fluorescent proteins. The fluorescence of these proteins can be controlled by light of specific wavelengths. In this review, we discuss the biochemical and photophysical properties of photocontrollable fluorescent proteins that are relevant to their use in superresolution microscopy. We then describe the recently developed photoactivatable, photoswitchable, and reversibly photoswitchable fluorescent proteins, and we detail their particular usefulness in single-molecule localization–based and nonlinear ensemble–based superresolution techniques. Finally, we discuss recent applications of photocontrollable proteins in superresolution imaging, as well as how these applications help to clarify properties of intracellular structures and processes that are relevant to cell and developmental biology, neuroscience, cancer biology and biomedicine. PMID:24895855
Structure and assembly of a paramyxovirus matrix protein
Battisti, Anthony J.; Meng, Geng; Winkler, Dennis C.; McGinnes, Lori W.; Plevka, Pavel; Steven, Alasdair C.; Morrison, Trudy G.; Rossmann, Michael G.
2012-01-01
Many pleomorphic, lipid-enveloped viruses encode matrix proteins that direct their assembly and budding, but the mechanism of this process is unclear. We have combined X-ray crystallography and cryoelectron tomography to show that the matrix protein of Newcastle disease virus, a paramyxovirus and relative of measles virus, forms dimers that assemble into pseudotetrameric arrays that generate the membrane curvature necessary for virus budding. We show that the glycoproteins are anchored in the gaps between the matrix proteins and that the helical nucleocapsids are associated in register with the matrix arrays. About 90% of virions lack matrix arrays, suggesting that, in agreement with previous biological observations, the matrix protein needs to dissociate from the viral membrane during maturation, as is required for fusion and release of the nucleocapsid into the host’s cytoplasm. Structure and sequence conservation imply that other paramyxovirus matrix proteins function similarly. PMID:22891297
Structure and assembly of a paramyxovirus matrix protein.
Battisti, Anthony J; Meng, Geng; Winkler, Dennis C; McGinnes, Lori W; Plevka, Pavel; Steven, Alasdair C; Morrison, Trudy G; Rossmann, Michael G
2012-08-28
Many pleomorphic, lipid-enveloped viruses encode matrix proteins that direct their assembly and budding, but the mechanism of this process is unclear. We have combined X-ray crystallography and cryoelectron tomography to show that the matrix protein of Newcastle disease virus, a paramyxovirus and relative of measles virus, forms dimers that assemble into pseudotetrameric arrays that generate the membrane curvature necessary for virus budding. We show that the glycoproteins are anchored in the gaps between the matrix proteins and that the helical nucleocapsids are associated in register with the matrix arrays. About 90% of virions lack matrix arrays, suggesting that, in agreement with previous biological observations, the matrix protein needs to dissociate from the viral membrane during maturation, as is required for fusion and release of the nucleocapsid into the host's cytoplasm. Structure and sequence conservation imply that other paramyxovirus matrix proteins function similarly.
Chen, Hsu-Hsin; Luche, Ralf; Wei, Bo; Tonks, Nicholas K
2004-10-01
Dual specificity phosphatases (DSPs) are members of the protein-tyrosine phosphatase superfamily that dephosphorylate both phosphotyrosine and phosphoserine/threonine residues in vitro. Many DSPs have been found to play important roles in various aspects of cellular function and to be involved in human disease. We have identified a gene located on human chromosome 10q22.2, which utilizes alternative open reading frames (ORFs) to encode the following two distinct DSPs: the previously described testis and skeletal muscle-specific dual specificity phosphatase (TMDP) and a novel DSP, muscle-restricted dual specificity phosphatase (MDSP). Use of alternative ORFs encoding distinct proteins from a single gene is extremely rare in eukaryotes, and in all previously reported cases the two proteins produced from one gene are unrelated. To our knowledge this is the first example of a gene from which two distinct proteins of the same family are expressed using alternative ORFs. Here we provide evidence that both MDSP and TMDP proteins are expressed in vivo and are restricted to specific tissues, skeletal muscle and testis, respectively. Most interestingly, the protein expression profiles of both MDSP and TMDP during mouse postnatal development are strikingly similar. MDSP is expressed at very low levels in myotubes and early postnatal muscle. TMDP is not detectable in testis lysate in the first 3 weeks of life. The expression of both MDSP and TMDP proteins was markedly increased at approximately the 3rd week after birth and continued to increase gradually into adulthood, implying that the physiological functions of both DSPs are specific to the mature/late-developing organs. The conserved gene structure and the similarity in postnatal expression profile of these two proteins suggest biological significance of the unusual gene arrangement.
P-proteins in Arabidopsis are heteromeric structures involved in rapid sieve tube sealing.
Jekat, Stephan B; Ernst, Antonia M; von Bohl, Andreas; Zielonka, Sascia; Twyman, Richard M; Noll, Gundula A; Prüfer, Dirk
2013-01-01
Structural phloem proteins (P-proteins) are characteristic components of the sieve elements in all dicotyledonous and many monocotyledonous angiosperms. Tobacco P-proteins were recently confirmed to be encoded by the widespread sieve element occlusion (SEO) gene family, and tobacco SEO proteins were shown to be directly involved in sieve tube sealing thus preventing the loss of photosynthate. Analysis of the two Arabidopsis SEO proteins (AtSEOa and AtSEOb) indicated that the corresponding P-protein subunits do not act in a redundant manner. However, there are still pending questions regarding the interaction properties and specific functions of AtSEOa and AtSEOb as well as the general function of structural P-proteins in Arabidopsis. In this study, we characterized the Arabidopsis P-proteins in more detail. We used in planta bimolecular fluorescence complementation assays to confirm the predicted heteromeric interactions between AtSEOa and AtSEOb. Arabidopsis mutants depleted for one or both AtSEO proteins lacked the typical P-protein structures normally found in sieve elements, underlining the identity of AtSEO proteins as P-proteins and furthermore providing the means to determine the role of Arabidopsis P-proteins in sieve tube sealing. We therefore developed an assay based on phloem exudation. Mutants with reduced AtSEO expression levels lost twice as much photosynthate following injury as comparable wild-type plants, confirming that Arabidopsis P-proteins are indeed involved in sieve tube sealing.
Borlee, Bradley R; Goldman, Aaron D; Murakami, Keiji; Samudrala, Ram; Wozniak, Daniel J; Parsek, Matthew R
2010-01-01
Pseudomonas aeruginosa, the principal pathogen of cystic fibrosis patients, forms antibiotic-resistant biofilms promoting chronic colonization of the airways. The extracellular (EPS) matrix is a crucial component of biofilms that provides the community multiple benefits. Recent work suggests that the secondary messenger, cyclic-di-GMP, promotes biofilm formation. An analysis of factors specifically expressed in P. aeruginosa under conditions of elevated c-di-GMP, revealed functions involved in the production and maintenance of the biofilm extracellular matrix. We have characterized one of these components, encoded by the PA4625 gene, as a putative adhesin and designated it cdrA. CdrA shares structural similarities to extracellular adhesins that belong to two-partner secretion systems. The cdrA gene is in a two gene operon that also encodes a putative outer membrane transporter, CdrB. The cdrA gene encodes a 220 KDa protein that is predicted to be rod-shaped protein harbouring a β-helix structural motif. Western analysis indicates that the CdrA is produced as a 220 kDa proprotein and processed to 150 kDa before secretion into the extracellular medium. We demonstrated that cdrAB expression is minimal in liquid culture, but is elevated in biofilm cultures. CdrAB expression was found to promote biofilm formation and auto-aggregation in liquid culture. Aggregation mediated by CdrA is dependent on the Psl polysaccharide and can be disrupted by adding mannose, a key structural component of Psl. Immunoprecipitation of Psl present in culture supernatants resulted in co-immunoprecipitation of CdrA, providing additional evidence that CdrA directly binds to Psl. A mutation in cdrA caused a decrease in biofilm biomass and resulted in the formation of biofilms exhibiting decreased structural integrity. Psl-specific lectin staining suggests that CdrA either cross-links Psl polysaccharide polymers and/or tethers Psl to the cells, resulting in increased biofilm structural stability. Thus, this study identifies a key protein structural component of the P. aeruginosa EPS matrix. PMID:20088866
Molecular Genetics of Mitochondrial Disorders
ERIC Educational Resources Information Center
Wong, Lee-Jun C.
2010-01-01
Mitochondrial respiratory chain (RC) disorders (RCDs) are a group of genetically and clinically heterogeneous diseases because of the fact that protein components of the RC are encoded by both mitochondrial and nuclear genomes and are essential in all cells. In addition, the biogenesis, structure, and function of mitochondria, including DNA…
The Molecules of the Immune System.
ERIC Educational Resources Information Center
Tonegawa, Susumu
1985-01-01
The immune system includes the most diverse proteins known because they are encoded by hundreds of scattered gene fragments which can be combined in millions or billions of ways. Events of immune response, binding of antigens, antibody structure, T-cell receptors, and other immunologically-oriented topics are discussed. (DH)
Large protein as a potential target for use in rabies diagnostics.
Santos Katz, I S; Dias, M H; Lima, I F; Chaves, L B; Ribeiro, O G; Scheffer, K C; Iwai, L K
Rabies is a zoonotic viral disease that remains a serious threat to public health worldwide. The rabies lyssavirus (RABV) genome encodes five structural proteins, multifunctional and significant for pathogenicity. The large protein (L) presents well-conserved genomic regions, which may be a good alternative to generate informative datasets for development of new methods for rabies diagnosis. This paper describes the development of a technique for the identification of L protein in several RABV strains from different hosts, demonstrating that MS-based proteomics is a potential method for antigen identification and a good alternative for rabies diagnosis.
Ulvsbäck, M; Lindström, C; Weiber, H; Abrahamsson, P A; Lilja, H; Lundwall, A
1989-11-15
In order to study the gene expression of the seminal plasma protein beta-microseminoprotein, also known as PSP94 and beta-inhibin, clones encoding this protein were isolated from a cDNA library constructed in lambda gt11. Nucleotide sequencing confirmed the structure of a previously cloned cDNA. By northern blot analysis identical sized transcripts were demonstrated in the prostate, the respiratory (tracheal, bronchial and lung) tissues and the antrum part of the gastric mucosa. Thus, the protein is not primarily associated with male reproductive function. Although probably of no physiological significance, a slight structural similarity to the ovarian inhibin beta-chains was identified in the C-terminal half of the molecule.
The group II intron maturase: a reverse transcriptase and splicing factor go hand in hand.
Zhao, Chen; Pyle, Anna Marie
2017-12-01
The splicing of group II introns in vivo requires the assistance of a multifunctional intron encoded protein (IEP, or maturase). Each IEP is also a reverse-transcriptase enzyme that enables group II introns to behave as mobile genetic elements. During splicing or retro-transposition, each group II intron forms a tight, specific complex with its own encoded IEP, resulting in a highly reactive holoenzyme. This review focuses on the structural basis for IEP function, as revealed by recent crystal structures of an IEP reverse transcriptase domain and cryo-EM structures of an IEP-intron complex. These structures explain how the same IEP scaffold is utilized for intron recognition, splicing and reverse transcription, while providing a physical basis for understanding the evolutionary transformation of the IEP into the eukaryotic splicing factor Prp8. Copyright © 2017 Elsevier Ltd. All rights reserved.
Polypeptide p41 of a Norwalk-Like Virus Is a Nucleic Acid-Independent Nucleoside Triphosphatase
Pfister, Thomas; Wimmer, Eckard
2001-01-01
Southampton virus (SHV) is a member of the Norwalk-like viruses (NLVs), one of four genera of the family Caliciviridae. The genome of SHV contains three open reading frames (ORFs). ORF 1 encodes a polyprotein that is autocatalytically processed into six proteins, one of which is p41. p41 shares sequence motifs with protein 2C of picornaviruses and superfamily 3 helicases. We have expressed p41 of SHV in bacteria. Purified p41 exhibited nucleoside triphosphate (NTP)-binding and NTP hydrolysis activities. The NTPase activity was not stimulated by single-stranded nucleic acids. SHV p41 had no detectable helicase activity. Protein sequence comparison between the consensus sequences of NLV p41 and enterovirus protein 2C revealed regions of high similarity. According to secondary structure prediction, the conserved regions were located within a putative central domain of alpha helices and beta strands. This study reveals for the first time an NTPase activity associated with a calicivirus-encoded protein. Based on enzymatic properties and sequence information, a functional relationship between NLV p41 and enterovirus 2C is discussed in regard to the role of 2C-like proteins in virus replication. PMID:11160659
Odors regulate Arc expression in neuronal ensembles engaged in odor processing.
Guthrie, K; Rayhanabad, J; Kuhl, D; Gall, C
2000-06-26
Synaptic activity is critical to developmental and plastic processes that produce long-term changes in neuronal connectivity and function. Genes expressed by neurons in an activity-dependent fashion are of particular interest since the proteins they encode may mediate neuronal plasticity. One such gene encodes the activity-regulated cytoskeleton-associated protein, Arc. The present study evaluated the effects of odor stimulation on Arc expression in rat olfactory bulb. Arc mRNA was rapidly increased in functionally linked cohorts of neurons topographically activated by odor stimuli. These included neurons surrounding individual glomeruli, mitral cells and transynaptically activated granule cells. Dendritic Arc immunoreactivity was also increased in odor-activated glomeruli. Our results suggest that odor regulation of Arc expression may contribute to activity-dependent structural changes associated with olfactory experience.
Silk Materials Functionalized via Genetic Engineering for Biomedical Applications.
Deptuch, Tomasz; Dams-Kozlowska, Hanna
2017-12-12
The great mechanical properties, biocompatibility and biodegradability of silk-based materials make them applicable to the biomedical field. Genetic engineering enables the construction of synthetic equivalents of natural silks. Knowledge about the relationship between the structure and function of silk proteins enables the design of bioengineered silks that can serve as the foundation of new biomaterials. Furthermore, in order to better address the needs of modern biomedicine, genetic engineering can be used to obtain silk-based materials with new functionalities. Sequences encoding new peptides or domains can be added to the sequences encoding the silk proteins. The expression of one cDNA fragment indicates that each silk molecule is related to a functional fragment. This review summarizes the proposed genetic functionalization of silk-based materials that can be potentially useful for biomedical applications.
Covering complete proteomes with X-ray structures: A current snapshot
Mizianty, Marcin J.; Fan, Xiao; Yan, Jing; ...
2014-10-23
Structural genomics programs have developed and applied structure-determination pipelines to a wide range of protein targets, facilitating the visualization of macromolecular interactions and the understanding of their molecular and biochemical functions. The fundamental question of whether three-dimensional structures of all proteins and all functional annotations can be determined using X-ray crystallography is investigated. A first-of-its-kind large-scale analysis of crystallization propensity for all proteins encoded in 1953 fully sequenced genomes was performed. It is shown that current X-ray crystallographic knowhow combined with homology modeling can provide structures for 25% of modeling families (protein clusters for which structural models can be obtainedmore » through homology modeling), with at least one structural model produced for each Gene Ontology functional annotation. The coverage varies between superkingdoms, with 19% for eukaryotes, 35% for bacteria and 49% for archaea, and with those of viruses following the coverage values of their hosts. It is shown that the crystallization propensities of proteomes from the taxonomic superkingdoms are distinct. The use of knowledge-based target selection is shown to substantially increase the ability to produce X-ray structures. It is demonstrated that the human proteome has one of the highest attainable coverage values among eukaryotes, and GPCR membrane proteins suitable for X-ray structure determination were determined.« less
Encoding of contextual fear memory requires de novo proteins in the prelimbic cortex
Rizzo, Valerio; Touzani, Khalid; Raveendra, Bindu L.; Swarnkar, Supriya; Lora, Joan; Kadakkuzha, Beena M.; Liu, Xin-An; Zhang, Chao; Betel, Doron; Stackman, Robert W.; Puthanveettil, Sathyanarayanan V.
2016-01-01
Background Despite our understanding of the significance of the prefrontal cortex in the consolidation of long-term memories (LTM), its role in the encoding of LTM remains elusive. Here we investigated the role of new protein synthesis in the mouse medial prefrontal cortex (mPFC) in encoding contextual fear memory. Methods Because a change in the association of mRNAs to polyribosomes is an indicator of new protein synthesis, we assessed the changes in polyribosome-associated mRNAs in the mPFC following contextual fear conditioning (CFC) in the mouse. Differential gene expression in mPFC was identified by polyribosome profiling (n = 18). The role of new protein synthesis in mPFC was determined by focal inhibition of protein synthesis (n = 131) and by intra-prelimbic cortex manipulation (n = 56) of Homer 3, a candidate identified from polyribosome profiling. Results We identified several mRNAs that are differentially and temporally recruited to polyribosomes in the mPFC following CFC. Inhibition of protein synthesis in the prelimbic (PL), but not in the anterior cingulate cortex (ACC) region of the mPFC immediately after CFC disrupted encoding of contextual fear memory. Intriguingly, inhibition of new protein synthesis in the PL 6 hours after CFC did not impair encoding. Furthermore, expression of Homer 3, an mRNA enriched in polyribosomes following CFC, in the PL constrained encoding of contextual fear memory. Conclusions Our studies identify several molecular substrates of new protein synthesis in the mPFC and establish that encoding of contextual fear memories require new protein synthesis in PL subregion of mPFC. PMID:28503670
GlycomeDB – integration of open-access carbohydrate structure databases
Ranzinger, René; Herget, Stephan; Wetter, Thomas; von der Lieth, Claus-Wilhelm
2008-01-01
Background Although carbohydrates are the third major class of biological macromolecules, after proteins and DNA, there is neither a comprehensive database for carbohydrate structures nor an established universal structure encoding scheme for computational purposes. Funding for further development of the Complex Carbohydrate Structure Database (CCSD or CarbBank) ceased in 1997, and since then several initiatives have developed independent databases with partially overlapping foci. For each database, different encoding schemes for residues and sequence topology were designed. Therefore, it is virtually impossible to obtain an overview of all deposited structures or to compare the contents of the various databases. Results We have implemented procedures which download the structures contained in the seven major databases, e.g. GLYCOSCIENCES.de, the Consortium for Functional Glycomics (CFG), the Kyoto Encyclopedia of Genes and Genomes (KEGG) and the Bacterial Carbohydrate Structure Database (BCSDB). We have created a new database called GlycomeDB, containing all structures, their taxonomic annotations and references (IDs) for the original databases. More than 100000 datasets were imported, resulting in more than 33000 unique sequences now encoded in GlycomeDB using the universal format GlycoCT. Inconsistencies were found in all public databases, which were discussed and corrected in multiple feedback rounds with the responsible curators. Conclusion GlycomeDB is a new, publicly available database for carbohydrate sequences with a unified, all-encompassing structure encoding format and NCBI taxonomic referencing. The database is updated weekly and can be downloaded free of charge. The JAVA application GlycoUpdateDB is also available for establishing and updating a local installation of GlycomeDB. With the advent of GlycomeDB, the distributed islands of knowledge in glycomics are now bridged to form a single resource. PMID:18803830
McBride, Ruth; Fielding, Burtram C.
2012-01-01
A respiratory disease caused by a novel coronavirus, termed the severe acute respiratory syndrome coronavirus (SARS-CoV), was first reported in China in late 2002. The subsequent efficient human-to-human transmission of this virus eventually affected more than 30 countries worldwide, resulting in a mortality rate of ~10% of infected individuals. The spread of the virus was ultimately controlled by isolation of infected individuals and there has been no infections reported since April 2004. However, the natural reservoir of the virus was never identified and it is not known if this virus will re-emerge and, therefore, research on this virus continues. The SARS-CoV genome is about 30 kb in length and is predicted to contain 14 functional open reading frames (ORFs). The genome encodes for proteins that are homologous to known coronavirus proteins, such as the replicase proteins (ORFs 1a and 1b) and the four major structural proteins: nucleocapsid (N), spike (S), membrane (M) and envelope (E). SARS-CoV also encodes for eight unique proteins, called accessory proteins, with no known homologues. This review will summarize the current knowledge on SARS-CoV accessory proteins and will include: (i) expression and processing; (ii) the effects on cellular processes; and (iii) functional studies. PMID:23202509
Hussain, Razak; Kumari, Indu; Sharma, Shikha; Ahmed, Mushtaq; Khan, Tabreiz Ahmad; Akhter, Yusuf
2017-12-01
Trichothecenes are the secondary metabolites produced by Trichoderma spp. Some of these molecules have been reported for their ability to stimulate plant growth by suppressing plant diseases and hence enabling Trichoderma spp. to be efficiently used as biocontrol agents in modern agriculture. Many of the proteins involved in the trichothecenes biosynthetic pathway in Trichoderma spp. are encoded by the genes present in the tri cluster. Tri4 protein catalyzes three consecutive oxygenation reaction steps during biosynthesis of isotrichodiol in the trichothecenes biosynthetic pathway, while tri11 protein catalyzes the C4 hydroxylation of 12, 13-epoxytrichothec-9-ene to produce trichodermol. In the present study, we have homology modelled the three-dimensional structures of tri4 and tri11 proteins. Furthermore, molecular dynamics simulations were carried out to elucidate the mechanism of their action. Both tri4 and tri11 encode for cytochrome P450 monooxygenase like proteins. These data also revealed effector-induced allosteric changes on substrate binding at an alternative binding site and showed potential homotropic negative cooperativity. These analyses also showed that their catalytic mechanism relies on protein-ligand and protein-heme interactions controlled by hydrophobic and hydrogen-bonding interactions which orient the complex in optimal conformation within the active sites.
Prangishvili, David; Vestergaard, Gisle; Häring, Monika; Aramayo, Ricardo; Basta, Tamara; Rachel, Reinhard; Garrett, Roger A
2006-06-23
A novel virus, ATV, of the hyperthermophilic archaeal genus Acidianus has the unique property of undergoing a major morphological development outside of, and independently of, the host cell. Virions are extruded from host cells as lemon-shaped tail-less particles, after which they develop long tails at each pointed end, at temperatures close to that of the natural habitat, 85 degrees C. The extracellularly developed tails constitute tubes, which terminate in an anchor-like structure that is not observed in the tail-less particles. A thin filament is located within the tube, which exhibits a periodic structure. Tail development produces a one half reduction in the volume of the virion, concurrent with a slight expansion of the virion surface. The circular, double-stranded DNA genome contains 62,730 bp and is exceptional for a crenarchaeal virus in that it carries four putative transposable elements as well as genes, which previously have been associated only with archaeal self-transmissable plasmids. In total, it encodes 72 predicted proteins, including 11 structural proteins with molecular masses in the range of 12 to 90 kDa. Several of the larger proteins are rich in coiled coil and/or low complexity sequence domains, which are unusual for archaea. One protein, in particular P800, resembles an intermediate filament protein in its structural properties. It is modified in the two-tailed, but not in the tail-less, virion particles and it may contribute to viral tail development. Exceptionally for a crenarchaeal virus, infection with ATV results either in viral replication and subsequent cell lysis or in conversion of the infected cell to a lysogen. The lysogenic cycle involves integration of the viral genome into the host chromosome, probably facilitated by the virus-encoded integrase and this process can be interrupted by different stress factors.
Maier, Lisa-Katharina; Benz, Juliane; Fischer, Susan; Alstetter, Martina; Jaschinski, Katharina; Hilker, Rolf; Becker, Anke; Allers, Thorsten; Soppa, Jörg; Marchfelder, Anita
2015-10-01
Members of the Sm protein family are important for the cellular RNA metabolism in all three domains of life. The family includes archaeal and eukaryotic Lsm proteins, eukaryotic Sm proteins and archaeal and bacterial Hfq proteins. While several studies concerning the bacterial and eukaryotic family members have been published, little is known about the archaeal Lsm proteins. Although structures for several archaeal Lsm proteins have been solved already more than ten years ago, we still do not know much about their biological function, however one can confidently propose that the archaeal Lsm proteins will also be involved in RNA metabolism. Therefore, we investigated this protein in the halophilic archaeon Haloferax volcanii. The Haloferax genome encodes a single Lsm protein, the lsm gene overlaps and is co-transcribed with the gene for the ribosomal L37.eR protein. Here, we show that the reading frame of the lsm gene contains a promoter which regulates expression of the overlapping rpl37R gene. This rpl37R specific promoter ensures high expression of the rpl37R gene in exponential growth phase. To investigate the biological function of the Lsm protein we generated a lsm deletion mutant that had the coding sequence for the Sm1 motif removed but still contained the internal promoter for the downstream rpl37R gene. The transcriptome of this deletion mutant was compared to the wild type transcriptome, revealing that several genes are down-regulated and many genes are up-regulated in the deletion strain. Northern blot analyses confirmed down-regulation of two genes. In addition, the deletion strain showed a gain of function in swarming, in congruence with the up-regulation of transcripts encoding proteins required for motility. Copyright © 2015 The Authors. Published by Elsevier B.V. All rights reserved.
The alphabet of intrinsic disorder
Theillet, Francois-Xavier; Kalmar, Lajos; Tompa, Peter; Han, Kyou-Hoon; Selenko, Philipp; Dunker, A. Keith; Daughdrill, Gary W.; Uversky, Vladimir N
2013-01-01
A significant fraction of every proteome is occupied by biologically active proteins that do not form unique three-dimensional structures. These intrinsically disordered proteins (IDPs) and IDP regions (IDPRs) have essential biological functions and are characterized by extensive structural plasticity. Such structural and functional behavior is encoded in the amino acid sequences of IDPs/IDPRs, which are enriched in disorder-promoting residues and depleted in order-promoting residues. In fact, amino acid residues can be arranged according to their disorder-promoting tendency to form an alphabet of intrinsic disorder that defines the structural complexity and diversity of IDPs/IDPRs. This review is the first in a series of publications dedicated to the roles that different amino acid residues play in defining the phenomenon of protein intrinsic disorder. We start with proline because data suggests that of the 20 common amino acid residues, this one is the most disorder-promoting. PMID:28516008
Biomaterials Made from Coiled-Coil Peptides.
Conticello, Vincent; Hughes, Spencer; Modlin, Charles
The development of biomaterials designed for specific applications is an important objective in personalized medicine. While the breadth and prominence of biomaterials have increased exponentially over the past decades, critical challenges remain to be addressed, particularly in the development of biomaterials that exhibit highly specific functions. These functional properties are often encoded within the molecular structure of the component molecules. Proteins, as a consequence of their structural specificity, represent useful substrates for the construction of functional biomaterials through rational design. This chapter provides an in-depth survey of biomaterials constructed from coiled-coils, one of the best-understood protein structural motifs. We discuss the utility of this structurally diverse and functionally tunable class of proteins for the creation of novel biomaterials. This discussion illustrates the progress that has been made in the development of coiled-coil biomaterials by showcasing studies that bridge the gap between the academic science and potential technological impact.
Hoyer, Lois L.; Cota, Ernesto
2016-01-01
Approximately two decades have passed since the description of the first gene in the Candida albicans ALS (agglutinin-like sequence) family. Since that time, much has been learned about the composition of the family and the function of its encoded cell-surface glycoproteins. Solution of the structure of the Als adhesive domain provides the opportunity to evaluate the molecular basis for protein function. This review article is formatted as a series of fundamental questions and explores the diversity of the Als proteins, as well as their role in ligand binding, aggregative effects, and attachment to abiotic surfaces. Interaction of Als proteins with each other, their functional equivalence, and the effects of protein abundance on phenotypic conclusions are also examined. Structural features of Als proteins that may facilitate invasive function are considered. Conclusions that are firmly supported by the literature are presented while highlighting areas that require additional investigation to reveal basic features of the Als proteins, their relatedness to each other, and their roles in C. albicans biology. PMID:27014205
2008-10-13
Furthermore, the encoded protein of this gene is only 30 kDa. A potential GTG start codon at position 625 also encodes a protein that is too small...horizontal bar and putative alternate translation initiation sites (ATG, GTG , and TTG) are indicated. The sizes and locations of the proteins encoded... gray line with rounded rectangles showing sequence features and motifs, including the Ala- and Pro-rich N-terminal region and the C-terminal Cys and
T4-Like Genome Organization of the Escherichia coli O157:H7 Lytic Phage AR1▿†
Liao, Wei-Chao; Ng, Wailap Victor; Lin, I-Hsuan; Syu, Wan-Jr; Liu, Tze-Tze; Chang, Chuan-Hsiung
2011-01-01
We report the genome organization and analysis of the first completely sequenced T4-like phage, AR1, of Escherichia coli O157:H7. Unlike most of the other sequenced phages of O157:H7, which belong to the temperate Podoviridae and Siphoviridae families, AR1 is a T4-like phage known to efficiently infect this pathogenic bacterial strain. The 167,435-bp AR1 genome is currently the largest among all the sequenced E. coli O157:H7 phages. It carries a total of 281 potential open reading frames (ORFs) and 10 putative tRNA genes. Of these, 126 predicted proteins could be classified into six viral orthologous group categories, with at least 18 proteins of the structural protein category having been detected by tandem mass spectrometry. Comparative genomic analysis of AR1 and four other completely sequenced T4-like genomes (RB32, RB69, T4, and JS98) indicated that they share a well-organized and highly conserved core genome, particularly in the regions encoding DNA replication and virion structural proteins. The major diverse features between these phages include the modules of distal tail fibers and the types and numbers of internal proteins, tRNA genes, and mobile elements. Codon usage analysis suggested that the presence of AR1-encoded tRNAs may be relevant to the codon usage of structural proteins. Furthermore, protein sequence analysis of AR1 gp37, a potential receptor binding protein, indicated that eight residues in the C terminus are unique to O157:H7 T4-like phages AR1 and PP01. These residues are known to be located in the T4 receptor recognition domain, and they may contribute to specificity for adsorption to the O157:H7 strain. PMID:21507986
Chen, Zhong-Yuan; Gao, Xiao-Chan; Zhang, Qi-Ya
2015-08-03
Aquareoviruses are serious pathogens of aquatic animals. Here, genome characterization and functional gene analysis of a novel aquareovirus, largemouth bass Micropterus salmoides reovirus (MsReV), was described. It comprises 11 dsRNA segments (S1-S11) covering 24,024 bp, and encodes 12 putative proteins including the inclusion forming-related protein NS87 and the fusion-associated small transmembrane (FAST) protein NS22. The function of NS22 was confirmed by expression in fish cells. Subsequently, MsReV was compared with two representative aquareoviruses, saltwater fish turbot Scophthalmus maximus reovirus (SMReV) and freshwater fish grass carp reovirus strain 109 (GCReV-109). MsReV NS87 and NS22 genes have the same structure and function with those of SMReV, whereas GCReV-109 is either missing the coiled-coil region in NS79 or the gene-encoding NS22. Significant similarities are also revealed among equivalent genome segments between MsReV and SMReV, but a difference is found between MsReV and GCReV-109. Furthermore, phylogenetic analysis showed that 13 aquareoviruses could be divided into freshwater and saline environments subgroups, and MsReV was closely related to SMReV in saline environments. Consequently, these viruses from hosts in saline environments have more genomic structural similarities than the viruses from hosts in freshwater. This is the first study of the relationships between aquareovirus genomic structure and their host environments.
Miller, Matthew S; Furlong, Wendy E; Pennell, Leesa; Geadah, Marc; Hertel, Laura
2010-07-01
The products of numerous open reading frames (ORFs) present in the genome of human cytomegalovirus (CMV) have not been characterized. Here, we describe the identification of a new CMV protein localizing to the nuclear envelope and in cytoplasmic vesicles at late times postinfection. Based on this distinctive localization pattern, we called this new protein nuclear rim-associated cytomegaloviral protein, or RASCAL. Two RASCAL isoforms exist, a short version of 97 amino acids encoded by the majority of CMV strains and a longer version of 176 amino acids encoded by the Towne, Toledo, HAN20, and HAN38 strains. Both isoforms colocalize with lamin B in deep intranuclear invaginations of the inner nuclear membrane (INM) and in novel cytoplasmic vesicular structures possibly derived from the nuclear envelope. INM infoldings have been previously described as sites of nucleocapsid egress, which is mediated by the localized disruption of the nuclear lamina, promoted by the activities of viral and cellular kinases recruited by the lamina-associated proteins UL50 and UL53. RASCAL accumulation at the nuclear membrane required the presence of UL50 but not of UL53. RASCAL and UL50 also appeared to specifically interact, suggesting that RASCAL is a new component of the nuclear egress complex (NEC) and possibly involved in mediating nucleocapsid egress from the nucleus. Finally, the presence of RASCAL within cytoplasmic vesicles raises the intriguing possibility that this protein might participate in additional steps of virion maturation occurring after capsid release from the nucleus.
Zhang, Gang; Li, Yi-Min; Li, Biao; Zhang, Da-Wei; Guo, Shun-Xing
2015-01-01
The zinc-regulated transporters (ZRT), iron-regulated transporter (IRT)-like protein (ZIP) plays an important role in the growth and development of plant. In this study, a full length cDNA of ZIP encoding gene, designed as DoZIP1 (GenBank accession KJ946203), was identified from Dendrobium officinale using RT-PCR and RACE. Bioinformatics analysis showed that DoZIP1 consisted of a 1,056 bp open reading frame (ORF) encoded a 351-aa protein with a molecular weight of 37.57 kDa and an isoelectric point (pI) of 6.09. The deduced DoZIP1 protein contained the conserved ZIP domain, and its secondary structure was composed of 50.71% alpha helix, 11.11% extended strand, 36.18% random coil, and beta turn 1.99%. DoZIP1 protein exhibited a signal peptide and eight transmembrane domains, presumably locating in cell membrane. The amino acid sequence had high homology with ZIP proteins from Arabidopsis, alfalfa and rice. A phylogenetic tree analysis demonstrated that DoZIP1 was closely related to AtZIP10 and OsZIP3, and they were clustered into one clade. Real time quantitative PCR analysis demonstrated that the transcription level of DoZIP1 in D. officinale roots was the highest (4.19 fold higher than that of stems), followed by that of leaves (1.12 fold). Molecular characters of DoZIP1 will be useful for further functional determination of the gene involving in the growth and development of D. officinale.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bajaj, R. Alexandra; Arbing, Mark A.; Shin, Annie
The structure of Msmeg_6760, a protein of unknown function, has been determined. Biochemical and bioinformatics analyses determined that Msmeg_6760 interacts with a protein encoded in the same operon, Msmeg_6762, and predicted that the operon is a toxin–antitoxin (TA) system. Structural comparison of Msmeg_6760 with proteins of known function suggests that Msmeg_6760 binds a hydrophobic ligand in a buried cavity lined by large hydrophobic residues. Access to this cavity could be controlled by a gate–latch mechanism. The function of the Msmeg_6760 toxin is unknown, but structure-based predictions revealed that Msmeg_6760 and Msmeg_6762 are homologous to Rv2034 and Rv2035, a predicted novelmore » TA system involved inMycobacterium tuberculosislatency during macrophage infection. The Msmeg_6760 toxin fold has not been previously described for bacterial toxins and its unique structural features suggest that toxin activation is likely to be mediated by a novel mechanism.« less
Bayram, Özgür; Biesemann, Christoph; Krappmann, Sven; Galland, Paul
2008-01-01
Cryptochromes are blue-light receptors that have presumably evolved from the DNA photolyase protein family, and the genomes of many organisms contain genes for both types of molecules. Both protein structures resemble each other, which suggests that light control and light protection share a common ancient origin. In the genome of the filamentous fungus Aspergillus nidulans, however, only one cryptochrome/photolyase-encoding gene, termed cryA, was identified. Deletion of the cryA gene triggers sexual differentiation under inappropriate culture conditions and results in up-regulation of transcripts encoding regulators of fruiting body formation. CryA is a protein whose N- and C-terminal synthetic green fluorescent protein fusions localize to the nucleus. CryA represses sexual development under UVA350-370 nm light both on plates and in submerged culture. Strikingly, CryA exhibits photorepair activity as demonstrated by heterologous complementation of a DNA repair-deficient Escherichia coli strain as well as overexpression in an A. nidulans uvsBΔ genetic background. This is in contrast to the single deletion cryAΔ strain, which does not show increased sensitivity toward UV-induced damage. In A. nidulans, cryA encodes a novel type of cryptochrome/photolyase that exhibits a regulatory function during light-dependent development and DNA repair activity. This represents a paradigm for the evolutionary transition between photolyases and cryptochromes. PMID:18495868
Pinto-Santini, Delia M.; Salama, Nina R.
2009-01-01
Helicobacter pylori strains harboring the cag pathogenicity island (PAI) have been associated with more severe gastric disease in infected humans. The cag PAI encodes a type IV secretion (T4S) system required for CagA translocation into host cells as well as induction of proinflammatory cytokines, such as interleukin-8 (IL-8). cag PAI genes sharing sequence similarity with T4S components from other bacteria are essential for Cag T4S function. Other cag PAI-encoded genes are also essential for Cag T4S, but lack of sequence-based or structural similarity with genes in existing databases has precluded a functional assignment for the encoded proteins. We have studied the role of one such protein, Cag3 (HP0522), in Cag T4S and determined Cag3 subcellular localization and protein interactions. Cag3 is membrane associated and copurifies with predicted inner and outer membrane Cag T4S components that are essential for Cag T4S as well as putative accessory factors. Coimmunoprecipitation and cross-linking experiments revealed specific interactions with HpVirB7 and CagM, suggesting Cag3 is a new component of the Cag T4S outer membrane subcomplex. Finally, lack of Cag3 lowers HpVirB7 steady-state levels, further indicating Cag3 makes a subcomplex with this protein. PMID:19801411
Complete Mitochondrial Genome of the Medicinal Mushroom Ganoderma lucidum
Chen, Haimei; Chen, Xiangdong; Lan, Jin; Liu, Chang
2013-01-01
Ganoderma lucidum is one of the well-known medicinal basidiomycetes worldwide. The mitochondrion, referred to as the second genome, is an organelle found in most eukaryotic cells and participates in critical cellular functions. Elucidating the structure and function of this genome is important to understand completely the genetic contents of G. lucidum. In this study, we assembled the mitochondrial genome of G. lucidum and analyzed the differential expressions of its encoded genes across three developmental stages. The mitochondrial genome is a typical circular DNA molecule of 60,630 bp with a GC content of 26.67%. Genome annotation identified genes that encode 15 conserved proteins, 27 tRNAs, small and large rRNAs, four homing endonucleases, and two hypothetical proteins. Except for genes encoding trnW and two hypothetical proteins, all genes were located on the positive strand. For the repeat structure analysis, eight forward, two inverted, and three tandem repeats were detected. A pair of fragments with a total length around 5.5 kb was found in both the nuclear and mitochondrial genomes, which suggests the possible transfer of DNA sequences between two genomes. RNA-Seq data for samples derived from three stages, namely, mycelia, primordia, and fruiting bodies, were mapped to the mitochondrial genome and qualified. The protein-coding genes were expressed higher in mycelia or primordial stages compared with those in the fruiting bodies. The rRNA abundances were significantly higher in all three stages. Two regions were transcribed but did not contain any identified protein or tRNA genes. Furthermore, three RNA-editing sites were detected. Genome synteny analysis showed that significant genome rearrangements occurred in the mitochondrial genomes. This study provides valuable information on the gene contents of the mitochondrial genome and their differential expressions at various developmental stages of G. lucidum. The results contribute to the understanding of the functions and evolution of fungal mitochondrial DNA. PMID:23991034
Protein-based materials, toward a new level of structural control.
van Hest, J C; Tirrell, D A
2001-10-07
Through billions of years of evolution nature has created and refined structural proteins for a wide variety of specific purposes. Amino acid sequences and their associated folding patterns combine to create elastic, rigid or tough materials. In many respects, nature's intricately designed products provide challenging examples for materials scientists, but translation of natural structural concepts into bio-inspired materials requires a level of control of macromolecular architecture far higher than that afforded by conventional polymerization processes. An increasingly important approach to this problem has been to use biological systems for production of materials. Through protein engineering, artificial genes can be developed that encode protein-based materials with desired features. Structural elements found in nature, such as beta-sheets and alpha-helices, can be combined with great flexibility, and can be outfitted with functional elements such as cell binding sites or enzymatic domains. The possibility of incorporating non-natural amino acids increases the versatility of protein engineering still further. It is expected that such methods will have large impact in the field of materials science, and especially in biomedical materials science, in the future.
Dohi, Koji; Mise, Kazuyuki; Furusawa, Iwao; Okuno, Tetsuro
2002-11-01
Viral RNA-dependent RNA polymerase (RdRp) plays crucial roles in the genomic replication and subgenomic transcription of Brome mosaic virus (BMV), a positive-stranded RNA plant virus. BMV RdRp is a complex of virus-encoded 1a and 2a proteins and some cellular factors, and associates with the endoplasmic reticulum at an infection-specific structure in the cytoplasm of host cells. In this study, we investigate the gross structure of the active BMV RdRp complex using monoclonal antibodies raised against the 1a and 2a proteins. Immunoprecipitation experiments showed that the intermediate region between the N-terminal methyltransferase-like domain and the C-terminal helicase-like domain of 1a protein, and the N terminus region of 2a protein are exposed on the surface of the solubilized RdRp complex. Inhibition assays for membrane-bound RdRp suggested that the intermediate region between the methyltransferase-like and the helicase-like domains of 1a protein is located at the border of the region buried within a membrane structure or with membrane-associated material.
Kariithi, Henry M; Ince, Ikbal A; Boeren, Sjef; Abd-Alla, Adly M M; Parker, Andrew G; Aksoy, Serap; Vlak, Just M; Oers, Monique M van
2011-11-01
The competence of the tsetse fly Glossina pallidipes (Diptera; Glossinidae) to acquire salivary gland hypertrophy virus (SGHV), to support virus replication and successfully transmit the virus depends on complex interactions between Glossina and SGHV macromolecules. Critical requisites to SGHV transmission are its replication and secretion of mature virions into the fly's salivary gland (SG) lumen. However, secretion of host proteins is of equal importance for successful transmission and requires cataloging of G. pallidipes secretome proteins from hypertrophied and non-hypertrophied SGs. After electrophoretic profiling and in-gel trypsin digestion, saliva proteins were analyzed by nano-LC-MS/MS. MaxQuant/Andromeda search of the MS data against the non-redundant (nr) GenBank database and a G. morsitans morsitans SG EST database, yielded a total of 521 hits, 31 of which were SGHV-encoded. On a false discovery rate limit of 1% and detection threshold of least 2 unique peptides per protein, the analysis resulted in 292 Glossina and 25 SGHV MS-supported proteins. When annotated by the Blast2GO suite, at least one gene ontology (GO) term could be assigned to 89.9% (285/317) of the detected proteins. Five (∼1.8%) Glossina and three (∼12%) SGHV proteins remained without a predicted function after blast searches against the nr database. Sixty-five of the 292 detected Glossina proteins contained an N-terminal signal/secretion peptide sequence. Eight of the SGHV proteins were predicted to be non-structural (NS), and fourteen are known structural (VP) proteins. SGHV alters the protein expression pattern in Glossina. The G. pallidipes SG secretome encompasses a spectrum of proteins that may be required during the SGHV infection cycle. These detected proteins have putative interactions with at least 21 of the 25 SGHV-encoded proteins. Our findings opens venues for developing novel SGHV mitigation strategies to block SGHV infections in tsetse production facilities such as using SGHV-specific antibodies and phage display-selected gut epithelia-binding peptides.
D'Antonio, Matteo; Masseroli, Marco
2009-01-01
Background Alternative splicing has been demonstrated to affect most of human genes; different isoforms from the same gene encode for proteins which differ for a limited number of residues, thus yielding similar structures. This suggests possible correlations between alternative splicing and protein structure. In order to support the investigation of such relationships, we have developed the Alternative Splicing and Protein Structure Scrutinizer (PASS), a Web application to automatically extract, integrate and analyze human alternative splicing and protein structure data sparsely available in the Alternative Splicing Database, Ensembl databank and Protein Data Bank. Primary data from these databases have been integrated and analyzed using the Protein Identifier Cross-Reference, BLAST, CLUSTALW and FeatureMap3D software tools. Results A database has been developed to store the considered primary data and the results from their analysis; a system of Perl scripts has been implemented to automatically create and update the database and analyze the integrated data; a Web interface has been implemented to make the analyses easily accessible; a database has been created to manage user accesses to the PASS Web application and store user's data and searches. Conclusion PASS automatically integrates data from the Alternative Splicing Database with protein structure data from the Protein Data Bank. Additionally, it comprehensively analyzes the integrated data with publicly available well-known bioinformatics tools in order to generate structural information of isoform pairs. Further analysis of such valuable information might reveal interesting relationships between alternative splicing and protein structure differences, which may be significantly associated with different functions. PMID:19828075
The Caulobacter crescentus phage phiCbK: genomics of a canonical phage
2012-01-01
Background The bacterium Caulobacter crescentus is a popular model for the study of cell cycle regulation and senescence. The large prolate siphophage phiCbK has been an important tool in C. crescentus biology, and has been studied in its own right as a model for viral morphogenesis. Although a system of some interest, to date little genomic information is available on phiCbK or its relatives. Results Five novel phiCbK-like C. crescentus bacteriophages, CcrMagneto, CcrSwift, CcrKarma, CcrRogue and CcrColossus, were isolated from the environment. The genomes of phage phiCbK and these five environmental phage isolates were obtained by 454 pyrosequencing. The phiCbK-like phage genomes range in size from 205 kb encoding 318 proteins (phiCbK) to 280 kb encoding 448 proteins (CcrColossus), and were found to contain nonpermuted terminal redundancies of 10 to 17 kb. A novel method of terminal ligation was developed to map genomic termini, which confirmed termini predicted by coverage analysis. This suggests that sequence coverage discontinuities may be useable as predictors of genomic termini in phage genomes. Genomic modules encoding virion morphogenesis, lysis and DNA replication proteins were identified. The phiCbK-like phages were also found to encode a number of intriguing proteins; all contain a clearly T7-like DNA polymerase, and five of the six encode a possible homolog of the C. crescentus cell cycle regulator GcrA, which may allow the phage to alter the host cell’s replicative state. The structural proteome of phage phiCbK was determined, identifying the portal, major and minor capsid proteins, the tail tape measure and possible tail fiber proteins. All six phage genomes are clearly related; phiCbK, CcrMagneto, CcrSwift, CcrKarma and CcrRogue form a group related at the DNA level, while CcrColossus is more diverged but retains significant similarity at the protein level. Conclusions Due to their lack of any apparent relationship to other described phages, this group is proposed as the founding cohort of a new phage type, the phiCbK-like phages. This work will serve as a foundation for future studies on morphogenesis, infection and phage-host interactions in C. crescentus. PMID:23050599
Kuan, Lisa; Schaffer, Jessica N.; Zouzias, Christos D.
2014-01-01
Proteus mirabilis is a Gram-negative enteric bacterium that causes complicated urinary tract infections, particularly in patients with indwelling catheters. Sequencing of clinical isolate P. mirabilis HI4320 revealed the presence of 17 predicted chaperone-usher fimbrial operons. We classified these fimbriae into three groups by their genetic relationship to other chaperone-usher fimbriae. Sixteen of these fimbriae are encoded by all seven currently sequenced P. mirabilis genomes. The predicted protein sequence of the major structural subunit for 14 of these fimbriae was highly conserved (≥95 % identity), whereas three other structural subunits (Fim3A, UcaA and Fim6A) were variable. Further examination of 58 clinical isolates showed that 14 of the 17 predicted major structural subunit genes of the fimbriae were present in most strains (>85 %). Transcription of the predicted major structural subunit genes for all 17 fimbriae was measured under different culture conditions designed to mimic conditions in the urinary tract. The majority of the fimbrial genes were induced during stationary phase, static culture or colony growth when compared to exponential-phase aerated culture. Major structural subunit proteins for six of these fimbriae were detected using MS of proteins sheared from the surface of broth-cultured P. mirabilis, demonstrating that this organism may produce multiple fimbriae within a single culture. The high degree of conservation of P. mirabilis fimbriae stands in contrast to uropathogenic Escherichia coli and Salmonella enterica, which exhibit greater variability in their fimbrial repertoires. These findings suggest there may be evolutionary pressure for P. mirabilis to maintain a large fimbrial arsenal. PMID:24809384
Revolting Developments in Our Understanding of the Organization of the Eukaryotic Genome.
ERIC Educational Resources Information Center
Krider, Hallie M.
1984-01-01
Various typs of DNA are discussed. Areas considered include highly repetitive and satellite sequences, genes encoding, ribosomal RNA, histone protein genes, and dispersed repeated genes that jump. Regulated genetic misbehavior, structure and use of unique genes, and higher order complexities of chromosomes are also discussed. (JN)
Pathogenicity evaluation of different Newcastle disease virus chimeras in 4-week-old chickens
USDA-ARS?s Scientific Manuscript database
Infection with a virulent strain of Newcastle disease virus is considered one of the most important threats to the poultry industry worldwide. The causative virus, Newcastle disease virus, belongs to the Paramyxoviridae family, genus Avulavirus, and its genome encodes for 6 structural proteins: nu...
Protein synthesis in sperm: dialog between mitochondria and cytoplasm.
Gur, Yael; Breitbart, Haim
2008-01-30
Ejaculated sperm are capable of using mRNAs transcripts for protein translation during the final maturation steps before fertilization. In a capacitation-dependent process, nuclear-encoded mRNAs are translated by mitochondrial-type ribosomes while the cytoplasmic translation machinery is not involved. Our findings suggest that new proteins are synthesized to replace degraded proteins while swimming and waiting in the female reproductive tract before fertilization, or produced due to the specific needs of the capacitating spermatozoa. In addition, a growing number of articles have reported evidence for the correlation of nuclear-encoded mRNA and protein synthesis in somatic mitochondria. It is known that all of the proteins necessary for the replication, transcription and translation of the genes encoded in mtDNA are now encoded in the nuclear genome. This genetic investment is far out of proportion to the number of proteins involved, as there have been multiple movements and duplications of genes. However, the evolutionary retention (or secondary uptake) of the mitochondrial machinery for translation of nuclear-encoded mRNAs may shed light on this paradox.
Corsaro, M Michela; Parrilli, Ermenegilda; Lanzetta, Rosa; Naldi, Teresa; Pieretti, Giuseppina; Lindner, Buko; Carpentieri, Andrea; Parrilli, Michelangelo; Tutino, M Luisa
2009-08-01
The role of lipopolysaccharides (LPSs) in the biogenesis of outer membrane proteins have been investigated in several studies. Some of these analyses showed that LPS is required for correct and efficient folding of outer membrane proteins; other studies support the idea of independence of outer membrane proteins biogenesis from LPS structure. In this article, we investigated the involvement of LPS structure in the anomalous aggregation of outer membrane proteins in a E. coli mutant strain (S17-1(lambdapir)). To achieve this aim, the LPS structure of the mutant strain was carefully determined and compared with the E. coli K-12 one. It turned out that LPS of these two strains differs in the inner core for the absence of a heptose residue (HepIII). We demonstrated that this difference is due to a mutation in waaQ, a gene encoding the transferase for the branch heptose HepIII residue. The mutation was complemented to find out if the restoration of LPS structure influenced the observed outer membrane proteins aggregation. Data reported in this work demonstrated that, in E. coli S17-1(lambdapir) there is no influence of LPS structure on the outer membrane proteins inclusion bodies formation.
Human AZU-1 gene, variants thereof and expressed gene products
Chen, Huei-Mei; Bissell, Mina
2004-06-22
A human AZU-1 gene, mutants, variants and fragments thereof. Protein products encoded by the AZU-1 gene and homologs encoded by the variants of AZU-1 gene acting as tumor suppressors or markers of malignancy progression and tumorigenicity reversion. Identification, isolation and characterization of AZU-1 and AZU-2 genes localized to a tumor suppressive locus at chromosome 10q26, highly expressed in nonmalignant and premalignant cells derived from a human breast tumor progression model. A recombinant full length protein sequences encoded by the AZU-1 gene and nucleotide sequences of AZU-1 and AZU-2 genes and variant and fragments thereof. Monoclonal or polyclonal antibodies specific to AZU-1, AZU-2 encoded protein and to AZU-1, or AZU-2 encoded protein homologs.
Crystal Structure of Menin Reveals Binding Site for Mixed Lineage Leukemia (MLL) Protein
DOE Office of Scientific and Technical Information (OSTI.GOV)
Murai, Marcelo J.; Chruszcz, Maksymilian; Reddy, Gireesh
2014-10-02
Menin is a tumor suppressor protein that is encoded by the MEN1 (multiple endocrine neoplasia 1) gene and controls cell growth in endocrine tissues. Importantly, menin also serves as a critical oncogenic cofactor of MLL (mixed lineage leukemia) fusion proteins in acute leukemias. Direct association of menin with MLL fusion proteins is required for MLL fusion protein-mediated leukemogenesis in vivo, and this interaction has been validated as a new potential therapeutic target for development of novel anti-leukemia agents. Here, we report the first crystal structure of menin homolog from Nematostella vectensis. Due to a very high sequence similarity, the Nematostellamore » menin is a close homolog of human menin, and these two proteins likely have very similar structures. Menin is predominantly an {alpha}-helical protein with the protein core comprising three tetratricopeptide motifs that are flanked by two {alpha}-helical bundles and covered by a {beta}-sheet motif. A very interesting feature of menin structure is the presence of a large central cavity that is highly conserved between Nematostella and human menin. By employing site-directed mutagenesis, we have demonstrated that this cavity constitutes the binding site for MLL. Our data provide a structural basis for understanding the role of menin as a tumor suppressor protein and as an oncogenic co-factor of MLL fusion proteins. It also provides essential structural information for development of inhibitors targeting the menin-MLL interaction as a novel therapeutic strategy in MLL-related leukemias.« less
USDA-ARS?s Scientific Manuscript database
Bean pod mottle virus (BPMV) is a bipartite, positive sense (+) RNA plant virus in the Secoviridae family. Its RNA1 encodes proteins required for genome replication, whereas RNA2 primarily encodes proteins needed for virion assembly and cell-to-cell movement. However, the function of a 58 kilo-dalto...
USDA-ARS?s Scientific Manuscript database
The members of Capillovirus genus encode two overlapping open reading frames (ORFs): ORF1 encodes a large polyprotein containing the domains of replication-associated proteins plus a coat protein (CP), and ORF2 encodes a movement protein, located within ORF1 in a different reading frame. Organizatio...
DOE Office of Scientific and Technical Information (OSTI.GOV)
S Kim; S Reddy; B Nelson
The Rv0948c gene from Mycobacterium tuberculosis H{sub 37}R{sub v} encodes a 90 amino acid protein as the natural gene product with chorismate mutase (CM) activity. The protein, 90-MtCM, exhibits Michaelis-Menten kinetics with a k{sub cat} of 5.5 {+-} 0.2 s{sup -1} and a K{sub m} of 1500 {+-} 100 {micro}m at 37 C and pH 7.5. The 2.0 {angstrom} X-ray structure shows that 90-MtCM is an all {alpha}-helical homodimer (Protein Data Bank ID: 2QBV) with the topology of Escherichia coli CM (EcCM), and that both protomers contribute to each catalytic site. Superimposition onto the structure of EcCM and the sequencemore » alignment shows that the C-terminus helix 3 is shortened. The absence of two residues in the active site of 90-MtCM corresponding to Ser84 and Gln88 of EcCM appears to be one reason for the low k{sub cat}. Hence, 90-MtCM belongs to a subfamily of {alpha}-helical AroQ CMs termed AroQ{delta}. The CM gene (y2828) from Yersinia pestis encodes a 186 amino acid protein with an N-terminal signal peptide that directs the protein to the periplasm. The mature protein, *YpCM, exhibits Michaelis-Menten kinetics with a k{sub cat} of 70 {+-} 5 s{sup -1} and Km of 500 {+-} 50 {micro}m at 37 C and pH 7.5. The 2.1 {angstrom} X-ray structure shows that *YpCM is an all {alpha}-helical protein, and functions as a homodimer, and that each protomer has an independent catalytic unit (Protein Data Bank ID: 2GBB). *YpCM belongs to the AroQ{gamma} class of CMs, and is similar to the secreted CM (Rv1885c, *MtCM) from M. tuberculosis.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kim, S.K.; Robinson, H.; Reddy, S. K.
2008-10-01
The Rv0948c gene from Mycobacterium tuberculosis H{sub 37}R{sub v} encodes a 90 amino acid protein as the natural gene product with chorismate mutase (CM) activity. The protein, 90-MtCM, exhibits Michaelis-Menten kinetics with a k{sub cat} of 5.5 {+-} 0.2 s{sup -1} and a K{sub m} of 1500 {+-} 100 {mu}m at 37 C and pH 7.5. The 2.0 {angstrom} X-ray structure shows that 90-MtCM is an all {alpha}-helical homodimer (Protein Data Bank ID: 2QBV) with the topology of Escherichia coli CM (EcCM), and that both protomers contribute to each catalytic site. Superimposition onto the structure of EcCM and the sequencemore » alignment shows that the C-terminus helix 3 is shortened. The absence of two residues in the active site of 90-MtCM corresponding to Ser84 and Gln88 of EcCM appears to be one reason for the low k{sub cat}. Hence, 90-MtCM belongs to a subfamily of {alpha}-helical AroQ CMs termed AroQ{sub {delta}}. The CM gene (y2828) from Yersinia pestis encodes a 186 amino acid protein with an N-terminal signal peptide that directs the protein to the periplasm. The mature protein, *YpCM, exhibits Michaelis-Menten kinetics with a k{sub cat} of 70 {+-} 5 s{sup -1} and K{sub m} of 500 {+-} 50 {mu}m at 37 C and pH 7.5. The 2.1 {angstrom} X-ray structure shows that *YpCM is an all {alpha}-helical protein, and functions as a homodimer, and that each protomer has an independent catalytic unit (Protein Data Bank ID: 2GBB). *YpCM belongs to the AroQ{sub {gamma}} class of CMs, and is similar to the secreted CM (Rv1885c, *MtCM) from M. tuberculosis.« less
NASA Astrophysics Data System (ADS)
Drillien, Robert; Spehner, Daniele; Kirn, Andre; Giraudon, Pascale; Buckland, Robin; Wild, Fabian; Lecocq, Jean-Pierre
1988-02-01
Vaccinia virus recombinants encoding the hemagglutinin or fusion protein of measles virus have been constructed. Infection of cell cultures with the recombinants led to the synthesis of authentic measles proteins as judged by their electrophoretic mobility, recognition by antibodies, glycosylation, proteolytic cleavage, and presentation on the cell surface. Mice vaccinated with a single dose of the recombinant encoding the hemagglutinin protein developed antibodies capable of both inhibiting hemagglutination activity and neutralizing measles virus, whereas animals vaccinated with the recombinant encoding the fusion protein developed measles neutralizing antibodies. Mice vaccinated with either of the recombinants resisted a normally lethal intracerebral inoculation of a cell-associated measles virus subacute sclerosing panencephalitis strain.
Woolford, Lucy; Rector, Annabel; Van Ranst, Marc; Ducki, Andrea; Bennett, Mark D.; Nicholls, Philip K.; Warren, Kristin S.; Swan, Ralph A.; Wilcox, Graham E.; O'Hara, Amanda J.
2007-01-01
Conservation efforts to prevent the extinction of the endangered western barred bandicoot (Perameles bougainville) are currently hindered by a progressively debilitating cutaneous and mucocutaneous papillomatosis and carcinomatosis syndrome observed in captive and wild populations. In this study, we detected a novel virus, designated the bandicoot papillomatosis carcinomatosis virus type 1 (BPCV1), in lesional tissue from affected western barred bandicoots using multiply primed rolling-circle amplification and PCR with the cutaneotropic papillomavirus primer pairs FAP59/FAP64 and AR-L1F8/AR-L1R9. Sequencing of the BPCV1 genome revealed a novel prototype virus exhibiting genomic properties of both the Papillomaviridae and the Polyomaviridae. Papillomaviral properties included a large genome size (∼7.3 kb) and the presence of open reading frames (ORFs) encoding canonical L1 and L2 structural proteins. The genomic organization in which structural and nonstructural proteins were encoded on different strands of the double-stranded genome and the presence of ORFs encoding the nonstructural proteins large T and small t antigens were, on the other hand, typical polyomaviral features. BPCV1 may represent the first member of a novel virus family, descended from a common ancestor of the papillomaviruses and polyomaviruses recognized today. Alternatively, it may represent the product of ancient recombination between members of these two virus families. The discovery of this virus could have implications for the current taxonomic classification of Papillomaviridae and Polyomaviridae and can provide further insight into the evolution of these ancient virus families. PMID:17898069
Identification of the centromere-specific histone H3 variant in Lotus japonicus.
Tek, Ahmet L; Kashihara, Kazunari; Murata, Minoru; Nagaki, Kiyotaka
2014-03-15
The centromere is a structurally and functionally specialized region present on every eukaryotic chromosome. Lotus japonicus is a model legume species for which there is very limited information on the centromere structure. Here we cloned and characterized the L. japonicus homolog of the centromere-specific histone H3 gene (LjCenH3) encoding a 159-amino acid protein. Using an Agrobacterium-based transformation system, LjCenH3 tagged with a green fluorescent protein was transferred into L. japonicus cells. The centromeric position of LjCENH3 protein was revealed on L. japonicus metaphase chromosomes by an immunofluorescence assay. The identification of LjCenH3 as a critical centromere landmark could pave the way for a better understanding of centromere structure in this model and other agriculturally important legume species. Published by Elsevier B.V.
Purification and characterization of human pancreatic polypeptide expressed in E. coli.
Griko, Y V; Kapanadze, M D
1995-08-04
The region of cDNA encoding human pancreatic polypeptide (hPP) was obtained by polymerase chain reaction (PCR) and subcloned into an expression vector. The pancreatic polypeptide gene was expressed in Escherichia coli in two versions: as a cleavable fusion protein with IgG-binding synthetic ZZ domains of protein A from Staphylococcus aureus or with the 1-48 fragment of lambda Cro repressor. Site-specific hydrolysis by hydroxylamine was used to cleave the fusion protein, releasing the human polypeptide. The structure of the obtained hPP has been studied by scanning microcalorimetry and circular dichroism spectrometry. It has been shown that hPP in solutions close to neutral has a compact and unique spatial structure with an extended hydrophobic core. This structure is stable at 20 degrees C and co-operatively breaks down upon heating from this temperature.
Murakami, Taira; Kanai, Tamotsu; Takata, Hiroki; Kuriki, Takashi; Imanaka, Tadayuki
2006-01-01
Branching enzyme (BE) catalyzes formation of the branch points in glycogen and amylopectin by cleavage of the α-1,4 linkage and its subsequent transfer to the α-1,6 position. We have identified a novel BE encoded by an uncharacterized open reading frame (TK1436) of the hyperthermophilic archaeon Thermococcus kodakaraensis KOD1. TK1436 encodes a conserved protein showing similarity to members of glycoside hydrolase family 57 (GH-57 family). At the C terminus of the TK1436 protein, two copies of a helix-hairpin-helix (HhH) motif were found. TK1436 orthologs are distributed in archaea of the order Thermococcales, cyanobacteria, some actinobacteria, and a few other bacterial species. When recombinant TK1436 protein was incubated with amylose used as the substrate, a product peak was detected by high-performance anion-exchange chromatography, eluting more slowly than the substrate. Isoamylase treatment of the reaction mixture significantly increased the level of short-chain α-glucans, indicating that the reaction product contained many α-1,6 branching points. The TK1436 protein showed an optimal pH of 7.0, an optimal temperature of 70°C, and thermostability up to 90°C, as determined by the iodine-staining assay. These properties were the same when a protein devoid of HhH motifs (the TK1436ΔH protein) was used. The average molecular weight of branched glucan after reaction with the TK1436ΔH protein was over 100 times larger than that of the starting substrate. These results clearly indicate that TK1436 encodes a structurally novel BE belonging to the GH-57 family. Identification of an overlooked BE species provides new insights into glycogen biosynthesis in microorganisms. PMID:16885460
Wang, Pingyang; Qiu, Zhiyong; Xia, Dingguo; Tang, Shunming; Shen, Xingjia; Zhao, Qiaoling
2017-01-01
A new purple quail-like (q-lp) mutant found from the plain silkworm strain 932VR has pigment dots on the epidermis similar to the pigment mutant quail (q). In addition, q-lp mutant larvae are inactive, consume little and grow slowly, with a high death rate and other developmental abnormalities. Pigmentation of the silkworm epidermis consists of melanin, ommochrome and pteridine. Silkworm development is regulated by ecdysone and juvenile hormone. In this study, we performed RNA-Seq on the epidermis of the q-lp mutant in the 4th instar during molting, with 932VR serving as the control. The results showed 515 differentially expressed genes, of which 234 were upregulated and 281 downregulated in q-lp. BLASTGO analysis indicated that the downregulated genes mainly encode protein-binding proteins, membrane components, oxidation/reduction enzymes, and proteolytic enzymes, whereas the upregulated genes largely encode cuticle structural constituents, membrane components, transport related proteins, and protein-binding proteins. Quantitative reverse transcription PCR was used to verify the accuracy of the RNA-Seq data, focusing on key genes for biosynthesis of the three pigments and chitin as well as genes encoding cuticular proteins and several related nuclear receptors, which are thought to play key roles in the q-lp mutant. We drew three conclusions based on the results: 1) melanin, ommochrome and pteridine pigments are all increased in the q-lp mutant; 2) more cuticle proteins are expressed in q-lp than in 932VR, and the number of upregulated cuticular genes is significantly greater than downregulated genes; 3) the downstream pathway regulated by ecdysone is blocked in the q-lp mutant. Our research findings lay the foundation for further research on the developmental changes responsible for the q-lp mutant.
Bryson, Steve; Thomson, Christy A; Risnes, Louise F; Dasgupta, Somnath; Smith, Kenneth; Schrader, John W; Pai, Emil F
2016-06-01
The human Ab response to certain pathogens is oligoclonal, with preferred IgV genes being used more frequently than others. A pair of such preferred genes, IGVK3-11 and IGVH3-30, contributes to the generation of protective Abs directed against the 23F serotype of the pneumonococcal capsular polysaccharide of Streptococcus pneumoniae and against the AD-2S1 peptide of the gB membrane protein of human CMV. Structural analyses of Fab fragments of mAbs 023.102 and pn132p2C05 in complex with portions of the 23F polysaccharide revealed five germline-encoded residues in contact with the key component, l-rhamnose. In the case of the AD-2S1 peptide, the KE5 Fab fragment complex identified nine germline-encoded contact residues. Two of these germline-encoded residues, Arg91L and Trp94L, contact both the l-rhamnose and the AD-2S1 peptide. Comparison of the respective paratopes that bind to carbohydrate and protein reveals that stochastic diversity in both CDR3 loops alone almost exclusively accounts for their divergent specificity. Combined evolutionary pressure by human CMV and the 23F serotype of S. pneumoniae acted on the IGVK3-11 and IGVH3-30 genes as demonstrated by the multiple germline-encoded amino acids that contact both l-rhamnose and AD-2S1 peptide. Copyright © 2016 by The American Association of Immunologists, Inc.
Cytochrome b5 gene and protein of Candida tropicalis and methods relating thereto
Craft, David L.; Madduri, Krishna M.; Loper, John C.
2003-01-01
A novel gene has been isolated which encodes cytochrome b5 (CYTb5) protein of the .omega.-hydroxylase complex of C. tropicalis 20336. Vectors including this gene, and transformed host cells are provided. Methods of increasing the production of a CYTb5 protein are also provided which involve transforming a host cell with a gene encoding this protein and culturing the cells. Methods of increasing the production of a dicarboxylic acid are also provided which involve increasing in the host cell the number of genes encoding this protein.
Elucidation of the structure of retroviral proteases: a reminiscence.
Jaskolski, Mariusz; Miller, Maria; Mohana Rao, J K; Gustchina, Alla; Wlodawer, Alexander
2015-11-01
Determinations of only a very few protein structures had consequences comparable to the impact exerted by the structure of the protease encoded by HIV-1, published just over 25 years ago. The structure of this relatively small protein and its cousins from other retroviruses provided a clear target for a spectacularly successful structure-assisted drug design effort that offered new hope for controlling the then-escalating AIDS epidemic. This reminiscence is limited primarily to work conducted at the National Cancer Institute, and is not meant to be a comprehensive history of the field, but is rather an attempt to provide a very personal account of how the structures of this most thoroughly studied crystallographic target were determined. Published 2015. This article is a U.S. Government work and is in the public domain in the USA.
Advances in RNA Structure Determination | Center for Cancer Research
The recent years have witnessed a revolution in the field of RNA structure and function. Until recently the main contribution of RNA in cellular and disease functions was considered to be a role defined by the central dogma, namely DNA codes for mRNAs, which in turn encode for proteins, a notion facilitated by non-coding ribosomal RNA and tRNA. It was also assumed at the time
Structure-Based Annotation of a Novel Sugar Isomerase from the Pathogenic E. coli O157:H7
DOE Office of Scientific and Technical Information (OSTI.GOV)
van Staalduinen, L.; Park, C; Yeom, S
2010-01-01
Prokaryotes can use a variety of sugars as carbon sources in order to provide a selective survival advantage. The gene z5688 found in the pathogenic Escherichia coli O157:H7 encodes a 'hypothetical' protein of unknown function. Sequence analysis identified the gene product as a putative member of the cupin superfamily of proteins, but no other functional information was known. We have determined the crystal structure of the Z5688 protein at 1.6 {angstrom} resolution and identified the protein as a novel E. coli sugar isomerase (EcSI) through overall fold analysis and secondary-structure matching. Extensive substrate screening revealed that EcSI is capable ofmore » acting on D-lyxose and D-mannose. The complex structure of EcSI with fructose allowed the identification of key active-site residues, and mutagenesis confirmed their importance. The structure of EcSI also suggested a novel mechanism for substrate binding and product release in a cupin sugar isomerase. Supplementation of a nonpathogenic E. coli strain with EcSI enabled cell growth on the rare pentose d-lyxose.« less
Click strategies for single-molecule protein fluorescence.
Milles, Sigrid; Tyagi, Swati; Banterle, Niccolò; Koehler, Christine; VanDelinder, Virginia; Plass, Tilman; Neal, Adrian P; Lemke, Edward A
2012-03-21
Single-molecule methods have matured into central tools for studies in biology. Foerster resonance energy transfer (FRET) techniques, in particular, have been widely applied to study biomolecular structure and dynamics. The major bottleneck for a facile and general application of these studies arises from the need to label biological samples site-specifically with suitable fluorescent dyes. In this work, we present an optimized strategy combining click chemistry and the genetic encoding of unnatural amino acids (UAAs) to overcome this limitation for proteins. We performed a systematic study with a variety of clickable UAAs and explored their potential for high-resolution single-molecule FRET (smFRET). We determined all parameters that are essential for successful single-molecule studies, such as accessibility of the probes, expression yield of proteins, and quantitative labeling. Our multiparameter fluorescence analysis allowed us to gain new insights into the effects and photophysical properties of fluorescent dyes linked to various UAAs for smFRET measurements. This led us to determine that, from the extended tool set that we now present, genetically encoding propargyllysine has major advantages for state-of-the-art measurements compared to other UAAs. Using this optimized system, we present a biocompatible one-step dual-labeling strategy of the regulatory protein RanBP3 with full labeling position freedom. Our technique allowed us then to determine that the region encompassing two FxFG repeat sequences adopts a disordered but collapsed state. RanBP3 serves here as a prototypical protein that, due to its multiple cysteines, size, and partially disordered structure, is not readily accessible to any of the typical structure determination techniques such as smFRET, NMR, and X-ray crystallography.
Extreme disorder in an ultrahigh-affinity protein complex
NASA Astrophysics Data System (ADS)
Borgia, Alessandro; Borgia, Madeleine B.; Bugge, Katrine; Kissling, Vera M.; Heidarsson, Pétur O.; Fernandes, Catarina B.; Sottini, Andrea; Soranno, Andrea; Buholzer, Karin J.; Nettels, Daniel; Kragelund, Birthe B.; Best, Robert B.; Schuler, Benjamin
2018-03-01
Molecular communication in biology is mediated by protein interactions. According to the current paradigm, the specificity and affinity required for these interactions are encoded in the precise complementarity of binding interfaces. Even proteins that are disordered under physiological conditions or that contain large unstructured regions commonly interact with well-structured binding sites on other biomolecules. Here we demonstrate the existence of an unexpected interaction mechanism: the two intrinsically disordered human proteins histone H1 and its nuclear chaperone prothymosin-α associate in a complex with picomolar affinity, but fully retain their structural disorder, long-range flexibility and highly dynamic character. On the basis of closely integrated experiments and molecular simulations, we show that the interaction can be explained by the large opposite net charge of the two proteins, without requiring defined binding sites or interactions between specific individual residues. Proteome-wide sequence analysis suggests that this interaction mechanism may be abundant in eukaryotes.
The poly(C)-binding proteins: a multiplicity of functions and a search for mechanisms.
Makeyev, Aleksandr V; Liebhaber, Stephen A
2002-01-01
The poly(C) binding proteins (PCBPs) are encoded at five dispersed loci in the mouse and human genomes. These proteins, which can be divided into two groups, hnRNPs K/J and the alphaCPs (alphaCP1-4), are linked by a common evolutionary history, a shared triple KH domain configuration, and by their poly(C) binding specificity. Given these conserved characteristics it is remarkable to find a substantial diversity in PCBP functions. The roles of these proteins in mRNA stabilization, translational activation, and translational silencing suggest a complex and diverse set of post-transcriptional control pathways. Their additional putative functions in transcriptional control and as structural components of important DNA-protein complexes further support their remarkable structural and functional versatility. Clearly the identification of additional binding targets and delineation of corresponding control mechanisms and effector pathways will establish highly informative models for further exploration. PMID:12003487
The poly(C)-binding proteins: a multiplicity of functions and a search for mechanisms.
Makeyev, Aleksandr V; Liebhaber, Stephen A
2002-03-01
The poly(C) binding proteins (PCBPs) are encoded at five dispersed loci in the mouse and human genomes. These proteins, which can be divided into two groups, hnRNPs K/J and the alphaCPs (alphaCP1-4), are linked by a common evolutionary history, a shared triple KH domain configuration, and by their poly(C) binding specificity. Given these conserved characteristics it is remarkable to find a substantial diversity in PCBP functions. The roles of these proteins in mRNA stabilization, translational activation, and translational silencing suggest a complex and diverse set of post-transcriptional control pathways. Their additional putative functions in transcriptional control and as structural components of important DNA-protein complexes further support their remarkable structural and functional versatility. Clearly the identification of additional binding targets and delineation of corresponding control mechanisms and effector pathways will establish highly informative models for further exploration.
Qiu, T; Lu, R H; Zhang, J; Zhu, Z Y
2001-07-01
The complete nucleotide sequence of M6 gene of grass carp hemorrhage virus (GCHV) was determined. It is 2039 nucleotides in length and contains a single large open reading frame that could encode a protein of 648 amino acids with predicted molecular mass of 68.7 kDa. Amino acid sequence comparison revealed that the protein encoded by GCHV M6 is closely related to the protein mu1 of mammalian reovirus. The M6 gene, encoding the major outer-capsid protein, was expressed using the pET fusion protein vector in Escherichia coli and detected by Western blotting using chicken anti-GCHV immunoglobulin (IgY). The result indicates that the protein encoded by M6 may share a putative Asn-42-Pro-43 proteolytic cleavage site with mu1.
Halbleib, Jennifer M.; Sääf, Annika M.
2007-01-01
Although there is considerable evidence implicating posttranslational mechanisms in the development of epithelial cell polarity, little is known about the patterns of gene expression and transcriptional regulation during this process. We characterized the temporal program of gene expression during cell–cell adhesion–initiated polarization of human Caco-2 cells in tissue culture, which develop structural and functional polarity similar to that of enterocytes in vivo. A distinctive switch in gene expression patterns occurred upon formation of cell–cell contacts between neighboring cells. Expression of genes involved in cell proliferation was down-regulated concomitant with induction of genes necessary for functional specialization of polarized epithelial cells. Transcriptional up-regulation of these latter genes correlated with formation of important structural and functional features in enterocyte differentiation and establishment of structural and functional cell polarity; components of the apical microvilli were induced as the brush border formed during polarization; as barrier function was established, expression of tight junction transmembrane proteins peaked; transcripts encoding components of the apical, but not the basal-lateral trafficking machinery were increased during polarization. Coordinated expression of genes encoding components of functional cell structures were often observed indicating temporal control of expression and assembly of multiprotein complexes. PMID:17699590
Viral and Cellular mRNA Translation in Coronavirus-Infected Cells
Nakagawa, K.; Lokugamage, K.G.; Makino, S.
2017-01-01
Coronaviruses have large positive-strand RNA genomes that are 5′ capped and 3′ polyadenylated. The 5′-terminal two-thirds of the genome contain two open reading frames (ORFs), 1a and 1b, that together make up the viral replicase gene and encode two large polyproteins that are processed by viral proteases into 15–16 nonstructural proteins, most of them being involved in viral RNA synthesis. ORFs located in the 3′-terminal one-third of the genome encode structural and accessory proteins and are expressed from a set of 5′ leader-containing subgenomic mRNAs that are synthesized by a process called discontinuous transcription. Coronavirus protein synthesis not only involves cap-dependent translation mechanisms but also employs regulatory mechanisms, such as ribosomal frameshifting. Coronavirus replication is known to affect cellular translation, involving activation of stress-induced signaling pathways, and employing viral proteins that affect cellular mRNA translation and RNA stability. This chapter describes our current understanding of the mechanisms involved in coronavirus mRNA translation and changes in host mRNA translation observed in coronavirus-infected cells. PMID:27712623
Porcine parvovirus: DNA sequence and genome organization.
Ranz, A I; Manclús, J J; Díaz-Aroca, E; Casal, J I
1989-10-01
We have determined the nucleotide sequence of an almost full-length clone of porcine parvovirus (PPV). The sequence is 4973 nucleotides (nt) long. The 3' end of virion DNA shows a Y-shaped configuration homologous to rodent parvoviruses. The 5' end of virion DNA shows a repetition of 127 nt at the carboxy terminus of the capsid proteins. The overall organization of the PPV genome is similar to those of other autonomous parvoviruses. There are two large open reading frames (ORFs) that almost entirely cover the genome, both located in the same frame of the complementary strand. The left ORF encodes the non-structural protein NS1 and the right ORF encodes the capsid proteins (VP1, VP2 and VP3). Promoter analysis, location of splicing sites and putative amino acid sequences for the viral proteins show a high homology of PPV with feline panleukopenia virus and canine parvoviruses (FPV and CPV) and rodent parvovirus. Therefore we conclude that PPV is related to the Kilham rat virus (KRV) group of autonomous parvoviruses formed by KRV, minute virus of mice, Lu III, H-1, FPV and CPV.
A genome-wide survey on basic helix-loop-helix transcription factors in giant panda.
Dang, Chunwang; Wang, Yong; Zhang, Debao; Yao, Qin; Chen, Keping
2011-01-01
The giant panda (Ailuropoda melanoleuca) is a critically endangered mammalian species. Studies on functions of regulatory proteins involved in developmental processes would facilitate understanding of specific behavior in giant panda. The basic helix-loop-helix (bHLH) proteins play essential roles in a wide range of developmental processes in higher organisms. bHLH family members have been identified in over 20 organisms, including fruit fly, zebrafish, mouse and human. Our present study identified 107 bHLH family members being encoded in giant panda genome. Phylogenetic analyses revealed that they belong to 44 bHLH families with 46, 25, 15, 4, 11 and 3 members in group A, B, C, D, E and F, respectively, while the remaining 3 members were assigned into "orphan". Compared to mouse, the giant panda does not encode seven bHLH proteins namely Beta3a, Mesp2, Sclerax, S-Myc, Hes5 (or Hes6), EBF4 and Orphan 1. These results provide useful background information for future studies on structure and function of bHLH proteins in the regulation of giant panda development.
Weil, D; Levy, G; Sahly, I; Levi-Acobas, F; Blanchard, S; El-Amraoui, A; Crozet, F; Philippe, H; Abitbol, M; Petit, C
1996-04-16
The gene encoding human myosin VIIA is responsible for Usher syndrome type III (USH1B), a disease which associates profound congenital sensorineural deafness, vestibular dysfunction, and retinitis pigmentosa. The reconstituted cDNA sequence presented here predicts a 2215 amino acid protein with a typical unconventional myosin structure. This protein is expected to dimerize into a two-headed molecule. The C terminus of its tail shares homology with the membrane-binding domain of the band 4.1 protein superfamily. The gene consists of 48 coding exons. It encodes several alternatively spliced forms. In situ hybridization analysis in human embryos demonstrates that the myosin VIIA gene is expressed in the pigment epithelium and the photoreceptor cells of the retina, thus indicating that both cell types may be involved in the USH1B retinal degenerative process. In addition, the gene is expressed in the human embryonic cochlear and vestibular neuroepithelia. We suggest that deafness and vestibular dysfunction in USH1B patients result from a defect in the morphogenesis of the inner ear sensory cell stereocilia.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lassila, JK; Bernstein, SL; Kinney, JN
Bacterial microconnpartnnents (BMCs) sequester enzymes from the cytoplasmic environment by encapsulation inside a selectively permeable protein shell. Bioinformatic analyses indicate that many bacteria encode BMC clusters of unknown function and with diverse combinations of shell proteins. The genome of the halophilic myxobacterium Haliangium ochraceum encodes one of the most atypical sets of shell proteins in terms of composition and primary structure. We found that microconnpartnnent shells could be purified in high yield when all seven H. ochraceum BMC shell genes were expressed from a synthetic operon in Escherichia coll. These shells differ substantially from previously isolated shell systems in thatmore » they are considerably smaller and more homogeneous, with measured diameters of 39 2 nm. The size and nearly uniform geometry allowed the development of a structural model for the shells composed of 260 hexagonal units and 13 hexagons per icosahedral face. We found that new proteins could be recruited to the shells by fusion to a predicted targeting peptide sequence, setting the stage for the use of these remarkably homogeneous shells for applications such as three-dimensional scaffolding and the construction of synthetic BMCs. Our results demonstrate the value of selecting from the diversity of BMC shell building blocks found in genomic sequence data for the construction of novel compartments. (C) 2014 Elsevier Ltd. All rights reserved.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Glasser, S.W.; Korfhagen, T.R.; Weaver, T.E.
1988-01-05
In hyaline membrane disease of premature infants, lack of surfactant leads to pulmonary atelectasis and respiratory distress. Hydrophobic surfactant proteins of M/sub r/ = 5000-14,000 have been isolated from mammalian surfactants which enhance the rate of spreading and the surface tension lowering properties of phospholipids during dynamic compression. The authors have characterized the amino-terminal amino acid sequence of pulmonary proteolipids from ether/ethanol extracts of bovine, canine, and human surfactant. Two distinct peptides were identified and termed SPL(pVal) and SPL(Phe). An oligonucleotide probe based on the valine-rich amino-terminal amino acid sequence of SPL(pVal) was utilized to isolate cDNA and genomic DNAmore » encoding the human protein, termed surfactant proteolipid SPL(pVal) on the basis of its unique polyvaline domain. The primary structure of a precursor protein of 20,870 daltons, containing the SPL(pVal) peptide, was deduced from the nucleotide sequence of the cDNAs. Hybrid-arrested translation and immunoprecipitation of labeled translation products of human mRNA demonstrated a precursor protein, the active hydrophobic peptide being produced by proteolytic processing. Two classes of cDNAs encoding SPL(pVal) were identified. Human SPL(pVal) mRNA was more abundant in the adult than in fetal lung. The SPL(pVal) gene locus was assigned to chromosome 8.« less
Serum amyloid A1: Structure, function and gene polymorphism
Sun, Lei; Ye, Richard D.
2017-01-01
Inducible expression of serum amyloid A (SAA) is a hallmark of the acute-phase response, which is a conserved reaction of vertebrates to environmental challenges such as tissue injury, infection and surgery. Human SAA1 is encoded by one of the four SAA genes and is the best-characterized SAA protein. Initially known as a major precursor of amyloid A (AA), SAA1 has been found to play an important role in lipid metabolism and contributes to bacterial clearance, the regulation of inflammation and tumor pathogenesis. SAA1 has five polymorphic coding alleles (SAA1.1 – SAA1.5) that encode distinct proteins with minor amino acid substitutions. Single nucleotide polymorphism (SNP) has been identified in both the coding and non-coding regions of human SAA1. Despite high levels of sequence homology among these variants, SAA1 polymorphisms have been reported as risk factors of cardiovascular diseases and several types of cancer. A recently solved crystal structure of SAA1.1 reveals a hexameric bundle with each of the SAA1 subunits assuming a 4-helix structure stabilized by the C-terminal tail. Analysis of the native SAA1.1 structure has led to the identification of a competing site for high-density lipoprotein (HDL) and heparin, thus providing the structural basis for a role of heparin and heparan sulfate in the conversion of SAA1 to AA. In this brief review, we compares human SAA1 with other forms of human and mouse SAAs, and discuss how structural and genetic studies of SAA1 have advanced our understanding of the physiological functions of the SAA proteins. PMID:26945629
DNA encoding a DNA repair protein
Petrini, John H.; Morgan, William Francis; Maser, Richard Scott; Carney, James Patrick
2006-08-15
An isolated and purified DNA molecule encoding a DNA repair protein, p95, is provided, as is isolated and purified p95. Also provided are methods of detecting p95 and DNA encoding p95. The invention further provides p95 knock-out mice.
Structure of human POFUT2: insights into thrombospondin type 1 repeat fold and O-fucosylation
Chen, Chun-I; Keusch, Jeremy J; Klein, Dominique; Hess, Daniel; Hofsteenge, Jan; Gut, Heinz
2012-01-01
Protein O-fucosylation is a post-translational modification found on serine/threonine residues of thrombospondin type 1 repeats (TSR). The fucose transfer is catalysed by the enzyme protein O-fucosyltransferase 2 (POFUT2) and >40 human proteins contain the TSR consensus sequence for POFUT2-dependent fucosylation. To better understand O-fucosylation on TSR, we carried out a structural and functional analysis of human POFUT2 and its TSR substrate. Crystal structures of POFUT2 reveal a variation of the classical GT-B fold and identify sugar donor and TSR acceptor binding sites. Structural findings are correlated with steady-state kinetic measurements of wild-type and mutant POFUT2 and TSR and give insight into the catalytic mechanism and substrate specificity. By using an artificial mini-TSR substrate, we show that specificity is not primarily encoded in the TSR protein sequence but rather in the unusual 3D structure of a small part of the TSR. Our findings uncover that recognition of distinct conserved 3D fold motifs can be used as a mechanism to achieve substrate specificity by enzymes modifying completely folded proteins of very wide sequence diversity and biological function. PMID:22588082
A transthyretin-related protein is functionally expressed in Herbaspirillum seropedicae.
Matiollo, Camila; Vernal, Javier; Ecco, Gabriela; Bertoldo, Jean Borges; Razzera, Guilherme; de Souza, Emanuel M; Pedrosa, Fábio O; Terenzi, Hernán
2009-10-02
Transthyretin-related proteins (TRPs) constitute a family of proteins structurally related to transthyretin (TTR) and are found in a large range of bacterial, fungal, plant, invertebrate, and vertebrate species. However, it was recently recognized that both prokaryotic and eukaryotic members of this family are not functionally related to transthyretins. TRPs are in fact involved in the purine catabolic pathway and function as hydroxyisourate hydrolases. An open reading frame encoding a protein similar to the Escherichia coli TRP was identified in Herbaspirillum seropedicae genome (Hs_TRP). It was cloned, overexpressed in E. coli, and purified to homogeneity. Mass spectrometry data confirmed the identity of this protein, and circular dichroism spectrum indicated a predominance of beta-sheet structure, as expected for a TRP. We have demonstrated that Hs_TRP is a 5-hydroxyisourate hydrolase and by site-directed mutagenesis the importance of three conserved catalytic residues for Hs_TRP activity was further confirmed. The production of large quantities of this recombinant protein opens up the possibility of obtaining its 3D-structure and will help further investigations into purine catabolism.
Diab, Ahmed; Foca, Adrien; Zoulim, Fabien; Durantel, David; Andrisani, Ourania
2018-01-01
Virally encoded proteins have evolved to perform multiple functions, and the core protein (HBc) of the hepatitis B virus (HBV) is a perfect example. While HBc is the structural component of the viral nucleocapsid, additional novel functions for the nucleus-localized HBc have recently been described. These results extend for HBc, beyond its structural role, a regulatory function in the viral life cycle and potentially a role in pathogenesis. In this article, we review the diverse roles of HBc in HBV replication and pathogenesis, emphasizing how the unique structure of this protein is key to its various functions. We focus in particular on recent advances in understanding the significance of HBc phosphorylations, its interaction with host proteins and the role of HBc in regulating the transcription of host genes. We also briefly allude to the emerging niche for new direct-acting antivirals targeting HBc, known as Core (protein) Allosteric Modulators (CAMs). Copyright © 2017 Elsevier B.V. All rights reserved.
Crystal structure of the extracellular domain of human myelin protein zero
DOE Office of Scientific and Technical Information (OSTI.GOV)
Liu, Zhigang; Wang, Yong; Yedidi, Ravikiran S.
2012-03-27
Charcot-Marie-Tooth disease (CMT), a hereditary motor and sensory neuropathy, is the most common genetic neuropathy with an incidence of 1 in 2600. Several forms of CMT have been identified arising from different genomic abnormalities such as CMT1 including CMT1A, CMT1B, and CMTX. CMT1 with associated peripheral nervous system (PNS) demyelination, the most frequent diagnosis, demonstrates slowed nerve conduction velocities and segmental demyelination upon nerve biopsy. One of its subtypes, CMT1A, presents a 1.5-Mb duplication in the p11-p12 region of the human chromosome 17 which encodes peripheral myelin protein 22 (PMP22). CMT1B, a less common form, arises from the mutations inmore » the myelin protein zero (MPZ) gene on chromosome 1, region q22-q23, which encodes the major structural component of the peripheral myelin. A rare type of CMT1 has been found recently and is caused by point mutations in early growth response gene 2 (EGR2), encoding a zinc finger transcription factor in Schwann cells. In addition, CMTX, an X-linked form of CMT, arises from a mutation in the connexin-32 gene. Myelin protein zero, associated with CMT1B, is a transmembrane protein of 219 amino acid residues. Human MPZ consists of three domains: 125 residues constitute the glycosylated immunoglobulin-like extracellular domain; 27 residues span the membrane; and 67 residues comprise the highly basic intracellular domain. MPZ makes up approximately 50% of the protein content of myelin, and is expressed predominantly in Schwann cells, the myelinating cell of the PNS. Myelin protein zero, a homophilic adhesion molecule, is a member of the immunoglobulin super-family and is essential for normal myelin structure and function. In addition, MPZ knockout mice displayed abnormal myelin that severely affects the myelination pathway, and overexpression of MPZ causes congenital hypomyelination of peripheral nerves. Myelin protein zero mutations account for {approx}5% of patients with CMT. To date, over 125 different mutations in the MPZ gene leading to peripheral neuropathy in patients have been reported worldwide (http://www.molgen. ua.ac.be/CMTMutations). All identified mutations resulting in a change or deletion of amino acid residues in MPZ give rise to neuropathy with the exception of R215L, which instead causes a benign polymorphism. Furthermore, more detailed analysis has classified the MPZ mutations into two major groups. In the first group, the mutations disrupt the intracellular processing of MPZ and are primarily associated with early onset neuropathy. It has been proposed that the mutated MPZ is trapped inside the cell rather than being transported to the plasma membrane. However, other evidence suggests that the mutated MPZ protein is expressed on the plasma membrane, but dominant-negatively disrupts the structure of myelin. In the second group, the MPZ mutations are associated with late onset neuropathy as these mutations cause only mild demyelination. The underlying mechanism is elusive with the hypothesis being that the second group of mutations cause minor abnormalities in the myelin sheath that over time may lead to aberrant Schwann cell-axon interactions and subsequently to axonal degeneration. The crystal structure of the extracellular domain of human MPZ (hP0ex) fused with maltose binding protein (MBP) is reported at 2.1 {angstrom} resolution. While the crystal structure of rat MPZ extracellular domain (rP0ex) is available, the crystal structure of the human counterpart is useful for the analysis of the two homologs as well as a comparison between the two species. The hP0ex molecule reveals subtle structural variations between two homologs allowing comparison of the human myelin protein zero to that of the rat protein. The alignment of these homologs is shown in Figure 1(a).« less
The λ Integrase Site-specific Recombination Pathway
LANDY, ARTHUR
2017-01-01
The site-specific recombinase encoded by bacteriophage λ (Int) is responsible for integrating and excising the viral chromosome into and out of the chromosome of its Escherichia coli host. Int carries out a reaction that is highly directional, tightly regulated, and depends upon an ensemble of accessory DNA bending proteins acting on 240 bp of DNA encoding 16 protein binding sites. This additional complexity enables two pathways, integrative and excisive recombination, whose opposite, and effectively irreversible, directions are dictated by different physiological and environmental signals. Int recombinase is a heterobivalent DNA binding protein and each of the four Int protomers, within a multiprotein 400 kDa recombinogenic complex, is thought to bind and, with the aid of DNA bending proteins, bridge one arm- and one core-type DNA site. In the 12 years since the publication of the last review focused solely on the λ site-specific recombination pathway in Mobile DNA II, there has been a great deal of progress in elucidating the molecular details of this pathway. The most dramatic advances in our understanding of the reaction have been in the area of X-ray crystallography where protein-DNA structures have now been determined for of all of the DNA-protein interfaces driving the Int pathway. Building on this foundation of structures, it has been possible to derive models for the assembly of components that determine the regulatory apparatus in the P-arm, and for the overall architectures that define excisive and integrative recombinogenic complexes. The most fundamental additional mechanistic insights derive from the application of hexapeptide inhibitors and single molecule kinetics. PMID:26104711
Xu, Jianing; Xing, Shanshan; Cui, Haoran; Chen, Xuesen; Wang, Xiaoyun
2016-04-01
The ubiquitin-protein ligases (E3s) directly participate in ubiquitin (Ub) transferring to the target proteins in the ubiquitination pathway. The HECT ubiquitin-protein ligase (UPL), one type of E3s, is characterized as containing a conserved HECT domain of approximately 350 amino acids in the C terminus. Some UPLs were found to be involved in trichome development and leaf senescence in Arabidopsis. However, studies on plant UPLs, such as characteristics of the protein structure, predicted functional motifs of the HECT domain, and the regulatory expression of UPLs have all been limited. Here, we present genome-wide identification of the genes encoding UPLs (HECT gene) in apple. The 13 genes (named as MdUPL1-MdUPL13) from ten different chromosomes were divided into four groups by phylogenetic analysis. Among these groups, the encoding genes in the intron-exon structure and the included additional functional domains were quite different. Notably, the F-box domain was first found in MdUPL7 in plant UPLs. The HECT domain in different MdUPL groups also presented different spatial features and three types of conservative motifs were identified. The promoters of each MdUPL member carried multiple stress-response related elements by cis-acting element analysis. Experimental results demonstrated that the expressions of several MdUPLs were quite sensitive to cold-, drought-, and salt-stresses by qRT-PCR assay. The results of this study helped to elucidate the functions of HECT proteins, especially in Rosaceae plants.
Identification and characterization of vp7 gene in Bombyx mori cytoplasmic polyhedrosis virus.
He, Lei; Hu, Xiaolong; Zhu, Min; Liang, Zi; Chen, Fei; Zhu, Liyuan; Kuang, Sulan; Cao, Guangli; Xue, Renyu; Gong, Chengliang
2017-09-05
The genome of Bombyx mori cytoplasmic polyhedrosis virus (BmCPV) contains 10 double stranded RNA segments (S1-S10). The segment 7 (S7) encodes 50kDa protein which is considered as a structural protein. The expression pattern and function of p50 in the virus life cycle are still unclear. In this study, the viral structural protein 7 (VP7) polyclonal antibody was prepared with immunized mouse to explore the presence of small VP7 gene-encoded proteins in Bombyx mori cytoplasmic polyhedrosis virus. The expression pattern of vp7 gene was investigated by its overexpression in BmN cells. In addition to VP7, supplementary band was identified with western blotting technique. The virion, BmCPV infected cells and midguts were also examined using western blotting technique. 4, 2 and 5 bands were detected in the corresponding samples, respectively. The replication of BmCPV genome in the cultured cells and midgut of silkworm was decreased by reducing the expression level of vp7 gene using RNA interference. In immunoprecipitation experiments, using a polyclonal antiserum directed against the VP7, one additional shorter band in BmCPV infected midguts was detected, and then the band was analyzed with mass spectrum (MS), the MS results showed thatone candidate interacted protein (VP7 voltage-dependent anion-selective channel-like isoform, VDAC) was identified from silkworm. We concluded that the novel viral product was generated with a leaky scanning mechanism and the VDAC may be an interacted protein with VP7. Copyright © 2017 Elsevier B.V. All rights reserved.
van Dijk, Sabine J; Bezold Kooiker, Kristina; Mazzalupo, Stacy; Yang, Yuanzhang; Kostyukova, Alla S; Mustacich, Debbie J; Hoye, Elaine R; Stern, Joshua A; Kittleson, Mark D; Harris, Samantha P
2016-07-01
Mutations in MYBPC3, the gene encoding cardiac myosin binding protein C (cMyBP-C), are a major cause of hypertrophic cardiomyopathy (HCM). While most mutations encode premature stop codons, missense mutations causing single amino acid substitutions are also common. Here we investigated effects of a single proline for alanine substitution at amino acid 31 (A31P) in the C0 domain of cMyBP-C, which was identified as a natural cause of HCM in cats. Results using recombinant proteins showed that the mutation disrupted C0 structure, altered sensitivity to trypsin digestion, and reduced recognition by an antibody that preferentially recognizes N-terminal domains of cMyBP-C. Western blots detecting A31P cMyBP-C in myocardium of cats heterozygous for the mutation showed a reduced amount of A31P mutant protein relative to wild-type cMyBP-C, but the total amount of cMyBP-C was not different in myocardium from cats with or without the A31P mutation indicating altered rates of synthesis/degradation of A31P cMyBP-C. Also, the mutant A31P cMyBP-C was properly localized in cardiac sarcomeres. These results indicate that reduced protein expression (haploinsufficiency) cannot account for effects of the A31P cMyBP-C mutation and instead suggest that the A31P mutation causes HCM through a poison polypeptide mechanism that disrupts cMyBP-C or myocyte function. Copyright © 2016 Elsevier Inc. All rights reserved.
Feltes, Bruno César; Bonatto, Diego
2015-01-01
The xeroderma pigmentosum complementation group proteins (XPs), which include XPA through XPG, play a critical role in coordinating and promoting global genome and transcription-coupled nucleotide excision repair (GG-NER and TC-NER, respectively) pathways in eukaryotic cells. GG-NER and TC-NER are both required for the repair of bulky DNA lesions, such as those induced by UV radiation. Mutations in genes that encode XPs lead to the clinical condition xeroderma pigmentosum (XP). Although the roles of XPs in the GG-NER/TC-NER subpathways have been extensively studied, complete knowledge of their three-dimensional structure is only beginning to emerge. Hence, this review aims to summarize the current knowledge of mapped mutations and other structural information on XP proteins that influence their function and protein-protein interactions. We also review the possible post-translational modifications for each protein and the impact of these modifications on XP protein functions. Copyright © 2014 Elsevier B.V. All rights reserved.
Tubulin homolog TubZ in a phage-encoded partition system
Oliva, María A.; Martin-Galiano, Antonio J.; Sakaguchi, Yoshihiko; Andreu, José M.
2012-01-01
Partition systems are responsible for the process whereby large and essential plasmids are accurately positioned to daughter cells during bacterial division. They are typically made of three components: a centromere-like DNA zone, an adaptor protein, and an assembling protein that is either a Walker-box ATPase (type I) or an actin-like ATPase (type II). A recently described type III segregation system has a tubulin/FtsZ-like protein, called TubZ, for plasmid movement. Here, we present the 2.3 Å structure and dynamic assembly of a TubZ tubulin homolog from a bacteriophage and unravel the Clostridium botulinum phage c-st type III partition system. Using biochemical and biophysical approaches, we prove that a gene upstream from tubZ encodes the partner TubR and localize the centromeric region (tubS), both of which are essential for anchoring phage DNA to the motile TubZ filaments. Finally, we describe a conserved fourth component, TubY, which modulates the TubZ-R-S complex interaction. PMID:22538818
Sun, Yan-Lin; Hong, Soon-Kwan
2012-08-01
Sea buckthorn (Hippophae rhamnoides L.) is naturally distributed from Asia to Europe. It has been widely planted as an ornamental shrub and is rich in nutritional and medicinal compounds. Fungal pathogens that cause diseases such as dried-shrink disease are threats to the production of this plant. In this study, we isolated the dried-shrink disease pathogen from bark and total chitinase protein from leaves of infected plants. The results of the Oxford Cup experiment suggested that chitinase protein inhibited the growth of this pathogen. To improve pathogen resistance, we cloned chitinase Class I and III genes in H. rhamnoides, designated Hrchi1 and Hrchi3. The full-length cDNA of the open reading frame region of Hrchi1 contained 903 bp encoding 300 amino acids and Hrchi3 contained 894 bp encoding 297 amino acids. Active domain analysis, protein types, and secondary and 3D structures were predicted using online software.
The Complex Transcriptional Response of Acaryochloris marina to Different Oxygen Levels.
Hernández-Prieto, Miguel A; Lin, Yuankui; Chen, Min
2017-02-09
Ancient oxygenic photosynthetic prokaryotes produced oxygen as a waste product, but existed for a long time under an oxygen-free (anoxic) atmosphere, before an oxic atmosphere emerged. The change in oxygen levels in the atmosphere influenced the chemistry and structure of many enzymes that contained prosthetic groups that were inactivated by oxygen. In the genome of Acaryochloris marina , multiple gene copies exist for proteins that are normally encoded by a single gene copy in other cyanobacteria. Using high throughput RNA sequencing to profile transcriptome responses from cells grown under microoxic and hyperoxic conditions, we detected 8446 transcripts out of the 8462 annotated genes in the Cyanobase database. Two-thirds of the 50 most abundant transcripts are key proteins in photosynthesis. Microoxic conditions negatively affected the levels of expression of genes encoding photosynthetic complexes, with the exception of some subunits. In addition to the known regulation of the multiple copies of psbA , we detected a similar transcriptional pattern for psbJ and psbU , which might play a key role in the altered components of photosystem II. Furthermore, regulation of genes encoding proteins important for reactive oxygen species-scavenging is discussed at genome level, including, for the first time, specific small RNAs having possible regulatory roles under varying oxygen levels. Copyright © 2017 Hernandez-Prieto et al.
Genetically-encoded Molecular Probes to Study G Protein-coupled Receptors
Naganathan, Saranga; Grunbeck, Amy; Tian, He; Huber, Thomas; Sakmar, Thomas P.
2013-01-01
To facilitate structural and dynamic studies of G protein-coupled receptor (GPCR) signaling complexes, new approaches are required to introduce informative probes or labels into expressed receptors that do not perturb receptor function. We used amber codon suppression technology to genetically-encode the unnatural amino acid, p-azido-L-phenylalanine (azF) at various targeted positions in GPCRs heterologously expressed in mammalian cells. The versatility of the azido group is illustrated here in different applications to study GPCRs in their native cellular environment or under detergent solubilized conditions. First, we demonstrate a cell-based targeted photocrosslinking technology to identify the residues in the ligand-binding pocket of GPCR where a tritium-labeled small-molecule ligand is crosslinked to a genetically-encoded azido amino acid. We then demonstrate site-specific modification of GPCRs by the bioorthogonal Staudinger-Bertozzi ligation reaction that targets the azido group using phosphine derivatives. We discuss a general strategy for targeted peptide-epitope tagging of expressed membrane proteins in-culture and its detection using a whole-cell-based ELISA approach. Finally, we show that azF-GPCRs can be selectively tagged with fluorescent probes. The methodologies discussed are general, in that they can in principle be applied to any amino acid position in any expressed GPCR to interrogate active signaling complexes. PMID:24056801
The dhnA gene of Escherichia coli encodes a class I fructose bisphosphate aldolase.
Thomson, G J; Howlett, G J; Ashcroft, A E; Berry, A
1998-01-01
The gene encoding the Escherichia coli Class I fructose-1, 6-bisphosphate aldolase (FBP aldolase) has been cloned and the protein overproduced in high amounts. This gene sequence has previously been identified as encoding an E. coli dehydrin in the GenBanktrade mark database [gene dhnA; entry code U73760; Close and Choi (1996) Submission to GenBanktrade mark]. However, the purified protein overproduced from the dhnA gene shares all its properties with those known for the E. coli Class I FBP aldolase. The protein is an 8-10-mer with a native molecular mass of approx. 340 kDa, each subunit consisting of 349 amino acids. The Class I enzyme shows low sequence identity with other known FBP aldolases, both Class I and Class II (in the order of 20%), which may be reflected by some novel properties of this FBP aldolase. The active-site peptide has been isolated and the Schiff-base-forming lysine residue (Lys236) has been identified by a combination of site-directed mutagenesis, kinetics and electrospray-ionization MS. A second lysine residue (Lys238) has been implicated in substrate binding. The cloning of this gene and the high levels of overexpression obtained will facilitate future structure-function studies. PMID:9531482
The Complex Transcriptional Response of Acaryochloris marina to Different Oxygen Levels
Hernández-Prieto, Miguel A.; Lin, Yuankui; Chen, Min
2016-01-01
Ancient oxygenic photosynthetic prokaryotes produced oxygen as a waste product, but existed for a long time under an oxygen-free (anoxic) atmosphere, before an oxic atmosphere emerged. The change in oxygen levels in the atmosphere influenced the chemistry and structure of many enzymes that contained prosthetic groups that were inactivated by oxygen. In the genome of Acaryochloris marina, multiple gene copies exist for proteins that are normally encoded by a single gene copy in other cyanobacteria. Using high throughput RNA sequencing to profile transcriptome responses from cells grown under microoxic and hyperoxic conditions, we detected 8446 transcripts out of the 8462 annotated genes in the Cyanobase database. Two-thirds of the 50 most abundant transcripts are key proteins in photosynthesis. Microoxic conditions negatively affected the levels of expression of genes encoding photosynthetic complexes, with the exception of some subunits. In addition to the known regulation of the multiple copies of psbA, we detected a similar transcriptional pattern for psbJ and psbU, which might play a key role in the altered components of photosystem II. Furthermore, regulation of genes encoding proteins important for reactive oxygen species-scavenging is discussed at genome level, including, for the first time, specific small RNAs having possible regulatory roles under varying oxygen levels. PMID:27974439
The versatile DNA nucleotide excision repair (NER) and its medical significance.
Falik-Zaccai, Tzipora C; Keren, Zohar; Slor, Hanoch
2009-12-01
Two of DNA's worst enemies, ultraviolet light and chemical carcinogens, can cause damage to the molecule by mutating individual nucleotides or changing its physical structure. In most cases, genomic integrity is restored by specialized suites of proteins dedicated to repairing specific types of injuries. One restoration mechanism, called nucleotide excision repair (NER), recruits and coordinates the services of 20-30 proteins to recognize and remove structure-impairing lesions, including those induced by ultraviolet (UV) light. Mutations in a gene that encodes a protein from the NER machinery might cause a wide variety of rare inherited human disorders. Sun sensitivity, cancer, developmental retardation, neurodegeneration and premature aging characterize these syndromes. Identification of the causative genes and proteins in affected families in Israel allowed us to establish accurate molecular diagnosis of couples at risk, and provide them with better genetic counseling.
Haataja, Tatu J K; Koski, M Kristian; Hiltunen, J Kalervo; Glumoff, Tuomo
2011-05-01
All of the peroxisomal β-oxidation pathways characterized thus far house at least one MFE (multifunctional enzyme) catalysing two out of four reactions of the spiral. MFE type 2 proteins from various species display great variation in domain composition and predicted substrate preference. The gene CG3415 encodes for Drosophila melanogaster MFE-2 (DmMFE-2), complements the Saccharomyces cerevisiae MFE-2 deletion strain, and the recombinant protein displays both MFE-2 enzymatic activities in vitro. The resolved crystal structure is the first one for a full-length MFE-2 revealing the assembly of domains, and the data can also be transferred to structure-function studies for other MFE-2 proteins. The structure explains the necessity of dimerization. The lack of substrate channelling is proposed based on both the structural features, as well as by the fact that hydration and dehydrogenation activities of MFE-2, if produced as separate enzymes, are equally efficient in catalysis as the full-length MFE-2.
Plett, Jonathan M.; Yin, Hengfu; Mewalal, Ritesh; ...
2017-03-23
During symbiosis, organisms use a range of metabolic and protein-based signals to communicate. Of these protein signals, one class is defined as ‘effectors’, i.e., small secreted proteins (SSPs) that cause phenotypical and physiological changes in another organism. To date, protein-based effectors have been described in aphids, nematodes, fungi and bacteria. Using RNA sequencing of Populus trichocarpa roots in mutualistic symbiosis with the ectomycorrhizal fungus Laccaria bicolor, we sought to determine if host plants also contain genes encoding effector-like proteins. We identified 417 plant-encoded putative SSPs that were significantly regulated during this interaction, including 161 SSPs specific to P. trichocarpa andmore » 15 SSPs exhibiting expansion in Populus and closely related lineages. We demonstrate that a subset of these SSPs can enter L. bicolor hyphae, localize to the nucleus and affect hyphal growth and morphology. Finally, we conclude that plants encode proteins that appear to function as effector proteins that may regulate symbiotic associations.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Plett, Jonathan M.; Yin, Hengfu; Mewalal, Ritesh
During symbiosis, organisms use a range of metabolic and protein-based signals to communicate. Of these protein signals, one class is defined as ‘effectors’, i.e., small secreted proteins (SSPs) that cause phenotypical and physiological changes in another organism. To date, protein-based effectors have been described in aphids, nematodes, fungi and bacteria. Using RNA sequencing of Populus trichocarpa roots in mutualistic symbiosis with the ectomycorrhizal fungus Laccaria bicolor, we sought to determine if host plants also contain genes encoding effector-like proteins. We identified 417 plant-encoded putative SSPs that were significantly regulated during this interaction, including 161 SSPs specific to P. trichocarpa andmore » 15 SSPs exhibiting expansion in Populus and closely related lineages. We demonstrate that a subset of these SSPs can enter L. bicolor hyphae, localize to the nucleus and affect hyphal growth and morphology. Finally, we conclude that plants encode proteins that appear to function as effector proteins that may regulate symbiotic associations.« less
P-proteins in Arabidopsis are heteromeric structures involved in rapid sieve tube sealing
Jekat, Stephan B.; Ernst, Antonia M.; von Bohl, Andreas; Zielonka, Sascia; Twyman, Richard M.; Noll, Gundula A.; Prüfer, Dirk
2013-01-01
Structural phloem proteins (P-proteins) are characteristic components of the sieve elements in all dicotyledonous and many monocotyledonous angiosperms. Tobacco P-proteins were recently confirmed to be encoded by the widespread sieve element occlusion (SEO) gene family, and tobacco SEO proteins were shown to be directly involved in sieve tube sealing thus preventing the loss of photosynthate. Analysis of the two Arabidopsis SEO proteins (AtSEOa and AtSEOb) indicated that the corresponding P-protein subunits do not act in a redundant manner. However, there are still pending questions regarding the interaction properties and specific functions of AtSEOa and AtSEOb as well as the general function of structural P-proteins in Arabidopsis. In this study, we characterized the Arabidopsis P-proteins in more detail. We used in planta bimolecular fluorescence complementation assays to confirm the predicted heteromeric interactions between AtSEOa and AtSEOb. Arabidopsis mutants depleted for one or both AtSEO proteins lacked the typical P-protein structures normally found in sieve elements, underlining the identity of AtSEO proteins as P-proteins and furthermore providing the means to determine the role of Arabidopsis P-proteins in sieve tube sealing. We therefore developed an assay based on phloem exudation. Mutants with reduced AtSEO expression levels lost twice as much photosynthate following injury as comparable wild-type plants, confirming that Arabidopsis P-proteins are indeed involved in sieve tube sealing. PMID:23840197
A Bacteriophage-Related Chimeric Marine Virus Infecting Abalone
Zhuang, Jun; Cai, Guiqin; Lin, Qiying; Wu, Zujian; Xie, Lianhui
2010-01-01
Marine viruses shape microbial communities with the most genetic diversity in the sea by multiple genetic exchanges and infect multiple marine organisms. Here we provide proof from experimental infection that abalone shriveling syndrome-associated virus (AbSV) can cause abalone shriveling syndrome. This malady produces histological necrosis and abnormally modified macromolecules (hemocyanin and ferritin). The AbSV genome is a 34.952-kilobase circular double-stranded DNA, containing putative genes with similarity to bacteriophages, eukaryotic viruses, bacteria and endosymbionts. Of the 28 predicted open reading frames (ORFs), eight ORF-encoded proteins have identifiable functional homologues. The 4 ORF products correspond to a predicted terminase large subunit and an endonuclease in bacteriophage, and both an integrase and an exonuclease from bacteria. The other four proteins are homologous to an endosymbiont-derived helicase, primase, single-stranded binding (SSB) protein, and thymidylate kinase, individually. Additionally, AbSV exhibits a common gene arrangement similar to the majority of bacteriophages. Unique to AbSV, the viral genome also contains genes associated with bacterial outer membrane proteins and may lack the structural protein-encoding ORFs. Genomic characterization of AbSV indicates that it may represent a transitional form of microbial evolution from viruses to bacteria. PMID:21079776
The enhanced cyan fluorescent protein: a sensitive pH sensor for fluorescence lifetime imaging.
Poëa-Guyon, Sandrine; Pasquier, Hélène; Mérola, Fabienne; Morel, Nicolas; Erard, Marie
2013-05-01
pH is an important parameter that affects many functions of live cells, from protein structure or function to several crucial steps of their metabolism. Genetically encoded pH sensors based on pH-sensitive fluorescent proteins have been developed and used to monitor the pH of intracellular compartments. The quantitative analysis of pH variations can be performed either by ratiometric or fluorescence lifetime detection. However, most available genetically encoded pH sensors are based on green and yellow fluorescent proteins and are not compatible with multicolor approaches. Taking advantage of the strong pH sensitivity of enhanced cyan fluorescent protein (ECFP), we demonstrate here its suitability as a sensitive pH sensor using fluorescence lifetime imaging. The intracellular ECFP lifetime undergoes large changes (32 %) in the pH 5 to pH 7 range, which allows accurate pH measurements to better than 0.2 pH units. By fusion of ECFP with the granular chromogranin A, we successfully measured the pH in secretory granules of PC12 cells, and we performed a kinetic analysis of intragranular pH variations in living cells exposed to ammonium chloride.
Yang, Bingye; Pu, Fei; Qin, Ji; You, Weiwei; Ke, Caihuan
2014-03-10
During a large-scale screen of the larval transcriptome library of the Portuguese oyster, Crassostrea angulata, the oyster gene RACK, which encodes a receptor of activated protein kinase C protein was isolated and characterized. The cDNA is 1,148 bp long and has a predicted open reading frame encoding 317 aa. The predicted protein shows high sequence identity to many RACK proteins of different organisms including molluscs, fish, amphibians and mammals, suggesting that it is conserved during evolution. The structural analysis of the Ca-RACK1 genomic sequence implies that the Ca-RACK1 gene has seven exons and six introns, extending approximately 6.5 kb in length. It is expressed ubiquitously in many oyster tissues as detected by RT-PCR analysis. The Ca-RACK1 mRNA expression pattern was markedly increased at larval metamorphosis; and was further increased along with Ca-RACK1 protein synthesis during epinephrine-induced metamorphosis. These results indicate that the Ca-RACK1 plays an important role in tissue differentiation and/or in cell growth during larval metamorphosis in the oyster, C. angulata. Copyright © 2013 Elsevier B.V. All rights reserved.
1990-01-01
The major histological components of the hair follicle are the hair cortex and cuticle. The hair cuticle cells encase and protect the cortex and undergo a different developmental program to that of the cortex. We report the molecular characterization of a set of evolutionarily conserved hair genes which are transcribed in the hair cuticle late in follicle development. Two genes were isolated and characterized, one expressed in the human follicle and one in the sheep follicle. Each gene encodes a small protein of 16 kD, containing greater than 50 cysteine residues, ranging from 31 to 36 mol% cysteine. Their high cysteine content and in vitro expression data identify them as ultra-high-sulfur (UHS) keratin proteins. The predicted proteins are composed almost entirely of cysteine-rich and glycine-rich repeats. Genomic blots reveal that the UHS keratin proteins are encoded by related multigene families in both the human and sheep genomes. Tissue in situ hybridization demonstrates that the expression of both genes is localized to the hair fiber cuticle and occurs at a late stage in fiber morphogenesis. PMID:1703541
Identification of functional elements and regulatory circuits by Drosophila modENCODE
DOE Office of Scientific and Technical Information (OSTI.GOV)
Roy, Sushmita; Ernst, Jason; Kharchenko, Peter V.
2010-12-22
To gain insight into how genomic information is translated into cellular and developmental programs, the Drosophila model organism Encyclopedia of DNA Elements (modENCODE) project is comprehensively mapping transcripts, histone modifications, chromosomal proteins, transcription factors, replication proteins and intermediates, and nucleosome properties across a developmental time course and in multiple cell lines. We have generated more than 700 data sets and discovered protein-coding, noncoding, RNA regulatory, replication, and chromatin elements, more than tripling the annotated portion of the Drosophila genome. Correlated activity patterns of these elements reveal a functional regulatory network, which predicts putative new functions for genes, reveals stage- andmore » tissue-specific regulators, and enables gene-expression prediction. Our results provide a foundation for directed experimental and computational studies in Drosophila and related species and also a model for systematic data integration toward comprehensive genomic and functional annotation. Several years after the complete genetic sequencing of many species, it is still unclear how to translate genomic information into a functional map of cellular and developmental programs. The Encyclopedia of DNA Elements (ENCODE) (1) and model organism ENCODE (modENCODE) (2) projects use diverse genomic assays to comprehensively annotate the Homo sapiens (human), Drosophila melanogaster (fruit fly), and Caenorhabditis elegans (worm) genomes, through systematic generation and computational integration of functional genomic data sets. Previous genomic studies in flies have made seminal contributions to our understanding of basic biological mechanisms and genome functions, facilitated by genetic, experimental, computational, and manual annotation of the euchromatic and heterochromatic genome (3), small genome size, short life cycle, and a deep knowledge of development, gene function, and chromosome biology. The functions of {approx}40% of the protein and nonprotein-coding genes [FlyBase 5.12 (4)] have been determined from cDNA collections (5, 6), manual curation of gene models (7), gene mutations and comprehensive genome-wide RNA interference screens (8-10), and comparative genomic analyses (11, 12). The Drosophila modENCODE project has generated more than 700 data sets that profile transcripts, histone modifications and physical nucleosome properties, general and specific transcription factors (TFs), and replication programs in cell lines, isolated tissues, and whole organisms across several developmental stages (Fig. 1). Here, we computationally integrate these data sets and report (i) improved and additional genome annotations, including full-length proteincoding genes and peptides as short as 21 amino acids; (ii) noncoding transcripts, including 132 candidate structural RNAs and 1608 nonstructural transcripts; (iii) additional Argonaute (Ago)-associated small RNA genes and pathways, including new microRNAs (miRNAs) encoded within protein-coding exons and endogenous small interfering RNAs (siRNAs) from 3-inch untranslated regions; (iv) chromatin 'states' defined by combinatorial patterns of 18 chromatin marks that are associated with distinct functions and properties; (v) regions of high TF occupancy and replication activity with likely epigenetic regulation; (vi)mixed TF and miRNA regulatory networks with hierarchical structure and enriched feed-forward loops; (vii) coexpression- and co-regulation-based functional annotations for nearly 3000 genes; (viii) stage- and tissue-specific regulators; and (ix) predictive models of gene expression levels and regulator function.« less
van der Ley, P
1988-11-01
Gonococci express a family of related outer membrane proteins designated protein II (P.II). These surface proteins are subject to both phase variation and antigenic variation. The P.II gene repertoire of Neisseria gonorrhoeae strain JS3 was found to consist of at least ten genes, eight of which were cloned. Sequence analysis and DNA hybridization studies revealed that one particular P.II-encoding sequence is present in three distinct, but almost identical, copies in the JS3 genome. These genes encode the P.II protein that was previously identified as P.IIc. Comparison of their sequences shows that the multiple copies of this P.IIc-encoding gene might have been generated by both gene conversion and gene duplication.
Yeung, Yik Andy; Foletti, Davide; Deng, Xiaodi; Abdiche, Yasmina; Strop, Pavel; Glanville, Jacob; Pitts, Steven; Lindquist, Kevin; Sundar, Purnima D; Sirota, Marina; Hasa-Moreno, Adela; Pham, Amber; Melton Witt, Jody; Ni, Irene; Pons, Jaume; Shelton, David; Rajpal, Arvind; Chaparro-Riggers, Javier
2016-11-18
Staphylococcus aureus is both an important pathogen and a human commensal. To explore this ambivalent relationship between host and microbe, we analysed the memory humoral response against IsdB, a protein involved in iron acquisition, in four healthy donors. Here we show that in all donors a heavily biased use of two immunoglobulin heavy chain germlines generated high affinity (pM) antibodies that neutralize the two IsdB NEAT domains, IGHV4-39 for NEAT1 and IGHV1-69 for NEAT2. In contrast to the typical antibody/antigen interactions, the binding is primarily driven by the germline-encoded hydrophobic CDRH-2 motifs of IGHV1-69 and IGHV4-39, with a binding mechanism nearly identical for each antibody derived from different donors. Our results suggest that IGHV1-69 and IGHV4-39, while part of the adaptive immune system, may have evolved under selection pressure to encode a binding motif innately capable of recognizing and neutralizing a structurally conserved protein domain involved in pathogen iron acquisition.
NASA Astrophysics Data System (ADS)
Ng, Siuk-Mun; Lee, Xin-Wei; Wan, Kiew-Lian; Firdaus-Raih, Mohd
2015-09-01
Regulation of functional nucleus-encoded proteins targeting the plastidial functions was comparatively studied for a plant parasite, Rafflesia cantleyi versus a photosynthetic plant, Arabidopsis thaliana. This study involved two species of different feeding modes and different developmental stages. A total of 30 nucleus-encoded proteins were found to be differentially-regulated during two stages in the parasite; whereas 17 nucleus-encoded proteins were differentially-expressed during two developmental stages in Arabidopsis thaliana. One notable finding observed for the two plants was the identification of genes involved in the regulation of photosynthesis-related processes where these processes, as expected, seem to be present only in the autotroph.
Smith, Nicola L; Taylor, Edward J; Lindsay, Anna-Marie; Charnock, Simon J; Turkenburg, Johan P; Dodson, Eleanor J; Davies, Gideon J; Black, Gary W
2005-12-06
Streptococcus pyogenes (group A Streptococcus) causes severe invasive infections including scarlet fever, pharyngitis (streptococcal sore throat), skin infections, necrotizing fasciitis (flesh-eating disease), septicemia, erysipelas, cellulitis, acute rheumatic fever, and toxic shock. The conversion from nonpathogenic to toxigenic strains of S. pyogenes is frequently mediated by bacteriophage infection. One of the key bacteriophage-encoded virulence factors is a putative "hyaluronidase," HylP1, a phage tail-fiber protein responsible for the digestion of the S. pyogenes hyaluronan capsule during phage infection. Here we demonstrate that HylP1 is a hyaluronate lyase. The 3D structure, at 1.8-angstroms resolution, reveals an unusual triple-stranded beta-helical structure and provides insight into the structural basis for phage tail assembly and the role of phage tail proteins in virulence. Unlike the triple-stranded beta-helix assemblies of the bacteriophage T4 injection machinery and the tailspike endosialidase of the Escherichia coli K1 bacteriophage K1F, HylP1 possesses three copies of the active center on the triple-helical fiber itself without the need for an accessory catalytic domain. The triple-stranded beta-helix is not simply a structural scaffold, as previously envisaged; it is harnessed to provide a 200-angstroms-long substrate-binding groove for the optimal reduction in hyaluronan viscosity to aid phage penetration of the capsule.
Silk Materials Functionalized via Genetic Engineering for Biomedical Applications
Deptuch, Tomasz
2017-01-01
The great mechanical properties, biocompatibility and biodegradability of silk-based materials make them applicable to the biomedical field. Genetic engineering enables the construction of synthetic equivalents of natural silks. Knowledge about the relationship between the structure and function of silk proteins enables the design of bioengineered silks that can serve as the foundation of new biomaterials. Furthermore, in order to better address the needs of modern biomedicine, genetic engineering can be used to obtain silk-based materials with new functionalities. Sequences encoding new peptides or domains can be added to the sequences encoding the silk proteins. The expression of one cDNA fragment indicates that each silk molecule is related to a functional fragment. This review summarizes the proposed genetic functionalization of silk-based materials that can be potentially useful for biomedical applications. PMID:29231863
Variola virus immune evasion proteins.
Dunlop, Lance R; Oehlberg, Katherine A; Reid, Jeremy J; Avci, Dilek; Rosengard, Ariella M
2003-09-01
Variola virus, the causative agent of smallpox, encodes approximately 200 proteins. Over 80 of these proteins are located in the terminal regions of the genome, where proteins associated with host immune evasion are encoded. To date, only two variola proteins have been characterized. Both are located in the terminal regions and demonstrate immunoregulatory functions. One protein, the smallpox inhibitor of complement enzymes (SPICE), is homologous to a vaccinia virus virulence factor, the vaccinia virus complement-control protein (VCP), which has been found experimentally to be expressed early in the course of vaccinia infection. Both SPICE and VCP are similar in structure and function to the family of mammalian complement regulatory proteins, which function to prevent inadvertent injury to adjacent cells and tissues during complement activation. The second variola protein is the variola virus high-affinity secreted chemokine-binding protein type II (CKBP-II, CBP-II, vCCI), which binds CC-chemokine receptors. The vaccinia homologue of CKBP-II is secreted both early and late in infection. CKBP-II proteins are highly conserved among orthopoxviruses, sharing approximately 85% homology, but are absent in eukaryotes. This characteristic sets it apart from other known virulence factors in orthopoxviruses, which share sequence homology with known mammalian immune regulatory gene products. Future studies of additional variola proteins may help illuminate factors associated with its virulence, pathogenesis and strict human tropism. In addition, these studies may also assist in the development of targeted therapies for the treatment of both smallpox and human immune-related diseases.
Kock, K; Ahlers, C; Schmale, H
1994-05-01
The rat von Ebner's gland protein 1 (VEGP 1) is a secretory protein, which is abundantly expressed in the small acinar von Ebner's salivary glands of the tongue. Based on the primary structure of this protein we have previously suggested that it is a member of the lipocalin superfamily of lipophilic-ligand carrier proteins. Although the physiological role of VEGP 1 is not clear, it might be involved in sensory or protective functions in the taste epithelium. Here, we report the purification of VEGP 1 and of a closely related secretory polypeptide, VEGP 2, the isolation of a cDNA clone encoding VEGP 2, and the isolation and structural characterization of the genes for both proteins. Protein purification by gel-filtration and anion-exchange chromatography using Mono Q revealed the presence of two different immunoreactive VEGP species. N-terminal sequence determination of peptide fragments isolated after protease Asp-N digestion allowed the identification of a new VEGP, named VEGP 2, in addition to the previously characterized VEGP 1. The complete VEGP 2 sequence was deduced from a cDNA clone isolated from a von Ebner's gland cDNA library. The VEGP 2 cDNA encodes a protein of 177 amino acids and is 94% identical to VEGP 1. DNA sequence analysis of the rat VEGP 1 and 2 genes isolated from rat genomic libraries revealed that both span about 4.5 kb and contain seven exons. The VEGP 1 and 2 genes are non-allelic distinct genes in the rat genome and probably arose by gene duplication. The high degree of nucleotide sequence identity in introns A-C (94-100%) points to a recent gene conversion event that included the 5' part of the genes. The genomic organization of the rat VEGP genes closely resembles that found in other lipocalins such as beta-lactoglobulin, mouse urinary proteins (MUPs) and prostaglandin D synthase, and therefore provides clear evidence that VEGPs belong to this superfamily of proteins.
Vera-Otarola, Jorge; Solis, Loretto; Soto-Rifo, Ricardo; Ricci, Emiliano P; Pino, Karla; Tischler, Nicole D; Ohlmann, Théophile; Darlix, Jean-Luc; López-Lastra, Marcelo
2012-02-01
The small mRNA (SmRNA) of all Bunyaviridae encodes the nucleocapsid (N) protein. In 4 out of 5 genera in the Bunyaviridae, the smRNA encodes an additional nonstructural protein denominated NSs. In this study, we show that Andes hantavirus (ANDV) SmRNA encodes an NSs protein. Data show that the NSs protein is expressed in the context of an ANDV infection. Additionally, our results suggest that translation initiation from the NSs initiation codon is mediated by ribosomal subunits that have bypassed the upstream N protein initiation codon through a leaky scanning mechanism.
Vera-Otarola, Jorge; Solis, Loretto; Soto-Rifo, Ricardo; Ricci, Emiliano P.; Pino, Karla; Tischler, Nicole D.; Ohlmann, Théophile; Darlix, Jean-Luc
2012-01-01
The small mRNA (SmRNA) of all Bunyaviridae encodes the nucleocapsid (N) protein. In 4 out of 5 genera in the Bunyaviridae, the smRNA encodes an additional nonstructural protein denominated NSs. In this study, we show that Andes hantavirus (ANDV) SmRNA encodes an NSs protein. Data show that the NSs protein is expressed in the context of an ANDV infection. Additionally, our results suggest that translation initiation from the NSs initiation codon is mediated by ribosomal subunits that have bypassed the upstream N protein initiation codon through a leaky scanning mechanism. PMID:22156529
Fusagene vectors: a novel strategy for the expression of multiple genes from a single cistron.
Gäken, J; Jiang, J; Daniel, K; van Berkel, E; Hughes, C; Kuiper, M; Darling, D; Tavassoli, M; Galea-Lauri, J; Ford, K; Kemeny, M; Russell, S; Farzaneh, F
2000-12-01
Transduction of cells with multiple genes, allowing their stable and co-ordinated expression, is difficult with the available methodologies. A method has been developed for expression of multiple gene products, as fusion proteins, from a single cistron. The encoded proteins are post-synthetically cleaved and processed into each of their constituent proteins as individual, biologically active factors. Specifically, linkers encoding cleavage sites for the Golgi expressed endoprotease, furin, have been incorporated between in-frame cDNA sequences encoding different secreted or membrane bound proteins. With this strategy we have developed expression vectors encoding multiple proteins (IL-2 and B7.1, IL-4 and B7.1, IL-4 and IL-2, IL-12 p40 and p35, and IL-12 p40, p35 and IL-2 ). Transduction and analysis of over 100 individual clones, derived from murine and human tumour cell lines, demonstrate the efficient expression and biological activity of each of the encoded proteins. Fusagene vectors enable the co-ordinated expression of multiple gene products from a single, monocistronic, expression cassette.
Adaptive Covariation between the Coat and Movement Proteins of Prunus Necrotic Ringspot Virus
Codoñer, Francisco M.; Fares, Mario A.; Elena, Santiago F.
2006-01-01
The relative functional and/or structural importance of different amino acid sites in a protein can be assessed by evaluating the selective constraints to which they have been subjected during the course of evolution. Here we explore such constraints at the linear and three-dimensional levels for the movement protein (MP) and coat protein (CP) encoded by RNA 3 of prunus necrotic ringspot ilarvirus (PNRSV). By a maximum-parsimony approach, the nucleotide sequences from 46 isolates of PNRSV varying in symptomatology, host tree, and geographic origin have been analyzed and sites under different selective pressures have been identified in both proteins. We have also performed covariation analyses to explore whether changes in certain amino acid sites condition subsequent variation in other sites of the same protein or the other protein. These covariation analyses shed light on which particular amino acids should be involved in the physical and functional interaction between MP and CP. Finally, we discuss these findings in the light of what is already known about the implication of certain sites and domains in structure and protein-protein and RNA-protein interactions. PMID:16731922
Adaptive covariation between the coat and movement proteins of prunus necrotic ringspot virus.
Codoñer, Francisco M; Fares, Mario A; Elena, Santiago F
2006-06-01
The relative functional and/or structural importance of different amino acid sites in a protein can be assessed by evaluating the selective constraints to which they have been subjected during the course of evolution. Here we explore such constraints at the linear and three-dimensional levels for the movement protein (MP) and coat protein (CP) encoded by RNA 3 of prunus necrotic ringspot ilarvirus (PNRSV). By a maximum-parsimony approach, the nucleotide sequences from 46 isolates of PNRSV varying in symptomatology, host tree, and geographic origin have been analyzed and sites under different selective pressures have been identified in both proteins. We have also performed covariation analyses to explore whether changes in certain amino acid sites condition subsequent variation in other sites of the same protein or the other protein. These covariation analyses shed light on which particular amino acids should be involved in the physical and functional interaction between MP and CP. Finally, we discuss these findings in the light of what is already known about the implication of certain sites and domains in structure and protein-protein and RNA-protein interactions.
Johnson, David K; Karanicolas, John
2013-01-01
Despite intense interest and considerable effort via high-throughput screening, there are few examples of small molecules that directly inhibit protein-protein interactions. This suggests that many protein interaction surfaces may not be intrinsically "druggable" by small molecules, and elevates in importance the few successful examples as model systems for improving our fundamental understanding of druggability. Here we describe an approach for exploring protein fluctuations enriched in conformations containing surface pockets suitable for small molecule binding. Starting from a set of seven unbound protein structures, we find that the presence of low-energy pocket-containing conformations is indeed a signature of druggable protein interaction sites and that analogous surface pockets are not formed elsewhere on the protein. We further find that ensembles of conformations generated with this biased approach structurally resemble known inhibitor-bound structures more closely than equivalent ensembles of unbiased conformations. Collectively these results suggest that "druggability" is a property encoded on a protein surface through its propensity to form pockets, and inspire a model in which the crude features of the predisposed pocket(s) restrict the range of complementary ligands; additional smaller conformational changes then respond to details of a particular ligand. We anticipate that the insights described here will prove useful in selecting protein targets for therapeutic intervention.
RPG: the Ribosomal Protein Gene database.
Nakao, Akihiro; Yoshihama, Maki; Kenmochi, Naoya
2004-01-01
RPG (http://ribosome.miyazaki-med.ac.jp/) is a new database that provides detailed information about ribosomal protein (RP) genes. It contains data from humans and other organisms, including Drosophila melanogaster, Caenorhabditis elegans, Saccharo myces cerevisiae, Methanococcus jannaschii and Escherichia coli. Users can search the database by gene name and organism. Each record includes sequences (genomic, cDNA and amino acid sequences), intron/exon structures, genomic locations and information about orthologs. In addition, users can view and compare the gene structures of the above organisms and make multiple amino acid sequence alignments. RPG also provides information on small nucleolar RNAs (snoRNAs) that are encoded in the introns of RP genes.
RPG: the Ribosomal Protein Gene database
Nakao, Akihiro; Yoshihama, Maki; Kenmochi, Naoya
2004-01-01
RPG (http://ribosome.miyazaki-med.ac.jp/) is a new database that provides detailed information about ribosomal protein (RP) genes. It contains data from humans and other organisms, including Drosophila melanogaster, Caenorhabditis elegans, Saccharo myces cerevisiae, Methanococcus jannaschii and Escherichia coli. Users can search the database by gene name and organism. Each record includes sequences (genomic, cDNA and amino acid sequences), intron/exon structures, genomic locations and information about orthologs. In addition, users can view and compare the gene structures of the above organisms and make multiple amino acid sequence alignments. RPG also provides information on small nucleolar RNAs (snoRNAs) that are encoded in the introns of RP genes. PMID:14681386
NASA Astrophysics Data System (ADS)
Zhang, Hao; Liu, Haijun; Blankenship, Robert E.; Gross, Michael L.
2016-01-01
We report an isotope-encoding method coupled with carboxyl-group footprinting to monitor protein conformational changes. The carboxyl groups of aspartic/glutamic acids and of the C-terminus of proteins can serve as reporters for protein conformational changes when labeled with glycine ethyl ester (GEE) mediated by carbodiimide. In the new development, isotope-encoded "heavy" and "light" GEE are used to label separately the two states of the orange carotenoid protein (OCP) from cyanobacteria. Two samples are mixed (1:1 ratio) and analyzed by a single LC-MS/MS experiment. The differences in labeling extent between the two states are represented by the ratio of the "heavy" and "light" peptides, providing information about protein conformational changes. Combining isotope-encoded MS quantitative analysis and carboxyl-group footprinting reduces the time of MS analysis and improves the sensitivity of GEE and other footprinting.
Zhang, Hao; Liu, Haijun; Blankenship, Robert E.; Gross, Michael L.
2015-01-01
We report an isotope-encoding method coupled with carboxyl-group footprinting to monitor protein conformational changes. The carboxyl groups of aspartic/glutamic acids and of the C-terminus of proteins can serve as reporters for protein conformational changes when labeled with glycine ethyl ester (GEE) mediated by carbodiimide. In the new development, isotope-encoded “heavy” and “light” GEE are used to label separately the two states of the Orange Carotenoid Protein (OCP) from cyanobacteria. Two samples are mixed (1:1 ratio) and analyzed by a single LC-MS/MS experiment. The differences in labeling extent between the two states are represented by the ratio of the “heavy” and “light” peptides, providing information about protein conformational changes. Combining isotope-encoded MS quantitative analysis and carboxyl-group footprinting reduces the time of MS analysis and improves the sensitivity of GEE and other footprinting. PMID:26384685
Zhang, Hao; Liu, Haijun; Blankenship, Robert E.; ...
2015-09-18
Here, we report an isotope-encoding method coupled with carboxyl-group footprinting to monitor protein conformational changes. The carboxyl groups of aspartic/glutamic acids and of the C-terminus of proteins can serve as reporters for protein conformational changes when labeled with glycine ethyl ester (GEE) mediated by carbodiimide. In the new development, isotope-encoded “heavy” and “light” GEE are used to label separately the two states of the orange carotenoid protein (OCP) from cyanobacteria. Two samples are mixed (1:1 ratio) and analyzed by a single LC-MS/MS experiment. The differences in labeling extent between the two states are represented by the ratio of the “heavy”more » and “light” peptides, providing information about protein conformational changes. Combining isotope-encoded MS quantitative analysis and carboxyl-group footprinting reduces the time of MS analysis and improves the sensitivity of GEE and other footprinting.« less
Varmanen, P; Rantanen, T; Palva, A
1996-12-01
A proline iminopeptidase gene (pepI) of an industrial Lactobacillus helveticus strain was cloned and found to be organized in an operon-like structure of three open reading frames (ORF1, ORF2 and ORF3). ORF1 was preceded by a typical prokaryotic promoter region, and a putative transcription terminator was found downstream of ORF3, identified as the pepI gene. Using primer-extension analyses, only one transcription start site, upstream of ORF1, was identifiable in the predicted operon. Although the size of mRNA could not be judged by Northern analysis either with ORF1-, ORF2- or pepI-specific probes, reverse transcription-PCR analyses further supported the operon structure of the three genes. ORF1, ORF2 and ORF3 had coding capacities for 50.7, 24.5 and 33.8 kDa proteins, respectively. The ORF3-encoded PepI protein showed 65% identity with the PepI proteins from Lactobacillus delbrueckii subsp. bulgaricus and Lactobacillus delbrueckii subsp. lactis. The ORF1-encoded protein had significant homology with several members of the ABC transporter family but, with two distinct putative ATP-binding sites, it would represent an unusual type among the bacterial ABC transporters. ORF2 encoded a putative integral membrane protein also characteristic of the ABC transporter family. The pepI gene was overexpressed in Escherichia coli. Purified PepI hydrolysed only di and tripeptides with proline in the first position. Optimum PepI activity was observed at pH 7.5 and 40 degrees C. A gel filtration analysis indicated that PepI is a dimer of M(r) 53,000. PepI was shown to be a metal-independent serine peptidase having thiol groups at or near the active site. Kinetic studies with proline-p-nitroanilide as substrate revealed Km and Vmax values of 0.8 mM and 350 mmol min-1 mg-1, respectively, and a very high turnover number of 135,000 s-1.
Muraki, Yasushi; Washioka, Hiroshi; Sugawara, Kanetsu; Matsuzaki, Yoko; Takashita, Emi; Hongo, Seiji
2004-07-01
Influenza C virus-like particles (VLPs) have been generated from cloned cDNAs. A cDNA of the green fluorescent protein (GFP) gene in antisense orientation was flanked by the 5' and 3' non-coding regions of RNA segment 5 of the influenza C virus. The cDNA cassette was inserted between an RNA polymerase I promoter and terminator of the Pol I vector. This plasmid DNA was transfected into 293T cells together with plasmids encoding virus proteins of C/Ann Arbor/1/50 or C/Yamagata/1/88. Transfer of the supernatants of the transfected 293T cells to HMV-II cells resulted in GFP expression in the HMV-II cells. The quantification of the GFP-positive HMV-II cells indicated the presence of approximately 10(6) VLPs (ml supernatant)(-1). Cords 50-300 microm in length were observed on transfected 293T cells, although the cords were not observed when the plasmid for M1 protein of C/Ann Arbor/1/50 was replaced with that of C/Taylor/1233/47. A series of transfection experiments with plasmids encoding M1 mutants of C/Ann Arbor/1/50 or C/Taylor/1233/47 showed that an amino acid at residue 24 of the M1 protein is responsible for cord formation. This finding provides direct evidence for a previous hypothesis that M1 protein is involved in the formation of cord-like structures protruding from the C/Yamagata/1/88-infected cells. Evidence was obtained by electron microscopy that transfected cells bearing cords produced filamentous VLPs, suggesting the potential role of the M1 protein in determining the filamentous/spherical morphology of influenza C virus.
Hu, Haibin; Kortner, Trond M; Gajardo, Karina; Chikwati, Elvis; Tinsley, John; Krogdahl, Åshild
2016-01-01
In Atlantic salmon (Salmo salar L.), and also in other fish species, certain plant protein ingredients can increase fecal water content creating a diarrhea-like condition which may impair gut function and reduce fish growth. The present study aimed to strengthen understanding of the underlying mechanisms by observing effects of various alternative plant protein sources when replacing fish meal on expression of genes encoding proteins playing key roles in regulation of water transport across the mucosa of the distal intestine (DI). A 48-day feeding trial was conducted with five diets: A reference diet (FM) in which fish meal (72%) was the only protein source; Diet SBMWG with a mix of soybean meal (30%) and wheat gluten (22%); Diet SPCPM with a mix of soy protein concentrate (30%) and poultry meal (6%); Diet GMWG with guar meal (30%) and wheat gluten (14.5%); Diet PM with 58% poultry meal. Compared to fish fed the FM reference diet, fish fed the soybean meal containing diet (SBMWG) showed signs of enteritis in the DI, increased fecal water content of DI chyme and higher plasma osmolality. Altered DI expression of a battery of genes encoding aquaporins, ion transporters, tight junction and adherens junction proteins suggested reduced transcellular transport of water as well as a tightening of the junction barrier in fish fed the SBMWG diet, which may explain the observed higher fecal water content and plasma osmolality. DI structure was not altered for fish fed the other experimental diets but alterations in target gene expression and fecal water content were observed, indicating that alterations in water transport components may take place without clear effects on intestinal structure.
Tetreau, Guillaume; Dittmer, Neal T; Cao, Xiaolong; Agrawal, Sinu; Chen, Yun-Ru; Muthukrishnan, Subbaratnam; Haobo, Jiang; Blissard, Gary W; Kanost, Michael R; Wang, Ping
2015-07-01
In insects, chitin is a major structural component of the cuticle and the peritrophic membrane (PM). In nature, chitin is always associated with proteins among which chitin-binding proteins (CBPs) are the most important for forming, maintaining and regulating the functions of these extracellular structures. In this study, a genome-wide search for genes encoding proteins with ChtBD2-type (peritrophin A-type) chitin-binding domains (CBDs) was conducted. A total of 53 genes encoding 56 CBPs were identified, including 15 CPAP1s (cuticular proteins analogous to peritrophins with 1 CBD), 11 CPAP3s (CPAPs with 3 CBDs) and 17 PMPs (PM proteins) with a variable number of CBDs, which are structural components of cuticle or of the PM. CBDs were also identified in enzymes of chitin metabolism including 6 chitinases and 7 chitin deacetylases encoded by 6 and 5 genes, respectively. RNA-seq analysis confirmed that PMP and CPAP genes have differential spatial expression patterns. The expression of PMP genes is midgut-specific, while CPAP genes are widely expressed in different cuticle forming tissues. Phylogenetic analysis of CBDs of proteins in insects belonging to different orders revealed that CPAP1s from different species constitute a separate family with 16 different groups, including 6 new groups identified in this study. The CPAP3s are clustered into a separate family of 7 groups present in all insect orders. Altogether, they reveal that duplication events of CBDs in CPAP1s and CPAP3s occurred prior to the evolutionary radiation of insect species. In contrast to the CPAPs, all CBDs from individual PMPs are generally clustered and distinct from other PMPs in the same species in phylogenetic analyses, indicating that the duplication of CBDs in each of these PMPs occurred after divergence of insect species. Phylogenetic analysis of these three CBP families showed that the CBDs in CPAP1s form a clearly separate family, while those found in PMPs and CPAP3s were clustered together in the phylogenetic tree. For chitinases and chitin deacetylases, most of phylogenetic analysis performed with the CBD sequences resulted in similar clustering to the one obtained by using catalytic domain sequences alone, suggesting that CBDs were incorporated into these enzymes and evolved in tandem with the catalytic domains before the diversification of different insect orders. Based on these results, the evolution of CBDs in insect CBPs is discussed to provide a new insight into the CBD sequence structure and diversity, and their evolution and expression in insects. Copyright © 2014 Elsevier Ltd. All rights reserved.
USDA-ARS?s Scientific Manuscript database
Nitrogen uptake and the efficient absorption and metabolism of nitrogen are essential elements in attempts to breed improved cereal cultivars for grain or silage production. One of the enzymes related to nitrogen metabolism is glutamine-2-oxoglutarate amidotransferase (GOGAT). Together with glutami...
Polycipiviridae: a proposed new family of polycistronic picorna-like RNA viruses
USDA-ARS?s Scientific Manuscript database
Solenopsis invicta virus 2 is a single-stranded positive-sense picorna-like RNA virus with an unusual genome structure. The monopartite genome of approximately 11 kb contains four short open reading frames in its 5' one third, three of which encode proteins with homology to picornavirus-like jelly-r...
Identification and characterization of long non-coding RNAs in rainbow trout eggs
USDA-ARS?s Scientific Manuscript database
Long non-coding RNAs (lncRNAs) are in general considered as a diverse class of transcripts longer than 200 nucleotides that structurally resemble mRNAs but do not encode proteins. Recent advances in RNA sequencing (RNA-Seq) and bioinformatics methods have provided an opportunity to indentify and ana...
MACF1 gene structure: a hybrid of plectin and dystrophin.
Gong, T W; Besirli, C G; Lomax, M I
2001-11-01
Mammalian MACF1 (Macrophin1; previously named ACF7) is a giant cytoskeletal linker protein with three known isoforms that arise by alternative splicing. We isolated a 19.1-kb cDNA encoding a fourth isoform (MACF1-4) with a unique N-terminus. Instead of an N-terminal actin-binding domain found in the other three isoforms, MACF1-4 has eight plectin repeats. The MACF1 gene is located on human Chr 1p32, contains at least 102 exons, spans over 270 kb, and gives rise to four major isoforms with different N-termini. The genomic organization of the actin-binding domain is highly conserved in mammalian genes for both plectin and BPAG1. All eight plectin repeats are encoded by one large exon; this feature is similar to the genomic structure of plectin. The intron positions within spectrin repeats in MACF1 are very similar to those in the dystrophin gene. This demonstrates that MACF1 has characteristic features of genes for two classes of cytoskeletal proteins, i.e., plectin and dystrophin.
Barz, W P; Walter, P
1999-04-01
Many eukaryotic cell surface proteins are anchored in the lipid bilayer through glycosylphosphatidylinositol (GPI). GPI anchors are covalently attached in the endoplasmic reticulum (ER). The modified proteins are then transported through the secretory pathway to the cell surface. We have identified two genes in Saccharomyces cerevisiae, LAG1 and a novel gene termed DGT1 (for "delayed GPI-anchored protein transport"), encoding structurally related proteins with multiple membrane-spanning domains. Both proteins are localized to the ER, as demonstrated by immunofluorescence microscopy. Deletion of either gene caused no detectable phenotype, whereas lag1Delta dgt1Delta cells displayed growth defects and a significant delay in ER-to-Golgi transport of GPI-anchored proteins, suggesting that LAG1 and DGT1 encode functionally redundant or overlapping proteins. The rate of GPI anchor attachment was not affected, nor was the transport rate of several non-GPI-anchored proteins. Consistent with a role of Lag1p and Dgt1p in GPI-anchored protein transport, lag1Delta dgt1Delta cells deposit abnormal, multilayered cell walls. Both proteins have significant sequence similarity to TRAM, a mammalian membrane protein thought to be involved in protein translocation across the ER membrane. In vivo translocation studies, however, did not detect any defects in protein translocation in lag1Delta dgt1Delta cells, suggesting that neither yeast gene plays a role in this process. Instead, we propose that Lag1p and Dgt1p facilitate efficient ER-to-Golgi transport of GPI-anchored proteins.
Barz, Wolfgang P.; Walter, Peter
1999-01-01
Many eukaryotic cell surface proteins are anchored in the lipid bilayer through glycosylphosphatidylinositol (GPI). GPI anchors are covalently attached in the endoplasmic reticulum (ER). The modified proteins are then transported through the secretory pathway to the cell surface. We have identified two genes in Saccharomyces cerevisiae, LAG1 and a novel gene termed DGT1 (for “delayed GPI-anchored protein transport”), encoding structurally related proteins with multiple membrane-spanning domains. Both proteins are localized to the ER, as demonstrated by immunofluorescence microscopy. Deletion of either gene caused no detectable phenotype, whereas lag1Δ dgt1Δ cells displayed growth defects and a significant delay in ER-to-Golgi transport of GPI-anchored proteins, suggesting that LAG1 and DGT1 encode functionally redundant or overlapping proteins. The rate of GPI anchor attachment was not affected, nor was the transport rate of several non–GPI-anchored proteins. Consistent with a role of Lag1p and Dgt1p in GPI-anchored protein transport, lag1Δ dgt1Δ cells deposit abnormal, multilayered cell walls. Both proteins have significant sequence similarity to TRAM, a mammalian membrane protein thought to be involved in protein translocation across the ER membrane. In vivo translocation studies, however, did not detect any defects in protein translocation in lag1Δ dgt1Δ cells, suggesting that neither yeast gene plays a role in this process. Instead, we propose that Lag1p and Dgt1p facilitate efficient ER-to-Golgi transport of GPI-anchored proteins. PMID:10198056
McCoy, Andrea J; Maurelli, Anthony T
2005-07-01
Recent characterization of chlamydial genes encoding functional peptidoglycan (PG)-synthesis proteins suggests that the Chlamydiaceae possess the ability to synthesize PG yet biochemical evidence for the synthesis of PG has yet to be demonstrated. The presence of D-amino acids in PG is a hallmark of bacteria. Chlamydiaceae do not appear to encode amino acid racemases however, a D-alanyl-D-alanine (D-Ala-D-Ala) ligase homologue (Ddl) is encoded in the genome. Thus, we undertook a genetics-based approach to demonstrate and characterize the D-Ala-D-Ala ligase activity of chlamydial Ddl, a protein encoded as a fusion with MurC. The full-length murC-ddl fusion gene from Chlamydia trachomatis serovar L2 was cloned and placed under the control of the arabinose-inducible ara promoter and transformed into a D-Ala-D-Ala ligase auxotroph of Escherichia coli possessing deletions of both the ddlA and ddlB genes. Viability of the E. coliDeltaddlADeltaddlB mutant in the absence of exogenous D-Ala-D-Ala dipeptide became dependent on the expression of the chlamydial murC-ddl thus demonstrating functional ligase activity. Domain mapping of the full-length fusion protein and site-directed mutagenesis of the MurC domain revealed that the structure of the full fusion protein but not MurC enzymatic activity was required for ligase activity in vivo. Recombinant MurC-Ddl exhibited substrate specificity for D-Ala. Chlamydia growth is inhibited by D-cycloserine (DCS) and in vitro analysis provided evidence for the chlamydial MurC-Ddl as the target for DCS sensitivity. In vivo sensitivity to DCS could be reversed by addition of exogenous D-Ala and D-Ala-D-Ala. Together, these findings further support our hypothesis that PG is synthesized by members of the Chlamydiaceae family and suggest that D-amino acids, specifically D-Ala, are present in chlamydial PG.
Positive selection on human gamete-recognition genes
Stover, Daryn A.; Guerra, Vanessa; Mozaffari, Sahar V.; Ober, Carole; Mugal, Carina F.; Kaj, Ingemar
2018-01-01
Coevolution of genes that encode interacting proteins expressed on the surfaces of sperm and eggs can lead to variation in reproductive compatibility between mates and reproductive isolation between members of different species. Previous studies in mice and other mammals have focused in particular on evidence for positive or diversifying selection that shapes the evolution of genes that encode sperm-binding proteins expressed in the egg coat or zona pellucida (ZP). By fitting phylogenetic models of codon evolution to data from the 1000 Genomes Project, we identified candidate sites evolving under diversifying selection in the human genes ZP3 and ZP2. We also identified one candidate site under positive selection in C4BPA, which encodes a repetitive protein similar to the mouse protein ZP3R that is expressed in the sperm head and binds to the ZP at fertilization. Results from several additional analyses that applied population genetic models to the same data were consistent with the hypothesis of selection on those candidate sites leading to coevolution of sperm- and egg-expressed genes. By contrast, we found no candidate sites under selection in a fourth gene (ZP1) that encodes an egg coat structural protein not directly involved in sperm binding. Finally, we found that two of the candidate sites (in C4BPA and ZP2) were correlated with variation in family size and birth rate among Hutterite couples, and those two candidate sites were also in linkage disequilibrium in the same Hutterite study population. All of these lines of evidence are consistent with predictions from a previously proposed hypothesis of balancing selection on epistatic interactions between C4BPA and ZP3 at fertilization that lead to the evolution of co-adapted allele pairs. Such patterns also suggest specific molecular traits that may be associated with both natural reproductive variation and clinical infertility. PMID:29340252
Pellegrino, Simone; Demeshkina, Natalia; Mancera-Martinez, Eder; Melnikov, Sergey; Simonetti, Angelita; Myasnikov, Alexander; Yusupov, Marat; Yusupova, Gulnara; Hashem, Yaser
2018-06-07
One of the most critical steps of protein biosynthesis is the coupled movement of messenger RNA (mRNA), that encodes genetic information, with transfer RNAs (tRNAs) on the ribosome. In eukaryotes this process is catalyzed by a conserved G-protein, the elongation factor 2 (eEF2), which carries a unique post-translational modification, called diphthamide, found in all eukaryotic species. Here we present near-atomic resolution cryo-EM structures of yeast 80S ribosome complexes containing mRNA, tRNA and eEF2 trapped in different GTP-hydrolysis states which provide further structural insights on the role of diphthamide in the mechanism of translation fidelity in eukaryotes. Copyright © 2018. Published by Elsevier Ltd.
Sperm-Associated Antigen–17 Gene Is Essential for Motile Cilia Function and Neonatal Survival
Teves, Maria Eugenia; Zhang, Zhibing; Costanzo, Richard M.; Henderson, Scott C.; Corwin, Frank D.; Zweit, Jamal; Sundaresan, Gobalakrishnan; Subler, Mark; Salloum, Fadi N.; Rubin, Bruce K.
2013-01-01
Primary ciliary dyskinesia (PCD), resulting from defects in cilia assembly or motility, is caused by mutations in a number of genes encoding axonemal proteins. PCD phenotypes are variable, and include recurrent respiratory tract infections, bronchiectasis, hydrocephaly, situs inversus, and male infertility. We generated knockout mice for the sperm-associated antigen–17 (Spag17) gene, which encodes a central pair (CP) protein present in the axonemes of cells with “9 + 2” motile cilia or flagella. The targeting of Spag17 resulted in a severe phenotype characterized by immotile nasal and tracheal cilia, reduced clearance of nasal mucus, profound respiratory distress associated with lung fluid accumulation and disruption of the alveolar epithelium, cerebral ventricular expansion consistent with emerging hydrocephalus, failure to suckle, and neonatal demise within 12 hours of birth. Ultrastructural analysis revealed the loss of one CP microtubule in approximately one quarter of tracheal cilia axonemes, an absence of a C1 microtubule projection, and other less frequent CP structural abnormalities. SPAG6 and SPAG16 (CP proteins that interact with SPAG17) were increased in tracheal tissue from SPAG17-deficient mice. We conclude that Spag17 plays a critical role in the function and structure of motile cilia, and that neonatal lethality is likely explained by impaired airway mucociliary clearance. PMID:23418344
NASA Astrophysics Data System (ADS)
Soundararajan, Venky; Aravamudan, Murali
2014-12-01
The efficacy and mechanisms of therapeutic action are largely described by atomic bonds and interactions local to drug binding sites. Here we introduce global connectivity analysis as a high-throughput computational assay of therapeutic action - inspired by the Google page rank algorithm that unearths most ``globally connected'' websites from the information-dense world wide web (WWW). We execute short timescale (30 ps) molecular dynamics simulations with high sampling frequency (0.01 ps), to identify amino acid residue hubs whose global connectivity dynamics are characteristic of the ligand or mutation associated with the target protein. We find that unexpected allosteric hubs - up to 20Å from the ATP binding site, but within 5Å of the phosphorylation site - encode the Gibbs free energy of inhibition (ΔGinhibition) for select protein kinase-targeted cancer therapeutics. We further find that clinically relevant somatic cancer mutations implicated in both drug resistance and personalized drug sensitivity can be predicted in a high-throughput fashion. Our results establish global connectivity analysis as a potent assay of protein functional modulation. This sets the stage for unearthing disease-causal exome mutations and motivates forecast of clinical drug response on a patient-by-patient basis. We suggest incorporation of structure-guided genetic inference assays into pharmaceutical and healthcare Oncology workflows.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zheng Min; Guangxi Center for Animal Disease Control and Prevention, Nanning 530001; College of Animal Science and Veterinary Medicine, Jilin University, Changchun 130062
Goatpox, caused by goatpox virus (GTPV), is an acute feverish and contagious disease in goats often associated with high morbidity and high mortality. To resolve potential safety risks and vaccination side effects of existing live attenuated goatpox vaccine (AV41), two Semliki forest virus (SFV) replicon-based bicistronic expression DNA vaccines (pCSm-AAL and pCSm-BAA) which encode GTPV structural proteins corresponding to the Vaccinia virus proteins A27, L1, A33, and B5, respectively, were constructed. Then, theirs ability to induce humoral and cellular response in mice and goats, and protect goats against virulent virus challenge were evaluated. The results showed that, vaccination with pCSm-AALmore » and pCSm-BAA in combination could elicit strong humoral and cellular responses in mice and goats, provide partial protection against viral challenge in goats, and reduce disease symptoms. Additionally, priming vaccination with the above-mentioned DNA vaccines could significantly reduce the goats' side reactions from boosting vaccinations with current live vaccine (AV41), which include skin lesions at the inoculation site and fevers. Data obtained in this study could not only facilitate improvement of the current goatpox vaccination strategy, but also provide valuable guidance to suitable candidates for evaluation and development of orthopoxvirus vaccines.« less
Mehedi, Masfique; Hoenen, Thomas; Robertson, Shelly; Ricklefs, Stacy; Dolan, Michael A; Taylor, Travis; Falzarano, Darryl; Ebihara, Hideki; Porcella, Stephen F; Feldmann, Heinz
2013-01-01
Ebolavirus (EBOV), the causative agent of a severe hemorrhagic fever and a biosafety level 4 pathogen, increases its genome coding capacity by producing multiple transcripts encoding for structural and nonstructural glycoproteins from a single gene. This is achieved through RNA editing, during which non-template adenosine residues are incorporated into the EBOV mRNAs at an editing site encoding for 7 adenosine residues. However, the mechanism of EBOV RNA editing is currently not understood. In this study, we report for the first time that minigenomes containing the glycoprotein gene editing site can undergo RNA editing, thereby eliminating the requirement for a biosafety level 4 laboratory to study EBOV RNA editing. Using a newly developed dual-reporter minigenome, we have characterized the mechanism of EBOV RNA editing, and have identified cis-acting sequences that are required for editing, located between 9 nt upstream and 9 nt downstream of the editing site. Moreover, we show that a secondary structure in the upstream cis-acting sequence plays an important role in RNA editing. EBOV RNA editing is glycoprotein gene-specific, as a stretch encoding for 7 adenosine residues located in the viral polymerase gene did not serve as an editing site, most likely due to an absence of the necessary cis-acting sequences. Finally, the EBOV protein VP30 was identified as a trans-acting factor for RNA editing, constituting a novel function for this protein. Overall, our results provide novel insights into the RNA editing mechanism of EBOV, further understanding of which might result in novel intervention strategies against this viral pathogen.
Chu, Teng-Ping J; Yuann, Jeu-Ming P
2011-05-01
MPT64, a secreted protein of Mycobacterium tuberculosis (MTB), stimulates the immune reactions within cells and is a protective antigen that is lost by the bacilli Calmette-Guérin (BCG) vaccine during propagation. To minimize the toxicity caused by MTB, we used the MPT64 gene encoded by nontoxic H37Ra MTB to carry out genetic expansion via polymerase chain reaction and gene clone MPT64. The plasmid DNA encoded MPT64 was expressed at 20°C for 22 H, and a large quantity of MPT64 was obtained. In the absence of urea, MPT64 multimers with subunits being covalently connected via disulfide bonds were detected by Western blot showing strong protein-protein interactions, as evidenced by the formation of MPT64 tetramers. Finally, with urea of decreasing concentrations, we refolded MPT64 purified in the presence of urea and determined its secondary structures using circular dichroism. MPT64 was found to contain 2.2% α-helix, 50.9% β-sheet, 19.5% turn, and 27.4% random coil. The molecular weight of MPT64 was determined by a matrix-assisted laser desorption ionization-time of flight mass spectrometer and found to be 23,497 Da, very close to the theoretical molecular weight of MPT64. The results presented here provide a sound basis for future biochemical and biophysical studies of MPT64 or any other proteins encoded by nontoxic H37Ra MTB. Copyright © 2011 International Union of Biochemistry and Molecular Biology, Inc.
Nopaline-type Ti plasmid of Agrobacterium encodes a VirF-like functional F-box protein.
Lacroix, Benoît; Citovsky, Vitaly
2015-11-20
During Agrobacterium-mediated genetic transformation of plants, several bacterial virulence (Vir) proteins are translocated into the host cell to facilitate infection. One of the most important of such translocated factors is VirF, an F-box protein produced by octopine strains of Agrobacterium, which presumably facilitates proteasomal uncoating of the invading T-DNA from its associated proteins. The presence of VirF also is thought to be involved in differences in host specificity between octopine and nopaline strains of Agrobacterium, with the current dogma being that no functional VirF is encoded by nopaline strains. Here, we show that a protein with homology to octopine VirF is encoded by the Ti plasmid of the nopaline C58 strain of Agrobacterium. This protein, C58VirF, possesses the hallmarks of functional F-box proteins: it contains an active F-box domain and specifically interacts, via its F-box domain, with SKP1-like (ASK) protein components of the plant ubiquitin/proteasome system. Thus, our data suggest that nopaline strains of Agrobacterium have evolved to encode a functional F-box protein VirF.
A Plant 5S Ribosomal RNA Mimic Regulates Alternative Splicing of Transcription Factor IIIA Pre-mRNAs
Hammond, Ming C.; Wachter, Andreas; Breaker, Ronald R.
2009-01-01
Transcription factor IIIA (TFIIIA) is required for eukaryotic synthesis of 5S ribosomal RNA by RNA polymerase III. Here we report the discovery of a structured RNA element with striking resemblance to 5S rRNA that is conserved within TFIIIA precursor mRNAs (pre-mRNAs) from diverse plant lineages. TFIIIA protein expression is controlled by alternative splicing of the exon containing the plant 5S rRNA mimic (P5SM). P5SM triggers exon skipping upon binding of ribosomal protein L5, a natural partner of 5S rRNA, which demonstrates the functional adaptation of its structural mimicry. Since the exon-skipped splice product encodes full-length TFIIIA protein, these results reveal a ribosomal protein-mRNA interaction that is involved in 5S rRNA synthesis and has implications for cross-coordination of ribosomal components. This study also provides insight into the origin and function of a newfound class of structured RNA that regulates alternative splicing. PMID:19377483
Burg, John S; Ingram, Jessica R; Venkatakrishnan, A J; Jude, Kevin M; Dukkipati, Abhiram; Feinberg, Evan N; Angelini, Alessandro; Waghray, Deepa; Dror, Ron O; Ploegh, Hidde L; Garcia, K Christopher
2015-03-06
Chemokines are small proteins that function as immune modulators through activation of chemokine G protein-coupled receptors (GPCRs). Several viruses also encode chemokines and chemokine receptors to subvert the host immune response. How protein ligands activate GPCRs remains unknown. We report the crystal structure at 2.9 angstrom resolution of the human cytomegalovirus GPCR US28 in complex with the chemokine domain of human CX3CL1 (fractalkine). The globular body of CX3CL1 is perched on top of the US28 extracellular vestibule, whereas its amino terminus projects into the central core of US28. The transmembrane helices of US28 adopt an active-state-like conformation. Atomic-level simulations suggest that the agonist-independent activity of US28 may be due to an amino acid network evolved in the viral GPCR to destabilize the receptor's inactive state. Copyright © 2015, American Association for the Advancement of Science.
Structure and function of the Zika virus full-length NS5 protein
Zhao, Baoyu; Yi, Guanghui; Du, Fenglei; ...
2017-03-27
The recent outbreak of Zika virus (ZIKV) has infected over 1 million people in over 30 countries. ZIKV replicates its RNA genome using virally encoded replication proteins. Nonstructural protein 5 (NS5) contains a methyltransferase for RNA capping and a polymerase for viral RNA synthesis. Here we report the crystal structures of full-length NS5 and its polymerase domain at 3.0 Å resolution. The NS5 structure has striking similarities to the NS5 protein of the related Japanese encephalitis virus. The methyltransferase contains in-line pockets for substrate binding and the active site. Key residues in the polymerase are located in similar positions tomore » those of the initiation complex for the hepatitis C virus polymerase. The polymerase conformation is affected by the methyltransferase, which enables a more efficiently elongation of RNA synthesis in vitro. Altogether, our results will contribute to future studies on ZIKV infection and the development of inhibitors of ZIKV replication.« less
Hammond, Ming C; Wachter, Andreas; Breaker, Ronald R
2009-05-01
Transcription factor IIIA (TFIIIA) is required for eukaryotic synthesis of 5S ribosomal RNA by RNA polymerase III. Here we report the discovery of a structured RNA element with clear resemblance to 5S rRNA that is conserved within TFIIIA precursor mRNAs from diverse plant lineages. TFIIIA protein expression is controlled by alternative splicing of the exon containing the plant 5S rRNA mimic (P5SM). P5SM triggers exon skipping upon binding of ribosomal protein L5, a natural partner of 5S rRNA, which demonstrates the functional adaptation of its structural mimicry. As the exon-skipped splice product encodes full-length TFIIIA protein, these results reveal a ribosomal protein-mRNA interaction that is involved in 5S rRNA synthesis and has implications for cross-coordination of ribosomal components. This study also provides insight into the origin and function of a newfound class of structured RNA that regulates alternative splicing.
Kim, Hyun Uk; Wu, Sherry S.H.; Ratnayake, Chandra; Huang, Anthony H.C.
2001-01-01
Plastid lipid-associated protein (PAP), a predominant structural protein associated with carotenoids and other non-green neutral lipids in plastids, was shown to be encoded by a single nuclear gene in several species. Here we report three PAP genes in the diploid Brassica rapa; the three PAPs are associated with different lipids in specific tissues. Pap1 and Pap2 are more similar to each other (84% amino acid sequence identity) than to Pap3 (46% and 44%, respectively) in the encoded mature proteins. Pap1 transcript was most abundant in the maturing anthers (tapetum) and in lesser amounts in leaves, fruit coats, seeds, and sepals; Pap2 transcript was abundant only in the petals; and Pap3 transcript had a wide distribution, but at minimal levels in numerous organs. Immunoblotting after sodium dodecyl sulfate-polyacrylamide gel electrophoresis indicated that most organs had several nanograms of PAP1 or PAP2 per milligram of total protein, the highest amounts being in the anthers (10.9 μg mg−1 PAP1) and petals (6.6 μg mg−1 PAP2), and that they had much less PAP3 (<0.02 μg mg−1). In these organs PAP was localized in isolated plastid fractions. Plants were subjected to abiotic stresses; drought and ozone reduced the levels of the three Pap transcripts, whereas mechanical wounding and altering the light intensity enhanced their levels. We conclude that the PAP gene family consists of several members whose proteins are associated with different lipids and whose expressions are controlled by distinct mechanisms. Earlier reports of the expression of one Pap gene in various organs in a species need to be re-examined. PMID:11351096
Orth, Martin F.; Cazes, Alex; Butt, Elke; Grunewald, Thomas G. P.
2015-01-01
The gene encoding the LIM and SH3 domain protein (LASP1) was cloned two decades ago from a cDNA library of breast cancer metastases. As the first protein of a class comprising one N-terminal LIM and one C-terminal SH3 domain, LASP1 founded a new LIM-protein subfamily of the nebulin group. Since its discovery LASP1 proved to be an extremely versatile protein because of its exceptional structure allowing interaction with various binding partners, its ubiquitous expression in normal tissues, albeit with distinct expression patterns, and its ability to transmit signals from the cytoplasm into the nucleus. As a result, LASP1 plays key roles in cell structure, physiological processes, and cell signaling. Furthermore, LASP1 overexpression contributes to cancer aggressiveness hinting to a potential value of LASP1 as a cancer biomarker. In this review we summarize published data on structure, regulation, function, and expression pattern of LASP1, with a focus on its role in human cancer and as a biomarker protein. In addition, we provide a comprehensive transcriptome analysis of published microarrays (n=2,780) that illustrates the expression profile of LASP1 in normal tissues and its overexpression in a broad range of human cancer entities. PMID:25622104
Luo, Yonglun; Blechingberg, Jenny; Fernandes, Ana Miguel; Li, Shengting; Fryland, Tue; Børglum, Anders D; Bolund, Lars; Nielsen, Anders Lade
2015-11-14
FUS (TLS) and EWS (EWSR1) belong to the FET-protein family of RNA and DNA binding proteins. FUS and EWS are structurally and functionally related and participate in transcriptional regulation and RNA processing. FUS and EWS are identified in translocation generated cancer fusion proteins and involved in the human neurological diseases amyotrophic lateral sclerosis and fronto-temporal lobar degeneration. To determine the gene regulatory functions of FUS and EWS at the level of chromatin, we have performed chromatin immunoprecipitation followed by next generation sequencing (ChIP-seq). Our results show that FUS and EWS bind to a subset of actively transcribed genes, that binding often is downstream the poly(A)-signal, and that binding overlaps with RNA polymerase II. Functional examinations of selected target genes identified that FUS and EWS can regulate gene expression at different levels. Gene Ontology analyses showed that FUS and EWS target genes preferentially encode proteins involved in regulatory processes at the RNA level. The presented results yield new insights into gene interactions of EWS and FUS and have identified a set of FUS and EWS target genes involved in pathways at the RNA regulatory level with potential to mediate normal and disease-associated functions of the FUS and EWS proteins.
A rapidly evolving secretome builds and patterns a sea shell
Jackson, Daniel J; McDougall, Carmel; Green, Kathryn; Simpson, Fiona; Wörheide, Gert; Degnan, Bernard M
2006-01-01
Background Instructions to fabricate mineralized structures with distinct nanoscale architectures, such as seashells and coral and vertebrate skeletons, are encoded in the genomes of a wide variety of animals. In mollusks, the mantle is responsible for the extracellular production of the shell, directing the ordered biomineralization of CaCO3 and the deposition of architectural and color patterns. The evolutionary origins of the ability to synthesize calcified structures across various metazoan taxa remain obscure, with only a small number of protein families identified from molluskan shells. The recent sequencing of a wide range of metazoan genomes coupled with the analysis of gene expression in non-model animals has allowed us to investigate the evolution and process of biomineralization in gastropod mollusks. Results Here we show that over 25% of the genes expressed in the mantle of the vetigastropod Haliotis asinina encode secreted proteins, indicating that hundreds of proteins are likely to be contributing to shell fabrication and patterning. Almost 85% of the secretome encodes novel proteins; remarkably, only 19% of these have identifiable homologues in the full genome of the patellogastropod Lottia scutum. The spatial expression profiles of mantle genes that belong to the secretome is restricted to discrete mantle zones, with each zone responsible for the fabrication of one of the structural layers of the shell. Patterned expression of a subset of genes along the length of the mantle is indicative of roles in shell ornamentation. For example, Has-sometsuke maps precisely to pigmentation patterns in the shell, providing the first case of a gene product to be involved in molluskan shell pigmentation. We also describe the expression of two novel genes involved in nacre (mother of pearl) deposition. Conclusion The unexpected complexity and evolvability of this secretome and the modular design of the molluskan mantle enables diversification of shell strength and design, and as such must contribute to the variety of adaptive architectures and colors found in mollusk shells. The composition of this novel mantle-specific secretome suggests that there are significant molecular differences in the ways in which gastropods synthesize their shells. PMID:17121673
Computational protein design: a review
NASA Astrophysics Data System (ADS)
Coluzza, Ivan
2017-04-01
Proteins are one of the most versatile modular assembling systems in nature. Experimentally, more than 110 000 protein structures have been identified and more are deposited every day in the Protein Data Bank. Such an enormous structural variety is to a first approximation controlled by the sequence of amino acids along the peptide chain of each protein. Understanding how the structural and functional properties of the target can be encoded in this sequence is the main objective of protein design. Unfortunately, rational protein design remains one of the major challenges across the disciplines of biology, physics and chemistry. The implications of solving this problem are enormous and branch into materials science, drug design, evolution and even cryptography. For instance, in the field of drug design an effective computational method to design protein-based ligands for biological targets such as viruses, bacteria or tumour cells, could give a significant boost to the development of new therapies with reduced side effects. In materials science, self-assembly is a highly desired property and soon artificial proteins could represent a new class of designable self-assembling materials. The scope of this review is to describe the state of the art in computational protein design methods and give the reader an outline of what developments could be expected in the near future.
Busslinger, M; Portmann, R; Irminger, J C; Birnstiel, M L
1980-01-01
The DNA sequences of the entire structural H4, H3, H2A and H2B genes and of their 5' flanking regions have been determined in the histone DNA clone h19 of the sea urchin Psammechinus miliaris. In clone h19 the polarity of transcription and the relative arrangement of the histone genes is identical to that in clone h22 of the same species. The histone proteins encoded by h19 DNA differ in their primary structure from those encoded by clone h22 and have been compared to histone protein sequences of other sea urchin species as well as other eukaryotes. A comparative analysis of the 5' flanking DNA sequences of the structural histone genes in both clones revealed four ubiquitous sequence motifs; a pentameric element GATCC, followed at short distance by the Hogness box GTATAAATAG, a conserved sequence PyCATTCPu, in or near which the 5' ends of the mRNAs map in h22 DNA and lastly a sequence A, containing the initiation codon. These sequences are also found, sometimes in modified version, in front of other eukaryotic genes transcribed by polymerase II. When prelude sequences of isocoding histone genes in clone h19 and h22 are compared areas of homology are seen to extend beyond the ubiquitous sequence motifs towards the divergent AT-rich spacer and terminate between approximately 140 and 240 nucleotides away from the structural gene. These prelude regions contain quite large conservative sequence blocks which are specific for each type of histone genes. Images PMID:7443547
NASA Astrophysics Data System (ADS)
Sethaphong, Latsavongsakda
This work examines smart material properties of rational self-assembly and molecular recognition found in nano-biosystems. Exploiting the sequence and structural information encoded within nucleic acids and proteins will permit programmed synthesis of nanomaterials and help create molecular machines that may carry out new roles involving chemical catalysis and bioenergy. Responsive to different ionic environments thru self-reorgnization, nucleic acids (NA) are nature's signature smart material; organisms such as viruses and bacteria use features of NAs to react to their environment and orchestrate their lifecycle. Furthermore, nucleic acid systems (both RNA and DNA) are currently exploited as scaffolds; recent applications have been showcased to build bioelectronics and biotemplated nanostructures via directed assembly of multidimensional nanoelectronic devices 1. Since the most stable and rudimentary structure of nucleic acids is the helical duplex, these were modeled in order to examine the influence of the microenvironment, sequence, and cation-dependent perturbations of their canonical forms. Due to their negatively charged phosphate backbone, NA's rely on counterions to overcome the inherent repulsive forces that arise from the assembly of two complementary strands. As a realistic model system, we chose the HIV-TAR helix (PDB ID: 397D) to study specific sequence motifs on cation sequestration. At physiologically relevant concentrations of sodium and potassium ions, we observed sequence based effects where purine stretches were adept in retaining high residency cations. The transitional space between adenine and guanosine nucleotides (ApG step) in a sequence proved the most favorable. This work was the first to directly show these subtle interactions of sequence based cationic sequestration and may be useful for controlling metallization of nucleic acids in conductive nanowires. Extending the study further, we explored the degree to which the structure of NA duplexes alone interacted with cations distinct from a specific sequence. Under physiologically relevant conditions, a duplex of RNA polyguanine-polycitidine was highly responsive and able to sequester cations to the middle of the purine stretches. The least responsive structure was a DNA polyadenine-polythymine duplex. A random sequence DNA duplex contorted into an RNA-like helix resulted in cationic dynamics similar to RNA systems. These studies showed that cation diffusive binding events in nucleic acid duplex structures are sequence specific and heavily influenced by structural aspects helical forms to account for much of the differences observed. Although structural information in nucleic acids is encoded within their sequence, linking amino acid sequence to protein structure is murkier; the structural information within proteins is encoded by the folding process itself: a complex phenomenon driven toward the equilibrium state of the active conformation. Upwards of two thirds of a protein's sequence can be substituted with similar amino acids without significantly perturbing its function; conserved residues of about 10% seem to be vital; since evolutionary selection pressure in proteins operates 3-dimenionally, a linear sequence is partially informative. We explored this problem by folding de-novo the cytosolic portion of the membrane protein, cellulose synthase, CESA1 from upland cotton, Gossypium hirsutum (Ghcesa1). The cytoplasmic region was generated by homology modeling and refined with molecular dynamics. These mutations impair local structural flexibility which likely results in cellulose that is produced at a lower rate and is less crystalline. Additional modeling of fragments of cellulose synthases from the model plant, Arabidopsis thaliana, offered novel insights into the function of conserved cytosolic domains within plant cellulose synthases. Transport mechanisms related to the transmembrane region revealed significant differences between plants and a bacterial complex. These studies generated possible mutations that may allow for the creation of new synthases and identified other avenues of research in order to develop technologies that may alter the crystallinity and other useful properties of cellulose. 1. Karplus, K., SAM-T08, HMM-based protein structure prediction. Nucleic Acids Research, 2009. 37: p. W492-W497.
Characterization of tail sheath protein of giant bacteriophage phiKZ Pseudomonas aeruginosa
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kurochkina, Lidia P., E-mail: lpk@ibch.r; Aksyuk, Anastasia A.; Sachkova, Maria Yu.
2009-12-20
The tail sheath protein of giant bacteriophage phiKZ Pseudomonas aeruginosa encoded by gene 29 was identified and its expression system was developed. Localization of the protein on the virion was confirmed by immunoelectron microscopy. Properties of gene product (gp) 29 were studied by electron microscopy, immunoblotting and limited trypsinolysis. Recombinant gp29 assembles into the regular tubular structures (polysheaths) of variable length. Trypsin digestion of gp29 within polysheaths or extended sheath of virion results in specific cleavage of the peptide bond between Arg135 and Asp136. However, this cleavage does not affect polymeric structure of polysheaths, sheaths and viral infectivity. Digestion bymore » trypsin of the C-truncated gp29 mutant, lacking the ability to self-assemble, results in formation of a stable protease-resistant fragment. Although there is no sequence homology of phiKZ proteins to proteins of other bacteriophages, some characteristic biochemical properties of gp29 revealed similarities to the tail sheath protein of bacteriophage T4.« less
De novo design of protein homo-oligomers with modular hydrogen bond network-mediated specificity
Boyken, Scott E.; Chen, Zibo; Groves, Benjamin; Langan, Robert A.; Oberdorfer, Gustav; Ford, Alex; Gilmore, Jason; Xu, Chunfu; DiMaio, Frank; Pereira, Jose Henrique; Sankaran, Banumathi; Seelig, Georg; Zwart, Peter H.; Baker, David
2017-01-01
In nature, structural specificity in DNA and proteins is encoded quite differently: in DNA, specificity arises from modular hydrogen bonds in the core of the double helix, whereas in proteins, specificity arises largely from buried hydrophobic packing complemented by irregular peripheral polar interactions. Here we describe a general approach for designing a wide range of protein homo-oligomers with specificity determined by modular arrays of central hydrogen bond networks. We use the approach to design dimers, trimers, and tetramers consisting of two concentric rings of helices, including previously not seen triangular, square, and supercoiled topologies. X-ray crystallography confirms that the structures overall, and the hydrogen bond networks in particular, are nearly identical to the design models, and the networks confer interaction specificity in vivo. The ability to design extensive hydrogen bond networks with atomic accuracy is a milestone for protein design and enables the programming of protein interaction specificity for a broad range of synthetic biology applications. PMID:27151862