variable structure motifs: Topics by Science.gov

Sample records for variable structure motifs

Automated classification of RNA 3D motifs and the RNA 3D Motif Atlas

PubMed Central

Petrov, Anton I.; Zirbel, Craig L.; Leontis, Neocles B.

2013-01-01

The analysis of atomic-resolution RNA three-dimensional (3D) structures reveals that many internal and hairpin loops are modular, recurrent, and structured by conserved non-Watson–Crick base pairs. Structurally similar loops define RNA 3D motifs that are conserved in homologous RNA molecules, but can also occur at nonhomologous sites in diverse RNAs, and which often vary in sequence. To further our understanding of RNA motif structure and sequence variability and to provide a useful resource for structure modeling and prediction, we present a new method for automated classification of internal and hairpin loop RNA 3D motifs and a new online database called the RNA 3D Motif Atlas. To classify the motif instances, a representative set of internal and hairpin loops is automatically extracted from a nonredundant list of RNA-containing PDB files. Their structures are compared geometrically, all-against-all, using the FR3D program suite. The loops are clustered into motif groups, taking into account geometric similarity and structural annotations and making allowance for a variable number of bulged bases. The automated procedure that we have implemented identifies all hairpin and internal loop motifs previously described in the literature. All motif instances and motif groups are assigned unique and stable identifiers and are made available in the RNA 3D Motif Atlas (http://rna.bgsu.edu/motifs), which is automatically updated every four weeks. The RNA 3D Motif Atlas provides an interactive user interface for exploring motif diversity and tools for programmatic data access. PMID:23970545
Occurrence probability of structured motifs in random sequences.

PubMed

Robin, S; Daudin, J-J; Richard, H; Sagot, M-F; Schbath, S

2002-01-01

The problem of extracting from a set of nucleic acid sequences motifs which may have biological function is more and more important. In this paper, we are interested in particular motifs that may be implicated in the transcription process. These motifs, called structured motifs, are composed of two ordered parts separated by a variable distance and allowing for substitutions. In order to assess their statistical significance, we propose approximations of the probability of occurrences of such a structured motif in a given sequence. An application of our method to evaluate candidate promoters in E. coli and B. subtilis is presented. Simulations show the goodness of the approximations.
Identifying DNA-binding proteins using structural motifs and the electrostatic potential

PubMed Central

Shanahan, Hugh P.; Garcia, Mario A.; Jones, Susan; Thornton, Janet M.

2004-01-01

Robust methods to detect DNA-binding proteins from structures of unknown function are important for structural biology. This paper describes a method for identifying such proteins that (i) have a solvent accessible structural motif necessary for DNA-binding and (ii) a positive electrostatic potential in the region of the binding region. We focus on three structural motifs: helix–turn-helix (HTH), helix–hairpin–helix (HhH) and helix–loop–helix (HLH). We find that the combination of these variables detect 78% of proteins with an HTH motif, which is a substantial improvement over previous work based purely on structural templates and is comparable to more complex methods of identifying DNA-binding proteins. Similar true positive fractions are achieved for the HhH and HLH motifs. We see evidence of wide evolutionary diversity for DNA-binding proteins with an HTH motif, and much smaller diversity for those with an HhH or HLH motif. PMID:15356290
Isosteric And Non-Isosteric Base Pairs In RNA Motifs: Molecular Dynamics And Bioinformatics Study Of The Sarcin-Ricin Internal Loop

PubMed Central

Havrila, Marek; Réblová, Kamila; Zirbel, Craig L.; Leontis, Neocles B.; Šponer, Jiří

2013-01-01

The Sarcin-Ricin RNA motif (SR motif) is one of the most prominent recurrent RNA building blocks that occurs in many different RNA contexts and folds autonomously, i.e., in a context-independent manner. In this study, we combined bioinformatics analysis with explicit-solvent molecular dynamics (MD) simulations to better understand the relation between the RNA sequence and the evolutionary patterns of SR motif. SHAPE probing experiment was also performed to confirm fidelity of MD simulations. We identified 57 instances of the SR motif in a non-redundant subset of the RNA X-ray structure database and analyzed their basepairing, base-phosphate, and backbone-backbone interactions. We extracted sequences aligned to these instances from large ribosomal RNA alignments to determine frequency of occurrence for different sequence variants. We then used a simple scoring scheme based on isostericity to suggest 10 sequence variants with highly variable expected degree of compatibility with the SR motif 3D structure. We carried out MD simulations of SR motifs with these base substitutions. Non isosteric base substitutions led to unstable structures, but so did isosteric substitutions which were unable to make key base-phosphate interactions. MD technique explains why some potentially isosteric SR motifs are not realized during evolution. We also found that inability to form stable cWW geometry is an important factor in case of the first base pair of the flexible region of the SR motif. Comparison of structural, bioinformatics, SHAPE probing and MD simulation data reveals that explicit solvent MD simulations neatly reflect viability of different sequence variants of the SR motif. Thus, MD simulations can efficiently complement bioinformatics tools in studies of conservation patterns of RNA motifs and provide atomistic insight into the role of their different signature interactions. PMID:24144333
Mining protein loops using a structural alphabet and statistical exceptionality

PubMed Central

2010-01-01

Background Protein loops encompass 50% of protein residues in available three-dimensional structures. These regions are often involved in protein functions, e.g. binding site, catalytic pocket... However, the description of protein loops with conventional tools is an uneasy task. Regular secondary structures, helices and strands, have been widely studied whereas loops, because they are highly variable in terms of sequence and structure, are difficult to analyze. Due to data sparsity, long loops have rarely been systematically studied. Results We developed a simple and accurate method that allows the description and analysis of the structures of short and long loops using structural motifs without restriction on loop length. This method is based on the structural alphabet HMM-SA. HMM-SA allows the simplification of a three-dimensional protein structure into a one-dimensional string of states, where each state is a four-residue prototype fragment, called structural letter. The difficult task of the structural grouping of huge data sets is thus easily accomplished by handling structural letter strings as in conventional protein sequence analysis. We systematically extracted all seven-residue fragments in a bank of 93000 protein loops and grouped them according to the structural-letter sequence, named structural word. This approach permits a systematic analysis of loops of all sizes since we consider the structural motifs of seven residues rather than complete loops. We focused the analysis on highly recurrent words of loops (observed more than 30 times). Our study reveals that 73% of loop-lengths are covered by only 3310 highly recurrent structural words out of 28274 observed words). These structural words have low structural variability (mean RMSd of 0.85 Å). As expected, half of these motifs display a flanking-region preference but interestingly, two thirds are shared by short (less than 12 residues) and long loops. Moreover, half of recurrent motifs exhibit a significant level of amino-acid conservation with at least four significant positions and 87% of long loops contain at least one such word. We complement our analysis with the detection of statistically over-represented patterns of structural letters as in conventional DNA sequence analysis. About 30% (930) of structural words are over-represented, and cover about 40% of loop lengths. Interestingly, these words exhibit lower structural variability and higher sequential specificity, suggesting structural or functional constraints. Conclusions We developed a method to systematically decompose and study protein loops using recurrent structural motifs. This method is based on the structural alphabet HMM-SA and not on structural alignment and geometrical parameters. We extracted meaningful structural motifs that are found in both short and long loops. To our knowledge, it is the first time that pattern mining helps to increase the signal-to-noise ratio in protein loops. This finding helps to better describe protein loops and might permit to decrease the complexity of long-loop analysis. Detailed results are available at http://www.mti.univ-paris-diderot.fr/publication/supplementary/2009/ACCLoop/. PMID:20132552
Mining protein loops using a structural alphabet and statistical exceptionality.

PubMed

Regad, Leslie; Martin, Juliette; Nuel, Gregory; Camproux, Anne-Claude

2010-02-04

Protein loops encompass 50% of protein residues in available three-dimensional structures. These regions are often involved in protein functions, e.g. binding site, catalytic pocket... However, the description of protein loops with conventional tools is an uneasy task. Regular secondary structures, helices and strands, have been widely studied whereas loops, because they are highly variable in terms of sequence and structure, are difficult to analyze. Due to data sparsity, long loops have rarely been systematically studied. We developed a simple and accurate method that allows the description and analysis of the structures of short and long loops using structural motifs without restriction on loop length. This method is based on the structural alphabet HMM-SA. HMM-SA allows the simplification of a three-dimensional protein structure into a one-dimensional string of states, where each state is a four-residue prototype fragment, called structural letter. The difficult task of the structural grouping of huge data sets is thus easily accomplished by handling structural letter strings as in conventional protein sequence analysis. We systematically extracted all seven-residue fragments in a bank of 93000 protein loops and grouped them according to the structural-letter sequence, named structural word. This approach permits a systematic analysis of loops of all sizes since we consider the structural motifs of seven residues rather than complete loops. We focused the analysis on highly recurrent words of loops (observed more than 30 times). Our study reveals that 73% of loop-lengths are covered by only 3310 highly recurrent structural words out of 28274 observed words). These structural words have low structural variability (mean RMSd of 0.85 A). As expected, half of these motifs display a flanking-region preference but interestingly, two thirds are shared by short (less than 12 residues) and long loops. Moreover, half of recurrent motifs exhibit a significant level of amino-acid conservation with at least four significant positions and 87% of long loops contain at least one such word. We complement our analysis with the detection of statistically over-represented patterns of structural letters as in conventional DNA sequence analysis. About 30% (930) of structural words are over-represented, and cover about 40% of loop lengths. Interestingly, these words exhibit lower structural variability and higher sequential specificity, suggesting structural or functional constraints. We developed a method to systematically decompose and study protein loops using recurrent structural motifs. This method is based on the structural alphabet HMM-SA and not on structural alignment and geometrical parameters. We extracted meaningful structural motifs that are found in both short and long loops. To our knowledge, it is the first time that pattern mining helps to increase the signal-to-noise ratio in protein loops. This finding helps to better describe protein loops and might permit to decrease the complexity of long-loop analysis. Detailed results are available at http://www.mti.univ-paris-diderot.fr/publication/supplementary/2009/ACCLoop/.
Modeling of DNA local parameters predicts encrypted architectural motifs in Xenopus laevis ribosomal gene promoter.

PubMed

Roux-Rouquie, M; Marilley, M

2000-09-15

We have modeled local DNA sequence parameters to search for DNA architectural motifs involved in transcription regulation and promotion within the Xenopus laevis ribosomal gene promoter and the intergenic spacer (IGS) sequences. The IGS was found to be shaped into distinct topological domains. First, intrinsic bends split the IGS into domains of common but different helical features. Local parameters at inter-domain junctions exhibit a high variability with respect to intrinsic curvature, bendability and thermal stability. Secondly, the repeated sequence blocks of the IGS exhibit right-handed supercoiled structures which could be related to their enhancer properties. Thirdly, the gene promoter presents both inherent curvature and minor groove narrowing which may be viewed as motifs of a structural code for protein recognition and binding. Such pre-existing deformations could simply be remodeled during the binding of the transcription complex. Alternatively, these deformations could pre-shape the promoter in such a way that further remodeling is facilitated. Mutations shown to abolish promoter curvature as well as intrinsic minor groove narrowing, in a variant which maintained full transcriptional activity, bring circumstantial evidence for structurally-preorganized motifs in relation to transcription regulation and promotion. Using well documented X. laevis rDNA regulatory sequences we showed that computer modeling may be of invaluable assistance in assessing encrypted architectural motifs. The evidence of these DNA topological motifs with respect to the concept of structural code is discussed.
Modeling of DNA local parameters predicts encrypted architectural motifs in Xenopus laevis ribosomal gene promoter

PubMed Central

Roux-Rouquie, Magali; Marilley, Monique

2000-01-01

We have modeled local DNA sequence parameters to search for DNA architectural motifs involved in transcription regulation and promotion within the Xenopus laevis ribosomal gene promoter and the intergenic spacer (IGS) sequences. The IGS was found to be shaped into distinct topological domains. First, intrinsic bends split the IGS into domains of common but different helical features. Local parameters at inter-domain junctions exhibit a high variability with respect to intrinsic curvature, bendability and thermal stability. Secondly, the repeated sequence blocks of the IGS exhibit right-handed supercoiled structures which could be related to their enhancer properties. Thirdly, the gene promoter presents both inherent curvature and minor groove narrowing which may be viewed as motifs of a structural code for protein recognition and binding. Such pre-existing deformations could simply be remodeled during the binding of the transcription complex. Alternatively, these deformations could pre-shape the promoter in such a way that further remodeling is facilitated. Mutations shown to abolish promoter curvature as well as intrinsic minor groove narrowing, in a variant which maintained full transcriptional activity, bring circumstantial evidence for structurally-preorganized motifs in relation to transcription regulation and promotion. Using well documented X.laevis rDNA regulatory sequences we showed that computer modeling may be of invaluable assistance in assessing encrypted architectural motifs. The evidence of these DNA topological motifs with respect to the concept of structural code is discussed. PMID:10982860
Unusual sugar specificity of banana lectin from Musa paradisiaca and its probable evolutionary origin. Crystallographic and modelling studies.

PubMed

Singh, D D; Saikrishnan, K; Kumar, Prashant; Surolia, A; Sekar, K; Vijayan, M

2005-10-01

The crystal structure of a complex of methyl-alpha-D-mannoside with banana lectin from Musa paradisiaca reveals two primary binding sites in the lectin, unlike in other lectins with beta-prism I fold which essentially consists of three Greek key motifs. It has been suggested that the fold evolved through successive gene duplication and fusion of an ancestral Greek key motif. In other lectins, all from dicots, the primary binding site exists on one of the three motifs in the three-fold symmetric molecule. Banana is a monocot, and the three motifs have not diverged enough to obliterate sequence similarity among them. Two Greek key motifs in it carry one primary binding site each. A common secondary binding site exists on the third Greek key. Modelling shows that both the primary sites can support 1-2, 1-3, and 1-6 linked mannosides with the second residue interacting in each case primarily with the secondary binding site. Modelling also readily leads to a bound branched mannopentose with the nonreducing ends of the two branches anchored at the two primary binding sites, providing a structural explanation for the lectin's specificity for branched alpha-mannans. A comparison of the dimeric banana lectin with other beta-prism I fold lectins, provides interesting insights into the variability in their quaternary structure.
Do motifs reflect evolved function?--No convergent evolution of genetic regulatory network subgraph topologies.

PubMed

Knabe, Johannes F; Nehaniv, Chrystopher L; Schilstra, Maria J

2008-01-01

Methods that analyse the topological structure of networks have recently become quite popular. Whether motifs (subgraph patterns that occur more often than in randomized networks) have specific functions as elementary computational circuits has been cause for debate. As the question is difficult to resolve with currently available biological data, we approach the issue using networks that abstractly model natural genetic regulatory networks (GRNs) which are evolved to show dynamical behaviors. Specifically one group of networks was evolved to be capable of exhibiting two different behaviors ("differentiation") in contrast to a group with a single target behavior. In both groups we find motif distribution differences within the groups to be larger than differences between them, indicating that evolutionary niches (target functions) do not necessarily mold network structure uniquely. These results show that variability operators can have a stronger influence on network topologies than selection pressures, especially when many topologies can create similar dynamics. Moreover, analysis of motif functional relevance by lesioning did not suggest that motifs were of greater importance to the functioning of the network than arbitrary subgraph patterns. Only when drastically restricting network size, so that one motif corresponds to a whole functionally evolved network, was preference for particular connection patterns found. This suggests that in non-restricted, bigger networks, entanglement with the rest of the network hinders topological subgraph analysis.
Mechanical properties and negative thermal expansion of a dense rare earth formate framework

DOE Office of Scientific and Technical Information (OSTI.GOV)

Zhang, Zhanrui; Jiang, Xingxing; Feng, Guoqiang

The fundamental mechanical properties of a dense metal–organic framework material, [NH{sub 2}CHNH{sub 2}][Er(HCOO){sub 4}] (1), have been studied using nanoindentation technique. The results demonstrate that the elastic moduli, hardnesses, and yield stresses on the (021)/(02−1) facets are 29.8/30.2, 1.80/1.83 and 0.93/1.01 GPa, respectively. Moreover, variable-temperature powder and single-crystal X-ray diffraction experiments reveal that framework 1 shows significant negative thermal expansion along its b axis, which can be explained by using a hinge–strut structural motif. - Graphical abstract: The structure of framework, [NH{sub 2}CHNH{sub 2}][Er(HCOO){sub 4}], and its indicatrix of thermal expansion. - Highlights: • The elastic modulus, hardness, and yieldmore » stress properties of a rare earth metal–organic framework material were studied via nanoindentation technique. • Variable-temperature powder X-ray diffraction experiments reveal that this framework shows significant negative thermal expansion along its b axis. • Based on variable-temperature single-crystal X-ray diffraction experiments, the mechanism of negative thermal expansion can be explained by a hinge–strut structural motif.« less
A structural-alphabet-based strategy for finding structural motifs across protein families

PubMed Central

Wu, Chih Yuan; Chen, Yao Chi; Lim, Carmay

2010-01-01

Proteins with insignificant sequence and overall structure similarity may still share locally conserved contiguous structural segments; i.e. structural/3D motifs. Most methods for finding 3D motifs require a known motif to search for other similar structures or functionally/structurally crucial residues. Here, without requiring a query motif or essential residues, a fully automated method for discovering 3D motifs of various sizes across protein families with different folds based on a 16-letter structural alphabet is presented. It was applied to structurally non-redundant proteins bound to DNA, RNA, obligate/non-obligate proteins as well as free DNA-binding proteins (DBPs) and proteins with known structures but unknown function. Its usefulness was illustrated by analyzing the 3D motifs found in DBPs. A non-specific motif was found with a ‘corner’ architecture that confers a stable scaffold and enables diverse interactions, making it suitable for binding not only DNA but also RNA and proteins. Furthermore, DNA-specific motifs present ‘only’ in DBPs were discovered. The motifs found can provide useful guidelines in detecting binding sites and computational protein redesign. PMID:20525797
Dissecting protein loops with a statistical scalpel suggests a functional implication of some structural motifs.

PubMed

Regad, Leslie; Martin, Juliette; Camproux, Anne-Claude

2011-06-20

One of the strategies for protein function annotation is to search particular structural motifs that are known to be shared by proteins with a given function. Here, we present a systematic extraction of structural motifs of seven residues from protein loops and we explore their correspondence with functional sites. Our approach is based on the structural alphabet HMM-SA (Hidden Markov Model - Structural Alphabet), which allows simplification of protein structures into uni-dimensional sequences, and advanced pattern statistics adapted to short sequences. Structural motifs of interest are selected by looking for structural motifs significantly over-represented in SCOP superfamilies in protein loops. We discovered two types of structural motifs significantly over-represented in SCOP superfamilies: (i) ubiquitous motifs, shared by several superfamilies and (ii) superfamily-specific motifs, over-represented in few superfamilies. A comparison of ubiquitous words with known small structural motifs shows that they contain well-described motifs as turn, niche or nest motifs. A comparison between superfamily-specific motifs and biological annotations of Swiss-Prot reveals that some of them actually correspond to functional sites involved in the binding sites of small ligands, such as ATP/GTP, NAD(P) and SAH/SAM. Our findings show that statistical over-representation in SCOP superfamilies is linked to functional features. The detection of over-represented motifs within structures simplified by HMM-SA is therefore a promising approach for prediction of functional sites and annotation of uncharacterized proteins.
Dissecting protein loops with a statistical scalpel suggests a functional implication of some structural motifs

PubMed Central

2011-01-01

Background One of the strategies for protein function annotation is to search particular structural motifs that are known to be shared by proteins with a given function. Results Here, we present a systematic extraction of structural motifs of seven residues from protein loops and we explore their correspondence with functional sites. Our approach is based on the structural alphabet HMM-SA (Hidden Markov Model - Structural Alphabet), which allows simplification of protein structures into uni-dimensional sequences, and advanced pattern statistics adapted to short sequences. Structural motifs of interest are selected by looking for structural motifs significantly over-represented in SCOP superfamilies in protein loops. We discovered two types of structural motifs significantly over-represented in SCOP superfamilies: (i) ubiquitous motifs, shared by several superfamilies and (ii) superfamily-specific motifs, over-represented in few superfamilies. A comparison of ubiquitous words with known small structural motifs shows that they contain well-described motifs as turn, niche or nest motifs. A comparison between superfamily-specific motifs and biological annotations of Swiss-Prot reveals that some of them actually correspond to functional sites involved in the binding sites of small ligands, such as ATP/GTP, NAD(P) and SAH/SAM. Conclusions Our findings show that statistical over-representation in SCOP superfamilies is linked to functional features. The detection of over-represented motifs within structures simplified by HMM-SA is therefore a promising approach for prediction of functional sites and annotation of uncharacterized proteins. PMID:21689388
SA-Mot: a web server for the identification of motifs of interest extracted from protein loops

PubMed Central

Regad, Leslie; Saladin, Adrien; Maupetit, Julien; Geneix, Colette; Camproux, Anne-Claude

2011-01-01

The detection of functional motifs is an important step for the determination of protein functions. We present here a new web server SA-Mot (Structural Alphabet Motif) for the extraction and location of structural motifs of interest from protein loops. Contrary to other methods, SA-Mot does not focus only on functional motifs, but it extracts recurrent and conserved structural motifs involved in structural redundancy of loops. SA-Mot uses the structural word notion to extract all structural motifs from uni-dimensional sequences corresponding to loop structures. Then, SA-Mot provides a description of these structural motifs using statistics computed in the loop data set and in SCOP superfamily, sequence and structural parameters. SA-Mot results correspond to an interactive table listing all structural motifs extracted from a target structure and their associated descriptors. Using this information, the users can easily locate loop regions that are important for the protein folding and function. The SA-Mot web server is available at http://sa-mot.mti.univ-paris-diderot.fr. PMID:21665924
SA-Mot: a web server for the identification of motifs of interest extracted from protein loops.

PubMed

Regad, Leslie; Saladin, Adrien; Maupetit, Julien; Geneix, Colette; Camproux, Anne-Claude

2011-07-01

The detection of functional motifs is an important step for the determination of protein functions. We present here a new web server SA-Mot (Structural Alphabet Motif) for the extraction and location of structural motifs of interest from protein loops. Contrary to other methods, SA-Mot does not focus only on functional motifs, but it extracts recurrent and conserved structural motifs involved in structural redundancy of loops. SA-Mot uses the structural word notion to extract all structural motifs from uni-dimensional sequences corresponding to loop structures. Then, SA-Mot provides a description of these structural motifs using statistics computed in the loop data set and in SCOP superfamily, sequence and structural parameters. SA-Mot results correspond to an interactive table listing all structural motifs extracted from a target structure and their associated descriptors. Using this information, the users can easily locate loop regions that are important for the protein folding and function. The SA-Mot web server is available at http://sa-mot.mti.univ-paris-diderot.fr.
The structure of the protein phosphatase 2A PR65/A subunit reveals the conformation of its 15 tandemly repeated HEAT motifs.

PubMed

Groves, M R; Hanlon, N; Turowski, P; Hemmings, B A; Barford, D

1999-01-08

The PR65/A subunit of protein phosphatase 2A serves as a scaffolding molecule to coordinate the assembly of the catalytic subunit and a variable regulatory B subunit, generating functionally diverse heterotrimers. Mutations of the beta isoform of PR65 are associated with lung and colon tumors. The crystal structure of the PR65/Aalpha subunit, at 2.3 A resolution, reveals the conformation of its 15 tandemly repeated HEAT sequences, degenerate motifs of approximately 39 amino acids present in a variety of proteins, including huntingtin and importin beta. Individual motifs are composed of a pair of antiparallel alpha helices that assemble in a mainly linear, repetitive fashion to form an elongated molecule characterized by a double layer of alpha helices. Left-handed rotations at three interrepeat interfaces generate a novel left-hand superhelical conformation. The protein interaction interface is formed from the intrarepeat turns that are aligned to form a continuous ridge.
RNA 3D Structural Motifs: Definition, Identification, Annotation, and Database Searching

NASA Astrophysics Data System (ADS)

Nasalean, Lorena; Stombaugh, Jesse; Zirbel, Craig L.; Leontis, Neocles B.

Structured RNA molecules resemble proteins in the hierarchical organization of their global structures, folding and broad range of functions. Structured RNAs are composed of recurrent modular motifs that play specific functional roles. Some motifs direct the folding of the RNA or stabilize the folded structure through tertiary interactions. Others bind ligands or proteins or catalyze chemical reactions. Therefore, it is desirable, starting from the RNA sequence, to be able to predict the locations of recurrent motifs in RNA molecules. Conversely, the potential occurrence of one or more known 3D RNA motifs may indicate that a genomic sequence codes for a structured RNA molecule. To identify known RNA structural motifs in new RNA sequences, precise structure-based definitions are needed that specify the core nucleotides of each motif and their conserved interactions. By comparing instances of each recurrent motif and applying base pair isosteriCity relations, one can identify neutral mutations that preserve its structure and function in the contexts in which it occurs.
Antibody Light-Chain-Restricted Recognition of the Site of Immune Pressure in the RV144 HIV-1 Vaccine Trial Is Phylogenetically Conserved

DOE PAGES

Wiehe, Kevin; Easterhoff, David; Luo, Kan; ...

2014-11-29

In HIV-1, the ability to mount antibody responses to conserved, neutralizing epitopes is critical for protection. Here we have studied the light chain usage of human and rhesus macaque antibodies targeted to a dominant region of the HIV-1 envelope second variable (V2) region involving lysine (K) 169, the site of immune pressure in the RV144 vaccine efficacy trial. We found that humans and rhesus macaques used orthologous lambda variable gene segments encoding a glutamic acid-aspartic acid (ED) motif for K169 recognition. Structure determination of an unmutated ancestor antibody demonstrated that the V2 binding site was preconfigured for ED motif-mediated recognitionmore » prior to maturation. Thus, light chain usage for recognition of the site of immune pressure in the RV144 trial is highly conserved across species. In conclusion, these data indicate that the HIV-1 K169-recognizing ED motif has persisted over the diversification between rhesus macaques and humans, suggesting an evolutionary advantage of this antibody recognition mode.« less
Antibody Light-Chain-Restricted Recognition of the Site of Immune Pressure in the RV144 HIV-1 Vaccine Trial Is Phylogenetically Conserved

DOE Office of Scientific and Technical Information (OSTI.GOV)

Wiehe, Kevin; Easterhoff, David; Luo, Kan

In HIV-1, the ability to mount antibody responses to conserved, neutralizing epitopes is critical for protection. Here we have studied the light chain usage of human and rhesus macaque antibodies targeted to a dominant region of the HIV-1 envelope second variable (V2) region involving lysine (K) 169, the site of immune pressure in the RV144 vaccine efficacy trial. We found that humans and rhesus macaques used orthologous lambda variable gene segments encoding a glutamic acid-aspartic acid (ED) motif for K169 recognition. Structure determination of an unmutated ancestor antibody demonstrated that the V2 binding site was preconfigured for ED motif-mediated recognitionmore » prior to maturation. Thus, light chain usage for recognition of the site of immune pressure in the RV144 trial is highly conserved across species. In conclusion, these data indicate that the HIV-1 K169-recognizing ED motif has persisted over the diversification between rhesus macaques and humans, suggesting an evolutionary advantage of this antibody recognition mode.« less

Comparative Analysis of P450 Signature Motifs EXXR and CXG in the Large and Diverse Kingdom of Fungi: Identification of Evolutionarily Conserved Amino Acid Patterns Characteristic of P450 Family

PubMed Central

Syed, Khajamohiddin; Mashele, Samson Sitheni

2014-01-01

Cytochrome P450 monooxygenases (P450s) are heme-thiolate proteins distributed across the biological kingdoms. P450s are catalytically versatile and play key roles in organisms primary and secondary metabolism. Identification of P450s across the biological kingdoms depends largely on the identification of two P450 signature motifs, EXXR and CXG, in the protein sequence. Once a putative protein has been identified as P450, it will be assigned to a family and subfamily based on the criteria that P450s within a family share more than 40% homology and members of subfamilies share more than 55% homology. However, to date, no evidence has been presented that can distinguish members of a P450 family. Here, for the first time we report the identification of EXXR- and CXG-motifs-based amino acid patterns that are characteristic of the P450 family. Analysis of P450 signature motifs in the under-explored fungal P450s from four different phyla, ascomycota, basidiomycota, zygomycota and chytridiomycota, indicated that the EXXR motif is highly variable and the CXG motif is somewhat variable. The amino acids threonine and leucine are preferred as second and third amino acids in the EXXR motif and proline and glycine are preferred as second and third amino acids in the CXG motif in fungal P450s. Analysis of 67 P450 families from biological kingdoms such as plants, animals, bacteria and fungi showed conservation of a set of amino acid patterns characteristic of a particular P450 family in EXXR and CXG motifs. This suggests that during the divergence of P450 families from a common ancestor these amino acids patterns evolve and are retained in each P450 family as a signature of that family. The role of amino acid patterns characteristic of a P450 family in the structural and/or functional aspects of members of the P450 family is a topic for future research. PMID:24743800
ssHMM: extracting intuitive sequence-structure motifs from high-throughput RNA-binding protein data

PubMed Central

Krestel, Ralf; Ohler, Uwe; Vingron, Martin; Marsico, Annalisa

2017-01-01

Abstract RNA-binding proteins (RBPs) play an important role in RNA post-transcriptional regulation and recognize target RNAs via sequence-structure motifs. The extent to which RNA structure influences protein binding in the presence or absence of a sequence motif is still poorly understood. Existing RNA motif finders either take the structure of the RNA only partially into account, or employ models which are not directly interpretable as sequence-structure motifs. We developed ssHMM, an RNA motif finder based on a hidden Markov model (HMM) and Gibbs sampling which fully captures the relationship between RNA sequence and secondary structure preference of a given RBP. Compared to previous methods which output separate logos for sequence and structure, it directly produces a combined sequence-structure motif when trained on a large set of sequences. ssHMM’s model is visualized intuitively as a graph and facilitates biological interpretation. ssHMM can be used to find novel bona fide sequence-structure motifs of uncharacterized RBPs, such as the one presented here for the YY1 protein. ssHMM reaches a high motif recovery rate on synthetic data, it recovers known RBP motifs from CLIP-Seq data, and scales linearly on the input size, being considerably faster than MEMERIS and RNAcontext on large datasets while being on par with GraphProt. It is freely available on Github and as a Docker image. PMID:28977546
Classification and assessment tools for structural motif discovery algorithms.

PubMed

Badr, Ghada; Al-Turaiki, Isra; Mathkour, Hassan

2013-01-01

Motif discovery is the problem of finding recurring patterns in biological data. Patterns can be sequential, mainly when discovered in DNA sequences. They can also be structural (e.g. when discovering RNA motifs). Finding common structural patterns helps to gain a better understanding of the mechanism of action (e.g. post-transcriptional regulation). Unlike DNA motifs, which are sequentially conserved, RNA motifs exhibit conservation in structure, which may be common even if the sequences are different. Over the past few years, hundreds of algorithms have been developed to solve the sequential motif discovery problem, while less work has been done for the structural case. In this paper, we survey, classify, and compare different algorithms that solve the structural motif discovery problem, where the underlying sequences may be different. We highlight their strengths and weaknesses. We start by proposing a benchmark dataset and a measurement tool that can be used to evaluate different motif discovery approaches. Then, we proceed by proposing our experimental setup. Finally, results are obtained using the proposed benchmark to compare available tools. To the best of our knowledge, this is the first attempt to compare tools solely designed for structural motif discovery. Results show that the accuracy of discovered motifs is relatively low. The results also suggest a complementary behavior among tools where some tools perform well on simple structures, while other tools are better for complex structures. We have classified and evaluated the performance of available structural motif discovery tools. In addition, we have proposed a benchmark dataset with tools that can be used to evaluate newly developed tools.
Motivated Proteins: A web application for studying small three-dimensional protein motifs

PubMed Central

Leader, David P; Milner-White, E James

2009-01-01

Background Small loop-shaped motifs are common constituents of the three-dimensional structure of proteins. Typically they comprise between three and seven amino acid residues, and are defined by a combination of dihedral angles and hydrogen bonding partners. The most abundant of these are αβ-motifs, asx-motifs, asx-turns, β-bulges, β-bulge loops, β-turns, nests, niches, Schellmann loops, ST-motifs, ST-staples and ST-turns. We have constructed a database of such motifs from a range of high-quality protein structures and built a web application as a visual interface to this. Description The web application, Motivated Proteins, provides access to these 12 motifs (with 48 sub-categories) in a database of over 400 representative proteins. Queries can be made for specific categories or sub-categories of motif, motifs in the vicinity of ligands, motifs which include part of an enzyme active site, overlapping motifs, or motifs which include a particular amino acid sequence. Individual proteins can be specified, or, where appropriate, motifs for all proteins listed. The results of queries are presented in textual form as an (X)HTML table, and may be saved as parsable plain text or XML. Motifs can be viewed and manipulated either individually or in the context of the protein in the Jmol applet structural viewer. Cartoons of the motifs imposed on a linear representation of protein secondary structure are also provided. Summary information for the motifs is available, as are histograms of amino acid distribution, and graphs of dihedral angles at individual positions in the motifs. Conclusion Motivated Proteins is a publicly and freely accessible web application that enables protein scientists to study small three-dimensional motifs without requiring knowledge of either Structured Query Language or the underlying database schema. PMID:19210785
Detection of core-periphery structure in networks based on 3-tuple motifs

NASA Astrophysics Data System (ADS)

Ma, Chuang; Xiang, Bing-Bing; Chen, Han-Shuang; Small, Michael; Zhang, Hai-Feng

2018-05-01

Detecting mesoscale structure, such as community structure, is of vital importance for analyzing complex networks. Recently, a new mesoscale structure, core-periphery (CP) structure, has been identified in many real-world systems. In this paper, we propose an effective algorithm for detecting CP structure based on a 3-tuple motif. In this algorithm, we first define a 3-tuple motif in terms of the patterns of edges as well as the property of nodes, and then a motif adjacency matrix is constructed based on the 3-tuple motif. Finally, the problem is converted to find a cluster that minimizes the smallest motif conductance. Our algorithm works well in different CP structures: including single or multiple CP structure, and local or global CP structures. Results on the synthetic and the empirical networks validate the high performance of our method.
Network motif frequency vectors reveal evolving metabolic network organisation.

PubMed

Pearcy, Nicole; Crofts, Jonathan J; Chuzhanova, Nadia

2015-01-01

At the systems level many organisms of interest may be described by their patterns of interaction, and as such, are perhaps best characterised via network or graph models. Metabolic networks, in particular, are fundamental to the proper functioning of many important biological processes, and thus, have been widely studied over the past decade or so. Such investigations have revealed a number of shared topological features, such as a short characteristic path-length, large clustering coefficient and hierarchical modular structure. However, the extent to which evolutionary and functional properties of metabolism manifest via this underlying network architecture remains unclear. In this paper, we employ a novel graph embedding technique, based upon low-order network motifs, to compare metabolic network structure for 383 bacterial species categorised according to a number of biological features. In particular, we introduce a new global significance score which enables us to quantify important evolutionary relationships that exist between organisms and their physical environments. Using this new approach, we demonstrate a number of significant correlations between environmental factors, such as growth conditions and habitat variability, and network motif structure, providing evidence that organism adaptability leads to increased complexities in the resultant metabolic networks.
Effect of C(60) fullerene on the duplex formation of i-motif DNA with complementary DNA in solution.

PubMed

Jin, Kyeong Sik; Shin, Su Ryon; Ahn, Byungcheol; Jin, Sangwoo; Rho, Yecheol; Kim, Heesoo; Kim, Seon Jeong; Ree, Moonhor

2010-04-15

The structural effects of fullerene on i-motif DNA were investigated by characterizing the structures of fullerene-free and fullerene-bound i-motif DNA, in the presence of cDNA and in solutions of varying pH, using circular dichroism and synchrotron small-angle X-ray scattering. To facilitate a direct structural comparison between the i-motif and duplex structures in response to pH stimulus, we developed atomic scale structural models for the duplex and i-motif DNA structures, and for the C(60)/i-motif DNA hybrid associated with the cDNA strand, assuming that the DNA strands are present in an ideal right-handed helical conformation. We found that fullerene shifted the pH-induced conformational transition between the i-motif and the duplex structure, possibly due to the hydrophobic interactions between the terminal fullerenes and between the terminal fullerenes and an internal TAA loop in the DNA strand. The hybrid structure showed a dramatic reduction in cyclic hysteresis.
Functional structural motifs for protein-ligand, protein-protein, and protein-nucleic acid interactions and their connection to supersecondary structures.

PubMed

Kinjo, Akira R; Nakamura, Haruki

2013-01-01

Protein functions are mediated by interactions between proteins and other molecules. One useful approach to analyze protein functions is to compare and classify the structures of interaction interfaces of proteins. Here, we describe the procedures for compiling a database of interface structures and efficiently comparing the interface structures. To do so requires a good understanding of the data structures of the Protein Data Bank (PDB). Therefore, we also provide a detailed account of the PDB exchange dictionary necessary for extracting data that are relevant for analyzing interaction interfaces and secondary structures. We identify recurring structural motifs by classifying similar interface structures, and we define a coarse-grained representation of supersecondary structures (SSS) which represents a sequence of two or three secondary structure elements including their relative orientations as a string of four to seven letters. By examining the correspondence between structural motifs and SSS strings, we show that no SSS string has particularly high propensity to be found interaction interfaces in general, indicating any SSS can be used as a binding interface. When individual structural motifs are examined, there are some SSS strings that have high propensity for particular groups of structural motifs. In addition, it is shown that while the SSS strings found in particular structural motifs for nonpolymer and protein interfaces are as abundant as in other structural motifs that belong to the same subunit, structural motifs for nucleic acid interfaces exhibit somewhat stronger preference for SSS strings. In regard to protein folds, many motif-specific SSS strings were found across many folds, suggesting that SSS may be a useful description to investigate the universality of ligand binding modes.
I-motif DNA structures are formed in the nuclei of human cells

NASA Astrophysics Data System (ADS)

Zeraati, Mahdi; Langley, David B.; Schofield, Peter; Moye, Aaron L.; Rouet, Romain; Hughes, William E.; Bryan, Tracy M.; Dinger, Marcel E.; Christ, Daniel

2018-06-01

Human genome function is underpinned by the primary storage of genetic information in canonical B-form DNA, with a second layer of DNA structure providing regulatory control. I-motif structures are thought to form in cytosine-rich regions of the genome and to have regulatory functions; however, in vivo evidence for the existence of such structures has so far remained elusive. Here we report the generation and characterization of an antibody fragment (iMab) that recognizes i-motif structures with high selectivity and affinity, enabling the detection of i-motifs in the nuclei of human cells. We demonstrate that the in vivo formation of such structures is cell-cycle and pH dependent. Furthermore, we provide evidence that i-motif structures are formed in regulatory regions of the human genome, including promoters and telomeric regions. Our results support the notion that i-motif structures provide key regulatory roles in the genome.
Spike Pattern Structure Influences Synaptic Efficacy Variability under STDP and Synaptic Homeostasis. I: Spike Generating Models on Converging Motifs

PubMed Central

Bi, Zedong; Zhou, Changsong

2016-01-01

In neural systems, synaptic plasticity is usually driven by spike trains. Due to the inherent noises of neurons and synapses as well as the randomness of connection details, spike trains typically exhibit variability such as spatial randomness and temporal stochasticity, resulting in variability of synaptic changes under plasticity, which we call efficacy variability. How the variability of spike trains influences the efficacy variability of synapses remains unclear. In this paper, we try to understand this influence under pair-wise additive spike-timing dependent plasticity (STDP) when the mean strength of plastic synapses into a neuron is bounded (synaptic homeostasis). Specifically, we systematically study, analytically and numerically, how four aspects of statistical features, i.e., synchronous firing, burstiness/regularity, heterogeneity of rates and heterogeneity of cross-correlations, as well as their interactions influence the efficacy variability in converging motifs (simple networks in which one neuron receives from many other neurons). Neurons (including the post-synaptic neuron) in a converging motif generate spikes according to statistical models with tunable parameters. In this way, we can explicitly control the statistics of the spike patterns, and investigate their influence onto the efficacy variability, without worrying about the feedback from synaptic changes onto the dynamics of the post-synaptic neuron. We separate efficacy variability into two parts: the drift part (DriftV) induced by the heterogeneity of change rates of different synapses, and the diffusion part (DiffV) induced by weight diffusion caused by stochasticity of spike trains. Our main findings are: (1) synchronous firing and burstiness tend to increase DiffV, (2) heterogeneity of rates induces DriftV when potentiation and depression in STDP are not balanced, and (3) heterogeneity of cross-correlations induces DriftV together with heterogeneity of rates. We anticipate our work important for understanding functional processes of neuronal networks (such as memory) and neural development. PMID:26941634
GrammarViz 3.0: Interactive Discovery of Variable-Length Time Series Patterns

DOE PAGES

Senin, Pavel; Lin, Jessica; Wang, Xing; ...

2018-02-23

The problems of recurrent and anomalous pattern discovery in time series, e.g., motifs and discords, respectively, have received a lot of attention from researchers in the past decade. However, since the pattern search space is usually intractable, most existing detection algorithms require that the patterns have discriminative characteristics and have its length known in advance and provided as input, which is an unreasonable requirement for many real-world problems. In addition, patterns of similar structure, but of different lengths may co-exist in a time series. In order to address these issues, we have developed algorithms for variable-length time series pattern discoverymore » that are based on symbolic discretization and grammar inference—two techniques whose combination enables the structured reduction of the search space and discovery of the candidate patterns in linear time. In this work, we present GrammarViz 3.0—a software package that provides implementations of proposed algorithms and graphical user interface for interactive variable-length time series pattern discovery. The current version of the software provides an alternative grammar inference algorithm that improves the time series motif discovery workflow, and introduces an experimental procedure for automated discretization parameter selection that builds upon the minimum cardinality maximum cover principle and aids the time series recurrent and anomalous pattern discovery.« less
GrammarViz 3.0: Interactive Discovery of Variable-Length Time Series Patterns

DOE Office of Scientific and Technical Information (OSTI.GOV)

Senin, Pavel; Lin, Jessica; Wang, Xing

The problems of recurrent and anomalous pattern discovery in time series, e.g., motifs and discords, respectively, have received a lot of attention from researchers in the past decade. However, since the pattern search space is usually intractable, most existing detection algorithms require that the patterns have discriminative characteristics and have its length known in advance and provided as input, which is an unreasonable requirement for many real-world problems. In addition, patterns of similar structure, but of different lengths may co-exist in a time series. In order to address these issues, we have developed algorithms for variable-length time series pattern discoverymore » that are based on symbolic discretization and grammar inference—two techniques whose combination enables the structured reduction of the search space and discovery of the candidate patterns in linear time. In this work, we present GrammarViz 3.0—a software package that provides implementations of proposed algorithms and graphical user interface for interactive variable-length time series pattern discovery. The current version of the software provides an alternative grammar inference algorithm that improves the time series motif discovery workflow, and introduces an experimental procedure for automated discretization parameter selection that builds upon the minimum cardinality maximum cover principle and aids the time series recurrent and anomalous pattern discovery.« less
Identifying the scale-dependent motifs in atmospheric surface layer by ordinal pattern analysis

NASA Astrophysics Data System (ADS)

Li, Qinglei; Fu, Zuntao

2018-07-01

Ramp-like structures in various atmospheric surface layer time series have been long studied, but the presence of motifs with the finer scale embedded within larger scale ramp-like structures has largely been overlooked in the reported literature. Here a novel, objective and well-adapted methodology, the ordinal pattern analysis, is adopted to study the finer-scaled motifs in atmospheric boundary-layer (ABL) time series. The studies show that the motifs represented by different ordinal patterns take clustering properties and 6 dominated motifs out of the whole 24 motifs account for about 45% of the time series under particular scales, which indicates the higher contribution of motifs with the finer scale to the series. Further studies indicate that motif statistics are similar for both stable conditions and unstable conditions at larger scales, but large discrepancies are found at smaller scales, and the frequencies of motifs "1234" and/or "4321" are a bit higher under stable conditions than unstable conditions. Under stable conditions, there are great changes for the occurrence frequencies of motifs "1234" and "4321", where the occurrence frequencies of motif "1234" decrease from nearly 24% to 4.5% with the scale factor increasing, and the occurrence frequencies of motif "4321" change nonlinearly with the scale increasing. These great differences of dominated motifs change with scale can be taken as an indicator to quantify the flow structure changes under different stability conditions, and motif entropy can be defined just by only 6 dominated motifs to quantify this time-scale independent property of the motifs. All these results suggest that the defined scale of motifs with the finer scale should be carefully taken into consideration in the interpretation of turbulence coherent structures.
Composite Structural Motifs of Binding Sites for Delineating Biological Functions of Proteins

PubMed Central

Kinjo, Akira R.; Nakamura, Haruki

2012-01-01

Most biological processes are described as a series of interactions between proteins and other molecules, and interactions are in turn described in terms of atomic structures. To annotate protein functions as sets of interaction states at atomic resolution, and thereby to better understand the relation between protein interactions and biological functions, we conducted exhaustive all-against-all atomic structure comparisons of all known binding sites for ligands including small molecules, proteins and nucleic acids, and identified recurring elementary motifs. By integrating the elementary motifs associated with each subunit, we defined composite motifs that represent context-dependent combinations of elementary motifs. It is demonstrated that function similarity can be better inferred from composite motif similarity compared to the similarity of protein sequences or of individual binding sites. By integrating the composite motifs associated with each protein function, we define meta-composite motifs each of which is regarded as a time-independent diagrammatic representation of a biological process. It is shown that meta-composite motifs provide richer annotations of biological processes than sequence clusters. The present results serve as a basis for bridging atomic structures to higher-order biological phenomena by classification and integration of binding site structures. PMID:22347478
Ligand and coactivator identity determines the requirement of the charge clamp for coactivation of the peroxisome proliferator-activated receptor gamma.

PubMed

Wu, Yifei; Chin, William W; Wang, Yong; Burris, Thomas P

2003-03-07

The activation function 2 (AF-2)-dependent recruitment of coactivator is essential for gene activation by nuclear receptors. We show that the peroxisome proliferator-activated receptor gamma (PPARgamma) (NR1C3) coactivator-1 (PGC-1) requires both the intact AF-2 domain of PPARgamma and the LXXLL domain of PGC-1 for ligand-dependent and ligand-independent interaction and coactivation. Although the AF-2 domain of PPARgamma is absolutely required for PGC-1-mediated coactivation, this coactivator displayed a unique lack of requirement for the charge clamp of the ligand-binding domain of the receptor that is thought to be essential for LXXLL motif recognition. The mutation of a single serine residue adjacent to the core LXXLL motif of PGC-1 led to restoration of the typical charge clamp requirement. Thus, the unique structural features of the PGC-1 LXXLL motif appear to mediate an atypical mode of interaction with PPARgamma. Unexpectedly, we discovered that various ligands display variability in terms of their requirement for the charge clamp of PPARgamma for coactivation by PGC-1. This ligand-selective variable requirement for the charge clamp was coactivator-specific. Thus, distinct structural determinants, which may be unique for a particular ligand, are utilized by the receptor to recognize the coactivator. Our data suggest that even subtle differences in ligand structure are perceived by the receptor and translated into a unique display of the coactivator-binding surface of the ligand-binding domain, allowing for differential recognition of coactivators that may underlie distinct pharmacological profiles observed for ligands of a particular nuclear receptor.
Motif discovery with data mining in 3D protein structure databases: discovery, validation and prediction of the U-shape zinc binding ("Huf-Zinc") motif.

PubMed

Maurer-Stroh, Sebastian; Gao, He; Han, Hao; Baeten, Lies; Schymkowitz, Joost; Rousseau, Frederic; Zhang, Louxin; Eisenhaber, Frank

2013-02-01

Data mining in protein databases, derivatives from more fundamental protein 3D structure and sequence databases, has considerable unearthed potential for the discovery of sequence motif--structural motif--function relationships as the finding of the U-shape (Huf-Zinc) motif, originally a small student's project, exemplifies. The metal ion zinc is critically involved in universal biological processes, ranging from protein-DNA complexes and transcription regulation to enzymatic catalysis and metabolic pathways. Proteins have evolved a series of motifs to specifically recognize and bind zinc ions. Many of these, so called zinc fingers, are structurally independent globular domains with discontinuous binding motifs made up of residues mostly far apart in sequence. Through a systematic approach starting from the BRIX structure fragment database, we discovered that there exists another predictable subset of zinc-binding motifs that not only have a conserved continuous sequence pattern but also share a characteristic local conformation, despite being included in totally different overall folds. While this does not allow general prediction of all Zn binding motifs, a HMM-based web server, Huf-Zinc, is available for prediction of these novel, as well as conventional, zinc finger motifs in protein sequences. The Huf-Zinc webserver can be freely accessed through this URL (http://mendel.bii.a-star.edu.sg/METHODS/hufzinc/).
Biological network motif detection and evaluation

PubMed Central

2011-01-01

Background Molecular level of biological data can be constructed into system level of data as biological networks. Network motifs are defined as over-represented small connected subgraphs in networks and they have been used for many biological applications. Since network motif discovery involves computationally challenging processes, previous algorithms have focused on computational efficiency. However, we believe that the biological quality of network motifs is also very important. Results We define biological network motifs as biologically significant subgraphs and traditional network motifs are differentiated as structural network motifs in this paper. We develop five algorithms, namely, EDGEGO-BNM, EDGEBETWEENNESS-BNM, NMF-BNM, NMFGO-BNM and VOLTAGE-BNM, for efficient detection of biological network motifs, and introduce several evaluation measures including motifs included in complex, motifs included in functional module and GO term clustering score in this paper. Experimental results show that EDGEGO-BNM and EDGEBETWEENNESS-BNM perform better than existing algorithms and all of our algorithms are applicable to find structural network motifs as well. Conclusion We provide new approaches to finding network motifs in biological networks. Our algorithms efficiently detect biological network motifs and further improve existing algorithms to find high quality structural network motifs, which would be impossible using existing algorithms. The performances of the algorithms are compared based on our new evaluation measures in biological contexts. We believe that our work gives some guidelines of network motifs research for the biological networks. PMID:22784624
High-resolution profiling of linear B-cell epitopes from mucin-associated surface proteins (MASPs) of Trypanosoma cruzi during human infections

PubMed Central

Durante, Ignacio M.; La Spina, Pablo E.; Carmona, Santiago J.; Agüero, Fernán

2017-01-01

Background The Trypanosoma cruzi genome bears a huge family of genes and pseudogenes coding for Mucin-Associated Surface Proteins (MASPs). MASP molecules display a ‘mosaic’ structure, with highly conserved flanking regions and a strikingly variable central and mature domain made up of different combinations of a large repertoire of short sequence motifs. MASP molecules are highly expressed in mammal-dwelling stages of T. cruzi and may be involved in parasite-host interactions and/or in diverting the immune response. Methods/Principle findings High-density microarrays composed of fully overlapped 15mer peptides spanning the entire sequences of 232 non-redundant MASPs (~25% of the total MASP content) were screened with chronic Chagasic sera. This strategy led to the identification of 86 antigenic motifs, each one likely representing a single linear B-cell epitope, which were mapped to 69 different MASPs. These motifs could be further grouped into 31 clusters of structurally- and likely antigenically-related sequences, and fully characterized. In contrast to previous reports, we show that MASP antigenic motifs are restricted to the central and mature region of MASP polypeptides, consistent with their intracellular processing. The antigenicity of these motifs displayed significant positive correlation with their genome dosage and their relative position within the MASP polypeptide. In addition, we verified the biased genetic co-occurrence of certain antigenic motifs within MASP polypeptides, compatible with proposed intra-family recombination events underlying the evolution of their coding genes. Sequences spanning 7 MASP antigenic motifs were further evaluated using distinct synthesis/display approaches and a large panel of serum samples. Overall, the serological recognition of MASP antigenic motifs exhibited a remarkable non normal distribution among the T. cruzi seropositive population, thus reducing their applicability in conventional serodiagnosis. As previously observed in in vitro and animal infection models, immune signatures supported the concurrent expression of several MASPs during human infection. Conclusions/Significance In spite of their conspicuous expression and potential roles in parasite biology, this study constitutes the first unbiased, high-resolution profiling of linear B-cell epitopes from T. cruzi MASPs during human infection. PMID:28961244
Identifying novel sequence variants of RNA 3D motifs

PubMed Central

Zirbel, Craig L.; Roll, James; Sweeney, Blake A.; Petrov, Anton I.; Pirrung, Meg; Leontis, Neocles B.

2015-01-01

Predicting RNA 3D structure from sequence is a major challenge in biophysics. An important sub-goal is accurately identifying recurrent 3D motifs from RNA internal and hairpin loop sequences extracted from secondary structure (2D) diagrams. We have developed and validated new probabilistic models for 3D motif sequences based on hybrid Stochastic Context-Free Grammars and Markov Random Fields (SCFG/MRF). The SCFG/MRF models are constructed using atomic-resolution RNA 3D structures. To parameterize each model, we use all instances of each motif found in the RNA 3D Motif Atlas and annotations of pairwise nucleotide interactions generated by the FR3D software. Isostericity relations between non-Watson–Crick basepairs are used in scoring sequence variants. SCFG techniques model nested pairs and insertions, while MRF ideas handle crossing interactions and base triples. We use test sets of randomly-generated sequences to set acceptance and rejection thresholds for each motif group and thus control the false positive rate. Validation was carried out by comparing results for four motif groups to RMDetect. The software developed for sequence scoring (JAR3D) is structured to automatically incorporate new motifs as they accumulate in the RNA 3D Motif Atlas when new structures are solved and is available free for download. PMID:26130723
New structures of Fe3S for rare-earth-free permanent magnets

NASA Astrophysics Data System (ADS)

Yu, Shu; Zhao, Xin; Wu, Shunqing; Nguyen, Manh Cuong; Zhu, Zi-zhong; Wang, Cai-Zhuang; Ho, Kai-Ming

2018-02-01

We applied an adaptive genetic algorithm (AGA) to search for low-energy crystal structures of Fe3S. A number of structures with energies lower than that of the experimentally reported Pnma and I-4 structures have been obtained from our AGA searches. These low-energy structures can be classified as layer-motif and column-motif structures. In the column-motif structures, Fe atoms self-assemble into rods with a bcc type of underlying lattice, which are separated by the holes terminated by S atoms. In the layer-motif structures, the bulk Fe is broken into slabs of several layers passivated by S atoms. Magnetic property calculations showed that the column-motif structures exhibit reasonably high uniaxial magnetic anisotropy. In addition, we examined the effect of Co doping to Fe3S and found that magnetic anisotropy can be enhanced through Co doping.

An Efficient Scheme for Crystal Structure Prediction Based on Structural Motifs

DOE PAGES

Zhu, Zizhong; Wu, Ping; Wu, Shunqing; ...

2017-05-15

An efficient scheme based on structural motifs is proposed for the crystal structure prediction of materials. The key advantage of the present method comes in two fold: first, the degrees of freedom of the system are greatly reduced, since each structural motif, regardless of its size, can always be described by a set of parameters (R, θ, φ) with five degrees of freedom; second, the motifs could always appear in the predicted structures when the energies of the structures are relatively low. Both features make the present scheme a very efficient method for predicting desired materials. The method has beenmore » applied to the case of LiFePO 4, an important cathode material for lithium-ion batteries. Numerous new structures of LiFePO 4 have been found, compared to those currently available, available, demonstrating the reliability of the present methodology and illustrating the promise of the concept of structural motifs.« less
An Efficient Scheme for Crystal Structure Prediction Based on Structural Motifs

DOE Office of Scientific and Technical Information (OSTI.GOV)

Zhu, Zizhong; Wu, Ping; Wu, Shunqing

An efficient scheme based on structural motifs is proposed for the crystal structure prediction of materials. The key advantage of the present method comes in two fold: first, the degrees of freedom of the system are greatly reduced, since each structural motif, regardless of its size, can always be described by a set of parameters (R, θ, φ) with five degrees of freedom; second, the motifs could always appear in the predicted structures when the energies of the structures are relatively low. Both features make the present scheme a very efficient method for predicting desired materials. The method has beenmore » applied to the case of LiFePO 4, an important cathode material for lithium-ion batteries. Numerous new structures of LiFePO 4 have been found, compared to those currently available, available, demonstrating the reliability of the present methodology and illustrating the promise of the concept of structural motifs.« less
De novo discovery of structural motifs in RNA 3D structures through clustering.

PubMed

Ge, Ping; Islam, Shahidul; Zhong, Cuncong; Zhang, Shaojie

2018-05-18

As functional components in three-dimensional (3D) conformation of an RNA, the RNA structural motifs provide an easy way to associate the molecular architectures with their biological mechanisms. In the past years, many computational tools have been developed to search motif instances by using the existing knowledge of well-studied families. Recently, with the rapidly increasing number of resolved RNA 3D structures, there is an urgent need to discover novel motifs with the newly presented information. In this work, we classify all the loops in non-redundant RNA 3D structures to detect plausible RNA structural motif families by using a clustering pipeline. Compared with other clustering approaches, our method has two benefits: first, the underlying alignment algorithm is tolerant to the variations in 3D structures. Second, sophisticated downstream analysis has been performed to ensure the clusters are valid and easily applied to further research. The final clustering results contain many interesting new variants of known motif families, such as GNAA tetraloop, kink-turn, sarcin-ricin and T-loop. We have also discovered potential novel functional motifs conserved in ribosomal RNA, sgRNA, SRP RNA, riboswitch and ribozyme.
FoldMiner and LOCK 2: protein structure comparison and motif discovery on the web.

PubMed

Shapiro, Jessica; Brutlag, Douglas

2004-07-01

The FoldMiner web server (http://foldminer.stanford.edu/) provides remote access to methods for protein structure alignment and unsupervised motif discovery. FoldMiner is unique among such algorithms in that it improves both the motif definition and the sensitivity of a structural similarity search by combining the search and motif discovery methods and using information from each process to enhance the other. In a typical run, a query structure is aligned to all structures in one of several databases of single domain targets in order to identify its structural neighbors and to discover a motif that is the basis for the similarity among the query and statistically significant targets. This process is fully automated, but options for manual refinement of the results are available as well. The server uses the Chime plugin and customized controls to allow for visualization of the motif and of structural superpositions. In addition, we provide an interface to the LOCK 2 algorithm for rapid alignments of a query structure to smaller numbers of user-specified targets.
Modeling protein homopolymeric repeats: possible polyglutamine structural motifs for Huntington's disease.

PubMed

Lathrop, R H; Casale, M; Tobias, D J; Marsh, J L; Thompson, L M

1998-01-01

We describe a prototype system (Poly-X) for assisting an expert user in modeling protein repeats. Poly-X reduces the large number of degrees of freedom required to specify a protein motif in complete atomic detail. The result is a small number of parameters that are easily understood by, and under the direct control of, a domain expert. The system was applied to the polyglutamine (poly-Q) repeat in the first exon of huntingtin, the gene implicated in Huntington's disease. We present four poly-Q structural motifs: two poly-Q beta-sheet motifs (parallel and antiparallel) that constitute plausible alternatives to a similar previously published poly-Q beta-sheet motif, and two novel poly-Q helix motifs (alpha-helix and pi-helix). To our knowledge, helical forms of polyglutamine have not been proposed before. The motifs suggest that there may be several plausible aggregation structures for the intranuclear inclusion bodies which have been found in diseased neurons, and may help in the effort to understand the structural basis for Huntington's disease.
SSMART: Sequence-structure motif identification for RNA-binding proteins.

PubMed

Munteanu, Alina; Mukherjee, Neelanjan; Ohler, Uwe

2018-06-11

RNA-binding proteins (RBPs) regulate every aspect of RNA metabolism and function. There are hundreds of RBPs encoded in the eukaryotic genomes, and each recognize its RNA targets through a specific mixture of RNA sequence and structure properties. For most RBPs, however, only a primary sequence motif has been determined, while the structure of the binding sites is uncharacterized. We developed SSMART, an RNA motif finder that simultaneously models the primary sequence and the structural properties of the RNA targets sites. The sequence-structure motifs are represented as consensus strings over a degenerate alphabet, extending the IUPAC codes for nucleotides to account for secondary structure preferences. Evaluation on synthetic data showed that SSMART is able to recover both sequence and structure motifs implanted into 3'UTR-like sequences, for various degrees of structured/unstructured binding sites. In addition, we successfully used SSMART on high-throughput in vivo and in vitro data, showing that we not only recover the known sequence motif, but also gain insight into the structural preferences of the RBP. Availability: SSMART is freely available at https://ohlerlab.mdc-berlin.de/software/SSMART_137/. Supplementary data are available at Bioinformatics online.
Chemical Space Mapping and Structure-Activity Analysis of the ChEMBL Antiviral Compound Set.

PubMed

Klimenko, Kyrylo; Marcou, Gilles; Horvath, Dragos; Varnek, Alexandre

2016-08-22

Curation, standardization and data fusion of the antiviral information present in the ChEMBL public database led to the definition of a robust data set, providing an association of antiviral compounds to seven broadly defined antiviral activity classes. Generative topographic mapping (GTM) subjected to evolutionary tuning was then used to produce maps of the antiviral chemical space, providing an optimal separation of compound families associated with the different antiviral classes. The ability to pinpoint the specific spots occupied (responsibility patterns) on a map by various classes of antiviral compounds opened the way for a GTM-supported search for privileged structural motifs, typical for each antiviral class. The privileged locations of antiviral classes were analyzed in order to highlight underlying privileged common structural motifs. Unlike in classical medicinal chemistry, where privileged structures are, almost always, predefined scaffolds, privileged structural motif detection based on GTM responsibility patterns has the decisive advantage of being able to automatically capture the nature ("resolution detail"-scaffold, detailed substructure, pharmacophore pattern, etc.) of the relevant structural motifs. Responsibility patterns were found to represent underlying structural motifs of various natures-from very fuzzy (groups of various "interchangeable" similar scaffolds), to the classical scenario in medicinal chemistry (underlying motif actually being the scaffold), to very precisely defined motifs (specifically substituted scaffolds).
Mutations in repeating structural motifs of tropomyosin cause gain of function in skeletal muscle myopathy patients

PubMed Central

Marston, Steven; Memo, Massimiliano; Messer, Andrew; Papadaki, Maria; Nowak, Kristen; McNamara, Elyshia; Ong, Royston; El-Mezgueldi, Mohammed; Li, Xiaochuan; Lehman, William

2013-01-01

The congenital myopathies include a wide spectrum of clinically, histologically and genetically variable neuromuscular disorders many of which are caused by mutations in genes for sarcomeric proteins. Some congenital myopathy patients have a hypercontractile phenotype. Recent functional studies demonstrated that ACTA1 K326N and TPM2 ΔK7 mutations were associated with hypercontractility that could be explained by increased myofibrillar Ca2+ sensitivity. A recent structure of the complex of actin and tropomyosin in the relaxed state showed that both these mutations are located in the actin–tropomyosin interface. Tropomyosin is an elongated molecule with a 7-fold repeated motif of around 40 amino acids corresponding to the 7 actin monomers it interacts with. Actin binds to tropomyosin electrostatically at two points, through Asp25 and through a cluster of amino acids that includes Lys326, mutated in the gain-of-function mutation. Asp25 interacts with tropomyosin K6, next to K7 that was mutated in the other gain-of-function mutation. We identified four tropomyosin motifs interacting with Asp25 (K6-K7, K48-K49, R90-R91 and R167-K168) and three E-E/D-K/R motifs interacting with Lys326 (E139, E181 and E218), and we predicted that the known skeletal myopathy mutations ΔK7, ΔK49, R91G, ΔE139, K168E and E181K would cause a gain of function. Tests by an in vitro motility assay confirmed that these mutations increased Ca2+ sensitivity, while mutations not in these motifs (R167H, R244G) decreased Ca2+ sensitivity. The work reported here explains the molecular mechanism for 6 out of 49 known disease-causing mutations in the TPM2 and TPM3 genes, derived from structural data of the actin–tropomyosin interface. PMID:23886664
Space-related pharma-motifs for fast search of protein binding motifs and polypharmacological targets

PubMed Central

2012-01-01

Background To discover a compound inhibiting multiple proteins (i.e. polypharmacological targets) is a new paradigm for the complex diseases (e.g. cancers and diabetes). In general, the polypharmacological proteins often share similar local binding environments and motifs. As the exponential growth of the number of protein structures, to find the similar structural binding motifs (pharma-motifs) is an emergency task for drug discovery (e.g. side effects and new uses for old drugs) and protein functions. Results We have developed a Space-Related Pharmamotifs (called SRPmotif) method to recognize the binding motifs by searching against protein structure database. SRPmotif is able to recognize conserved binding environments containing spatially discontinuous pharma-motifs which are often short conserved peptides with specific physico-chemical properties for protein functions. Among 356 pharma-motifs, 56.5% interacting residues are highly conserved. Experimental results indicate that 81.1% and 92.7% polypharmacological targets of each protein-ligand complex are annotated with same biological process (BP) and molecular function (MF) terms, respectively, based on Gene Ontology (GO). Our experimental results show that the identified pharma-motifs often consist of key residues in functional (active) sites and play the key roles for protein functions. The SRPmotif is available at http://gemdock.life.nctu.edu.tw/SRP/. Conclusions SRPmotif is able to identify similar pharma-interfaces and pharma-motifs sharing similar binding environments for polypharmacological targets by rapidly searching against the protein structure database. Pharma-motifs describe the conservations of binding environments for drug discovery and protein functions. Additionally, these pharma-motifs provide the clues for discovering new sequence-based motifs to predict protein functions from protein sequence databases. We believe that SRPmotif is useful for elucidating protein functions and drug discovery. PMID:23281852
Space-related pharma-motifs for fast search of protein binding motifs and polypharmacological targets.

PubMed

Chiu, Yi-Yuan; Lin, Chun-Yu; Lin, Chih-Ta; Hsu, Kai-Cheng; Chang, Li-Zen; Yang, Jinn-Moon

2012-01-01

To discover a compound inhibiting multiple proteins (i.e. polypharmacological targets) is a new paradigm for the complex diseases (e.g. cancers and diabetes). In general, the polypharmacological proteins often share similar local binding environments and motifs. As the exponential growth of the number of protein structures, to find the similar structural binding motifs (pharma-motifs) is an emergency task for drug discovery (e.g. side effects and new uses for old drugs) and protein functions. We have developed a Space-Related Pharmamotifs (called SRPmotif) method to recognize the binding motifs by searching against protein structure database. SRPmotif is able to recognize conserved binding environments containing spatially discontinuous pharma-motifs which are often short conserved peptides with specific physico-chemical properties for protein functions. Among 356 pharma-motifs, 56.5% interacting residues are highly conserved. Experimental results indicate that 81.1% and 92.7% polypharmacological targets of each protein-ligand complex are annotated with same biological process (BP) and molecular function (MF) terms, respectively, based on Gene Ontology (GO). Our experimental results show that the identified pharma-motifs often consist of key residues in functional (active) sites and play the key roles for protein functions. The SRPmotif is available at http://gemdock.life.nctu.edu.tw/SRP/. SRPmotif is able to identify similar pharma-interfaces and pharma-motifs sharing similar binding environments for polypharmacological targets by rapidly searching against the protein structure database. Pharma-motifs describe the conservations of binding environments for drug discovery and protein functions. Additionally, these pharma-motifs provide the clues for discovering new sequence-based motifs to predict protein functions from protein sequence databases. We believe that SRPmotif is useful for elucidating protein functions and drug discovery.
A novel swarm intelligence algorithm for finding DNA motifs.

PubMed

Lei, Chengwei; Ruan, Jianhua

2009-01-01

Discovering DNA motifs from co-expressed or co-regulated genes is an important step towards deciphering complex gene regulatory networks and understanding gene functions. Despite significant improvement in the last decade, it still remains one of the most challenging problems in computational molecular biology. In this work, we propose a novel motif finding algorithm that finds consensus patterns using a population-based stochastic optimisation technique called Particle Swarm Optimisation (PSO), which has been shown to be effective in optimising difficult multidimensional problems in continuous domains. We propose to use a word dissimilarity graph to remap the neighborhood structure of the solution space of DNA motifs, and propose a modification of the naive PSO algorithm to accommodate discrete variables. In order to improve efficiency, we also propose several strategies for escaping from local optima and for automatically determining the termination criteria. Experimental results on simulated challenge problems show that our method is both more efficient and more accurate than several existing algorithms. Applications to several sets of real promoter sequences also show that our approach is able to detect known transcription factor binding sites, and outperforms two of the most popular existing algorithms.
Systematic comparison of the response properties of protein and RNA mediated gene regulatory motifs.

PubMed

Iyengar, Bharat Ravi; Pillai, Beena; Venkatesh, K V; Gadgil, Chetan J

2017-05-30

We present a framework enabling the dissection of the effects of motif structure (feedback or feedforward), the nature of the controller (RNA or protein), and the regulation mode (transcriptional, post-transcriptional or translational) on the response to a step change in the input. We have used a common model framework for gene expression where both motif structures have an activating input and repressing regulator, with the same set of parameters, to enable a comparison of the responses. We studied the global sensitivity of the system properties, such as steady-state gain, overshoot, peak time, and peak duration, to parameters. We find that, in all motifs, overshoot correlated negatively whereas peak duration varied concavely with peak time. Differences in the other system properties were found to be mainly dependent on the nature of the controller rather than the motif structure. Protein mediated motifs showed a higher degree of adaptation i.e. a tendency to return to baseline levels; in particular, feedforward motifs exhibited perfect adaptation. RNA mediated motifs had a mild regulatory effect; they also exhibited a lower peaking tendency and mean overshoot. Protein mediated feedforward motifs showed higher overshoot and lower peak time compared to the corresponding feedback motifs.
A novel motif in the yeast mitochondrial dynamin Dnm1 is essential for adaptor binding and membrane recruitment

PubMed Central

Bui, Huyen T.; Karren, Mary A.; Bhar, Debjani

2012-01-01

To initiate mitochondrial fission, dynamin-related proteins (DRPs) must bind specific adaptors on the outer mitochondrial membrane. The structural features underlying this interaction are poorly understood. Using yeast as a model, we show that the Insert B domain of the Dnm1 guanosine triphosphatase (a DRP) contains a novel motif required for association with the mitochondrial adaptor Mdv1. Mutation of this conserved motif specifically disrupted Dnm1–Mdv1 interactions, blocking Dnm1 recruitment and mitochondrial fission. Suppressor mutations in Mdv1 that restored Dnm1–Mdv1 interactions and fission identified potential protein-binding interfaces on the Mdv1 β-propeller domain. These results define the first known function for Insert B in DRP–adaptor interactions. Based on the variability of Insert B sequences and adaptor proteins, we propose that Insert B domains and mitochondrial adaptors have coevolved to meet the unique requirements for mitochondrial fission of different organisms. PMID:23148233
Hybrid DNA i-motif: Aminoethylprolyl-PNA (pC5) enhance the stability of DNA (dC5) i-motif structure.

PubMed

Gade, Chandrasekhar Reddy; Sharma, Nagendra K

2017-12-15

This report describes the synthesis of C-rich sequence, cytosine pentamer, of aep-PNA and its biophysical studies for the formation of hybrid DNA:aep-PNAi-motif structure with DNA cytosine pentamer (dC 5 ) under acidic pH conditions. Herein, the CD/UV/NMR/ESI-Mass studies strongly support the formation of stable hybrid DNA i-motif structure with aep-PNA even near acidic conditions. Hence aep-PNA C-rich sequence cytosine could be considered as potential DNA i-motif stabilizing agents in vivo conditions. Copyright © 2017 Elsevier Ltd. All rights reserved.
RNA Bricks—a database of RNA 3D motifs and their interactions

PubMed Central

Chojnowski, Grzegorz; Waleń, Tomasz; Bujnicki, Janusz M.

2014-01-01

The RNA Bricks database (http://iimcb.genesilico.pl/rnabricks), stores information about recurrent RNA 3D motifs and their interactions, found in experimentally determined RNA structures and in RNA–protein complexes. In contrast to other similar tools (RNA 3D Motif Atlas, RNA Frabase, Rloom) RNA motifs, i.e. ‘RNA bricks’ are presented in the molecular environment, in which they were determined, including RNA, protein, metal ions, water molecules and ligands. All nucleotide residues in RNA bricks are annotated with structural quality scores that describe real-space correlation coefficients with the electron density data (if available), backbone geometry and possible steric conflicts, which can be used to identify poorly modeled residues. The database is also equipped with an algorithm for 3D motif search and comparison. The algorithm compares spatial positions of backbone atoms of the user-provided query structure and of stored RNA motifs, without relying on sequence or secondary structure information. This enables the identification of local structural similarities among evolutionarily related and unrelated RNA molecules. Besides, the search utility enables searching ‘RNA bricks’ according to sequence similarity, and makes it possible to identify motifs with modified ribonucleotide residues at specific positions. PMID:24220091
Helix-packing motifs in membrane proteins.

PubMed

Walters, R F S; DeGrado, W F

2006-09-12

The fold of a helical membrane protein is largely determined by interactions between membrane-imbedded helices. To elucidate recurring helix-helix interaction motifs, we dissected the crystallographic structures of membrane proteins into a library of interacting helical pairs. The pairs were clustered according to their three-dimensional similarity (rmsd
SARNAclust: Semi-automatic detection of RNA protein binding motifs from immunoprecipitation data

PubMed Central

Dotu, Ivan; Adamson, Scott I.; Coleman, Benjamin; Fournier, Cyril; Ricart-Altimiras, Emma; Eyras, Eduardo

2018-01-01

RNA-protein binding is critical to gene regulation, controlling fundamental processes including splicing, translation, localization and stability, and aberrant RNA-protein interactions are known to play a role in a wide variety of diseases. However, molecular understanding of RNA-protein interactions remains limited; in particular, identification of RNA motifs that bind proteins has long been challenging, especially when such motifs depend on both sequence and structure. Moreover, although RNA binding proteins (RBPs) often contain more than one binding domain, algorithms capable of identifying more than one binding motif simultaneously have not been developed. In this paper we present a novel pipeline to determine binding peaks in crosslinking immunoprecipitation (CLIP) data, to discover multiple possible RNA sequence/structure motifs among them, and to experimentally validate such motifs. At the core is a new semi-automatic algorithm SARNAclust, the first unsupervised method to identify and deconvolve multiple sequence/structure motifs simultaneously. SARNAclust computes similarity between sequence/structure objects using a graph kernel, providing the ability to isolate the impact of specific features through the bulge graph formalism. Application of SARNAclust to synthetic data shows its capability of clustering 5 motifs at once with a V-measure value of over 0.95, while GraphClust achieves only a V-measure of 0.083 and RNAcontext cannot detect any of the motifs. When applied to existing eCLIP sets, SARNAclust finds known motifs for SLBP and HNRNPC and novel motifs for several other RBPs such as AGGF1, AKAP8L and ILF3. We demonstrate an experimental validation protocol, a targeted Bind-n-Seq-like high-throughput sequencing approach that relies on RNA inverse folding for oligo pool design, that can validate the components within the SLBP motif. Finally, we use this protocol to experimentally interrogate the SARNAclust motif predictions for protein ILF3. Our results support a newly identified partially double-stranded UUUUUGAGA motif similar to that known for the splicing factor HNRNPC. PMID:29596423
Insights into Structural and Mechanistic Features of Viral IRES Elements

PubMed Central

Martinez-Salas, Encarnacion; Francisco-Velilla, Rosario; Fernandez-Chamorro, Javier; Embarek, Azman M.

2018-01-01

Internal ribosome entry site (IRES) elements are cis-acting RNA regions that promote internal initiation of protein synthesis using cap-independent mechanisms. However, distinct types of IRES elements present in the genome of various RNA viruses perform the same function despite lacking conservation of sequence and secondary RNA structure. Likewise, IRES elements differ in host factor requirement to recruit the ribosomal subunits. In spite of this diversity, evolutionarily conserved motifs in each family of RNA viruses preserve sequences impacting on RNA structure and RNA–protein interactions important for IRES activity. Indeed, IRES elements adopting remarkable different structural organizations contain RNA structural motifs that play an essential role in recruiting ribosomes, initiation factors and/or RNA-binding proteins using different mechanisms. Therefore, given that a universal IRES motif remains elusive, it is critical to understand how diverse structural motifs deliver functions relevant for IRES activity. This will be useful for understanding the molecular mechanisms beyond cap-independent translation, as well as the evolutionary history of these regulatory elements. Moreover, it could improve the accuracy to predict IRES-like motifs hidden in genome sequences. This review summarizes recent advances on the diversity and biological relevance of RNA structural motifs for viral IRES elements. PMID:29354113
Efficacy of function specific 3D-motifs in enzyme classification according to their EC-numbers.

PubMed

Rahimi, Amir; Madadkar-Sobhani, Armin; Touserkani, Rouzbeh; Goliaei, Bahram

2013-11-07

Due to the increasing number of protein structures with unknown function originated from structural genomics projects, protein function prediction has become an important subject in bioinformatics. Among diverse function prediction methods, exploring known 3D-motifs, which are associated with functional elements in unknown protein structures is one of the most biologically meaningful methods. Homologous enzymes inherit such motifs in their active sites from common ancestors. However, slight differences in the properties of these motifs, results in variation in the reactions and substrates of the enzymes. In this study, we examined the possibility of discriminating highly related active site patterns according to their EC-numbers by 3D-motifs. For each EC-number, the spatial arrangement of an active site, which has minimum average distance to other active sites with the same function, was selected as a representative 3D-motif. In order to characterize the motifs, various points in active site elements were tested. The results demonstrated the possibility of predicting full EC-number of enzymes by 3D-motifs. However, the discriminating power of 3D-motifs varies among different enzyme families and depends on selecting the appropriate points and features. © 2013 Elsevier Ltd. All rights reserved.
ELM: the status of the 2010 eukaryotic linear motif resource

PubMed Central

Gould, Cathryn M.; Diella, Francesca; Via, Allegra; Puntervoll, Pål; Gemünd, Christine; Chabanis-Davidson, Sophie; Michael, Sushama; Sayadi, Ahmed; Bryne, Jan Christian; Chica, Claudia; Seiler, Markus; Davey, Norman E.; Haslam, Niall; Weatheritt, Robert J.; Budd, Aidan; Hughes, Tim; Paś, Jakub; Rychlewski, Leszek; Travé, Gilles; Aasland, Rein; Helmer-Citterich, Manuela; Linding, Rune; Gibson, Toby J.

2010-01-01

Linear motifs are short segments of multidomain proteins that provide regulatory functions independently of protein tertiary structure. Much of intracellular signalling passes through protein modifications at linear motifs. Many thousands of linear motif instances, most notably phosphorylation sites, have now been reported. Although clearly very abundant, linear motifs are difficult to predict de novo in protein sequences due to the difficulty of obtaining robust statistical assessments. The ELM resource at http://elm.eu.org/ provides an expanding knowledge base, currently covering 146 known motifs, with annotation that includes >1300 experimentally reported instances. ELM is also an exploratory tool for suggesting new candidates of known linear motifs in proteins of interest. Information about protein domains, protein structure and native disorder, cellular and taxonomic contexts is used to reduce or deprecate false positive matches. Results are graphically displayed in a ‘Bar Code’ format, which also displays known instances from homologous proteins through a novel ‘Instance Mapper’ protocol based on PHI-BLAST. ELM server output provides links to the ELM annotation as well as to a number of remote resources. Using the links, researchers can explore the motifs, proteins, complex structures and associated literature to evaluate whether candidate motifs might be worth experimental investigation. PMID:19920119

Computational study of stability of an H-H-type pseudoknot motif.

PubMed

Wang, Jun; Zhao, Yunjie; Wang, Jian; Xiao, Yi

2015-12-01

Motifs in RNA tertiary structures are important to their structural organizations and biological functions. Here we consider an H-H-type pseudoknot (HHpk) motif that consists of two hairpins connected by a junction loop and with kissing interactions between the two hairpin loops. Such a tertiary structural motif is recurrently found in RNA tertiary structures, but is difficult to predict computationally. So it is important to understand the mechanism of its formation and stability. Here we investigate the stability of the HHpk tertiary structure by using an all-atom molecular dynamics simulation. The results indicate that the HHpk tertiary structure is stable. However, it is found that this stability is not due to the helix-helix packing, as is usually expected, but is maintained by the combined action of the kissing hairpin loops and junctions, although the former plays the main role. Stable HHpk motifs may form structural platforms for the molecules to realize their biological functions. These results are useful for understanding the construction principle of RNA tertiary structures and structure prediction.
A Novel Protein Interaction between Nucleotide Binding Domain of Hsp70 and p53 Motif

PubMed Central

Elengoe, Asita; Naser, Mohammed Abu; Hamdan, Salehhuddin

2015-01-01

Currently, protein interaction of Homo sapiens nucleotide binding domain (NBD) of heat shock 70 kDa protein (PDB: 1HJO) with p53 motif remains to be elucidated. The NBD-p53 motif complex enhances the p53 stabilization, thereby increasing the tumor suppression activity in cancer treatment. Therefore, we identified the interaction between NBD and p53 using STRING version 9.1 program. Then, we modeled the three-dimensional structure of p53 motif through homology modeling and determined the binding affinity and stability of NBD-p53 motif complex structure via molecular docking and dynamics (MD) simulation. Human DNA binding domain of p53 motif (SCMGGMNR) retrieved from UniProt (UniProtKB: P04637) was docked with the NBD protein, using the Autodock version 4.2 program. The binding energy and intermolecular energy for the NBD-p53 motif complex were −0.44 Kcal/mol and −9.90 Kcal/mol, respectively. Moreover, RMSD, RMSF, hydrogen bonds, salt bridge, and secondary structure analyses revealed that the NBD protein had a strong bond with p53 motif and the protein-ligand complex was stable. Thus, the current data would be highly encouraging for designing Hsp70 structure based drug in cancer therapy. PMID:26098630
A Novel Protein Interaction between Nucleotide Binding Domain of Hsp70 and p53 Motif.

PubMed

Elengoe, Asita; Naser, Mohammed Abu; Hamdan, Salehhuddin

2015-01-01

Currently, protein interaction of Homo sapiens nucleotide binding domain (NBD) of heat shock 70 kDa protein (PDB: 1HJO) with p53 motif remains to be elucidated. The NBD-p53 motif complex enhances the p53 stabilization, thereby increasing the tumor suppression activity in cancer treatment. Therefore, we identified the interaction between NBD and p53 using STRING version 9.1 program. Then, we modeled the three-dimensional structure of p53 motif through homology modeling and determined the binding affinity and stability of NBD-p53 motif complex structure via molecular docking and dynamics (MD) simulation. Human DNA binding domain of p53 motif (SCMGGMNR) retrieved from UniProt (UniProtKB: P04637) was docked with the NBD protein, using the Autodock version 4.2 program. The binding energy and intermolecular energy for the NBD-p53 motif complex were -0.44 Kcal/mol and -9.90 Kcal/mol, respectively. Moreover, RMSD, RMSF, hydrogen bonds, salt bridge, and secondary structure analyses revealed that the NBD protein had a strong bond with p53 motif and the protein-ligand complex was stable. Thus, the current data would be highly encouraging for designing Hsp70 structure based drug in cancer therapy.
Topological characteristics of helical repeat proteins.

PubMed

Groves, M R; Barford, D

1999-06-01

The recent elucidation of protein structures based upon repeating amino acid motifs, including the armadillo motif, the HEAT motif and tetratricopeptide repeats, reveals that they belong to the class of helical repeat proteins. These proteins share the common property of being assembled from tandem repeats of an alpha-helical structural unit, creating extended superhelical structures that are ideally suited to create a protein recognition interface.
BEAM web server: a tool for structural RNA motif discovery.

PubMed

Pietrosanto, Marco; Adinolfi, Marta; Casula, Riccardo; Ausiello, Gabriele; Ferrè, Fabrizio; Helmer-Citterich, Manuela

2018-03-15

RNA structural motif finding is a relevant problem that becomes computationally hard when working on high-throughput data (e.g. eCLIP, PAR-CLIP), often represented by thousands of RNA molecules. Currently, the BEAM server is the only web tool capable to handle tens of thousands of RNA in input with a motif discovery procedure that is only limited by the current secondary structure prediction accuracies. The recently developed method BEAM (BEAr Motifs finder) can analyze tens of thousands of RNA molecules and identify RNA secondary structure motifs associated to a measure of their statistical significance. BEAM is extremely fast thanks to the BEAR encoding that transforms each RNA secondary structure in a string of characters. BEAM also exploits the evolutionary knowledge contained in a substitution matrix of secondary structure elements, extracted from the RFAM database of families of homologous RNAs. The BEAM web server has been designed to streamline data pre-processing by automatically handling folding and encoding of RNA sequences, giving users a choice for the preferred folding program. The server provides an intuitive and informative results page with the list of secondary structure motifs identified, the logo of each motif, its significance, graphic representation and information about its position in the RNA molecules sharing it. The web server is freely available at http://beam.uniroma2.it/ and it is implemented in NodeJS and Python with all major browsers supported. marco.pietrosanto@uniroma2.it. Supplementary data are available at Bioinformatics online.
New Structural and Functional Contexts of the Dx[DN]xDG Linear Motif: Insights into Evolution of Calcium-Binding Proteins

PubMed Central

Rigden, Daniel J.; Woodhead, Duncan D.; Wong, Prudence W. H.; Galperin, Michael Y.

2011-01-01

Binding of calcium ions (Ca2+) to proteins can have profound effects on their structure and function. Common roles of calcium binding include structure stabilization and regulation of activity. It is known that diverse families – EF-hands being one of at least twelve – use a Dx[DN]xDG linear motif to bind calcium in near-identical fashion. Here, four novel structural contexts for the motif are described. Existing experimental data for one of them, a thermophilic archaeal subtilisin, demonstrate for the first time a role for Dx[DN]xDG-bound calcium in protein folding. An integrin-like embedding of the motif in the blade of a β-propeller fold – here named the calcium blade – is discovered in structures of bacterial and fungal proteins. Furthermore, sensitive database searches suggest a common origin for the calcium blade in β-propeller structures of different sizes and a pan-kingdom distribution of these proteins. Factors favouring the multiple convergent evolution of the motif appear to include its general Asp-richness, the regular spacing of the Asp residues and the fact that change of Asp into Gly and vice versa can occur though a single nucleotide change. Among the known structural contexts for the Dx[DN]xDG motif, only the calcium blade and the EF-hand are currently found intracellularly in large numbers, perhaps because the higher extracellular concentration of Ca2+ allows for easier fixing of newly evolved motifs that have acquired useful functions. The analysis presented here will inform ongoing efforts toward prediction of similar calcium-binding motifs from sequence information alone. PMID:21720552
Transient α-helices in the disordered RPEL motifs of the serum response factor coactivator MKL1

NASA Astrophysics Data System (ADS)

Mizuguchi, Mineyuki; Fuju, Takahiro; Obita, Takayuki; Ishikawa, Mitsuru; Tsuda, Masaaki; Tabuchi, Akiko

2014-06-01

The megakaryoblastic leukemia 1 (MKL1) protein functions as a transcriptional coactivator of the serum response factor. MKL1 has three RPEL motifs (RPEL1, RPEL2, and RPEL3) in its N-terminal region. MKL1 binds to monomeric G-actin through RPEL motifs, and the dissociation of MKL1 from G-actin promotes the translocation of MKL1 to the nucleus. Although structural data are available for RPEL motifs of MKL1 in complex with G-actin, the structural characteristics of RPEL motifs in the free state have been poorly defined. Here we characterized the structures of free RPEL motifs using NMR and CD spectroscopy. NMR and CD measurements showed that free RPEL motifs are largely unstructured in solution. However, NMR analysis identified transient α-helices in the regions where helices α1 and α2 are induced upon binding to G-actin. Proline mutagenesis showed that the transient α-helices are locally formed without helix-helix interactions. The helix content is higher in the order of RPEL1, RPEL2, and RPEL3. The amount of preformed structure may correlate with the binding affinity between the intrinsically disordered protein and its target molecule.
Sequence Analysis and Domain Motifs in the Porcine Skin Decorin Glycosaminoglycan Chain*

PubMed Central

Zhao, Xue; Yang, Bo; Solakylidirim, Kemal; Joo, Eun Ji; Toida, Toshihiko; Higashi, Kyohei; Linhardt, Robert J.; Li, Lingyun

2013-01-01

Decorin proteoglycan is comprised of a core protein containing a single O-linked dermatan sulfate/chondroitin sulfate glycosaminoglycan (GAG) chain. Although the sequence of the decorin core protein is determined by the gene encoding its structure, the structure of its GAG chain is determined in the Golgi. The recent application of modern MS to bikunin, a far simpler chondroitin sulfate proteoglycans, suggests that it has a single or small number of defined sequences. On this basis, a similar approach to sequence the decorin of porcine skin much larger and more structurally complex dermatan sulfate/chondroitin sulfate GAG chain was undertaken. This approach resulted in information on the consistency/variability of its linkage region at the reducing end of the GAG chain, its iduronic acid-rich domain, glucuronic acid-rich domain, and non-reducing end. A general motif for the porcine skin decorin GAG chain was established. A single small decorin GAG chain was sequenced using MS/MS analysis. The data obtained in the study suggest that the decorin GAG chain has a small or a limited number of sequences. PMID:23423381
New structures of Fe3S for rare-earth-free permanent magnets

DOE PAGES

Yu, Shu; Zhao, Xin; Wu, Shunqing; ...

2018-02-25

We applied adaptive genetic algorithm (AGA) to search for low-energy crystal structures of Fe 3S. A number of structures with energies lower than that of the experimentally reported Pnma and I-4 structures have been obtained from our AGA searches. These low-energy structures can be classified as layer-motif and column-motif structures. In the column-motif structures, Fe atoms self-assemble into rods with bcc type of underlying lattice, which are separated by the holes terminated by S atoms. In the layer-motif structures, the bulk Fe is broken into slabs of several layers passivated by S atoms. Magnetic properties calculations showed that the column-motifmore » structures exhibit reasonably high uniaxial magnetic anisotropy. In addition, we examined the effect of Co doping to Fe 3S and found magnetic anisotropy can be enhanced through Co doping.« less
Form and function in gene regulatory networks: the structure of network motifs determines fundamental properties of their dynamical state space.

PubMed

Ahnert, S E; Fink, T M A

2016-07-01

Network motifs have been studied extensively over the past decade, and certain motifs, such as the feed-forward loop, play an important role in regulatory networks. Recent studies have used Boolean network motifs to explore the link between form and function in gene regulatory networks and have found that the structure of a motif does not strongly determine its function, if this is defined in terms of the gene expression patterns the motif can produce. Here, we offer a different, higher-level definition of the 'function' of a motif, in terms of two fundamental properties of its dynamical state space as a Boolean network. One is the basin entropy, which is a complexity measure of the dynamics of Boolean networks. The other is the diversity of cyclic attractor lengths that a given motif can produce. Using these two measures, we examine all 104 topologically distinct three-node motifs and show that the structural properties of a motif, such as the presence of feedback loops and feed-forward loops, predict fundamental characteristics of its dynamical state space, which in turn determine aspects of its functional versatility. We also show that these higher-level properties have a direct bearing on real regulatory networks, as both basin entropy and cycle length diversity show a close correspondence with the prevalence, in neural and genetic regulatory networks, of the 13 connected motifs without self-interactions that have been studied extensively in the literature. © 2016 The Authors.
Gibbs motif sampling: detection of bacterial outer membrane protein repeats.

PubMed Central

Neuwald, A. F.; Liu, J. S.; Lawrence, C. E.

1995-01-01

The detection and alignment of locally conserved regions (motifs) in multiple sequences can provide insight into protein structure, function, and evolution. A new Gibbs sampling algorithm is described that detects motif-encoding regions in sequences and optimally partitions them into distinct motif models; this is illustrated using a set of immunoglobulin fold proteins. When applied to sequences sharing a single motif, the sampler can be used to classify motif regions into related submodels, as is illustrated using helix-turn-helix DNA-binding proteins. Other statistically based procedures are described for searching a database for sequences matching motifs found by the sampler. When applied to a set of 32 very distantly related bacterial integral outer membrane proteins, the sampler revealed that they share a subtle, repetitive motif. Although BLAST (Altschul SF et al., 1990, J Mol Biol 215:403-410) fails to detect significant pairwise similarity between any of the sequences, the repeats present in these outer membrane proteins, taken as a whole, are highly significant (based on a generally applicable statistical test for motifs described here). Analysis of bacterial porins with known trimeric beta-barrel structure and related proteins reveals a similar repetitive motif corresponding to alternating membrane-spanning beta-strands. These beta-strands occur on the membrane interface (as opposed to the trimeric interface) of the beta-barrel. The broad conservation and structural location of these repeats suggests that they play important functional roles. PMID:8520488
Optimized mixed Markov models for motif identification

PubMed Central

Huang, Weichun; Umbach, David M; Ohler, Uwe; Li, Leping

2006-01-01

Background Identifying functional elements, such as transcriptional factor binding sites, is a fundamental step in reconstructing gene regulatory networks and remains a challenging issue, largely due to limited availability of training samples. Results We introduce a novel and flexible model, the Optimized Mixture Markov model (OMiMa), and related methods to allow adjustment of model complexity for different motifs. In comparison with other leading methods, OMiMa can incorporate more than the NNSplice's pairwise dependencies; OMiMa avoids model over-fitting better than the Permuted Variable Length Markov Model (PVLMM); and OMiMa requires smaller training samples than the Maximum Entropy Model (MEM). Testing on both simulated and actual data (regulatory cis-elements and splice sites), we found OMiMa's performance superior to the other leading methods in terms of prediction accuracy, required size of training data or computational time. Our OMiMa system, to our knowledge, is the only motif finding tool that incorporates automatic selection of the best model. OMiMa is freely available at [1]. Conclusion Our optimized mixture of Markov models represents an alternative to the existing methods for modeling dependent structures within a biological motif. Our model is conceptually simple and effective, and can improve prediction accuracy and/or computational speed over other leading methods. PMID:16749929
The Methionine-aromatic Motif Plays a Unique Role in Stabilizing Protein Structure*

PubMed Central

Valley, Christopher C.; Cembran, Alessandro; Perlmutter, Jason D.; Lewis, Andrew K.; Labello, Nicholas P.; Gao, Jiali; Sachs, Jonathan N.

2012-01-01

Of the 20 amino acids, the precise function of methionine (Met) remains among the least well understood. To establish a determining characteristic of methionine that fundamentally differentiates it from purely hydrophobic residues, we have used in vitro cellular experiments, molecular simulations, quantum calculations, and a bioinformatics screen of the Protein Data Bank. We show that approximately one-third of all known protein structures contain an energetically stabilizing Met-aromatic motif and, remarkably, that greater than 10,000 structures contain this motif more than 10 times. Critically, we show that as compared with a purely hydrophobic interaction, the Met-aromatic motif yields an additional stabilization of 1–1.5 kcal/mol. To highlight its importance and to dissect the energetic underpinnings of this motif, we have studied two clinically relevant TNF ligand-receptor complexes, namely TRAIL-DR5 and LTα-TNFR1. In both cases, we show that the motif is necessary for high affinity ligand binding as well as function. Additionally, we highlight previously overlooked instances of the motif in several disease-related Met mutations. Our results strongly suggest that the Met-aromatic motif should be exploited in the rational design of therapeutics targeting a range of proteins. PMID:22859300
The crystal structure of the regulatory domain of the human sodium-driven chloride/bicarbonate exchanger.

PubMed

Alvadia, Carolina M; Sommer, Theis; Bjerregaard-Andersen, Kaare; Damkier, Helle Hasager; Montrasio, Michele; Aalkjaer, Christian; Morth, J Preben

2017-09-21

The sodium-driven chloride/bicarbonate exchanger (NDCBE) is essential for maintaining homeostatic pH in neurons. The crystal structure at 2.8 Å resolution of the regulatory N-terminal domain of human NDCBE represents the first crystal structure of an electroneutral sodium-bicarbonate cotransporter. The crystal structure forms an equivalent dimeric interface as observed for the cytoplasmic domain of Band 3, and thus establishes that the consensus motif VTVLP is the key minimal dimerization motif. The VTVLP motif is highly conserved and likely to be the physiologically relevant interface for all other members of the SLC4 family. A novel conserved Zn 2+ -binding motif present in the N-terminal domain of NDCBE is identified and characterized in vitro. Cellular studies confirm the Zn 2+ dependent transport of two electroneutral bicarbonate transporters, NCBE and NBCn1. The Zn 2+ site is mapped to a cluster of histidines close to the conserved ETARWLKFEE motif and likely plays a role in the regulation of this important motif. The combined structural and bioinformatics analysis provides a model that predicts with additional confidence the physiologically relevant interface between the cytoplasmic domain and the transmembrane domain.
Designing synthetic RNAs to determine the relevance of structural motifs in picornavirus IRES elements

NASA Astrophysics Data System (ADS)

Fernandez-Chamorro, Javier; Lozano, Gloria; Garcia-Martin, Juan Antonio; Ramajo, Jorge; Dotu, Ivan; Clote, Peter; Martinez-Salas, Encarnacion

2016-04-01

The function of Internal Ribosome Entry Site (IRES) elements is intimately linked to their RNA structure. Viral IRES elements are organized in modular domains consisting of one or more stem-loops that harbor conserved RNA motifs critical for internal initiation of translation. A conserved motif is the pyrimidine-tract located upstream of the functional initiation codon in type I and II picornavirus IRES. By computationally designing synthetic RNAs to fold into a structure that sequesters the polypyrimidine tract in a hairpin, we establish a correlation between predicted inaccessibility of the pyrimidine tract and IRES activity, as determined in both in vitro and in vivo systems. Our data supports the hypothesis that structural sequestration of the pyrimidine-tract within a stable hairpin inactivates IRES activity, since the stronger the stability of the hairpin the higher the inhibition of protein synthesis. Destabilization of the stem-loop immediately upstream of the pyrimidine-tract also decreases IRES activity. Our work introduces a hybrid computational/experimental method to determine the importance of structural motifs for biological function. Specifically, we show the feasibility of using the software RNAiFold to design synthetic RNAs with particular sequence and structural motifs that permit subsequent experimental determination of the importance of such motifs for biological function.
An experimental test of a fundamental food web motif.

PubMed

Rip, Jason M K; McCann, Kevin S; Lynn, Denis H; Fawcett, Sonia

2010-06-07

Large-scale changes to the world's ecosystem are resulting in the deterioration of biostructure-the complex web of species interactions that make up ecological communities. A difficult, yet crucial task is to identify food web structures, or food web motifs, that are the building blocks of this baroque network of interactions. Once identified, these food web motifs can then be examined through experiments and theory to provide mechanistic explanations for how structure governs ecosystem stability. Here, we synthesize recent ecological research to show that generalist consumers coupling resources with different interaction strengths, is one such motif. This motif amazingly occurs across an enormous range of spatial scales, and so acts to distribute coupled weak and strong interactions throughout food webs. We then perform an experiment that illustrates the importance of this motif to ecological stability. We find that weak interactions coupled to strong interactions by generalist consumers dampen strong interaction strengths and increase community stability. This study takes a critical step by isolating a common food web motif and through clear, experimental manipulation, identifies the fundamental stabilizing consequences of this structure for ecological communities.
Identification of 15 candidate structured noncoding RNA motifs in fungi by comparative genomics.

PubMed

Li, Sanshu; Breaker, Ronald R

2017-10-13

With the development of rapid and inexpensive DNA sequencing, the genome sequences of more than 100 fungal species have been made available. This dataset provides an excellent resource for comparative genomics analyses, which can be used to discover genetic elements, including noncoding RNAs (ncRNAs). Bioinformatics tools similar to those used to uncover novel ncRNAs in bacteria, likewise, should be useful for searching fungal genomic sequences, and the relative ease of genetic experiments with some model fungal species could facilitate experimental validation studies. We have adapted a bioinformatics pipeline for discovering bacterial ncRNAs to systematically analyze many fungal genomes. This comparative genomics pipeline integrates information on conserved RNA sequence and structural features with alternative splicing information to reveal fungal RNA motifs that are candidate regulatory domains, or that might have other possible functions. A total of 15 prominent classes of structured ncRNA candidates were identified, including variant HDV self-cleaving ribozyme representatives, atypical snoRNA candidates, and possible structured antisense RNA motifs. Candidate regulatory motifs were also found associated with genes for ribosomal proteins, S-adenosylmethionine decarboxylase (SDC), amidase, and HexA protein involved in Woronin body formation. We experimentally confirm that the variant HDV ribozymes undergo rapid self-cleavage, and we demonstrate that the SDC RNA motif reduces the expression of SAM decarboxylase by translational repression. Furthermore, we provide evidence that several other motifs discovered in this study are likely to be functional ncRNA elements. Systematic screening of fungal genomes using a computational discovery pipeline has revealed the existence of a variety of novel structured ncRNAs. Genome contexts and similarities to known ncRNA motifs provide strong evidence for the biological and biochemical functions of some newly found ncRNA motifs. Although initial examinations of several motifs provide evidence for their likely functions, other motifs will require more in-depth analysis to reveal their functions.
Crystal structure of EML1 reveals the basis for Hsp90 dependence of oncogenic EML4-ALK by disruption of an atypical β-propeller domain

PubMed Central

Richards, Mark W.; Law, Edward W. P.; Rennalls, La’Verne P.; Busacca, Sara; O’Regan, Laura; Fry, Andrew M.; Fennell, Dean A.; Bayliss, Richard

2014-01-01

Proteins of the echinoderm microtubule-associated protein (EMAP)-like (EML) family contribute to formation of the mitotic spindle and interphase microtubule network. They contain a unique hydrophobic EML protein (HELP) motif and a variable number of WD40 repeats. Recurrent gene rearrangements in nonsmall cell lung cancer fuse EML4 to anaplastic lymphoma kinase (ALK), causing expression of several fusion oncoprotein variants. We have determined a 2.6-Å crystal structure of the representative ∼70-kDa core of EML1, revealing an intimately associated pair of β-propellers, which we term a TAPE (tandem atypical propeller in EMLs) domain. One propeller is highly atypical, having a discontinuous subdomain unrelated to a WD40 motif in place of one of its blades. This unexpected feature shows how a propeller structure can be assembled from subdomains with distinct folds. The HELP motif is not an independent domain but forms part of the hydrophobic core that joins the two β-propellers. The TAPE domain binds α/β-tubulin via its conserved, concave surface, including part of the atypical blade. Mapping the characteristic breakpoints of each EML4-ALK variant onto our structure indicates that the EML4 TAPE domain is truncated in many variants in a manner likely to make the fusion protein structurally unstable. We found that the heat shock protein 90 (Hsp90) inhibitor ganetespib induced degradation of these variants whereas others lacking a partial TAPE domain were resistant in both overexpression models and patient-derived cell lines. The Hsp90-sensitive EML4-ALK variants are exceptions to the rule that oncogenic fusion proteins involve breakpoints in disordered regions of both partners. PMID:24706829
Crystal structure of EML1 reveals the basis for Hsp90 dependence of oncogenic EML4-ALK by disruption of an atypical β-propeller domain.

PubMed

Richards, Mark W; Law, Edward W P; Rennalls, La'Verne P; Busacca, Sara; O'Regan, Laura; Fry, Andrew M; Fennell, Dean A; Bayliss, Richard

2014-04-08

Proteins of the echinoderm microtubule-associated protein (EMAP)-like (EML) family contribute to formation of the mitotic spindle and interphase microtubule network. They contain a unique hydrophobic EML protein (HELP) motif and a variable number of WD40 repeats. Recurrent gene rearrangements in nonsmall cell lung cancer fuse EML4 to anaplastic lymphoma kinase (ALK), causing expression of several fusion oncoprotein variants. We have determined a 2.6-Å crystal structure of the representative ∼70-kDa core of EML1, revealing an intimately associated pair of β-propellers, which we term a TAPE (tandem atypical propeller in EMLs) domain. One propeller is highly atypical, having a discontinuous subdomain unrelated to a WD40 motif in place of one of its blades. This unexpected feature shows how a propeller structure can be assembled from subdomains with distinct folds. The HELP motif is not an independent domain but forms part of the hydrophobic core that joins the two β-propellers. The TAPE domain binds α/β-tubulin via its conserved, concave surface, including part of the atypical blade. Mapping the characteristic breakpoints of each EML4-ALK variant onto our structure indicates that the EML4 TAPE domain is truncated in many variants in a manner likely to make the fusion protein structurally unstable. We found that the heat shock protein 90 (Hsp90) inhibitor ganetespib induced degradation of these variants whereas others lacking a partial TAPE domain were resistant in both overexpression models and patient-derived cell lines. The Hsp90-sensitive EML4-ALK variants are exceptions to the rule that oncogenic fusion proteins involve breakpoints in disordered regions of both partners.
Identification of a novel calcium binding motif based on the detection of sequence insertions in the animal peroxidase domain of bacterial proteins.

PubMed

Santamaría-Hernando, Saray; Krell, Tino; Ramos-González, María-Isabel

2012-01-01

Proteins of the animal heme peroxidase (ANP) superfamily differ greatly in size since they have either one or two catalytic domains that match profile PS50292. The orf PP_2561 of Pseudomonas putida KT2440 that we have called PepA encodes a two-domain ANP. The alignment of these domains with those of PepA homologues revealed a variable number of insertions with the consensus G-x-D-G-x-x-[GN]-[TN]-x-D-D. This motif has also been detected in the structure of pseudopilin (pdb 3G20), where it was found to be involved in Ca(2+) coordination although a sequence analysis did not reveal the presence of any known calcium binding motifs in this protein. Isothermal titration calorimetry revealed that a peptide containing this consensus motif bound specifically calcium ions with affinities ranging between 33-79 µM depending on the pH. Microcalorimetric titrations of the purified N-terminal ANP-like domain of PepA revealed Ca(2+) binding with a K(D) of 12 µM and stoichiometry of 1.25 calcium ions per protein monomer. This domain exhibited peroxidase activity after its reconstitution with heme. These data led to the definition of a novel calcium binding motif that we have termed PERCAL and which was abundantly present in animal peroxidase-like domains of bacterial proteins. Bacterial heme peroxidases thus possess two different types of calcium binding motifs, namely PERCAL and the related hemolysin type calcium binding motif, with the latter being located outside the catalytic domains and in their C-terminal end. A phylogenetic tree of ANP-like catalytic domains of bacterial proteins with PERCAL motifs, including single domain peroxidases, was divided into two major clusters, representing domains with and without PERCAL motif containing insertions. We have verified that the recently reported classification of bacterial heme peroxidases in two families (cd09819 and cd09821) is unrelated to these insertions. Sequences matching PERCAL were detected in all kingdoms of life.

Finding the target sites of RNA-binding proteins

PubMed Central

Li, Xiao; Kazan, Hilal; Lipshitz, Howard D; Morris, Quaid D

2014-01-01

RNA–protein interactions differ from DNA–protein interactions because of the central role of RNA secondary structure. Some RNA-binding domains (RBDs) recognize their target sites mainly by their shape and geometry and others are sequence-specific but are sensitive to secondary structure context. A number of small- and large-scale experimental approaches have been developed to measure RNAs associated in vitro and in vivo with RNA-binding proteins (RBPs). Generalizing outside of the experimental conditions tested by these assays requires computational motif finding. Often RBP motif finding is done by adapting DNA motif finding methods; but modeling secondary structure context leads to better recovery of RBP-binding preferences. Genome-wide assessment of mRNA secondary structure has recently become possible, but these data must be combined with computational predictions of secondary structure before they add value in predicting in vivo binding. There are two main approaches to incorporating structural information into motif models: supplementing primary sequence motif models with preferred secondary structure contexts (e.g., MEMERIS and RNAcontext) and directly modeling secondary structure recognized by the RBP using stochastic context-free grammars (e.g., CMfinder and RNApromo). The former better reconstruct known binding preferences for sequence-specific RBPs but are not suitable for modeling RBPs that recognize shape and geometry of RNAs. Future work in RBP motif finding should incorporate interactions between multiple RBDs and multiple RBPs in binding to RNA. WIREs RNA 2014, 5:111–130. doi: 10.1002/wrna.1201 PMID:24217996
Solution structure of a DNA mimicking motif of an RNA aptamer against transcription factor AML1 Runt domain.

PubMed

Nomura, Yusuke; Tanaka, Yoichiro; Fukunaga, Jun-ichi; Fujiwara, Kazuya; Chiba, Manabu; Iibuchi, Hiroaki; Tanaka, Taku; Nakamura, Yoshikazu; Kawai, Gota; Kozu, Tomoko; Sakamoto, Taiichi

2013-12-01

AML1/RUNX1 is an essential transcription factor involved in the differentiation of hematopoietic cells. AML1 binds to the Runt-binding double-stranded DNA element (RDE) of target genes through its N-terminal Runt domain. In a previous study, we obtained RNA aptamers against the AML1 Runt domain by systematic evolution of ligands by exponential enrichment and revealed that RNA aptamers exhibit higher affinity for the Runt domain than that for RDE and possess the 5'-GCGMGNN-3' and 5'-N'N'CCAC-3' conserved motif (M: A or C; N and N' form Watson-Crick base pairs) that is important for Runt domain binding. In this study, to understand the structural basis of recognition of the Runt domain by the aptamer motif, the solution structure of a 22-mer RNA was determined using nuclear magnetic resonance. The motif contains the AH(+)-C mismatch and base triple and adopts an unusual backbone structure. Structural analysis of the aptamer motif indicated that the aptamer binds to the Runt domain by mimicking the RDE sequence and structure. Our data should enhance the understanding of the structural basis of DNA mimicry by RNA molecules.
Common fold in helix–hairpin–helix proteins

PubMed Central

Shao, Xuguang; Grishin, Nick V.

2000-01-01

Helix–hairpin–helix (HhH) is a widespread motif involved in non-sequence-specific DNA binding. The majority of HhH motifs function as DNA-binding modules, however, some of them are used to mediate protein–protein interactions or have acquired enzymatic activity by incorporating catalytic residues (DNA glycosylases). From sequence and structural analysis of HhH-containing proteins we conclude that most HhH motifs are integrated as a part of a five-helical domain, termed (HhH)2 domain here. It typically consists of two consecutive HhH motifs that are linked by a connector helix and displays pseudo-2-fold symmetry. (HhH)2 domains show clear structural integrity and a conserved hydrophobic core composed of seven residues, one residue from each α-helix and each hairpin, and deserves recognition as a distinct protein fold. In addition to known HhH in the structures of RuvA, RadA, MutY and DNA-polymerases, we have detected new HhH motifs in sterile alpha motif and barrier-to-autointegration factor domains, the α-subunit of Escherichia coli RNA-polymerase, DNA-helicase PcrA and DNA glycosylases. Statistically significant sequence similarity of HhH motifs and pronounced structural conservation argue for homology between (HhH)2 domains in different protein families. Our analysis helps to clarify how non-symmetric protein motifs bind to the double helix of DNA through the formation of a pseudo-2-fold symmetric (HhH)2 functional unit. PMID:10908318
A relational extension of the notion of motifs: application to the common 3D protein substructures searching problem.

PubMed

Pisanti, Nadia; Soldano, Henry; Carpentier, Mathilde; Pothier, Joel

2009-12-01

The geometrical configurations of atoms in protein structures can be viewed as approximate relations among them. Then, finding similar common substructures within a set of protein structures belongs to a new class of problems that generalizes that of finding repeated motifs. The novelty lies in the addition of constraints on the motifs in terms of relations that must hold between pairs of positions of the motifs. We will hence denote them as relational motifs. For this class of problems, we present an algorithm that is a suitable extension of the KMR paradigm and, in particular, of the KMRC as it uses a degenerate alphabet. Our algorithm contains several improvements that become especially useful when-as it is required for relational motifs-the inference is made by partially overlapping shorter motifs, rather than concatenating them. The efficiency, correctness and completeness of the algorithm is ensured by several non-trivial properties that are proven in this paper. The algorithm has been applied in the important field of protein common 3D substructure searching. The methods implemented have been tested on several examples of protein families such as serine proteases, globins and cytochromes P450 additionally. The detected motifs have been compared to those found by multiple structural alignments methods.
Identification of helix capping and β-turn motifs from NMR chemical shifts

PubMed Central

Shen, Yang; Bax, Ad

2012-01-01

We present an empirical method for identification of distinct structural motifs in proteins on the basis of experimentally determined backbone and 13Cβ chemical shifts. Elements identified include the N-terminal and C-terminal helix capping motifs and five types of β-turns: I, II, I′, II′ and VIII. Using a database of proteins of known structure, the NMR chemical shifts, together with the PDB-extracted amino acid preference of the helix capping and β-turn motifs are used as input data for training an artificial neural network algorithm, which outputs the statistical probability of finding each motif at any given position in the protein. The trained neural networks, contained in the MICS (motif identification from chemical shifts) program, also provide a confidence level for each of their predictions, and values ranging from ca 0.7–0.9 for the Matthews correlation coefficient of its predictions far exceed that attainable by sequence analysis. MICS is anticipated to be useful both in the conventional NMR structure determination process and for enhancing on-going efforts to determine protein structures solely on the basis of chemical shift information, where it can aid in identifying protein database fragments suitable for use in building such structures. PMID:22314702
Ser/Thr Motifs in Transmembrane Proteins: Conservation Patterns and Effects on Local Protein Structure and Dynamics

PubMed Central

del Val, Coral; White, Stephen H.

2014-01-01

We combined systematic bioinformatics analyses and molecular dynamics simulations to assess the conservation patterns of Ser and Thr motifs in membrane proteins, and the effect of such motifs on the structure and dynamics of α-helical transmembrane (TM) segments. We find that Ser/Thr motifs are often present in β-barrel TM proteins. At least one Ser/Thr motif is present in almost half of the sequences of α-helical proteins analyzed here. The extensive bioinformatics analyses and inspection of protein structures led to the identification of molecular transporters with noticeable numbers of Ser/Thr motifs within the TM region. Given the energetic penalty for burying multiple Ser/Thr groups in the membrane hydrophobic core, the observation of transporters with multiple membrane-embedded Ser/Thr is intriguing and raises the question of how the presence of multiple Ser/Thr affects protein local structure and dynamics. Molecular dynamics simulations of four different Ser-containing model TM peptides indicate that backbone hydrogen bonding of membrane-buried Ser/Thr hydroxyl groups can significantly change the local structure and dynamics of the helix. Ser groups located close to the membrane interface can hydrogen bond to solvent water instead of protein backbone, leading to an enhanced local solvation of the peptide. PMID:22836667
Solution structure and base pair opening kinetics of the i-motif dimer of d(5mCCTTTACC): a noncanonical structure with possible roles in chromosome stability.

PubMed

Nonin, S; Phan, A T; Leroy, J L

1997-09-15

Repetitive cytosine-rich DNA sequences have been identified in telomeres and centromeres of eukaryotic chromosomes. These sequences play a role in maintaining chromosome stability during replication and may be involved in chromosome pairing during meiosis. The C-rich repeats can fold into an 'i-motif' structure, in which two parallel-stranded duplexes with hemiprotonated C.C+ pairs are intercalated. Previous NMR studies of naturally occurring repeats have produced poor NMR spectra. This led us to investigate oligonucleotides, based on natural sequences, to produce higher quality spectra and thus provide further information as to the structure and possible biological function of the i-motif. NMR spectroscopy has shown that d(5mCCTTTACC) forms an i-motif dimer of symmetry-related and intercalated folded strands. The high-definition structure is computed on the basis of the build-up rates of 29 intraresidue and 35 interresidue nuclear Overhauser effect (NOE) connectivities. The i-motif core includes intercalated interstrand C.C+ pairs stacked in the order 2*.8/1.7*/1*.7/2.8* (where one strand is distinguished by an asterisk and the numbers relate to the base positions within the repeat). The TTTA sequences form two loops which span the two wide grooves on opposite sides of the i-motif core; the i-motif core is extended at both ends by the stacking of A6 onto C2.C8+. The lifetimes of pairs C2.C8+ and 5mC1.C7+ are 1 ms and 1 s, respectively, at 15 degrees C. Anomalous exchange properties of the T3 imino proton indicate hydrogen bonding to A6 N7 via a water bridge. The d(5mCCTTTTCC) deoxyoligonucleotide, in which position 6 is occupied by a thymidine instead of an adenine, also forms a symmetric i-motif dimer. However, in this structure the two TTTT loops are located on the same side of the i-motif core and the C.C+ pairs are formed by equivalent cytidines stacked in the order 8*.8/1.1*/7*.7/2.2*. Oligodeoxynucleotides containing two C-rich repeats can fold and dimerize into an i-motif. The change of folding topology resulting from the substitution of a single nucleoside emphasizes the influence of the loop residues on the i-motif structure formed by two folded strands.
The crystal structure of the Sox4 HMG domain-DNA complex suggests a mechanism for positional interdependence in DNA recognition.

PubMed

Jauch, Ralf; Ng, Calista K L; Narasimhan, Kamesh; Kolatkar, Prasanna R

2012-04-01

It has recently been proposed that the sequence preferences of DNA-binding TFs (transcription factors) can be well described by models that include the positional interdependence of the nucleotides of the target sites. Such binding models allow for multiple motifs to be invoked, such as principal and secondary motifs differing at two or more nucleotide positions. However, the structural mechanisms underlying the accommodation of such variant motifs by TFs remain elusive. In the present study we examine the crystal structure of the HMG (high-mobility group) domain of Sox4 [Sry (sex-determining region on the Y chromosome)-related HMG box 4] bound to DNA. By comparing this structure with previously solved structures of Sox17 and Sox2, we observed subtle conformational differences at the DNA-binding interface. Furthermore, using quantitative electrophoretic mobility-shift assays we validated the positional interdependence of two nucleotides and the presence of a secondary Sox motif in the affinity landscape of Sox4. These results suggest that a concerted rearrangement of two interface amino acids enables Sox4 to accommodate primary and secondary motifs. The structural adaptations lead to altered dinucleotide preferences that mutually reinforce each other. These analyses underline the complexity of the DNA recognition by TFs and provide an experimental validation for the conceptual framework of positional interdependence and secondary binding motifs.
Maximum likelihood density modification by pattern recognition of structural motifs

DOEpatents

Terwilliger, Thomas C.

2004-04-13

An electron density for a crystallographic structure having protein regions and solvent regions is improved by maximizing the log likelihood of a set of structures factors {F.sub.h } using a local log-likelihood function: (x)+p(.rho.(x).vertline.SOLV)p.sub.SOLV (x)+p(.rho.(x).vertline.H)p.sub.H (x)], where p.sub.PROT (x) is the probability that x is in the protein region, p(.rho.(x).vertline.PROT) is the conditional probability for .rho.(x) given that x is in the protein region, and p.sub.SOLV (x) and p(.rho.(x).vertline.SOLV) are the corresponding quantities for the solvent region, p.sub.H (x) refers to the probability that there is a structural motif at a known location, with a known orientation, in the vicinity of the point x; and p(.rho.(x).vertline.H) is the probability distribution for electron density at this point given that the structural motif actually is present. One appropriate structural motif is a helical structure within the crystallographic structure.
NoFold: RNA structure clustering without folding or alignment.

PubMed

Middleton, Sarah A; Kim, Junhyong

2014-11-01

Structures that recur across multiple different transcripts, called structure motifs, often perform a similar function-for example, recruiting a specific RNA-binding protein that then regulates translation, splicing, or subcellular localization. Identifying common motifs between coregulated transcripts may therefore yield significant insight into their binding partners and mechanism of regulation. However, as most methods for clustering structures are based on folding individual sequences or doing many pairwise alignments, this results in a tradeoff between speed and accuracy that can be problematic for large-scale data sets. Here we describe a novel method for comparing and characterizing RNA secondary structures that does not require folding or pairwise alignment of the input sequences. Our method uses the idea of constructing a distance function between two objects by their respective distances to a collection of empirical examples or models, which in our case consists of 1973 Rfam family covariance models. Using this as a basis for measuring structural similarity, we developed a clustering pipeline called NoFold to automatically identify and annotate structure motifs within large sequence data sets. We demonstrate that NoFold can simultaneously identify multiple structure motifs with an average sensitivity of 0.80 and precision of 0.98 and generally exceeds the performance of existing methods. We also perform a cross-validation analysis of the entire set of Rfam families, achieving an average sensitivity of 0.57. We apply NoFold to identify motifs enriched in dendritically localized transcripts and report 213 enriched motifs, including both known and novel structures. © 2014 Middleton and Kim; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Statistical tests to compare motif count exceptionalities

PubMed Central

Robin, Stéphane; Schbath, Sophie; Vandewalle, Vincent

2007-01-01

Background Finding over- or under-represented motifs in biological sequences is now a common task in genomics. Thanks to p-value calculation for motif counts, exceptional motifs are identified and represent candidate functional motifs. The present work addresses the related question of comparing the exceptionality of one motif in two different sequences. Just comparing the motif count p-values in each sequence is indeed not sufficient to decide if this motif is significantly more exceptional in one sequence compared to the other one. A statistical test is required. Results We develop and analyze two statistical tests, an exact binomial one and an asymptotic likelihood ratio test, to decide whether the exceptionality of a given motif is equivalent or significantly different in two sequences of interest. For that purpose, motif occurrences are modeled by Poisson processes, with a special care for overlapping motifs. Both tests can take the sequence compositions into account. As an illustration, we compare the octamer exceptionalities in the Escherichia coli K-12 backbone versus variable strain-specific loops. Conclusion The exact binomial test is particularly adapted for small counts. For large counts, we advise to use the likelihood ratio test which is asymptotic but strongly correlated with the exact binomial test and very simple to use. PMID:17346349
C-terminal motifs in promyelocytic leukemia protein isoforms critically regulate PML nuclear body formation.

PubMed

Li, Chuang; Peng, Qiongfang; Wan, Xiao; Sun, Haili; Tang, Jun

2017-10-15

Promyelocytic leukemia protein (PML) nuclear bodies (NBs), which are sub-nuclear protein structures, are involved in a variety of important cellular functions. PML-NBs are assembled by PML isoforms, and contact between small ubiquitin-like modifiers (SUMOs) with the SUMO interaction motif (SIM) are critically involved in this process. PML isoforms contain a common N-terminal region and a variable C-terminus. However, the contribution of the C-terminal regions to PML-NB formation remains poorly defined. Here, using high-resolution microscopy, we show that mutation of the SIM distinctively influences the structure of NBs formed by each individual PML isoform, with that of PML-III and PML-V minimally changed, and PML-I and PML-IV dramatically impaired. We further identify several C-terminal elements that are important in regulating NB structure and provide strong evidence to suggest that the 8b element in PML-IV possesses a strong ability to interact with SUMO-1 and SUMO-2, and critically participates in NB formation. Our findings highlight the importance of PML C-termini in NB assembly and function, and provide molecular insight into the PML-NB assembly of each distinctive isoform. © 2017. Published by The Company of Biologists Ltd.
A naturally occurring, noncanonical GTP aptamer made of simple tandem repeats

PubMed Central

Curtis, Edward A; Liu, David R

2014-01-01

Recently, we used in vitro selection to identify a new class of naturally occurring GTP aptamer called the G motif. Here we report the discovery and characterization of a second class of naturally occurring GTP aptamer, the “CA motif.” The primary sequence of this aptamer is unusual in that it consists entirely of tandem repeats of CA-rich motifs as short as three nucleotides. Several active variants of the CA motif aptamer lack the ability to form consecutive Watson-Crick base pairs in any register, while others consist of repeats containing only cytidine and adenosine residues, indicating that noncanonical interactions play important roles in its structure. The circular dichroism spectrum of the CA motif aptamer is distinct from that of A-form RNA and other major classes of nucleic acid structures. Bioinformatic searches indicate that the CA motif is absent from most archaeal and bacterial genomes, but occurs in at least 70 percent of approximately 400 eukaryotic genomes examined. These searches also uncovered several phylogenetically conserved examples of the CA motif in rodent (mouse and rat) genomes. Together, these results reveal the existence of a second class of naturally occurring GTP aptamer whose sequence requirements, like that of the G motif, are not consistent with those of a canonical secondary structure. They also indicate a new and unexpected potential biochemical activity of certain naturally occurring tandem repeats. PMID:24824832
Recurring sequence-structure motifs in (βα)8-barrel proteins and experimental optimization of a chimeric protein designed based on such motifs.

PubMed

Wang, Jichao; Zhang, Tongchuan; Liu, Ruicun; Song, Meilin; Wang, Juncheng; Hong, Jiong; Chen, Quan; Liu, Haiyan

2017-02-01

An interesting way of generating novel artificial proteins is to combine sequence motifs from natural proteins, mimicking the evolutionary path suggested by natural proteins comprising recurring motifs. We analyzed the βα and αβ modules of TIM barrel proteins by structure alignment-based sequence clustering. A number of preferred motifs were identified. A chimeric TIM was designed by using recurring elements as mutually compatible interfaces. The foldability of the designed TIM protein was then significantly improved by six rounds of directed evolution. The melting temperature has been improved by more than 20°C. A variety of characteristics suggested that the resulting protein is well-folded. Our analysis provided a library of peptide motifs that is potentially useful for different protein engineering studies. The protein engineering strategy of using recurring motifs as interfaces to connect partial natural proteins may be applied to other protein folds. Copyright © 2016 Elsevier B.V. All rights reserved.
Integrin Engagement by the Helical RGD Motif of the Helicobacter pylori CagL Protein Is Regulated by pH-induced Displacement of a Neighboring Helix*

PubMed Central

Bonsor, Daniel A.; Pham, Kieu T.; Beadenkopf, Robert; Diederichs, Kay; Haas, Rainer; Beckett, Dorothy; Fischer, Wolfgang; Sundberg, Eric J.

2015-01-01

Arginine-aspartate-glycine (RGD) motifs are recognized by integrins to bridge cells to one another and the extracellular matrix. RGD motifs typically reside in exposed loop conformations. X-ray crystal structures of the Helicobacter pylori protein CagL revealed that RGD motifs can also exist in helical regions of proteins. Interactions between CagL and host gastric epithelial cell via integrins are required for the translocation of the bacterial oncoprotein CagA. Here, we have investigated the molecular basis of the CagL-host cell interactions using structural, biophysical, and functional analyses. We solved an x-ray crystal structure of CagL that revealed conformational changes induced by low pH not present in previous structures. Using analytical ultracentrifugation, we found that pH-induced conformational changes in CagL occur in solution and not just in the crystalline environment. By designing numerous CagL mutants based on all available crystal structures, we probed the functional roles of CagL conformational changes on cell surface integrin engagement. Together, our data indicate that the helical RGD motif in CagL is buried by a neighboring helix at low pH to inhibit CagL binding to integrin, whereas at neutral pH the neighboring helix is displaced to allow integrin access to the CagL RGD motif. This novel molecular mechanism of regulating integrin-RGD motif interactions by changes in the chemical environment provides new insight to H. pylori-mediated oncogenesis. PMID:25837254
Novel calcium recognition constructions in proteins: Calcium blade and EF-hand zone

DOE Office of Scientific and Technical Information (OSTI.GOV)

Denesyuk, Alexander I., E-mail: adenesyu@abo.fi; Institute for Biological Instrumentation of the Russian Academy of Sciences, Pushchino 142290; Permyakov, Sergei E.

Metal ions can regulate various cell processes being first, second or third messengers, and some of them, especially transition metal ions, take part in catalysis in many enzymes. As an intracellular ion, Ca{sup 2+} is involved in many cellular functions from fertilization and contraction, cell differentiation and proliferation, to apoptosis and cancer. Here, we have identified and described two novel calcium recognition environments in proteins: the calcium blade zone and the EF-hand zone, common to 12 and 8 different protein families, respectively. Each of the two environments contains three distinct structural elements: (a) the well-known characteristic Dx[DN]xDG motif; (b) anmore » adjacent structurally identical segment, which binds metal ion in the same way between the calcium blade zone and the EF-hand zone; and (c) the following structurally variable segment, which distinguishes the calcium blade zone from the EF-hand zone. Both zones have sequence insertions between the last residue of the zone and calcium-binding residues in positions V or VI. The long insertion often connects the active and the calcium-binding sites in proteins. Using the structurally identical segments as an anchor, we were able to construct the classical calmodulin type EF-hand calcium-binding site out of two different calcium-binding motifs from two unrelated proteins.« less
Structural and biochemical analysis of Bcl-2 interaction with the hepatitis B virus protein HBx.

PubMed

Jiang, Tianyu; Liu, Minhao; Wu, Jianping; Shi, Yigong

2016-02-23

HBx is a hepatitis B virus protein that is required for viral infectivity and replication. Anti-apoptotic Bcl-2 family members are thought to be among the important host targets of HBx. However, the structure and function of HBx are poorly understood and the molecular mechanism of HBx-induced carcinogenesis remains unknown. In this study, we report biochemical and structural characterization of HBx. The recombinant HBx protein contains metal ions, in particular iron and zinc. A BH3-like motif in HBx (residues 110-135) binds Bcl-2 with a dissociation constant of ∼193 μM, which is drastically lower than that for a canonical BH3 motif from Bim or Bad. Structural analysis reveals that, similar to other BH3 motifs, the BH3-like motif of HBx adopts an amphipathic α-helix and binds the conserved BH3-binding groove on Bcl-2. Unlike the helical Bim or Bad BH3 motif, the C-terminal portion of the bound HBx BH3-like motif has an extended conformation and makes considerably fewer interactions with Bcl-2. These observations suggest that HBx may modulate Bcl-2 function in a way that is different from that of the classical BH3-only proteins.
Brickworx builds recurrent RNA and DNA structural motifs into medium- and low-resolution electron-density maps

DOE Office of Scientific and Technical Information (OSTI.GOV)

Chojnowski, Grzegorz, E-mail: gchojnowski@genesilico.pl; Waleń, Tomasz; University of Warsaw, Banacha 2, 02-097 Warsaw

2015-03-01

A computer program that builds crystal structure models of nucleic acid molecules is presented. Brickworx is a computer program that builds crystal structure models of nucleic acid molecules using recurrent motifs including double-stranded helices. In a first step, the program searches for electron-density peaks that may correspond to phosphate groups; it may also take into account phosphate-group positions provided by the user. Subsequently, comparing the three-dimensional patterns of the P atoms with a database of nucleic acid fragments, it finds the matching positions of the double-stranded helical motifs (A-RNA or B-DNA) in the unit cell. If the target structure ismore » RNA, the helical fragments are further extended with recurrent RNA motifs from a fragment library that contains single-stranded segments. Finally, the matched motifs are merged and refined in real space to find the most likely conformations, including a fit of the sequence to the electron-density map. The Brickworx program is available for download and as a web server at http://iimcb.genesilico.pl/brickworx.« less
Exploring the limits of sequence and structure in a variant βγ-crystallin domain of the protein absent in melanoma-1 (AIM1)

PubMed Central

Aravind, Penmatsa; Wistow, Graeme; Sharma, Yogendra; Sankaranarayanan, Rajan

2008-01-01

βγ-Crystallins belong to a superfamily of proteins in prokaryotes and eukaryotes that are based on duplications of a characteristic, highly conserved Greek Key motif. Most members of the superfamily in vertebrates are structural proteins of the eye lens that contain four motifs arranged as two structural domains. Absent in melanoma-1 (AIM1), an unusual member of the superfamily whose expression is associated with suppression of malignancy in melanoma, contains 12 βγ-crystallin motifs in six domains. Some of these motifs diverge considerably from the canonical motif sequence. AIM1g1, the first βγ-crystallin domain of AIM1, is the most variant of βγ-crystallin domains currently known. In order to understand the limits of sequence variation on the structure, we report the crystal structure of AIM1g1 at 1.9Å resolution. In spite of having changes in key residues, the domain retains the overall βγ-crystallin fold. The domain also contains an unusual extended surface loop that significantly alters the shape of the domain and its charge profile. This structure illustrates the resilience of the βγ fold to considerable sequence changes and its remarkable ability to adapt for novel functions. PMID:18582473
QuadBase2: web server for multiplexed guanine quadruplex mining and visualization

PubMed Central

Dhapola, Parashar; Chowdhury, Shantanu

2016-01-01

DNA guanine quadruplexes or G4s are non-canonical DNA secondary structures which affect genomic processes like replication, transcription and recombination. G4s are computationally identified by specific nucleotide motifs which are also called putative G4 (PG4) motifs. Despite the general relevance of these structures, there is currently no tool available that can allow batch queries and genome-wide analysis of these motifs in a user-friendly interface. QuadBase2 (quadbase.igib.res.in) presents a completely reinvented web server version of previously published QuadBase database. QuadBase2 enables users to mine PG4 motifs in up to 178 eukaryotes through the EuQuad module. This module interfaces with Ensembl Compara database, to allow users mine PG4 motifs in the orthologues of genes of interest across eukaryotes. PG4 motifs can be mined across genes and their promoter sequences in 1719 prokaryotes through ProQuad module. This module includes a feature that allows genome-wide mining of PG4 motifs and their visualization as circular histograms. TetraplexFinder, the module for mining PG4 motifs in user-provided sequences is now capable of handling up to 20 MB of data. QuadBase2 is a comprehensive PG4 motif mining tool that further expands the configurations and algorithms for mining PG4 motifs in a user-friendly way. PMID:27185890

Crystal structure of bacterial cell-surface alginate-binding protein with an M75 peptidase motif

DOE Office of Scientific and Technical Information (OSTI.GOV)

Maruyama, Yukie; Ochiai, Akihito; Mikami, Bunzo

Research highlights: {yields} Bacterial alginate-binding Algp7 is similar to component EfeO of Fe{sup 2+} transporter. {yields} We determined the crystal structure of Algp7 with a metal-binding motif. {yields} Algp7 consists of two helical bundles formed through duplication of a single bundle. {yields} A deep cleft involved in alginate binding locates around the metal-binding site. {yields} Algp7 may function as a Fe{sup 2+}-chelated alginate-binding protein. -- Abstract: A gram-negative Sphingomonas sp. A1 directly incorporates alginate polysaccharide into the cytoplasm via the cell-surface pit and ABC transporter. A cell-surface alginate-binding protein, Algp7, functions as a concentrator of the polysaccharide in the pit.more » Based on the primary structure and genetic organization in the bacterial genome, Algp7 was found to be homologous to an M75 peptidase motif-containing EfeO, a component of a ferrous ion transporter. Despite the presence of an M75 peptidase motif with high similarity, the Algp7 protein purified from recombinant Escherichia coli cells was inert on insulin B chain and N-benzoyl-Phe-Val-Arg-p-nitroanilide, both of which are substrates for a typical M75 peptidase, imelysin, from Pseudomonas aeruginosa. The X-ray crystallographic structure of Algp7 was determined at 2.10 A resolution by single-wavelength anomalous diffraction. Although a metal-binding motif, HxxE, conserved in zinc ion-dependent M75 peptidases is also found in Algp7, the crystal structure of Algp7 contains no metal even at the motif. The protein consists of two structurally similar up-and-down helical bundles as the basic scaffold. A deep cleft between the bundles is sufficiently large to accommodate macromolecules such as alginate polysaccharide. This is the first structural report on a bacterial cell-surface alginate-binding protein with an M75 peptidase motif.« less
Probing the Potential Role of Non-B DNA Structures at Yeast Meiosis-Specific DNA Double-Strand Breaks.

PubMed

Kshirsagar, Rucha; Khan, Krishnendu; Joshi, Mamata V; Hosur, Ramakrishna V; Muniyappa, K

2017-05-23

A plethora of evidence suggests that different types of DNA quadruplexes are widely present in the genome of all organisms. The existence of a growing number of proteins that selectively bind and/or process these structures underscores their biological relevance. Moreover, G-quadruplex DNA has been implicated in the alignment of four sister chromatids by forming parallel guanine quadruplexes during meiosis; however, the underlying mechanism is not well defined. Here we show that a G/C-rich motif associated with a meiosis-specific DNA double-strand break (DSB) in Saccharomyces cerevisiae folds into G-quadruplex, and the C-rich sequence complementary to the G-rich sequence forms an i-motif. The presence of G-quadruplex or i-motif structures upstream of the green fluorescent protein-coding sequence markedly reduces the levels of gfp mRNA expression in S. cerevisiae cells, with a concomitant decrease in green fluorescent protein abundance, and blocks primer extension by DNA polymerase, thereby demonstrating the functional significance of these structures. Surprisingly, although S. cerevisiae Hop1, a component of synaptonemal complex axial/lateral elements, exhibits strong affinity to G-quadruplex DNA, it displays a much weaker affinity for the i-motif structure. However, the Hop1 C-terminal but not the N-terminal domain possesses strong i-motif binding activity, implying that the C-terminal domain has a distinct substrate specificity. Additionally, we found that Hop1 promotes intermolecular pairing between G/C-rich DNA segments associated with a meiosis-specific DSB site. Our results support the idea that the G/C-rich motifs associated with meiosis-specific DSBs fold into intramolecular G-quadruplex and i-motif structures, both in vitro and in vivo, thus revealing an important link between non-B form DNA structures and Hop1 in meiotic chromosome synapsis and recombination. Copyright © 2017 Biophysical Society. Published by Elsevier Inc. All rights reserved.
Exploration of tetrahedral structures in silicate cathodes using a motif-network scheme

PubMed Central

Zhao, Xin; Wu, Shunqing; Lv, Xiaobao; Nguyen, Manh Cuong; Wang, Cai-Zhuang; Lin, Zijing; Zhu, Zi-Zhong; Ho, Kai-Ming

2015-01-01

Using a motif-network search scheme, we studied the tetrahedral structures of the dilithium/disodium transition metal orthosilicates A2MSiO4 with A = Li or Na and M = Mn, Fe or Co. In addition to finding all previously reported structures, we discovered many other different tetrahedral-network-based crystal structures which are highly degenerate in energy. These structures can be classified into structures with 1D, 2D and 3D M-Si-O frameworks. A clear trend of the structural preference in different systems was revealed and possible indicators that affect the structure stabilities were introduced. For the case of Na systems which have been much less investigated in the literature relative to the Li systems, we predicted their ground state structures and found evidence for the existence of new structural motifs. PMID:26497381
Conservation of the Human Integrin-Type Beta-Propeller Domain in Bacteria

PubMed Central

Chouhan, Bhanupratap; Denesyuk, Alexander; Heino, Jyrki; Johnson, Mark S.; Denessiouk, Konstantin

2011-01-01

Integrins are heterodimeric cell-surface receptors with key functions in cell-cell and cell-matrix adhesion. Integrin α and β subunits are present throughout the metazoans, but it is unclear whether the subunits predate the origin of multicellular organisms. Several component domains have been detected in bacteria, one of which, a specific 7-bladed β-propeller domain, is a unique feature of the integrin α subunits. Here, we describe a structure-derived motif, which incorporates key features of each blade from the X-ray structures of human αIIbβ3 and αVβ3, includes elements of the FG-GAP/Cage and Ca2+-binding motifs, and is specific only for the metazoan integrin domains. Separately, we searched for the metazoan integrin type β-propeller domains among all available sequences from bacteria and unicellular eukaryotic organisms, which must incorporate seven repeats, corresponding to the seven blades of the β-propeller domain, and so that the newly found structure-derived motif would exist in every repeat. As the result, among 47 available genomes of unicellular eukaryotes we could not find a single instance of seven repeats with the motif. Several sequences contained three repeats, a predicted transmembrane segment, and a short cytoplasmic motif associated with some integrins, but otherwise differ from the metazoan integrin α subunits. Among the available bacterial sequences, we found five examples containing seven sequential metazoan integrin-specific motifs within the seven repeats. The motifs differ in having one Ca2+-binding site per repeat, whereas metazoan integrins have three or four sites. The bacterial sequences are more conserved in terms of motif conservation and loop length, suggesting that the structure is more regular and compact than those example structures from human integrins. Although the bacterial examples are not full-length integrins, the full-length metazoan-type 7-bladed β-propeller domains are present, and sometimes two tandem copies are found. PMID:22022374
Sequence, Structure, and Context Preferences of Human RNA Binding Proteins.

PubMed

Dominguez, Daniel; Freese, Peter; Alexis, Maria S; Su, Amanda; Hochman, Myles; Palden, Tsultrim; Bazile, Cassandra; Lambert, Nicole J; Van Nostrand, Eric L; Pratt, Gabriel A; Yeo, Gene W; Graveley, Brenton R; Burge, Christopher B

2018-06-07

RNA binding proteins (RBPs) orchestrate the production, processing, and function of mRNAs. Here, we present the affinity landscapes of 78 human RBPs using an unbiased assay that determines the sequence, structure, and context preferences of these proteins in vitro by deep sequencing of bound RNAs. These data enable construction of "RNA maps" of RBP activity without requiring crosslinking-based assays. We found an unexpectedly low diversity of RNA motifs, implying frequent convergence of binding specificity toward a relatively small set of RNA motifs, many with low compositional complexity. Offsetting this trend, however, we observed extensive preferences for contextual features distinct from short linear RNA motifs, including spaced "bipartite" motifs, biased flanking nucleotide composition, and bias away from or toward RNA structure. Our results emphasize the importance of contextual features in RNA recognition, which likely enable targeting of distinct subsets of transcripts by different RBPs that recognize the same linear motif. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.
Theoretical studies of optics and charge transport in organic conducting oligomers and polymers: Rational design of improved transparent and conducting polymers

NASA Astrophysics Data System (ADS)

Hutchison, Geoffrey Rogers

Theoretical studies on a variety of oligo- and polyheterocycles elucidate their optical and charge transport properties, suggesting new, improved transparent conductive polymers. First-principles calculations provide accurate methodologies for predicting both optical band gaps of neutral and cationic oligomers and intrinsic charge transfer rates. Multidimensional analysis reveals important motifs in chemical tailorability of oligoheterocycle optical and charge transport properties. The results suggest new directions for design of novel materials. Using both finite oligomer and infinite polymer calculations, the optical band gaps in polyheterocycles follow a modified particle-in-a-box formalism, scaling approximately as 1/N (where N is the number of monomer units) in short chains, saturating for long chains. Calculations demonstrate that band structure changes upon heteroatom substitution, (e.g., from polythiophene to polypyrrole) derive from heteroatom electron affinity. Further investigation of chemical variability in substituted oligoheterocycles using multidimensional statistics reveals the interplay between heteroatom and substituent in correlations between structure and redox/optical properties of neutral and cationic species. A linear correlation between band gaps of neutral and cationic species upon oxidation of conjugated oligomers, shows redshifts of optical absorption for most species and blueshifts for small band gap species. Interstrand charge-transport studies focus on two contributors to hopping-style charge transfer rates: internal reorganization energy and the electronic coupling matrix element. Statistical analysis of chemical variability of reorganization energies in oligoheterocycles proves the importance of reorganization energy in determining intrinsic charge transfer rates (e.g., charge mobility in unsubstituted oligothiophenes). Computed bandwidths across several oligothiophene crystal packing motifs show similar electron and hole bandwidths, and show that well-known tilted and herringbone motifs in oligothiophenes are driven by electrostatic repulsion. Tilted stacks exhibit intrinsic charge-transfer rates smaller than cofacial stacks, but with lower packing energy. Given similar electron and hole bandwidths, a charge injection model explains substitution-modulated majority carrier changes in n- and p-type oligothiophene field-effect transistors.
Crystal Structure Predictions Using Adaptive Genetic Algorithm and Motif Search methods

NASA Astrophysics Data System (ADS)

Ho, K. M.; Wang, C. Z.; Zhao, X.; Wu, S.; Lyu, X.; Zhu, Z.; Nguyen, M. C.; Umemoto, K.; Wentzcovitch, R. M. M.

2017-12-01

Material informatics is a new initiative which has attracted a lot of attention in recent scientific research. The basic strategy is to construct comprehensive data sets and use machine learning to solve a wide variety of problems in material design and discovery. In pursuit of this goal, a key element is the quality and completeness of the databases used. Recent advance in the development of crystal structure prediction algorithms has made it a complementary and more efficient approach to explore the structure/phase space in materials using computers. In this talk, we discuss the importance of the structural motifs and motif-networks in crystal structure predictions. Correspondingly, powerful methods are developed to improve the sampling of the low-energy structure landscape.
The ARTT motif and a unified structural understanding of substraterecognition in ADP ribosylating bacterial toxins and eukaryotic ADPribosyltransferases

DOE Office of Scientific and Technical Information (OSTI.GOV)

Han, S.; Tainer, J.A.

2001-08-01

ADP-ribosylation is a widely occurring and biologically critical covalent chemical modification process in pathogenic mechanisms, intracellular signaling systems, DNA repair, and cell division. The reaction is catalyzed by ADP-ribosyltransferases, which transfer the ADP-ribose moiety of NAD to a target protein with nicotinamide release. A family of bacterial toxins and eukaryotic enzymes has been termed the mono-ADP-ribosyltransferases, in distinction to the poly-ADP-ribosyltransferases, which catalyze the addition of multiple ADP-ribose groups to the carboxyl terminus of eukaryotic nucleoproteins. Despite the limited primary sequence homology among the different ADP-ribosyltransferases, a central cleft bearing NAD-binding pocket formed by the two perpendicular b-sheet core hasmore » been remarkably conserved between bacterial toxins and eukaryotic mono- and poly-ADP-ribosyltransferases. The majority of bacterial toxins and eukaryotic mono-ADP-ribosyltransferases are characterized by conserved His and catalytic Glu residues. In contrast, Diphtheria toxin, Pseudomonas exotoxin A, and eukaryotic poly-ADP-ribosyltransferases are characterized by conserved Arg and catalytic Glu residues. The NAD-binding core of a binary toxin and a C3-like toxin family identified an ARTT motif (ADP-ribosylating turn-turn motif) that is implicated in substrate specificity and recognition by structural and mutagenic studies. Here we apply structure-based sequence alignment and comparative structural analyses of all known structures of ADP-ribosyltransfeases to suggest that this ARTT motif is functionally important in many ADP-ribosylating enzymes that bear a NAD binding cleft as characterized by conserved Arg and catalytic Glu residues. Overall, structure-based sequence analysis reveals common core structures and conserved active sites of ADP-ribosyltransferases to support similar NAD binding mechanisms but differing mechanisms of target protein binding via sequence variations within the ARTT motif structural framework. Thus, we propose here that the ARTT motif represents an experimentally testable general recognition motif region for many ADP-ribosyltransferases and thereby potentially provides a unified structural understanding of substrate recognition in ADP-ribosylation processes.« less
2,6-Diiminopiperidin-1-ol: an overlooked motif relevant to uranyl and transition metal binding on poly(amidoxime) adsorbents

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kennedy, Zachary C.; Cardenas, Allan Jay P.; Corbey, Jordan F.

2016-01-01

Glutardiamidoxime, a structural motif on sorbents used in uranium extraction from seawater, was discovered to cyclize in situ at room temperature to 2,6-diimino-piperidin-1-ol in the presence of uranyl nitrate. The new diimino motif was also generated when exposed to competing transition metals Cu(II) and Ni(II). Multinuclear μ-O bridged U(VI), Cu(II), and Ni(II) complexes featuring bound diimino ligands were isolated. A Cu(II) complex with the historically relevant cyclic imide dioxime motif is also reported for structural comparison to the reported diimino complexes.
Parallel arrangements of positive feedback loops limit cell-to-cell variability in differentiation.

PubMed

Dey, Anupam; Barik, Debashis

2017-01-01

Cellular differentiations are often regulated by bistable switches resulting from specific arrangements of multiple positive feedback loops (PFL) fused to one another. Although bistability generates digital responses at the cellular level, stochasticity in chemical reactions causes population heterogeneity in terms of its differentiated states. We hypothesized that the specific arrangements of PFLs may have evolved to minimize the cellular heterogeneity in differentiation. In order to test this we investigated variability in cellular differentiation controlled either by parallel or serial arrangements of multiple PFLs having similar average properties under extrinsic and intrinsic noises. We find that motifs with PFLs fused in parallel to one another around a central regulator are less susceptible to noise as compared to the motifs with PFLs arranged serially. Our calculations suggest that the increased resistance to noise in parallel motifs originate from the less sensitivity of bifurcation points to the extrinsic noise. Whereas estimation of mean residence times indicate that stable branches of bifurcations are robust to intrinsic noise in parallel motifs as compared to serial motifs. Model conclusions are consistent both in AND- and OR-gate input signal configurations and also with two different modeling strategies. Our investigations provide some insight into recent findings that differentiation of preadipocyte to mature adipocyte is controlled by network of parallel PFLs.
Tertiary alphabet for the observable protein structural universe.

PubMed

Mackenzie, Craig O; Zhou, Jianfu; Grigoryan, Gevorg

2016-11-22

Here, we systematically decompose the known protein structural universe into its basic elements, which we dub tertiary structural motifs (TERMs). A TERM is a compact backbone fragment that captures the secondary, tertiary, and quaternary environments around a given residue, comprising one or more disjoint segments (three on average). We seek the set of universal TERMs that capture all structure in the Protein Data Bank (PDB), finding remarkable degeneracy. Only ∼600 TERMs are sufficient to describe 50% of the PDB at sub-Angstrom resolution. However, more rare geometries also exist, and the overall structural coverage grows logarithmically with the number of TERMs. We go on to show that universal TERMs provide an effective mapping between sequence and structure. We demonstrate that TERM-based statistics alone are sufficient to recapitulate close-to-native sequences given either NMR or X-ray backbones. Furthermore, sequence variability predicted from TERM data agrees closely with evolutionary variation. Finally, locations of TERMs in protein chains can be predicted from sequence alone based on sequence signatures emergent from TERM instances in the PDB. For multisegment motifs, this method identifies spatially adjacent fragments that are not contiguous in sequence-a major bottleneck in structure prediction. Although all TERMs recur in diverse proteins, some appear specialized for certain functions, such as interface formation, metal coordination, or even water binding. Structural biology has benefited greatly from previously observed degeneracies in structure. The decomposition of the known structural universe into a finite set of compact TERMs offers exciting opportunities toward better understanding, design, and prediction of protein structure.
Tertiary alphabet for the observable protein structural universe

PubMed Central

Mackenzie, Craig O.; Zhou, Jianfu; Grigoryan, Gevorg

2016-01-01

Here, we systematically decompose the known protein structural universe into its basic elements, which we dub tertiary structural motifs (TERMs). A TERM is a compact backbone fragment that captures the secondary, tertiary, and quaternary environments around a given residue, comprising one or more disjoint segments (three on average). We seek the set of universal TERMs that capture all structure in the Protein Data Bank (PDB), finding remarkable degeneracy. Only ∼600 TERMs are sufficient to describe 50% of the PDB at sub-Angstrom resolution. However, more rare geometries also exist, and the overall structural coverage grows logarithmically with the number of TERMs. We go on to show that universal TERMs provide an effective mapping between sequence and structure. We demonstrate that TERM-based statistics alone are sufficient to recapitulate close-to-native sequences given either NMR or X-ray backbones. Furthermore, sequence variability predicted from TERM data agrees closely with evolutionary variation. Finally, locations of TERMs in protein chains can be predicted from sequence alone based on sequence signatures emergent from TERM instances in the PDB. For multisegment motifs, this method identifies spatially adjacent fragments that are not contiguous in sequence—a major bottleneck in structure prediction. Although all TERMs recur in diverse proteins, some appear specialized for certain functions, such as interface formation, metal coordination, or even water binding. Structural biology has benefited greatly from previously observed degeneracies in structure. The decomposition of the known structural universe into a finite set of compact TERMs offers exciting opportunities toward better understanding, design, and prediction of protein structure. PMID:27810958
Mutually Exclusive Formation of G-Quadruplex and i-Motif Is a General Phenomenon Governed by Steric Hindrance in Duplex DNA.

PubMed

Cui, Yunxi; Kong, Deming; Ghimire, Chiran; Xu, Cuixia; Mao, Hanbin

2016-04-19

G-Quadruplex and i-motif are tetraplex structures that may form in opposite strands at the same location of a duplex DNA. Recent discoveries have indicated that the two tetraplex structures can have conflicting biological activities, which poses a challenge for cells to coordinate. Here, by performing innovative population analysis on mechanical unfolding profiles of tetraplex structures in double-stranded DNA, we found that formations of G-quadruplex and i-motif in the two complementary strands are mutually exclusive in a variety of DNA templates, which include human telomere and promoter fragments of hINS and hTERT genes. To explain this behavior, we placed G-quadruplex- and i-motif-hosting sequences in an offset fashion in the two complementary telomeric DNA strands. We found simultaneous formation of the G-quadruplex and i-motif in opposite strands, suggesting that mutual exclusivity between the two tetraplexes is controlled by steric hindrance. This conclusion was corroborated in the BCL-2 promoter sequence, in which simultaneous formation of two tetraplexes was observed due to possible offset arrangements between G-quadruplex and i-motif in opposite strands. The mutual exclusivity revealed here sets a molecular basis for cells to efficiently coordinate opposite biological activities of G-quadruplex and i-motif at the same dsDNA location.
A Three-Dimensional RNA Motif in Potato spindle tuber viroid Mediates Trafficking from Palisade Mesophyll to Spongy Mesophyll in Nicotiana benthamiana[W

PubMed Central

Takeda, Ryuta; Petrov, Anton I.; Leontis, Neocles B.; Ding, Biao

2011-01-01

Cell-to-cell trafficking of RNA is an emerging biological principle that integrates systemic gene regulation, viral infection, antiviral response, and cell-to-cell communication. A key mechanistic question is how an RNA is specifically selected for trafficking from one type of cell into another type. Here, we report the identification of an RNA motif in Potato spindle tuber viroid (PSTVd) required for trafficking from palisade mesophyll to spongy mesophyll in Nicotiana benthamiana leaves. This motif, called loop 6, has the sequence 5′-CGA-3′...5′-GAC-3′ flanked on both sides by cis Watson-Crick G/C and G/U wobble base pairs. We present a three-dimensional (3D) structural model of loop 6 that specifies all non-Watson-Crick base pair interactions, derived by isostericity-based sequence comparisons with 3D RNA motifs from the RNA x-ray crystal structure database. The model is supported by available chemical modification patterns, natural sequence conservation/variations in PSTVd isolates and related species, and functional characterization of all possible mutants for each of the loop 6 base pairs. Our findings and approaches have broad implications for studying the 3D RNA structural motifs mediating trafficking of diverse RNA species across specific cellular boundaries and for studying the structure-function relationships of RNA motifs in other biological processes. PMID:21258006
A three-dimensional RNA motif in Potato spindle tuber viroid mediates trafficking from palisade mesophyll to spongy mesophyll in Nicotiana benthamiana.

PubMed

Takeda, Ryuta; Petrov, Anton I; Leontis, Neocles B; Ding, Biao

2011-01-01

Cell-to-cell trafficking of RNA is an emerging biological principle that integrates systemic gene regulation, viral infection, antiviral response, and cell-to-cell communication. A key mechanistic question is how an RNA is specifically selected for trafficking from one type of cell into another type. Here, we report the identification of an RNA motif in Potato spindle tuber viroid (PSTVd) required for trafficking from palisade mesophyll to spongy mesophyll in Nicotiana benthamiana leaves. This motif, called loop 6, has the sequence 5'-CGA-3'...5'-GAC-3' flanked on both sides by cis Watson-Crick G/C and G/U wobble base pairs. We present a three-dimensional (3D) structural model of loop 6 that specifies all non-Watson-Crick base pair interactions, derived by isostericity-based sequence comparisons with 3D RNA motifs from the RNA x-ray crystal structure database. The model is supported by available chemical modification patterns, natural sequence conservation/variations in PSTVd isolates and related species, and functional characterization of all possible mutants for each of the loop 6 base pairs. Our findings and approaches have broad implications for studying the 3D RNA structural motifs mediating trafficking of diverse RNA species across specific cellular boundaries and for studying the structure-function relationships of RNA motifs in other biological processes.
Structural and Biochemical Basis for the Binding Selectivity of Peroxisome Proliferator-activated Receptor [gamma] to PGC-1[alpha

DOE Office of Scientific and Technical Information (OSTI.GOV)

Li, Yong; Kovach, Amanda; Suino-Powell, Kelly

2008-07-23

The functional interaction between the peroxisome proliferator-activated receptor {gamma} (PPAR{gamma}) and its coactivator PGC-1{alpha} is crucial for the normal physiology of PPAR{gamma} and its pharmacological response to antidiabetic treatment with rosiglitazone. Here we report the crystal structure of the PPAR{gamma} ligand-binding domain bound to rosiglitazone and to a large PGC-1{alpha} fragment that contains two LXXLL-related motifs. The structure reveals critical contacts mediated through the first LXXLL motif of PGC-1{alpha} and the PPAR{gamma} coactivator binding site. Through a combination of biochemical and structural studies, we demonstrate that the first LXXLL motif is the most potent among all nuclear receptor coactivator motifsmore » tested, and only this motif of the two LXXLL-related motifs in PGC-1{alpha} is capable of binding to PPAR{gamma}. Our studies reveal that the strong interaction of PGC-1{alpha} and PPAR{gamma} is mediated through both hydrophobic and specific polar interactions. Mutations within the context of the full-length PGC-1{alpha} indicate that the first PGC-1{alpha} motif is necessary and sufficient for PGC-1{alpha} to coactivate PPAR{gamma} in the presence or absence of rosiglitazone. These results provide a molecular basis for specific recruitment and functional interplay between PPAR{gamma} and PGC-1{alpha} in glucose homeostasis and adipocyte differentiation.« less
Analysis of secondary structural elements in human microRNA hairpin precursors.

PubMed

Liu, Biao; Childs-Disney, Jessica L; Znosko, Brent M; Wang, Dan; Fallahi, Mohammad; Gallo, Steven M; Disney, Matthew D

2016-03-01

MicroRNAs (miRNAs) regulate gene expression by targeting complementary mRNAs for destruction or translational repression. Aberrant expression of miRNAs has been associated with various diseases including cancer, thus making them interesting therapeutic targets. The composite of secondary structural elements that comprise miRNAs could aid the design of small molecules that modulate their function. We analyzed the secondary structural elements, or motifs, present in all human miRNA hairpin precursors and compared them to highly expressed human RNAs with known structures and other RNAs from various organisms. Amongst human miRNAs, there are 3808 are unique motifs, many residing in processing sites. Further, we identified motifs in miRNAs that are not present in other highly expressed human RNAs, desirable targets for small molecules. MiRNA motifs were incorporated into a searchable database that is freely available. We also analyzed the most frequently occurring bulges and internal loops for each RNA class and found that the smallest loops possible prevail. However, the distribution of loops and the preferred closing base pairs were unique to each class. Collectively, we have completed a broad survey of motifs found in human miRNA precursors, highly expressed human RNAs, and RNAs from other organisms. Interestingly, unique motifs were identified in human miRNA processing sites, binding to which could inhibit miRNA maturation and hence function.
Identification of novel RNA secondary structures within the hepatitis C virus genome reveals a cooperative involvement in genome packaging

PubMed Central

Stewart, H.; Bingham, R.J.; White, S. J.; Dykeman, E. C.; Zothner, C.; Tuplin, A. K.; Stockley, P. G.; Twarock, R.; Harris, M.

2016-01-01

The specific packaging of the hepatitis C virus (HCV) genome is hypothesised to be driven by Core-RNA interactions. To identify the regions of the viral genome involved in this process, we used SELEX (systematic evolution of ligands by exponential enrichment) to identify RNA aptamers which bind specifically to Core in vitro. Comparison of these aptamers to multiple HCV genomes revealed the presence of a conserved terminal loop motif within short RNA stem-loop structures. We postulated that interactions of these motifs, as well as sub-motifs which were present in HCV genomes at statistically significant levels, with the Core protein may drive virion assembly. We mutated 8 of these predicted motifs within the HCV infectious molecular clone JFH-1, thereby producing a range of mutant viruses predicted to possess altered RNA secondary structures. RNA replication and viral titre were unaltered in viruses possessing only one mutated structure. However, infectivity titres were decreased in viruses possessing a higher number of mutated regions. This work thus identified multiple novel RNA motifs which appear to contribute to genome packaging. We suggest that these structures act as cooperative packaging signals to drive specific RNA encapsidation during HCV assembly. PMID:26972799
Exploration of tetrahedral structures in silicate cathodes using a motif-network scheme

DOE PAGES

Zhao, Xin; Wu, Shunqing; Lv, Xiaobao; ...

2015-10-26

Using a motif-network search scheme, we studied the tetrahedral structures of the dilithium/disodium transition metal orthosilicates A 2MSiO 4 with A = Li or Na and M = Mn, Fe or Co. In addition to finding all previously reported structures, we discovered many other different tetrahedral-network-based crystal structures which are highly degenerate in energy. In addition, these structures can be classified into structures with 1D, 2D and 3D M-Si-O frameworks. A clear trend of the structural preference in different systems was revealed and possible indicators that affect the structure stabilities were introduced. For the case of Na systems which havemore » been much less investigated in the literature relative to the Li systems, we predicted their ground state structures and found evidence for the existence of new structural motifs.« less
Identification of GATC- and CCGG- recognizing Type II REases and their putative specificity-determining positions using Scan2S—a novel motif scan algorithm with optional secondary structure constraints

PubMed Central

Niv, Masha Y.; Skrabanek, Lucy; Roberts, Richard J.; Scheraga, Harold A.; Weinstein, Harel

2008-01-01

Restriction endonucleases (REases) are DNA-cleaving enzymes that have become indispensable tools in molecular biology. Type II REases are highly divergent in sequence despite their common structural core, function and, in some cases, common specificities towards DNA sequences. This makes it difficult to identify and classify them functionally based on sequence, and has hampered the efforts of specificity-engineering. Here, we define novel REase sequence motifs, which extend beyond the PD-(D/E)XK hallmark, and incorporate secondary structure information. The automated search using these motifs is carried out with a newly developed fast regular expression matching algorithm that accommodates long patterns with optional secondary structure constraints. Using this new tool, named Scan2S, motifs derived from REases with specificity towards GATC- and CGGG-containing DNA sequences successfully identify REases of the same specificity. Notably, some of these sequences are not identified by standard sequence detection tools. The new motifs highlight potential specificity-determining positions that do not fully overlap for the GATC- and the CCGG-recognizing REases and are candidates for specificity re-engineering. PMID:17972284

Identification of GATC- and CCGG-recognizing Type II REases and their putative specificity-determining positions using Scan2S--a novel motif scan algorithm with optional secondary structure constraints.

PubMed

Niv, Masha Y; Skrabanek, Lucy; Roberts, Richard J; Scheraga, Harold A; Weinstein, Harel

2008-05-01

Restriction endonucleases (REases) are DNA-cleaving enzymes that have become indispensable tools in molecular biology. Type II REases are highly divergent in sequence despite their common structural core, function and, in some cases, common specificities towards DNA sequences. This makes it difficult to identify and classify them functionally based on sequence, and has hampered the efforts of specificity-engineering. Here, we define novel REase sequence motifs, which extend beyond the PD-(D/E)XK hallmark, and incorporate secondary structure information. The automated search using these motifs is carried out with a newly developed fast regular expression matching algorithm that accommodates long patterns with optional secondary structure constraints. Using this new tool, named Scan2S, motifs derived from REases with specificity towards GATC- and CGGG-containing DNA sequences successfully identify REases of the same specificity. Notably, some of these sequences are not identified by standard sequence detection tools. The new motifs highlight potential specificity-determining positions that do not fully overlap for the GATC- and the CCGG-recognizing REases and are candidates for specificity re-engineering.
HOXB9 induction of mesenchymal-to-epithelial transition in gastric carcinoma is negatively regulated by its hexapeptide motif

PubMed Central

He, Changyu; Zhang, Baogui; Zhang, Jun; Liu, Bingya; Zeng, Naiyan; Zhu, Zhenggang

2015-01-01

HOXB9, a transcription factor, plays an important role in development. While HOXB9 has been implicated in tumorigenesis and metastasis, its mechanisms are variable and its role in gastric carcinoma (GC) remains unclear. In the present study, we demonstrated that the expression of HOXB9 decreased in gastric carcinoma and was associated with malignancy and metastasis. Re-expression of HOXB9 in gastric cell lines resulted in the suppression of cell proliferation, migration, and invasion, which was accompanied by the induction of mesenchymal-to-epithelial transition (MET). Comparative sequence analysis and examination of a HOXB9 structural model indicated that three sites might possibly be involved in MET regulation. The in vitro study of HOXB9 mutants showed that these were unable to inhibit MET induction. However, when overexpressing a HOXB9 mutant lacking the hexapeptide motif, a more potent MET induction and tumor suppression was observed compared to that of the wild-type, indicating that the presence of the hexapeptide motif reduced HOXB9 MET induction and tumor suppression activity. Therefore, the results of the present study suggested that HOXB9 is a tumor suppressor in gastric carcinoma, and its activity was controlled by different regulatory mechanisms such as the hexapeptide motif as a “brake” in this case. The results of these regulatory effects could lead to either oncogenic or tumor suppressive roles of HOXB9, depending on the context of the particular type of cancer involved. PMID:26536658
Structural basis for the binding of tryptophan-based motifs by δ-COP

PubMed Central

Suckling, Richard J.; Poon, Pak Phi; Travis, Sophie M.; Majoul, Irina V.; Hughson, Frederick M.; Evans, Philip R.; Duden, Rainer; Owen, David J.

2015-01-01

Coatomer consists of two subcomplexes: the membrane-targeting, ADP ribosylation factor 1 (Arf1):GTP-binding βγδζ-COP F-subcomplex, which is related to the adaptor protein (AP) clathrin adaptors, and the cargo-binding αβ’ε-COP B-subcomplex. We present the structure of the C-terminal μ-homology domain of the yeast δ-COP subunit in complex with the WxW motif from its binding partner, the endoplasmic reticulum-localized Dsl1 tether. The motif binds at a site distinct from that used by the homologous AP μ subunits to bind YxxΦ cargo motifs with its two tryptophan residues sitting in compatible pockets. We also show that the Saccharomyces cerevisiae Arf GTPase-activating protein (GAP) homolog Gcs1p uses a related WxxF motif at its extreme C terminus to bind to δ-COP at the same site in the same way. Mutations designed on the basis of the structure in conjunction with isothermal titration calorimetry confirm the mode of binding and show that mammalian δ-COP binds related tryptophan-based motifs such as that from ArfGAP1 in a similar manner. We conclude that δ-COP subunits bind Wxn(1–6)[WF] motifs within unstructured regions of proteins that influence the lifecycle of COPI-coated vesicles; this conclusion is supported by the observation that, in the context of a sensitizing domain deletion in Dsl1p, mutating the tryptophan-based motif-binding site in yeast causes defects in both growth and carboxypeptidase Y trafficking/processing. PMID:26578768
Distance-dependent duplex DNA destabilization proximal to G-quadruplex/i-motif sequences

PubMed Central

König, Sebastian L. B.; Huppert, Julian L.; Sigel, Roland K. O.; Evans, Amanda C.

2013-01-01

G-quadruplexes and i-motifs are complementary examples of non-canonical nucleic acid substructure conformations. G-quadruplex thermodynamic stability has been extensively studied for a variety of base sequences, but the degree of duplex destabilization that adjacent quadruplex structure formation can cause has yet to be fully addressed. Stable in vivo formation of these alternative nucleic acid structures is likely to be highly dependent on whether sufficient spacing exists between neighbouring duplex- and quadruplex-/i-motif-forming regions to accommodate quadruplexes or i-motifs without disrupting duplex stability. Prediction of putative G-quadruplex-forming regions is likely to be assisted by further understanding of what distance (number of base pairs) is required for duplexes to remain stable as quadruplexes or i-motifs form. Using oligonucleotide constructs derived from precedented G-quadruplexes and i-motif-forming bcl-2 P1 promoter region, initial biophysical stability studies indicate that the formation of G-quadruplex and i-motif conformations do destabilize proximal duplex regions. The undermining effect that quadruplex formation can have on duplex stability is mitigated with increased distance from the duplex region: a spacing of five base pairs or more is sufficient to maintain duplex stability proximal to predicted quadruplex/i-motif-forming regions. PMID:23771141
BlockLogo: visualization of peptide and sequence motif conservation

PubMed Central

Olsen, Lars Rønn; Kudahl, Ulrich Johan; Simon, Christian; Sun, Jing; Schönbach, Christian; Reinherz, Ellis L.; Zhang, Guang Lan; Brusic, Vladimir

2013-01-01

BlockLogo is a web-server application for visualization of protein and nucleotide fragments, continuous protein sequence motifs, and discontinuous sequence motifs using calculation of block entropy from multiple sequence alignments. The user input consists of a multiple sequence alignment, selection of motif positions, type of sequence, and output format definition. The output has BlockLogo along with the sequence logo, and a table of motif frequencies. We deployed BlockLogo as an online application and have demonstrated its utility through examples that show visualization of T-cell epitopes and B-cell epitopes (both continuous and discontinuous). Our additional example shows a visualization and analysis of structural motifs that determine specificity of peptide binding to HLA-DR molecules. The BlockLogo server also employs selected experimentally validated prediction algorithms to enable on-the-fly prediction of MHC binding affinity to 15 common HLA class I and class II alleles as well as visual analysis of discontinuous epitopes from multiple sequence alignments. It enables the visualization and analysis of structural and functional motifs that are usually described as regular expressions. It provides a compact view of discontinuous motifs composed of distant positions within biological sequences. BlockLogo is available at: http://research4.dfci.harvard.edu/cvc/blocklogo/ and http://methilab.bu.edu/blocklogo/ PMID:24001880
The pH-dependent tertiary structure of a designed helix-loop-helix dimer.

PubMed

Dolphin, G T; Baltzer, L

1997-01-01

De novo designed helix-loop-helix motifs can fold into well-defined tertiary structures if residues or groups of residues are incorporated at the helix-helix boundary to form helix-recognition sites that restrict the conformational degrees of freedom of the helical segments. Understanding the relationship between structure and function of conformational constraints therefore forms the basis for the engineering of non-natural proteins. This paper describes the design of an interhelical HisH+-Asp- hydrogen-bonded ion pair and the conformational stability of the folded helix-loop-helix motif. GTD-C, a polypeptide with 43 amino acid residues, has been designed to fold into a hairpin helix-loop-helix motif that can dimerise to form a four-helix bundle. The folded motif is in slow conformational exchange on the NMR timescale and has a well-dispersed 1H NMR spectrum, a narrow temperature interval for thermal denaturation and a near-UV CD spectrum with some fine structure. The conformational stability is pH dependent with an optimum that corresponds to the pH for maximum formation of a hydrogen-bonded ion pair between HisH17+ in helix I and Asp27- in helix II. The formation of an interhelical salt bridge is strongly suggested by the pH dependence of a number of spectroscopic probes to generate a well-defined tertiary structure in a designed helix-loop-helix motif. The thermodynamic stability of the folded motif is not increased by the formation of the salt bridge, but neighbouring conformations are destabilised. The use of this novel design principle in combination with hydrophobic interactions that provide sufficient binding energy in the folded structure should be of general use in de novo design of native-like proteins.
Assessing local structure motifs using order parameters for motif recognition, interstitial identification, and diffusion path characterization

NASA Astrophysics Data System (ADS)

Zimmermann, Nils E. R.; Horton, Matthew K.; Jain, Anubhav; Haranczyk, Maciej

2017-11-01

Structure-property relationships form the basis of many design rules in materials science, including synthesizability and long-term stability of catalysts, control of electrical and optoelectronic behavior in semiconductors as well as the capacity of and transport properties in cathode materials for rechargeable batteries. The immediate atomic environments (i.e., the first coordination shells) of a few atomic sites are often a key factor in achieving a desired property. Some of the most frequently encountered coordination patterns are tetrahedra, octahedra, body and face-centered cubic as well as hexagonal closed packed-like environments. Here, we showcase the usefulness of local order parameters to identify these basic structural motifs in inorganic solid materials by developing classification criteria. We introduce a systematic testing framework, the Einstein crystal test rig, that probes the response of order parameters to distortions in perfect motifs to validate our approach. Subsequently, we highlight three important application cases. First, we map basic crystal structure information of a large materials database in an intuitive manner by screening the Materials Project (MP) database (61,422 compounds) for element-specific motif distributions. Second, we use the structure-motif recognition capabilities to automatically find interstitials in metals, semiconductor, and insulator materials. Our Interstitialcy Finding Tool (InFiT) facilitates high-throughput screenings of defect properties. Third, the order parameters are reliable and compact quantitative structure descriptors for characterizing diffusion hops of intercalants as our example of magnesium in MnO2-spinel indicates. Finally, the tools developed in our work are readily and freely available as software implementations in the pymatgen library, and we expect them to be further applied to machine-learning approaches for emerging applications in materials science.
Automated Recognition of RNA Structure Motifs by Their SHAPE Data Signatures.

PubMed

Radecki, Pierce; Ledda, Mirko; Aviran, Sharon

2018-06-14

High-throughput structure profiling (SP) experiments that provide information at nucleotide resolution are revolutionizing our ability to study RNA structures. Of particular interest are RNA elements whose underlying structures are necessary for their biological functions. We previously introduced patteRNA , an algorithm for rapidly mining SP data for patterns characteristic of such motifs. This work provided a proof-of-concept for the detection of motifs and the capability of distinguishing structures displaying pronounced conformational changes. Here, we describe several improvements and automation routines to patteRNA . We then consider more elaborate biological situations starting with the comparison or integration of results from searches for distinct motifs and across datasets. To facilitate such analyses, we characterize patteRNA ’s outputs and describe a normalization framework that regularizes results. We then demonstrate that our algorithm successfully discerns between highly similar structural variants of the human immunodeficiency virus type 1 (HIV-1) Rev response element (RRE) and readily identifies its exact location in whole-genome structure profiles of HIV-1. This work highlights the breadth of information that can be gleaned from SP data and broadens the utility of data-driven methods as tools for the detection of novel RNA elements.
Molecular basis for the wide range of affinity found in Csr/Rsm protein-RNA recognition.

PubMed

Duss, Olivier; Michel, Erich; Diarra dit Konté, Nana; Schubert, Mario; Allain, Frédéric H-T

2014-04-01

The carbon storage regulator/regulator of secondary metabolism (Csr/Rsm) type of small non-coding RNAs (sRNAs) is widespread throughout bacteria and acts by sequestering the global translation repressor protein CsrA/RsmE from the ribosome binding site of a subset of mRNAs. Although we have previously described the molecular basis of a high affinity RNA target bound to RsmE, it remains unknown how other lower affinity targets are recognized by the same protein. Here, we have determined the nuclear magnetic resonance solution structures of five separate GGA binding motifs of the sRNA RsmZ of Pseudomonas fluorescens in complex with RsmE. The structures explain how the variation of sequence and structural context of the GGA binding motifs modulate the binding affinity for RsmE by five orders of magnitude (∼10 nM to ∼3 mM, Kd). Furthermore, we see that conformational adaptation of protein side-chains and RNA enable recognition of different RNA sequences by the same protein contributing to binding affinity without conferring specificity. Overall, our findings illustrate how the variability in the Csr/Rsm protein-RNA recognition allows a fine-tuning of the competition between mRNAs and sRNAs for the CsrA/RsmE protein.
Variable setpoint as a relaxing component in physiological control.

PubMed

Risvoll, Geir B; Thorsen, Kristian; Ruoff, Peter; Drengstig, Tormod

2017-09-01

Setpoints in physiology have been a puzzle for decades, and especially the notion of fixed or variable setpoints have received much attention. In this paper, we show how previously presented homeostatic controller motifs, extended with saturable signaling kinetics, can be described as variable setpoint controllers. The benefit of a variable setpoint controller is that an observed change in the concentration of the regulated biochemical species (the controlled variable) is fully characterized, and is not considered a deviation from a fixed setpoint. The variation in this biochemical species originate from variation in the disturbances (the perturbation), and thereby in the biochemical species representing the controller (the manipulated variable). Thus, we define an operational space which is spanned out by the combined high and low levels of the variations in (1) the controlled variable, (2) the manipulated variable, and (3) the perturbation. From this operational space, we investigate whether and how it imposes constraints on the different motif parameters, in order for the motif to represent a mathematical model of the regulatory system. Further analysis of the controller's ability to compensate for disturbances reveals that a variable setpoint represents a relaxing component for the controller, in that the necessary control action is reduced compared to that of a fixed setpoint controller. Such a relaxing component might serve as an important property from an evolutionary point of view. Finally, we illustrate the principles using the renal sodium and aldosterone regulatory system, where we model the variation in plasma sodium as a function of salt intake. We show that the experimentally observed variations in plasma sodium can be interpreted as a variable setpoint regulatory system. © 2017 The Authors. Physiological Reports published by Wiley Periodicals, Inc. on behalf of The Physiological Society and the American Physiological Society.
Conservation of the glycoprotein B homologs of the Kaposi’s sarcoma-associated herpesvirus (KSHV/HHV8) and Old World primate rhadinoviruses of chimpanzees and macaques

PubMed Central

Bruce, A. Gregory; Horst, Jeremy A.; Rose, Timothy M.

2016-01-01

The envelope-associated glycoprotein B (gB) is highly conserved within the Herpesviridae and plays a critical role in viral entry. We analyzed the evolutionary conservation of sequence and structural motifs within the Kaposi’s sarcoma-associated herpesvirus (KSHV) gB and homologs of Old World primate rhadinoviruses belonging to the distinct RV1 and RV2 rhadinovirus lineages. In addition to gB homologs of rhadinoviruses infecting the pig-tailed and rhesus macaques, we cloned and sequenced gB homologs of RV1 and RV2 rhadinoviruses infecting chimpanzees. A structural model of the KSHV gB was determined, and functional motifs and sequence variants were mapped to the model structure. Conserved domains and motifs were identified, including an “RGD” motif that plays a critical role in KSHV binding and entry through the cellular integrin αVβ3. The RGD motif was only detected in RV1 rhadinoviruses suggesting an important difference in cell tropism between the two rhadinovirus lineages. PMID:27070755
Discovering Sequence Motifs with Arbitrary Insertions and Deletions

PubMed Central

Frith, Martin C.; Saunders, Neil F. W.; Kobe, Bostjan; Bailey, Timothy L.

2008-01-01

Biology is encoded in molecular sequences: deciphering this encoding remains a grand scientific challenge. Functional regions of DNA, RNA, and protein sequences often exhibit characteristic but subtle motifs; thus, computational discovery of motifs in sequences is a fundamental and much-studied problem. However, most current algorithms do not allow for insertions or deletions (indels) within motifs, and the few that do have other limitations. We present a method, GLAM2 (Gapped Local Alignment of Motifs), for discovering motifs allowing indels in a fully general manner, and a companion method GLAM2SCAN for searching sequence databases using such motifs. glam2 is a generalization of the gapless Gibbs sampling algorithm. It re-discovers variable-width protein motifs from the PROSITE database significantly more accurately than the alternative methods PRATT and SAM-T2K. Furthermore, it usefully refines protein motifs from the ELM database: in some cases, the refined motifs make orders of magnitude fewer overpredictions than the original ELM regular expressions. GLAM2 performs respectably on the BAliBASE multiple alignment benchmark, and may be superior to leading multiple alignment methods for “motif-like” alignments with N- and C-terminal extensions. Finally, we demonstrate the use of GLAM2 to discover protein kinase substrate motifs and a gapped DNA motif for the LIM-only transcriptional regulatory complex: using GLAM2SCAN, we identify promising targets for the latter. GLAM2 is especially promising for short protein motifs, and it should improve our ability to identify the protein cleavage sites, interaction sites, post-translational modification attachment sites, etc., that underlie much of biology. It may be equally useful for arbitrarily gapped motifs in DNA and RNA, although fewer examples of such motifs are known at present. GLAM2 is public domain software, available for download at http://bioinformatics.org.au/glam2. PMID:18437229
Structural and Functional Characterization of an Archaeal Clustered Regularly Interspaced Short Palindromic Repeat (CRISPR)-associated Complex for Antiviral Defense (CASCADE)*

PubMed Central

Lintner, Nathanael G.; Kerou, Melina; Brumfield, Susan K.; Graham, Shirley; Liu, Huanting; Naismith, James H.; Sdano, Matthew; Peng, Nan; She, Qunxin; Copié, Valérie; Young, Mark J.; White, Malcolm F.; Lawrence, C. Martin

2011-01-01

In response to viral infection, many prokaryotes incorporate fragments of virus-derived DNA into loci called clustered regularly interspaced short palindromic repeats (CRISPRs). The loci are then transcribed, and the processed CRISPR transcripts are used to target invading viral DNA and RNA. The Escherichia coli “CRISPR-associated complex for antiviral defense” (CASCADE) is central in targeting invading DNA. Here we report the structural and functional characterization of an archaeal CASCADE (aCASCADE) from Sulfolobus solfataricus. Tagged Csa2 (Cas7) expressed in S. solfataricus co-purifies with Cas5a-, Cas6-, Csa5-, and Cas6-processed CRISPR-RNA (crRNA). Csa2, the dominant protein in aCASCADE, forms a stable complex with Cas5a. Transmission electron microscopy reveals a helical complex of variable length, perhaps due to substoichiometric amounts of other CASCADE components. A recombinant Csa2-Cas5a complex is sufficient to bind crRNA and complementary ssDNA. The structure of Csa2 reveals a crescent-shaped structure unexpectedly composed of a modified RNA-recognition motif and two additional domains present as insertions in the RNA-recognition motif. Conserved residues indicate potential crRNA- and target DNA-binding sites, and the H160A variant shows significantly reduced affinity for crRNA. We propose a general subunit architecture for CASCADE in other bacteria and Archaea. PMID:21507944
Structural and functional characterization of an archaeal clustered regularly interspaced short palindromic repeat (CRISPR)-associated complex for antiviral defense (CASCADE).

PubMed

Lintner, Nathanael G; Kerou, Melina; Brumfield, Susan K; Graham, Shirley; Liu, Huanting; Naismith, James H; Sdano, Matthew; Peng, Nan; She, Qunxin; Copié, Valérie; Young, Mark J; White, Malcolm F; Lawrence, C Martin

2011-06-17

In response to viral infection, many prokaryotes incorporate fragments of virus-derived DNA into loci called clustered regularly interspaced short palindromic repeats (CRISPRs). The loci are then transcribed, and the processed CRISPR transcripts are used to target invading viral DNA and RNA. The Escherichia coli "CRISPR-associated complex for antiviral defense" (CASCADE) is central in targeting invading DNA. Here we report the structural and functional characterization of an archaeal CASCADE (aCASCADE) from Sulfolobus solfataricus. Tagged Csa2 (Cas7) expressed in S. solfataricus co-purifies with Cas5a-, Cas6-, Csa5-, and Cas6-processed CRISPR-RNA (crRNA). Csa2, the dominant protein in aCASCADE, forms a stable complex with Cas5a. Transmission electron microscopy reveals a helical complex of variable length, perhaps due to substoichiometric amounts of other CASCADE components. A recombinant Csa2-Cas5a complex is sufficient to bind crRNA and complementary ssDNA. The structure of Csa2 reveals a crescent-shaped structure unexpectedly composed of a modified RNA-recognition motif and two additional domains present as insertions in the RNA-recognition motif. Conserved residues indicate potential crRNA- and target DNA-binding sites, and the H160A variant shows significantly reduced affinity for crRNA. We propose a general subunit architecture for CASCADE in other bacteria and Archaea.
Supramolecularly engineered perylene bisimide assemblies exhibiting thermal transition from columnar to multilamellar structures.

PubMed

Yagai, Shiki; Usui, Mari; Seki, Tomohiro; Murayama, Haruno; Kikkawa, Yoshihiro; Uemura, Shinobu; Karatsu, Takashi; Kitamura, Akihide; Asano, Atsushi; Seki, Shu

2012-05-09

Perylene 3,4:9,10-tetracarboxylic acid bisimide (PBI) was functionalized with ditopic cyanuric acid to organize it into complex columnar architectures through the formation of hydrogen-bonded supermacrocycles (rosette) by complexing with ditopic melamines possessing solubilizing alkoxyphenyl substituents. The aggregation study in solution using UV-vis and NMR spectroscopies showed the formation of extended aggregates through hydrogen-bonding and π-π stacking interactions. The cylindrical fibrillar nanostructures were visualized by microscopic techniques (AFM, TEM), and the formation of lyotropic mesophase was confirmed by polarized optical microscopy and SEM. X-ray diffraction study revealed that a well-defined hexagonal columnar (Col(h)) structure was formed by solution-casting of fibrillar assemblies. All of these results are consistent with the formation of hydrogen-bonded PBI rosettes that spontaneously organize into the Col(h) structure. Upon heating the Col(h) structure in the bulk state, a structural transition to a highly ordered lamellar (Lam) structure was observed by variable-temperature X-ray diffraction, differential scanning calorimetry, and AFM studies. IR study showed that the rearrangement of the hydrogen-bonding motifs occurs during the structural transition. These results suggest that such a striking structural transition is aided by the reorganization in the lowest level of self-organization, i.e., the rearrangement of hydrogen-bonded motifs from rosette to linear tape. A remarkable increase in the transient photoconductivity was observed by the flash-photolysis time-resolved microwave conductivity (FP-TRMC) measurements upon converting the Col(h) structure to the Lam structure. Transient absorption spectroscopy revealed that electron transfer from electron-donating alkoxyphenyl groups of melamine components to electron-deficient PBI moieties takes place, resulting in a higher probability of charge carrier generation in the Lam structure compared to the Col(h) structure.
Gene Isolation Using Degenerate Primers Targeting Protein Motif: A Laboratory Exercise

ERIC Educational Resources Information Center

Yeo, Brandon Pei Hui; Foong, Lian Chee; Tam, Sheh May; Lee, Vivian; Hwang, Siaw San

2018-01-01

Structures and functions of protein motifs are widely included in many biology-based course syllabi. However, little emphasis is placed to link this knowledge to applications in biotechnology to enhance the learning experience. Here, the conserved motifs of nucleotide binding site-leucine rich repeats (NBS-LRR) proteins, successfully used for the…
Rules for the recognition of dilysine retrieval motifs by coatomer

PubMed Central

Ma, Wenfu; Goldberg, Jonathan

2013-01-01

Cytoplasmic dilysine motifs on transmembrane proteins are captured by coatomer α-COP and β′-COP subunits and packaged into COPI-coated vesicles for Golgi-to-ER retrieval. Numerous ER/Golgi proteins contain K(x)Kxx motifs, but the rules for their recognition are unclear. We present crystal structures of α-COP and β′-COP bound to a series of naturally occurring retrieval motifs—encompassing KKxx, KxKxx and non-canonical RKxx and viral KxHxx sequences. Binding experiments show that α-COP and β′-COP have generally the same specificity for KKxx and KxKxx, but only β′-COP recognizes the RKxx signal. Dilysine motif recognition involves lysine side-chain interactions with two acidic patches. Surprisingly, however, KKxx and KxKxx motifs bind differently, with their lysine residues transposed at the binding patches. We derive rules for retrieval motif recognition from key structural features: the reversed binding modes, the recognition of the C-terminal carboxylate group which enforces lysine positional context, and the tolerance of the acidic patches for non-lysine residues. PMID:23481256
Evolution subverting essentiality: Dispensability of the cell attachment Arg-Gly-Asp motif in multiply passaged foot-and-mouth disease virus

PubMed Central

Martínez, Miguel A.; Verdaguer, Nuria; Mateu, Mauricio G.; Domingo, Esteban

1997-01-01

Aphthoviruses use a conserved Arg-Gly-Asp triplet for attachment to host cells and this motif is believed to be essential for virus viability. Here we report that this triplet—which is also a widespread motif involved in cell-to-cell adhesion—can become dispensable upon short-term evolution of the virus harboring it. Foot-and-mouth disease virus (FMDV), which was multiply passaged in cell culture, showed an altered repertoire of antigenic variants resistant to a neutralizing monoclonal antibody. The altered repertoire includes variants with substitutions at the Arg-Gly-Asp motif. Mutants lacking this sequence replicated normally in cell culture and were indistinguishable from the parental virus. Studies with individual FMDV clones indicate that amino acid replacements on the capsid surface located around the loop harboring the Arg-Gly-Asp triplet may mediate in the dispensability of this motif. The results show that FMDV quasispecies evolving in a constant biological environment have the capability of rendering totally dispensable a receptor recognition motif previously invariant, and to ensure an alternative pathway for normal viral replication. Thus, variability of highly conserved motifs, even those that viruses have adapted from functional cellular motifs, can contribute to phenotypic flexibility of RNA viruses in nature. PMID:9192645
Nucleic Acid Database (NDB)

Science.gov Websites

the NDB archive or in the Non-Redundant list Advanced Search Search for structures based on structural features, chemical features, binding modes, citation and experimental information Featured Tools RNA 3D Motif Atlas, a representative collection of RNA 3D internal and hairpin loop motifs Non-redundant Lists
Gonadotropin-Releasing Hormone (GnRH) Receptor Structure and GnRH Binding

PubMed Central

Flanagan, Colleen A.; Manilall, Ashmeetha

2017-01-01

Gonadotropin-releasing hormone (GnRH) regulates reproduction. The human GnRH receptor lacks a cytoplasmic carboxy-terminal tail but has amino acid sequence motifs characteristic of rhodopsin-like, class A, G protein-coupled receptors (GPCRs). This review will consider how recent descriptions of X-ray crystallographic structures of GPCRs in inactive and active conformations may contribute to understanding GnRH receptor structure, mechanism of activation and ligand binding. The structures confirmed that ligands bind to variable extracellular surfaces, whereas the seven membrane-spanning α-helices convey the activation signal to the cytoplasmic receptor surface, which binds and activates heterotrimeric G proteins. Forty non-covalent interactions that bridge topologically equivalent residues in different transmembrane (TM) helices are conserved in class A GPCR structures, regardless of activation state. Conformation-independent interhelical contacts account for a conserved receptor protein structure and their importance in the GnRH receptor structure is supported by decreased expression of receptors with mutations of residues in the network. Many of the GnRH receptor mutations associated with congenital hypogonadotropic hypogonadism, including the Glu2.53(90) Lys mutation, involve amino acids that constitute the conserved network. Half of the ~250 intramolecular interactions in GPCRs differ between inactive and active structures. Conformation-specific interhelical contacts depend on amino acids changing partners during activation. Conserved inactive conformation-specific contacts prevent receptor activation by stabilizing proximity of TM helices 3 and 6 and a closed G protein-binding site. Mutations of GnRH receptor residues involved in these interactions, such as Arg3.50(139) of the DRY/S motif or Tyr7.53(323) of the N/DPxxY motif, increase or decrease receptor expression and efficiency of receptor coupling to G protein signaling, consistent with the native residues stabilizing the inactive GnRH receptor structure. Active conformation-specific interhelical contacts stabilize an open G protein-binding site. Progress in defining the GnRH-binding site has recently slowed, with evidence that Tyr6.58(290) contacts Tyr5 of GnRH, whereas other residues affect recognition of Trp3 and Gly10NH2. The surprisingly consistent observations that GnRH receptor mutations that disrupt GnRH binding have less effect on “conformationally constrained” GnRH peptides may now be explained by crystal structures of agonist-bound peptide receptors. Analysis of GPCR structures provides insight into GnRH receptor function. PMID:29123501

Organization of feed-forward loop motifs reveals architectural principles in natural and engineered networks.

PubMed

Gorochowski, Thomas E; Grierson, Claire S; di Bernardo, Mario

2018-03-01

Network motifs are significantly overrepresented subgraphs that have been proposed as building blocks for natural and engineered networks. Detailed functional analysis has been performed for many types of motif in isolation, but less is known about how motifs work together to perform complex tasks. To address this issue, we measure the aggregation of network motifs via methods that extract precisely how these structures are connected. Applying this approach to a broad spectrum of networked systems and focusing on the widespread feed-forward loop motif, we uncover striking differences in motif organization. The types of connection are often highly constrained, differ between domains, and clearly capture architectural principles. We show how this information can be used to effectively predict functionally important nodes in the metabolic network of Escherichia coli . Our findings have implications for understanding how networked systems are constructed from motif parts and elucidate constraints that guide their evolution.
Organization of feed-forward loop motifs reveals architectural principles in natural and engineered networks

PubMed Central

Grierson, Claire S.

2018-01-01

Network motifs are significantly overrepresented subgraphs that have been proposed as building blocks for natural and engineered networks. Detailed functional analysis has been performed for many types of motif in isolation, but less is known about how motifs work together to perform complex tasks. To address this issue, we measure the aggregation of network motifs via methods that extract precisely how these structures are connected. Applying this approach to a broad spectrum of networked systems and focusing on the widespread feed-forward loop motif, we uncover striking differences in motif organization. The types of connection are often highly constrained, differ between domains, and clearly capture architectural principles. We show how this information can be used to effectively predict functionally important nodes in the metabolic network of Escherichia coli. Our findings have implications for understanding how networked systems are constructed from motif parts and elucidate constraints that guide their evolution. PMID:29670941
Assessing Local Structure Motifs Using Order Parameters for Motif Recognition, Interstitial Identification, and Diffusion Path Characterization

DOE PAGES

Zimmermann, Nils E. R.; Horton, Matthew K.; Jain, Anubhav; ...

2017-11-13

Structure–property relationships form the basis of many design rules in materials science, including synthesizability and long-term stability of catalysts, control of electrical and optoelectronic behavior in semiconductors, as well as the capacity of and transport properties in cathode materials for rechargeable batteries. The immediate atomic environments (i.e., the first coordination shells) of a few atomic sites are often a key factor in achieving a desired property. Some of the most frequently encountered coordination patterns are tetrahedra, octahedra, body and face-centered cubic as well as hexagonal close packed-like environments. Here, we showcase the usefulness of local order parameters to identify thesemore » basic structural motifs in inorganic solid materials by developing classification criteria. We introduce a systematic testing framework, the Einstein crystal test rig, that probes the response of order parameters to distortions in perfect motifs to validate our approach. Subsequently, we highlight three important application cases. First, we map basic crystal structure information of a large materials database in an intuitive manner by screening the Materials Project (MP) database (61,422 compounds) for element-specific motif distributions. Second, we use the structure-motif recognition capabilities to automatically find interstitials in metals, semiconductor, and insulator materials. Our Interstitialcy Finding Tool (InFiT) facilitates high-throughput screenings of defect properties. Third, the order parameters are reliable and compact quantitative structure descriptors for characterizing diffusion hops of intercalants as our example of magnesium in MnO 2-spinel indicates. Finally, the tools developed in our work are readily and freely available as software implementations in the pymatgen library, and we expect them to be further applied to machine-learning approaches for emerging applications in materials science.« less
Assessing Local Structure Motifs Using Order Parameters for Motif Recognition, Interstitial Identification, and Diffusion Path Characterization

DOE Office of Scientific and Technical Information (OSTI.GOV)

Zimmermann, Nils E. R.; Horton, Matthew K.; Jain, Anubhav

Structure–property relationships form the basis of many design rules in materials science, including synthesizability and long-term stability of catalysts, control of electrical and optoelectronic behavior in semiconductors, as well as the capacity of and transport properties in cathode materials for rechargeable batteries. The immediate atomic environments (i.e., the first coordination shells) of a few atomic sites are often a key factor in achieving a desired property. Some of the most frequently encountered coordination patterns are tetrahedra, octahedra, body and face-centered cubic as well as hexagonal close packed-like environments. Here, we showcase the usefulness of local order parameters to identify thesemore » basic structural motifs in inorganic solid materials by developing classification criteria. We introduce a systematic testing framework, the Einstein crystal test rig, that probes the response of order parameters to distortions in perfect motifs to validate our approach. Subsequently, we highlight three important application cases. First, we map basic crystal structure information of a large materials database in an intuitive manner by screening the Materials Project (MP) database (61,422 compounds) for element-specific motif distributions. Second, we use the structure-motif recognition capabilities to automatically find interstitials in metals, semiconductor, and insulator materials. Our Interstitialcy Finding Tool (InFiT) facilitates high-throughput screenings of defect properties. Third, the order parameters are reliable and compact quantitative structure descriptors for characterizing diffusion hops of intercalants as our example of magnesium in MnO 2-spinel indicates. Finally, the tools developed in our work are readily and freely available as software implementations in the pymatgen library, and we expect them to be further applied to machine-learning approaches for emerging applications in materials science.« less
Unusual conformation of the SxN motif in the crystal structure of penicillin-binding protein A from Mycobacterium tuberculosis.

DOE Office of Scientific and Technical Information (OSTI.GOV)

Fedarovich, Alena; Nicholas, Robert A.; Davies, Christopher

PBPA from Mycobacterium tuberculosis is a class B-like penicillin-binding protein (PBP) that is not essential for cell growth in M. tuberculosis, but is important for proper cell division in Mycobacterium smegmatis. We have determined the crystal structure of PBPA at 2.05 {angstrom} resolution, the first published structure of a PBP from this important pathogen. Compared to other PBPs, PBPA has a relatively small N-terminal domain, and conservation of a cluster of charged residues within this domain suggests that PBPA is more related to class B PBPs than previously inferred from sequence analysis. The C-terminal domain is a typical transpeptidase foldmore » and contains the three conserved active-site motifs characterisitic of penicillin-interacting enzymes. While the arrangement of the SxxK and KTG motifs is similar to that observed in other PBPs, the SxN motif is markedly displaced away from the active site, such that its serine (Ser281) is not involved in hydrogen bonding with residues of the other two motifs. A disulfide bridge between Cys282 (the 'x' of the SxN motif) and Cys266, which resides on an adjacent loop, may be responsible for this unusual conformation. Another interesting feature of the structure is a relatively long connection between {beta}5 and {alpha}11, which restricts the space available in the active site of PBPA and suggests that conformational changes would be required to accommodate peptide substrate or {beta}-lactam antibiotics during acylation. Finally, the structure shows that one of the two threonines postulated to be targets for phosphorylation is inaccessible (Thr362), whereas the other (Thr437) is well placed on a surface loop near the active site.« less
Simultaneously learning DNA motif along with its position and sequence rank preferences through expectation maximization algorithm.

PubMed

Zhang, ZhiZhuo; Chang, Cheng Wei; Hugo, Willy; Cheung, Edwin; Sung, Wing-Kin

2013-03-01

Although de novo motifs can be discovered through mining over-represented sequence patterns, this approach misses some real motifs and generates many false positives. To improve accuracy, one solution is to consider some additional binding features (i.e., position preference and sequence rank preference). This information is usually required from the user. This article presents a de novo motif discovery algorithm called SEME (sampling with expectation maximization for motif elicitation), which uses pure probabilistic mixture model to model the motif's binding features and uses expectation maximization (EM) algorithms to simultaneously learn the sequence motif, position, and sequence rank preferences without asking for any prior knowledge from the user. SEME is both efficient and accurate thanks to two important techniques: the variable motif length extension and importance sampling. Using 75 large-scale synthetic datasets, 32 metazoan compendium benchmark datasets, and 164 chromatin immunoprecipitation sequencing (ChIP-Seq) libraries, we demonstrated the superior performance of SEME over existing programs in finding transcription factor (TF) binding sites. SEME is further applied to a more difficult problem of finding the co-regulated TF (coTF) motifs in 15 ChIP-Seq libraries. It identified significantly more correct coTF motifs and, at the same time, predicted coTF motifs with better matching to the known motifs. Finally, we show that the learned position and sequence rank preferences of each coTF reveals potential interaction mechanisms between the primary TF and the coTF within these sites. Some of these findings were further validated by the ChIP-Seq experiments of the coTFs. The application is available online.
Convergent evolution and mimicry of protein linear motifs in host-pathogen interactions.

PubMed

Chemes, Lucía Beatriz; de Prat-Gay, Gonzalo; Sánchez, Ignacio Enrique

2015-06-01

Pathogen linear motif mimics are highly evolvable elements that facilitate rewiring of host protein interaction networks. Host linear motifs and pathogen mimics differ in sequence, leading to thermodynamic and structural differences in the resulting protein-protein interactions. Moreover, the functional output of a mimic depends on the motif and domain repertoire of the pathogen protein. Regulatory evolution mediated by linear motifs can be understood by measuring evolutionary rates, quantifying positive and negative selection and performing phylogenetic reconstructions of linear motif natural history. Convergent evolution of linear motif mimics is widespread among unrelated proteins from viral, prokaryotic and eukaryotic pathogens and can also take place within individual protein phylogenies. Statistics, biochemistry and laboratory models of infection link pathogen linear motifs to phenotypic traits such as tropism, virulence and oncogenicity. In vitro evolution experiments and analysis of natural sequences suggest that changes in linear motif composition underlie pathogen adaptation to a changing environment. Copyright © 2015 Elsevier Ltd. All rights reserved.
Control of complex networks requires both structure and dynamics

NASA Astrophysics Data System (ADS)

Gates, Alexander J.; Rocha, Luis M.

2016-04-01

The study of network structure has uncovered signatures of the organization of complex systems. However, there is also a need to understand how to control them; for example, identifying strategies to revert a diseased cell to a healthy state, or a mature cell to a pluripotent state. Two recent methodologies suggest that the controllability of complex systems can be predicted solely from the graph of interactions between variables, without considering their dynamics: structural controllability and minimum dominating sets. We demonstrate that such structure-only methods fail to characterize controllability when dynamics are introduced. We study Boolean network ensembles of network motifs as well as three models of biochemical regulation: the segment polarity network in Drosophila melanogaster, the cell cycle of budding yeast Saccharomyces cerevisiae, and the floral organ arrangement in Arabidopsis thaliana. We demonstrate that structure-only methods both undershoot and overshoot the number and which sets of critical variables best control the dynamics of these models, highlighting the importance of the actual system dynamics in determining control. Our analysis further shows that the logic of automata transition functions, namely how canalizing they are, plays an important role in the extent to which structure predicts dynamics.
SMARTIV: combined sequence and structure de-novo motif discovery for in-vivo RNA binding data.

PubMed

Polishchuk, Maya; Paz, Inbal; Yakhini, Zohar; Mandel-Gutfreund, Yael

2018-05-25

Gene expression regulation is highly dependent on binding of RNA-binding proteins (RBPs) to their RNA targets. Growing evidence supports the notion that both RNA primary sequence and its local secondary structure play a role in specific Protein-RNA recognition and binding. Despite the great advance in high-throughput experimental methods for identifying sequence targets of RBPs, predicting the specific sequence and structure binding preferences of RBPs remains a major challenge. We present a novel webserver, SMARTIV, designed for discovering and visualizing combined RNA sequence and structure motifs from high-throughput RNA-binding data, generated from in-vivo experiments. The uniqueness of SMARTIV is that it predicts motifs from enriched k-mers that combine information from ranked RNA sequences and their predicted secondary structure, obtained using various folding methods. Consequently, SMARTIV generates Position Weight Matrices (PWMs) in a combined sequence and structure alphabet with assigned P-values. SMARTIV concisely represents the sequence and structure motif content as a single graphical logo, which is informative and easy for visual perception. SMARTIV was examined extensively on a variety of high-throughput binding experiments for RBPs from different families, generated from different technologies, showing consistent and accurate results. Finally, SMARTIV is a user-friendly webserver, highly efficient in run-time and freely accessible via http://smartiv.technion.ac.il/.
Revisiting the structure/function relationships of H/ACA(-like) RNAs: a unified model for Euryarchaea and Crenarchaea

PubMed Central

Toffano-Nioche, Claire; Gautheret, Daniel; Leclerc, Fabrice

2015-01-01

A structural and functional classification of H/ACA and H/ACA-like motifs is obtained from the analysis of the H/ACA guide RNAs which have been identified previously in the genomes of Euryarchaea (Pyrococcus) and Crenarchaea (Pyrobaculum). A unified structure/function model is proposed based on the common structural determinants shared by H/ACA and H/ACA-like motifs in both Euryarchaea and Crenarchaea. Using a computational approach, structural and energetic rules for the guide:target RNA-RNA interactions are derived from structural and functional data on the H/ACA RNP particles. H/ACA(-like) motifs found in Pyrococcus are evaluated through the classification and their biological relevance is discussed. Extra-ribosomal targets found in both Pyrococcus and Pyrobaculum might support the hypothesis of a gene regulation mediated by H/ACA(-like) guide RNAs in archaea. PMID:26240384
Top surface blade residues and the central channel water molecules are conserved in every repeat of the integrin-like β-propeller structures.

PubMed

Denesyuk, Alexander; Denessiouk, Konstantin; Johnson, Mark S

2018-02-01

An integrin-like β-propeller domain contains seven repeats of a four-stranded antiparallel β-sheet motif (blades). Previously we described a 3D structural motif within each blade of the integrin-type β-propeller. Here, we show unique structural links that join different blades of the β-propeller structure, which together with the structural motif for a single blade are repeated in a β-propeller to provide the functional top face of the barrel, found to be involved in protein-protein interactions and substrate recognition. We compare functional top face diagrams of the integrin-type β-propeller domain and two non-integrin type β-propeller domains of virginiamycin B lyase and WD Repeat-Containing Protein 5. Copyright © 2017 Elsevier Inc. All rights reserved.
Multiple Binding Modes between HNF4[alpha] and the LXXLL Motifs of PGC-1[alpha] Lead to Full Activation

DOE Office of Scientific and Technical Information (OSTI.GOV)

Rha, Geun Bae; Wu, Guangteng; Shoelson, Steven E.

2010-04-15

Hepatocyte nuclear factor 4{alpha} (HNF4{alpha}) is a novel nuclear receptor that participates in a hierarchical network of transcription factors regulating the development and physiology of such vital organs as the liver, pancreas, and kidney. Among the various transcriptional coregulators with which HNF4{alpha} interacts, peroxisome proliferation-activated receptor {gamma} (PPAR{gamma}) coactivator 1{alpha} (PGC-1{alpha}) represents a novel coactivator whose activation is unusually robust and whose binding mode appears to be distinct from that of canonical coactivators such as NCoA/SRC/p160 family members. To elucidate the potentially unique molecular mechanism of PGC-1{alpha} recruitment, we have determined the crystal structure of HNF4{alpha} in complex with amore » fragment of PGC-1{alpha} containing all three of its LXXLL motifs. Despite the presence of all three LXXLL motifs available for interactions, only one is bound at the canonical binding site, with no additional contacts observed between the two proteins. However, a close inspection of the electron density map indicates that the bound LXXLL motif is not a selected one but an averaged structure of more than one LXXLL motif. Further biochemical and functional studies show that the individual LXXLL motifs can bind but drive only minimal transactivation. Only when more than one LXXLL motif is involved can significant transcriptional activity be measured, and full activation requires all three LXXLL motifs. These findings led us to propose a model wherein each LXXLL motif has an additive effect, and the multiple binding modes by HNF4{alpha} toward the LXXLL motifs of PGC-1{alpha} could account for the apparent robust activation by providing a flexible mechanism for combinatorial recruitment of additional coactivators and mediators.« less
Analysis of the Isolated SecA DEAD Motor Suggests a Mechanism for Chemical-Mechanical Coupling

DOE Office of Scientific and Technical Information (OSTI.GOV)

Nithianantham, Stanley; Shilton, Brian H

The preprotein cross-linking domain and C-terminal domains of Escherichia coli SecA were removed to create a minimal DEAD motor, SecA-DM. SecA-DM hydrolyzes ATP and has the same affinity for ADP as full-length SecA. The crystal structure of SecA-DM in complex with ADP was solved and shows the DEAD motor in a closed conformation. Comparison with the structure of the E. coli DEAD motor in an open conformation (Protein Data Bank ID 2FSI) indicates main-chain conformational changes in two critical sequences corresponding to Motif III and Motif V of the DEAD helicase family. The structures that the Motif III and Motifmore » V sequences adopt in the DEAD motor open conformation are incompatible with the closed conformation. Therefore, when the DEAD motor makes the transition from open to closed, Motif III and Motif V are forced to change their conformations, which likely functions to regulate passage through the transition state for ATP hydrolysis. The transition state for ATP hydrolysis for the SecA DEAD motor was modeled based on the conformation of the Vasa helicase in complex with adenylyl imidodiphosphate and RNA (Protein Data Bank ID 2DB3). A mechanism for chemical-mechanical coupling emerges, where passage through the transition state for ATP hydrolysis is hindered by the conformational changes required in Motif III and Motif V, and may be promoted by binding interactions with the preprotein substrate and/or other translocase domains and subunits.« less
Analysis of the Isolated SecA DEAD Motor Suggests a Mechanism for Chemical-Mechanical Coupling

DOE Office of Scientific and Technical Information (OSTI.GOV)

Nithianantham, Stanley; Shilton, Brian H

2011-09-28

The preprotein cross-linking domain and C-terminal domains of Escherichia coli SecA were removed to create a minimal DEAD motor, SecA-DM. SecA-DM hydrolyzes ATP and has the same affinity for ADP as full-length SecA. The crystal structure of SecA-DM in complex with ADP was solved and shows the DEAD motor in a closed conformation. Comparison with the structure of the E. coli DEAD motor in an open conformation (Protein Data Bank ID 2FSI) indicates main-chain conformational changes in two critical sequences corresponding to Motif III and Motif V of the DEAD helicase family. The structures that the Motif III and Motifmore » V sequences adopt in the DEAD motor open conformation are incompatible with the closed conformation. Therefore, when the DEAD motor makes the transition from open to closed, Motif III and Motif V are forced to change their conformations, which likely functions to regulate passage through the transition state for ATP hydrolysis. The transition state for ATP hydrolysis for the SecA DEAD motor was modeled based on the conformation of the Vasa helicase in complex with adenylyl imidodiphosphate and RNA (Protein Data Bank ID 2DB3). A mechanism for chemical-mechanical coupling emerges, where passage through the transition state for ATP hydrolysis is hindered by the conformational changes required in Motif III and Motif V, and may be promoted by binding interactions with the preprotein substrate and/or other translocase domains and subunits.« less
Building a stable RNA U-turn with a protonated cytidine

PubMed Central

Gottstein-Schmidtke, Sina R.; Duchardt-Ferner, Elke; Groher, Florian; Weigand, Julia E.; Gottstein, Daniel; Suess, Beatrix; Wöhnert, Jens

2014-01-01

The U-turn is a classical three-dimensional RNA folding motif first identified in the anticodon and T-loops of tRNAs. It also occurs frequently as a building block in other functional RNA structures in many different sequence and structural contexts. U-turns induce sharp changes in the direction of the RNA backbone and often conform to the 3-nt consensus sequence 5′-UNR-3′ (N = any nucleotide, R = purine). The canonical U-turn motif is stabilized by a hydrogen bond between the N3 imino group of the U residue and the 3′ phosphate group of the R residue as well as a hydrogen bond between the 2′-hydroxyl group of the uridine and the N7 nitrogen of the R residue. Here, we demonstrate that a protonated cytidine can functionally and structurally replace the uridine at the first position of the canonical U-turn motif in the apical loop of the neomycin riboswitch. Using NMR spectroscopy, we directly show that the N3 imino group of the protonated cytidine forms a hydrogen bond with the backbone phosphate 3′ from the third nucleotide of the U-turn analogously to the imino group of the uridine in the canonical motif. In addition, we compare the stability of the hydrogen bonds in the mutant U-turn motif to the wild type and describe the NMR signature of the C+-phosphate interaction. Our results have implications for the prediction of RNA structural motifs and suggest simple approaches for the experimental identification of hydrogen bonds between protonated C-imino groups and the phosphate backbone. PMID:24951555
Combinatorics of feedback in cellular uptake and metabolism of small molecules.

PubMed

Krishna, Sandeep; Semsey, Szabolcs; Sneppen, Kim

2007-12-26

We analyze the connection between structure and function for regulatory motifs associated with cellular uptake and usage of small molecules. Based on the boolean logic of the feedback we suggest four classes: the socialist, consumer, fashion, and collector motifs. We find that the socialist motif is good for homeostasis of a useful but potentially poisonous molecule, whereas the consumer motif is optimal for nutrition molecules. Accordingly, examples of these motifs are found in, respectively, the iron homeostasis system in various organisms and in the uptake of sugar molecules in bacteria. The remaining two motifs have no obvious analogs in small molecule regulation, but we illustrate their behavior using analogies to fashion and obesity. These extreme motifs could inspire construction of synthetic systems that exhibit bistable, history-dependent states, and homeostasis of flux (rather than concentration).
Protein–DNA Interactions: The Story so Far and a New Method for Prediction

DOE PAGES

Jones, Susan; Thornton, Janet M.

2003-01-01

This review describes methods for the prediction of DNA binding function, and specifically summarizes a new method using 3D structural templates. The new method features the HTH motif that is found in approximately one-third of DNAbinding protein families. A library of 3D structural templates of HTH motifs was derived from proteins in the PDB. Templates were scanned against complete protein structures and the optimal superposition of a template on a structure calculated. Significance thresholds in terms of a minimum root mean squared deviation (rmsd) of an optimal superposition, and a minimum motif accessible surface area (ASA), have been calculated. Inmore » this way, it is possible to scan the template library against proteins of unknown function to make predictions about DNA-binding functionality.« less
Self-assembly of multi-stranded RNA motifs into lattices and tubular structures

DOE PAGES

Stewart, Jaimie Marie; Subramanian, Hari K. K.; Franco, Elisa

2017-02-16

Rational design of nucleic acidmolecules yields selfassembling scaffolds with increasing complexity, size and functionality. It is an open question whether design methods tailored to build DNA nanostructures can be adapted to build RNA nanostructures with comparable features. We demonstrate the formation of RNA lattices and tubular assemblies from double crossover (DX) tiles, a canonical motif in DNA nanotechnology. Tubular structures can exceed 1 m in length, suggesting that this DX motif can produce very robust lattices. Some of these tubes spontaneously form with left-handed chirality. We obtain assemblies by using two methods: a protocol where gel-extracted RNA strands are slowlymore » annealed, and a one-pot transcription and anneal procedure. We then identify the tile nick position as a structural requirement for lattice formation. These results demonstrate that stable RNA structures can be obtained with design tools imported from DNA nanotechnology. These large assemblies could be potentially integrated with a variety of functional RNA motifs for drug or nanoparticle delivery, or for colocalization of cellular components.« less
Self-assembly of multi-stranded RNA motifs into lattices and tubular structures

DOE Office of Scientific and Technical Information (OSTI.GOV)

Stewart, Jaimie Marie; Subramanian, Hari K. K.; Franco, Elisa

Rational design of nucleic acidmolecules yields selfassembling scaffolds with increasing complexity, size and functionality. It is an open question whether design methods tailored to build DNA nanostructures can be adapted to build RNA nanostructures with comparable features. We demonstrate the formation of RNA lattices and tubular assemblies from double crossover (DX) tiles, a canonical motif in DNA nanotechnology. Tubular structures can exceed 1 m in length, suggesting that this DX motif can produce very robust lattices. Some of these tubes spontaneously form with left-handed chirality. We obtain assemblies by using two methods: a protocol where gel-extracted RNA strands are slowlymore » annealed, and a one-pot transcription and anneal procedure. We then identify the tile nick position as a structural requirement for lattice formation. These results demonstrate that stable RNA structures can be obtained with design tools imported from DNA nanotechnology. These large assemblies could be potentially integrated with a variety of functional RNA motifs for drug or nanoparticle delivery, or for colocalization of cellular components.« less
Self-assembly of multi-stranded RNA motifs into lattices and tubular structures

PubMed Central

Stewart, Jaimie Marie; Subramanian, Hari K. K.

2017-01-01

Abstract Rational design of nucleic acid molecules yields self-assembling scaffolds with increasing complexity, size and functionality. It is an open question whether design methods tailored to build DNA nanostructures can be adapted to build RNA nanostructures with comparable features. Here we demonstrate the formation of RNA lattices and tubular assemblies from double crossover (DX) tiles, a canonical motif in DNA nanotechnology. Tubular structures can exceed 1 μm in length, suggesting that this DX motif can produce very robust lattices. Some of these tubes spontaneously form with left-handed chirality. We obtain assemblies by using two methods: a protocol where gel-extracted RNA strands are slowly annealed, and a one-pot transcription and anneal procedure. We identify the tile nick position as a structural requirement for lattice formation. Our results demonstrate that stable RNA structures can be obtained with design tools imported from DNA nanotechnology. These large assemblies could be potentially integrated with a variety of functional RNA motifs for drug or nanoparticle delivery, or for colocalization of cellular components. PMID:28204562

The Transcriptional Complex Between the BCL2 i-Motif and hnRNP LL Is a Molecular Switch for Control of Gene Expression That Can Be Modulated by Small Molecules

PubMed Central

2015-01-01

In a companion paper (DOI: 10.021/ja410934b) we demonstrate that the C-rich strand of the cis-regulatory element in the BCL2 promoter element is highly dynamic in nature and can form either an i-motif or a flexible hairpin. Under physiological conditions these two secondary DNA structures are found in an equilibrium mixture, which can be shifted by the addition of small molecules that trap out either the i-motif (IMC-48) or the flexible hairpin (IMC-76). In cellular experiments we demonstrate that the addition of these molecules has opposite effects on BCL2 gene expression and furthermore that these effects are antagonistic. In this contribution we have identified a transcriptional factor that recognizes and binds to the BCL2 i-motif to activate transcription. The molecular basis for the recognition of the i-motif by hnRNP LL is determined, and we demonstrate that the protein unfolds the i-motif structure to form a stable single-stranded complex. In subsequent experiments we show that IMC-48 and IMC-76 have opposite, antagonistic effects on the formation of the hnRNP LL–i-motif complex as well as on the transcription factor occupancy at the BCL2 promoter. For the first time we propose that the i-motif acts as a molecular switch that controls gene expression and that small molecules that target the dynamic equilibrium of the i-motif and the flexible hairpin can differentially modulate gene expression. PMID:24559432
Structural complexity of Dengue virus untranslated regions: cis-acting RNA motifs and pseudoknot interactions modulating functionality of the viral genome

PubMed Central

Sztuba-Solinska, Joanna; Teramoto, Tadahisa; Rausch, Jason W.; Shapiro, Bruce A.; Padmanabhan, Radhakrishnan; Le Grice, Stuart F. J.

2013-01-01

The Dengue virus (DENV) genome contains multiple cis-acting elements required for translation and replication. Previous studies indicated that a 719-nt subgenomic minigenome (DENV-MINI) is an efficient template for translation and (−) strand RNA synthesis in vitro. We performed a detailed structural analysis of DENV-MINI RNA, combining chemical acylation techniques, Pb2+ ion-induced hydrolysis and site-directed mutagenesis. Our results highlight protein-independent 5′–3′ terminal interactions involving hybridization between recognized cis-acting motifs. Probing analyses identified tandem dumbbell structures (DBs) within the 3′ terminus spaced by single-stranded regions, internal loops and hairpins with embedded GNRA-like motifs. Analysis of conserved motifs and top loops (TLs) of these dumbbells, and their proposed interactions with downstream pseudoknot (PK) regions, predicted an H-type pseudoknot involving TL1 of the 5′ DB and the complementary region, PK2. As disrupting the TL1/PK2 interaction, via ‘flipping’ mutations of PK2, previously attenuated DENV replication, this pseudoknot may participate in regulation of RNA synthesis. Computer modeling implied that this motif might function as autonomous structural/regulatory element. In addition, our studies targeting elements of the 3′ DB and its complementary region PK1 indicated that communication between 5′–3′ terminal regions strongly depends on structure and sequence composition of the 5′ cyclization region. PMID:23531545
Hairpin structures with conserved sequence motifs determine the 3' ends of non-polyadenylated invertebrate iridovirus transcripts.

PubMed

İnce, İkbal Agah; Pijlman, Gorben P; Vlak, Just M; van Oers, Monique M

2017-11-01

Previously, we observed that the transcripts of Invertebrate iridescent virus 6 (IIV6) are not polyadenylated, in line with the absence of canonical poly(A) motifs (AATAAA) downstream of the open reading frames (ORFs) in the genome. Here, we determined the 3' ends of the transcripts of fifty-four IIV6 virion protein genes in infected Drosophila Schneider 2 (S2) cells. By using ligation-based amplification of cDNA ends (LACE) it was shown that the IIV6 mRNAs often ended with a CAUUA motif. In silico analysis showed that the 3'-untranslated regions of IIV6 genes have the ability to form hairpin structures (22-56 nt in length) and that for about half of all IIV6 genes these 3' sequences contained complementary TAATG and CATTA motifs. We also show that a hairpin in the 3' flanking region with conserved sequence motifs is a conserved feature in invertebrate-infecting iridoviruses (genus Iridovirus and Chloriridovirus). Copyright © 2017 Elsevier Inc. All rights reserved.
Nucleophosmin integrates within the nucleolus via multi-modal interactions with proteins displaying R-rich linear motifs and rRNA.

PubMed

Mitrea, Diana M; Cika, Jaclyn A; Guy, Clifford S; Ban, David; Banerjee, Priya R; Stanley, Christopher B; Nourse, Amanda; Deniz, Ashok A; Kriwacki, Richard W

2016-02-02

The nucleolus is a membrane-less organelle formed through liquid-liquid phase separation of its components from the surrounding nucleoplasm. Here, we show that nucleophosmin (NPM1) integrates within the nucleolus via a multi-modal mechanism involving multivalent interactions with proteins containing arginine-rich linear motifs (R-motifs) and ribosomal RNA (rRNA). Importantly, these R-motifs are found in canonical nucleolar localization signals. Based on a novel combination of biophysical approaches, we propose a model for the molecular organization within liquid-like droplets formed by the N-terminal domain of NPM1 and R-motif peptides, thus providing insights into the structural organization of the nucleolus. We identify multivalency of acidic tracts and folded nucleic acid binding domains, mediated by N-terminal domain oligomerization, as structural features required for phase separation of NPM1 with other nucleolar components in vitro and for localization within mammalian nucleoli. We propose that one mechanism of nucleolar localization involves phase separation of proteins within the nucleolus.
Identity and functions of CxxC-derived motifs.

PubMed

Fomenko, Dmitri E; Gladyshev, Vadim N

2003-09-30

Two cysteines separated by two other residues (the CxxC motif) are employed by many redox proteins for formation, isomerization, and reduction of disulfide bonds and for other redox functions. The place of the C-terminal cysteine in this motif may be occupied by serine (the CxxS motif), modifying the functional repertoire of redox proteins. Here we found that the CxxC motif may also give rise to a motif, in which the C-terminal cysteine is replaced with threonine (the CxxT motif). Moreover, in contrast to a view that the N-terminal cysteine in the CxxC motif always serves as a nucleophilic attacking group, this residue could also be replaced with threonine (the TxxC motif), serine (the SxxC motif), or other residues. In each of these CxxC-derived motifs, the presence of a downstream alpha-helix was strongly favored. A search for conserved CxxC-derived motif/helix patterns in four complete genomes representing bacteria, archaea, and eukaryotes identified known redox proteins and suggested possible redox functions for several additional proteins. Catalytic sites in peroxiredoxins were major representatives of the TxxC motif, whereas those in glutathione peroxidases represented the CxxT motif. Structural assessments indicated that threonines in these enzymes could stabilize catalytic thiolates, suggesting revisions to previously proposed catalytic triads. Each of the CxxC-derived motifs was also observed in natural selenium-containing proteins, in which selenocysteine was present in place of a catalytic cysteine.
Structures of minimal catalytic fragments of topoisomerase V reveals conformational changes relevant for DNA binding

PubMed Central

Rajan, Rakhi; Taneja, Bhupesh; Mondragón, Alfonso

2010-01-01

Summary Topoisomerase V is an archaeal type I topoisomerase that is unique among topoisomerases due to presence of both topoisomerase and DNA repair activities in the same protein. It is organized as an N-terminal topoisomerase domain followed by 24 tandem helix hairpin helix (HhH) motifs. Structural studies have shown that the active site is buried by the (HhH) motifs. Here we show that the N-terminal domain can relax DNA in the absence of any HhH motifs and that the HhH motifs are required for stable protein-DNA complex formation. Crystal structures of various topoisomerase V fragments show changes in the relative orientation of the domains mediated by a long bent linker helix, and these movements are essential for the DNA to enter the active site. Phosphate ions bound to the protein near the active site helped model DNA in the topoisomerase domain and shows how topoisomerase V may interact with DNA. PMID:20637419
Non-B DB: a database of predicted non-B DNA-forming motifs in mammalian genomes.

PubMed

Cer, Regina Z; Bruce, Kevin H; Mudunuri, Uma S; Yi, Ming; Volfovsky, Natalia; Luke, Brian T; Bacolla, Albino; Collins, Jack R; Stephens, Robert M

2011-01-01

Although the capability of DNA to form a variety of non-canonical (non-B) structures has long been recognized, the overall significance of these alternate conformations in biology has only recently become accepted en masse. In order to provide access to genome-wide locations of these classes of predicted structures, we have developed non-B DB, a database integrating annotations and analysis of non-B DNA-forming sequence motifs. The database provides the most complete list of alternative DNA structure predictions available, including Z-DNA motifs, quadruplex-forming motifs, inverted repeats, mirror repeats and direct repeats and their associated subsets of cruciforms, triplex and slipped structures, respectively. The database also contains motifs predicted to form static DNA bends, short tandem repeats and homo(purine•pyrimidine) tracts that have been associated with disease. The database has been built using the latest releases of the human, chimp, dog, macaque and mouse genomes, so that the results can be compared directly with other data sources. In order to make the data interpretable in a genomic context, features such as genes, single-nucleotide polymorphisms and repetitive elements (SINE, LINE, etc.) have also been incorporated. The database is accessed through query pages that produce results with links to the UCSC browser and a GBrowse-based genomic viewer. It is freely accessible at http://nonb.abcc.ncifcrf.gov.
Structural Integrity of the Greek Key Motif in βγ-Crystallins Is Vital for Central Eye Lens Transparency

PubMed Central

Vendra, Venkata Pulla Rao; Agarwal, Garima; Chandani, Sushil; Talla, Venu; Srinivasan, Narayanaswamy; Balasubramanian, Dorairajan

2013-01-01

Background We highlight an unrecognized physiological role for the Greek key motif, an evolutionarily conserved super-secondary structural topology of the βγ-crystallins. These proteins constitute the bulk of the human eye lens, packed at very high concentrations in a compact, globular, short-range order, generating transparency. Congenital cataract (affecting 400,000 newborns yearly worldwide), associated with 54 mutations in βγ-crystallins, occurs in two major phenotypes nuclear cataract, which blocks the central visual axis, hampering the development of the growing eye and demanding earliest intervention, and the milder peripheral progressive cataract where surgery can wait. In order to understand this phenotypic dichotomy at the molecular level, we have studied the structural and aggregation features of representative mutations. Methods Wild type and several representative mutant proteins were cloned, expressed and purified and their secondary and tertiary structural details, as well as structural stability, were compared in solution, using spectroscopy. Their tendencies to aggregate in vitro and in cellulo were also compared. In addition, we analyzed their structural differences by molecular modeling in silico. Results Based on their properties, mutants are seen to fall into two classes. Mutants A36P, L45PL54P, R140X, and G165fs display lowered solubility and structural stability, expose several buried residues to the surface, aggregate in vitro and in cellulo, and disturb/distort the Greek key motif. And they are associated with nuclear cataract. In contrast, mutants P24T and R77S, associated with peripheral cataract, behave quite similar to the wild type molecule, and do not affect the Greek key topology. Conclusion When a mutation distorts even one of the four Greek key motifs, the protein readily self-aggregates and precipitates, consistent with the phenotype of nuclear cataract, while mutations not affecting the motif display ‘native state aggregation’, leading to peripheral cataract, thus offering a protein structural rationale for the cataract phenotypic dichotomy “distort motif, lose central vision”. PMID:23936409
A study of pH-dependence of shrink and stretch of tetrahedral DNA nanostructures.

PubMed

Wang, Ping; Xia, Zhiwei; Yan, Juan; Liu, Xunwei; Yao, Guangbao; Pei, Hao; Zuo, Xiaolei; Sun, Gang; He, Dannong

2015-04-21

We monitored the shrink and stretch of the tetrahedral DNA nanostructure (TDN) and the i-motif connected TDN structure at pH 8.5 and pH 4.5, and we found that not only the i-motif can change its structure when the pH changes, but also the TDN and the DNA double helix change their structures when the pH changes.
Novel functions of CCM1 delimit the relationship of PTB/PH domains.

PubMed

Zhang, Jun; Dubey, Pallavi; Padarti, Akhil; Zhang, Aileen; Patel, Rinkal; Patel, Vipulkumar; Cistola, David; Badr, Ahmed

2017-10-01

Three NPXY motifs and one FERM domain in CCM1 makes it a versatile scaffold protein for tethering the signaling components together within the CCM signaling complex (CSC). The cellular role of CCM1 protein remains inadequately expounded. Both phosphotyrosine binding (PTB) and pleckstrin homology (PH) domains were recognized as structurally related but functionally distinct domains. By utilizing molecular cloning, protein binding assays and RT-qPCR to identify novel cellular partners of CCM1 and its cellular expression patterns; by screening candidate PTB/PH proteins and subsequently structurally simulation in combining with current X-ray crystallography and NMR data to defined the essential structure of PTB/PH domain for NPXY-binding and the relationship among PTB, PH and FERM domain(s). We identified a group of 28 novel cellular partners of CCM1, all of which contain either PTB or PH domain(s), and developed a novel classification system for these PTB/PH proteins based on their relationship with different NPXY motifs of CCM1. Our results demonstrated that CCM1 has a wide spectrum of binding to different PTB/PH proteins and perpetuates their specificity to interact with certain PTB/PH domains through selective combination of three NPXY motifs. We also demonstrated that CCM1 can be assembled into oligomers through intermolecular interaction between its F3 lobe in FERM domain and one of the three NPXY motifs. Despite being embedded in FERM domain as F3 lobe, F3 module acts as a fully functional PH domain to interact with NPXY motif. The most salient feature of the study was that both PTB and PH domains are structurally and functionally comparable, suggesting that PTB domain is likely evolved from PH domain with polymorphic structural additions at its N-terminus. A new β1A-strand of the PTB domain was discovered and new minimum structural requirement of PTB/PH domain for NPXY motif-binding was determined. Based on our data, a novel theory of structure, function and relationship of PTB, PH and FERM domains has been proposed, which extends the importance of the NPXY-PTB/PH interaction on the CSC signaling and/or other cell receptors with great potential pointing to new therapeutic strategies. The study provides new insight into the structural characteristics of PTB/PH domains, essential structural elements of PTB/PH domain required for NPXY motif-binding, and function and relationship among PTB, PH and FERM domains. Copyright © 2017 Elsevier B.V. All rights reserved.
The Crystal Structure of a Cardiovirus RNA-Dependent RNA Polymerase Reveals an Unusual Conformation of the Polymerase Active Site

PubMed Central

Vives-Adrian, Laia; Lujan, Celia; Oliva, Baldo; van der Linden, Lonneke; Selisko, Barbara; Coutard, Bruno; Canard, Bruno; van Kuppeveld, Frank J. M.

2014-01-01

ABSTRACT Encephalomyocarditis virus (EMCV) is a member of the Cardiovirus genus within the large Picornaviridae family, which includes a number of important human and animal pathogens. The RNA-dependent RNA polymerase (RdRp) 3Dpol is a key enzyme for viral genome replication. In this study, we report the X-ray structures of two different crystal forms of the EMCV RdRp determined at 2.8- and 2.15-Å resolution. The in vitro elongation and VPg uridylylation activities of the purified enzyme have also been demonstrated. Although the overall structure of EMCV 3Dpol is shown to be similar to that of the known RdRps of other members of the Picornaviridae family, structural comparisons show a large reorganization of the active-site cavity in one of the crystal forms. The rearrangement affects mainly motif A, where the conserved residue Asp240, involved in ribonucleoside triphosphate (rNTP) selection, and its neighbor residue, Phe239, move about 10 Å from their expected positions within the ribose binding pocket toward the entrance of the rNTP tunnel. This altered conformation of motif A is stabilized by a cation-π interaction established between the aromatic ring of Phe239 and the side chain of Lys56 within the finger domain. Other contacts, involving Phe239 and different residues of motif F, are also observed. The movement of motif A is connected with important conformational changes in the finger region flanked by residues 54 to 63, harboring Lys56, and in the polymerase N terminus. The structures determined in this work provide essential information for studies on the cardiovirus RNA replication process and may have important implications for the development of new antivirals targeting the altered conformation of motif A. IMPORTANCE The Picornaviridae family is one of the largest virus families known, including many important human and animal pathogens. The RNA-dependent RNA polymerase (RdRp) 3Dpol is a key enzyme for picornavirus genome replication and a validated target for the development of antiviral therapies. Solving the X-ray structure of the first cardiovirus RdRp, EMCV 3Dpol, we captured an altered conformation of a conserved motif in the polymerase active site (motif A) containing the aspartic acid residue involved in rNTP selection and binding. This altered conformation of motif A, which interferes with the correct positioning of the rNTP substrate in the active site, is stabilized by a number of residues strictly conserved among picornaviruses. The rearrangements observed suggest that this motif A segment is a dynamic element that can be modulated by external effectors, either activating or inhibiting enzyme activity, and this type of modulation appears to be general to all picornaviruses. PMID:24600002
The crystal structure of a cardiovirus RNA-dependent RNA polymerase reveals an unusual conformation of the polymerase active site.

PubMed

Vives-Adrian, Laia; Lujan, Celia; Oliva, Baldo; van der Linden, Lonneke; Selisko, Barbara; Coutard, Bruno; Canard, Bruno; van Kuppeveld, Frank J M; Ferrer-Orta, Cristina; Verdaguer, Núria

2014-05-01

Encephalomyocarditis virus (EMCV) is a member of the Cardiovirus genus within the large Picornaviridae family, which includes a number of important human and animal pathogens. The RNA-dependent RNA polymerase (RdRp) 3Dpol is a key enzyme for viral genome replication. In this study, we report the X-ray structures of two different crystal forms of the EMCV RdRp determined at 2.8- and 2.15-Å resolution. The in vitro elongation and VPg uridylylation activities of the purified enzyme have also been demonstrated. Although the overall structure of EMCV 3Dpol is shown to be similar to that of the known RdRps of other members of the Picornaviridae family, structural comparisons show a large reorganization of the active-site cavity in one of the crystal forms. The rearrangement affects mainly motif A, where the conserved residue Asp240, involved in ribonucleoside triphosphate (rNTP) selection, and its neighbor residue, Phe239, move about 10 Å from their expected positions within the ribose binding pocket toward the entrance of the rNTP tunnel. This altered conformation of motif A is stabilized by a cation-π interaction established between the aromatic ring of Phe239 and the side chain of Lys56 within the finger domain. Other contacts, involving Phe239 and different residues of motif F, are also observed. The movement of motif A is connected with important conformational changes in the finger region flanked by residues 54 to 63, harboring Lys56, and in the polymerase N terminus. The structures determined in this work provide essential information for studies on the cardiovirus RNA replication process and may have important implications for the development of new antivirals targeting the altered conformation of motif A. The Picornaviridae family is one of the largest virus families known, including many important human and animal pathogens. The RNA-dependent RNA polymerase (RdRp) 3Dpol is a key enzyme for picornavirus genome replication and a validated target for the development of antiviral therapies. Solving the X-ray structure of the first cardiovirus RdRp, EMCV 3Dpol, we captured an altered conformation of a conserved motif in the polymerase active site (motif A) containing the aspartic acid residue involved in rNTP selection and binding. This altered conformation of motif A, which interferes with the correct positioning of the rNTP substrate in the active site, is stabilized by a number of residues strictly conserved among picornaviruses. The rearrangements observed suggest that this motif A segment is a dynamic element that can be modulated by external effectors, either activating or inhibiting enzyme activity, and this type of modulation appears to be general to all picornaviruses.
Self-Organization of Microcircuits in Networks of Spiking Neurons with Plastic Synapses.

PubMed

Ocker, Gabriel Koch; Litwin-Kumar, Ashok; Doiron, Brent

2015-08-01

The synaptic connectivity of cortical networks features an overrepresentation of certain wiring motifs compared to simple random-network models. This structure is shaped, in part, by synaptic plasticity that promotes or suppresses connections between neurons depending on their joint spiking activity. Frequently, theoretical studies focus on how feedforward inputs drive plasticity to create this network structure. We study the complementary scenario of self-organized structure in a recurrent network, with spike timing-dependent plasticity driven by spontaneous dynamics. We develop a self-consistent theory for the evolution of network structure by combining fast spiking covariance with a slow evolution of synaptic weights. Through a finite-size expansion of network dynamics we obtain a low-dimensional set of nonlinear differential equations for the evolution of two-synapse connectivity motifs. With this theory in hand, we explore how the form of the plasticity rule drives the evolution of microcircuits in cortical networks. When potentiation and depression are in approximate balance, synaptic dynamics depend on weighted divergent, convergent, and chain motifs. For additive, Hebbian STDP these motif interactions create instabilities in synaptic dynamics that either promote or suppress the initial network structure. Our work provides a consistent theoretical framework for studying how spiking activity in recurrent networks interacts with synaptic plasticity to determine network structure.
Self-Organization of Microcircuits in Networks of Spiking Neurons with Plastic Synapses

PubMed Central

Ocker, Gabriel Koch; Litwin-Kumar, Ashok; Doiron, Brent

2015-01-01

The synaptic connectivity of cortical networks features an overrepresentation of certain wiring motifs compared to simple random-network models. This structure is shaped, in part, by synaptic plasticity that promotes or suppresses connections between neurons depending on their joint spiking activity. Frequently, theoretical studies focus on how feedforward inputs drive plasticity to create this network structure. We study the complementary scenario of self-organized structure in a recurrent network, with spike timing-dependent plasticity driven by spontaneous dynamics. We develop a self-consistent theory for the evolution of network structure by combining fast spiking covariance with a slow evolution of synaptic weights. Through a finite-size expansion of network dynamics we obtain a low-dimensional set of nonlinear differential equations for the evolution of two-synapse connectivity motifs. With this theory in hand, we explore how the form of the plasticity rule drives the evolution of microcircuits in cortical networks. When potentiation and depression are in approximate balance, synaptic dynamics depend on weighted divergent, convergent, and chain motifs. For additive, Hebbian STDP these motif interactions create instabilities in synaptic dynamics that either promote or suppress the initial network structure. Our work provides a consistent theoretical framework for studying how spiking activity in recurrent networks interacts with synaptic plasticity to determine network structure. PMID:26291697
The Leu-Arg-Glu (LRE) adhesion motif in proteins of the neuromuscular junction with special reference to proteins of the carboxylesterase/cholinesterase family.

PubMed

Johnson, Glynis; Moore, Samuel W

2013-09-01

Short linear motifs confer evolutionary flexibility on proteins as they can be added with relative ease allowing the acquisition of new functions. Such motifs may mediate a variety of signalling functions. The adhesion-mediating Leu-Arg-Glu (LRE) motif is enriched in laminin beta 2, and has been observed in other proteins, including members of the carboxylesterase/cholinesterase family. It acts as a stop signal for growing axons in the developing neuromuscular junction, binding to the voltage-gated calcium channel. In this bioinformatic analysis, we have investigated the presence of the motif in proteins of the neuromuscular junction, and have also examined its structural position and potential for ligand interaction, as well as phylogenetic conservation, in the carboxylesterase/cholinesterase family. The motif was observed to occur with a significantly higher frequency than expected in the UniProt/Swiss-Prot database, as well as in four individual species (human, mouse, Caenorhabditis elegans and Drosophila melanogaster). Examination of its presence in neuromuscular junction proteins showed it to be enriched in certain proteins of the synaptic basement membrane, including laminin, agrin, acetylcholinesterase and tenascin. A highly significant enrichment was observed in cytoskeletal proteins, particularly intermediate filament proteins and members of the spectrin family. In the carboxylesterase/cholinesterase family, the motif was observed in four conserved positions in the protein structure. It is present in the majority of mammalian acetylcholinesterases, as well as acetylcholinesterases from electric fish and a number of invertebrates. In insects, it is present in the ace-2, rather than in the synaptic ace-1, enzyme. It is also observed in the cholinesterase-like adhesion molecules (neuroligins, neurotactin and glutactin). It is never seen in butyrylcholinesterases, which do not mediate cell adhesion. In conclusion, the significant enrichment of the motif in certain classes of protein, as well as its conserved presence and structural positioning in one protein family, suggests that it has specific functions both in cell adhesion in the neuromuscular junction and in maintaining the structural integrity of the cytoskeleton. Copyright © 2013 Elsevier Inc. All rights reserved.
Slipped-strand mispairing at noncontiguous repeats in Poecilia reticulata: a model for minisatellite birth.

PubMed Central

Taylor, J S; Breden, F

2000-01-01

The standard slipped-strand mispairing (SSM) model for the formation of variable number tandem repeats (VNTRs) proposes that a few tandem repeats, produced by chance mutations, provide the "raw material" for VNTR expansion. However, this model is unlikely to explain the formation of VNTRs with long motifs (e.g., minisatellites), because the likelihood of a tandem repeat forming by chance decreases rapidly as the length of the repeat motif increases. Phylogenetic reconstruction of the birth of a mitochondrial (mt) DNA minisatellite in guppies suggests that VNTRs with long motifs can form as a consequence of SSM at noncontiguous repeats. VNTRs formed in this manner have motifs longer than the noncontiguous repeat originally formed by chance and are flanked by one unit of the original, noncontiguous repeat. SSM at noncontiguous repeats can therefore explain the birth of VNTRs with long motifs and the "imperfect" or "short direct" repeats frequently observed adjacent to both mtDNA and nuclear VNTRs. PMID:10880490
Structure of a putative acetyltransferase (PA1377) from Pseudomonas aeruginosa

DOE Office of Scientific and Technical Information (OSTI.GOV)

Davies, Anna M.; Tata, Renée; Chauviac, François-Xavier

2008-05-01

The crystal structure of an acetyltransferase encoded by the gene PA1377 from Pseudomonas aeruginosa has been determined at 2.25 Å resolution. Comparison with a related acetyltransferase revealed a structural difference in the active site that was taken to reflect a difference in substrate binding and/or specificity between the two enzymes. Gene PA1377 from Pseudomonas aeruginosa encodes a 177-amino-acid conserved hypothetical protein of unknown function. The structure of this protein (termed pitax) has been solved in space group I222 to 2.25 Å resolution. Pitax belongs to the GCN5-related N-acetyltransferase family and contains all four sequence motifs conserved among family members. Themore » β-strand structure in one of these motifs (motif A) is disrupted, which is believed to affect binding of the substrate that accepts the acetyl group from acetyl-CoA.« less
Crystal structure of yeast allantoicase reveals a repeated jelly roll motif.

PubMed

Leulliot, Nicolas; Quevillon-Cheruel, Sophie; Sorel, Isabelle; Graille, Marc; Meyer, Philippe; Liger, Dominique; Blondeau, Karine; Janin, Joël; van Tilbeurgh, Herman

2004-05-28

Allantoicase (EC 3.5.3.4) catalyzes the conversion of allantoate into ureidoglycolate and urea, one of the final steps in the degradation of purines to urea. The mechanism of most enzymes involved in this pathway, which has been known for a long time, is unknown. In this paper we describe the three-dimensional crystal structure of the yeast allantoicase determined at a resolution of 2.6 A by single anomalous diffraction. This constitutes the first structure for an enzyme of this pathway. The structure reveals a repeated jelly roll beta-sheet motif, also present in proteins of unrelated biochemical function. Allantoicase has a hexameric arrangement in the crystal (dimer of trimers). Analysis of the protein sequence against the structural data reveals the presence of two totally conserved surface patches, one on each jelly roll motif. The hexameric packing concentrates these patches into conserved pockets that probably constitute the active site.
Weak interactions involving organic fluorine: analysis of structural motifs in Flunazirine and Haloperidol

NASA Astrophysics Data System (ADS)

Prasanna, M. D.; Row, T. N. Guru

2001-05-01

The crystal structure of Flunazirine, an anticonvulsant drug, is analyzed in terms of intermolecular interactions involving fluorine. The structure displays motifs formed by only weak interactions C-H⋯F and C-H⋯π. The motifs thus generated show cavities, which could serve as hosts for complexation. The structure of Flunazirine displays cavities formed by C-H⋯F and C-H⋯π interactions. Haloperidol, an antipsychotic drug, shows F⋯F interactions in the crystalline lattice in lieu of Cl⋯Cl interactions. However, strong O-H⋯N interactions dominate packing. The salient features of the two structures in terms of intermolecular interactions reveal, even though organic fluorine has lower tendency to engage in hydrogen bonding and F⋯F interactions, these interactions could play a significant role in the design of molecular assemblies via crystal engineering.
High-Pressure NMR and SAXS Reveals How Capping Modulates Folding Cooperativity of the pp32 Leucine-rich Repeat Protein.

PubMed

Zhang, Yi; Berghaus, Melanie; Klein, Sean; Jenkins, Kelly; Zhang, Siwen; McCallum, Scott A; Morgan, Joel E; Winter, Roland; Barrick, Doug; Royer, Catherine A

2018-04-27

Many repeat proteins contain capping motifs, which serve to shield the hydrophobic core from solvent and maintain structural integrity. While the role of capping motifs in enhancing the stability and structural integrity of repeat proteins is well documented, their contribution to folding cooperativity is not. Here we examined the role of capping motifs in defining the folding cooperativity of the leucine-rich repeat protein, pp32, by monitoring the pressure- and urea-induced unfolding of an N-terminal capping motif (N-cap) deletion mutant, pp32-∆N-cap, and a C-terminal capping motif destabilization mutant pp32-Y131F/D146L, using residue-specific NMR and small-angle X-ray scattering. Destabilization of the C-terminal capping motif resulted in higher cooperativity for the unfolding transition compared to wild-type pp32, as these mutations render the stability of the C-terminus similar to that of the rest of the protein. In contrast, deletion of the N-cap led to strong deviation from two-state unfolding. In both urea- and pressure-induced unfolding, residues in repeats 1-3 of pp32-ΔN-cap lost their native structure first, while the C-terminal half was more stable. The residue-specific free energy changes in all regions of pp32-ΔN-cap were larger in urea compared to high pressure, indicating a less cooperative destabilization by pressure. Moreover, in contrast to complete structural disruption of pp32-ΔN-cap at high urea concentration, its pressure unfolded state remained compact. The contrasting effects of the capping motifs on folding cooperativity arise from the differential local stabilities of pp32, whereas the contrasting effects of pressure and urea on the pp32-ΔN-cap variant arise from their distinct mechanisms of action. Copyright © 2018 Elsevier Ltd. All rights reserved.

Monitoring i-motif transitions through the exciplex emission of a fluorescent probe incorporating two (Py)A units.

PubMed

Lee, Il Joon; Kim, Byeang Hyean

2012-02-18

Pairs of pyrene-modified deoxyadenosine ((Py)A) units induce a stable interstrand i-motif structure, which can be characterized by a change in the fluorescence λ(max), with an exciplex emission that is not observable in its single-strand structure. This journal is © The Royal Society of Chemistry 2012
Classification of proteins with shared motifs and internal repeats in the ECOD database

PubMed Central

Kinch, Lisa N.; Liao, Yuxing

2016-01-01

Abstract Proteins and their domains evolve by a set of events commonly including the duplication and divergence of small motifs. The presence of short repetitive regions in domains has generally constituted a difficult case for structural domain classifications and their hierarchies. We developed the Evolutionary Classification Of protein Domains (ECOD) in part to implement a new schema for the classification of these types of proteins. Here we document the ways in which ECOD classifies proteins with small internal repeats, widespread functional motifs, and assemblies of small domain‐like fragments in its evolutionary schema. We illustrate the ways in which the structural genomics project impacted the classification and characterization of new structural domains and sequence families over the decade. PMID:26833690
Structural insight into the interaction of proteins containing NPF, DPF, and GPF motifs with the C-terminal EH-domain of EHD1

PubMed Central

Kieken, Fabien; Jović, Marko; Tonelli, Marco; Naslavsky, Naava; Caplan, Steve; Sorgen, Paul L

2009-01-01

Eps15 homology (EH)-domain containing proteins are regulators of endocytic membrane trafficking. EH-domain binding to proteins containing the tripeptide NPF has been well characterized, but recent studies have shown that EH-domains are also able to interact with ligands containing DPF or GPF motifs. We demonstrate that the three motifs interact in a similar way with the EH-domain of EHD1, with the NPF motif having the highest affinity due to the presence of an intermolecular hydrogen bond. The weaker affinity for the DPF and GPF motifs suggests that if complex formation occurs in vivo, they may require high ligand concentrations, the presence of successive motifs and/or specific flanking residues. PMID:19798736
Structural conservation, variability, and immunogenicity of the T6 backbone pilin of serotype M6 Streptococcus pyogenes.

PubMed

Young, Paul G; Moreland, Nicole J; Loh, Jacelyn M; Bell, Anita; Atatoa Carr, Polly; Proft, Thomas; Baker, Edward N

2014-07-01

Group A streptococcus (GAS; Streptococcus pyogenes) is a Gram-positive human pathogen that causes a broad range of diseases ranging from acute pharyngitis to the poststreptococcal sequelae of acute rheumatic fever. GAS pili are highly diverse, long protein polymers that extend from the cell surface. They have multiple roles in infection and are promising candidates for vaccine development. This study describes the structure of the T6 backbone pilin (BP; Lancefield T-antigen) from the important M6 serotype. The structure reveals a modular arrangement of three tandem immunoglobulin-like domains, two with internal isopeptide bonds. The T6 pilin lysine, essential for polymerization, is located in a novel VAKS motif that is structurally homologous to the canonical YPKN pilin lysine in other three- and four-domain Gram-positive pilins. The T6 structure also highlights a conserved pilin core whose surface is decorated with highly variable loops and extensions. Comparison to other Gram-positive BPs shows that many of the largest variable extensions are found in conserved locations. Studies with sera from patients diagnosed with GAS-associated acute rheumatic fever showed that each of the three T6 domains, and the largest of the variable extensions (V8), are targeted by IgG during infection in vivo. Although the GAS BP show large variations in size and sequence, the modular nature of the pilus proteins revealed by the T6 structure may aid the future design of a pilus-based vaccine. Copyright © 2014, American Society for Microbiology. All Rights Reserved.
Molecular dynamics analysis of stabilities of the telomeric Watson-Crick duplex and the associated i-motif as a function of pH and temperature.

PubMed

Panczyk, Tomasz; Wolski, Pawel

2018-06-01

This work deals with a molecular dynamics analysis of the protonated and deprotonated states of the natural sequence d[(CCCTAA) 3 CCCT] of the telomeric DNA forming the intercalated i-motif or paired with the sequence d[(CCCTAA) 3 CCCT] and forming the Watson-Crick (WC) duplex. By utilizing the amber force field for nucleic acids we built the i-motif and the WC duplex either with native cytosines or using their protonated forms. We studied, by applying molecular dynamics simulations, the role of hydrogen bonds between cytosines or in cytosine-guanine pairs in the stabilization of both structures in the physiological fluid. We found that hydrogen bonds exist in the case of protonated i-motif and in the standard form of the WC duplex. They, however, vanish in the case of the deprotonated i-motif and protonated form of the WC duplex. By determining potentials of mean force in the enforced unwrapping of these structures we found that the protonated i-motif is thermodynamically the most stable. Its deprotonation leads to spontaneous and observed directly in the unbiased calculations unfolding of the i-motif to the hairpin structure at normal temperature. The WC duplex is stable in its standard form and its slight destabilization is observed at the acidic pH. However, the protonated WC duplex unwraps very slowly at 310 K and its decomposition was not observed in the unbiased calculations. At higher temperatures (ca. 400 K or more) the WC duplex unwraps spontaneously. Copyright © 2018. Published by Elsevier B.V.
The solely motif-doped Au36-xAgx(SPh-tBu)24 (x = 1-8) nanoclusters: X-ray crystal structure and optical properties.

PubMed

Fan, Jiqiang; Song, Yongbo; Chai, Jinsong; Yang, Sha; Chen, Tao; Rao, Bo; Yu, Haizhu; Zhu, Manzhou

2016-08-18

We report the observation of new doping behavior in Au36-xAgx(SR)24 nanoclusters (NCs) with x = 1 to 8. The atomic arrangements of Au and Ag atoms are determined by X-ray crystallography. The new gold-silver bimetallic NCs share the same framework as that of the homogold counterpart, i.e. possessing an fcc-type Au28 kernel, four dimeric AuAg(SR)3 staple motifs and twelve simple bridging SR ligands. Interestingly, all the Ag dopants in the Au36-xAgx(SR)24 NCs are selectively incorporated into the surface motifs, which is in contrast to the previously reported Au-Ag alloy structures with the Ag dopants preferentially displacing the core gold atoms. This distinct doping behavior implies that the previous assignments of an fcc Au28 core with four dimers and 12 bridging thiolates for Au36(SR)24 are more justified than other assignments of core vs. surface motifs. The UV-Vis adsorption spectrum of Au36-xAgx(SR)24 is almost the same as that of Au36(SR)24, indicating that the Ag dopants in the motifs do not change the optical properties. The similar UV-Vis spectra are further confirmed by TD-DFT calculations. DFT also reveals that the energies of the HOMO and LUMO of the motif-doped AuAg alloy NC are comparable to those of the homogold Au36 NC, indicating that the electronic structure is not disturbed by the motif Ag dopants. Overall, this study reveals a new silver-doping mode in alloy NCs.
Syntactic structures in languages and biology.

PubMed

Horn, David

2008-08-01

Both natural languages and cell biology make use of one-dimensional encryption. Their investigation calls for syntactic deciphering of the text and semantic understanding of the resulting structures. Here we discuss recently published algorithms that allow for such searches: automatic distillation of structure (ADIOS) that is successful in discovering syntactic structures in linguistic texts and its motif extraction (MEX) component that can be used for uncovering motifs in DNA and protein sequences. The underlying principles of these syntactic algorithms and some of their results will be described.
Mechanical features of various silkworm crystalline considering hydration effect via molecular dynamics simulations.

PubMed

Kim, Yoonjung; Lee, Myeongsang; Choi, Hyunsung; Baek, Inchul; Kim, Jae In; Na, Sungsoo

2018-04-01

Silk materials are receiving significant attention as base materials for various functional nanomaterials and nanodevices, due to its exceptionally high mechanical properties, biocompatibility, and degradable characteristics. Although crystalline silk regions are composed of various repetitive motifs with differing amino acid sequences, how the effect of humidity works differently on each of the motifs and their structural characteristics remains unclear. We report molecular dynamics (MD) simulations on various silkworm fibroins composed of major motifs (i.e. (GAGAGS) n , (GAGAGA) n , and (GAGAGY) n ) at varying degrees of hydration, and reveal how each major motifs of silk fibroins change at each degrees of hydration using MD simulations and their structural properties in mechanical perspective via steered molecular dynamics simulations. Our results explain what effects humidity can have on nanoscale materials and devices consisting of crystalline silk materials.
The crystal structure of TrxA(CACA): Insights into the formation of a [2Fe-2S] iron–sulfur cluster in an Escherichia coli thioredoxin mutant

PubMed Central

Collet, Jean-Francois; Peisach, Daniel; Bardwell, James C.A.; Xu, Zhaohui

2005-01-01

Escherichia coli thioredoxin is a small monomeric protein that reduces disulfide bonds in cytoplasmic proteins. Two cysteine residues present in a conserved CGPC motif are essential for this activity. Recently, we identified mutations of this motif that changed thioredoxin into a homodimer bridged by a [2Fe-2S] iron–sulfur cluster. When exported to the periplasm, these thioredoxin mutants could restore disulfide bond formation in strains lacking the entire periplasmic oxidative pathway. Essential for the assembly of the iron–sulfur was an additional cysteine that replaced the proline at position three of the CGPC motif. We solved the crystalline structure at 2.3 Å for one of these variants, TrxA(CACA). The mutant protein crystallized as a dimer in which the iron–sulfur cluster is replaced by two intermolecular disulfide bonds. The catalytic site, which forms the dimer interface, crystallized in two different conformations. In one of them, the replacement of the CGPC motif by CACA has a dramatic effect on the structure and causes the unraveling of an extended α-helix. In both conformations, the second cysteine residue of the CACA motif is surface-exposed, which contrasts with wildtype thioredoxin where the second cysteine of the CXXC motif is buried. This exposure of a pair of vicinal cysteine residues apparently allows thioredoxin to acquire an iron–sulfur cofactor at its active site, and thus a new activity and mechanism of action. PMID:15987909
The crystal structure of TrxA(CACA): Insights into the formation of a [2Fe-2S] iron-sulfur cluster in an Escherichia coli thioredoxin mutant.

PubMed

Collet, Jean-Francois; Peisach, Daniel; Bardwell, James C A; Xu, Zhaohui

2005-07-01

Escherichia coli thioredoxin is a small monomeric protein that reduces disulfide bonds in cytoplasmic proteins. Two cysteine residues present in a conserved CGPC motif are essential for this activity. Recently, we identified mutations of this motif that changed thioredoxin into a homodimer bridged by a [2Fe-2S] iron-sulfur cluster. When exported to the periplasm, these thioredoxin mutants could restore disulfide bond formation in strains lacking the entire periplasmic oxidative pathway. Essential for the assembly of the iron-sulfur was an additional cysteine that replaced the proline at position three of the CGPC motif. We solved the crystalline structure at 2.3 Angstroms for one of these variants, TrxA(CACA). The mutant protein crystallized as a dimer in which the iron-sulfur cluster is replaced by two intermolecular disulfide bonds. The catalytic site, which forms the dimer interface, crystallized in two different conformations. In one of them, the replacement of the CGPC motif by CACA has a dramatic effect on the structure and causes the unraveling of an extended alpha-helix. In both conformations, the second cysteine residue of the CACA motif is surface-exposed, which contrasts with wildtype thioredoxin where the second cysteine of the CXXC motif is buried. This exposure of a pair of vicinal cysteine residues apparently allows thioredoxin to acquire an iron-sulfur cofactor at its active site, and thus a new activity and mechanism of action.
An automated parallel crystallisation search for predicted crystal structures and packing motifs of carbamazepine.

PubMed

Florence, Alastair J; Johnston, Andrea; Price, Sarah L; Nowell, Harriott; Kennedy, Alan R; Shankland, Norman

2006-09-01

An automated parallel crystallisation search for physical forms of carbamazepine, covering 66 solvents and five crystallisation protocols, identified three anhydrous polymorphs (forms I-III), one hydrate and eight organic solvates, including the single-crystal structures of three previously unreported solvates (N,N-dimethylformamide (1:1); hemi-furfural; hemi-1,4-dioxane). Correlation of physical form outcome with the crystallisation conditions demonstrated that the solvent adopts a relatively nonspecific role in determining which polymorph is obtained, and that the previously reported effect of a polymer template facilitating the formation of form IV could not be reproduced by solvent crystallisation alone. In the accompanying computational search, approximately half of the energetically feasible predicted crystal structures exhibit the C=O...H--N R2(2)(8)dimer motif that is observed in the known polymorphs, with the most stable correctly corresponding to form III. Most of the other energetically feasible structures, including the global minimum, have a C=O...H--N C(4) chain hydrogen bond motif. No such chain structures were observed in this or any other previously published work, suggesting that kinetic, rather than thermodynamic, factors determine which of the energetically feasible crystal structures are observed experimentally, with the kinetics apparently favouring nucleation of crystal structures based on the CBZ-CBZ R2(2)(8) motif. (c) 2006 Wiley-Liss, Inc. and the American Pharmacists Association.
Rationalizing the role of structural motif and underlying electronic structure in the finite temperature behavior of atomic clusters

NASA Astrophysics Data System (ADS)

Susan, Anju; Joshi, Kavita

2014-04-01

Melting in finite size systems is an interesting but complex phenomenon. Many factors affect melting and owing to their interdependencies it is a challenging task to rationalize their roles in the phase transition. In this work, we demonstrate how structural motif of the ground state influences melting transition in small clusters. Here, we report a case with clusters of aluminum and gallium having same number of atoms, valence electrons, and similar structural motif of the ground state but drastically different melting temperatures. We have employed Born-Oppenheimer molecular dynamics to simulate the solid-like to liquid-like transition in these clusters. Our simulations have reproduced the experimental trends fairly well. Further, the detailed analysis of isomers has brought out the role of the ground state structure and underlying electronic structure in the finite temperature behavior of these clusters. For both clusters, isomers accessible before cluster melts have striking similarities and does have strong influence of the structural motif of the ground state. Further, the shape of the heat capacity curve is similar in both the cases but the transition is more spread over for Al36 which is consistent with the observed isomerization pattern. Our simulations also suggest a way to characterize transition region on the basis of accessibility of the ground state at a specific temperature.
Chaotic Motifs in Gene Regulatory Networks

PubMed Central

Zhang, Zhaoyang; Ye, Weiming; Qian, Yu; Zheng, Zhigang; Huang, Xuhui; Hu, Gang

2012-01-01

Chaos should occur often in gene regulatory networks (GRNs) which have been widely described by nonlinear coupled ordinary differential equations, if their dimensions are no less than 3. It is therefore puzzling that chaos has never been reported in GRNs in nature and is also extremely rare in models of GRNs. On the other hand, the topic of motifs has attracted great attention in studying biological networks, and network motifs are suggested to be elementary building blocks that carry out some key functions in the network. In this paper, chaotic motifs (subnetworks with chaos) in GRNs are systematically investigated. The conclusion is that: (i) chaos can only appear through competitions between different oscillatory modes with rivaling intensities. Conditions required for chaotic GRNs are found to be very strict, which make chaotic GRNs extremely rare. (ii) Chaotic motifs are explored as the simplest few-node structures capable of producing chaos, and serve as the intrinsic source of chaos of random few-node GRNs. Several optimal motifs causing chaos with atypically high probability are figured out. (iii) Moreover, we discovered that a number of special oscillators can never produce chaos. These structures bring some advantages on rhythmic functions and may help us understand the robustness of diverse biological rhythms. (iv) The methods of dominant phase-advanced driving (DPAD) and DPAD time fraction are proposed to quantitatively identify chaotic motifs and to explain the origin of chaotic behaviors in GRNs. PMID:22792171
Building a stable RNA U-turn with a protonated cytidine.

PubMed

Gottstein-Schmidtke, Sina R; Duchardt-Ferner, Elke; Groher, Florian; Weigand, Julia E; Gottstein, Daniel; Suess, Beatrix; Wöhnert, Jens

2014-08-01

The U-turn is a classical three-dimensional RNA folding motif first identified in the anticodon and T-loops of tRNAs. It also occurs frequently as a building block in other functional RNA structures in many different sequence and structural contexts. U-turns induce sharp changes in the direction of the RNA backbone and often conform to the 3-nt consensus sequence 5'-UNR-3' (N = any nucleotide, R = purine). The canonical U-turn motif is stabilized by a hydrogen bond between the N3 imino group of the U residue and the 3' phosphate group of the R residue as well as a hydrogen bond between the 2'-hydroxyl group of the uridine and the N7 nitrogen of the R residue. Here, we demonstrate that a protonated cytidine can functionally and structurally replace the uridine at the first position of the canonical U-turn motif in the apical loop of the neomycin riboswitch. Using NMR spectroscopy, we directly show that the N3 imino group of the protonated cytidine forms a hydrogen bond with the backbone phosphate 3' from the third nucleotide of the U-turn analogously to the imino group of the uridine in the canonical motif. In addition, we compare the stability of the hydrogen bonds in the mutant U-turn motif to the wild type and describe the NMR signature of the C+-phosphate interaction. Our results have implications for the prediction of RNA structural motifs and suggest simple approaches for the experimental identification of hydrogen bonds between protonated C-imino groups and the phosphate backbone. © 2014 Gottstein-Schmidtke et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Tuning structural motifs and alloying of bulk immiscible Mo-Cu bimetallic nanoparticles by gas-phase synthesis

NASA Astrophysics Data System (ADS)

Krishnan, Gopi; Verheijen, Marcel A.; Ten Brink, Gert H.; Palasantzas, George; Kooi, Bart J.

2013-05-01

Nowadays bimetallic nanoparticles (NPs) have emerged as key materials for important modern applications in nanoplasmonics, catalysis, biodiagnostics, and nanomagnetics. Consequently the control of bimetallic structural motifs with specific shapes provides increasing functionality and selectivity for related applications. However, producing bimetallic NPs with well controlled structural motifs still remains a formidable challenge. Hence, we present here a general methodology for gas phase synthesis of bimetallic NPs with distinctively different structural motifs ranging at a single particle level from a fully mixed alloy to core-shell, to onion (multi-shell), and finally to a Janus/dumbbell, with the same overall particle composition. These concepts are illustrated for Mo-Cu NPs, where the precise control of the bimetallic NPs with various degrees of chemical ordering, including different shapes from spherical to cube, is achieved by tailoring the energy and thermal environment that the NPs experience during their production. The initial state of NP growth, either in the liquid or in the solid state phase, has important implications for the different structural motifs and shapes of synthesized NPs. Finally we demonstrate that we are able to tune the alloying regime, for the otherwise bulk immiscible Mo-Cu, by achieving an increase of the critical size, below which alloying occurs, closely up to an order of magnitude. It is discovered that the critical size of the NP alloy is not only affected by controlled tuning of the alloying temperature but also by the particle shape.Nowadays bimetallic nanoparticles (NPs) have emerged as key materials for important modern applications in nanoplasmonics, catalysis, biodiagnostics, and nanomagnetics. Consequently the control of bimetallic structural motifs with specific shapes provides increasing functionality and selectivity for related applications. However, producing bimetallic NPs with well controlled structural motifs still remains a formidable challenge. Hence, we present here a general methodology for gas phase synthesis of bimetallic NPs with distinctively different structural motifs ranging at a single particle level from a fully mixed alloy to core-shell, to onion (multi-shell), and finally to a Janus/dumbbell, with the same overall particle composition. These concepts are illustrated for Mo-Cu NPs, where the precise control of the bimetallic NPs with various degrees of chemical ordering, including different shapes from spherical to cube, is achieved by tailoring the energy and thermal environment that the NPs experience during their production. The initial state of NP growth, either in the liquid or in the solid state phase, has important implications for the different structural motifs and shapes of synthesized NPs. Finally we demonstrate that we are able to tune the alloying regime, for the otherwise bulk immiscible Mo-Cu, by achieving an increase of the critical size, below which alloying occurs, closely up to an order of magnitude. It is discovered that the critical size of the NP alloy is not only affected by controlled tuning of the alloying temperature but also by the particle shape. Electronic supplementary information (ESI) available: Experimental details including schematics of the gas phase synthesis set up, target arrangement, synthesis condition for various structures, and TEM images of alloy, core-shell and Mo-Cu-Mo onion nanoparticles. See DOI: 10.1039/c3nr00565h
SiteBinder: an improved approach for comparing multiple protein structural motifs.

PubMed

Sehnal, David; Vařeková, Radka Svobodová; Huber, Heinrich J; Geidl, Stanislav; Ionescu, Crina-Maria; Wimmerová, Michaela; Koča, Jaroslav

2012-02-27

There is a paramount need to develop new techniques and tools that will extract as much information as possible from the ever growing repository of protein 3D structures. We report here on the development of a software tool for the multiple superimposition of large sets of protein structural motifs. Our superimposition methodology performs a systematic search for the atom pairing that provides the best fit. During this search, the RMSD values for all chemically relevant pairings are calculated by quaternion algebra. The number of evaluated pairings is markedly decreased by using PDB annotations for atoms. This approach guarantees that the best fit will be found and can be applied even when sequence similarity is low or does not exist at all. We have implemented this methodology in the Web application SiteBinder, which is able to process up to thousands of protein structural motifs in a very short time, and which provides an intuitive and user-friendly interface. Our benchmarking analysis has shown the robustness, efficiency, and versatility of our methodology and its implementation by the successful superimposition of 1000 experimentally determined structures for each of 32 eukaryotic linear motifs. We also demonstrate the applicability of SiteBinder using three case studies. We first compared the structures of 61 PA-IIL sugar binding sites containing nine different sugars, and we found that the sugar binding sites of PA-IIL and its mutants have a conserved structure despite their binding different sugars. We then superimposed over 300 zinc finger central motifs and revealed that the molecular structure in the vicinity of the Zn atom is highly conserved. Finally, we superimposed 12 BH3 domains from pro-apoptotic proteins. Our findings come to support the hypothesis that there is a structural basis for the functional segregation of BH3-only proteins into activators and enablers.
TFBSshape: a motif database for DNA shape features of transcription factor binding sites.

PubMed

Yang, Lin; Zhou, Tianyin; Dror, Iris; Mathelier, Anthony; Wasserman, Wyeth W; Gordân, Raluca; Rohs, Remo

2014-01-01

Transcription factor binding sites (TFBSs) are most commonly characterized by the nucleotide preferences at each position of the DNA target. Whereas these sequence motifs are quite accurate descriptions of DNA binding specificities of transcription factors (TFs), proteins recognize DNA as a three-dimensional object. DNA structural features refine the description of TF binding specificities and provide mechanistic insights into protein-DNA recognition. Existing motif databases contain extensive nucleotide sequences identified in binding experiments based on their selection by a TF. To utilize DNA shape information when analysing the DNA binding specificities of TFs, we developed a new tool, the TFBSshape database (available at http://rohslab.cmb.usc.edu/TFBSshape/), for calculating DNA structural features from nucleotide sequences provided by motif databases. The TFBSshape database can be used to generate heat maps and quantitative data for DNA structural features (i.e., minor groove width, roll, propeller twist and helix twist) for 739 TF datasets from 23 different species derived from the motif databases JASPAR and UniPROBE. As demonstrated for the basic helix-loop-helix and homeodomain TF families, our TFBSshape database can be used to compare, qualitatively and quantitatively, the DNA binding specificities of closely related TFs and, thus, uncover differential DNA binding specificities that are not apparent from nucleotide sequence alone.
TFBSshape: a motif database for DNA shape features of transcription factor binding sites

PubMed Central

Yang, Lin; Zhou, Tianyin; Dror, Iris; Mathelier, Anthony; Wasserman, Wyeth W.; Gordân, Raluca; Rohs, Remo

2014-01-01

Transcription factor binding sites (TFBSs) are most commonly characterized by the nucleotide preferences at each position of the DNA target. Whereas these sequence motifs are quite accurate descriptions of DNA binding specificities of transcription factors (TFs), proteins recognize DNA as a three-dimensional object. DNA structural features refine the description of TF binding specificities and provide mechanistic insights into protein–DNA recognition. Existing motif databases contain extensive nucleotide sequences identified in binding experiments based on their selection by a TF. To utilize DNA shape information when analysing the DNA binding specificities of TFs, we developed a new tool, the TFBSshape database (available at http://rohslab.cmb.usc.edu/TFBSshape/), for calculating DNA structural features from nucleotide sequences provided by motif databases. The TFBSshape database can be used to generate heat maps and quantitative data for DNA structural features (i.e., minor groove width, roll, propeller twist and helix twist) for 739 TF datasets from 23 different species derived from the motif databases JASPAR and UniPROBE. As demonstrated for the basic helix-loop-helix and homeodomain TF families, our TFBSshape database can be used to compare, qualitatively and quantitatively, the DNA binding specificities of closely related TFs and, thus, uncover differential DNA binding specificities that are not apparent from nucleotide sequence alone. PMID:24214955
Identification of sequence-structure RNA binding motifs for SELEX-derived aptamers.

PubMed

Hoinka, Jan; Zotenko, Elena; Friedman, Adam; Sauna, Zuben E; Przytycka, Teresa M

2012-06-15

Systematic Evolution of Ligands by EXponential Enrichment (SELEX) represents a state-of-the-art technology to isolate single-stranded (ribo)nucleic acid fragments, named aptamers, which bind to a molecule (or molecules) of interest via specific structural regions induced by their sequence-dependent fold. This powerful method has applications in designing protein inhibitors, molecular detection systems, therapeutic drugs and antibody replacement among others. However, full understanding and consequently optimal utilization of the process has lagged behind its wide application due to the lack of dedicated computational approaches. At the same time, the combination of SELEX with novel sequencing technologies is beginning to provide the data that will allow the examination of a variety of properties of the selection process. To close this gap we developed, Aptamotif, a computational method for the identification of sequence-structure motifs in SELEX-derived aptamers. To increase the chances of identifying functional motifs, Aptamotif uses an ensemble-based approach. We validated the method using two published aptamer datasets containing experimentally determined motifs of increasing complexity. We were able to recreate the author's findings to a high degree, thus proving the capability of our approach to identify binding motifs in SELEX data. Additionally, using our new experimental dataset, we illustrate the application of Aptamotif to elucidate several properties of the selection process.
Structural diversity of domain superfamilies in the CATH database.

PubMed

Reeves, Gabrielle A; Dallman, Timothy J; Redfern, Oliver C; Akpor, Adrian; Orengo, Christine A

2006-07-14

The CATH database of domain structures has been used to explore the structural variation of homologous domains in 294 well populated domain structure superfamilies, each containing at least three sequence diverse relatives. Our analyses confirm some previously detected trends relating sequence divergence to structural variation but for a much larger dataset and in some superfamilies the new data reveal exceptional structural variation. Use of a new algorithm (2DSEC) to analyse variability in secondary structure compositions across a superfamily sheds new light on how structures evolve. 2DSEC detects inserted secondary structures that embellish the core of conserved secondary structures found throughout the superfamily. Analysis showed that for 56% of highly populated superfamilies (>9 sequence diverse relatives), there are twofold or more increases in the numbers of secondary structures in some relatives. In some families fivefold increases occur, sometimes modifying the fold of the domain. Manual inspection of secondary structure insertions or embellishments in 48 particularly variable superfamilies revealed that although these insertions were usually discontiguous in the sequence they were often co-located in 3D resulting in a larger structural motif that often modified the geometry of the active site or the surface conformation promoting diverse domain partnerships and protein interactions. These observations, supported by automatic analysis of all well populated CATH families, suggest that accretion of small secondary structure insertions may provide a simple mechanism for evolving new functions in diverse relatives. Some layered domain architectures (e.g. mainly-beta and alpha-beta sandwiches) that recur highly in the genomes more frequently exploit these types of embellishments to modify function. In these architectures, aggregation occurs most often at the edges, top or bottom of the beta-sheets. Information on structural variability across domain superfamilies has been made available through the CATH Dictionary of Homologous Structures (DHS).

Characterizing the Secondary Protein Structure of Black Widow Dragline Silk Using Solid-State NMR & X-ray Diffraction

PubMed Central

Jenkins, Janelle E.; Sampath, Sujatha; Butler, Emily; Kim, Jihyun; Henning, Robert W.; Holland, Gregory P.; Yarger, Jeffery L.

2013-01-01

This study provides a detailed secondary structural characterization of major ampullate dragline silk from Latrodectus hesperus (black widow) spiders. X-ray diffraction results show that the structure of black widow major ampullate silk fibers is comprised of stacked β-sheet nanocrystallites oriented parallel to the fiber axis and an amorphous region with oriented (anisotropic) and isotropic components. The combination of two-dimensional (2D) 13C-13C through-space and through-bond solid-state NMR experiments provide chemical shifts that are used to determine detailed information about amino acid motif secondary structure in black widow spider dragline silk. Individual amino acids are incorporated into different repetitive motifs that make up the majority of this protein-based biopolymer. From the solid-state NMR measurements, we assign distinct secondary conformations to each repetitive amino acid motif and hence to the amino acids that make up the motifs. Specifically, alanine is incorporated in β-sheet (poly(Alan) and poly(Gly-Ala)), 31-helix (poly(Gly-Gly-Xaa), and α-helix (poly(Gln-Gln-Ala-Tyr)) components. Glycine is determined to be in β-sheet (poly(Gly-Ala)) and 31-helical (poly(Gly-Gly-Xaa)) regions, while serine is present in β-sheet (poly(Gly-Ala-Ser)), 31-helix (poly(Gly-Gly-Ser)), and β-turn (poly(Gly-Pro-Ser)) structures. These various motif-specific secondary structural elements are quantitatively correlated to the primary amino acid sequence of major ampullate spidroin 1 and 2 (MaSp1 and MaSp2) and are shown to form a self-consistent model for black widow dragline silk. PMID:24024617
Analysis of zinc binding sites in protein crystal structures.

PubMed

Alberts, I L; Nadassy, K; Wodak, S J

1998-08-01

The geometrical properties of zinc binding sites in a dataset of high quality protein crystal structures deposited in the Protein Data Bank have been examined to identify important differences between zinc sites that are directly involved in catalysis and those that play a structural role. Coordination angles in the zinc primary coordination sphere are compared with ideal values for each coordination geometry, and zinc coordination distances are compared with those in small zinc complexes from the Cambridge Structural Database as a guide of expected trends. We find that distances and angles in the primary coordination sphere are in general close to the expected (or ideal) values. Deviations occur primarily for oxygen coordinating atoms and are found to be mainly due to H-bonding of the oxygen coordinating ligand to protein residues, bidentate binding arrangements, and multi-zinc sites. We find that H-bonding of oxygen containing residues (or water) to zinc bound histidines is almost universal in our dataset and defines the elec-His-Zn motif. Analysis of the stereochemistry shows that carboxyl elec-His-Zn motifs are geometrically rigid, while water elec-His-Zn motifs show the most geometrical variation. As catalytic motifs have a higher proportion of carboxyl elec atoms than structural motifs, they provide a more rigid framework for zinc binding. This is understood biologically, as a small distortion in the zinc position in an enzyme can have serious consequences on the enzymatic reaction. We also analyze the sequence pattern of the zinc ligands and residues that provide elecs, and identify conserved hydrophobic residues in the endopeptidases that also appear to contribute to stabilizing the catalytic zinc site. A zinc binding template in protein crystal structures is derived from these observations.
Extensive T-Cell Epitope Repertoire Sharing among Human Proteome, Gastrointestinal Microbiome, and Pathogenic Bacteria: Implications for the Definition of Self

PubMed Central

Bremel, Robert D.; Homan, E. Jane

2015-01-01

T-cell receptor binding to MHC-bound peptides plays a key role in discrimination between self and non-self. Only a subset, typically a pentamer, of amino acids in a MHC-bound peptide form the motif exposed to the T-cell receptor. We categorize and compare the T-cell exposed amino acid motif repertoire of the total proteomes of two groups of bacteria, comprising pathogens and gastrointestinal microbiome organisms, with the human proteome and immunoglobulins. Given the maximum 205, or 3.2 million of such motifs that bind T-cell receptors, there is considerable overlap in motif usage. We show that the human proteome, exclusive of immunoglobulins, only comprises three quarters of the possible motifs, of which 65.3% are also present in both composite bacterial proteomes. Very few motifs are unique to the human proteome. Immunoglobulin variable regions carry a broad diversity of T-cell exposed motifs (TCEMs) that provides a stratified random sample of the motifs found in pathogens, microbiome, and the human proteome. Individual bacterial genera and species vary in the content of immunoglobulin and human proteome matched motifs that they carry. Mycobacteria and Burkholderia spp carry a particularly high content of such matched motifs. Some bacteria retain a unique motif signature and motif sharing pattern with the human proteome. The implication is that distinguishing self from non-self does not depend on individual TCEMs, but on a complex and dynamic overlay of signals wherein the same TCEM may play different roles in different organisms, and the frequency with which a particular TCEM appears influences its effect. The patterns observed provide clues to bacterial immune evasion and to strategies for intervention, including vaccine design. The breadth and distinct frequency patterns of the immunoglobulin-derived peptides suggest a role of immunoglobulins in maintaining a broadly responsive T-cell repertoire. PMID:26557118
A single thiazole orange molecule forms an exciplex in a DNA i-motif.

PubMed

Xu, Baochang; Wu, Xiangyang; Yeow, Edwin K L; Shao, Fangwei

2014-06-18

A fluorescent exciplex of thiazole orange (TO) is formed in a single-dye conjugated DNA i-motif. The exciplex fluorescence exhibits a large Stokes shift, high quantum yield, robust response to pH oscillation and little structural disturbance to the DNA quadruplex, which can be used to monitor the folding of high-order DNA structures.
Magnesium-binding architectures in RNA crystal structures: validation, binding preferences, classification and motif detection

PubMed Central

Zheng, Heping; Shabalin, Ivan G.; Handing, Katarzyna B.; Bujnicki, Janusz M.; Minor, Wladek

2015-01-01

The ubiquitous presence of magnesium ions in RNA has long been recognized as a key factor governing RNA folding, and is crucial for many diverse functions of RNA molecules. In this work, Mg2+-binding architectures in RNA were systematically studied using a database of RNA crystal structures from the Protein Data Bank (PDB). Due to the abundance of poorly modeled or incorrectly identified Mg2+ ions, the set of all sites was comprehensively validated and filtered to identify a benchmark dataset of 15 334 ‘reliable’ RNA-bound Mg2+ sites. The normalized frequencies by which specific RNA atoms coordinate Mg2+ were derived for both the inner and outer coordination spheres. A hierarchical classification system of Mg2+ sites in RNA structures was designed and applied to the benchmark dataset, yielding a set of 41 types of inner-sphere and 95 types of outer-sphere coordinating patterns. This classification system has also been applied to describe six previously reported Mg2+-binding motifs and detect them in new RNA structures. Investigation of the most populous site types resulted in the identification of seven novel Mg2+-binding motifs, and all RNA structures in the PDB were screened for the presence of these motifs. PMID:25800744
Structural details (kinks and non-alpha conformations) in transmembrane helices are intrahelically determined and can be predicted by sequence pattern descriptors.

PubMed

Rigoutsos, Isidore; Riek, Peter; Graham, Robert M; Novotny, Jiri

2003-08-01

One of the promising methods of protein structure prediction involves the use of amino acid sequence-derived patterns. Here we report on the creation of non-degenerate motif descriptors derived through data mining of training sets of residues taken from the transmembrane-spanning segments of polytopic proteins. These residues correspond to short regions in which there is a deviation from the regular alpha-helical character (i.e. pi-helices, 3(10)-helices and kinks). A 'search engine' derived from these motif descriptors correctly identifies, and discriminates amongst instances of the above 'non-canonical' helical motifs contained in the SwissProt/TrEMBL database of protein primary structures. Our results suggest that deviations from alpha-helicity are encoded locally in sequence patterns only about 7-9 residues long and can be determined in silico directly from the amino acid sequence. Delineation of such variations in helical habit is critical to understanding the complex structure-function relationships of polytopic proteins and for drug discovery. The success of our current methodology foretells development of similar prediction tools capable of identifying other structural motifs from sequence alone. The method described here has been implemented and is available on the World Wide Web at http://cbcsrv.watson.ibm.com/Ttkw.html.
Characteristic motifs for families of allergenic proteins

PubMed Central

Ivanciuc, Ovidiu; Garcia, Tzintzuni; Torres, Miguel; Schein, Catherine H.; Braun, Werner

2008-01-01

The identification of potential allergenic proteins is usually done by scanning a database of allergenic proteins and locating known allergens with a high sequence similarity. However, there is no universally accepted cut-off value for sequence similarity to indicate potential IgE cross-reactivity. Further, overall sequence similarity may be less important than discrete areas of similarity in proteins with homologous structure. To identify such areas, we first classified all allergens and their subdomains in the Structural Database of Allergenic Proteins (SDAP, http://fermi.utmb.edu/SDAP/) to their closest protein families as defined in Pfam, and identified conserved physicochemical property motifs characteristic of each group of sequences. Allergens populate only a small subset of all known Pfam families, as all allergenic proteins in SDAP could be grouped to only 130 (of 9318 total) Pfams, and 31 families contain more than four allergens. Conserved physicochemical property motifs for the aligned sequences of the most populated Pfam families were identified with the PCPMer program suite and catalogued in the webserver Motif-Mate (http://born.utmb.edu/motifmate/summary.php). We also determined specific motifs for allergenic members of a family that could distinguish them from non-allergenic ones. These allergen specific motifs should be most useful in database searches for potential allergens. We found that sequence motifs unique to the allergens in three families (seed storage proteins, Bet v 1, and tropomyosin) overlap with known IgE epitopes, thus providing evidence that our motif based approach can be used to assess the potential allergenicity of novel proteins. PMID:18951633
Identification of a "glycine-loop"-like coiled structure in the 34 AA Pro,Gly,Met repeat domain of the biomineral-associated protein, PM27.

PubMed

Wustman, Brandon A; Santos, Rudolpho; Zhang, Bo; Evans, John Spencer

2002-12-05

Fracture resistance in biomineralized structures has been linked to the presence of proteins, some of which possess sequences that are associated with elastic behavior. One such protein superfamily, the Pro,Gly-rich sea urchin intracrystalline spicule matrix proteins, form protein-protein supramolecular assemblies that modify the microstructure and fracture-resistant properties of the calcium carbonate mineral phase within embryonic sea urchin spicules and adult sea urchin spines. In this report, we detail the identification of a repetitive keratin-like "glycine-loop"- or coil-like structure within the 34-AA (AA: amino acid) N-terminal domain, (PGMG)(8)PG, of the spicule matrix protein, PM27. The identification of this repetitive structural motif was accomplished using two capped model peptides: a 9-AA sequence, GPGMGPGMG, and a 34-AA peptide representing the entire motif. Using CD, NMR spectrometry, and molecular dynamics simulated annealing/minimization simulations, we have determined that the 9-AA model peptide adopts a loop-like structure at pH 7.4. The structure of the 34-AA polypeptide resembles a coil structure consisting of repeating loop motifs that do not exhibit long-range ordering. Given that loop structures have been associated with protein elastic behavior and protein motion, it is plausible that the 34-AA Pro,Gly,Met repeat sequence motif in PM27 represents a putative elastic or mobile domain. Copyright 2002 Wiley Periodicals, Inc.
Blind prediction of noncanonical RNA structure at atomic accuracy.

PubMed

Watkins, Andrew M; Geniesse, Caleb; Kladwang, Wipapat; Zakrevsky, Paul; Jaeger, Luc; Das, Rhiju

2018-05-01

Prediction of RNA structure from nucleotide sequence remains an unsolved grand challenge of biochemistry and requires distinct concepts from protein structure prediction. Despite extensive algorithmic development in recent years, modeling of noncanonical base pairs of new RNA structural motifs has not been achieved in blind challenges. We report a stepwise Monte Carlo (SWM) method with a unique add-and-delete move set that enables predictions of noncanonical base pairs of complex RNA structures. A benchmark of 82 diverse motifs establishes the method's general ability to recover noncanonical pairs ab initio, including multistrand motifs that have been refractory to prior approaches. In a blind challenge, SWM models predicted nucleotide-resolution chemical mapping and compensatory mutagenesis experiments for three in vitro selected tetraloop/receptors with previously unsolved structures (C7.2, C7.10, and R1). As a final test, SWM blindly and correctly predicted all noncanonical pairs of a Zika virus double pseudoknot during a recent community-wide RNA-Puzzle. Stepwise structure formation, as encoded in the SWM method, enables modeling of noncanonical RNA structure in a variety of previously intractable problems.
Nucleophosmin integrates within the nucleolus via multi-modal interactions with proteins displaying R-rich linear motifs and rRNA

PubMed Central

Mitrea, Diana M; Cika, Jaclyn A; Guy, Clifford S; Ban, David; Banerjee, Priya R; Stanley, Christopher B; Nourse, Amanda; Deniz, Ashok A; Kriwacki, Richard W

2016-01-01

The nucleolus is a membrane-less organelle formed through liquid-liquid phase separation of its components from the surrounding nucleoplasm. Here, we show that nucleophosmin (NPM1) integrates within the nucleolus via a multi-modal mechanism involving multivalent interactions with proteins containing arginine-rich linear motifs (R-motifs) and ribosomal RNA (rRNA). Importantly, these R-motifs are found in canonical nucleolar localization signals. Based on a novel combination of biophysical approaches, we propose a model for the molecular organization within liquid-like droplets formed by the N-terminal domain of NPM1 and R-motif peptides, thus providing insights into the structural organization of the nucleolus. We identify multivalency of acidic tracts and folded nucleic acid binding domains, mediated by N-terminal domain oligomerization, as structural features required for phase separation of NPM1 with other nucleolar components in vitro and for localization within mammalian nucleoli. We propose that one mechanism of nucleolar localization involves phase separation of proteins within the nucleolus. DOI: http://dx.doi.org/10.7554/eLife.13571.001 PMID:26836305
Composition-dependent stability of the medium-range order responsible for metallic glass formation

DOE PAGES

Zhang, Feng; Ji, Min; Fang, Xiao-Wei; ...

2014-09-18

The competition between the characteristic medium-range order corresponding to amorphous alloys and that in ordered crystalline phases is central to phase selection and morphology evolution under various processing conditions. We examine the stability of a model glass system, Cu–Zr, by comparing the energetics of various medium-range structural motifs over a wide range of compositions using first-principles calculations. Furthermore, we focus specifically on motifs that represent possible building blocks for competing glassy and crystalline phases, and we employ a genetic algorithm to efficiently identify the energetically favored decorations of each motif for specific compositions. These results show that a Bergman-type motifmore » with crystallization-resisting icosahedral symmetry is energetically most favorable in the composition range 0.63 < xCu < 0.68, and is the underlying motif for one of the three optimal glass-forming ranges observed experimentally for this binary system (Li et al., 2008). This work establishes an energy-based methodology to evaluate specific medium-range structural motifs which compete with stable crystalline nuclei in deeply undercooled liquids.« less
Ni2+-binding RNA motifs with an asymmetric purine-rich internal loop and a G-A base pair.

PubMed Central

Hofmann, H P; Limmer, S; Hornung, V; Sprinzl, M

1997-01-01

RNA molecules with high affinity for immobilized Ni2+ were isolated from an RNA pool with 50 randomized positions by in vitro selection-amplification. The selected RNAs preferentially bind Ni2+ and Co2+ over other cations from first series transition metals. Conserved structure motifs, comprising about 15 nt, were identified that are likely to represent the Ni2+ binding sites. Two conserved motifs contain an asymmetric purine-rich internal loop and probably a mismatch G-A base pair. The structure of one of these motifs was studied with proton NMR spectroscopy and formation of the G-A pair at the junction of helix and internal loop was demonstrated. Using Ni2+ as a paramagnetic probe, a divalent metal ion binding site near this G-A base pair was identified. Ni2+ ions bound to this motif exert a specific stabilization effect. We propose that small asymmetric purine-rich loops that contain a G-A interaction may represent a divalent metal ion binding site in RNA. PMID:9409620
Tandem hnRNP A1 RNA recognition motifs act in concert to repress the splicing of survival motor neuron exon 7

PubMed Central

Beusch, Irene; Barraud, Pierre; Moursy, Ahmed; Cléry, Antoine; Allain, Frédéric Hai-Trieu

2017-01-01

HnRNP A1 regulates many alternative splicing events by the recognition of splicing silencer elements. Here, we provide the solution structures of its two RNA recognition motifs (RRMs) in complex with short RNA. In addition, we show by NMR that both RRMs of hnRNP A1 can bind simultaneously to a single bipartite motif of the human intronic splicing silencer ISS-N1, which controls survival of motor neuron exon 7 splicing. RRM2 binds to the upstream motif and RRM1 to the downstream motif. Combining the insights from the structure with in cell splicing assays we show that the architecture and organization of the two RRMs is essential to hnRNP A1 function. The disruption of the inter-RRM interaction or the loss of RNA binding capacity of either RRM impairs splicing repression by hnRNP A1. Furthermore, both binding sites within the ISS-N1 are important for splicing repression and their contributions are cumulative rather than synergistic. DOI: http://dx.doi.org/10.7554/eLife.25736.001 PMID:28650318
Nucleophosmin integrates within the nucleolus via multi-modal interactions with proteins displaying R-rich linear motifs and rRNA

DOE PAGES

Mitrea, Diana M.; Cika, Jaclyn A.; Guy, Clifford S.; ...

2016-02-02

In this study, the nucleolus is a membrane-less organelle formed through liquid-liquid phase separation of its components from the surrounding nucleoplasm. Here, we show that nucleophosmin (NPM1) integrates within the nucleolus via a multi-modal mechanism involving multivalent interactions with proteins containing arginine-rich linear motifs (R-motifs) and ribosomal RNA (rRNA). Importantly, these R-motifs are found in canonical nucleolar localization signals. Based on a novel combination of biophysical approaches, we propose a model for the molecular organization within liquid-like droplets formed by the N-terminal domain of NPM1 and R-motif peptides, thus providing insights into the structural organization of the nucleolus. We identifymore » multivalency of acidic tracts and folded nucleic acid binding domains, mediated by N-terminal domain oligomerization, as structural features required for phase separation of NPM1 with other nucleolar components in vitro and for localization within mammalian nucleoli. We propose that one mechanism of nucleolar localization involves phase separation of proteins within the nucleolus.« less
[Structure-functional organization of eukaryotic high-affinity copper importer CTR1 determines its ability to transport copper, silver and cisplatin].

PubMed

Skvortsov, A N; Zatulovskiĭ, E A; Puchkova, L V

2012-01-01

It was shown recently, that high affinity Cu(I) importer eukaryotic protein CTR1 can also transport in vitro abiogenic Ag(I) ions and anticancer drug cisplatin. At present there is no rational explanation how CTR1 can transfer platinum group, which is different by coordination properties from highly similar Cu(I) and Ag(I). To understand this phenomenon we analyzed 25 sequences of chordate CTR1 proteins, and found out conserved patterns of organization of N-terminal extracellular part of CTR1 which correspond to initial metal binding. Extracellular copper-binding motifs were qualified by their coordination properties. It was shown that relative position of Met- and His-rich copper-binding motifs in CTR1 predisposes the extracellular CTR1 part to binding of copper, silver and cisplatin. Relation between tissue-specific expression of CTR1 gene, steady-state copper concentration, and silver and platinum accumulation in organs of mice in vivo was analyzed. Significant positive but incomplete correlation exists between these variables. Basing on structural and functional peculiarities of N-terminal part of CTR1 a hypothesis of coupled transport of copper and cisplatin has been suggested, which avoids the disagreement between CTR1-mediated cisplatin transport in vitro, and irreversible binding of platinum to Met-rich peptides.
The K-turn motif in riboswitches and other RNA species☆

PubMed Central

Lilley, David M.J.

2014-01-01

The kink turn is a widespread structure motif that introduces a tight bend into the axis of duplex RNA. This generally functions to mediate tertiary interactions, and to serve as a specific protein binding site. K-turns or closely related structures are found in at least seven different riboswitch structures, where they function as key architectural elements that help generate the ligand binding pocket. This article is part of a Special Issue entitled: Riboswitches. PMID:24798078
Identification of N-Terminal Lobe Motifs that Determine the Kinase Activity of the Catalytic Domains and Regulatory Strategies of Src and Csk Protein Tyrosine Kinases†

PubMed Central

Huang, Kezhen; Wang, Yue-Hao; Brown, Alex; Sun, Gongqin

2009-01-01

Csk and Src protein tyrosine kinases are structurally homologous, but use opposite regulatory strategies. The isolated catalytic domain of Csk is intrinsically inactive and is activated by interactions with the regulatory SH3 and SH2 domains, while the isolated catalytic domain of Src is intrinsically active and is suppressed by interactions with the regulatory SH3 and SH2 domains. The structural basis for why one isolated catalytic domain is intrinsically active while the other is inactive is not clear. In this current study, we identify the structural elements in the N-terminal lobe of the catalytic domain that render the Src catalytic domain active. These structural elements include the α-helix C region, a β-turn between the β-4 and β-5 strands, and an Arg residue at the beginning of the catalytic domain. These three motifs interact with each other to activate the Src catalytic domain, but the equivalent motifs in Csk directly interact with the regulatory domains that are important for Csk activation. The Src motifs can be grafted to the Csk catalytic domain to obtain an active Csk catalytic domain. These results, together with available Src and Csk tertiary structures, reveal an important structural switch that determines the kinase activity of a catalytic domain and dictates the regulatory strategy of a kinase. PMID:19244618
Structural basis for the facilitative diffusion mechanism by SemiSWEET transporter

NASA Astrophysics Data System (ADS)

Lee, Yongchan; Nishizawa, Tomohiro; Yamashita, Keitaro; Ishitani, Ryuichiro; Nureki, Osamu

2015-01-01

SWEET family proteins mediate sugar transport across biological membranes and play crucial roles in plants and animals. The SWEETs and their bacterial homologues, the SemiSWEETs, are related to the PQ-loop family, which is characterized by highly conserved proline and glutamine residues (PQ-loop motif). Although the structures of the bacterial SemiSWEETs were recently reported, the conformational transition and the significance of the conserved motif in the transport cycle have remained elusive. Here we report crystal structures of SemiSWEET from Escherichia coli, in the both inward-open and outward-open states. A structural comparison revealed that SemiSWEET undergoes an intramolecular conformational change in each protomer. The conserved PQ-loop motif serves as a molecular hinge that enables the ‘binder clip-like’ motion of SemiSWEET. The present work provides the framework for understanding the overall transport cycles of SWEET and PQ-loop family proteins.
Physicochemically Tunable Polyfunctionalized RNA Square Architecture with Fluorogenic and Ribozymatic Properties

PubMed Central

2015-01-01

Recent advances in RNA nanotechnology allow the rational design of various nanoarchitectures. Previous methods utilized conserved angles from natural RNA motifs to form geometries with specific sizes. However, the feasibility of producing RNA architecture with variable sizes using native motifs featuring fixed sizes and angles is limited. It would be advantageous to display RNA nanoparticles of diverse shape and size derived from a given primary sequence. Here, we report an approach to construct RNA nanoparticles with tunable size and stability. Multifunctional RNA squares with a 90° angle were constructed by tuning the 60° angle of the three-way junction (3WJ) motif from the packaging RNA (pRNA) of the bacteriophage phi29 DNA packaging motor. The physicochemical properties and size of the RNA square were also easily tuned by modulating the “core” strand and adjusting the length of the sides of the square via predictable design. Squares of 5, 10, and 20 nm were constructed, each showing diverse thermodynamic and chemical stabilities. Four “arms” extending from the corners of the square were used to incorporate siRNA, ribozyme, and fluorogenic RNA motifs. Unique intramolecular contact using the pre-existing intricacy of the 3WJ avoids relatively weaker intermolecular interactions via kissing loops or sticky ends. Utilizing the 3WJ motif, we have employed a modular design technique to construct variable-size RNA squares with controllable properties and functionalities for diverse and versatile applications with engineering, pharmaceutical, and medical potential. This technique for simple design to finely tune physicochemical properties adds a new angle to RNA nanotechnology. PMID:24971772
DOE Office of Scientific and Technical Information (OSTI.GOV)

Rajan, Rakhi; Taneja, Bhupesh; Mondragón, Alfonso

Topoisomerase V is an archaeal type I topoisomerase that is unique among topoisomerases due to presence of both topoisomerase and DNA repair activities in the same protein. It is organized as an N-terminal topoisomerase domain followed by 24 tandem helix-hairpin-helix (HhH) motifs. Structural studies have shown that the active site is buried by the (HhH) motifs. Here we show that the N-terminal domain can relax DNA in the absence of any HhH motifs and that the HhH motifs are required for stable protein-DNA complex formation. Crystal structures of various topoisomerase V fragments show changes in the relative orientation of themore » domains mediated by a long bent linker helix, and these movements are essential for the DNA to enter the active site. Phosphate ions bound to the protein near the active site helped model DNA in the topoisomerase domain and show how topoisomerase V may interact with DNA.« less

RNA motif search with data-driven element ordering.

PubMed

Rampášek, Ladislav; Jimenez, Randi M; Lupták, Andrej; Vinař, Tomáš; Brejová, Broňa

2016-05-18

In this paper, we study the problem of RNA motif search in long genomic sequences. This approach uses a combination of sequence and structure constraints to uncover new distant homologs of known functional RNAs. The problem is NP-hard and is traditionally solved by backtracking algorithms. We have designed a new algorithm for RNA motif search and implemented a new motif search tool RNArobo. The tool enhances the RNAbob descriptor language, allowing insertions in helices, which enables better characterization of ribozymes and aptamers. A typical RNA motif consists of multiple elements and the running time of the algorithm is highly dependent on their ordering. By approaching the element ordering problem in a principled way, we demonstrate more than 100-fold speedup of the search for complex motifs compared to previously published tools. We have developed a new method for RNA motif search that allows for a significant speedup of the search of complex motifs that include pseudoknots. Such speed improvements are crucial at a time when the rate of DNA sequencing outpaces growth in computing. RNArobo is available at http://compbio.fmph.uniba.sk/rnarobo .
CircularLogo: A lightweight web application to visualize intra-motif dependencies.

PubMed

Ye, Zhenqing; Ma, Tao; Kalmbach, Michael T; Dasari, Surendra; Kocher, Jean-Pierre A; Wang, Liguo

2017-05-22

The sequence logo has been widely used to represent DNA or RNA motifs for more than three decades. Despite its intelligibility and intuitiveness, the traditional sequence logo is unable to display the intra-motif dependencies and therefore is insufficient to fully characterize nucleotide motifs. Many methods have been developed to quantify the intra-motif dependencies, but fewer tools are available for visualization. We developed CircularLogo, a web-based interactive application, which is able to not only visualize the position-specific nucleotide consensus and diversity but also display the intra-motif dependencies. Applying CircularLogo to HNF6 binding sites and tRNA sequences demonstrated its ability to show intra-motif dependencies and intuitively reveal biomolecular structure. CircularLogo is implemented in JavaScript and Python based on the Django web framework. The program's source code and user's manual are freely available at http://circularlogo.sourceforge.net . CircularLogo web server can be accessed from http://bioinformaticstools.mayo.edu/circularlogo/index.html . CircularLogo is an innovative web application that is specifically designed to visualize and interactively explore intra-motif dependencies.
A flexible motif search technique based on generalized profiles.

PubMed

Bucher, P; Karplus, K; Moeri, N; Hofmann, K

1996-03-01

A flexible motif search technique is presented which has two major components: (1) a generalized profile syntax serving as a motif definition language; and (2) a motif search method specifically adapted to the problem of finding multiple instances of a motif in the same sequence. The new profile structure, which is the core of the generalized profile syntax, combines the functions of a variety of motif descriptors implemented in other methods, including regular expression-like patterns, weight matrices, previously used profiles, and certain types of hidden Markov models (HMMs). The relationship between generalized profiles and other biomolecular motif descriptors is analyzed in detail, with special attention to HMMs. Generalized profiles are shown to be equivalent to a particular class of HMMs, and conversion procedures in both directions are given. The conversion procedures provide an interpretation for local alignment in the framework of stochastic models, allowing for clear, simple significance tests. A mathematical statement of the motif search problem defines the new method exactly without linking it to a specific algorithmic solution. Part of the definition includes a new definition of disjointness of alignments.
Structural Characterization of Proline-rich Tyrosine Kinase 2 (PYK2) Reveals a Unique (DFG-out) Conformation and Enables Inhibitor Design

DOE Office of Scientific and Technical Information (OSTI.GOV)

Han, Seungil; Mistry, Anil; Chang, Jeanne S.

Proline-rich tyrosine kinase 2 (PYK2) is a cytoplasmic, non-receptor tyrosine kinase implicated in multiple signaling pathways. It is a negative regulator of osteogenesis and considered a viable drug target for osteoporosis treatment. The high-resolution structures of the human PYK2 kinase domain with different inhibitor complexes establish the conventional bilobal kinase architecture and show the conformational variability of the DFG loop. The basis for the lack of selectivity for the classical kinase inhibitor, PF-431396, within the FAK family is explained by our structural analyses. Importantly, the novel DFG-out conformation with two diarylurea inhibitors (BIRB796, PF-4618433) reveals a distinct subclass of non-receptormore » tyrosine kinases identifiable by the gatekeeper Met-502 and the unique hinge loop conformation of Leu-504. This is the first example of a leucine residue in the hinge loop that blocks the ATP binding site in the DFG-out conformation. Our structural, biophysical, and pharmacological studies suggest that the unique features of the DFG motif, including Leu-504 hinge-loop variability, can be exploited for the development of selective protein kinase inhibitors.« less
Structural basis for genome wide recognition of 5-bp GC motifs by SMAD transcription factors.

PubMed

Martin-Malpartida, Pau; Batet, Marta; Kaczmarska, Zuzanna; Freier, Regina; Gomes, Tiago; Aragón, Eric; Zou, Yilong; Wang, Qiong; Xi, Qiaoran; Ruiz, Lidia; Vea, Angela; Márquez, José A; Massagué, Joan; Macias, Maria J

2017-12-12

Smad transcription factors activated by TGF-β or by BMP receptors form trimeric complexes with Smad4 to target specific genes for cell fate regulation. The CAGAC motif has been considered as the main binding element for Smad2/3/4, whereas Smad1/5/8 have been thought to preferentially bind GC-rich elements. However, chromatin immunoprecipitation analysis in embryonic stem cells showed extensive binding of Smad2/3/4 to GC-rich cis-regulatory elements. Here, we present the structural basis for specific binding of Smad3 and Smad4 to GC-rich motifs in the goosecoid promoter, a nodal-regulated differentiation gene. The structures revealed a 5-bp consensus sequence GGC(GC)|(CG) as the binding site for both TGF-β and BMP-activated Smads and for Smad4. These 5GC motifs are highly represented as clusters in Smad-bound regions genome-wide. Our results provide a basis for understanding the functional adaptability of Smads in different cellular contexts, and their dependence on lineage-determining transcription factors to target specific genes in TGF-β and BMP pathways.
QuateXelero: An Accelerated Exact Network Motif Detection Algorithm

PubMed Central

Khakabimamaghani, Sahand; Sharafuddin, Iman; Dichter, Norbert; Koch, Ina; Masoudi-Nejad, Ali

2013-01-01

Finding motifs in biological, social, technological, and other types of networks has become a widespread method to gain more knowledge about these networks’ structure and function. However, this task is very computationally demanding, because it is highly associated with the graph isomorphism which is an NP problem (not known to belong to P or NP-complete subsets yet). Accordingly, this research is endeavoring to decrease the need to call NAUTY isomorphism detection method, which is the most time-consuming step in many existing algorithms. The work provides an extremely fast motif detection algorithm called QuateXelero, which has a Quaternary Tree data structure in the heart. The proposed algorithm is based on the well-known ESU (FANMOD) motif detection algorithm. The results of experiments on some standard model networks approve the overal superiority of the proposed algorithm, namely QuateXelero, compared with two of the fastest existing algorithms, G-Tries and Kavosh. QuateXelero is especially fastest in constructing the central data structure of the algorithm from scratch based on the input network. PMID:23874498
A motif detection and classification method for peptide sequences using genetic programming.

PubMed

Tomita, Yasuyuki; Kato, Ryuji; Okochi, Mina; Honda, Hiroyuki

2008-08-01

An exploration of common rules (property motifs) in amino acid sequences has been required for the design of novel sequences and elucidation of the interactions between molecules controlled by the structural or physical environment. In the present study, we developed a new method to search property motifs that are common in peptide sequence data. Our method comprises the following two characteristics: (i) the automatic determination of the position and length of common property motifs by calculating the physicochemical similarity of amino acids, and (ii) the quick and effective exploration of motif candidates that discriminates the positives and negatives by the introduction of genetic programming (GP). Our method was evaluated by two types of model data sets. First, the intentionally buried property motifs were searched in the artificially derived peptide data containing intentionally buried property motifs. As a result, the expected property motifs were correctly extracted by our algorithm. Second, the peptide data that interact with MHC class II molecules were analyzed as one of the models of biologically active peptides with buried motifs in various lengths. Twofold MHC class II binding peptides were identified with the rule using our method, compared to the existing scoring matrix method. In conclusion, our GP based motif searching approach enabled to obtain knowledge of functional aspects of the peptides without any prior knowledge.
Discriminative motif discovery via simulated evolution and random under-sampling.

PubMed

Song, Tao; Gu, Hong

2014-01-01

Conserved motifs in biological sequences are closely related to their structure and functions. Recently, discriminative motif discovery methods have attracted more and more attention. However, little attention has been devoted to the data imbalance problem, which is one of the main reasons affecting the performance of the discriminative models. In this article, a simulated evolution method is applied to solve the multi-class imbalance problem at the stage of data preprocessing, and at the stage of Hidden Markov Models (HMMs) training, a random under-sampling method is introduced for the imbalance between the positive and negative datasets. It is shown that, in the task of discovering targeting motifs of nine subcellular compartments, the motifs found by our method are more conserved than the methods without considering data imbalance problem and recover the most known targeting motifs from Minimotif Miner and InterPro. Meanwhile, we use the found motifs to predict protein subcellular localization and achieve higher prediction precision and recall for the minority classes.
Identifying the preferred RNA motifs and chemotypes that interact by probing millions of combinations.

PubMed

Tran, Tuan; Disney, Matthew D

2012-01-01

RNA is an important therapeutic target but information about RNA-ligand interactions is limited. Here, we report a screening method that probes over 3,000,000 combinations of RNA motif-small molecule interactions to identify the privileged RNA structures and chemical spaces that interact. Specifically, a small molecule library biased for binding RNA was probed for binding to over 70,000 unique RNA motifs in a high throughput solution-based screen. The RNA motifs that specifically bind each small molecule were identified by microarray-based selection. In this library-versus-library or multidimensional combinatorial screening approach, hairpin loops (among a variety of RNA motifs) were the preferred RNA motif space that binds small molecules. Furthermore, it was shown that indole, 2-phenyl indole, 2-phenyl benzimidazole and pyridinium chemotypes allow for specific recognition of RNA motifs. As targeting RNA with small molecules is an extremely challenging area, these studies provide new information on RNA-ligand interactions that has many potential uses.
Identifying the Preferred RNA Motifs and Chemotypes that Interact by Probing Millions of Combinations

PubMed Central

Tran, Tuan; Disney, Matthew D.

2012-01-01

RNA is an important therapeutic target but information about RNA-ligand interactions is limited. Here we report a screening method that probes over 3,000,000 combinations of RNA motif-small molecule interactions to identify the privileged RNA structures and chemical spaces that interact. Specifically, a small molecule library biased for binding RNA was probed for binding to over 70,000 unique RNA motifs in a high throughput solution-based screen. The RNA motifs that specifically bind each small molecule were identified by microarray-based selection. In this library-versus-library or multidimensional combinatorial screening approach, hairpin loops (amongst a variety of RNA motifs) were the preferred RNA motif space that binds small molecules. Furthermore, it was shown that indole, 2-phenyl indole, 2-phenyl benzimidazole, and pyridinium chemotypes allow for specific recognition of RNA motifs. Since targeting RNA with small molecules is an extremely challenging area, these studies provide new information on RNA-ligand interactions that has many potential uses. PMID:23047683
Deciphering functional glycosaminoglycan motifs in development.

PubMed

Townley, Robert A; Bülow, Hannes E

2018-03-23

Glycosaminoglycans (GAGs) such as heparan sulfate, chondroitin/dermatan sulfate, and keratan sulfate are linear glycans, which when attached to protein backbones form proteoglycans. GAGs are essential components of the extracellular space in metazoans. Extensive modifications of the glycans such as sulfation, deacetylation and epimerization create structural GAG motifs. These motifs regulate protein-protein interactions and are thereby repsonsible for many of the essential functions of GAGs. This review focusses on recent genetic approaches to characterize GAG motifs and their function in defined signaling pathways during development. We discuss a coding approach for GAGs that would enable computational analyses of GAG sequences such as alignments and the computation of position weight matrices to describe GAG motifs. Copyright © 2018 Elsevier Ltd. All rights reserved.
Ca2+-binding Motif of βγ-Crystallins*

PubMed Central

Srivastava, Shanti Swaroop; Mishra, Amita; Krishnan, Bal; Sharma, Yogendra

2014-01-01

βγ-Crystallin-type double clamp (N/D)(N/D)XX(S/T)S motif is an established but sparsely investigated motif for Ca2+ binding. A βγ-crystallin domain is formed of two Greek key motifs, accommodating two Ca2+-binding sites. βγ-Crystallins make a separate class of Ca2+-binding proteins (CaBP), apparently a major group of CaBP in bacteria. Paralleling the diversity in βγ-crystallin domains, these motifs also show great diversity, both in structure and in function. Although the expression of some of them has been associated with stress, virulence, and adhesion, the functional implications of Ca2+ binding to βγ-crystallins in mediating biological processes are yet to be elucidated. PMID:24567326
Web server to identify similarity of amino acid motifs to compounds (SAAMCO).

PubMed

Casey, Fergal P; Davey, Norman E; Baran, Ivan; Varekova, Radka Svobodova; Shields, Denis C

2008-07-01

Protein-protein interactions are fundamental in mediating biological processes including metabolism, cell growth, and signaling. To be able to selectively inhibit or induce protein activity or complex formation is a key feature in controlling disease. For those situations in which protein-protein interactions derive substantial affinity from short linear peptide sequences, or motifs, we can develop search algorithms for peptidomimetic compounds that resemble the short peptide's structure but are not compromised by poor pharmacological properties. SAAMCO is a Web service ( http://bioware.ucd.ie/ approximately saamco) that facilitates the screening of motifs with known structures against bioactive compound databases. It is built on an algorithm that defines compound similarity based on the presence of appropriate amino acid side chain fragments and a favorable Root Mean Squared Deviation (RMSD) between compound and motif structure. The methodology is efficient as the available compound databases are preprocessed and fast regular expression searches filter potential matches before time-intensive 3D superposition is performed. The required input information is minimal, and the compound databases have been selected to maximize the availability of information on biological activity. "Hits" are accompanied with a visualization window and links to source database entries. Motif matching can be defined on partial or full similarity which will increase or reduce respectively the number of potential mimetic compounds. The Web server provides the functionality for rapid screening of known or putative interaction motifs against prepared compound libraries using a novel search algorithm. The tabulated results can be analyzed by linking to appropriate databases and by visualization.
The crystal structure of TrxA(CACA): Insights into the formation of a [2Fe-2S] iron-sulfur cluster in an Escherichia coli thioredoxin mutant

DOE Office of Scientific and Technical Information (OSTI.GOV)

Collet, Jean-Francois; Peisach, Daniel; Bardwell, James C.A.

2010-07-13

Escherichia coli thioredoxin is a small monomeric protein that reduces disulfide bonds in cytoplasmic proteins. Two cysteine residues present in a conserved CGPC motif are essential for this activity. Recently, we identified mutations of this motif that changed thioredoxin into a homodimer bridged by a [2Fe-2S] iron-sulfur cluster. When exported to the periplasm, these thioredoxin mutants could restore disulfide bond formation in strains lacking the entire periplasmic oxidative pathway. Essential for the assembly of the iron-sulfur was an additional cysteine that replaced the proline at position three of the CGPC motif. We solved the crystalline structure at 2.3 {angstrom} formore » one of these variants, TrxA(CACA). The mutant protein crystallized as a dimer in which the iron-sulfur cluster is replaced by two intermolecular disulfide bonds. The catalytic site, which forms the dimer interface, crystallized in two different conformations. In one of them, the replacement of the CGPC motif by CACA has a dramatic effect on the structure and causes the unraveling of an extended {alpha}-helix. In both conformations, the second cysteine residue of the CACA motif is surface-exposed, which contrasts with wildtype thioredoxin where the second cysteine of the CXXC motif is buried. This exposure of a pair of vicinal cysteine residues apparently allows thioredoxin to acquire an iron-sulfur cofactor at its active site, and thus a new activity and mechanism of action.« less
Molecular modeling of the elastomeric properties of repeating units and building blocks of resilin, a disordered elastic protein.

PubMed

Khandaker, Md Shahriar K; Dudek, Daniel M; Beers, Eric P; Dillard, David A; Bevan, David R

2016-08-01

The mechanisms responsible for the properties of disordered elastomeric proteins are not well known. To better understand the relationship between elastomeric behavior and amino acid sequence, we investigated resilin, a disordered rubber-like protein, found in specialized regions of the cuticle of insects. Resilin of Drosophila melanogaster contains Gly-rich repetitive motifs comprised of the amino acids, PSSSYGAPGGGNGGR, which confer elastic properties to resilin. The repetitive motifs of insect resilin can be divided into smaller partially conserved building blocks: PSS, SYGAP, GGGN and GGR. Using molecular dynamics (MD) simulations, we studied the relative roles of SYGAP, and its less common variants SYSAP and TYGAP, on the elastomeric properties of resilin. Results showed that SYGAP adopts a bent structure that is one-half to one-third the end-to-end length of the other motifs having an equal number of amino acids but containing SYSAP or TYGAP substituted for SYGAP. The bent structure of SYGAP forms due to conformational freedom of glycine, and hydrogen bonding within the motif apparently plays a role in maintaining this conformation. These structural features of SYGAP result in higher extensibility compared to other motifs, which may contribute to elastic properties at the macroscopic level. Overall, the results are consistent with a role for the SYGAP building block in the elastomeric properties of these disordered proteins. What we learned from simulating the repetitive motifs of resilin may be applicable to the biology and mechanics of other elastomeric biomaterials, and may provide us the deeper understanding of their unique properties. Copyright © 2016 Elsevier Ltd. All rights reserved.
Crystallographic and Computational Studies of a Class II MHC Complex with a Nonconforming Peptide: HLA-DRA/DRB3*0101

NASA Astrophysics Data System (ADS)

Parry, Christian S.; Gorski, Jack; Stern, Lawrence J.

2003-03-01

The stable binding of processed foreign peptide to a class II major histocompatibility (MHC) molecule and subsequent presentation to a T cell receptor is a central event in immune recognition and regulation. Polymorphic residues on the floor of the peptide binding site form pockets that anchor peptide side chains. These and other residues in the helical wall of the groove determine the specificity of each allele and define a motif. Allele specific motifs allow the prediction of epitopes from the sequence of pathogens. There are, however, known epitopes that do not satisfy these motifs: anchor motifs are not adequate for predicting epitopes as there are apparently major and minor motifs. We present crystallographic studies into the nature of the interactions that govern the binding of these so called nonconforming peptides. We would like to understand the role of the P10 pocket and find out whether the peptides that do not obey the consensus anchor motif bind in the canonical conformation observed in in prior structures of class II MHC-peptide complexes. HLA-DRB3*0101 complexed with peptide crystallized in unit cell 92.10 x 92.10 x 248.30 (90, 90, 90), P41212, and the diffraction data is reliable to 2.2ÅWe are complementing our studies with dynamical long time simulations to answer these questions, particularly the interplay of the anchor motifs in peptide binding, the range of protein and ligand conformations, and water hydration structures.
Genome-wide colonization of gene regulatory elements by G4 DNA motifs

PubMed Central

Du, Zhuo; Zhao, Yiqiang; Li, Ning

2009-01-01

G-quadruplex (or G4 DNA), a stable four-stranded structure found in guanine-rich regions, is implicated in the transcriptional regulation of genes involved in growth and development. Previous studies on the role of G4 DNA in gene regulation mostly focused on genomic regions proximal to transcription start sites (TSSs). To gain a more comprehensive understanding of the regulatory role of G4 DNA, we examined the landscape of potential G4 DNA (PG4Ms) motifs in the human genome and found that G4 motifs, not restricted to those found in the TSS-proximal regions, are bias toward gene-associated regions. Significantly, analyses of G4 motifs in seven types of well-known gene regulatory elements revealed a constitutive enrichment pattern and the clusters of G4 motifs tend to be colocalized with regulatory elements. Considering our analysis from a genome evolutionary perspective, we found evidence that the occurrence and accumulation of certain progenitors and canonical G4 DNA motifs within regulatory regions were progressively favored by natural selection. Our results suggest that G4 DNA motifs are ‘colonized’ in regulatory regions, supporting a likely genome-wide role of G4 DNA in gene regulation. We hypothesize that G4 DNA is a regulatory apparatus situated in regulatory elements, acting as a molecular switch that can modulate the role of the host functional regions, by transition in DNA structure. PMID:19759215
An analysis of multi-type relational interactions in FMA using graph motifs with disjointness constraints.

PubMed

Zhang, Guo-Qiang; Luo, Lingyun; Ogbuji, Chime; Joslyn, Cliff; Mejino, Jose; Sahoo, Satya S

2012-01-01

The interaction of multiple types of relationships among anatomical classes in the Foundational Model of Anatomy (FMA) can provide inferred information valuable for quality assurance. This paper introduces a method called Motif Checking (MOCH) to study the effects of such multi-relation type interactions for detecting logical inconsistencies as well as other anomalies represented by the motifs. MOCH represents patterns of multi-type interaction as small labeled (with multiple types of edges) sub-graph motifs, whose nodes represent class variables, and labeled edges represent relational types. By representing FMA as an RDF graph and motifs as SPARQL queries, fragments of FMA are automatically obtained as auditing candidates. Leveraging the scalability and reconfigurability of Semantic Web Technology, we performed exhaustive analyses of a variety of labeled sub-graph motifs. The quality assurance feature of MOCH comes from the distinct use of a subset of the edges of the graph motifs as constraints for disjointness, whereby bringing in rule-based flavor to the approach as well. With possible disjointness implied by antonyms, we performed manual inspection of the resulting FMA fragments and tracked down sources of abnormal inferred conclusions (logical inconsistencies), which are amendable for programmatic revision of the FMA. Our results demonstrate that MOCH provides a unique source of valuable information for quality assurance. Since our approach is general, it is applicable to any ontological system with an OWL representation.
An Analysis of Multi-type Relational Interactions in FMA Using Graph Motifs with Disjointness Constraints

PubMed Central

Zhang, Guo-Qiang; Luo, Lingyun; Ogbuji, Chime; Joslyn, Cliff; Mejino, Jose; Sahoo, Satya S

2012-01-01

The interaction of multiple types of relationships among anatomical classes in the Foundational Model of Anatomy (FMA) can provide inferred information valuable for quality assurance. This paper introduces a method called Motif Checking (MOCH) to study the effects of such multi-relation type interactions for detecting logical inconsistencies as well as other anomalies represented by the motifs. MOCH represents patterns of multi-type interaction as small labeled (with multiple types of edges) sub-graph motifs, whose nodes represent class variables, and labeled edges represent relational types. By representing FMA as an RDF graph and motifs as SPARQL queries, fragments of FMA are automatically obtained as auditing candidates. Leveraging the scalability and reconfigurability of Semantic Web Technology, we performed exhaustive analyses of a variety of labeled sub-graph motifs. The quality assurance feature of MOCH comes from the distinct use of a subset of the edges of the graph motifs as constraints for disjointness, whereby bringing in rule-based flavor to the approach as well. With possible disjointness implied by antonyms, we performed manual inspection of the resulting FMA fragments and tracked down sources of abnormal inferred conclusions (logical inconsistencies), which are amendable for programmatic revision of the FMA. Our results demonstrate that MOCH provides a unique source of valuable information for quality assurance. Since our approach is general, it is applicable to any ontological system with an OWL representation. PMID:23304382
Automated extraction and classification of RNA tertiary structure cyclic motifs

PubMed Central

Lemieux, Sébastien; Major, François

2006-01-01

A minimum cycle basis of the tertiary structure of a large ribosomal subunit (LSU) X-ray crystal structure was analyzed. Most cycles are small, as they are composed of 3- to 5 nt, and repeated across the LSU tertiary structure. We used hierarchical clustering to quantify and classify the 4 nt cycles. One class is defined by the GNRA tetraloop motif. The inspection of the GNRA class revealed peculiar instances in sequence. First is the presence of UA, CA, UC and CC base pairs that substitute the usual sheared GA base pair. Second is the revelation of GNR(Xn)A tetraloops, where Xn is bulged out of the classical GNRA structure, and of GN/RA formed by the two strands of interior-loops. We were able to unambiguously characterize the cycle classes using base stacking and base pairing annotations. The cycles identified correspond to small and cyclic motifs that compose most of the LSU RNA tertiary structure and contribute to its thermodynamic stability. Consequently, the RNA minimum cycles could well be used as the basic elements of RNA tertiary structure prediction methods. PMID:16679452

Giant Reverse Transcriptase-Encoding Transposable Elements at Telomeres.

PubMed

Arkhipova, Irina R; Yushenova, Irina A; Rodriguez, Fernando

2017-09-01

Transposable elements are omnipresent in eukaryotic genomes and have a profound impact on chromosome structure, function and evolution. Their structural and functional diversity is thought to be reasonably well-understood, especially in retroelements, which transpose via an RNA intermediate copied into cDNA by the element-encoded reverse transcriptase, and are characterized by a compact structure. Here, we report a novel type of expandable eukaryotic retroelements, which we call Terminons. These elements can attach to G-rich telomeric repeat overhangs at the chromosome ends, in a process apparently facilitated by complementary C-rich repeats at the 3'-end of the RNA template immediately adjacent to a hammerhead ribozyme motif. Terminon units, which can exceed 40 kb in length, display an unusually complex and diverse structure, and can form very long chains, with host genes often captured between units. As the principal polymerizing component, Terminons contain Athena reverse transcriptases previously described in bdelloid rotifers and belonging to the enigmatic group of Penelope-like elements, but can additionally accumulate multiple cooriented ORFs, including DEDDy 3'-exonucleases, GDSL esterases/lipases, GIY-YIG-like endonucleases, rolling-circle replication initiator (Rep) proteins, and putatively structural ORFs with coiled-coil motifs and transmembrane domains. The extraordinary length and complexity of Terminons and the high degree of interfamily variability in their ORF content challenge the current views on the structural organization of eukaryotic retroelements, and highlight their possible connections with the viral world and the implications for the elevated frequency of gene transfer. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Topological impact of noncanonical DNA structures on Klenow fragment of DNA polymerase.

PubMed

Takahashi, Shuntaro; Brazier, John A; Sugimoto, Naoki

2017-09-05

Noncanonical DNA structures that stall DNA replication can cause errors in genomic DNA. Here, we investigated how the noncanonical structures formed by sequences in genes associated with a number of diseases impacted DNA polymerization by the Klenow fragment of DNA polymerase. Replication of a DNA sequence forming an i-motif from a telomere, hypoxia-induced transcription factor, and an insulin-linked polymorphic region was effectively inhibited. On the other hand, replication of a mixed-type G-quadruplex (G4) from a telomere was less inhibited than that of the antiparallel type or parallel type. Interestingly, the i-motif was a better inhibitor of replication than were mixed-type G4s or hairpin structures, even though all had similar thermodynamic stabilities. These results indicate that both the stability and topology of structures formed in DNA templates impact the processivity of a DNA polymerase. This suggests that i-motif formation may trigger genomic instability by stalling the replication of DNA, causing intractable diseases.
Topological impact of noncanonical DNA structures on Klenow fragment of DNA polymerase

PubMed Central

Takahashi, Shuntaro; Brazier, John A.; Sugimoto, Naoki

2017-01-01

Noncanonical DNA structures that stall DNA replication can cause errors in genomic DNA. Here, we investigated how the noncanonical structures formed by sequences in genes associated with a number of diseases impacted DNA polymerization by the Klenow fragment of DNA polymerase. Replication of a DNA sequence forming an i-motif from a telomere, hypoxia-induced transcription factor, and an insulin-linked polymorphic region was effectively inhibited. On the other hand, replication of a mixed-type G-quadruplex (G4) from a telomere was less inhibited than that of the antiparallel type or parallel type. Interestingly, the i-motif was a better inhibitor of replication than were mixed-type G4s or hairpin structures, even though all had similar thermodynamic stabilities. These results indicate that both the stability and topology of structures formed in DNA templates impact the processivity of a DNA polymerase. This suggests that i-motif formation may trigger genomic instability by stalling the replication of DNA, causing intractable diseases. PMID:28827350
Methods for Identifying Ligands that Target Nucleic Acid Molecules and Nucleic Acid Structural Motifs

NASA Technical Reports Server (NTRS)

Childs-Disney, Jessica L. (Inventor); Disney, Matthew D. (Inventor)

2017-01-01

Disclosed are methods for identifying a nucleic acid (e.g., RNA, DNA, etc.) motif which interacts with a ligand. The method includes providing a plurality of ligands immobilized on a support, wherein each particular ligand is immobilized at a discrete location on the support; contacting the plurality of immobilized ligands with a nucleic acid motif library under conditions effective for one or more members of the nucleic acid motif library to bind with the immobilized ligands; and identifying members of the nucleic acid motif library that are bound to a particular immobilized ligand. Also disclosed are methods for selecting, from a plurality of candidate ligands, one or more ligands that have increased likelihood of binding to a nucleic acid molecule comprising a particular nucleic acid motif, as well as methods for identifying a nucleic acid which interacts with a ligand.
The Proliferating Cell Nuclear Antigen (PCNA)-interacting Protein (PIP) Motif of DNA Polymerase η Mediates Its Interaction with the C-terminal Domain of Rev1*

PubMed Central

Boehm, Elizabeth M.; Powers, Kyle T.; Kondratick, Christine M.; Spies, Maria; Houtman, Jon C. D.; Washington, M. Todd

2016-01-01

Y-family DNA polymerases, such as polymerase η, polymerase ι, and polymerase κ, catalyze the bypass of DNA damage during translesion synthesis. These enzymes are recruited to sites of DNA damage by interacting with the essential replication accessory protein proliferating cell nuclear antigen (PCNA) and the scaffold protein Rev1. In most Y-family polymerases, these interactions are mediated by one or more conserved PCNA-interacting protein (PIP) motifs that bind in a hydrophobic pocket on the front side of PCNA as well as by conserved Rev1-interacting region (RIR) motifs that bind in a hydrophobic pocket on the C-terminal domain of Rev1. Yeast polymerase η, a prototypical translesion synthesis polymerase, binds both PCNA and Rev1. It possesses a single PIP motif but not an RIR motif. Here we show that the PIP motif of yeast polymerase η mediates its interactions both with PCNA and with Rev1. Moreover, the PIP motif of polymerase η binds in the hydrophobic pocket on the Rev1 C-terminal domain. We also show that the RIR motif of human polymerase κ and the PIP motif of yeast Msh6 bind both PCNA and Rev1. Overall, these findings demonstrate that PIP motifs and RIR motifs have overlapping specificities and can interact with both PCNA and Rev1 in structurally similar ways. These findings also suggest that PIP motifs are a more versatile protein interaction motif than previously believed. PMID:26903512
Structural motif screening reveals a novel, conserved carbohydrate-binding surface in the pathogenesis-related protein PR-5d.

PubMed

Doxey, Andrew C; Cheng, Zhenyu; Moffatt, Barbara A; McConkey, Brendan J

2010-08-03

Aromatic amino acids play a critical role in protein-glycan interactions. Clusters of surface aromatic residues and their features may therefore be useful in distinguishing glycan-binding sites as well as predicting novel glycan-binding proteins. In this work, a structural bioinformatics approach was used to screen the Protein Data Bank (PDB) for coplanar aromatic motifs similar to those found in known glycan-binding proteins. The proteins identified in the screen were significantly associated with carbohydrate-related functions according to gene ontology (GO) enrichment analysis, and predicted motifs were found frequently within novel folds and glycan-binding sites not included in the training set. In addition to numerous binding sites predicted in structural genomics proteins of unknown function, one novel prediction was a surface motif (W34/W36/W192) in the tobacco pathogenesis-related protein, PR-5d. Phylogenetic analysis revealed that the surface motif is exclusive to a subfamily of PR-5 proteins from the Solanaceae family of plants, and is absent completely in more distant homologs. To confirm PR-5d's insoluble-polysaccharide binding activity, a cellulose-pulldown assay of tobacco proteins was performed and PR-5d was identified in the cellulose-binding fraction by mass spectrometry. Based on the combined results, we propose that the putative binding site in PR-5d may be an evolutionary adaptation of Solanaceae plants including potato, tomato, and tobacco, towards defense against cellulose-containing pathogens such as species of the deadly oomycete genus, Phytophthora. More generally, the results demonstrate that coplanar aromatic clusters on protein surfaces are a structural signature of glycan-binding proteins, and can be used to computationally predict novel glycan-binding proteins from 3 D structure.
Identification of sequence–structure RNA binding motifs for SELEX-derived aptamers

PubMed Central

Hoinka, Jan; Zotenko, Elena; Friedman, Adam; Sauna, Zuben E.; Przytycka, Teresa M.

2012-01-01

Motivation: Systematic Evolution of Ligands by EXponential Enrichment (SELEX) represents a state-of-the-art technology to isolate single-stranded (ribo)nucleic acid fragments, named aptamers, which bind to a molecule (or molecules) of interest via specific structural regions induced by their sequence-dependent fold. This powerful method has applications in designing protein inhibitors, molecular detection systems, therapeutic drugs and antibody replacement among others. However, full understanding and consequently optimal utilization of the process has lagged behind its wide application due to the lack of dedicated computational approaches. At the same time, the combination of SELEX with novel sequencing technologies is beginning to provide the data that will allow the examination of a variety of properties of the selection process. Results: To close this gap we developed, Aptamotif, a computational method for the identification of sequence–structure motifs in SELEX-derived aptamers. To increase the chances of identifying functional motifs, Aptamotif uses an ensemble-based approach. We validated the method using two published aptamer datasets containing experimentally determined motifs of increasing complexity. We were able to recreate the author's findings to a high degree, thus proving the capability of our approach to identify binding motifs in SELEX data. Additionally, using our new experimental dataset, we illustrate the application of Aptamotif to elucidate several properties of the selection process. Contact: przytyck@ncbi.nlm.nih.gov, Zuben.Sauna@fda.hhs.gov PMID:22689764
Signal-dependent export of GABA transporter 1 from the ER-Golgi intermediate compartment is specified by a C-terminal motif

PubMed Central

Farhan, Hesso; Reiterer, Veronika; Kriz, Alexander; Hauri, Hans-Peter; Pavelka, Margit; Sitte, Harald H.; Freissmuth, Michael

2015-01-01

Summary The C-terminus of GABA transporter 1 (GAT1, SLC6A1) is required for trafficking of the protein through the secretory pathway to reach its final destination, i.e. the rim of the synaptic specialization. We identified a motif of three hydrophobic residues (569VMI571) that was required for export of GAT1 from the ER-Golgi intermediate compartment (ERGIC). This conclusion was based on the following observations: (i) GAT1-SSS, the mutant in which 569VMI571 was replaced by serine residues, was exported from the ER in a COPII-dependent manner but accumulated in punctate structures and failed to reach the Golgi; (ii) under appropriate conditions (imposing a block at 15°C, disruption of COPI), these structures also contained ERGIC53; (iii) the punctae were part of a dynamic compartment, because it was accessible to a second anterograde cargo [the temperature-sensitive variant of vesicular stomatitis virus G protein (VSV-G)] and because GAT1-SSS could be retrieved from the punctate structures by addition of a KKxx-based retrieval motif, which supported retrograde transport to the ER. To the best of our knowledge, the VMI-motif of GAT1 provides the first example of a cargo-based motif that specifies export from the ERGIC. PMID:18285449
Statistics of optimal information flow in ensembles of regulatory motifs

NASA Astrophysics Data System (ADS)

Crisanti, Andrea; De Martino, Andrea; Fiorentino, Jonathan

2018-02-01

Genetic regulatory circuits universally cope with different sources of noise that limit their ability to coordinate input and output signals. In many cases, optimal regulatory performance can be thought to correspond to configurations of variables and parameters that maximize the mutual information between inputs and outputs. Since the mid-2000s, such optima have been well characterized in several biologically relevant cases. Here we use methods of statistical field theory to calculate the statistics of the maximal mutual information (the "capacity") achievable by tuning the input variable only in an ensemble of regulatory motifs, such that a single controller regulates N targets. Assuming (i) sufficiently large N , (ii) quenched random kinetic parameters, and (iii) small noise affecting the input-output channels, we can accurately reproduce numerical simulations both for the mean capacity and for the whole distribution. Our results provide insight into the inherent variability in effectiveness occurring in regulatory systems with heterogeneous kinetic parameters.
Identification of family-specific residue packing motifs and their use for structure-based protein function prediction: I. Method development.

PubMed

Bandyopadhyay, Deepak; Huan, Jun; Prins, Jan; Snoeyink, Jack; Wang, Wei; Tropsha, Alexander

2009-11-01

Protein function prediction is one of the central problems in computational biology. We present a novel automated protein structure-based function prediction method using libraries of local residue packing patterns that are common to most proteins in a known functional family. Critical to this approach is the representation of a protein structure as a graph where residue vertices (residue name used as a vertex label) are connected by geometrical proximity edges. The approach employs two steps. First, it uses a fast subgraph mining algorithm to find all occurrences of family-specific labeled subgraphs for all well characterized protein structural and functional families. Second, it queries a new structure for occurrences of a set of motifs characteristic of a known family, using a graph index to speed up Ullman's subgraph isomorphism algorithm. The confidence of function inference from structure depends on the number of family-specific motifs found in the query structure compared with their distribution in a large non-redundant database of proteins. This method can assign a new structure to a specific functional family in cases where sequence alignments, sequence patterns, structural superposition and active site templates fail to provide accurate annotation.
A comprehensive analysis of three Asiatic black bear mitochondrial genomes (subspecies ussuricus, formosanus and mupinensis), with emphasis on the complete mtDNA sequence of Ursus thibetanus ussuricus (Ursidae).

PubMed

Hwang, Dae-Sik; Ki, Jang-Seu; Jeong, Dong-Hyuk; Kim, Bo-Hyun; Lee, Bae-Keun; Han, Sang-Hoon; Lee, Jae-Seong

2008-08-01

In the present paper, we describe the mitochondrial genome sequence of the Asiatic black bear (Ursus thibetanus ussuricus) with particular emphasis on the control region (CR), and compared with mitochondrial genomes on molecular relationships among the bears. The mitochondrial genome sequence of U. thibetanus ussuricus was 16,700 bp in size with mostly conserved structures (e.g. 13 protein-coding, two rRNA genes, 22 tRNA genes). The CR consisted of several typical conserved domains such as F, E, D, and C boxes, and a conserved sequence block. Nucleotide sequences and the repeated motifs in the CR were different among the bear species, and their copy numbers were also variable according to populations, even within F1 generations of U. thibetanus ussuricus. Comparative analyses showed that the CR D1 region was highly informative for the discrimination of the bear family. These findings suggest that nucleotide sequences of both repeated motifs and CR D1 in the bear family are good markers for species discriminations.
DNA nanotechnology based on i-motif structures.

PubMed

Dong, Yuanchen; Yang, Zhongqiang; Liu, Dongsheng

2014-06-17

CONSPECTUS: Most biological processes happen at the nanometer scale, and understanding the energy transformations and material transportation mechanisms within living organisms has proved challenging. To better understand the secrets of life, researchers have investigated artificial molecular motors and devices over the past decade because such systems can mimic certain biological processes. DNA nanotechnology based on i-motif structures is one system that has played an important role in these investigations. In this Account, we summarize recent advances in functional DNA nanotechnology based on i-motif structures. The i-motif is a DNA quadruplex that occurs as four stretches of cytosine repeat sequences form C·CH(+) base pairs, and their stabilization requires slightly acidic conditions. This unique property has produced the first DNA molecular motor driven by pH changes. The motor is reliable, and studies show that it is capable of millisecond running speeds, comparable to the speed of natural protein motors. With careful design, the output of these types of motors was combined to drive micrometer-sized cantilevers bend. Using established DNA nanostructure assembly and functionalization methods, researchers can easily integrate the motor within other DNA assembled structures and functional units, producing DNA molecular devices with new functions such as suprahydrophobic/suprahydrophilic smart surfaces that switch, intelligent nanopores triggered by pH changes, molecular logic gates, and DNA nanosprings. Recently, researchers have produced motors driven by light and electricity, which have allowed DNA motors to be integrated within silicon-based nanodevices. Moreover, some devices based on i-motif structures have proven useful for investigating processes within living cells. The pH-responsiveness of the i-motif structure also provides a way to control the stepwise assembly of DNA nanostructures. In addition, because of the stability of the i-motif, this structure can serve as the stem of one-dimensional nanowires, and a four-strand stem can provide a new basis for three-dimensional DNA structures such as pillars. By sacrificing some accuracy in assembly, we used these properties to prepare the first fast-responding pure DNA supramolecular hydrogel. This hydrogel does not swell and cannot encapsulate small molecules. These unique properties could lead to new developments in smart materials based on DNA assembly and support important applications in fields such as tissue engineering. We expect that DNA nanotechnology will continue to develop rapidly. At a fundamental level, further studies should lead to greater understanding of the energy transformation and material transportation mechanisms at the nanometer scale. In terms of applications, we expect that many of these elegant molecular devices will soon be used in vivo. These further studies could demonstrate the power of DNA nanotechnology in biology, material science, chemistry, and physics.
Revisiting the TALE repeat.

PubMed

Deng, Dong; Yan, Chuangye; Wu, Jianping; Pan, Xiaojing; Yan, Nieng

2014-04-01

Transcription activator-like (TAL) effectors specifically bind to double stranded (ds) DNA through a central domain of tandem repeats. Each TAL effector (TALE) repeat comprises 33-35 amino acids and recognizes one specific DNA base through a highly variable residue at a fixed position in the repeat. Structural studies have revealed the molecular basis of DNA recognition by TALE repeats. Examination of the overall structure reveals that the basic building block of TALE protein, namely a helical hairpin, is one-helix shifted from the previously defined TALE motif. Here we wish to suggest a structure-based re-demarcation of the TALE repeat which starts with the residues that bind to the DNA backbone phosphate and concludes with the base-recognition hyper-variable residue. This new numbering system is consistent with the α-solenoid superfamily to which TALE belongs, and reflects the structural integrity of TAL effectors. In addition, it confers integral number of TALE repeats that matches the number of bound DNA bases. We then present fifteen crystal structures of engineered dHax3 variants in complex with target DNA molecules, which elucidate the structural basis for the recognition of bases adenine (A) and guanine (G) by reported or uncharacterized TALE codes. Finally, we analyzed the sequence-structure correlation of the amino acid residues within a TALE repeat. The structural analyses reported here may advance the mechanistic understanding of TALE proteins and facilitate the design of TALEN with improved affinity and specificity.
Kinetic, Thermodynamic, and Structural Characterizations of the Association between Nrf2-DLGex Degron and Keap1

PubMed Central

Fukutomi, Toshiaki; Takagi, Kenji; Mizushima, Tsunehiro; Ohuchi, Noriaki

2014-01-01

Transcription factor Nrf2 (NF-E2-related factor 2) coordinately regulates cytoprotective gene expression, but under unstressed conditions, Nrf2 is degraded rapidly through Keap1 (Kelch-like ECH-associated protein 1)-mediated ubiquitination. Nrf2 harbors two Keap1-binding motifs, DLG and ETGE. Interactions between these two motifs and Keap1 constitute a key regulatory nexus for cellular Nrf2 activity through the formation of a two-site binding hinge-and-latch mechanism. In this study, we determined the minimum Keap1-binding sequence of the DLG motif, the low-affinity latch site, and defined a new DLGex motif that covers a sequence much longer than that previously defined. We have successfully clarified the crystal structure of the Keap1-DC-DLGex complex at 1.6 Å. DLGex possesses a complicated helix structure, which interprets well the human-cancer-derived loss-of-function mutations in DLGex. In thermodynamic analyses, Keap1-DLGex binding is characterized as enthalpy and entropy driven, while Keap1-ETGE binding is characterized as purely enthalpy driven. In kinetic analyses, Keap1-DLGex binding follows a fast-association and fast-dissociation model, while Keap1-ETGE binding contains a slow-reaction step that leads to a stable conformation. These results demonstrate that the mode of DLGex binding to Keap1 is distinct from that of ETGE structurally, thermodynamically, and kinetically and support our contention that the DLGex motif serves as a converter transmitting environmental stress to Nrf2 induction as the latch site. PMID:24366543
Extra! Extra! Read All about It!: Structuring the U.S. History Survey around the Motif of the Newspaper

ERIC Educational Resources Information Center

Morin, Erica A.

2013-01-01

As a graduate instructor for HIST 152: United States Since 1877, the author structures the entire course around the motif of the newspaper. She models her curriculum after the newspaper both visually and symbolically and uses it as a theme throughout the class. The newspaper is not a gimmick or cliche, but rather a recurring stylistic theme, an…
Sites of instability in the human TCF3 (E2A) gene adopt G-quadruplex DNA structures in vitro

PubMed Central

Williams, Jonathan D.; Fleetwood, Sara; Berroyer, Alexandra; Kim, Nayun; Larson, Erik D.

2015-01-01

The formation of highly stable four-stranded DNA, called G-quadruplex (G4), promotes site-specific genome instability. G4 DNA structures fold from repetitive guanine sequences, and increasing experimental evidence connects G4 sequence motifs with specific gene rearrangements. The human transcription factor 3 (TCF3) gene (also termed E2A) is subject to genetic instability associated with severe disease, most notably a common translocation event t(1;19) associated with acute lymphoblastic leukemia. The sites of instability in TCF3 are not randomly distributed, but focused to certain sequences. We asked if G4 DNA formation could explain why TCF3 is prone to recombination and mutagenesis. Here we demonstrate that sequences surrounding the major t(1;19) break site and a region associated with copy number variations both contain G4 sequence motifs. The motifs identified readily adopt G4 DNA structures that are stable enough to interfere with DNA synthesis in physiological salt conditions in vitro. When introduced into the yeast genome, TCF3 G4 motifs promoted gross chromosomal rearrangements in a transcription-dependent manner. Our results provide a molecular rationale for the site-specific instability of human TCF3, suggesting that G4 DNA structures contribute to oncogenic DNA breaks and recombination. PMID:26029241
Structural details (kinks and non-α conformations) in transmembrane helices are intrahelically determined and can be predicted by sequence pattern descriptors

PubMed Central

Rigoutsos, Isidore; Riek, Peter; Graham, Robert M.; Novotny, Jiri

2003-01-01

One of the promising methods of protein structure prediction involves the use of amino acid sequence-derived patterns. Here we report on the creation of non-degenerate motif descriptors derived through data mining of training sets of residues taken from the transmembrane-spanning segments of polytopic proteins. These residues correspond to short regions in which there is a deviation from the regular α-helical character (i.e. π-helices, 310-helices and kinks). A ‘search engine’ derived from these motif descriptors correctly identifies, and discriminates amongst instances of the above ‘non-canonical’ helical motifs contained in the SwissProt/TrEMBL database of protein primary structures. Our results suggest that deviations from α-helicity are encoded locally in sequence patterns only about 7–9 residues long and can be determined in silico directly from the amino acid sequence. Delineation of such variations in helical habit is critical to understanding the complex structure–function relationships of polytopic proteins and for drug discovery. The success of our current methodology foretells development of similar prediction tools capable of identifying other structural motifs from sequence alone. The method described here has been implemented and is available on the World Wide Web at http://cbcsrv.watson.ibm.com/Ttkw.html. PMID:12888523
Molecular Signaling Network Motifs Provide a Mechanistic Basis for Cellular Threshold Responses

PubMed Central

Bhattacharya, Sudin; Conolly, Rory B.; Clewell, Harvey J.; Kaminski, Norbert E.; Andersen, Melvin E.

2014-01-01

Background: Increasingly, there is a move toward using in vitro toxicity testing to assess human health risk due to chemical exposure. As with in vivo toxicity testing, an important question for in vitro results is whether there are thresholds for adverse cellular responses. Empirical evaluations may show consistency with thresholds, but the main evidence has to come from mechanistic considerations. Objectives: Cellular response behaviors depend on the molecular pathway and circuitry in the cell and the manner in which chemicals perturb these circuits. Understanding circuit structures that are inherently capable of resisting small perturbations and producing threshold responses is an important step towards mechanistically interpreting in vitro testing data. Methods: Here we have examined dose–response characteristics for several biochemical network motifs. These network motifs are basic building blocks of molecular circuits underpinning a variety of cellular functions, including adaptation, homeostasis, proliferation, differentiation, and apoptosis. For each motif, we present biological examples and models to illustrate how thresholds arise from specific network structures. Discussion and Conclusion: Integral feedback, feedforward, and transcritical bifurcation motifs can generate thresholds. Other motifs (e.g., proportional feedback and ultrasensitivity)produce responses where the slope in the low-dose region is small and stays close to the baseline. Feedforward control may lead to nonmonotonic or hormetic responses. We conclude that network motifs provide a basis for understanding thresholds for cellular responses. Computational pathway modeling of these motifs and their combinations occurring in molecular signaling networks will be a key element in new risk assessment approaches based on in vitro cellular assays. Citation: Zhang Q, Bhattacharya S, Conolly RB, Clewell HJ III, Kaminski NE, Andersen ME. 2014. Molecular signaling network motifs provide a mechanistic basis for cellular threshold responses. Environ Health Perspect 122:1261–1270; http://dx.doi.org/10.1289/ehp.1408244 PMID:25117432
Modulation of the multistate folding of designed TPR proteins through intrinsic and extrinsic factors

PubMed Central

Phillips, J J; Javadi, Y; Millership, C; Main, E R G

2012-01-01

Tetratricopeptide repeats (TPRs) are a class of all alpha-helical repeat proteins that are comprised of 34-aa helix-turn-helix motifs. These stack together to form nonglobular structures that are stabilized by short-range interactions from residues close in primary sequence. Unlike globular proteins, they have few, if any, long-range nonlocal stabilizing interactions. Several studies on designed TPR proteins have shown that this modular structure is reflected in their folding, that is, modular multistate folding is observed as opposed to two-state folding. Here we show that TPR multistate folding can be suppressed to approximate two-state folding through modulation of intrinsic stability or extrinsic environmental variables. This modulation was investigated by comparing the thermodynamic unfolding under differing buffer regimes of two distinct series of consensus-designed TPR proteins, which possess different intrinsic stabilities. A total of nine proteins of differing sizes and differing consensus TPR motifs were each thermally and chemically denatured and their unfolding monitored using differential scanning calorimetry (DSC) and CD/fluorescence, respectively. Analyses of both the DSC and chemical denaturation data show that reducing the total stability of each protein and repeat units leads to observable two-state unfolding. These data highlight the intimate link between global and intrinsic repeat stability that governs whether folding proceeds by an observably two-state mechanism, or whether partial unfolding yields stable intermediate structures which retain sufficient stability to be populated at equilibrium. PMID:22170589
HRD Motif as the Central Hub of the Signaling Network for Activation Loop Autophosphorylation in Abl Kinase.

PubMed

La Sala, Giuseppina; Riccardi, Laura; Gaspari, Roberto; Cavalli, Andrea; Hantschel, Oliver; De Vivo, Marco

2016-11-08

A number of structural factors modulate the activity of Abelson (Abl) tyrosine kinase, whose deregulation is often related to oncogenic processes. First, only the open conformation of the Abl kinase domain's activation loop (A-loop) favors ATP binding to the catalytic cleft. In this regard, the trans-autophosphorylation of the Y412 residue, which is located along the A-loop, favors the stability of the open conformation, in turn enhancing Abl activity. Another key factor for full Abl activity is the formation of active conformations of the catalytic DFG motif in the Abl kinase domain. Furthermore, binding of the SH2 domain to the N-lobe of the Abl kinase was recently demonstrated to have a long-range allosteric effect on the stabilization of the A-loop open state. Intriguingly, these distinct structural factors imply a complex signal transmission network for controlling the A-loop's flexibility and conformational preference for optimal Abl function. However, the exact dynamical features of this signal transmission network structure remain unclear. Here, we report on microsecond-long molecular dynamics coupled with enhanced sampling simulations of multiple Abl model systems, in the presence or absence of the SH2 domain and with the DFG motif flipped in two ways (in or out conformation). Through comparative analysis, our simulations augment the interpretation of the existing Abl experimental data, revealing a dynamical network of interactions that interconnect SH2 domain binding with A-loop plasticity and Y412 autophosphorylation in Abl. This signaling network engages the DFG motif and, importantly, other conserved structural elements of the kinase domain, namely, the EPK-ELK H-bond network and the HRD motif. Our results show that the signal propagation for modulating the A-loop spatial localization is highly dependent on the HRD motif conformation, which thus acts as the central hub of this (allosteric) signaling network controlling Abl activation and function.

Defining RNA motif-aminoglycoside interactions via two-dimensional combinatorial screening and structure-activity relationships through sequencing.

PubMed

Velagapudi, Sai Pradeep; Disney, Matthew D

2013-10-15

RNA is an extremely important target for the development of chemical probes of function or small molecule therapeutics. Aminoglycosides are the most well studied class of small molecules to target RNA. However, the RNA motifs outside of the bacterial rRNA A-site that are likely to be bound by these compounds in biological systems is largely unknown. If such information were known, it could allow for aminoglycosides to be exploited to target other RNAs and, in addition, could provide invaluable insights into potential bystander targets of these clinically used drugs. We utilized two-dimensional combinatorial screening (2DCS), a library-versus-library screening approach, to select the motifs displayed in a 3×3 nucleotide internal loop library and in a 6-nucleotide hairpin library that bind with high affinity and selectivity to six aminoglycoside derivatives. The selected RNA motifs were then analyzed using structure-activity relationships through sequencing (StARTS), a statistical approach that defines the privileged RNA motif space that binds a small molecule. StARTS allowed for the facile annotation of the selected RNA motif-aminoglycoside interactions in terms of affinity and selectivity. The interactions selected by 2DCS generally have nanomolar affinities, which is higher affinity than the binding of aminoglycosides to a mimic of their therapeutic target, the bacterial rRNA A-site. Copyright © 2013 Elsevier Ltd. All rights reserved.
Inforna 2.0: A Platform for the Sequence-Based Design of Small Molecules Targeting Structured RNAs.

PubMed

Disney, Matthew D; Winkelsas, Audrey M; Velagapudi, Sai Pradeep; Southern, Mark; Fallahi, Mohammad; Childs-Disney, Jessica L

2016-06-17

The development of small molecules that target RNA is challenging yet, if successful, could advance the development of chemical probes to study RNA function or precision therapeutics to treat RNA-mediated disease. Previously, we described Inforna, an approach that can mine motifs (secondary structures) within target RNAs, which is deduced from the RNA sequence, and compare them to a database of known RNA motif-small molecule binding partners. Output generated by Inforna includes the motif found in both the database and the desired RNA target, lead small molecules for that target, and other related meta-data. Lead small molecules can then be tested for binding and affecting cellular (dys)function. Herein, we describe Inforna 2.0, which incorporates all known RNA motif-small molecule binding partners reported in the scientific literature, a chemical similarity searching feature, and an improved user interface and is freely available via an online web server. By incorporation of interactions identified by other laboratories, the database has been doubled, containing 1936 RNA motif-small molecule interactions, including 244 unique small molecules and 1331 motifs. Interestingly, chemotype analysis of the compounds that bind RNA in the database reveals features in small molecule chemotypes that are privileged for binding. Further, this updated database expanded the number of cellular RNAs to which lead compounds can be identified.
Intrastrand triplex DNA repeats in bacteria: a source of genomic instability

PubMed Central

Holder, Isabelle T.; Wagner, Stefanie; Xiong, Peiwen; Sinn, Malte; Frickey, Tancred; Meyer, Axel; Hartig, Jörg S.

2015-01-01

Repetitive nucleic acid sequences are often prone to form secondary structures distinct from B-DNA. Prominent examples of such structures are DNA triplexes. We observed that certain intrastrand triplex motifs are highly conserved and abundant in prokaryotic genomes. A systematic search of 5246 different prokaryotic plasmids and genomes for intrastrand triplex motifs was conducted and the results summarized in the ITxF database available online at http://bioinformatics.uni-konstanz.de/utils/ITxF/. Next we investigated biophysical and biochemical properties of a particular G/C-rich triplex motif (TM) that occurs in many copies in more than 260 bacterial genomes by CD and nuclear magnetic resonance spectroscopy as well as in vivo footprinting techniques. A characterization of putative properties and functions of these unusually frequent nucleic acid motifs demonstrated that the occurrence of the TM is associated with a high degree of genomic instability. TM-containing genomic loci are significantly more rearranged among closely related Escherichia coli strains compared to control sites. In addition, we found very high frequencies of TM motifs in certain Enterobacteria and Cyanobacteria that were previously described as genetically highly diverse. In conclusion we link intrastrand triplex motifs with the induction of genomic instability. We speculate that the observed instability might be an adaptive feature of these genomes that creates variation for natural selection to act upon. PMID:26450966
Ca2+-Induced Rigidity Change of the Myosin VIIa IQ Motif-Single α Helix Lever Arm Extension.

PubMed

Li, Jianchao; Chen, Yiyun; Deng, Yisong; Unarta, Ilona Christy; Lu, Qing; Huang, Xuhui; Zhang, Mingjie

2017-04-04

Several unconventional myosins contain a highly charged single α helix (SAH) immediately following the calmodulin (CaM) binding IQ motifs, functioning to extend lever arms of these myosins. How such SAH is connected to the IQ motifs and whether the conformation of the IQ motifs-SAH segments are regulated by Ca 2+ fluctuations are not known. Here, we demonstrate by solving its crystal structure that the predicted SAH of myosin VIIa (Myo7a) forms a stable SAH. The structure of Myo7a IQ5-SAH segment in complex with apo-CaM reveals that the SAH sequence can extend the length of the Myo7a lever arm. Although Ca 2+ -CaM remains bound to IQ5-SAH, the Ca 2+ -induced CaM binding mode change softens the conformation of the IQ5-SAH junction, revealing a Ca 2+ -induced lever arm flexibility change for Myo7a. We further demonstrate that the last IQ motif of several other myosins also binds to both apo- and Ca 2+ -CaM, suggesting a common Ca 2+ -induced conformational regulation mechanism. Copyright © 2017 Elsevier Ltd. All rights reserved.
Nucleic Acid i-Motif Structures in Analytical Chemistry.

PubMed

Alba, Joan Josep; Sadurní, Anna; Gargallo, Raimundo

2016-09-02

Under the appropriate experimental conditions of pH and temperature, cytosine-rich segments in DNA or RNA sequences may produce a characteristic folded structure known as an i-motif. Besides its potential role in vivo, which is still under investigation, this structure has attracted increasing interest in other fields due to its sharp, fast and reversible pH-driven conformational changes. This "on/off" switch at molecular level is being used in nanotechnology and analytical chemistry to develop nanomachines and sensors, respectively. This paper presents a review of the latest applications of this structure in the field of chemical analysis.
Supramolecular architectures constructed through self-assembly of a chalcone and substituted diazo-β-diketones

NASA Astrophysics Data System (ADS)

Prajapati, R.; Mishra, L.; Grabowski, S. J.; Govil, G.; Dubey, S. K.

2008-05-01

Organic compounds namely pyridyl chalcone viz. 3-[4-(3-oxo-3-pyridin-2-yl-propenyl)-phenyl]-1-pyridin-2-yl-propenone (L 1), p-cholorophenyldiazopentane-2,4-dione (L 2) and p-methyl phenyldiazopentane-2,4-dione (L 3) have been characterized by their single-crystal X-ray crystallographic studies. Several structural motifs resulting upon their self-association through probable non-covalent interactions have been discussed. The studies of related motifs found in Cambridge Structural Database are performed and the results are related to the structural data obtained for crystal structures reported here in.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Zhang, Guo Qiang; Luo, Lingyun; Ogbuji, Chime

The interaction of multiple types of relationships among anatomical classes in the Foundational Model of Anatomy (FMA) can provide inferred information valuable for quality assurance. This paper introduces a method called Motif Checking (MOCH) to study the effects of such multi-relation type interactions. MOCH represents patterns of multitype interaction as small labeled sub-graph motifs, whose nodes represent class variables, and labeled edges represent relational types. By representing FMA as an RDF graph and motifs as SPARQL queries, fragments of FMA are automatically obtained as auditing candidates. Leveraging the scalability and reconfigurability of Semantic Web Technology (OWL, RDF and SPARQL) andmore » Virtuoso, we performed exhaustive analyses of three 2-node motifs, resulting in 638 matching FMA configurations; twelve 3-node motifs, resulting in 202,960 configurations. Using the Principal Ideal Explorer (PIE) methodology as an extension of MOCH, we were able to identify 755 root nodes with 4,100 respective descendants with opposing antonyms in their class names for arbitrary-length motifs. With possible disjointness implied by antonyms, we performed manual inspection of a subset of the resulting FMA fragments and tracked down a source of abnormal inferred conclusions (captured by the motifs), coming from a gender-neutral class being modeled as a part of gender-specific class, such as “Urinary system” is a part of “Female human body.” Our results demonstrate that MOCH and PIE provide a unique source of valuable information for quality assurance. Since our approach is general, it is applicable to any ontological system with an OWL representation.« less
Structural Basis for Recognition and Sequestration of UUUOH 3 ' Temini of Nascent RNA Polymerase III Transcripts by La, a Rheumatic Disease Autoantigen

DOE Office of Scientific and Technical Information (OSTI.GOV)

Teplova,M.; Yuan, Y.; Phan, A.

2006-01-01

The nuclear phosphoprotein La was identified as an autoantigen in patients with systemic lupus erythematosus and Sjogren's syndrome. La binds to and protects the UUUOH 3' terminii of nascent RNA polymerase III transcripts from exonuclease digestion. We report the 1.85 Angstroms crystal structure of the N-terminal domain of human La, consisting of La and RRM1 motifs, bound to r(U1-G2-C3-U4-G5-U6-U7-U8-U9OH). The U7-U8-U9OH 3' end, in a splayed-apart orientation, is sequestered within a basic and aromatic amino acid-lined cleft between the La and RRM1 motifs. The specificity-determining U8 residue bridges both motifs, in part through unprecedented targeting of the {beta} sheet edge,more » rather than the anticipated face, of the RRM1 motif. Our structural observations, supported by mutation studies of both La and RNA components, illustrate the principles behind RNA sequestration by a rheumatic disease autoantigen, whereby the UUUOH 3' ends of nascent RNA transcripts are protected during downstream processing and maturation events.« less
Structural basis for recognition and sequestration of UUU(OH) 3' temini of nascent RNA polymerase III transcripts by La, a rheumatic disease autoantigen.

PubMed

Teplova, Marianna; Yuan, Yu-Ren; Phan, Anh Tuân; Malinina, Lucy; Ilin, Serge; Teplov, Alexei; Patel, Dinshaw J

2006-01-06

The nuclear phosphoprotein La was identified as an autoantigen in patients with systemic lupus erythematosus and Sjogren's syndrome. La binds to and protects the UUU(OH) 3' terminii of nascent RNA polymerase III transcripts from exonuclease digestion. We report the 1.85 angstroms crystal structure of the N-terminal domain of human La, consisting of La and RRM1 motifs, bound to r(U1-G2-C3-U4-G5-U6-U7-U8-U9OH). The U7-U8-U9OH 3' end, in a splayed-apart orientation, is sequestered within a basic and aromatic amino acid-lined cleft between the La and RRM1 motifs. The specificity-determining U8 residue bridges both motifs, in part through unprecedented targeting of the beta sheet edge, rather than the anticipated face, of the RRM1 motif. Our structural observations, supported by mutation studies of both La and RNA components, illustrate the principles behind RNA sequestration by a rheumatic disease autoantigen, whereby the UUU(OH) 3' ends of nascent RNA transcripts are protected during downstream processing and maturation events.
A cell-surface-anchored ratiometric i-motif sensor for extracellular pH detection.

PubMed

Ying, Le; Xie, Nuli; Yang, Yanjing; Yang, Xiaohai; Zhou, Qifeng; Yin, Bincheng; Huang, Jin; Wang, Kemin

2016-06-14

A FRET-based sensor is anchored on the cell surface through streptavidin-biotin interactions. Due to the excellent properties of the pH-sensitive i-motif structure, the sensor can detect extracellular pH with high sensitivity and excellent reversibility.
Ring-shaped architecture of RecR: implications for its role in homologous recombinational DNA repair

PubMed Central

Lee, Byung Il; Kim, Kyoung Hoon; Park, Soo Jeong; Eom, Soo Hyun; Song, Hyun Kyu; Suh, Se Won

2004-01-01

RecR, together with RecF and RecO, facilitates RecA loading in the RecF pathway of homologous recombinational DNA repair in procaryotes . The human Rad52 protein is a functional counterpart of RecFOR. We present here the crystal structure of RecR from Deinococcus radiodurans (DR RecR). A monomer of DR RecR has a two-domain structure: the N-terminal domain with a helix–hairpin–helix (HhH) motif and the C-terminal domain with a Cys4 zinc-finger motif, a Toprim domain and a Walker B motif. Four such monomers form a ring-shaped tetramer of 222 symmetry with a central hole of 30−35 Å diameter. In the crystal, two tetramers are concatenated, implying that the RecR tetramer is capable of opening and closing. We also show that DR RecR binds to both dsDNA and ssDNA, and that its HhH motif is essential for DNA binding. PMID:15116069
Identification of structural motifs as tunneling two-level systems in amorphous alumina at low temperatures

NASA Astrophysics Data System (ADS)

Paz, Alejandro Pérez; Lebedeva, Irina V.; Tokatly, Ilya V.; Rubio, Angel

2014-12-01

One of the most accepted models that describe the anomalous thermal behavior of amorphous materials at temperatures below 1 K relies on the quantum mechanical tunneling of atoms between two nearly equivalent potential energy wells forming a two-level system (TLS). Indirect evidence for TLSs is widely available. However, the atomistic structure of these TLSs remains an unsolved topic in the physics of amorphous materials. Here, using classical molecular dynamics, we found several hitherto unknown bistable structural motifs that may be key to understanding the anomalous thermal properties of amorphous alumina at low temperatures. We show through free energy profiles that the complex potential energy surface can be reduced to canonical TLSs. The tunnel splitting predicted from instanton theory, the number density, dipole moment, and coupling to external strain of the discovered motifs are consistent with experiments.
The role of symmetry in the regulation of brain dynamics

NASA Astrophysics Data System (ADS)

Tang, Evelyn; Giusti, Chad; Cieslak, Matthew; Grafton, Scott; Bassett, Danielle

Synchronous neural processes regulate a wide range of behaviors from attention to learning. Yet structural constraints on these processes are far from understood. We draw on new theoretical links between structural symmetries and the control of synchronous function, to offer a reconceptualization of the relationships between brain structure and function in human and non-human primates. By classifying 3-node motifs in macaque connectivity data, we find the most prevalent motifs can theoretically ensure a diversity of function including strict synchrony as well as control to arbitrary states. The least prevalent motifs are theoretically controllable to arbitrary states, which may not be desirable in a biological system. In humans, regions with high topological similarity of connections (a continuous notion related to symmetry) are most commonly found in fronto-parietal systems, which may account for their critical role in cognitive control. Collectively, our work underscores the role of symmetry and topological similarity in regulating dynamics of brain function.
Amyloid fibril formation from sequences of a natural beta-structured fibrous protein, the adenovirus fiber.

PubMed

Papanikolopoulou, Katerina; Schoehn, Guy; Forge, Vincent; Forsyth, V Trevor; Riekel, Christian; Hernandez, Jean-François; Ruigrok, Rob W H; Mitraki, Anna

2005-01-28

Amyloid fibrils are fibrous beta-structures that derive from abnormal folding and assembly of peptides and proteins. Despite a wealth of structural studies on amyloids, the nature of the amyloid structure remains elusive; possible connections to natural, beta-structured fibrous motifs have been suggested. In this work we focus on understanding amyloid structure and formation from sequences of a natural, beta-structured fibrous protein. We show that short peptides (25 to 6 amino acids) corresponding to repetitive sequences from the adenovirus fiber shaft have an intrinsic capacity to form amyloid fibrils as judged by electron microscopy, Congo Red binding, infrared spectroscopy, and x-ray fiber diffraction. In the presence of the globular C-terminal domain of the protein that acts as a trimerization motif, the shaft sequences adopt a triple-stranded, beta-fibrous motif. We discuss the possible structure and arrangement of these sequences within the amyloid fibril, as compared with the one adopted within the native structure. A 6-amino acid peptide, corresponding to the last beta-strand of the shaft, was found to be sufficient to form amyloid fibrils. Structural analysis of these amyloid fibrils suggests that perpendicular stacking of beta-strand repeat units is an underlying common feature of amyloid formation.
Peptide-directed self-assembly of hydrogels

PubMed Central

Kopeček, Jindřich; Yang, Jiyuan

2009-01-01

This review focuses on the self-assembly of macromolecules mediated by the biorecognition of peptide/protein domains. Structures forming α-helices and β-sheets have been used to mediate self-assembly into hydrogels of peptides, reactive copolymers and peptide motifs, block copolymers, and graft copolymers. Structural factors governing the self-assembly of these molecules into precisely defined three-dimensional structures (hydrogels) are reviewed. The incorporation of peptide motifs into hybrid systems, composed of synthetic and natural macromolecules, enhances design opportunities for new biomaterials when compared to individual components. PMID:18952513
Canonical Bcl-2 motifs of the Na+/K+ pump revealed by the BH3 mimetic chelerythrine: early signal transducers of apoptosis?

PubMed

Lauf, Peter K; Heiny, Judith; Meller, Jarek; Lepera, Michael A; Koikov, Leonid; Alter, Gerald M; Brown, Thomas L; Adragna, Norma C

2013-01-01

Chelerythrine [CET], a protein kinase C [PKC] inhibitor, is a prop-apoptotic BH3-mimetic binding to BH1-like motifs of Bcl-2 proteins. CET action was examined on PKC phosphorylation-dependent membrane transporters (Na+/K+ pump/ATPase [NKP, NKA], Na+-K+-2Cl+ [NKCC] and K+-Cl- [KCC] cotransporters, and channel-supported K+ loss) in human lens epithelial cells [LECs]. K+ loss and K+ uptake, using Rb+ as congener, were measured by atomic absorption/emission spectrophotometry with NKP and NKCC inhibitors, and Cl- replacement by NO3ˉ to determine KCC. 3H-Ouabain binding was performed on a pig renal NKA in the presence and absence of CET. Bcl-2 protein and NKA sequences were aligned and motifs identified and mapped using PROSITE in conjunction with BLAST alignments and analysis of conservation and structural similarity based on prediction of secondary and crystal structures. CET inhibited NKP and NKCC by >90% (IC50 values ~35 and ~15 μM, respectively) without significant KCC activity change, and stimulated K+ loss by ~35% at 10-30 μM. Neither ATP levels nor phosphorylation of the NKA α1 subunit changed. 3H-ouabain was displaced from pig renal NKA only at 100 fold higher CET concentrations than the ligand. Sequence alignments of NKA with BH1- and BH3-like motifs containing pro-survival Bcl-2 and BclXl proteins showed more than one BH1-like motif within NKA for interaction with CET or with BH3 motifs. One NKA BH1-like motif (ARAAEILARDGPN) was also found in all P-type ATPases. Also, NKA possessed a second motif similar to that near the BH3 region of Bcl-2. Findings support the hypothesis that CET inhibits NKP by binding to BH1-like motifs and disrupting the α1 subunit catalytic activity through conformational changes. By interacting with Bcl-2 proteins through their complementary BH1- or BH3-like-motifs, NKP proteins may be sensors of normal and pathological cell functions, becoming important yet unrecognized signal transducers in the initial phases of apoptosis. CET action on NKCC1 and K+ channels may involve PKC-regulated mechanisms; however, limited sequence homologies to BH1-like motifs cannot exclude direct effects.
Structural Analysis of the Complex between Penta-EF-Hand ALG-2 Protein and Sec31A Peptide Reveals a Novel Target Recognition Mechanism of ALG-2

PubMed Central

Takahashi, Takeshi; Kojima, Kyosuke; Zhang, Wei; Sasaki, Kanae; Ito, Masaru; Suzuki, Hironori; Kawasaki, Masato; Wakatsuki, Soichi; Takahara, Terunao; Shibata, Hideki; Maki, Masatoshi

2015-01-01

ALG-2, a 22-kDa penta-EF-hand protein, is involved in cell death, signal transduction, membrane trafficking, etc., by interacting with various proteins in mammalian cells in a Ca2+-dependent manner. Most known ALG-2-interacting proteins contain proline-rich regions in which either PPYPXnYP (type 1 motif) or PXPGF (type 2 motif) is commonly found. Previous X-ray crystal structural analysis of the complex between ALG-2 and an ALIX peptide revealed that the peptide binds to the two hydrophobic pockets. In the present study, we resolved the crystal structure of the complex between ALG-2 and a peptide of Sec31A (outer shell component of coat complex II, COPII; containing the type 2 motif) and found that the peptide binds to the third hydrophobic pocket (Pocket 3). While amino acid substitution of Phe85, a Pocket 3 residue, with Ala abrogated the interaction with Sec31A, it did not affect the interaction with ALIX. On the other hand, amino acid substitution of Tyr180, a Pocket 1 residue, with Ala caused loss of binding to ALIX, but maintained binding to Sec31A. We conclude that ALG-2 recognizes two types of motifs at different hydrophobic surfaces. Furthermore, based on the results of serial mutational analysis of the ALG-2-binding sites in Sec31A, the type 2 motif was newly defined. PMID:25667979
In Silico Molecular Modeling and Docking Studies on Novel Mutants (E229V, H225P and D230C) of the Nucleotide-Binding Domain of Homo sapiens Hsp70.

PubMed

Elengoe, Asita; Hamdan, Salehhuddin

2017-12-01

In this study, we explored the possibility of determining the synergistic interactions between nucleotide-binding domain (NBD) of Homo sapiens heat-shock 70 kDa protein (Hsp70) and E1A 32 kDa of adenovirus serotype 5 motif (PNLVP) in the efficiency of killing of tumor cells in cancer treatment. At present, the protein interaction between NBD and PNLVP motif is still unknown, but believed to enhance the rate of virus replication in tumor cells. Three mutant models (E229V, H225P and D230C) were built and simulated, and their interactions with PNLVP motif were studied. The PNLVP motif showed the binding energy and intermolecular energy values with the novel E229V mutant at -7.32 and -11.2 kcal/mol. The E229V mutant had the highest number of hydrogen bonds (7). Based on the root mean square deviation, root mean square fluctuation, hydrogen bonds, salt bridge, secondary structure, surface-accessible solvent area, potential energy and distance matrices analyses, it was proved that the E229V had the strongest and most stable interaction with the PNLVP motif among all the four protein-ligand complex structures. The knowledge of this protein-ligand complex model would help in designing Hsp70 structure-based drug for cancer therapy.
The Thiamine-Pyrophosphate-Motif

NASA Technical Reports Server (NTRS)

Ciszak, Ewa; Dominiak, Paulina

2004-01-01

Thiamin pyrophosphate (TPP), a derivative of vitamin B1, is a cofactor for enzymes performing catalysis in pathways of energy production including the well known decarboxylation of a-keto acid dehydrogenases followed by transketolation. TPP-dependent enzymes constitute a structurally and functionally diverse group exhibiting multimeric subunit organization, multiple domains and two chemically equivalent catalytic centers. Annotation of functional TPP-dependcnt enzymes, therefore, has not been trivial due to low sequence similarity related to this complex organization. Our approach to analysis of structures of known TPP-dependent enzymes reveals for the first time features common to this group, which we have termed the TPP-motif. The TPP-motif consists of specific spatial arrangements of structural elements and their specific contacts to provide for a flip-flop, or alternate site, enzymatic mechanism of action. Analysis of structural elements entrained in the flip-flop action displayed by TPP-dependent enzymes reveals a novel definition of the common amino acid sequences. These sequences allow for annotation of TPP-dependent enzymes, thus advancing functional proteomics. Further details of three-dimensional structures of TPP-dependent enzymes will be discussed.
Crystal Structure of FadA Adhesin from Fusobacterium nucleatum Reveals a Novel Oligomerization Motif, the Leucine Chain

DOE Office of Scientific and Technical Information (OSTI.GOV)

Nithianantham, Stanley; Xu, Minghua; Yamada, Mitsunori

2009-04-07

Many bacterial appendages have filamentous structures, often composed of repeating monomers assembled in a head-to-tail manner. The mechanisms of such linkages vary. We report here a novel protein oligomerization motif identified in the FadA adhesin from the Gram-negative bacterium Fusobacterium nucleatum. The 2.0 {angstrom} crystal structure of the secreted form of FadA (mFadA) reveals two antiparallel {alpha}-helices connected by an intervening 8-residue hairpin loop. Leucine-leucine contacts play a prominent dual intra- and intermolecular role in the structure and function of FadA. First, they comprise the main association between the two helical arms of the monomer; second, they mediate the head-to-tailmore » association of monomers to form the elongated polymers. This leucine-mediated filamentous assembly of FadA molecules constitutes a novel structural motif termed the 'leucine chain.' The essential role of these residues in FadA is corroborated by mutagenesis of selected leucine residues, which leads to the abrogation of oligomerization, filament formation, and binding to host cells.« less

Structural basis of RNA folding and recognition in an AMP-RNA aptamer complex.

PubMed

Jiang, F; Kumar, R A; Jones, R A; Patel, D J

1996-07-11

The catalytic properties of RNA and its well known role in gene expression and regulation are the consequence of its unique solution structures. Identification of the structural determinants of ligand recognition by RNA molecules is of fundamental importance for understanding the biological functions of RNA, as well as for the rational design of RNA Sequences with specific catalytic activities. Towards this latter end, Szostak et al. used in vitro selection techniques to isolate RNA sequences ('aptamers') containing a high-affinity binding site for ATP, the universal currency of cellular energy, and then used this motif to engineer ribozymes with polynucleotide kinase activity. Here we present the solution structure, as determined by multidimensional NMR spectroscopy and molecular dynamics calculations, of both uniformly and specifically 13C-, 15N-labelled 40-mer RNA containing the ATP-binding motif complexed with AMP. The aptamer adopts an L-shaped structure with two nearly orthogonal stems, each capped proximally by a G x G mismatch pair, binding the AMP ligand at their junction in a GNRA-like motif.
An Amino Acid Packing Code for α-helical Structure and Protein Design

PubMed Central

Joo, Hyun; Chavan, Archana G.; Phan, Jamie; Day, Ryan; Tsai, Jerry

2012-01-01

This work demonstrates that all packing in α-helices can be simplified to repetitive patterns of a single motif: the knob-socket. Using the precision of Voronoi Polyhedra/Deluaney Tessellations to identify contacts, the knob-socket is a 4 residue tetrahedral motif: a knob residue on one α-helix packs into the 3 residue socket on another α-helix. The principle of the knob-socket model relates the packing between levels of protein structure: the intra-helical packing arrangements within secondary structure that permit inter-helix tertiary packing interactions. Within an α-helix, the 3 residue sockets arrange residues into a uniform packing lattice. Inter-helix packing results from a definable pattern of interdigitated knob-socket motifs between 2 α-helices. Furthermore, the knob-socket model classifies 3 types of sockets: 1) free: favoring only intra-helical packing, 2) filled: favoring inter-helical interactions and 3) non: disfavoring α-helical structure. The amino acid propensities in these 3 socket classes essentially represent an amino acid code for structure in α-helical packing. Using this code, a novel yet straightforward approach for the design of α-helical structure was used to validate the knob-socket model. Unique sequences for 3 peptides were created to produce a predicted amount of α-helical structure: mostly helical, some helical, and no-helix. These 3 peptides were synthesized and helical content assessed using CD spectroscopy. The measured α-helicity of each peptide was consistent with the expected predictions. These results and analysis demonstrate that the knob-socket motif functions as the basic unit of packing and presents an intuitive tool to decipher the rules governing packing in protein structure. PMID:22426125
Non-B DB v2.0: a database of predicted non-B DNA-forming motifs and its associated tools.

PubMed

Cer, Regina Z; Donohue, Duncan E; Mudunuri, Uma S; Temiz, Nuri A; Loss, Michael A; Starner, Nathan J; Halusa, Goran N; Volfovsky, Natalia; Yi, Ming; Luke, Brian T; Bacolla, Albino; Collins, Jack R; Stephens, Robert M

2013-01-01

The non-B DB, available at http://nonb.abcc.ncifcrf.gov, catalogs predicted non-B DNA-forming sequence motifs, including Z-DNA, G-quadruplex, A-phased repeats, inverted repeats, mirror repeats, direct repeats and their corresponding subsets: cruciforms, triplexes and slipped structures, in several genomes. Version 2.0 of the database revises and re-implements the motif discovery algorithms to better align with accepted definitions and thresholds for motifs, expands the non-B DNA-forming motifs coverage by including short tandem repeats and adds key visualization tools to compare motif locations relative to other genomic annotations. Non-B DB v2.0 extends the ability for comparative genomics by including re-annotation of the five organisms reported in non-B DB v1.0, human, chimpanzee, dog, macaque and mouse, and adds seven additional organisms: orangutan, rat, cow, pig, horse, platypus and Arabidopsis thaliana. Additionally, the non-B DB v2.0 provides an overall improved graphical user interface and faster query performance.
Rapid search for tertiary fragments reveals protein sequence–structure relationships

PubMed Central

Zhou, Jianfu; Grigoryan, Gevorg

2015-01-01

Finding backbone substructures from the Protein Data Bank that match an arbitrary query structural motif, composed of multiple disjoint segments, is a problem of growing relevance in structure prediction and protein design. Although numerous protein structure search approaches have been proposed, methods that address this specific task without additional restrictions and on practical time scales are generally lacking. Here, we propose a solution, dubbed MASTER, that is both rapid, enabling searches over the Protein Data Bank in a matter of seconds, and provably correct, finding all matches below a user-specified root-mean-square deviation cutoff. We show that despite the potentially exponential time complexity of the problem, running times in practice are modest even for queries with many segments. The ability to explore naturally plausible structural and sequence variations around a given motif has the potential to synthesize its design principles in an automated manner; so we go on to illustrate the utility of MASTER to protein structural biology. We demonstrate its capacity to rapidly establish structure–sequence relationships, uncover the native designability landscapes of tertiary structural motifs, identify structural signatures of binding, and automatically rewire protein topologies. Given the broad utility of protein tertiary fragment searches, we hope that providing MASTER in an open-source format will enable novel advances in understanding, predicting, and designing protein structure. PMID:25420575
Molecular Investigations of the Structure and Function of the Protein Phosphatase 1:Spinophilin:Inhibitor-2 Heterotrimeric Complex

PubMed Central

Dancheck, Barbara; Ragusa, Michael J.; Allaire, Marc; Nairn, Angus C.; Page, Rebecca; Peti, Wolfgang

2011-01-01

Regulation of the major ser/thr phosphatase Protein Phosphatase 1 (PP1) is controlled by a diverse array of targeting and inhibitor proteins. Though many PP1 regulatory proteins share at least one PP1 binding motif, usually the RVxF motif, it was recently discovered that certain pairs of targeting and inhibitor proteins bind PP1 simultaneously to form PP1 heterotrimeric complexes. To date, structural information for these heterotrimeric complexes, and, in turn, how they direct PP1 activity is entirely lacking. Using a combination of NMR spectroscopy, biochemistry and small angle X-ray scattering (SAXS), we show that major structural rearrangements in both spinophilin (targeting) and Inhibitor-2 (I-2, inhibitor) are essential for the formation of the heterotrimeric PP1:spinophilin:I-2 (PSI) complex. The RVxF motif of I-2 is released from PP1 during the formation of PSI, making the less prevalent SILK motif of I-2 essential for complex stability. The release of the I-2 RVxF motif allows for enhanced flexibility of both I-2 and spinophilin in the heterotrimeric complex. In addition, we used inductively coupled plasma atomic emission spectroscopy to show that PP1 contains two metals in both heterodimeric complexes (PP1:spinophilin and PP1:I2) and PSI, demonstrating that PSI retains the biochemical characteristics of the PP1:I2 holoenzyme. Finally, we combined the NMR and biochemical data with SAXS and molecular dynamics simulations to generate a structural model of the full heterotrimeric PSI complex. Collectively, these data reveal the molecular events that enable PP1 heterotrimeric complexes to exploit both the targeting and inhibitory features of the PP1-regulatory proteins to form multi-functional PP1 holoenzymes. PMID:21218781
Transcriptional regulation of Saccharomyces cerevisiaeCYS3 encoding cystathionine γ-lyase

PubMed Central

Hiraishi, Hiroyuki; Miyake, Tsuyoshi

2008-01-01

In studying the regulation of GSH11, the structural gene of the high-affinity glutathione transporter (GSH-P1) in Saccharomyces cerevisiae, a cis-acting cysteine responsive element, CCGCCACAC (CCG motif), was detected. Like GSH-P1, the cystathionine γ-lyase encoded by CYS3 is induced by sulfur starvation and repressed by addition of cysteine to the growth medium. We detected a CCG motif (−311 to −303) and a CGC motif (CGCCACAC; −193 to −186), which is one base shorter than the CCG motif, in the 5′-upstream region of CYS3. One copy of the centromere determining element 1, CDE1 (TCACGTGA; −217 to −210), being responsible for regulation of the sulfate assimilation pathway genes, was also detected. We tested the roles of these three elements in the regulation of CYS3. Using a lacZ-reporter assay system, we found that the CCG/CGC motif is required for activation of CYS3, as well as for its repression by cysteine. In contrast, the CDE1 motif was responsible for only activation of CYS3. We also found that two transcription factors, Met4 and VDE, are responsible for activation of CYS3 through the CCG/CGC and CDE1 motifs. These observations suggest a dual regulation of CYS3 by factors that interact with the CDE1 motif and the CCG/CGC motifs. PMID:18317767
Ensemble characterization of an intrinsically disordered FG-Nup peptide and its F>A mutant in DMSO-d6.

PubMed

Reid, Korey M; Sunanda, Punnepalli; Raghothama, S; Krishnan, V V

2017-11-01

Intrinsically disordered proteins (IDP) lack a well-defined 3D-structure under physiological conditions, yet, the inherent disorder represented by an ensemble of conformation plays a critical role in many cellular and regulatory processes. Nucleoporins, or Nups, are the proteins found in the nuclear pore complex (NPC). The central pore of the NPC is occupied by Nups, which have phenylalanine-glycine domain repeats and are intrinsically disordered, and therefore are termed FG-Nups. These FG-domain repeats exhibit differing cohesiveness character and differ from least (FG) to most (GLFG) cohesive. The designed FG-Nup is a 25 AA model peptide containing a noncohesive FG-motif flanked by two cohesive GLFG-motifs (WT peptide). Complete NMR-based ensemble characterization of this peptide along with a control peptide with an F>A substitution (MU peptide) are discussed. Ensemble characterization of the NMR-determined models suggests that both the peptides do not have consistent secondary structures and continue to be disordered. Nonetheless, the role of cohesive elements mediated by the GLFG motifs is evident in the WT ensemble of structures that are more compact than the MU peptide. The approach presented here allows an alternate way to investigate the specific roles of distinct amino acid motifs that translate into the long-range organization of the ensemble of structures and in general on the nature of IDPs. © 2017 Wiley Periodicals, Inc.
Motif mismatches in microsatellites: insights from genome-wide investigation among 20 insect species.

PubMed

Behura, Susanta K; Severson, David W

2015-02-01

We present a detailed genome-wide comparative study of motif mismatches of microsatellites among 20 insect species representing five taxonomic orders. The results show that varying proportions (∼15-46%) of microsatellites identified in these species are imperfect in motif structure, and that they also vary in chromosomal distribution within genomes. It was observed that the genomic abundance of imperfect repeats is significantly associated with the length and number of motif mismatches of microsatellites. Furthermore, microsatellites with a higher number of mismatches tend to have lower abundance in the genome, suggesting that sequence heterogeneity of repeat motifs is a key determinant of genomic abundance of microsatellites. This relationship seems to be a general feature of microsatellites even in unrelated species such as yeast, roundworm, mouse and human. We provide a mechanistic explanation of the evolutionary link between motif heterogeneity and genomic abundance of microsatellites by examining the patterns of motif mismatches and allele sequences of single-nucleotide polymorphisms identified within microsatellite loci. Using Drosophila Reference Genetic Panel data, we further show that pattern of allelic variation modulates motif heterogeneity of microsatellites, and provide estimates of allele age of specific imperfect microsatellites found within protein-coding genes. © The Author 2014. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Conserved binding of GCAC motifs by MEC-8, couch potato, and the RBPMS protein family

PubMed Central

Soufari, Heddy

2017-01-01

Precise regulation of mRNA processing, translation, localization, and stability relies on specific interactions with RNA-binding proteins whose biological function and target preference are dictated by their preferred RNA motifs. The RBPMS family of RNA-binding proteins is defined by a conserved RNA recognition motif (RRM) domain found in metazoan RBPMS/Hermes and RBPMS2, Drosophila couch potato, and MEC-8 from Caenorhabditis elegans. In order to determine the parameters of RNA sequence recognition by the RBPMS family, we have first used the N-terminal domain from MEC-8 in binding assays and have demonstrated a preference for two GCAC motifs optimally separated by >6 nucleotides (nt). We have also determined the crystal structure of the dimeric N-terminal RRM domain from MEC-8 in the unbound form, and in complex with an oligonucleotide harboring two copies of the optimal GCAC motif. The atomic details reveal the molecular network that provides specificity to all four bases in the motif, including multiple hydrogen bonds to the initial guanine. Further studies with human RBPMS, as well as Drosophila couch potato, confirm a general preference for this double GCAC motif by other members of the protein family and the presence of this motif in known targets. PMID:28003515
Crystal-Structure-Guided Design of Self-Assembling RNA Nanotriangles.

PubMed

Boerneke, Mark A; Dibrov, Sergey M; Hermann, Thomas

2016-03-14

RNA nanotechnology uses RNA structural motifs to build nanosized architectures that assemble through selective base-pair interactions. Herein, we report the crystal-structure-guided design of highly stable RNA nanotriangles that self-assemble cooperatively from short oligonucleotides. The crystal structure of an 81 nucleotide nanotriangle determined at 2.6 Å resolution reveals the so-far smallest circularly closed nanoobject made entirely of double-stranded RNA. The assembly of the nanotriangle architecture involved RNA corner motifs that were derived from ligand-responsive RNA switches, which offer the opportunity to control self-assembly and dissociation. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Tripathi, S.; Zhang, D.; Paukstelis, P. J.

DNA has proved to be an excellent material for nanoscale construction because complementary DNA duplexes are programmable and structurally predictable. However, in the absence of Watson–Crick pairings, DNA can be structurally more diverse. Here, we describe the crystal structures of d(ACTCGGATGAT) and the brominated derivative, d(AC BrUCGGA BrUGAT). These oligonucleotides form parallel-stranded duplexes with a crystallographically equivalent strand, resulting in the first examples of DNA crystal structures that contains four different symmetric homo base pairs. Two of the parallel-stranded duplexes are coaxially stacked in opposite directions and locked together to form a tetraplex through intercalation of the 5'-most A–A basemore » pairs between adjacent G–G pairs in the partner duplex. The intercalation region is a new type of DNA tertiary structural motif with similarities to the i-motif. 1H– 1H nuclear magnetic resonance and native gel electrophoresis confirmed the formation of a parallel-stranded duplex in solution. Finally, we modified specific nucleotide positions and added d(GAY) motifs to oligonucleotides and were readily able to obtain similar crystals. This suggests that this parallel-stranded DNA structure may be useful in the rational design of DNA crystals and nanostructures.« less
An intercalation-locked parallel-stranded DNA tetraplex

DOE PAGES

Tripathi, S.; Zhang, D.; Paukstelis, P. J.

2015-01-27

DNA has proved to be an excellent material for nanoscale construction because complementary DNA duplexes are programmable and structurally predictable. However, in the absence of Watson–Crick pairings, DNA can be structurally more diverse. Here, we describe the crystal structures of d(ACTCGGATGAT) and the brominated derivative, d(AC BrUCGGA BrUGAT). These oligonucleotides form parallel-stranded duplexes with a crystallographically equivalent strand, resulting in the first examples of DNA crystal structures that contains four different symmetric homo base pairs. Two of the parallel-stranded duplexes are coaxially stacked in opposite directions and locked together to form a tetraplex through intercalation of the 5'-most A–A basemore » pairs between adjacent G–G pairs in the partner duplex. The intercalation region is a new type of DNA tertiary structural motif with similarities to the i-motif. 1H– 1H nuclear magnetic resonance and native gel electrophoresis confirmed the formation of a parallel-stranded duplex in solution. Finally, we modified specific nucleotide positions and added d(GAY) motifs to oligonucleotides and were readily able to obtain similar crystals. This suggests that this parallel-stranded DNA structure may be useful in the rational design of DNA crystals and nanostructures.« less
Crystal genes in a marginal glass-forming system of Ni 50Zr 50

DOE Office of Scientific and Technical Information (OSTI.GOV)

Wen, T. Q.; Tang, L.; Sun, Y.

Glass-forming motifs with B2 traits are found. A perfect Ni-centered B33 motif deteriorates the glass-forming ability of Ni 50Zr 50. The marginal glass-forming ability (GFA) of binary Ni-Zr system is an issue to be explained considering the numerous bulk metallic glasses (BMGs) found in the Cu-Zr system. Using molecular dynamics, the structures and dynamics of Ni 50Zr 50 metallic liquid and glass are investigated at the atomistic level. To achieve a well-relaxed glassy sample, sub-T g annealing method is applied and the final sample is closer to the experiments than the models prepared by continuous cooling. With the state-of-the-art structuralmore » analysis tools such as cluster alignment and pair-wise alignment methods, two glass-forming motifs with some mixed traits of the metastable B2 crystalline phase and the crystalline Ni-centered B33 motif are found to be dominant in the undercooled liquid and glass samples. A new chemical order characterization on each short-range order (SRO) structure is accomplished based on the cluster alignment method. The significant amount of the crystalline motif and the few icosahedra in the glassy sample deteriorate the GFA.« less
Crystal genes in a marginal glass-forming system of Ni 50Zr 50

DOE PAGES

Wen, T. Q.; Tang, L.; Sun, Y.; ...

2017-10-17

Glass-forming motifs with B2 traits are found. A perfect Ni-centered B33 motif deteriorates the glass-forming ability of Ni 50Zr 50. The marginal glass-forming ability (GFA) of binary Ni-Zr system is an issue to be explained considering the numerous bulk metallic glasses (BMGs) found in the Cu-Zr system. Using molecular dynamics, the structures and dynamics of Ni 50Zr 50 metallic liquid and glass are investigated at the atomistic level. To achieve a well-relaxed glassy sample, sub-T g annealing method is applied and the final sample is closer to the experiments than the models prepared by continuous cooling. With the state-of-the-art structuralmore » analysis tools such as cluster alignment and pair-wise alignment methods, two glass-forming motifs with some mixed traits of the metastable B2 crystalline phase and the crystalline Ni-centered B33 motif are found to be dominant in the undercooled liquid and glass samples. A new chemical order characterization on each short-range order (SRO) structure is accomplished based on the cluster alignment method. The significant amount of the crystalline motif and the few icosahedra in the glassy sample deteriorate the GFA.« less
Evidence for the Concerted Evolution between Short Linear Protein Motifs and Their Flanking Regions

PubMed Central

Chica, Claudia; Diella, Francesca; Gibson, Toby J.

2009-01-01

Background Linear motifs are short modules of protein sequences that play a crucial role in mediating and regulating many protein–protein interactions. The function of linear motifs strongly depends on the context, e.g. functional instances mainly occur inside flexible regions that are accessible for interaction. Sometimes linear motifs appear as isolated islands of conservation in multiple sequence alignments. However, they also occur in larger blocks of sequence conservation, suggesting an active role for the neighbouring amino acids. Results The evolution of regions flanking 116 functional linear motif instances was studied. The conservation of the amino acid sequence and order/disorder tendency of those regions was related to presence/absence of the instance. For the majority of the analysed instances, the pairs of sequences conserving the linear motif were also observed to maintain a similar local structural tendency and/or to have higher local sequence conservation when compared to pairs of sequences where one is missing the linear motif. Furthermore, those instances have a higher chance to co–evolve with the neighbouring residues in comparison to the distant ones. Those findings are supported by examples where the regulation of the linear motif–mediated interaction has been shown to depend on the modifications (e.g. phosphorylation) at neighbouring positions or is thought to benefit from the binding versatility of disordered regions. Conclusion The results suggest that flanking regions are relevant for linear motif–mediated interactions, both at the structural and sequence level. More interestingly, they indicate that the prediction of linear motif instances can be enriched with contextual information by performing a sequence analysis similar to the one presented here. This can facilitate the understanding of the role of these predicted instances in determining the protein function inside the broader context of the cellular network where they arise. PMID:19584925
Computational mining for hypothetical patterns of amino acid side chains in protein data bank (PDB)

NASA Astrophysics Data System (ADS)

Ghani, Nur Syatila Ab; Firdaus-Raih, Mohd

2018-04-01

The three-dimensional structure of a protein can provide insights regarding its function. Functional relationship between proteins can be inferred from fold and sequence similarities. In certain cases, sequence or fold comparison fails to conclude homology between proteins with similar mechanism. Since the structure is more conserved than the sequence, a constellation of functional residues can be similarly arranged among proteins of similar mechanism. Local structural similarity searches are able to detect such constellation of amino acids among distinct proteins, which can be useful to annotate proteins of unknown function. Detection of such patterns of amino acids on a large scale can increase the repertoire of important 3D motifs since available known 3D motifs currently, could not compensate the ever-increasing numbers of uncharacterized proteins to be annotated. Here, a computational platform for an automated detection of 3D motifs is described. A fuzzy-pattern searching algorithm derived from IMagine an Amino Acid 3D Arrangement search EnGINE (IMAAAGINE) was implemented to develop an automated method for searching of hypothetical patterns of amino acid side chains in Protein Data Bank (PDB), without the need for prior knowledge on related sequence or structure of pattern of interest. We present an example of the searches, which is the detection of a hypothetical pattern derived from known structural motif of C2H2 structural pattern from zinc fingers. The conservation of particular patterns of amino acid side chains in unrelated proteins is highlighted. This approach can act as a complementary method for available structure- and sequence-based platforms and may contribute in improving functional association between proteins.
Cryo-EM near-atomic structure of a dsRNA fungal virus shows ancient structural motifs preserved in the dsRNA viral lineage

PubMed Central

Luque, Daniel; Gómez-Blanco, Josué; Garriga, Damiá; Brilot, Axel F.; González, José M.; Havens, Wendy M.; Carrascosa, José L.; Trus, Benes L.; Verdaguer, Nuria; Ghabrial, Said A.; Castón, José R.

2014-01-01

Viruses evolve so rapidly that sequence-based comparison is not suitable for detecting relatedness among distant viruses. Structure-based comparisons suggest that evolution led to a small number of viral classes or lineages that can be grouped by capsid protein (CP) folds. Here, we report that the CP structure of the fungal dsRNA Penicillium chrysogenum virus (PcV) shows the progenitor fold of the dsRNA virus lineage and suggests a relationship between lineages. Cryo-EM structure at near-atomic resolution showed that the 982-aa PcV CP is formed by a repeated α-helical core, indicative of gene duplication despite lack of sequence similarity between the two halves. Superimposition of secondary structure elements identified a single “hotspot” at which variation is introduced by insertion of peptide segments. Structural comparison of PcV and other distantly related dsRNA viruses detected preferential insertion sites at which the complexity of the conserved α-helical core, made up of ancestral structural motifs that have acted as a skeleton, might have increased, leading to evolution of the highly varied current structures. Analyses of structural motifs only apparent after systematic structural comparisons indicated that the hallmark fold preserved in the dsRNA virus lineage shares a long (spinal) α-helix tangential to the capsid surface with the head-tailed phage and herpesvirus viral lineage. PMID:24821769
Triazine-based sequence-defined polymers with side-chain diversity and backbone-backbone interaction motifs

DOE PAGES

Grate, Jay W.; Mo, Kai -For; Daily, Michael D.

2016-02-10

Sequence control in polymers, well-known in nature, encodes structure and functionality. Here we introduce a new architecture, based on the nucleophilic aromatic substitution chemistry of cyanuric chloride, that creates a new class of sequence-defined polymers dubbed TZPs. Proof of concept is demonstrated with two synthesized hexamers, having neutral and ionizable side chains. Molecular dynamics simulations show backbone–backbone interactions, including H-bonding motifs and pi–pi interactions. This architecture is arguably biomimetic while differing from sequence-defined polymers having peptide bonds. In conclusion, the synthetic methodology supports the structural diversity of side chains known in peptides, as well as backbone–backbone hydrogen-bonding motifs, and willmore » thus enable new macromolecules and materials with useful functions.« less
Triazine-Based Sequence-Defined Polymers with Side-Chain Diversity and Backbone-Backbone Interaction Motifs.

PubMed

Grate, Jay W; Mo, Kai-For; Daily, Michael D

2016-03-14

Sequence control in polymers, well-known in nature, encodes structure and functionality. Here we introduce a new architecture, based on the nucleophilic aromatic substitution chemistry of cyanuric chloride, that creates a new class of sequence-defined polymers dubbed TZPs. Proof of concept is demonstrated with two synthesized hexamers, having neutral and ionizable side chains. Molecular dynamics simulations show backbone-backbone interactions, including H-bonding motifs and pi-pi interactions. This architecture is arguably biomimetic while differing from sequence-defined polymers having peptide bonds. The synthetic methodology supports the structural diversity of side chains known in peptides, as well as backbone-backbone hydrogen-bonding motifs, and will thus enable new macromolecules and materials with useful functions. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Triazine-based sequence-defined polymers with side-chain diversity and backbone-backbone interaction motifs

DOE Office of Scientific and Technical Information (OSTI.GOV)

Grate, Jay W.; Mo, Kai -For; Daily, Michael D.

Sequence control in polymers, well-known in nature, encodes structure and functionality. Here we introduce a new architecture, based on the nucleophilic aromatic substitution chemistry of cyanuric chloride, that creates a new class of sequence-defined polymers dubbed TZPs. Proof of concept is demonstrated with two synthesized hexamers, having neutral and ionizable side chains. Molecular dynamics simulations show backbone–backbone interactions, including H-bonding motifs and pi–pi interactions. This architecture is arguably biomimetic while differing from sequence-defined polymers having peptide bonds. In conclusion, the synthetic methodology supports the structural diversity of side chains known in peptides, as well as backbone–backbone hydrogen-bonding motifs, and willmore » thus enable new macromolecules and materials with useful functions.« less

Biophysical characterization of the basic cluster in the transcription repression domain of human MeCP2 with AT-rich DNA.

PubMed

Mushtaq, Ameeq Ul; Lee, Yejin; Hwang, Eunha; Bang, Jeong Kyu; Hong, Eunmi; Byun, Youngjoo; Song, Ji-Joon; Jeon, Young Ho

2018-01-01

MeCP2 is a chromatin associated protein which is highly expressed in brain and relevant with Rett syndrome (RTT). There are AT-hook motifs in MeCP2 which can bind with AT-rich DNA, suggesting a role in chromatin binding. Here, we report the identification and characterization of another AT-rich DNA binding motif (residues 295 to 313) from the C-terminal transcription repression domain of MeCP2 by nuclear magnetic resonance (NMR) and isothermal calorimetry (ITC). This motif shows a micromolar affinity to AT-rich DNA, and it binds to the minor groove of DNA like AT-hook motifs. Together with the previous studies, our results provide an insight into a critical role of this motif in chromatin structure and function. Copyright © 2017 Elsevier Inc. All rights reserved.
Multi-scale modularity and motif distributional effect in metabolic networks.

PubMed

Gao, Shang; Chen, Alan; Rahmani, Ali; Zeng, Jia; Tan, Mehmet; Alhajj, Reda; Rokne, Jon; Demetrick, Douglas; Wei, Xiaohui

2016-01-01

Metabolism is a set of fundamental processes that play important roles in a plethora of biological and medical contexts. It is understood that the topological information of reconstructed metabolic networks, such as modular organization, has crucial implications on biological functions. Recent interpretations of modularity in network settings provide a view of multiple network partitions induced by different resolution parameters. Here we ask the question: How do multiple network partitions affect the organization of metabolic networks? Since network motifs are often interpreted as the super families of evolved units, we further investigate their impact under multiple network partitions and investigate how the distribution of network motifs influences the organization of metabolic networks. We studied Homo sapiens, Saccharomyces cerevisiae and Escherichia coli metabolic networks; we analyzed the relationship between different community structures and motif distribution patterns. Further, we quantified the degree to which motifs participate in the modular organization of metabolic networks.
Distinct cagA EPIYA motifs are associated with ethnic diversity in Malaysia and Singapore.

PubMed

Schmidt, Heather-Marie A; Goh, Khean-Lee; Fock, Kwong Ming; Hilmi, Ida; Dhamodaran, Subbiah; Forman, David; Mitchell, Hazel

2009-08-01

In vitro studies have shown that the biologic activity of CagA is influenced by the number and class of EPIYA motifs present in its variable region as these motifs correspond to the CagA phosphorylation sites. It has been hypothesized that strains possessing specific combinations of these motifs may be responsible for gastric cancer development. This study investigated the prevalence of cagA and the EPIYA motifs with regard to number, class, and patterns in strains from the three major ethnic groups within the Malaysian and Singaporean populations in relation to disease development. Helicobacter pylori isolates from 49 Chinese, 43 Indian, and 14 Malay patients with functional dyspepsia (FD) and 21 gastric cancer (GC) cases were analyzed using polymerase chain reaction for the presence of cagA and the number, type, and pattern of EPIYA motifs. Additionally, the EPIYA motifs of 47 isolates were sequenced. All 126 isolates possessed cagA, with the majority encoding EPIYA-A (97.6%) and all encoding EPIYA-B. However, while the cagA of 93.0% of Indian FD isolates encoded EPIYA-C as the third motif, 91.8% of Chinese FD isolates and 81.7% of Chinese GC isolates encoded EPIYA-D (p < .001). Of Malay FD isolates, 61.5% and 38.5% possessed EPIYA-C and EPIYA-D, respectively. The majority of isolates possessed three EPIYA motifs; however, Indian isolates were significantly more likely to have four or more (p < .05). Although, H. pylori strains with distinct cagA-types are circulating within the primary ethnic groups resident in Malaysia and Singapore, these genotypes appear unassociated with the development of GC in the ethnic Chinese population. The phenomenon of distinct strains circulating within different ethnic groups, in combination with host and certain environmental factors, may help to explain the rates of GC development in Malaysia.
Cellular automata simulation of topological effects on the dynamics of feed-forward motifs

PubMed Central

Apte, Advait A; Cain, John W; Bonchev, Danail G; Fong, Stephen S

2008-01-01

Background Feed-forward motifs are important functional modules in biological and other complex networks. The functionality of feed-forward motifs and other network motifs is largely dictated by the connectivity of the individual network components. While studies on the dynamics of motifs and networks are usually devoted to the temporal or spatial description of processes, this study focuses on the relationship between the specific architecture and the overall rate of the processes of the feed-forward family of motifs, including double and triple feed-forward loops. The search for the most efficient network architecture could be of particular interest for regulatory or signaling pathways in biology, as well as in computational and communication systems. Results Feed-forward motif dynamics were studied using cellular automata and compared with differential equation modeling. The number of cellular automata iterations needed for a 100% conversion of a substrate into a target product was used as an inverse measure of the transformation rate. Several basic topological patterns were identified that order the specific feed-forward constructions according to the rate of dynamics they enable. At the same number of network nodes and constant other parameters, the bi-parallel and tri-parallel motifs provide higher network efficacy than single feed-forward motifs. Additionally, a topological property of isodynamicity was identified for feed-forward motifs where different network architectures resulted in the same overall rate of the target production. Conclusion It was shown for classes of structural motifs with feed-forward architecture that network topology affects the overall rate of a process in a quantitatively predictable manner. These fundamental results can be used as a basis for simulating larger networks as combinations of smaller network modules with implications on studying synthetic gene circuits, small regulatory systems, and eventually dynamic whole-cell models. PMID:18304325
Informative priors based on transcription factor structural class improve de novo motif discovery.

PubMed

Narlikar, Leelavati; Gordân, Raluca; Ohler, Uwe; Hartemink, Alexander J

2006-07-15

An important problem in molecular biology is to identify the locations at which a transcription factor (TF) binds to DNA, given a set of DNA sequences believed to be bound by that TF. In previous work, we showed that information in the DNA sequence of a binding site is sufficient to predict the structural class of the TF that binds it. In particular, this suggests that we can predict which locations in any DNA sequence are more likely to be bound by certain classes of TFs than others. Here, we argue that traditional methods for de novo motif finding can be significantly improved by adopting an informative prior probability that a TF binding site occurs at each sequence location. To demonstrate the utility of such an approach, we present priority, a powerful new de novo motif finding algorithm. Using data from TRANSFAC, we train three classifiers to recognize binding sites of basic leucine zipper, forkhead, and basic helix loop helix TFs. These classifiers are used to equip priority with three class-specific priors, in addition to a default prior to handle TFs of other classes. We apply priority and a number of popular motif finding programs to sets of yeast intergenic regions that are reported by ChIP-chip to be bound by particular TFs. priority identifies motifs the other methods fail to identify, and correctly predicts the structural class of the TF recognizing the identified binding sites. Supplementary material and code can be found at http://www.cs.duke.edu/~amink/.
Coevolved Mutations Reveal Distinct Architectures for Two Core Proteins in the Bacterial Flagellar Motor

PubMed Central

Pandini, Alessandro; Kleinjung, Jens; Rasool, Shafqat; Khan, Shahid

2015-01-01

Switching of bacterial flagellar rotation is caused by large domain movements of the FliG protein triggered by binding of the signal protein CheY to FliM. FliG and FliM form adjacent multi-subunit arrays within the basal body C-ring. The movements alter the interaction of the FliG C-terminal (FliGC) “torque” helix with the stator complexes. Atomic models based on the Salmonella entrovar C-ring electron microscopy reconstruction have implications for switching, but lack consensus on the relative locations of the FliG armadillo (ARM) domains (amino-terminal (FliGN), middle (FliGM) and FliGC) as well as changes during chemotaxis. The generality of the Salmonella model is challenged by the variation in motor morphology and response between species. We studied coevolved residue mutations to determine the unifying elements of switch architecture. Residue interactions, measured by their coevolution, were formalized as a network, guided by structural data. Our measurements reveal a common design with dedicated switch and motor modules. The FliM middle domain (FliMM) has extensive connectivity most simply explained by conserved intra and inter-subunit contacts. In contrast, FliG has patchy, complex architecture. Conserved structural motifs form interacting nodes in the coevolution network that wire FliMM to the FliGC C-terminal, four-helix motor module (C3-6). FliG C3-6 coevolution is organized around the torque helix, differently from other ARM domains. The nodes form separated, surface-proximal patches that are targeted by deleterious mutations as in other allosteric systems. The dominant node is formed by the EHPQ motif at the FliMMFliGM contact interface and adjacent helix residues at a central location within FliGM. The node interacts with nodes in the N-terminal FliGc α-helix triad (ARM-C) and FliGN. ARM-C, separated from C3-6 by the MFVF motif, has poor intra-network connectivity consistent with its variable orientation revealed by structural data. ARM-C could be the convertor element that provides mechanistic and species diversity. PMID:26561852
Conserved structure and inferred evolutionary history of long terminal repeats (LTRs)

PubMed Central

2013-01-01

Background Long terminal repeats (LTRs, consisting of U3-R-U5 portions) are important elements of retroviruses and related retrotransposons. They are difficult to analyse due to their variability. The aim was to obtain a more comprehensive view of structure, diversity and phylogeny of LTRs than hitherto possible. Results Hidden Markov models (HMM) were created for 11 clades of LTRs belonging to Retroviridae (class III retroviruses), animal Metaviridae (Gypsy/Ty3) elements and plant Pseudoviridae (Copia/Ty1) elements, complementing our work with Orthoretrovirus HMMs. The great variation in LTR length of plant Metaviridae and the few divergent animal Pseudoviridae prevented building HMMs from both of these groups. Animal Metaviridae LTRs had the same conserved motifs as retroviral LTRs, confirming that the two groups are closely related. The conserved motifs were the short inverted repeats (SIRs), integrase recognition signals (5´TGTTRNR…YNYAACA 3´); the polyadenylation signal or AATAAA motif; a GT-rich stretch downstream of the polyadenylation signal; and a less conserved AT-rich stretch corresponding to the core promoter element, the TATA box. Plant Pseudoviridae LTRs differed slightly in having a conserved TATA-box, TATATA, but no conserved polyadenylation signal, plus a much shorter R region. The sensitivity of the HMMs for detection in genomic sequences was around 50% for most models, at a relatively high specificity, suitable for genome screening. The HMMs yielded consensus sequences, which were aligned by creating an HMM model (a ‘Superviterbi’ alignment). This yielded a phylogenetic tree that was compared with a Pol-based tree. Both LTR and Pol trees supported monophyly of retroviruses. In both, Pseudoviridae was ancestral to all other LTR retrotransposons. However, the LTR trees showed the chromovirus portion of Metaviridae clustering together with Pseudoviridae, dividing Metaviridae into two portions with distinct phylogeny. Conclusion The HMMs clearly demonstrated a unitary conserved structure of LTRs, supporting that they arose once during evolution. We attempted to follow the evolution of LTRs by tracing their functional foundations, that is, acquisition of RNAse H, a combined promoter/ polyadenylation site, integrase, hairpin priming and the primer binding site (PBS). Available information did not support a simple evolutionary chain of events. PMID:23369192
Specific interaction of mutant p53 with regions of matrix attachment region DNA elements (MARs) with a high potential for base-unpairing

PubMed Central

Will, Katrin; Warnecke, Gabriele; Wiesmüller, Lisa; Deppert, Wolfgang

1998-01-01

Mutant, but not wild-type p53 binds with high affinity to a variety of MAR-DNA elements (MARs), suggesting that MAR-binding of mutant p53 relates to the dominant-oncogenic activities proposed for mutant p53. MARs recognized by mutant p53 share AT richness and contain variations of an AATATATTT “DNA-unwinding motif,” which enhances the structural dynamics of chromatin and promotes regional DNA base-unpairing. Mutant p53 specifically interacted with MAR-derived oligonucleotides carrying such unwinding motifs, catalyzing DNA strand separation when this motif was located within a structurally labile sequence environment. Addition of GC-clamps to the respective MAR-oligonucleotides or introducing mutations into the unwinding motif strongly reduced DNA strand separation, but supported the formation of tight complexes between mutant p53 and such oligonucleotides. We conclude that the specific interaction of mutant p53 with regions of MAR-DNA with a high potential for base-unpairing provides the basis for the high-affinity binding of mutant p53 to MAR-DNA. PMID:9811860
The effects of motif net charge and amphiphilicity on the self-assembly of functionally designer RADA16-I peptides.

PubMed

Wu, Dongni; Zhang, Shuangying; Zhao, Yuyuan; Ao, Ningjian; Ramakrishna, Seeram; He, Liumin

2018-03-16

RADA16-I (Ac-(RADA) 4 -CONH 2 ) is a widely investigated self-assembling peptide (SAP) in the biomedical field. It can undergo ordered self-assembly to form stable secondary structures, thereby further forming a nanofiber hydrogel. The modification of RADA16-I with functional peptide motifs has become a popular research topic. Researchers aim to exhibit particular biomedical signaling, and subsequently, further expand its applications. However, only a few fundamental reports are available on the influences of the peptide motifs on self-assembly mechanisms of designer functional RADA16-I SAPs. In this study, we designed RGD-modified RADA16-I SAPs with a series of net charges and amphiphilicities. The assembly/reassembly of these functionally designer SAPs was thoroughly studied using Raman spectroscopy, CD spectroscopy, and AFM. The nanofiber morphology and the secondary structure largely depended on the balance between the hydrophobic effects versus like-charge repulsions of the motifs, which should be to the focus in order to achieve a tailored nanostructure. Our study would contribute insight into considerations for sophisticated design of SAPs for biomedical applications.
Basic Tilted Helix Bundle - a new protein fold in human FKBP25/FKBP3 and HectD1.

PubMed

Helander, Sara; Montecchio, Meri; Lemak, Alexander; Farès, Christophe; Almlöf, Jonas; Yi, Yanjun; Yee, Adelinda; Arrowsmith, Cheryl; DhePaganon, Sirano; Sunnerhagen, Maria

2014-04-25

In this paper, we describe the structure of a N-terminal domain motif in nuclear-localized FKBP251-73, a member of the FKBP family, together with the structure of a sequence-related subdomain of the E3 ubiquitin ligase HectD1 that we show belongs to the same fold. This motif adopts a compact 5-helix bundle which we name the Basic Tilted Helix Bundle (BTHB) domain. A positively charged surface patch, structurally centered around the tilted helix H4, is present in both FKBP25 and HectD1 and is conserved in both proteins, suggesting a conserved functional role. We provide detailed comparative analysis of the structures of the two proteins and their sequence similarities, and analysis of the interaction of the proposed FKBP25 binding protein YY1. We suggest that the basic motif in BTHB is involved in the observed DNA binding of FKBP25, and that the function of this domain can be affected by regulatory YY1 binding and/or interactions with adjacent domains. Copyright © 2014 Elsevier Inc. All rights reserved.
Conserved thioredoxin fold is present in Pisum sativum L. sieve element occlusion-1 protein

PubMed Central

Umate, Pavan; Tuteja, Renu

2010-01-01

Homology-based three-dimensional model for Pisum sativum sieve element occlusion 1 (Ps.SEO1) (forisomes) protein was constructed. A stretch of amino acids (residues 320 to 456) which is well conserved in all known members of forisomes proteins was used to model the 3D structure of Ps.SEO1. The structural prediction was done using Protein Homology/analogY Recognition Engine (PHYRE) web server. Based on studies of local sequence alignment, the thioredoxin-fold containing protein [Structural Classification of Proteins (SCOP) code d1o73a_], a member of the glutathione peroxidase family was selected as a template for modeling the spatial structure of Ps.SEO1. Selection was based on comparison of primary sequence, higher match quality and alignment accuracy. Motif 1 (EVF) is conserved in Ps.SEO1, Vicia faba (Vf.For1) and Medicago truncatula (MT.SEO3); motif 2 (KKED) is well conserved across all forisomes proteins and motif 3 (IGYIGNP) is conserved in Ps.SEO1 and Vf.For1. PMID:20404566
Hyperactive antifreeze proteins from longhorn beetles: some structural insights.

PubMed

Kristiansen, Erlend; Wilkens, Casper; Vincents, Bjarne; Friis, Dennis; Lorentzen, Anders Blomkild; Jenssen, Håvard; Løbner-Olesen, Anders; Ramløv, Hans

2012-11-01

This study reports on structural characteristics of hyperactive antifreeze proteins (AFPs) from two species of longhorn beetles. In Rhagium mordax, eight unique mRNAs coding for five different mature AFPs were identified from cold-hardy individuals. These AFPs are apparently homologues to a previously characterized AFP from the closely related species Rhagium inquisitor, and consist of six identifiable repeats of a putative ice binding motif TxTxTxT spaced irregularly apart by segments varying in length from 13 to 20 residues. Circular dichroism spectra show that the AFPs from both species have a high content of β-sheet and low levels of α-helix and random coil. Theoretical predictions of residue-specific secondary structure locate these β-sheets within the putative ice-binding motifs and the central parts of the segments separating them, consistent with an overall β-helical structure with the ice-binding motifs stacked in a β-sheet on one side of the coil. Molecular dynamics models based on these findings show that these AFPs would be energetically stable in a β-helical conformation. Copyright © 2012 Elsevier Ltd. All rights reserved.
Structural and energetic study of cation-π-cation interactions in proteins.

PubMed

Pinheiro, Silvana; Soteras, Ignacio; Gelpí, Josep Lluis; Dehez, François; Chipot, Christophe; Luque, F Javier; Curutchet, Carles

2017-04-12

Cation-π interactions of aromatic rings and positively charged groups are among the most important interactions in structural biology. The role and energetic characteristics of these interactions are well established. However, the occurrence of cation-π-cation interactions is an unexpected motif, which raises intriguing questions about its functional role in proteins. We present a statistical analysis of the occurrence, composition and geometrical preferences of cation-π-cation interactions identified in a set of non-redundant protein structures taken from the Protein Data Bank. Our results demonstrate that this structural motif is observed at a small, albeit non-negligible frequency in proteins, and suggest a preference to establish cation-π-cation motifs with Trp, followed by Tyr and Phe. Furthermore, we have found that cation-π-cation interactions tend to be highly conserved, which supports their structural or functional role. Finally, we have performed an energetic analysis of a representative subset of cation-π-cation complexes combining quantum-chemical and continuum solvation calculations. Our results point out that the protein environment can strongly screen the cation-cation repulsion, leading to an attractive interaction in 64% of the complexes analyzed. Together with the high degree of conservation observed, these results suggest a potential stabilizing role in the protein fold, as demonstrated recently for a miniature protein (Craven et al., J. Am. Chem. Soc. 2016, 138, 1543). From a computational point of view, the significant contribution of non-additive three-body terms challenges the suitability of standard additive force fields for describing cation-π-cation motifs in molecular simulations.
Structural and Functional Basis of the Fidelity of Nucleotide Selection by Flavivirus RNA-Dependent RNA Polymerases

PubMed Central

Canard, Bruno

2018-01-01

Viral RNA-dependent RNA polymerases (RdRps) play a central role not only in viral replication, but also in the genetic evolution of viral RNAs. After binding to an RNA template and selecting 5′-triphosphate ribonucleosides, viral RdRps synthesize an RNA copy according to Watson-Crick base-pairing rules. The copy process sometimes deviates from both the base-pairing rules specified by the template and the natural ribose selectivity and, thus, the process is error-prone due to the intrinsic (in)fidelity of viral RdRps. These enzymes share a number of conserved amino-acid sequence strings, called motifs A–G, which can be defined from a structural and functional point-of-view. A co-relation is gradually emerging between mutations in these motifs and viral genome evolution or observed mutation rates. Here, we review our current knowledge on these motifs and their role on the structural and mechanistic basis of the fidelity of nucleotide selection and RNA synthesis by Flavivirus RdRps. PMID:29385764
Molecular cloning and characterization of sea bass (Dicentrarchus labrax, L.) calreticulin.

PubMed

Pinto, Rute D; Moreira, Ana R; Pereira, Pedro J B; dos Santos, Nuno M S

2013-06-01

Mammalian calreticulin (CRT) is a key molecular chaperone and regulator of Ca(2+) homeostasis in endoplasmic reticulum (ER), also being implicated in a variety of physiological/pathological processes outside the ER. Importantly, it is involved in assembly of MHC class I molecules. In this work, sea bass (Dicentrarchus labrax) CRT (Dila-CRT) gene and cDNA have been isolated and characterized. The mature protein retains two conserved motifs, three structural/functional domains (N, P and C), three type 1 and 2 motifs repeated in tandem, a conserved pair of cysteines and ER-retention motif. It is a single-copy gene composed of 9 exons. Dila-CRT three-dimensional homology models are consistent with the structural features described for mammalian molecules. Together, these results are supportive of a highly conserved structure of CRT through evolution. Moreover, the present data provides information that will allow further studies on sea bass CRT involvement in immunity and in particular class I antigen presentation. Copyright © 2013 Elsevier Ltd. All rights reserved.
Unique scorpion toxin with a putative ancestral fold provides insight into evolution of the inhibitor cystine knot motif.

PubMed

Smith, Jennifer J; Hill, Justine M; Little, Michelle J; Nicholson, Graham M; King, Glenn F; Alewood, Paul F

2011-06-28

The three-disulfide inhibitor cystine knot (ICK) motif is a fold common to venom peptides from spiders, scorpions, and aquatic cone snails. Over a decade ago it was proposed that the ICK motif is an elaboration of an ancestral two-disulfide fold coined the disulfide-directed β-hairpin (DDH). Here we report the isolation, characterization, and structure of a novel toxin [U(1)-liotoxin-Lw1a (U(1)-LITX-Lw1a)] from the venom of the scorpion Liocheles waigiensis that is the first example of a native peptide that adopts the DDH fold. U(1)-LITX-Lw1a not only represents the discovery of a missing link in venom protein evolution, it is the first member of a fourth structural fold to be adopted by scorpion-venom peptides. Additionally, we show that U(1)-LITX-Lw1a has potent insecticidal activity across a broad range of insect pest species, thereby providing a unique structural scaffold for bioinsecticide development.
Methylation of class I translation termination factors: structural and functional aspects.

PubMed

Graille, Marc; Figaro, Sabine; Kervestin, Stéphanie; Buckingham, Richard H; Liger, Dominique; Heurgué-Hamard, Valérie

2012-07-01

During protein synthesis, release of polypeptide from the ribosome occurs when an in frame termination codon is encountered. Contrary to sense codons, which are decoded by tRNAs, stop codons present in the A-site are recognized by proteins named class I release factors, leading to the release of newly synthesized proteins. Structures of these factors bound to termination ribosomal complexes have recently been obtained, and lead to a better understanding of stop codon recognition and its coordination with peptidyl-tRNA hydrolysis in bacteria. Release factors contain a universally conserved GGQ motif which interacts with the peptidyl-transferase centre to allow peptide release. The Gln side chain from this motif is methylated, a feature conserved from bacteria to man, suggesting an important biological role. However, methylation is catalysed by completely unrelated enzymes. The function of this motif and its post-translational modification will be discussed in the context of recent structural and functional studies. Copyright © 2012 Elsevier Masson SAS. All rights reserved.
THGS: a web-based database of Transmembrane Helices in Genome Sequences

PubMed Central

Fernando, S. A.; Selvarani, P.; Das, Soma; Kumar, Ch. Kiran; Mondal, Sukanta; Ramakumar, S.; Sekar, K.

2004-01-01

Transmembrane Helices in Genome Sequences (THGS) is an interactive web-based database, developed to search the transmembrane helices in the user-interested gene sequences available in the Genome Database (GDB). The proposed database has provision to search sequence motifs in transmembrane and globular proteins. In addition, the motif can be searched in the other sequence databases (Swiss-Prot and PIR) or in the macromolecular structure database, Protein Data Bank (PDB). Further, the 3D structure of the corresponding queried motif, if it is available in the solved protein structures deposited in the Protein Data Bank, can also be visualized using the widely used graphics package RASMOL. All the sequence databases used in the present work are updated frequently and hence the results produced are up to date. The database THGS is freely available via the world wide web and can be accessed at http://pranag.physics.iisc.ernet.in/thgs/ or http://144.16.71.10/thgs/. PMID:14681375
Structural characterization of Helicobacter pylori dethiobiotin synthetase reveals differences between family members

DOE Office of Scientific and Technical Information (OSTI.GOV)

Porebski, Przemyslaw J.; Klimecka, Maria; Chruszcz, Maksymilian

2012-07-11

Dethiobiotin synthetase (DTBS) is involved in the biosynthesis of biotin in bacteria, fungi, and plants. As humans lack this pathway, DTBS is a promising antimicrobial drug target. We determined structures of DTBS from Helicobacter pylori (hpDTBS) bound with cofactors and a substrate analog, and described its unique characteristics relative to other DTBS proteins. Comparison with bacterial DTBS orthologs revealed considerable structural differences in nucleotide recognition. The C-terminal region of DTBS proteins, which contains two nucleotide-recognition motifs, differs greatly among DTBS proteins from different species. The structure of hpDTBS revealed that this protein is unique and does not contain a C-terminalmore » region containing one of the motifs. The single nucleotide-binding motif in hpDTBS is similar to its counterpart in GTPases; however, isothermal titration calorimetry binding studies showed that hpDTBS has a strong preference for ATP. The structural determinants of ATP specificity were assessed with X-ray crystallographic studies of hpDTBS-ATP and hpDTBS-GTP complexes. The unique mode of nucleotide recognition in hpDTBS makes this protein a good target for H. pylori-specific inhibitors of the biotin synthesis pathway.« less
Crystal structures reveal metal-binding plasticity at the metallo-β-lactamase active site of PqqB from Pseudomonas putida

DOE Office of Scientific and Technical Information (OSTI.GOV)

Tu, Xiongying; Latham, John A.; Klema, Valerie J.

PqqB is an enzyme involved in the biosynthesis of pyrroloquinoline quinone and a distal member of the metallo-β-lactamase (MBL) superfamily. PqqB lacks two residues in the conserved signature motif HxHxDH that makes up the key metal-chelating elements that can bind up to two metal ions at the active site of MBLs and other members of its superfamily. Here, we report crystal structures of PqqB bound to Mn2+, Mg2+, Cu2+, and Zn2+. These structures demonstrate that PqqB can still bind metal ions at the canonical MBL active site. The fact that PqqB can adapt its side chains to chelate a widemore » spectrum of metal ions with different coordination features on a uniform main chain scaffold demonstrates its metal-binding plasticity. This plasticity may provide insights into the structural basis of promiscuous activities found in ensembles of metal complexes within this superfamily. Furthermore, PqqB belongs to a small subclass of MBLs that contain an additional CxCxxC motif that binds a structural Zn2+. Our data support a key role for this motif in dimerization.« less

Unique Structural Features and Sequence Motifs of Proline Utilization A (PutA)

PubMed Central

Singh, Ranjan K.; Tanner, John J.

2013-01-01

Proline utilization A proteins (PutAs) are bifunctional enzymes that catalyze the oxidation of proline to glutamate using spatially separated proline dehydrogenase and pyrroline-5-carboxylate dehydrogenase active sites. Here we use the crystal structure of the minimalist PutA from Bradyrhizobium japonicum (BjPutA) along with sequence analysis to identify unique structural features of PutAs. This analysis shows that PutAs have secondary structural elements and domains not found in the related monofunctional enzymes. Some of these extra features are predicted to be important for substrate channeling in BjPutA. Multiple sequence alignment analysis shows that some PutAs have a 17-residue conserved motif in the C-terminal 20–30 residues of the polypeptide chain. The BjPutA structure shows that this motif helps seal the internal substrate-channeling cavity from the bulk medium. Finally, it is shown that some PutAs have a 100–200 residue domain of unknown function in the C-terminus that is not found in minimalist PutAs. Remote homology detection suggests that this domain is homologous to the oligomerization beta-hairpin and Rossmann fold domain of BjPutA. PMID:22201760
(φ,ψ)2-motifs: a purely conformation-based, fine-grained enumeration of protein parts at the two-residue level

PubMed Central

Hollingsworth, Scott A.; Lewis, Matthew C.; Berkholz, Donald S.; Wong, Weng-Keen; Karplus, P. Andrew

2011-01-01

A deep understanding of protein structure benefits from the use of a variety of classification strategies that enhance our ability to effectively describe local patterns of conformation. Here, we use a clustering algorithm to analyze 76,533 all-trans segments from protein structures solved at 1.2 Å resolution or better to create a purely φ,ψ-based comprehensive empirical categorization of common conformations adopted by two adjacent φ,ψ-pairs (i.e. (φ,ψ)2-motifs). The clustering algorithm works in an origin-shifted 4-dimensional space based on the two φ,ψ-pairs to yield a parameter-dependent list of (φ,ψ)2-motifs – in order of their prominence. The results are remarkably distinct from and complementary to the standard hydrogen-bond centered view of secondary structure. New insights include an unprecedented level of precision in describing the φ,ψ-angles of both previously known and novel motifs, an ordering of these motifs by their population density, a data-driven recommendation that the standard Cαi…Cαi+3 < 7 Å criteria for defining turns be changed to 6.5 Å, an identification of β-strand and turn capping motifs, and of conformational capping by residues in the polypeptide-II (PII) conformation. We further document that the conformational preferences of a residue are substantially influenced by the conformation of its neighbors, and suggest that accounting for these dependencies will improve protein modeling accuracy. Although the CUEVAS-4D(r10є14) “parts list” presented here is only an initial exploration of the complex (φ,ψ)2-landscape of proteins, it shows there is value to be had from this approach and opens the door to more in-depth characterizations at the (φ,ψ)2-level and at higher dimensions. PMID:22198294
(φ,ψ)₂ motifs: a purely conformation-based fine-grained enumeration of protein parts at the two-residue level.

PubMed

Hollingsworth, Scott A; Lewis, Matthew C; Berkholz, Donald S; Wong, Weng-Keen; Karplus, P Andrew

2012-02-10

A deep understanding of protein structure benefits from the use of a variety of classification strategies that enhance our ability to effectively describe local patterns of conformation. Here, we use a clustering algorithm to analyze 76,533 all-trans segments from protein structures solved at 1.2 Å resolution or better to create a purely φ,ψ-based comprehensive empirical categorization of common conformations adopted by two adjacent φ,ψ pairs (i.e., (φ,ψ)(2) motifs). The clustering algorithm works in an origin-shifted four-dimensional space based on the two φ,ψ pairs to yield a parameter-dependent list of (φ,ψ)(2) motifs, in order of their prominence. The results are remarkably distinct from and complementary to the standard hydrogen-bond-centered view of secondary structure. New insights include an unprecedented level of precision in describing the φ,ψ angles of both previously known and novel motifs, ordering of these motifs by their population density, a data-driven recommendation that the standard C(α(i))…C(α(i+3))<7 Å criteria for defining turns be changed to 6.5 Å, identification of β-strand and turn capping motifs, and identification of conformational capping by residues in polypeptide II conformation. We further document that the conformational preferences of a residue are substantially influenced by the conformation of its neighbors, and we suggest that accounting for these dependencies will improve protein modeling accuracy. Although the CUEVAS-4D(r(10)є(14)) 'parts list' presented here is only an initial exploration of the complex (φ,ψ)(2) landscape of proteins, it shows that there is value to be had from this approach, and it opens the door to more in-depth characterizations at the (φ,ψ)(2) level and at higher dimensions. Copyright © 2011 Elsevier Ltd. All rights reserved.
Identification of the WW domain-interaction sites in the unstructured N-terminal domain of EBV LMP 2A.

PubMed

Seo, Min-Duk; Park, Sung Jean; Kim, Hyun-Jung; Lee, Bong Jin

2007-01-09

Epstein-Barr virus latency is maintained by the latent membrane protein (LMP) 2A, which mimics the B-cell receptor (BCR) and perturbs BCR signaling. The cytoplasmic N-terminal domain of LMP2A is composed of 119 amino acids. The N-terminal domain of LMP2A (LMP2A NTD) contains two PY motifs (PPPPY) that interact with the WW domains of Nedd4 family ubiquitin-protein ligases. Based on our analysis of NMR data, we found that the LMP2A NTD adopts an overall random-coil structure in its native state. However, the region between residues 60 and 90 was relatively ordered, and seemed to form the hydrophobic core of the LMP2A NTD. This region resides between two PY motifs and is important for WW domain binding. Mapping of the residues involved in the interaction between the LMP2A NTD and WW domains was achieved by chemical shift perturbation, by the addition of WW2 and WW3 peptides. Interestingly, the binding of the WW domains mainly occurred in the hydrophobic core of the LMP2A NTD. In addition, we detected a difference in the binding modes of the two PY motifs against the two WW peptides. The binding of the WW3 peptide caused the resonances of five residues (Tyr(60), Glu(61), Asp(62), Trp(65), and Gly(66)) just behind the N-terminal PY motif of the LMP2A NTD to disappear. A similar result was obtained with WW2 binding. However, near the C-terminal PY motif, the chemical shift perturbation caused by WW2 binding was different from that due to WW3 binding, indicating that the residues near the PY motifs are involved in selective binding of WW domains. The present work represents the first structural study of the LMP2A NTD and provides fundamental structural information about its interaction with ubiquitin-protein ligase.
Diversity surveys and evolutionary relationships of aoxB genes in aerobic arsenite-oxidizing bacteria.

PubMed

Quéméneur, Marianne; Heinrich-Salmeron, Audrey; Muller, Daniel; Lièvremont, Didier; Jauzein, Michel; Bertin, Philippe N; Garrido, Francis; Joulian, Catherine

2008-07-01

A new primer set was designed to specifically amplify ca. 1,100 bp of aoxB genes encoding the As(III) oxidase catalytic subunit from taxonomically diverse aerobic As(III)-oxidizing bacteria. Comparative analysis of AoxB protein sequences showed variable conservation levels and highlighted the conservation of essential amino acids and structural motifs. AoxB phylogeny of pure strains showed well-discriminated taxonomic groups and was similar to 16S rRNA phylogeny. Alphaproteobacteria-, Betaproteobacteria-, and Gammaproteobacteria-related sequences were retrieved from environmental surveys, demonstrating their prevalence in mesophilic As-contaminated soils. Our study underlines the usefulness of the aoxB gene as a functional marker of aerobic As(III) oxidizers.
Structural evolution of nrDNA ITS in Pinaceae and its phylogenetic implications.

PubMed

Kan, Xian-Zhao; Wang, Shan-Shan; Ding, Xin; Wang, Xiao-Quan

2007-08-01

Nuclear ribosomal DNA (nrDNA) has been considered as an important tool for inferring phylogenetic relationships at many taxonomic levels. In comparison with its fast concerted evolution in angiosperms, nrDNA is symbolized by slow concerted evolution and substantial ITS region length variation in gymnosperms, particularly in Pinaceae. Here we studied structure characteristics, including subrepeat composition, size, GC content and secondary structure, of nrDNA ITS regions of all Pinaceae genera. The results showed that the ITS regions of all taxa studied contained subrepeat units, ranging from 2 to 9 in number, and these units could be divided into two types, longer subrepeat (LSR) without the motif (5'-GGCCACCCTAGTC) and shorter subrepeat (SSR) with the motif. Phylogenetic analyses indicate that the homology of some SSRs still can be recognized, providing important informations for the evolutionary history of nrDNA ITS and phylogeny of Pinaceae. In particular, the adjacent tandem SSRs are not more closely related to one another than they are to remote SSRs in some genera, which may imply that multiple structure variations such as recombination have occurred in the ITS1 region of these groups. This study also found that GC content in the ITS1 region is relevant to its sequence length and subrepeat number, and could provide some phylogenetic information, especially supporting the close relationships among Picea, Pinus, and Cathaya. Moreover, several characteristics of the secondary structure of Pinaceae ITS1 were found as follows: (1) the structure is dominated by several extended hairpins; (2) the configuration complexity is positively correlated with subrepeat number; (3) paired subrepeats often partially overlap at the conserved motif (5'-GGCCACCCTAGTC), and form a long stem, while other subrepeats fold onto itself, leaving part of the conserved motif exposed in hairpin loops.
Structural basis for the substrate selectivity of a HAD phosphatase from Thermococcus onnurineus NA1.

PubMed

Ngo, Tri Duc; Van Le, Binh; Subramani, Vinod Kumar; Thi Nguyen, Chi My; Lee, Hyun Sook; Cho, Yona; Kim, Kyeong Kyu; Hwang, Hye-Yeon

2015-05-22

Proteins in the haloalkaloic acid dehalogenase (HAD) superfamily, which is one of the largest enzyme families, is generally composed of a catalytic core domain and a cap domain. Although proteins in this family show broad substrate specificities, the mechanisms of their substrate recognition are not well understood. In this study, we identified a new substrate binding motif of HAD proteins from structural and functional analyses, and propose that this motif might be crucial for interacting with hydrophobic rings of substrates. The crystal structure of TON_0338, one of the 17 putative HAD proteins identified in a hyperthermophilic archaeon, Thermococcus onnurineus NA1, was determined as an apo-form at 2.0 Å resolution. In addition, we determined the crystal structure TON_0338 in complex with Mg(2+) or N-cyclohexyl-2-aminoethanesulfonic acid (CHES) at 1.7 Å resolution. Examination of the apo-form and CHES-bound structures revealed that CHES is sandwiched between Trp58 and Trp61, suggesting that this Trp sandwich might function as a substrate recognition motif. In the phosphatase assay, TON_0338 was shown to have high activity for flavin mononucleotide (FMN), and the docking analysis suggested that the flavin of FMN may interact with Trp58 and Trp61 in a way similar to that observed in the crystal structure. Moreover, the replacement of these tryptophan residues significantly reduced the phosphatase activity for FMN. Our results suggest that WxxW may function as a substrate binding motif in HAD proteins, and expand the diversity of their substrate recognition mode. Copyright © 2015 Elsevier Inc. All rights reserved.
Crystal structure of AFV1-102, a protein from the acidianus filamentous virus 1

PubMed Central

Keller, Jenny; Leulliot, Nicolas; Collinet, Bruno; Campanacci, Valerie; Cambillau, Christian; Pranghisvilli, David; van Tilbeurgh, Herman

2009-01-01

Viruses infecting hyperthermophilic archaea have intriguing morphologies and genomic properties. The vast majority of their genes do not have homologs other than in other hyperthermophilic viruses, and the biology of these viruses is poorly understood. As part of a structural genomics project on the proteins of these viruses, we present here the structure of a 102 amino acid protein from acidianus filamentous virus 1 (AFV1-102). The structure shows that it is made of two identical motifs that have poor sequence similarity. Although no function can be proposed from structural analysis, tight binding of the gateway tag peptide in a groove between the two motifs suggests AFV1-102 is involved in protein protein interactions. PMID:19319936
A Conserved GPG-Motif in the HIV-1 Nef Core Is Required for Principal Nef-Activities

PubMed Central

Martínez-Bonet, Marta; Palladino, Claudia; Briz, Veronica; Rudolph, Jochen M.; Fackler, Oliver T.; Relloso, Miguel; Muñoz-Fernandez, Maria Angeles; Madrid, Ricardo

2015-01-01

To find out new determinants required for Nef activity we performed a functional alanine scanning analysis along a discrete but highly conserved region at the core of HIV-1 Nef. We identified the GPG-motif, located at the 121–137 region of HIV-1 NL4.3 Nef, as a novel protein signature strictly required for the p56Lck dependent Nef-induced CD4-downregulation in T-cells. Since the Nef-GPG motif was dispensable for CD4-downregulation in HeLa-CD4 cells, Nef/AP-1 interaction and Nef-dependent effects on Tf-R trafficking, the observed effects on CD4 downregulation cannot be attributed to structure constraints or to alterations on general protein trafficking. Besides, we found that the GPG-motif was also required for Nef-dependent inhibition of ring actin re-organization upon TCR triggering and MHCI downregulation, suggesting that the GPG-motif could actively cooperate with the Nef PxxP motif for these HIV-1 Nef-related effects. Finally, we observed that the Nef-GPG motif was required for optimal infectivity of those viruses produced in T-cells. According to these findings, we propose the conserved GPG-motif in HIV-1 Nef as functional region required for HIV-1 infectivity and therefore with a potential interest for the interference of Nef activity during HIV-1 infection. PMID:26700863
Sequence information gain based motif analysis.

PubMed

Maynou, Joan; Pairó, Erola; Marco, Santiago; Perera, Alexandre

2015-11-09

The detection of regulatory regions in candidate sequences is essential for the understanding of the regulation of a particular gene and the mechanisms involved. This paper proposes a novel methodology based on information theoretic metrics for finding regulatory sequences in promoter regions. This methodology (SIGMA) has been tested on genomic sequence data for Homo sapiens and Mus musculus. SIGMA has been compared with different publicly available alternatives for motif detection, such as MEME/MAST, Biostrings (Bioconductor package), MotifRegressor, and previous work such Qresiduals projections or information theoretic based detectors. Comparative results, in the form of Receiver Operating Characteristic curves, show how, in 70% of the studied Transcription Factor Binding Sites, the SIGMA detector has a better performance and behaves more robustly than the methods compared, while having a similar computational time. The performance of SIGMA can be explained by its parametric simplicity in the modelling of the non-linear co-variability in the binding motif positions. Sequence Information Gain based Motif Analysis is a generalisation of a non-linear model of the cis-regulatory sequences detection based on Information Theory. This generalisation allows us to detect transcription factor binding sites with maximum performance disregarding the covariability observed in the positions of the training set of sequences. SIGMA is freely available to the public at http://b2slab.upc.edu.
Grafting of functional motifs onto protein scaffolds identified by PDB screening--an efficient route to design optimizable protein binders.

PubMed

Tlatli, Rym; Nozach, Hervé; Collet, Guillaume; Beau, Fabrice; Vera, Laura; Stura, Enrico; Dive, Vincent; Cuniasse, Philippe

2013-01-01

Artificial miniproteins that are able to target catalytic sites of matrix metalloproteinases (MMPs) were designed using a functional motif-grafting approach. The motif corresponded to the four N-terminal residues of TIMP-2, a broad-spectrum protein inhibitor of MMPs. Scaffolds that are able to reproduce the functional topology of this motif were obtained by exhaustive screening of the Protein Data Bank (PDB) using STAMPS software (search for three-dimensional atom motifs in protein structures). Ten artificial protein binders were produced. The designed proteins bind catalytic sites of MMPs with affinities ranging from 450 nm to 450 μm prior to optimization. The crystal structure of one artificial binder in complex with the catalytic domain of MMP-12 showed that the inter-molecular interactions established by the functional motif in the artificial binder corresponded to those found in the MMP-14-TIMP-2 complex, albeit with some differences in geometry. Molecular dynamics simulations of the ten binders in complex with MMP-14 suggested that these scaffolds may allow partial reproduction of native inter-molecular interactions, but differences in geometry and stability may contribute to the lower affinity of the artificial protein binders compared to the natural protein binder. Nevertheless, these results show that the in silico design method used provides sets of protein binders that target a specific binding site with a good rate of success. This approach may constitute the first step of an efficient hybrid computational/experimental approach to protein binder design. © 2012 The Authors Journal compilation © 2012 FEBS.
Divergent Synthesis of Chondroitin Sulfate Disaccharides and Identification of Sulfate Motifs that Inhibit Triple Negative Breast Cancer

NASA Astrophysics Data System (ADS)

Wei Poh, Zhong; Heng Gan, Chin; Lee, Eric J.; Guo, Suxian; Yip, George W.; Lam, Yulin

2015-09-01

Glycosaminoglycans (GAGs) regulate many important physiological processes. A pertinent issue to address is whether GAGs encode important functional information via introduction of position specific sulfate groups in the GAG structure. However, procurement of pure, homogenous GAG motifs to probe the “sulfation code” is a challenging task due to isolation difficulty and structural complexity. To this end, we devised a versatile synthetic strategy to obtain all the 16 theoretically possible sulfation patterns in the chondroitin sulfate (CS) repeating unit; these include rare but potentially important sulfated motifs which have not been isolated earlier. Biological evaluation indicated that CS sulfation patterns had differing effects for different breast cancer cell types, and the greatest inhibitory effect was observed for the most aggressive, triple negative breast cancer cell line MDA-MB-231.
Structural insights into species-specific features of the ribosome from the pathogen Staphylococcus aureus

PubMed Central

Eyal, Zohar; Matzov, Donna; Krupkin, Miri; Wekselman, Itai; Paukner, Susanne; Zimmerman, Ella; Rozenberg, Haim; Bashan, Anat; Yonath, Ada

2015-01-01

The emergence of bacterial multidrug resistance to antibiotics threatens to cause regression to the preantibiotic era. Here we present the crystal structure of the large ribosomal subunit from Staphylococcus aureus, a versatile Gram-positive aggressive pathogen, and its complexes with the known antibiotics linezolid and telithromycin, as well as with a new, highly potent pleuromutilin derivative, BC-3205. These crystal structures shed light on specific structural motifs of the S. aureus ribosome and the binding modes of the aforementioned antibiotics. Moreover, by analyzing the ribosome structure and comparing it with those of nonpathogenic bacterial models, we identified some unique internal and peripheral structural motifs that may be potential candidates for improving known antibiotics and for use in the design of selective antibiotic drugs against S. aureus. PMID:26464510
Structure and anticoagulant activity of a sulfated galactan from the red alga, Gelidium crinale. Is there a specific structural requirement for the anticoagulant action?

PubMed

Pereira, Maria G; Benevides, Norma M B; Melo, Marcia R S; Valente, Ana Paula; Melo, Fábio R; Mourão, Paulo A S

2005-09-05

Marine red algae are an abundant source of sulfated galactans with potent anticoagulant activity. However, the specific structural motifs that confer biological activity remain to be elucidated. We have now isolated and purified a sulfated galactan from the marine red alga, Gellidium crinale. The structure of this polysaccharide was determined using NMR spectroscopy. It is composed of the repeating structure -4-alpha-Galp-(1-->3)-beta-Galp1--> but with a variable sulfation pattern. Clearly 15% of the total alpha-units are 2,3-di-sulfated and another 55% are 2-sulfated. No evidence for the occurrence of 3,6-anhydro alpha-galactose units was observed in the NMR spectra. We also compared the anticoagulant activity of this sulfated galactan with a polysaccharide from the species, Botryocladia occidentalis, with a similar saccharide chain but with higher amounts of 2,3-di-sulfated alpha-units. The sulfated galactan from G. crinale has a lower anticoagulant activity on a clotting assay when compared with the polysaccharide from B. occidentalis. When tested in assays using specific proteases and coagulation inhibitors, these two galactans showed significant differences in their activity. They do not differ in thrombin inhibition mediated by antithrombin, but in assays where heparin cofactor II replaces antithrombin, the sulfated galactan from G. crinale requires a significantly higher concentration to achieve the same inhibitory effect as the polysaccharide from B. occidentalis. In contrast, when factor Xa instead of thrombin is used as the target protease, the sulfated galactan from G. crinale is a more potent anticoagulant. These observations suggest that the proportion and/or the distribution of 2,3-di-sulfated alpha-units along the galactan chain may be a critical structural motif to promote the interaction of the protease with specific protease and coagulation inhibitors.
Structural basis for concerted recruitment and activation of IRF-3 by innate immune adaptor proteins

DOE PAGES

Zhao, Baoyu; Shu, Chang; Gao, Xinsheng; ...

2016-06-02

Type I IFNs are key cytokines mediating innate antiviral immunity. cGMP-AMP synthase, ritinoic acid-inducible protein 1 (RIG-I)–like receptors, and Toll-like receptors recognize microbial double-stranded (ds)DNA, dsRNA, and LPS to induce the expression of type I IFNs. These signaling pathways converge at the recruitment and activation of the transcription factor IRF-3 (IFN regulatory factor 3). The adaptor proteins STING (stimulator of IFN genes), MAVS (mitochondrial antiviral signaling), and TRIF (TIR domain-containing adaptor inducing IFN-β) mediate the recruitment of IRF-3 through a conserved pLxIS motif. Here in this paper, we show that the pLxIS motif of phosphorylated STING, MAVS, and TRIF bindsmore » to IRF-3 in a similar manner, whereas residues upstream of the motif confer specificity. The structure of the IRF-3 phosphomimetic mutant S386/396E bound to the cAMP response element binding protein (CREB)-binding protein reveals that the pLxIS motif also mediates IRF-3 dimerization and activation. Moreover, rotavirus NSP1 (nonstructural protein 1) employs a pLxIS motif to target IRF-3 for degradation, but phosphorylation of NSP1 is not required for its activity. These results suggest a concerted mechanism for the recruitment and activation of IRF-3 that can be subverted by viral proteins to evade innate immune responses.« less
Roles of conserved proline and glycosyltransferase motifs of EmbC in biosynthesis of lipoarabinomannan.

PubMed

Berg, Stefan; Starbuck, James; Torrelles, Jordi B; Vissa, Varalakshmi D; Crick, Dean C; Chatterjee, Delphi; Brennan, Patrick J

2005-02-18

D-Arabinans, composed of D-arabinofuranose (D-Araf), dominate the structure of mycobacterial cell walls in two settings, as part of lipoarabinomannan (LAM) and arabinogalactan, each with markedly different structures and functions. Little is known of the complexity of their biosynthesis. beta-D-Arabinofuranosyl-1-monophosphoryldecaprenol is the only known sugar donor. EmbA, EmbB, and EmbC, products of the paralogous genes embA, embB, and embC, the sites of resistance to the anti-tuberculosis drug ethambutol (EMB), are the only known implicated enzymes. EmbA and -B apparently contribute to the synthesis of arabinogalactan, whereas EmbC is reserved for the synthesis of LAM. The Emb proteins show no overall similarity to any known proteins beyond Mycobacterium and related genera. However, functional motifs, equivalent to a proline-rich motif of several bacterial polysaccharide co-polymerases and a superfamily of glycosyltransferases, were found. Site-directed mutagenesis in glycosyltransferase superfamily C resulted in complete ablation of LAM synthesis. Point mutations in three amino acids of the proline motif of EmbC resulted in marked reduction of LAM-arabinan synthesis and accumulation of an unknown intermediate and of the known precursor lipomannan. Yet the pattern of the differently linked d-Araf units observed in wild type LAM-arabinan was largely retained in the proline motif mutants. The results allow for the presentation of a unique model of arabinan synthesis.
Structural and functional analysis of the GABARAP interaction motif (GIM)

DOE Office of Scientific and Technical Information (OSTI.GOV)

Rogov, Vladimir V.; Stolz, Alexandra; Ravichandran, Arvind C.

Through the canonical LC3 interaction motif (LIR), [W/F/Y]–X 1–X 2[I/L/V], protein complexes are recruited to autophagosomes to perform their functions as either autophagy adaptors or receptors. How these adaptors/receptors selectively interact with either LC3 or GABARAP families remains unclear. Herein, we determine the range of selectivity of 30 known core LIR motifs towards individual LC3s and GABARAPs. From these, we define a GABARAP Interaction Motif (GIM) sequence ([W/F]–[V/I]–X 2–V) that the adaptor protein PLEKHM1 tightly conforms to. Using biophysical and structural approaches, we show that the PLEKHM1–LIR is indeed 11–fold more specific for GABARAP than LC3B. Selective mutation of themore » X 1 and X 2 positions either completely abolished the interaction with all LC3 and GABARAPs or increased PLEKHM1–GIM selectivity 20–fold towards LC3B. Finally, we show that conversion of p62/SQSTM1, FUNDC1 and FIP200 LIRs into our newly defined GIM, by introducing two valine residues, enhances their interaction with endogenous GABARAP over LC3B. In conclusion, the identification of a GABARAP–specific interaction motif will aid the identification and characterization of the expanding array of autophagy receptor and adaptor proteins and their in vivo functions.« less
Structural basis for concerted recruitment and activation of IRF-3 by innate immune adaptor proteins

DOE Office of Scientific and Technical Information (OSTI.GOV)

Zhao, Baoyu; Shu, Chang; Gao, Xinsheng

Type I IFNs are key cytokines mediating innate antiviral immunity. cGMP-AMP synthase, ritinoic acid-inducible protein 1 (RIG-I)–like receptors, and Toll-like receptors recognize microbial double-stranded (ds)DNA, dsRNA, and LPS to induce the expression of type I IFNs. These signaling pathways converge at the recruitment and activation of the transcription factor IRF-3 (IFN regulatory factor 3). The adaptor proteins STING (stimulator of IFN genes), MAVS (mitochondrial antiviral signaling), and TRIF (TIR domain-containing adaptor inducing IFN-β) mediate the recruitment of IRF-3 through a conserved pLxIS motif. Here in this paper, we show that the pLxIS motif of phosphorylated STING, MAVS, and TRIF bindsmore » to IRF-3 in a similar manner, whereas residues upstream of the motif confer specificity. The structure of the IRF-3 phosphomimetic mutant S386/396E bound to the cAMP response element binding protein (CREB)-binding protein reveals that the pLxIS motif also mediates IRF-3 dimerization and activation. Moreover, rotavirus NSP1 (nonstructural protein 1) employs a pLxIS motif to target IRF-3 for degradation, but phosphorylation of NSP1 is not required for its activity. These results suggest a concerted mechanism for the recruitment and activation of IRF-3 that can be subverted by viral proteins to evade innate immune responses.« less
Structural and functional analysis of the GABARAP interaction motif (GIM)

DOE PAGES

Rogov, Vladimir V.; Stolz, Alexandra; Ravichandran, Arvind C.; ...

2017-06-27

Through the canonical LC3 interaction motif (LIR), [W/F/Y]–X 1–X 2[I/L/V], protein complexes are recruited to autophagosomes to perform their functions as either autophagy adaptors or receptors. How these adaptors/receptors selectively interact with either LC3 or GABARAP families remains unclear. Herein, we determine the range of selectivity of 30 known core LIR motifs towards individual LC3s and GABARAPs. From these, we define a GABARAP Interaction Motif (GIM) sequence ([W/F]–[V/I]–X 2–V) that the adaptor protein PLEKHM1 tightly conforms to. Using biophysical and structural approaches, we show that the PLEKHM1–LIR is indeed 11–fold more specific for GABARAP than LC3B. Selective mutation of themore » X 1 and X 2 positions either completely abolished the interaction with all LC3 and GABARAPs or increased PLEKHM1–GIM selectivity 20–fold towards LC3B. Finally, we show that conversion of p62/SQSTM1, FUNDC1 and FIP200 LIRs into our newly defined GIM, by introducing two valine residues, enhances their interaction with endogenous GABARAP over LC3B. In conclusion, the identification of a GABARAP–specific interaction motif will aid the identification and characterization of the expanding array of autophagy receptor and adaptor proteins and their in vivo functions.« less
A new test of computational protein design: predicting posttranslational modification specificity for the enzyme SMYD2.

PubMed

Reynolds, Kimberly A

2015-01-06

In this issue of Structure, Lanouette and colleagues use a combination of computation and experiment to define a specificity motif for the lysine methyltransferase SMYD2. Using this motif, they predict and experimentally verify four new SMYD2 substrates. Copyright © 2015 Elsevier Ltd. All rights reserved.

Hypersusceptibility to substrate analogs conferred by mutations in human immunodeficiency virus type 1 reverse transcriptase.

PubMed

Smith, Robert A; Anderson, Donovan J; Preston, Bradley D

2006-07-01

Human immunodeficiency virus type 1 (HIV-1) reverse transcriptase (RT) contains four structural motifs (A, B, C, and D) that are conserved in polymerases from diverse organisms. Motif B interacts with the incoming nucleotide, the template strand, and key active-site residues from other motifs, suggesting that motif B is an important determinant of substrate specificity. To examine the functional role of this region, we performed "random scanning mutagenesis" of 11 motif B residues and screened replication-competent mutants for altered substrate analog sensitivity in culture. Single amino acid replacements throughout the targeted region conferred resistance to lamivudine and/or hypersusceptibility to zidovudine (AZT). Substitutions at residue Q151 increased the sensitivity of HIV-1 to multiple nucleoside analogs, and a subset of these Q151 variants was also hypersusceptible to the pyrophosphate analog phosphonoformic acid (PFA). Other AZT-hypersusceptible mutants were resistant to PFA and are therefore phenotypically similar to PFA-resistant variants selected in vitro and in infected patients. Collectively, these data show that specific amino acid replacements in motif B confer broad-spectrum hypersusceptibility to substrate analog inhibitors. Our results suggest that motif B influences RT-deoxynucleoside triphosphate interactions at multiple steps in the catalytic cycle of polymerization.
Two-level tunneling systems in amorphous alumina

NASA Astrophysics Data System (ADS)

Lebedeva, Irina V.; Paz, Alejandro P.; Tokatly, Ilya V.; Rubio, Angel

2014-03-01

The decades of research on thermal properties of amorphous solids at temperatures below 1 K suggest that their anomalous behaviour can be related to quantum mechanical tunneling of atoms between two nearly equivalent states that can be described as a two-level system (TLS). This theory is also supported by recent studies on microwave spectroscopy of superconducting qubits. However, the microscopic nature of the TLS remains unknown. To identify structural motifs for TLSs in amorphous alumina we have performed extensive classical molecular dynamics simulations. Several bistable motifs with only one or two atoms jumping by considerable distance ~ 0.5 Å were found at T=25 K. Accounting for the surrounding environment relaxation was shown to be important up to distances ~ 7 Å. The energy asymmetry and barrier for the detected motifs lied in the ranges 0.5 - 2 meV and 4 - 15 meV, respectively, while their density was about 1 motif per 10 000 atoms. Tuning of motif asymmetry by strain was demonstrated with the coupling coefficient below 1 eV. The tunnel splitting for the symmetrized motifs was estimated on the order of 0.1 meV. The discovered motifs are in good agreement with the available experimental data. The financial support from the Marie Curie Fellowship PIIF-GA-2012-326435 (RespSpatDisp) is gratefully acknowledged.
Pathogen recognition of a novel C-type lectin from Marsupenaeus japonicus reveals the divergent sugar-binding specificity of QAP motif.

PubMed

Alenton, Rod Russel R; Koiwai, Keiichiro; Miyaguchi, Kohei; Kondo, Hidehiro; Hirono, Ikuo

2017-04-04

C-type lectins (CTLs) are calcium-dependent carbohydrate-binding proteins known to assist the innate immune system as pattern recognition receptors (PRRs). The binding specificity of CTLs lies in the motif of their carbohydrate recognition domain (CRD), the tripeptide motifs EPN and QPD bind to mannose and galactose, respectively. However, variants of these motifs were discovered including a QAP sequence reported in shrimp believed to have the same carbohydrate specificity as QPD. Here, we characterized a novel C-type lectin (MjGCTL) possessing a CRD with a QAP motif. The recombinant MjGCTL has a calcium-dependent agglutinating capability against both Gram-negative and Gram-positive bacteria, and its sugar specificity did not involve either mannose or galactose. In an encapsulation assay, agarose beads coated with rMjGCTL were immediately encapsulated from 0 h followed by melanization at 4 h post-incubation with hemocytes. These results confirm that MjGCTL functions as a classical CTL. The structure of QAP motif and carbohydrate-specificity of rMjGCTL was found to be different to both EPN and QPD, suggesting that QAP is a new motif. Furthermore, MjGCTL acts as a PRR binding to hemocytes to activate their adherent state and initiate encapsulation.
Pathogen recognition of a novel C-type lectin from Marsupenaeus japonicus reveals the divergent sugar-binding specificity of QAP motif

PubMed Central

Alenton, Rod Russel R.; Koiwai, Keiichiro; Miyaguchi, Kohei; Kondo, Hidehiro; Hirono, Ikuo

2017-01-01

C-type lectins (CTLs) are calcium-dependent carbohydrate-binding proteins known to assist the innate immune system as pattern recognition receptors (PRRs). The binding specificity of CTLs lies in the motif of their carbohydrate recognition domain (CRD), the tripeptide motifs EPN and QPD bind to mannose and galactose, respectively. However, variants of these motifs were discovered including a QAP sequence reported in shrimp believed to have the same carbohydrate specificity as QPD. Here, we characterized a novel C-type lectin (MjGCTL) possessing a CRD with a QAP motif. The recombinant MjGCTL has a calcium-dependent agglutinating capability against both Gram-negative and Gram-positive bacteria, and its sugar specificity did not involve either mannose or galactose. In an encapsulation assay, agarose beads coated with rMjGCTL were immediately encapsulated from 0 h followed by melanization at 4 h post-incubation with hemocytes. These results confirm that MjGCTL functions as a classical CTL. The structure of QAP motif and carbohydrate-specificity of rMjGCTL was found to be different to both EPN and QPD, suggesting that QAP is a new motif. Furthermore, MjGCTL acts as a PRR binding to hemocytes to activate their adherent state and initiate encapsulation. PMID:28374848
Conformational Dissection of a Viral Intrinsically Disordered Domain Involved in Cellular Transformation

PubMed Central

Perrone, Sebastián; Salvay, Andres G.; Chemes, Lucía B.; de Prat-Gay, Gonzalo

2013-01-01

Intrinsic disorder is abundant in viral genomes and provides conformational plasticity to its protein products. In order to gain insight into its structure-function relationships, we carried out a comprehensive analysis of structural propensities within the intrinsically disordered N-terminal domain from the human papillomavirus type-16 E7 oncoprotein (E7N). Two E7N segments located within the conserved CR1 and CR2 regions present transient α-helix structure. The helix in the CR1 region spans residues L8 to L13 and overlaps with the E2F mimic linear motif. The second helix, located within the highly acidic CR2 region, presents a pH-dependent structural transition. At neutral pH the helix spans residues P17 to N29, which include the retinoblastoma tumor suppressor LxCxE binding motif (residues 21–29), while the acidic CKII-PEST region spanning residues E33 to I38 populates polyproline type II (PII) structure. At pH 5.0, the CR2 helix propagates up to residue I38 at the expense of loss of PII due to charge neutralization of acidic residues. Using truncated forms of HPV-16 E7, we confirmed that pH-induced changes in α-helix content are governed by the intrinsically disordered E7N domain. Interestingly, while at both pH the region encompassing the LxCxE motif adopts α-helical structure, the isolated 21–29 fragment including this stretch is unable to populate an α-helix even at high TFE concentrations. Thus, the E7N domain can populate dynamic but discrete structural ensembles by sampling α-helix-coil-PII-ß-sheet structures. This high plasticity may modulate the exposure of linear binding motifs responsible for its multi-target binding properties, leading to interference with key cell signaling pathways and eventually to cellular transformation by the virus. PMID:24086265
Trithiocarbonates: exploration of a new head group for HDAC inhibitors.

PubMed

Dehmel, Florian; Ciossek, Thomas; Maier, Thomas; Weinbrenner, Steffen; Schmidt, Beate; Zoche, Martin; Beckers, Thomas

2007-09-01

Inhibition of histone deacetylases class I/II enzymes is a new, promising approach for cancer therapy. In the present study, we disclose a new structural class of HDAC inhibitors with the trithiocarbonate motif. A clear structure-activity-relationship was obtained for the cap-linker motif and the putative Zn(2+) complexing head group. Selected analogs display potent inhibition of HDAC enzymatic activity and a cellular potency comparable to that of suberoylanilide hydroxamic acid (SAHA), recently approved for treatment of patients with advanced cutaneous T-cell lymphoma.
Reversible conformational switching of i-motif DNA studied by fluorescence spectroscopy.

PubMed

Choi, Jungkweon; Majima, Tetsuro

2013-01-01

Non-B DNAs, which can form unique structures other than double helix of B-DNA, have attracted considerable attention from scientists in various fields including biology, chemistry and physics etc. Among them, i-motif DNA, which is formed from cytosine (C)-rich sequences found in telomeric DNA and the promoter region of oncogenes, has been extensively investigated as a signpost and controller for the oncogene expression at the transcription level and as a promising material in nanotechnology. Fluorescence techniques such as fluorescence resonance energy transfer (FRET) and the fluorescence quenching are important for studying DNA and in particular for the visualization of reversible conformational switching of i-motif DNA that is triggered by the protonation. Here, we review the latest studies on the conformational dynamics of i-motif DNA as well as the application of FRET and fluorescence quenching techniques to the visualization of reversible conformational switching of i-motif DNA in nano-biotechnology. © 2013 Wiley Periodicals, Inc. Photochemistry and Photobiology © 2013 The American Society of Photobiology.
Searching RNA motifs and their intermolecular contacts with constraint networks.

PubMed

Thébault, P; de Givry, S; Schiex, T; Gaspin, C

2006-09-01

Searching RNA gene occurrences in genomic sequences is a task whose importance has been renewed by the recent discovery of numerous functional RNA, often interacting with other ligands. Even if several programs exist for RNA motif search, none exists that can represent and solve the problem of searching for occurrences of RNA motifs in interaction with other molecules. We present a constraint network formulation of this problem. RNA are represented as structured motifs that can occur on more than one sequence and which are related together by possible hybridization. The implemented tool MilPat is used to search for several sRNA families in genomic sequences. Results show that MilPat allows to efficiently search for interacting motifs in large genomic sequences and offers a simple and extensible framework to solve such problems. New and known sRNA are identified as H/ACA candidates in Methanocaldococcus jannaschii. http://carlit.toulouse.inra.fr/MilPaT/MilPat.pl.
A frequent, GxxxG-mediated, transmembrane association motif is optimized for the formation of interhelical Cα–H hydrogen bonds

PubMed Central

Mueller, Benjamin K.; Subramaniam, Sabareesh; Senes, Alessandro

2014-01-01

Carbon hydrogen bonds between Cα–H donors and carbonyl acceptors are frequently observed between transmembrane helices (Cα–H···O=C). Networks of these interactions occur often at helix−helix interfaces mediated by GxxxG and similar patterns. Cα–H hydrogen bonds have been hypothesized to be important in membrane protein folding and association, but evidence that they are major determinants of helix association is still lacking. Here we present a comprehensive geometric analysis of homodimeric helices that demonstrates the existence of a single region in conformational space with high propensity for Cα–H···O=C hydrogen bond formation. This region corresponds to the most frequent motif for parallel dimers, GASright, whose best-known example is glycophorin A. The finding suggests a causal link between the high frequency of occurrence of GASright and its propensity for carbon hydrogen bond formation. Investigation of the sequence dependency of the motif determined that Gly residues are required at specific positions where only Gly can act as a donor with its “side chain” Hα. Gly also reduces the steric barrier for non-Gly amino acids at other positions to act as Cα donors, promoting the formation of cooperative hydrogen bonding networks. These findings offer a structural rationale for the occurrence of GxxxG patterns at the GASright interface. The analysis identified the conformational space and the sequence requirement of Cα–H···O=C mediated motifs; we took advantage of these results to develop a structural prediction method. The resulting program, CATM, predicts ab initio the known high-resolution structures of homodimeric GASright motifs at near-atomic level. PMID:24569864
Signature motif-guided identification of receptors for peptide hormones essential for root meristem growth.

PubMed

Song, Wen; Liu, Li; Wang, Jizong; Wu, Zhen; Zhang, Heqiao; Tang, Jiao; Lin, Guangzhong; Wang, Yichuan; Wen, Xing; Li, Wenyang; Han, Zhifu; Guo, Hongwei; Chai, Jijie

2016-06-01

Peptide-mediated cell-to-cell signaling has crucial roles in coordination and definition of cellular functions in plants. Peptide-receptor matching is important for understanding the mechanisms underlying peptide-mediated signaling. Here we report the structure-guided identification of root meristem growth factor (RGF) receptors important for plant development. An assay based on a signature ligand recognition motif (Arg-x-Arg) conserved in a subfamily of leucine-rich repeat receptor kinases (LRR-RKs) identified the functionally uncharacterized LRR-RK At4g26540 as a receptor of RGF1 (RGFR1). We further solved the crystal structure of RGF1 in complex with the LRR domain of RGFR1 at a resolution of 2.6 Å, which reveals that the Arg-x-Gly-Gly (RxGG) motif is responsible for specific recognition of the sulfate group of RGF1 by RGFR1. Based on the RxGG motif, we identified additional four RGFRs. Participation of the five RGFRs in RGF-induced signaling is supported by biochemical and genetic data. We also offer evidence showing that SERKs function as co-receptors for RGFs. Taken together, our study identifies RGF receptors and co-receptors that can link RGF signals with their downstream components and provides a proof of principle for structure-based matching of LRR-RKs with their peptide ligands.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Schürpf, Thomas; Chen, Qiang; Liu, Jin-huan

Developmental endothelial cell locus-1 (Del-1) glycoprotein is secreted by endothelial cells and a subset of macrophages. Del-1 plays a regulatory role in vascular remodeling and functions in innate immunity through interaction with integrin {alpha}{sub V}{beta}{sub 3}. Del-1 contains 3 epidermal growth factor (EGF)-like repeats and 2 discoidin-like domains. An Arg-Gly-Asp (RGD) motif in the second EGF domain (EGF2) mediates adhesion by endothelial cells and phagocytes. We report the crystal structure of its 3 EGF domains. The RGD motif of EGF2 forms a type II' {beta} turn at the tip of a long protruding loop, dubbed the RGD finger. Whereas EGF2more » and EGF3 constitute a rigid rod via an interdomain calcium ion binding site, the long linker between EGF1 and EGF2 lends considerable flexibility to EGF1. Two unique O-linked glycans and 1 N-linked glycan locate to the opposite side of EGF2 from the RGD motif. These structural features favor integrin binding of the RGD finger. Mutagenesis data confirm the importance of having the RGD motif at the tip of the RGD finger. A database search for EGF domain sequences shows that this RGD finger is likely an evolutionary insertion and unique to the EGF domain of Del-1 and its homologue milk fat globule-EGF 8. The RGD finger of Del-1 is a unique structural feature critical for integrin binding.« less
Structural and functional studies of a phosphatidic acid-binding antifungal plant defensin MtDef4: Identification of an RGFRRR motif governing fungal cell entry

DOE Office of Scientific and Technical Information (OSTI.GOV)

Sagaram, Uma S.; El-Mounadi, Kaoutar; Buchko, Garry W.

A highly conserved plant defensin MtDef4 potently inhibits the growth of a filamentous fungus Fusarium graminearum. MtDef4 is internalized by cells of F. graminearum. To determine its mechanism of fungal cell entry and antifungal action, NMR solution structure of MtDef4 has been determined. The analysis of its structure has revealed a positively charged patch on the surface of the protein consisting of arginine residues in its γ-core signature, a major determinant of the antifungal activity of MtDef4. Here, we report functional analysis of the RGFRRR motif of the γ-core signature of MtDef4. The replacement of RGFRRR to AAAARR or tomore » RGFRAA not only abolishes fungal cell entry but also results in loss of the antifungal activity of MtDef4. MtDef4 binds strongly to phosphatidic acid (PA), a precursor for the biosynthesis of membrane phospholipids and a signaling lipid known to recruit cytosolic proteins to membranes. Mutations of RGFRRR which abolish fungal cell entry of MtDef4 also impair its binding to PA. Our results suggest that RGFRRR motif is a translocation signal for entry of MtDef4 into fungal cells and that this positively charged motif likely mediates interaction of this defensin with PA as part of its antifungal action.« less
Regions of extreme synonymous codon selection in mammalian genes

PubMed Central

Schattner, Peter; Diekhans, Mark

2006-01-01

Recently there has been increasing evidence that purifying selection occurs among synonymous codons in mammalian genes. This selection appears to be a consequence of either cis-regulatory motifs, such as exonic splicing enhancers (ESEs), or mRNA secondary structures, being superimposed on the coding sequence of the gene. We have developed a program to identify regions likely to be enriched for such motifs by searching for extended regions of extreme codon conservation between homologous genes of related species. Here we present the results of applying this approach to five mammalian species (human, chimpanzee, mouse, rat and dog). Even with very conservative selection criteria, we find over 200 regions of extreme codon conservation, ranging in length from 60 to 178 codons. The regions are often found within genes involved in DNA-binding, RNA-binding or zinc-ion-binding. They are highly depleted for synonymous single nucleotide polymorphisms (SNPs) but not for non-synonymous SNPs, further indicating that the observed codon conservation is being driven by negative selection. Forty-three percent of the regions overlap conserved alternative transcript isoforms and are enriched for known ESEs. Other regions are enriched for TpA dinucleotides and may contain conserved motifs/structures relating to mRNA stability and/or degradation. We anticipate that this tool will be useful for detecting regions enriched in other classes of coding-sequence motifs and structures as well. PMID:16556911
Motif structure and cooperation in real-world complex networks

NASA Astrophysics Data System (ADS)

Salehi, Mostafa; Rabiee, Hamid R.; Jalili, Mahdi

2010-12-01

Networks of dynamical nodes serve as generic models for real-world systems in many branches of science ranging from mathematics to physics, technology, sociology and biology. Collective behavior of agents interacting over complex networks is important in many applications. The cooperation between selfish individuals is one of the most interesting collective phenomena. In this paper we address the interplay between the motifs’ cooperation properties and their abundance in a number of real-world networks including yeast protein-protein interaction, human brain, protein structure, email communication, dolphins’ social interaction, Zachary karate club and Net-science coauthorship networks. First, the amount of cooperativity for all possible undirected subgraphs with three to six nodes is calculated. To this end, the evolutionary dynamics of the Prisoner’s Dilemma game is considered and the cooperativity of each subgraph is calculated as the percentage of cooperating agents at the end of the simulation time. Then, the three- to six-node motifs are extracted for each network. The significance of the abundance of a motif, represented by a Z-value, is obtained by comparing them with some properly randomized versions of the original network. We found that there is always a group of motifs showing a significant inverse correlation between their cooperativity amount and Z-value, i.e. the more the Z-value the less the amount of cooperativity. This suggests that networks composed of well-structured units do not have good cooperativity properties.
Evolution of genes and repeats in the Nimrod superfamily.

PubMed

Somogyi, Kálmán; Sipos, Botond; Pénzes, Zsolt; Kurucz, Eva; Zsámboki, János; Hultmark, Dan; Andó, István

2008-11-01

The recently identified Nimrod superfamily is characterized by the presence of a special type of EGF repeat, the NIM repeat, located right after a typical CCXGY/W amino acid motif. On the basis of structural features, nimrod genes can be divided into three types. The proteins encoded by Draper-type genes have an EMI domain at the N-terminal part and only one copy of the NIM motif, followed by a variable number of EGF-like repeats. The products of Nimrod B-type and Nimrod C-type genes (including the eater gene) have different kinds of N-terminal domains, and lack EGF-like repeats but contain a variable number of NIM repeats. Draper and Nimrod C-type (but not Nimrod B-type) proteins carry a transmembrane domain. Several members of the superfamily were claimed to function as receptors in phagocytosis and/or binding of bacteria, which indicates an important role in the cellular immunity and the elimination of apoptotic cells. In this paper, the evolution of the Nimrod superfamily is studied with various methods on the level of genes and repeats. A hypothesis is presented in which the NIM repeat, along with the EMI domain, emerged by structural reorganizations at the end of an EGF-like repeat chain, suggesting a mechanism for the formation of novel types of repeats. The analyses revealed diverse evolutionary patterns in the sequences containing multiple NIM repeats. Although in the Nimrod B and Nimrod C proteins show characteristics of independent evolution, many internal NIM repeats in Eater sequences seem to have undergone concerted evolution. An analysis of the nimrod genes has been performed using phylogenetic and other methods and an evolutionary scenario of the origin and diversification of the Nimrod superfamily is proposed. Our study presents an intriguing example how the evolution of multigene families may contribute to the complexity of the innate immune response.
A versatile palindromic amphipathic repeat coding sequence horizontally distributed among diverse bacterial and eucaryotic microbes

PubMed Central

2010-01-01

Background Intragenic tandem repeats occur throughout all domains of life and impart functional and structural variability to diverse translation products. Repeat proteins confer distinctive surface phenotypes to many unicellular organisms, including those with minimal genomes such as the wall-less bacterial monoderms, Mollicutes. One such repeat pattern in this clade is distributed in a manner suggesting its exchange by horizontal gene transfer (HGT). Expanding genome sequence databases reveal the pattern in a widening range of bacteria, and recently among eucaryotic microbes. We examined the genomic flux and consequences of the motif by determining its distribution, predicted structural features and association with membrane-targeted proteins. Results Using a refined hidden Markov model, we document a 25-residue protein sequence motif tandemly arrayed in variable-number repeats in ORFs lacking assigned functions. It appears sporadically in unicellular microbes from disparate bacterial and eucaryotic clades, representing diverse lifestyles and ecological niches that include host parasitic, marine and extreme environments. Tracts of the repeats predict a malleable configuration of recurring domains, with conserved hydrophobic residues forming an amphipathic secondary structure in which hydrophilic residues endow extensive sequence variation. Many ORFs with these domains also have membrane-targeting sequences that predict assorted topologies; others may comprise reservoirs of sequence variants. We demonstrate expressed variants among surface lipoproteins that distinguish closely related animal pathogens belonging to a subgroup of the Mollicutes. DNA sequences encoding the tandem domains display dyad symmetry. Moreover, in some taxa the domains occur in ORFs selectively associated with mobile elements. These features, a punctate phylogenetic distribution, and different patterns of dispersal in genomes of related taxa, suggest that the repeat may be disseminated by HGT and intra-genomic shuffling. Conclusions We describe novel features of PARCELs (Palindromic Amphipathic Repeat Coding ELements), a set of widely distributed repeat protein domains and coding sequences that were likely acquired through HGT by diverse unicellular microbes, further mobilized and diversified within genomes, and co-opted for expression in the membrane proteome of some taxa. Disseminated by multiple gene-centric vehicles, ORFs harboring these elements enhance accessory gene pools as part of the "mobilome" connecting genomes of various clades, in taxa sharing common niches. PMID:20626840
DNA motifs associated with aberrant CpG island methylation.

PubMed

Feltus, F Alex; Lee, Eva K; Costello, Joseph F; Plass, Christoph; Vertino, Paula M

2006-05-01

Epigenetic silencing involving the aberrant methylation of promoter region CpG islands is widely recognized as a tumor suppressor silencing mechanism in cancer. However, the molecular pathways underlying aberrant DNA methylation remain elusive. Recently we showed that, on a genome-wide level, CpG island loci differ in their intrinsic susceptibility to aberrant methylation and that this susceptibility can be predicted based on underlying sequence context. These data suggest that there are sequence/structural features that contribute to the protection from or susceptibility to aberrant methylation. Here we use motif elicitation coupled with classification techniques to identify DNA sequence motifs that selectively define methylation-prone or methylation-resistant CpG islands. Motifs common to 28 methylation-prone or 47 methylation-resistant CpG island-containing genomic fragments were determined using the MEME and MAST algorithms (). The five most discriminatory motifs derived from methylation-prone sequences were found to be associated with CpG islands in general and were nonrandomly distributed throughout the genome. In contrast, the eight most discriminatory motifs derived from the methylation-resistant CpG islands were randomly distributed throughout the genome. Interestingly, this latter group tended to associate with Alu and other repetitive sequences. Used together, the frequency of occurrence of these motifs successfully discriminated methylation-prone and methylation-resistant CpG island groups with an accuracy of 87% after 10-fold cross-validation. The motifs identified here are candidate methylation-targeting or methylation-protection DNA sequences.
Structural Basis of Transcriptional Regulation of the Proline Utilization Regulon by Multifunctional PutA

PubMed Central

Zhou, Yuzhen; Larson, John D.; Bottoms, Christopher A.; Arturo, Emilia C.; Henzl, Michael T.; Jenkins, Jermaine L.; Nix, Jay C.; Becker, Donald F.; Tanner, John J.

2009-01-01

Summary The multifunctional Escherichia coli PutA flavoprotein functions as both a membrane-associated proline catabolic enzyme and transcriptional repressor of the proline utilization genes putA and putP. To better understand the mechanism of transcriptional regulation by PutA, we have mapped the put regulatory region, determined a crystal structure of the PutA ribbon-helix-helix domain (PutA52) complexed with DNA and examined the thermodynamics of DNA binding to PutA52. Five operator sites, each containing the sequence motif 5′-GTTGCA-3′, were identified using gel-shift analysis. Three of the sites are shown to be critical for repression of putA, whereas the two other sites are important for repression of putP. The 2.25 Å resolution crystal structure of PutA52 bound to one of the operators (operator 2, 21-bp) shows that the protein contacts a 9-bp fragment, corresponding to the GTTGCA consensus motif plus three flanking base pairs. Since the operator sequences differ in flanking bases, the structure implies that PutA may have different affinities for the five operators. This hypothesis was explored using isothermal titration calorimetry. The binding of PutA52 to operator 2 is exothermic with an enthalpy of −1.8 kcal/mol and a dissociation constant of 210 nM. Substitution of the flanking bases of operator 4 into operator 2 results in an unfavorable enthalpy of 0.2 kcal/mol and 15-fold lower affinity, which shows that base pairs outside of the consensus motif impact binding. The structural and thermodynamic data suggest that hydrogen bonds between Lys9 and bases adjacent to the GTTGCA motif contribute to transcriptional regulation by fine-tuning the affinity of PutA for put control operators. PMID:18586269
Structural and Histone Binding Ability Characterizations of Human PWWP Domains

DOE Office of Scientific and Technical Information (OSTI.GOV)

Wu, Hong; Zeng, Hong; Lam, Robert

2013-09-25

The PWWP domain was first identified as a structural motif of 100-130 amino acids in the WHSC1 protein and predicted to be a protein-protein interaction domain. It belongs to the Tudor domain 'Royal Family', which consists of Tudor, chromodomain, MBT and PWWP domains. While Tudor, chromodomain and MBT domains have long been known to bind methylated histones, PWWP was shown to exhibit histone binding ability only until recently. The PWWP domain has been shown to be a DNA binding domain, but sequence analysis and previous structural studies show that the PWWP domain exhibits significant similarity to other 'Royal Family' members,more » implying that the PWWP domain has the potential to bind histones. In order to further explore the function of the PWWP domain, we used the protein family approach to determine the crystal structures of the PWWP domains from seven different human proteins. Our fluorescence polarization binding studies show that PWWP domains have weak histone binding ability, which is also confirmed by our NMR titration experiments. Furthermore, we determined the crystal structures of the BRPF1 PWWP domain in complex with H3K36me3, and HDGF2 PWWP domain in complex with H3K79me3 and H4K20me3. PWWP proteins constitute a new family of methyl lysine histone binders. The PWWP domain consists of three motifs: a canonical {beta}-barrel core, an insertion motif between the second and third {beta}-strands and a C-terminal {alpha}-helix bundle. Both the canonical {beta}-barrel core and the insertion motif are directly involved in histone binding. The PWWP domain has been previously shown to be a DNA binding domain. Therefore, the PWWP domain exhibits dual functions: binding both DNA and methyllysine histones.« less
Deciphering common recognition principles of nucleoside mono/di and tri-phosphates binding in diverse proteins via structural matching of their binding sites.

PubMed

Bhagavat, Raghu; Srinivasan, Narayanaswamy; Chandra, Nagasuma

2017-09-01

Nucleoside triphosphate (NTP) ligands are of high biological importance and are essential for all life forms. A pre-requisite for them to participate in diverse biochemical processes is their recognition by diverse proteins. It is thus of great interest to understand the basis for such recognition in different proteins. Towards this, we have used a structural bioinformatics approach and analyze structures of 4677 NTP complexes available in Protein Data Bank (PDB). Binding sites were extracted and compared exhaustively using PocketMatch, a sensitive in-house site comparison algorithm, which resulted in grouping the entire dataset into 27 site-types. Each of these site-types represent a structural motif comprised of two or more residue conservations, derived using another in-house tool for superposing binding sites, PocketAlign. The 27 site-types could be grouped further into 9 super-types by considering partial similarities in the sites, which indicated that the individual site-types comprise different combinations of one or more site features. A scan across PDB using the 27 structural motifs determined the motifs to be specific to NTP binding sites, and a computational alanine mutagenesis indicated that residues identified to be highly conserved in the motifs are also most contributing to binding. Alternate orientations of the ligand in several site-types were observed and rationalized, indicating the possibility of some residues serving as anchors for NTP recognition. The presence of multiple site-types and the grouping of multiple folds into each site-type is strongly suggestive of convergent evolution. Knowledge of determinants obtained from this study will be useful for detecting function in unknown proteins. Proteins 2017; 85:1699-1712. © 2017 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.

Large scale structural optimization of trimetallic Cu-Au-Pt clusters up to 147 atoms

NASA Astrophysics Data System (ADS)

Wu, Genhua; Sun, Yan; Wu, Xia; Chen, Run; Wang, Yan

2017-10-01

The stable structures of Cu-Au-Pt clusters up to 147 atoms are optimized by using an improved adaptive immune optimization algorithm (AIOA-IC method), in which several motifs, such as decahedron, icosahedron, face centered cubic, sixfold pancake, and Leary tetrahedron, are randomly selected as the inner cores of the starting structures. The structures of Cu8AunPt30-n (n = 1-29), Cu8AunPt47-n (n = 1-46), and partial 75-, 79-, 100-, and 147-atom clusters are analyzed. Cu12Au93Pt42 cluster has onion-like Mackay icosahedral motif. The segregation phenomena of Cu, Au and Pt in clusters are explained by the atomic radius, surface energy, and cohesive energy.
Sr{sub 7}Ge{sub 6}, Ba{sub 7}Ge{sub 6} and Ba{sub 3}Sn{sub 2} -Three new binary compounds containing dumbbells and four-membered chains of tetrel atoms with considerable Ge-Ge {pi}-bonding character

DOE Office of Scientific and Technical Information (OSTI.GOV)

Siggelkow, Lisa; Hlukhyy, Viktor; Faessler, Thomas F., E-mail: thomas.faessler@lrz.tum.de

2012-07-15

The germanides Sr{sub 7}Ge{sub 6} and Ba{sub 7}Ge{sub 6} as well as the stannide Ba{sub 3}Sn{sub 2} were prepared by arc melting and annealing in welded tantalum ampoules using induction as well as resistance furnaces. The compounds were investigated by powder and single crystal X-ray diffraction. Sr{sub 7}Ge{sub 6} and Ba{sub 7}Ge{sub 6} crystallize in the Ca{sub 7}Sn{sub 6} structure type (space group Pmna, Z=4: a=7.777(2) A, b=23.595(4) A, c=8.563(2) A, wR{sub 2}=0.081 (all data), 2175 independent reflections, 64 variable parameters for Sr{sub 7}Ge{sub 6} and a=8.0853(6) A, b=24.545(2) A, c=8.9782(8) A, wR{sub 2}=0.085 (all data), 2307 independent reflections, 64more » variable parameters for Ba{sub 7}Ge{sub 6}). Ba{sub 3}Sn{sub 2} crystallizes in an own structure type with the space group P4{sub 3}2{sub 1}2, Z=4, a=6.6854(2) A, c=17.842(2) A, wR{sub 2}=0.037 (all data), 1163 independent reflections, 25 variable parameters. In Sr{sub 7}Ge{sub 6} and Ba{sub 7}Ge{sub 6} the Ge atoms are arranged as Ge{sub 2} dumbbells and Ge{sub 4} four-membered atom chains. Their crystal structures cannot be rationalized according to the (8-N) rule. In contrast, Ba{sub 3}Sn{sub 2} presents Sn{sub 2} dumbbells as a main structural motif and thereby can be described as an electron precise Zintl phase. The chemical bonding situation in these structures is discussed on the basis of partial and total Density Of States (DOS) curves, band structures including fatbands, topological analysis of the Electron Localization Function (ELF) as well as Bader analysis of the bond critical points using the programs TB-LMTO-ASA and WIEN2K. While Ba{sub 3}Sn{sub 2} reveals semiconducting behaviour, all germanides Ae{sub 7}Ge{sub 6} (Ae=Ca, Sr, and Ba) show metallic properties and a considerable {pi}-bonding character between the Ge atoms of the four-membered chains and the dumbbells. The {pi}-bonding character of the germanides is best reflected by the resonance hybrid structures {l_brace}[Ge-Ge]{sup 6-}/[Ge-{sup ....}Ge-{sup ....}Ge-{sup ....}Ge]{sup 8-}{r_brace}{r_reversible}{l_brace}[Ge=Ge]{sup 4-}/[Ge-Ge-Ge-Ge]{sup 10-}{r_brace}. - Graphical abstract: The structure of Ba{sub 3}Sn{sub 2} contains Sn{sub 2} dumbbells as a main structural motif and thereby can be described as an electron precise Zintl phase. Ge{sub 2} dumbbells and Ge{sub 4} four-membered atom chains are the predominant features in Sr{sub 7}Ge{sub 6} and Ba{sub 7}Ge{sub 6}. Their crystal structures cannot be rationalized according to the (8-N) rule. While Ba{sub 3}Sn{sub 2} reveals semiconducting behaviour, the germanides Ae{sub 7}Ge{sub 6} (Ae=Ca, Sr, and Ba) show metallic properties and a considerable {pi}-bonding character between the Ge atoms of the four-membered chains and the dumbbells. Highlights: Black-Right-Pointing-Pointer The germanides Sr{sub 7}Ge{sub 6} and Ba{sub 7}Ge{sub 6} as well as the stannide Ba{sub 3}Sn{sub 2} have been synthesized. Black-Right-Pointing-Pointer In Sr{sub 7}Ge{sub 6} and Ba{sub 7}Ge{sub 6} the Ge atoms are arranged as dumbbells and four-membered atom chains. Black-Right-Pointing-Pointer Ba{sub 3}Sn{sub 2} presents Sn{sub 2} dumbbells as a main structural motif. Black-Right-Pointing-Pointer The chemical bonding situation within these structures is discussed.« less
Correlated Mutation in the Evolution of Catalysis in Uracil DNA Glycosylase Superfamily

NASA Astrophysics Data System (ADS)

Xia, Bo; Liu, Yinling; Guevara, Jose; Li, Jing; Jilich, Celeste; Yang, Ye; Wang, Liangjiang; Dominy, Brian N.; Cao, Weiguo

2017-04-01

Enzymes in Uracil DNA glycosylase (UDG) superfamily are essential for the removal of uracil. Family 4 UDGa is a robust uracil DNA glycosylase that only acts on double-stranded and single-stranded uracil-containing DNA. Based on mutational, kinetic and modeling analyses, a catalytic mechanism involving leaving group stabilization by H155 in motif 2 and water coordination by N89 in motif 3 is proposed. Mutual Information analysis identifies a complexed correlated mutation network including a strong correlation in the EG doublet in motif 1 of family 4 UDGa and in the QD doublet in motif 1 of family 1 UNG. Conversion of EG doublet in family 4 Thermus thermophilus UDGa to QD doublet increases the catalytic efficiency by over one hundred-fold and seventeen-fold over the E41Q and G42D single mutation, respectively, rectifying the strong correlation in the doublet. Molecular dynamics simulations suggest that the correlated mutations in the doublet in motif 1 position the catalytic H155 in motif 2 to stabilize the leaving uracilate anion. The integrated approach has important implications in studying enzyme evolution and protein structure and function.
Analysis of the Effects of Polymorphism on Pollen Profilin Structural Functionality and the Generation of Conformational, T- and B-Cell Epitopes

PubMed Central

Jimenez-Lopez, Jose C.; Rodríguez-García, María I.; Alché, Juan D.

2013-01-01

An extensive polymorphism analysis of pollen profilin, a fundamental regulator of the actin cytoskeleton dynamics, has been performed with a major focus in 3D-folding maintenance, changes in the 2-D structural elements, surface residues involved in ligands-profilin interactions and functionality, and the generation of conformational and lineal B- and T-cell epitopes variability. Our results revealed that while the general fold is conserved among profilins, substantial structural differences were found, particularly affecting the special distribution and length of different 2-D structural elements (i.e. cysteine residues), characteristic loops and coils, and numerous micro-heterogeneities present in fundamental residues directly involved in the interacting motifs, and to some extension these residues nearby to the ligand-interacting areas. Differential changes as result of polymorphism might contribute to generate functional variability among the plethora of profilin isoforms present in the olive pollen from different genetic background (olive cultivars), and between plant species, since biochemical interacting properties and binding affinities to natural ligands may be affected, particularly the interactions with different actin isoforms and phosphoinositides lipids species. Furthermore, conspicuous variability in lineal and conformational epitopes was found between profilins belonging to the same olive cultivar, and among different cultivars as direct implication of sequences polymorphism. The variability of the residues taking part of IgE-binding epitopes might be the final responsible of the differences in cross-reactivity among olive pollen cultivars, among pollen and plant-derived food allergens, as well as between distantly related pollen species, leading to a variable range of allergy reactions among atopic patients. Identification and analysis of commonly shared and specific epitopes in profilin isoforms is essential to gain knowledge about the interacting surface of these epitopes, and for a better understanding of immune responses, helping design and development of rational and effective immunotherapy strategies for the treatment of allergy diseases. PMID:24146818
Structural and functional aspects of winged-helix domains at the core of transcription initiation complexes.

PubMed

Teichmann, Martin; Dumay-Odelot, Hélène; Fribourg, Sébastien

2012-01-01

The winged helix (WH) domain is found in core components of transcription systems in eukaryotes and prokaryotes. It represents a sub-class of the helix-turn-helix motif. The WH domain participates in establishing protein-DNA and protein-protein-interactions. Here, we discuss possible explanations for the enrichment of this motif in transcription systems.
Mapping the distribution of packing topologies within protein interiors shows predominant preference for specific packing motifs

PubMed Central

2011-01-01

Background Mapping protein primary sequences to their three dimensional folds referred to as the 'second genetic code' remains an unsolved scientific problem. A crucial part of the problem concerns the geometrical specificity in side chain association leading to densely packed protein cores, a hallmark of correctly folded native structures. Thus, any model of packing within proteins should constitute an indispensable component of protein folding and design. Results In this study an attempt has been made to find, characterize and classify recurring patterns in the packing of side chain atoms within a protein which sustains its native fold. The interaction of side chain atoms within the protein core has been represented as a contact network based on the surface complementarity and overlap between associating side chain surfaces. Some network topologies definitely appear to be preferred and they have been termed 'packing motifs', analogous to super secondary structures in proteins. Study of the distribution of these motifs reveals the ubiquitous presence of typical smaller graphs, which appear to get linked or coalesce to give larger graphs, reminiscent of the nucleation-condensation model in protein folding. One such frequently occurring motif, also envisaged as the unit of clustering, the three residue clique was invariably found in regions of dense packing. Finally, topological measures based on surface contact networks appeared to be effective in discriminating sequences native to a specific fold amongst a set of decoys. Conclusions Out of innumerable topological possibilities, only a finite number of specific packing motifs are actually realized in proteins. This small number of motifs could serve as a basis set in the construction of larger networks. Of these, the triplet clique exhibits distinct preference both in terms of composition and geometry. PMID:21605466
Comparative genomics of pyridoxal 5′-phosphate-dependent transcription factor regulons in Bacteria

PubMed Central

Suvorova, Inna A.

2016-01-01

The MocR-subfamily transcription factors (MocR-TFs) characterized by the GntR-family DNA-binding domain and aminotransferase-like sensory domain are broadly distributed among certain lineages of Bacteria. Characterized MocR-TFs bind pyridoxal 5′-phosphate (PLP) and control transcription of genes involved in PLP, gamma aminobutyric acid (GABA) and taurine metabolism via binding specific DNA operator sites. To identify putative target genes and DNA binding motifs of MocR-TFs, we performed comparative genomics analysis of over 250 bacterial genomes. The reconstructed regulons for 825 MocR-TFs comprise structural genes from over 200 protein families involved in diverse biological processes. Using the genome context and metabolic subsystem analysis we tentatively assigned functional roles for 38 out of 86 orthologous groups of studied regulators. Most of these MocR-TF regulons are involved in PLP metabolism, as well as utilization of GABA, taurine and ectoine. The remaining studied MocR-TF regulators presumably control genes encoding enzymes involved in reduction/oxidation processes, various transporters and PLP-dependent enzymes, for example aminotransferases. Predicted DNA binding motifs of MocR-TFs are generally similar in each orthologous group and are characterized by two to four repeated sequences. Identified motifs were classified according to their structures. Motifs with direct and/or inverted repeat symmetry constitute the majority of inferred DNA motifs, suggesting preferable TF dimerization in head-to-tail or head-to-head configuration. The obtained genomic collection of in silico reconstructed MocR-TF motifs and regulons in Bacteria provides a basis for future experimental characterization of molecular mechanisms for various regulators in this family. PMID:28348826
Synchronous high-frequency oscillations in inhibitory-dominant network motifs consisting of three dentate gyrus-CA3 systems

NASA Astrophysics Data System (ADS)

Zhang, Liyuan; Fan, Denggui; Wang, Qingyun

2018-06-01

Studies on the structural-functional connectomes of the human brain have demonstrated the existence of synchronous firings in a specific brain network motif. In particular, synchronization of high-frequency oscillations (HFOs) has been observed in the experimental data sets of temporal lobe epilepsy (TLE). In addition, both clinical and experimental evidences have accumulated to demonstrate the effect of electrical stimulation on TLE, which, however, remains largely unexplored. In this work, we first employ our previously proposed dentate gyrus (DG)-CA3 network model to investigate the influence of an external electrical stimulus on the HFO transitions. The results indicate that the reinforcing stimulus can induce the HFO transitions of the DG-CA3 system from the gamma band to the fast ripples band. Along with that, the consistent oscillations of neurons within DG-CA3 can also be enhanced with the increasing of stimulus. Then, we expand into a simple motif of three coupled DG-CA3 systems in both the feedforward inhibition and feedback inhibition connections, to investigate the synchronous evolutions of HFOs by regulating both the stimulation strength and inhibitory function. It is shown that the comprehensive effects, which lead to band transition, are independent of the motif configurations. The enhanced external electrical stimulus weakens the synchronism and correlation of connected motifs. In contrast, we demonstrate that the increased inhibitory coupling could facilitate correlation to some extent. Overall, our work highlights the possible origin of synchronous HFOs of hippocampal motifs governed by external inputs and inhibitory connection, which might contribute to a better understanding of the interplay between synchronization dynamics and epileptic structure in the human brain.
Evolutionary relationships in the ilarviruses: nucleotide sequence of prunus necrotic ringspot virus RNA 3.

PubMed

Sánchez-Navarro, J A; Pallás, V

1997-01-01

The complete nucleotide sequence of an isolate of prunus necrotic ringspot virus (PNRSV) RNA 3 has been determined. Elucidation of the amino acid sequence of the proteins encoded by the two large open reading frames (ORFs) allowed us to carry out comparative and phylogenetic studies on the movement (MP) and coat (CP) proteins in the ilarvirus group. Amino acid sequence comparison of the MP revealed a highly conserved basic sequence motif with an amphipathic alpha-helical structure preceding the conserved motif of the '30K superfamily' proposed by Mushegian and Koonin [26] for MP's. Within this '30K' motif a strictly conserved transmembrane domain is present in all ilarviruses sequenced so far. At the amino-terminal end, prune dwarf virus (PDV) has an extension not present in other ilarviruses but which is observed in all bromo- and cucumoviruses, suggesting a common ancestor or a recombinational event in the Bromoviridae family. Examination of the N-terminus of the CP's of all ilarviruses revealed a highly basic region, part of which resembles the Arg-rich motif that has been characterized in the RNA-binding protein family. This motif has also been found in the other members of the Bromoviridae family, suggesting its involvement in a structural function. Furthermore this region is required for infectivity in ilarviruses. The similarities found in this Arg-rich motif are discussed in terms of this process known as genome activation. Finally, phylogenetic analysis of both the MP and CP proteins revealed a higher relationship of A1MV to PNRSV, apple mosaic virus (ApMV) and PDV than any other member of the ilarvirus group. In that sense, A1MV should be considered as a true ilarvirus instead of forming a distinct group of viruses.
Mannose-recognition mutant of the galactose/N-acetylgalactosamine-specific C-type lectin CEL-I engineered by site-directed mutagenesis.

PubMed

Moriuchi, Hiromi; Unno, Hideaki; Goda, Shuichiro; Tateno, Hiroaki; Hirabayashi, Jun; Hatakeyama, Tomomitsu

2015-07-01

CEL-I is a galactose/N-acetylgalactosamine-specific C-type lectin isolated from the sea cucumber Cucumaria echinata. Its carbohydrate-binding site contains a QPD (Gln-Pro-Asp) motif, which is generally recognized as the galactose specificity-determining motif in the C-type lectins. In our previous study, replacement of the QPD motif by an EPN (Glu-Pro-Asn) motif led to a weak binding affinity for mannose. Therefore, we examined the effects of an additional mutation in the carbohydrate-binding site on the specificity of the lectin. Trp105 of EPN-CEL-I was replaced by a histidine residue using site-directed mutagenesis, and the binding affinity of the resulting mutant, EPNH-CEL-I, was examined by sugar-polyamidoamine dendrimer assay, isothermal titration calorimetry, and glycoconjugate microarray analysis. Tertiary structure of the EPNH-CEL-I/mannose complex was determined by X-ray crystallographic analysis. Sugar-polyamidoamine dendrimer assay and glycoconjugate microarray analysis revealed a drastic change in the specificity of EPNH-CEL-I from galactose/N-acetylgalactosamine to mannose. The association constant of EPNH-CEL-I for mannose was determined to be 3.17×10(3) M(-1) at 25°C. Mannose specificity of EPNH-CEL-I was achieved by stabilization of the binding of mannose in a correct orientation, in which the EPN motif can form proper hydrogen bonds with 3- and 4-hydroxy groups of the bound mannose. Specificity of CEL-I can be engineered by mutating a limited number of amino acid residues in addition to the QPD/EPN motifs. Versatility of the C-type carbohydrate-recognition domain structure in the recognition of various carbohydrate chains could become a promising platform to develop novel molecular recognition proteins. Copyright © 2015 Elsevier B.V. All rights reserved.
Structural motifs of pre-nucleation clusters.

PubMed

Zhang, Y; Türkmen, I R; Wassermann, B; Erko, A; Rühl, E

2013-10-07

Structural motifs of pre-nucleation clusters prepared in single, optically levitated supersaturated aqueous aerosol microparticles containing CaBr2 as a model system are reported. Cluster formation is identified by means of X-ray absorption in the Br K-edge regime. The salt concentration beyond the saturation point is varied by controlling the humidity in the ambient atmosphere surrounding the 15-30 μm microdroplets. This leads to the formation of metastable supersaturated liquid particles. Distinct spectral shifts in near-edge spectra as a function of salt concentration are observed, in which the energy position of the Br K-edge is red-shifted by up to 7.1 ± 0.4 eV if the dilute solution is compared to the solid. The K-edge positions of supersaturated solutions are found between these limits. The changes in electronic structure are rationalized in terms of the formation of pre-nucleation clusters. This assumption is verified by spectral simulations using first-principle density functional theory and molecular dynamics calculations, in which structural motifs are considered, explaining the experimental results. These consist of solvated CaBr2 moieties, rather than building blocks forming calcium bromide hexahydrates, the crystal system that is formed by drying aqueous CaBr2 solutions.
Molecular dynamics simulations on the Tre1 G protein-coupled receptor: exploring the role of the arginine of the NRY motif in Tre1 structure

PubMed Central

2013-01-01

Background The arginine of the D/E/NRY motif in Rhodopsin family G protein-coupled receptors (GPCRs) is conserved in 96% of these proteins. In some GPCRs, this arginine in transmembrane 3 can form a salt bridge with an aspartic acid or glutamic acid in transmembrane 6. The Drosophila melanogaster GPCR Trapped in endoderm-1 (Tre1) is required for normal primordial germ cell migration. In a mutant form of the protein, Tre1sctt, eight amino acids RYILIACH are missing, resulting in a severe disruption of primordial germ cell development. The impact of the loss of these amino acids on Tre1 structure is unknown. Since the missing amino acids in Tre1sctt include the arginine that is part of the D/E/NRY motif in Tre1, molecular dynamics simulations were performed to explore the hypothesis that these amino acids are involved in salt bridge formation and help maintain Tre1 structure. Results Structural predictions of wild type Tre1 (Tre1+) and Tre1sctt were subjected to over 250 ns of molecular dynamics simulations. The ability of the model systems to form a salt bridge between the arginine of the D/E/NRY motif and an aspartic acid residue in transmembrane 6 was analyzed. The results indicate that a stable salt bridge can form in the Tre1+ systems and a weak salt bridge or no salt bridge, using an alternative arginine, is likely in the Tre1sctt systems. Conclusions The weak salt bridge or lack of a salt bridge in the Tre1sctt systems could be one possible explanation for the disrupted function of Tre1sctt in primordial germ cell migration. These results provide a framework for studying the importance of the arginine of the D/E/NRY motif in the structure and function of other GPCRs that are involved in cell migration, such as CXCR4 in the mouse, zebrafish, and chicken. PMID:24044607
Molecular dynamics simulations on the Tre1 G protein-coupled receptor: exploring the role of the arginine of the NRY motif in Tre1 structure.

PubMed

Pruitt, Margaret M; Lamm, Monica H; Coffman, Clark R

2013-09-18

The arginine of the D/E/NRY motif in Rhodopsin family G protein-coupled receptors (GPCRs) is conserved in 96% of these proteins. In some GPCRs, this arginine in transmembrane 3 can form a salt bridge with an aspartic acid or glutamic acid in transmembrane 6. The Drosophila melanogaster GPCR Trapped in endoderm-1 (Tre1) is required for normal primordial germ cell migration. In a mutant form of the protein, Tre1sctt, eight amino acids RYILIACH are missing, resulting in a severe disruption of primordial germ cell development. The impact of the loss of these amino acids on Tre1 structure is unknown. Since the missing amino acids in Tre1sctt include the arginine that is part of the D/E/NRY motif in Tre1, molecular dynamics simulations were performed to explore the hypothesis that these amino acids are involved in salt bridge formation and help maintain Tre1 structure. Structural predictions of wild type Tre1 (Tre1+) and Tre1sctt were subjected to over 250 ns of molecular dynamics simulations. The ability of the model systems to form a salt bridge between the arginine of the D/E/NRY motif and an aspartic acid residue in transmembrane 6 was analyzed. The results indicate that a stable salt bridge can form in the Tre1+ systems and a weak salt bridge or no salt bridge, using an alternative arginine, is likely in the Tre1sctt systems. The weak salt bridge or lack of a salt bridge in the Tre1sctt systems could be one possible explanation for the disrupted function of Tre1sctt in primordial germ cell migration. These results provide a framework for studying the importance of the arginine of the D/E/NRY motif in the structure and function of other GPCRs that are involved in cell migration, such as CXCR4 in the mouse, zebrafish, and chicken.
Structural model of dodecameric heat-shock protein Hsp21: Flexible N-terminal arms interact with client proteins while C-terminal tails maintain the dodecamer and chaperone activity.

PubMed

Rutsdottir, Gudrun; Härmark, Johan; Weide, Yoran; Hebert, Hans; Rasmussen, Morten I; Wernersson, Sven; Respondek, Michal; Akke, Mikael; Højrup, Peter; Koeck, Philip J B; Söderberg, Christopher A G; Emanuelsson, Cecilia

2017-05-12

Small heat-shock proteins (sHsps) prevent aggregation of thermosensitive client proteins in a first line of defense against cellular stress. The mechanisms by which they perform this function have been hard to define due to limited structural information; currently, there is only one high-resolution structure of a plant sHsp published, that of the cytosolic Hsp16.9. We took interest in Hsp21, a chloroplast-localized sHsp crucial for plant stress resistance, which has even longer N-terminal arms than Hsp16.9, with a functionally important and conserved methionine-rich motif. To provide a framework for investigating structure-function relationships of Hsp21 and understanding these sequence variations, we developed a structural model of Hsp21 based on homology modeling, cryo-EM, cross-linking mass spectrometry, NMR, and small-angle X-ray scattering. Our data suggest a dodecameric arrangement of two trimer-of-dimer discs stabilized by the C-terminal tails, possibly through tail-to-tail interactions between the discs, mediated through extended I X V X I motifs. Our model further suggests that six N-terminal arms are located on the outside of the dodecamer, accessible for interaction with client proteins, and distinct from previous undefined or inwardly facing arms. To test the importance of the I X V X I motif, we created the point mutant V181A, which, as expected, disrupts the Hsp21 dodecamer and decreases chaperone activity. Finally, our data emphasize that sHsp chaperone efficiency depends on oligomerization and that client interactions can occur both with and without oligomer dissociation. These results provide a generalizable workflow to explore sHsps, expand our understanding of sHsp structural motifs, and provide a testable Hsp21 structure model to inform future investigations. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
Proteolytic dissection of Zab, the Z-DNA-binding domain of human ADAR1

NASA Technical Reports Server (NTRS)

Schwartz, T.; Lowenhaupt, K.; Kim, Y. G.; Li, L.; Brown, B. A. 2nd; Herbert, A.; Rich, A.

1999-01-01

Zalpha is a peptide motif that binds to Z-DNA with high affinity. This motif binds to alternating dC-dG sequences stabilized in the Z-conformation by means of bromination or supercoiling, but not to B-DNA. Zalpha is part of the N-terminal region of double-stranded RNA adenosine deaminase (ADAR1), a candidate enzyme for nuclear pre-mRNA editing in mammals. Zalpha is conserved in ADAR1 from many species; in each case, there is a second similar motif, Zbeta, separated from Zalpha by a more divergent linker. To investigate the structure-function relationship of Zalpha, its domain structure was studied by limited proteolysis. Proteolytic profiles indicated that Zalpha is part of a domain, Zab, of 229 amino acids (residues 133-361 in human ADAR1). This domain contains both Zalpha and Zbeta as well as a tandem repeat of a 49-amino acid linker module. Prolonged proteolysis revealed a minimal core domain of 77 amino acids (positions 133-209), containing only Zalpha, which is sufficient to bind left-handed Z-DNA; however, the substrate binding is strikingly different from that of Zab. The second motif, Zbeta, retains its structural integrity only in the context of Zab and does not bind Z-DNA as a separate entity. These results suggest that Zalpha and Zbeta act as a single bipartite domain. In the presence of substrate DNA, Zab becomes more resistant to proteases, suggesting that it adopts a more rigid structure when bound to its substrate, possibly with conformational changes in parts of the protein.
Multilayer motif analysis of brain networks

NASA Astrophysics Data System (ADS)

Battiston, Federico; Nicosia, Vincenzo; Chavez, Mario; Latora, Vito

2017-04-01

In the last decade, network science has shed new light both on the structural (anatomical) and on the functional (correlations in the activity) connectivity among the different areas of the human brain. The analysis of brain networks has made possible to detect the central areas of a neural system and to identify its building blocks by looking at overabundant small subgraphs, known as motifs. However, network analysis of the brain has so far mainly focused on anatomical and functional networks as separate entities. The recently developed mathematical framework of multi-layer networks allows us to perform an analysis of the human brain where the structural and functional layers are considered together. In this work, we describe how to classify the subgraphs of a multiplex network, and we extend the motif analysis to networks with an arbitrary number of layers. We then extract multi-layer motifs in brain networks of healthy subjects by considering networks with two layers, anatomical and functional, respectively, obtained from diffusion and functional magnetic resonance imaging. Results indicate that subgraphs in which the presence of a physical connection between brain areas (links at the structural layer) coexists with a non-trivial positive correlation in their activities are statistically overabundant. Finally, we investigate the existence of a reinforcement mechanism between the two layers by looking at how the probability to find a link in one layer depends on the intensity of the connection in the other one. Showing that functional connectivity is non-trivially constrained by the underlying anatomical network, our work contributes to a better understanding of the interplay between the structure and function in the human brain.
Computational study of the fibril organization of polyglutamine repeats reveals a common motif identified in beta-helices.

PubMed

Zanuy, David; Gunasekaran, Kannan; Lesk, Arthur M; Nussinov, Ruth

2006-04-21

The formation of fibril aggregates by long polyglutamine sequences is assumed to play a major role in neurodegenerative diseases such as Huntington. Here, we model peptides rich in glutamine, through a series of molecular dynamics simulations. Starting from a rigid nanotube-like conformation, we have obtained a new conformational template that shares structural features of a tubular helix and of a beta-helix conformational organization. Our new model can be described as a super-helical arrangement of flat beta-sheet segments linked by planar turns or bends. Interestingly, our comprehensive analysis of the Protein Data Bank reveals that this is a common motif in beta-helices (termed beta-bend), although it has not been identified so far. The motif is based on the alternation of beta-sheet and helical conformation as the protein sequence is followed from the N to the C termini (beta-alpha(R)-beta-polyPro-beta). We further identify this motif in the ssNMR structure of the protofibril of the amyloidogenic peptide Abeta(1-40). The recurrence of the beta-bend suggests a general mode of connecting long parallel beta-sheet segments that would allow the growth of partially ordered fibril structures. The design allows the peptide backbone to change direction with a minimal loss of main chain hydrogen bonds. The identification of a coherent organization beyond that of the beta-sheet segments in different folds rich in parallel beta-sheets suggests a higher degree of ordered structure in protein fibrils, in agreement with their low solubility and dense molecular packing.
Competition between drum and quasi-planar structures in RhB18-: motifs for metallo-boronanotubes and metallo-borophenes.

PubMed

Jian, Tian; Li, Wan-Lu; Chen, Xin; Chen, Teng-Teng; Lopez, Gary V; Li, Jun; Wang, Lai-Sheng

2016-12-01

Metal-doped boron clusters provide new opportunities to design nanoclusters with interesting structures and bonding. A cobalt-doped boron cluster, CoB 18 - , has been observed recently to be planar and can be viewed as a motif for metallo-borophenes, whereas the D 9d drum isomer as a motif for metallo-boronanotubes is found to be much higher in energy. Hence, whether larger doped boron drums are possible is still an open question. Here we report that for RhB 18 - the drum and quasi-planar structures become much closer in energy and co-exist experimentally, revealing a competition between the metallo-boronanotube and metallo-borophene structures. Photoelectron spectroscopy of RhB 18 - shows a complicated spectral pattern, suggesting the presence of two isomers. Quantum chemistry studies indicate that the D 9d drum isomer and a quasi-planar isomer ( C s ) compete for the global minimum. The enhanced stability of the drum isomer in RhB 18 - is due to the less contracted Rh 4d orbitals, which can have favorable interactions with the B 18 drum motif. Chemical bonding analyses show that the quasi-planar isomer of RhB 18 - is aromatic with 10 π electrons, whereas the observed RhB 18 - drum cluster sets a new record for coordination number of eighteen among metal complexes. The current finding shows that the size of the boron drum can be tuned by appropriate metal dopants, suggesting that even larger boron drums with 5d, 6d transition metal, lanthanide or actinide metal atoms are possible.
Layered structures of organic/inorganic hybrid halide perovskites

NASA Astrophysics Data System (ADS)

Huan, Tran Doan; Tuoc, Vu Ngoc; Minh, Nguyen Viet

2016-03-01

Organic-inorganic hybrid halide perovskites, in which the A cations of an ABX3 perovskite are replaced by organic cations, may be used for photovoltaic and solar thermoelectric applications. In this contribution, we systematically study three lead-free hybrid perovskites, i.e., methylammonium tin iodide CH3NH3SnI3 , ammonium tin iodide NH4SnI3 , and formamidnium tin iodide HC (NH2)2SnI3 by first-principles calculations. We find that in addition to the commonly known motif in which the corner-shared SnI6 octahedra form a three-dimensional network, these materials may also favor a two-dimensional (layered) motif formed by alternating layers of the SnI6 octahedra and the organic cations. These two motifs are nearly equal in free energy and are separated by low barriers. These layered structures features many flat electronic bands near the band edges, making their electronic structures significantly different from those of the structural phases composed of three-dimension networks of SnI6 octahedra. Furthermore, because the electronic structures of HC (NH2)2SnI3 are found to be rather similar to those of CH3NH3SnI3 , formamidnium tin iodide may also be promising for the applications of methylammonium tin iodide.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Helander, Sara; Montecchio, Meri; Lemak, Alexander

Highlights: • We describe the structure of a novel fold in FKBP25 and HectD. • The new fold is named the Basic Tilted Helix Bundle (BTHB) domain. • A conserved basic surface patch is presented, suggesting a functional role. - Abstract: In this paper, we describe the structure of a N-terminal domain motif in nuclear-localized FKBP25{sub 1–73}, a member of the FKBP family, together with the structure of a sequence-related subdomain of the E3 ubiquitin ligase HectD1 that we show belongs to the same fold. This motif adopts a compact 5-helix bundle which we name the Basic Tilted Helix Bundlemore » (BTHB) domain. A positively charged surface patch, structurally centered around the tilted helix H4, is present in both FKBP25 and HectD1 and is conserved in both proteins, suggesting a conserved functional role. We provide detailed comparative analysis of the structures of the two proteins and their sequence similarities, and analysis of the interaction of the proposed FKBP25 binding protein YY1. We suggest that the basic motif in BTHB is involved in the observed DNA binding of FKBP25, and that the function of this domain can be affected by regulatory YY1 binding and/or interactions with adjacent domains.« less

Structural modelling and phylogenetic analyses of PgeIF4A2 (Eukaryotic translation initiation factor) from Pennisetum glaucum reveal signature motifs with a role in stress tolerance and development

PubMed Central

Agarwal, Aakrati; Mudgil, Yashwanti; Pandey, Saurabh; Fartyal, Dhirendra; Reddy, Malireddy K

2016-01-01

Eukaryotic translation initiation factor 4A (eIF4A) is an indispensable component of the translation machinery and also play a role in developmental processes and stress alleviation in plants and animals. Different eIF4A isoforms are present in the cytosol of the cell, namely, eIF4A1, eIF4A2, and eIF4A3 and their expression is tightly regulated in cap-dependent translation. We revealed the structural model of PgeIF4A2 protein using the crystal structure of Homo sapiens eIF4A3 (PDB ID: 2J0S) as template by Modeller 9.12. The resultant PgeIF4A2 model structure was refined by PROCHECK, ProSA, Verify3D and RMSD that showed the model structure is reliable with 77 % amino acid sequence identity with template. Investigation revealed two conserved signatures for ATP-dependent RNA Helicase DEAD-box conserved site (VLDEADEML) and RNA helicase DEAD-box type, Q-motif in sheet-turn-helix and α-helical region respectively. All these conserved motifs are responsible for response during developmental stages and stress tolerance in plants. PMID:28358146
Structural modelling and phylogenetic analyses of PgeIF4A2 (Eukaryotic translation initiation factor) from Pennisetum glaucum reveal signature motifs with a role in stress tolerance and development.

PubMed

Agarwal, Aakrati; Mudgil, Yashwanti; Pandey, Saurabh; Fartyal, Dhirendra; Reddy, Malireddy K

2016-01-01

Eukaryotic translation initiation factor 4A (eIF4A) is an indispensable component of the translation machinery and also play a role in developmental processes and stress alleviation in plants and animals. Different eIF4A isoforms are present in the cytosol of the cell, namely, eIF4A1, eIF4A2, and eIF4A3 and their expression is tightly regulated in cap-dependent translation. We revealed the structural model of PgeIF4A2 protein using the crystal structure of Homo sapiens eIF4A3 (PDB ID: 2J0S) as template by Modeller 9.12. The resultant PgeIF4A2 model structure was refined by PROCHECK, ProSA, Verify3D and RMSD that showed the model structure is reliable with 77 % amino acid sequence identity with template. Investigation revealed two conserved signatures for ATP-dependent RNA Helicase DEAD-box conserved site (VLDEADEML) and RNA helicase DEAD-box type, Q-motif in sheet-turn-helix and α-helical region respectively. All these conserved motifs are responsible for response during developmental stages and stress tolerance in plants.
Interaction of Cu(+) with cytosine and formation of i-motif-like C-M(+)-C complexes: alkali versus coinage metals.

PubMed

Gao, Juehan; Berden, Giel; Rodgers, M T; Oomens, Jos

2016-03-14

The Watson-Crick structure of DNA is among the most well-known molecular structures of our time. However, alternative base-pairing motifs are also known to occur, often depending on base sequence, pH, or the presence of cations. Pairing of cytosine (C) bases induced by the sharing of a single proton (C-H(+)-C) may give rise to the so-called i-motif, which occurs primarily in expanded trinucleotide repeats and the telomeric region of DNA, particularly at low pH. At physiological pH, silver cations were recently found to stabilize C dimers in a C-Ag(+)-C structure analogous to the hemiprotonated C-dimer. Here we use infrared ion spectroscopy in combination with density functional theory calculations at the B3LYP/6-311G+(2df,2p) level to show that copper in the 1+ oxidation state induces an analogous formation of C-Cu(+)-C structures. In contrast to protons and these transition metal ions, alkali metal ions induce a different dimer structure, where each ligand coordinates the alkali metal ion in a bidentate fashion in which the N3 and O2 atoms of both cytosine ligands coordinate to the metal ion, sacrificing hydrogen-bonding interactions between the ligands for improved chelation of the metal cation.
Multifunctionality and diversity of GDSL esterase/lipase gene family in rice (Oryza sativa L. japonica) genome: new insights from bioinformatics analysis

PubMed Central

2012-01-01

Background GDSL esterases/lipases are a newly discovered subclass of lipolytic enzymes that are very important and attractive research subjects because of their multifunctional properties, such as broad substrate specificity and regiospecificity. Compared with the current knowledge regarding these enzymes in bacteria, our understanding of the plant GDSL enzymes is very limited, although the GDSL gene family in plant species include numerous members in many fully sequenced plant genomes. Only two genes from a large rice GDSL esterase/lipase gene family were previously characterised, and the majority of the members remain unknown. In the present study, we describe the rice OsGELP (Oryza sativa GDSL esterase/lipase protein) gene family at the genomic and proteomic levels, and use this knowledge to provide insights into the multifunctionality of the rice OsGELP enzymes. Results In this study, an extensive bioinformatics analysis identified 114 genes in the rice OsGELP gene family. A complete overview of this family in rice is presented, including the chromosome locations, gene structures, phylogeny, and protein motifs. Among the OsGELPs and the plant GDSL esterase/lipase proteins of known functions, 41 motifs were found that represent the core secondary structure elements or appear specifically in different phylogenetic subclades. The specification and distribution of identified putative conserved clade-common and -specific peptide motifs, and their location on the predicted protein three dimensional structure may possibly signify their functional roles. Potentially important regions for substrate specificity are highlighted, in accordance with protein three-dimensional model and location of the phylogenetic specific conserved motifs. The differential expression of some representative genes were confirmed by quantitative real-time PCR. The phylogenetic analysis, together with protein motif architectures, and the expression profiling were analysed to predict the possible biological functions of the rice OsGELP genes. Conclusions Our current genomic analysis, for the first time, presents fundamental information on the organization of the rice OsGELP gene family. With combination of the genomic, phylogenetic, microarray expression, protein motif distribution, and protein structure analyses, we were able to create supported basis for the functional prediction of many members in the rice GDSL esterase/lipase family. The present study provides a platform for the selection of candidate genes for further detailed functional study. PMID:22793791
The Drosophila hnRNP F/H Homolog Glorund Uses Two Distinct RNA-Binding Modes to Diversify Target Recognition.

PubMed

Tamayo, Joel V; Teramoto, Takamasa; Chatterjee, Seema; Hall, Traci M Tanaka; Gavis, Elizabeth R

2017-04-04

The Drosophila hnRNP F/H homolog, Glorund (Glo), regulates nanos mRNA translation by interacting with a structured UA-rich motif in the nanos 3' untranslated region. Glo regulates additional RNAs, however, and mammalian homologs bind G-tract sequences to regulate alternative splicing, suggesting that Glo also recognizes G-tract RNA. To gain insight into how Glo recognizes both structured UA-rich and G-tract RNAs, we used mutational analysis guided by crystal structures of Glo's RNA-binding domains and identified two discrete RNA-binding surfaces that allow Glo to recognize both RNA motifs. By engineering Glo variants that favor a single RNA-binding mode, we show that a subset of Glo's functions in vivo is mediated solely by the G-tract binding mode, whereas regulation of nanos requires both recognition modes. Our findings suggest a molecular mechanism for the evolution of dual RNA motif recognition in Glo that may be applied to understanding the functional diversity of other RNA-binding proteins. Copyright © 2017 The Author(s). Published by Elsevier Inc. All rights reserved.
Structural constraints in the packaging of bluetongue virus genomic segments

PubMed Central

Burkhardt, Christiane; Sung, Po-Yu; Celma, Cristina C.

2014-01-01

The mechanism used by bluetongue virus (BTV) to ensure the sorting and packaging of its 10 genomic segments is still poorly understood. In this study, we investigated the packaging constraints for two BTV genomic segments from two different serotypes. Segment 4 (S4) of BTV serotype 9 was mutated sequentially and packaging of mutant ssRNAs was investigated by two newly developed RNA packaging assay systems, one in vivo and the other in vitro. Modelling of the mutated ssRNA followed by biochemical data analysis suggested that a conformational motif formed by interaction of the 5′ and 3′ ends of the molecule was necessary and sufficient for packaging. A similar structural signal was also identified in S8 of BTV serotype 1. Furthermore, the same conformational analysis of secondary structures for positive-sense ssRNAs was used to generate a chimeric segment that maintained the putative packaging motif but contained unrelated internal sequences. This chimeric segment was packaged successfully, confirming that the motif identified directs the correct packaging of the segment. PMID:24980574
The Drosophila hnRNP F/H homolog glorund uses two distinct RNA-binding modes to diversify target recognition

DOE PAGES

Tamayo, Joel V.; Teramoto, Takamasa; Chatterjee, Seema; ...

2017-04-04

The Drosophila hnRNP F/H homolog, Glorund (Glo), regulates nanos mRNA translation by interacting with a structured UA-rich motif in the nanos 3' untranslated region. Glo regulates additional RNAs, however, and mammalian homologs bind G-tract sequences to regulate alternative splicing, suggesting that Glo also recognizes G-tract RNA. To gain insight into how Glo recognizes both structured UA-rich and G-tract RNAs, we used mutational analysis guided by crystal structures of Glo’s RNA-binding domains and identified two discrete RNA-binding surfaces that allow Glo to recognize both RNA motifs. By engineering Glo variants that favor a single RNA-binding mode, we show that a subsetmore » of Glo’s functions in vivo is mediated solely by the G-tract binding mode, whereas regulation of nanos requires both recognition modes. Lastly, our findings suggest a molecular mechanism for the evolution of dual RNA motif recognition in Glo that may be applied to understanding the functional diversity of other RNA-binding proteins.« less
Intrinsically disordered proteins drive enamel formation via an evolutionarily conserved self-assembly motif.

PubMed

Wald, Tomas; Spoutil, Frantisek; Osickova, Adriana; Prochazkova, Michaela; Benada, Oldrich; Kasparek, Petr; Bumba, Ladislav; Klein, Ophir D; Sedlacek, Radislav; Sebo, Peter; Prochazka, Jan; Osicka, Radim

2017-02-28

The formation of mineralized tissues is governed by extracellular matrix proteins that assemble into a 3D organic matrix directing the deposition of hydroxyapatite. Although the formation of bones and dentin depends on the self-assembly of type I collagen via the Gly-X-Y motif, the molecular mechanism by which enamel matrix proteins (EMPs) assemble into the organic matrix remains poorly understood. Here we identified a Y/F-x-x-Y/L/F-x-Y/F motif, evolutionarily conserved from the first tetrapods to man, that is crucial for higher order structure self-assembly of the key intrinsically disordered EMPs, ameloblastin and amelogenin. Using targeted mutations in mice and high-resolution imaging, we show that impairment of ameloblastin self-assembly causes disorganization of the enamel organic matrix and yields enamel with disordered hydroxyapatite crystallites. These findings define a paradigm for the molecular mechanism by which the EMPs self-assemble into supramolecular structures and demonstrate that this process is crucial for organization of the organic matrix and formation of properly structured enamel.
The Drosophila hnRNP F/H Homolog Glorund Uses Two Distinct RNA-Binding Modes to Diversify Target Recognition

DOE Office of Scientific and Technical Information (OSTI.GOV)

Tamayo, Joel V.; Teramoto, Takamasa; Chatterjee, Seema

The Drosophila hnRNP F/H homolog, Glorund (Glo), regulates nanos mRNA translation by interacting with a structured UA-rich motif in the nanos 3' untranslated region. Glo regulates additional RNAs, however, and mammalian homologs bind G-tract sequences to regulate alternative splicing, suggesting that Glo also recognizes G-tract RNA. To gain insight into how Glo recognizes both structured UA-rich and G-tract RNAs, we used mutational analysis guided by crystal structures of Glo’s RNA-binding domains and identified two discrete RNA-binding surfaces that allow Glo to recognize both RNA motifs. By engineering Glo variants that favor a single RNA-binding mode, we show that a subsetmore » of Glo’s functions in vivo is mediated solely by the G-tract binding mode, whereas regulation of nanos requires both recognition modes. Our findings suggest a molecular mechanism for the evolution of dual RNA motif recognition in Glo that may be applied to understanding the functional diversity of other RNA-binding proteins.« less
The Drosophila hnRNP F/H homolog glorund uses two distinct RNA-binding modes to diversify target recognition

DOE Office of Scientific and Technical Information (OSTI.GOV)

Tamayo, Joel V.; Teramoto, Takamasa; Chatterjee, Seema

The Drosophila hnRNP F/H homolog, Glorund (Glo), regulates nanos mRNA translation by interacting with a structured UA-rich motif in the nanos 3' untranslated region. Glo regulates additional RNAs, however, and mammalian homologs bind G-tract sequences to regulate alternative splicing, suggesting that Glo also recognizes G-tract RNA. To gain insight into how Glo recognizes both structured UA-rich and G-tract RNAs, we used mutational analysis guided by crystal structures of Glo’s RNA-binding domains and identified two discrete RNA-binding surfaces that allow Glo to recognize both RNA motifs. By engineering Glo variants that favor a single RNA-binding mode, we show that a subsetmore » of Glo’s functions in vivo is mediated solely by the G-tract binding mode, whereas regulation of nanos requires both recognition modes. Lastly, our findings suggest a molecular mechanism for the evolution of dual RNA motif recognition in Glo that may be applied to understanding the functional diversity of other RNA-binding proteins.« less
Structural Basis of PP2A Inhibition by Small t Antigen

PubMed Central

Cho, Uhn Soo; Morrone, Seamus; Sablina, Anna A; Arroyo, Jason D; Hahn, William C; Xu, Wenqing

2007-01-01

The SV40 small t antigen (ST) is a potent oncoprotein that perturbs the function of protein phosphatase 2A (PP2A). ST directly interacts with the PP2A scaffolding A subunit and alters PP2A activity by displacing regulatory B subunits from the A subunit. We have determined the crystal structure of full-length ST in complex with PP2A A subunit at 3.1 Å resolution. ST consists of an N-terminal J domain and a C-terminal unique domain that contains two zinc-binding motifs. Both the J domain and second zinc-binding motif interact with the intra-HEAT-repeat loops of HEAT repeats 3–7 of the A subunit, which overlaps with the binding site of the PP2A B56 subunit. Intriguingly, the first zinc-binding motif is in a position that may allow it to directly interact with and inhibit the phosphatase activity of the PP2A catalytic C subunit. These observations provide a structural basis for understanding the oncogenic functions of ST. PMID:17608567
Tyrocidine A Analogues Bearing the Planar d-Phe-2-Abz Turn Motif: How Conformation Impacts Bioactivity.

PubMed

Cameron, Alan J; Edwards, Patrick J B; Harjes, Elena; Sarojini, Vijayalekshmi

2017-12-14

The d-Phe-Pro β-turn of the cyclic β-hairpin antimicrobial decapeptide tyrocidine A, (Tyrc A) was substituted with the d-Phe-2-aminobenzoic acid (2-Abz) motif in a synthetic analogue (1). The NMR structure of 1 demonstrated that compound 1 retained the β-hairpin structure of Tyrc A with additional planarity, resulting in approximately 30-fold reduced hemolysis than Tyrc A. Although antibacterial activity was partially compromised, a single Gln to Lys substitution (2) restored activity equivalent to Tyrc A against S. aureus, enhanced activity against two Gram negative strains and maintained the reduced hemeloysis of 1. Analysis by transmission electron microscopy (TEM) suggested a membrane lytic mechanism of action for these peptides. Compound 2 also exhibits nanomolar antifungal activity in synergy with amphotericin B. The d-Phe-2-Abz turn may serve as a tool for the synthesis of structurally predictable β-hairpin libraries. Unlike traditional β-turn motifs such as d-Pro-Gly, both the 2-Abz and d-Phe rings may be further functionalized.
Complexity of the 5' Untranslated Region of EIF4A3, a Critical Factor for Craniofacial and Neural Development.

PubMed

Hsia, Gabriella S P; Musso, Camila M; Alvizi, Lucas; Brito, Luciano A; Kobayashi, Gerson S; Pavanello, Rita C M; Zatz, Mayana; Gardham, Alice; Wakeling, Emma; Zechi-Ceide, Roseli M; Bertola, Debora; Passos-Bueno, Maria Rita

2018-01-01

Repeats in coding and non-coding regions have increasingly been associated with many human genetic disorders, such as Richieri-Costa-Pereira syndrome (RCPS). RCPS, mostly characterized by midline cleft mandible, Robin sequence and limb defects, is an autosomal-recessive acrofacial dysostosis mainly reported in Brazilian patients. This disorder is caused by decreased levels of EIF4A3 , mostly due to an increased number of repeats at the EIF4A3 5'UTR. EIF4A3 5'UTR alleles are CG-rich and vary in size and organization of three types of motifs. An exclusive allelic pattern was identified among affected individuals, in which the CGCA-motif is the most prevalent, herein referred as "disease-associated CGCA-20nt motif." The origin of the pathogenic alleles containing the disease-associated motif, as well as the functional effects of the 5'UTR motifs on EIF4A3 expression, to date, are entirely unknown. Here, we characterized 43 different EIF4A3 5'UTR alleles in a cohort of 380 unaffected individuals. We identified eight heterozygous unaffected individuals harboring the disease-associated CGCA-20nt motif and our haplotype analyses indicate that there are more than one haplotype associated with RCPS. The combined analysis of number, motif organization and haplotypic diversity, as well as the observation of two apparently distinct haplotypes associated with the disease-associated CGCA-20nt motif, suggest that the RCPS alleles might have arisen from independent unequal crossing-over events between ancient alleles at least twice. Moreover, we have shown that the number and sequence of motifs in the 5'UTR region is associated with EIF4A3 repression, which is not mediated by CpG methylation. In conclusion, this study has shown that the large number of repeats in EIF4A3 does not represent a dynamic mutation and RCPS can arise in any population harboring alleles with the CGCA-20nt motif. We also provided further evidence that EIF4A3 5'UTR is a regulatory region and the size and sequence type of the repeats at 5'UTR may contribute to clinical variability in RCPS.
Finding specific RNA motifs: Function in a zeptomole world?

PubMed Central

KNIGHT, ROB; YARUS, MICHAEL

2003-01-01

We have developed a new method for estimating the abundance of any modular (piecewise) RNA motif within a longer random region. We have used this method to estimate the size of the active motifs available to modern SELEX experiments (picomoles of unique sequences) and to a plausible RNA World (zeptomoles of unique sequences: 1 zmole = 602 sequences). Unexpectedly, activities such as specific isoleucine binding are almost certainly present in zeptomoles of molecules, and even ribozymes such as self-cleavage motifs may appear (depending on assumptions about the minimal structures). The number of specified nucleotides is not the only important determinant of a motif’s rarity: The number of modules into which it is divided, and the details of this division, are also crucial. We propose three maxims for easily isolated motifs: the Maxim of Minimization, the Maxim of Multiplicity, and the Maxim of the Median. These maxims together state that selected motifs should be small and composed of as many separate, equally sized modules as possible. For evenly divided motifs with four modules, the largest accessible activity in picomole scale (1–1000 pmole) pools of length 100 is about 34 nucleotides; while for zeptomole scale (1–1000 zmole) pools it is about 20 specific nucleotides (50% probability of occurrence). This latter figure includes some ribozymes and aptamers. Consequently, an RNA metabolism apparently could have begun with only zeptomoles of RNA molecules. PMID:12554865
Arresting a Torsin ATPase Reshapes the Endoplasmic Reticulum*

PubMed Central

Rose, April E.; Zhao, Chenguang; Turner, Elizabeth M.; Steyer, Anna M.; Schlieker, Christian

2014-01-01

Torsins are membrane-tethered AAA+ ATPases residing in the nuclear envelope (NE) and endoplasmic reticulum (ER). Here, we show that the induction of a conditional, dominant-negative TorsinB variant provokes a profound reorganization of the endomembrane system into foci containing double membrane structures that are derived from the ER. These double-membrane sinusoidal structures are formed by compressing the ER lumen to a constant width of 15 nm, and are highly enriched in the ATPase activator LULL1. Further, we define an important role for a highly conserved aromatic motif at the C terminus of Torsins. Mutations in this motif perturb LULL1 binding, reduce ATPase activity, and profoundly limit the induction of sinusoidal structures. PMID:24275647
A natural product based DOS library of hybrid systems.

PubMed

Prabhu, Ganesh; Agarwal, Shalini; Sharma, Vijeta; Madurkar, Sanjay M; Munshi, Parthapratim; Singh, Shailja; Sen, Subhabrata

2015-05-05

Here we described a natural product inspired modular DOS strategy for the synthesis of a library of hybrid systems that are structurally and stereochemically disparate. The main scaffold is a pyrroloisoquinoline motif, that is synthesized from tandem Pictet-Spengler lactamization. The structural diversity is generated via "privileged scaffolds" that are attached at the appropriate site of the motif. Screening of the library compounds for their antiplasmodial activity against chloroquine sensitive 3D7 cells indicated few compounds with moderate activity (20-50 μM). A systematic comparison of structural intricacy between the library members and a natural product dataset obtained from ZINC(®) revealed comparable complexity. Copyright © 2015 Elsevier Masson SAS. All rights reserved.
A proximity-based graph clustering method for the identification and application of transcription factor clusters.

PubMed

Spadafore, Maxwell; Najarian, Kayvan; Boyle, Alan P

2017-11-29

Transcription factors (TFs) form a complex regulatory network within the cell that is crucial to cell functioning and human health. While methods to establish where a TF binds to DNA are well established, these methods provide no information describing how TFs interact with one another when they do bind. TFs tend to bind the genome in clusters, and current methods to identify these clusters are either limited in scope, unable to detect relationships beyond motif similarity, or not applied to TF-TF interactions. Here, we present a proximity-based graph clustering approach to identify TF clusters using either ChIP-seq or motif search data. We use TF co-occurrence to construct a filtered, normalized adjacency matrix and use the Markov Clustering Algorithm to partition the graph while maintaining TF-cluster and cluster-cluster interactions. We then apply our graph structure beyond clustering, using it to increase the accuracy of motif-based TFBS searching for an example TF. We show that our method produces small, manageable clusters that encapsulate many known, experimentally validated transcription factor interactions and that our method is capable of capturing interactions that motif similarity methods might miss. Our graph structure is able to significantly increase the accuracy of motif TFBS searching, demonstrating that the TF-TF connections within the graph correlate with biological TF-TF interactions. The interactions identified by our method correspond to biological reality and allow for fast exploration of TF clustering and regulatory dynamics.
Macrocyclic molecular rotors with bridged steroidal frameworks.

PubMed

Czajkowska-Szczykowska, Dorota; Rodríguez-Molina, Braulio; Magaña-Vergara, Nancy E; Santillan, Rosa; Morzycki, Jacek W; Garcia-Garibay, Miguel A

2012-11-16

In this work, we describe the synthesis and solid-state dynamics of isomeric molecular rotors 7E and 7Z, consisting of two androstane steroidal frameworks linked by the D rings by triple bonds at their C17 positions to a 1,4-phenylene rotator. They are also linked by the A rings by an alkenyl diester bridge to restrict the conformational flexibility of the molecules and reduce the number of potential crystalline arrays. The analysis of the resulting molecular structures and packing motifs offered insights of the internal dynamics that were later elucidated by means of line shape analyses of the spectral features obtained through variable-temperature solid-state (13)C NMR; such analysis revealed rotations in the solid state occurring at kilohertz frequency at room temperature.
Diversity Surveys and Evolutionary Relationships of aoxB Genes in Aerobic Arsenite-Oxidizing Bacteria▿ †

PubMed Central

Quéméneur, Marianne; Heinrich-Salmeron, Audrey; Muller, Daniel; Lièvremont, Didier; Jauzein, Michel; Bertin, Philippe N.; Garrido, Francis; Joulian, Catherine

2008-01-01

A new primer set was designed to specifically amplify ca. 1,100 bp of aoxB genes encoding the As(III) oxidase catalytic subunit from taxonomically diverse aerobic As(III)-oxidizing bacteria. Comparative analysis of AoxB protein sequences showed variable conservation levels and highlighted the conservation of essential amino acids and structural motifs. AoxB phylogeny of pure strains showed well-discriminated taxonomic groups and was similar to 16S rRNA phylogeny. Alphaproteobacteria-, Betaproteobacteria-, and Gammaproteobacteria-related sequences were retrieved from environmental surveys, demonstrating their prevalence in mesophilic As-contaminated soils. Our study underlines the usefulness of the aoxB gene as a functional marker of aerobic As(III) oxidizers. PMID:18502920
Molecular Dynamics Simulations Reveal an Interplay between SHAPE Reagent Binding and RNA Flexibility.

PubMed

Mlýnský, Vojtěch; Bussi, Giovanni

2018-01-18

The function of RNA molecules usually depends on their overall fold and on the presence of specific structural motifs. Chemical probing methods are routinely used in combination with nearest-neighbor models to determine RNA secondary structure. Among the available methods, SHAPE is relevant due to its capability to probe all RNA nucleotides and the possibility to be used in vivo. However, the structural determinants for SHAPE reactivity and its mechanism of reaction are still unclear. Here molecular dynamics simulations and enhanced sampling techniques are used to predict the accessibility of nucleotide analogs and larger RNA structural motifs to SHAPE reagents. We show that local RNA reconformations are crucial in allowing reagents to reach the 2'-OH group of a particular nucleotide and that sugar pucker is a major structural factor influencing SHAPE reactivity.

DOE Office of Scientific and Technical Information (OSTI.GOV)

Wu, Kailang; Li, Weikai; Peng, Guiqing

NL63 coronavirus (NL63-CoV), a prevalent human respiratory virus, is the only group I coronavirus known to use angiotensin-converting enzyme 2 (ACE2) as its receptor. Incidentally, ACE2 is also used by group II SARS coronavirus (SARS-CoV). We investigated how different groups of coronaviruses recognize the same receptor, whereas homologous group I coronaviruses recognize different receptors. We determined the crystal structure of NL63-CoV spike protein receptor-binding domain (RBD) complexed with human ACE2. NL63-CoV RBD has a novel {beta}-sandwich core structure consisting of 2 layers of {beta}-sheets, presenting 3 discontinuous receptor-binding motifs (RBMs) to bind ACE2. NL63-CoV and SARS-CoV have no structural homologymore » in RBD cores or RBMs; yet the 2 viruses recognize common ACE2 regions, largely because of a 'virus-binding hotspot' on ACE2. Among group I coronaviruses, RBD cores are conserved but RBMs are variable, explaining how these viruses recognize different receptors. These results provide a structural basis for understanding viral evolution and virus-receptor interactions.« less
Energetics and Structure Prediction of the Network of Homo- and Hetero-Oligomers Formed by the Transmembrane Domains of the ErbB Receptor Family of Proteins

DTIC Science & Technology

2005-06-01

acid residue motif, Small-x-x-Large-G/A, consist- complexes is critical to understanding the signal ing of a small residue (Gly, Ala , Ser, Thr, or Pro...in transduction process. Recent structures of the the zero position, a large aliphatic residue ( Ala , Val, ligand-binding domains of the erbB...receptors Leu, or Ile) in position 3, followed by Gly or Ala in have begun to provide insight into the mechanisms position four.’ This motif was identified
HLA-G peptide preferences change in transformed cells: impact on the binding motif.

PubMed

Celik, Alexander A; Simper, Gwendolin S; Hiemisch, Wiebke; Blasczyk, Rainer; Bade-Döding, Christina

2018-03-30

HLA-G is known for its strictly restricted tissue distribution. HLA-G expression could be detected in immune privileged organs and many tumor entities such as leukemia, multiple myeloma, and non-Hodgkin and Hodgkin's lymphoma. This functional variability from mediation of immune tolerance to facilitation of tumor immune evasion strategies might translate to a differential NK cell inhibition between immune-privileged organs and tumor cells. The biophysical invariability of the HLA-G heavy chain and its contrary diversity in immunity implicates a strong influence of the bound peptides on the pHLA-G structure. The aim was to determine if HLA-G displays a tissue-specific peptide repertoire. Therefore, using soluble sHLA-G technology, we analyzed the K562 and HDLM-2 peptide repertoires. Although both cell lines possess a comparable proteome and recruit HLA-G-restricted peptides through the same peptide-loading pathway, the peptide features appear to be cell specific. HDLM-2 derived HLA-G peptides are anchored by an Arg at p1 and K562-derived peptides are anchored by a Lys. At p2, no anchor motif could be determined while peptides were anchored at pΩ with a Leu and showed an auxiliary anchor motif Pro at p3. To appreciate if the peptide anchor alterations are due to a cell-specific differential peptidome, we performed analysis of peptide availability within the different cell types. Yet, the comparison of the cell-specific proteome and HLA-G-restricted ligandome clearly demonstrates a tissue-specific peptide selection by HLA-G molecules. This exclusive and unexpected observation suggests an exquisite immune function of HLA-G.
Placement of molecules in (not out of) the cell

DOE Office of Scientific and Technical Information (OSTI.GOV)

Dauter, Zbigniew, E-mail: dauter@anl.gov

2013-01-01

The importance of presenting macromolecular structures in unified, standard ways is discussed. To uniquely describe a crystal structure, it is sufficient to specify the crystal unit cell and symmetry, and describe the unique structural motif which is repeated by the space-group symmetry throughout the whole crystal. It is somewhat arbitrary how such a unique motif can be defined and positioned with respect to the unit-cell origin. As a result of such freedom, some isomorphous structures are presented in the Protein Data Bank in different locations and appear as if they have different atomic coordinates, despite being completely equivalent structurally. Thismore » may easily confuse those users of the PDB who are less familiar with crystallographic symmetry transformations. It would therefore be beneficial for the community of PDB users to introduce standard rules for locating crystal structures of macromolecules in the unit cells of various space groups.« less
Lattice-free prediction of three-dimensional structure of programmed DNA assemblies

PubMed Central

Pan, Keyao; Kim, Do-Nyun; Zhang, Fei; Adendorff, Matthew R.; Yan, Hao; Bathe, Mark

2014-01-01

DNA can be programmed to self-assemble into high molecular weight 3D assemblies with precise nanometer-scale structural features. Although numerous sequence design strategies exist to realize these assemblies in solution, there is currently no computational framework to predict their 3D structures on the basis of programmed underlying multi-way junction topologies constrained by DNA duplexes. Here, we introduce such an approach and apply it to assemblies designed using the canonical immobile four-way junction. The procedure is used to predict the 3D structure of high molecular weight planar and spherical ring-like origami objects, a tile-based sheet-like ribbon, and a 3D crystalline tensegrity motif, in quantitative agreement with experiments. Our framework provides a new approach to predict programmed nucleic acid 3D structure on the basis of prescribed secondary structure motifs, with possible application to the design of such assemblies for use in biomolecular and materials science. PMID:25470497
Helix–hairpin–helix motifs confer salt resistance and processivity on chimeric DNA polymerases

PubMed Central

Pavlov, Andrey R.; Belova, Galina I.; Kozyavkin, Sergei A.; Slesarev, Alexei I.

2002-01-01

Helix–hairpin–helix (HhH) is a widespread motif involved in sequence-nonspecific DNA binding. The majority of HhH motifs function as DNA-binding modules with typical occurrence of one HhH motif or one or two (HhH)2 domains in proteins. We recently identified 24 HhH motifs in DNA topoisomerase V (Topo V). Although these motifs are dispensable for the topoisomerase activity of Topo V, their removal narrows the salt concentration range for topoisomerase activity tenfold. Here, we demonstrate the utility of Topo V's HhH motifs for modulating DNA-binding properties of the Stoffel fragment of TaqDNA polymerase and Pfu DNA polymerase. Different HhH cassettes fused with either NH2 terminus or COOH terminus of DNA polymerases broaden the salt concentration range of the polymerase activity significantly (up to 0.5 M NaCl or 1.8 M potassium glutamate). We found that anions play a major role in the inhibition of DNA polymerase activity. The resistance of initial extension rates and the processivity of chimeric polymerases to salts depend on the structure of added HhH motifs. Regardless of the type of the construct, the thermal stability of chimeric Taq polymerases increases under the optimal ionic conditions, as compared with that of TaqDNA polymerase or its Stoffel fragment. Our approach to raise the salt tolerance, processivity, and thermostability of Taq and Pfu DNA polymerases may be applied to all pol1- and polB-type polymerases, as well as to other DNA processing enzymes. PMID:12368475
A ΩXaV motif in the Rift Valley fever virus NSs protein is essential for degrading p62, forming nuclear filaments and virulence

PubMed Central

Cyr, Normand; de la Fuente, Cynthia; Lecoq, Lauriane; Guendel, Irene; Chabot, Philippe R.; Kehn-Hall, Kylene; Omichinski, James G.

2015-01-01

Rift Valley fever virus (RVFV) is a single-stranded RNA virus capable of inducing fatal hemorrhagic fever in humans. A key component of RVFV virulence is its ability to form nuclear filaments through interactions between the viral nonstructural protein NSs and the host general transcription factor TFIIH. Here, we identify an interaction between a ΩXaV motif in NSs and the p62 subunit of TFIIH. This motif in NSs is similar to ΩXaV motifs found in nucleotide excision repair (NER) factors and transcription factors known to interact with p62. Structural and biophysical studies demonstrate that NSs binds to p62 in a similar manner as these other factors. Functional studies in RVFV-infected cells show that the ΩXaV motif is required for both nuclear filament formation and degradation of p62. Consistent with the fact that the RVFV can be distinguished from other Bunyaviridae-family viruses due to its ability to form nuclear filaments in infected cells, the motif is absent in the NSs proteins of other Bunyaviridae-family viruses. Taken together, our studies demonstrate that p62 binding to NSs through the ΩXaV motif is essential for degrading p62, forming nuclear filaments and enhancing RVFV virulence. In addition, these results show how the RVFV incorporates a simple motif into the NSs protein that enables it to functionally mimic host cell proteins that bind the p62 subunit of TFIIH. PMID:25918396
A ΩXaV motif in the Rift Valley fever virus NSs protein is essential for degrading p62, forming nuclear filaments and virulence.

PubMed

Cyr, Normand; de la Fuente, Cynthia; Lecoq, Lauriane; Guendel, Irene; Chabot, Philippe R; Kehn-Hall, Kylene; Omichinski, James G

2015-05-12

Rift Valley fever virus (RVFV) is a single-stranded RNA virus capable of inducing fatal hemorrhagic fever in humans. A key component of RVFV virulence is its ability to form nuclear filaments through interactions between the viral nonstructural protein NSs and the host general transcription factor TFIIH. Here, we identify an interaction between a ΩXaV motif in NSs and the p62 subunit of TFIIH. This motif in NSs is similar to ΩXaV motifs found in nucleotide excision repair (NER) factors and transcription factors known to interact with p62. Structural and biophysical studies demonstrate that NSs binds to p62 in a similar manner as these other factors. Functional studies in RVFV-infected cells show that the ΩXaV motif is required for both nuclear filament formation and degradation of p62. Consistent with the fact that the RVFV can be distinguished from other Bunyaviridae-family viruses due to its ability to form nuclear filaments in infected cells, the motif is absent in the NSs proteins of other Bunyaviridae-family viruses. Taken together, our studies demonstrate that p62 binding to NSs through the ΩXaV motif is essential for degrading p62, forming nuclear filaments and enhancing RVFV virulence. In addition, these results show how the RVFV incorporates a simple motif into the NSs protein that enables it to functionally mimic host cell proteins that bind the p62 subunit of TFIIH.
How many Coccolithovirus genotypes does it take to terminate an Emiliania huxleyi bloom?

PubMed

Highfield, Andrea; Evans, Claire; Walne, Anthony; Miller, Peter I; Schroeder, Declan C

2014-10-01

Giant viruses are known to be significant mortality agents of phytoplankton, often being implicated in the terminations of large Emiliania huxleyi blooms. We have previously shown the high temporal variability of E. huxleyi-infecting coccolithoviruses (EhVs) within a Norwegian fjord mesocosm. In the current study we investigated EhV dynamics within a naturally-occurring E. huxleyi bloom in the Western English Channel. Using denaturing gradient gel electrophoresis and marker gene sequencing, we uncovered a spatially highly dynamic Coccolithovirus population that was associated with a genetically stable E. huxleyi population as revealed by the major capsid protein gene (mcp) and coccolith morphology motif (CMM), respectively. Coccolithoviruses within the bloom were found to be variable with depth and unique virus populations were detected at different stations sampled indicating a complex network of EhV-host infections. This ultimately will have significant implications to the internal structure and longevity of ecologically important E. huxleyi blooms. Copyright © 2014 The Authors. Published by Elsevier Inc. All rights reserved.
Membrane Curvature Sensing by Amphipathic Helices Is Modulated by the Surrounding Protein Backbone.

PubMed

Doucet, Christine M; Esmery, Nina; de Saint-Jean, Maud; Antonny, Bruno

2015-01-01

Membrane curvature is involved in numerous biological pathways like vesicle trafficking, endocytosis or nuclear pore complex assembly. In addition to its topological role, membrane curvature is sensed by specific proteins, enabling the coordination of biological processes in space and time. Amongst membrane curvature sensors are the ALPS (Amphipathic Lipid Packing Sensors). ALPS motifs are short peptides with peculiar amphipathic properties. They are found in proteins targeted to distinct curved membranes, mostly in the early secretory pathway. For instance, the ALPS motif of the golgin GMAP210 binds trafficking vesicles, while the ALPS motif of Nup133 targets nuclear pores. It is not clear if, besides curvature sensitivity, ALPS motifs also provide target specificity, or if other domains in the surrounding protein backbone are involved. To elucidate this aspect, we studied the subcellular localization of ALPS motifs outside their natural protein context. The ALPS motifs of GMAP210 or Nup133 were grafted on artificial fluorescent probes. Importantly, ALPS motifs are held in different positions and these contrasting architectures were mimicked by the fluorescent probes. The resulting chimeras recapitulated the original proteins localization, indicating that ALPS motifs are sufficient to specifically localize proteins. Modulating the electrostatic or hydrophobic content of Nup133 ALPS motif modified its avidity for cellular membranes but did not change its organelle targeting properties. In contrast, the structure of the backbone surrounding the helix strongly influenced targeting. In particular, introducing an artificial coiled-coil between ALPS and the fluorescent protein increased membrane curvature sensitivity. This coiled-coil domain also provided membrane curvature sensitivity to the amphipathic helix of Sar1. The degree of curvature sensitivity within the coiled-coil context remains correlated to the natural curvature sensitivity of the helices. This suggests that the chemistry of ALPS motifs is a key parameter for membrane curvature sensitivity, which can be further modulated by the surrounding protein backbone.
Dual role of Zn2+ in maintaining structural integrity and suppressing deacetylase activity of SIRT1.

PubMed

Chen, Lei; Feng, Yu; Zhou, Yinqiu; Zhu, Weiliang; Shen, Xu; Chen, Kaixian; Jiang, Hualiang; Liu, Dongxiang

2010-02-01

Zn(2+) directly participates in catalysis of histone deacetylase (HDAC) Classes I, II, IV enzymes while its role in HDAC Class III activity is not well established. Herein we investigated the effects of Zn(2+) on the deacetylase activity of sirtuin 1 (silent mating type information regulation 2 homolog 1, SIRT1). We found that the inherent Zn(2+) at the zinc-finger motif of SIRT1 is essential for the structural integrity and the deacetylase activity of SIRT1, whereas the exogenous Zn(2+) strongly inhibits the deacetylase activity with an IC(50) of 0.82muM for Zn(Gly)(2). SIRT1 activity suppressed by the exogenous Zn(2+) can be fully recovered by the metal chelator EDTA but not by the activator resveratrol. We also identified Zn(2+) as a noncompetitive inhibitor for the substrates of NAD(+) and the acetyl peptide P53-AMC. The 8-anilino-1-naphthalenesulfonic acid (ANS) fluorescence titration experiments and site-directed mutagenesis study suggested that the exogenous Zn(2+) binds to SIRT1 but not at the zinc-finger motif. These results indicate that Zn(2+) plays a dual role in SIRT1 activity. Inherent Zn(2+) at the zinc-finger motif is structurally related and essential for SIRT1 activity. On the other hand, Zn(2+) may also bind to another site different from the zinc-finger motif or the binding sites for the substrates or resveratrol and act as a potent inhibitor of SIRT1.
Complexation of imidazopyridine-based cations with a 24-crown-8 ether host: [2]pseudorotaxane and partially threaded structures.

PubMed

Moreno-Olivares, Surisadai I; Cervantes, Ruy; Tiburcio, Jorge

2013-11-01

A new series of linear molecules derived from 1,2-bis(imidazopyridin-2-yl)ethane can fully or partially penetrate the cavity of the dibenzo-24-crown-8 macrocycle to produce a new family of host-guest complexes. Protonation or alkylation of the nitrogen atoms on the pyridine rings led to an increase in the guest total positive charge up to 4+ and simultaneously generated two new recognition sites (pyridinium motifs) that are in competition with the 1,2-bis(benzimidazole)ethane motif for the crown ether. The relative position of the pyridine ring and the chemical nature of the N-substituent determined the preferred motif and the host-guest complex geometry: (i) for linear guests with relatively bulky groups (i.e., a benzyl substituent), the 1,2-bis(benzimidazole)ethane motif is favored, leading to a fully threaded complex with a [2]pseudorotaxane geometry; (ii) for small substituents, such as -H and -CH3 groups, regardless of the guest shape, the pyridinium motifs are preferred, leading to external partially threaded complexes in a 2:1 host to guest stoichiometry.
Human telomeric DNA: G-quadruplex, i-motif and Watson–Crick double helix

PubMed Central

Phan, Anh Tuân; Mergny, Jean-Louis

2002-01-01

Human telomeric DNA composed of (TTAGGG/CCCTAA)n repeats may form a classical Watson–Crick double helix. Each individual strand is also prone to quadruplex formation: the G-rich strand may adopt a G-quadruplex conformation involving G-quartets whereas the C-rich strand may fold into an i-motif based on intercalated C·C+ base pairs. Using an equimolar mixture of the telomeric oligonucleotides d[AGGG(TTAGGG)3] and d[(CCCTAA)3CCCT], we defined which structures existed and which would be the predominant species under a variety of experimental conditions. Under near-physiological conditions of pH, temperature and salt concentration, telomeric DNA was predominantly in a double-helix form. However, at lower pH values or higher temperatures, the G-quadruplex and/or the i-motif efficiently competed with the duplex. We also present kinetic and thermodynamic data for duplex association and for G-quadruplex/i-motif unfolding. PMID:12409451
The Thiamin Pyrophosphate-Motif

NASA Technical Reports Server (NTRS)

Dominiak, P.; Ciszak, E.

2003-01-01

Using databases the authors have identified a common thiamin pyrophosphate (TPP)-motif in the family of functionally diverse TPP-dependent enzymes. This common motif consists of multimeric organization of subunits and two catalytic centers. Each catalytic center (PP:PYR) is formed at the interface of the PP-domain binding the magnesium ion, pyrophosphate and amhopyrimidine ring of TPP, and the PYR-domain binding the aminopyrimidine ring of that cofactor. A pair of these catalytic centers constitutes the catalytic core (PP:PYR)(sub 2) within these enzymes. Analysis of the structural elements of this catalytic core reveals novel definition of the common amino acid sequences, which are GXPhiX(sub 4)(G)PhiXXGQ and GDGX(sub 25-30)NN in the PP-domain, and the EX(sub 4)(G)PhiXXGPhi in the PYR-domain, where Phi corresponds to a hydrophobic amino acid. This TPP-motif provides a novel tool for annotation of TPP-dependent enzymes useful in advancing functional proteomics.
Understanding the role of histidine in the GHSxG acyltransferase active site motif: Evidence for histidine stabilization of the malonyl-enzyme intermediate

DOE Office of Scientific and Technical Information (OSTI.GOV)

Poust, Sean; Yoon, Isu; Adams, Paul D.

Acyltransferases determine which extender units are incorporated into polyketide and fatty acid products. Thus, the ping-pong acyltransferase mechanism utilizes a serine in a conserved GHSxG motif. However, the role of the conserved histidine in this motif is poorly understood. We observed that a histidine to alanine mutation (H640A) in the GHSxG motif of the malonyl-CoA specific yersiniabactin acyltransferase results in an approximately seven-fold higher hydrolysis rate over the wildtype enzyme, while retaining transacylation activity. We propose two possibilities for the reduction in hydrolysis rate: either H640 structurally stabilizes the protein by hydrogen bonding with a conserved asparagine in the ferredoxin-likemore » subdomain of the protein, or a water-mediated hydrogen bond between H640 and the malonyl moiety stabilizes the malonyl-O-AT ester intermediate.« less
Understanding the role of histidine in the GHSxG acyltransferase active site motif: Evidence for histidine stabilization of the malonyl-enzyme intermediate

DOE PAGES

Poust, Sean; Yoon, Isu; Adams, Paul D.; ...

2014-10-06

Acyltransferases determine which extender units are incorporated into polyketide and fatty acid products. Thus, the ping-pong acyltransferase mechanism utilizes a serine in a conserved GHSxG motif. However, the role of the conserved histidine in this motif is poorly understood. We observed that a histidine to alanine mutation (H640A) in the GHSxG motif of the malonyl-CoA specific yersiniabactin acyltransferase results in an approximately seven-fold higher hydrolysis rate over the wildtype enzyme, while retaining transacylation activity. We propose two possibilities for the reduction in hydrolysis rate: either H640 structurally stabilizes the protein by hydrogen bonding with a conserved asparagine in the ferredoxin-likemore » subdomain of the protein, or a water-mediated hydrogen bond between H640 and the malonyl moiety stabilizes the malonyl-O-AT ester intermediate.« less
Identifying Atomic Scale Structure in Undoped/Doped Semicrystalline P3HT Using Inelastic Neutron Scattering

DOE PAGES

Harrelson, Thomas F.; Cheng, Yongqiang Q.; Li, Jun; ...

2017-03-07

The greatest advantage of organic materials is the ability to synthetically tune desired properties. However, structural heterogeneity often obfuscates the relationship between chemical structure and functional properties. Inelastic neutron scattering (INS) is sensitive to both local structure and chemical environment and provides atomic level details that cannot be obtained through other spectroscopic or diffraction methods. INS data are composed of a density of vibrational states with no selection rules, which means that every structural configuration is equally weighted in the spectrum. This allows the INS spectrum to be quantitatively decomposed into different structural motifs. Here in this paper we presentmore » INS measurements of the semiconducting polymer P3HT doped with F4TCNQ supported by density functional theory calculations to identify two dominant families of undoped crystalline structures and one dominant doped structural motif, in spite of considerable heterogeneity. The differences between the undoped and doped structures indicate that P3HT side chains flatten upon doping.« less
Search for global-minimum geometries of medium-sized germanium clusters. II. Motif-based low-lying clusters Ge21-Ge29

NASA Astrophysics Data System (ADS)

Yoo, S.; Zeng, X. C.

2006-05-01

We performed a constrained search for the geometries of low-lying neutral germanium clusters GeN in the size range of 21⩽N⩽29. The basin-hopping global optimization method is employed for the search. The potential-energy surface is computed based on the plane-wave pseudopotential density functional theory. A new series of low-lying clusters is found on the basis of several generic structural motifs identified previously for silicon clusters [S. Yoo and X. C. Zeng, J. Chem. Phys. 124, 054304 (2006)] as well as for smaller-sized germanium clusters [S. Bulusu et al., J. Chem. Phys. 122, 164305 (2005)]. Among the generic motifs examined, we found that two motifs stand out in producing most low-lying clusters, namely, the six/nine motif, a puckered-hexagonal-ring Ge6 unit attached to a tricapped trigonal prism Ge9, and the six/ten motif, a puckered-hexagonal-ring Ge6 unit attached to a bicapped antiprism Ge10. The low-lying clusters obtained are all prolate in shape and their energies are appreciably lower than the near-spherical low-energy clusters. This result is consistent with the ion-mobility measurement in that medium-sized germanium clusters detected are all prolate in shape until the size N ˜65.
Structure of Rot, a global regulator of virulence genes in Staphylococcus aureus.

PubMed

Zhu, Yuwei; Fan, Xiaojiao; Zhang, Xu; Jiang, Xuguang; Niu, Liwen; Teng, Maikun; Li, Xu

2014-09-01

Staphylococcus aureus is a highly versatile pathogen that can infect human tissue by producing a large arsenal of virulence factors that are tightly regulated by a complex regulatory network. Rot, which shares sequence similarity with SarA homologues, is a global regulator that regulates numerous virulence genes. However, the recognition model of Rot for the promoter region of target genes and the putative regulation mechanism remain elusive. In this study, the 1.77 Å resolution X-ray crystal structure of Rot is reported. The structure reveals that two Rot molecules form a compact homodimer, each of which contains a typical helix-turn-helix module and a β-hairpin motif connected by a flexible loop. Fluorescence polarization results indicate that Rot preferentially recognizes AT-rich dsDNA with ~30-base-pair nucleotides and that the conserved positively charged residues on the winged-helix motif are vital for binding to the AT-rich dsDNA. It is proposed that the DNA-recognition model of Rot may be similar to that of SarA, SarR and SarS, in which the helix-turn-helix motifs of each monomer interact with the major grooves of target dsDNA and the winged motifs contact the minor grooves. Interestingly, the structure shows that Rot adopts a novel dimerization model that differs from that of other SarA homologues. As expected, perturbation of the dimer interface abolishes the dsDNA-binding ability of Rot, suggesting that Rot functions as a dimer. In addition, the results have been further confirmed in vivo by measuring the transcriptional regulation of α-toxin, a major virulence factor produced by most S. aureus strains.
Structural and functional analysis of a FeoB A143S G5 loop mutant explains the accelerated GDP release rate.

PubMed

Guilfoyle, Amy P; Deshpande, Chandrika N; Vincent, Kimberley; Pedroso, Marcelo M; Schenk, Gerhard; Maher, Megan J; Jormakka, Mika

2014-05-01

GTPases (G proteins) hydrolyze the conversion of GTP to GDP and free phosphate, comprising an integral part of prokaryotic and eukaryotic signaling, protein biosynthesis and cell division, as well as membrane transport processes. The G protein cycle is brought to a halt after GTP hydrolysis, and requires the release of GDP before a new cycle can be initiated. For eukaryotic heterotrimeric Gαβγ proteins, the interaction with a membrane-bound G protein-coupled receptor catalyzes the release of GDP from the Gα subunit. Structural and functional studies have implicated one of the nucleotide binding sequence motifs, the G5 motif, as playing an integral part in this release mechanism. Indeed, a Gαs G5 mutant (A366S) was shown to have an accelerated GDP release rate, mimicking a G protein-coupled receptor catalyzed release state. In the present study, we investigate the role of the equivalent residue in the G5 motif (residue A143) in the prokaryotic membrane protein FeoB from Streptococcus thermophilus, which includes an N-terminal soluble G protein domain. The structure of this domain has previously been determined in the apo and GDP-bound states and in the presence of a transition state analogue, revealing conformational changes in the G5 motif. The A143 residue was mutated to a serine and analyzed with respect to changes in GTPase activity, nucleotide release rate, GDP affinity and structural alterations. We conclude that the identity of the residue at this position in the G5 loop plays a key role in the nucleotide release rate by allowing the correct positioning and hydrogen bonding of the nucleotide base. © 2014 FEBS.

Evolving nucleotide binding surfaces

NASA Technical Reports Server (NTRS)

Kieber-Emmons, T.; Rein, R.

1981-01-01

An analysis is presented of the stability and nature of binding of a nucleotide to several known dehydrogenases. The employed approach includes calculation of hydrophobic stabilization of the binding motif and its intermolecular interaction with the ligand. The evolutionary changes of the binding motif are studied by calculating the Euclidean deviation of the respective dehydrogenases. Attention is given to the possible structural elements involved in the origin of nucleotide recognition by non-coded primordial polypeptides.
Distribution of recombination hotspots in the human genome--a comparison of computer simulations with real data.

PubMed

Mackiewicz, Dorota; de Oliveira, Paulo Murilo Castro; Moss de Oliveira, Suzana; Cebrat, Stanisław

2013-01-01

Recombination is the main cause of genetic diversity. Thus, errors in this process can lead to chromosomal abnormalities. Recombination events are confined to narrow chromosome regions called hotspots in which characteristic DNA motifs are found. Genomic analyses have shown that both recombination hotspots and DNA motifs are distributed unevenly along human chromosomes and are much more frequent in the subtelomeric regions of chromosomes than in their central parts. Clusters of motifs roughly follow the distribution of recombination hotspots whereas single motifs show a negative correlation with the hotspot distribution. To model the phenomena related to recombination, we carried out computer Monte Carlo simulations of genome evolution. Computer simulations generated uneven distribution of hotspots with their domination in the subtelomeric regions of chromosomes. They also revealed that purifying selection eliminating defective alleles is strong enough to cause such hotspot distribution. After sufficiently long time of simulations, the structure of chromosomes reached a dynamic equilibrium, in which number and global distribution of both hotspots and defective alleles remained statistically unchanged, while their precise positions were shifted. This resembles the dynamic structure of human and chimpanzee genomes, where hotspots change their exact locations but the global distributions of recombination events are very similar.
Distribution of Recombination Hotspots in the Human Genome – A Comparison of Computer Simulations with Real Data

PubMed Central

Mackiewicz, Dorota; de Oliveira, Paulo Murilo Castro; Moss de Oliveira, Suzana; Cebrat, Stanisław

2013-01-01

Recombination is the main cause of genetic diversity. Thus, errors in this process can lead to chromosomal abnormalities. Recombination events are confined to narrow chromosome regions called hotspots in which characteristic DNA motifs are found. Genomic analyses have shown that both recombination hotspots and DNA motifs are distributed unevenly along human chromosomes and are much more frequent in the subtelomeric regions of chromosomes than in their central parts. Clusters of motifs roughly follow the distribution of recombination hotspots whereas single motifs show a negative correlation with the hotspot distribution. To model the phenomena related to recombination, we carried out computer Monte Carlo simulations of genome evolution. Computer simulations generated uneven distribution of hotspots with their domination in the subtelomeric regions of chromosomes. They also revealed that purifying selection eliminating defective alleles is strong enough to cause such hotspot distribution. After sufficiently long time of simulations, the structure of chromosomes reached a dynamic equilibrium, in which number and global distribution of both hotspots and defective alleles remained statistically unchanged, while their precise positions were shifted. This resembles the dynamic structure of human and chimpanzee genomes, where hotspots change their exact locations but the global distributions of recombination events are very similar. PMID:23776462
Application of 1-aminocyclohexane carboxylic acid to protein nanostructure computer design

PubMed Central

Rodríguez-Ropero, Francisco; Zanuy, David; Casanovas, Jordi; Nussinov, Ruth; Alemán, Carlos

2009-01-01

Conformationally restricted amino acids are promising candidates to serve as basic pieces in redesigned protein motifs which constitute the basic modules in synthetic nanoconstructs. Here we study the ability of constrained cyclic amino acid 1-aminocyclohexane-1-carboxylic acid (Ac6c) to stabilize highly regular β-helical motifs excised from naturally occurring proteins. Calculations indicate that the conformational flexibility observed in both the ring and the main chain is significantly higher than that detected for other 1-aminocycloalkane-1-carboxylic acid (Acnc, where n refers to the size of the ring) with smaller cycles. Incorporation of Ac6c into the flexible loops of β-helical motifs indicates that the stability of such excised building blocks as well as the nano-assemblies derived from them is significantly enhanced. Thus, the intrinsic Ac6c tendency to adopt folded conformations combined with the low structural strain of the cyclohexane ring confers the ability to both self-adapt to the β-helix motif and to stabilize the overall structure by absorbing part of its conformational fluctuations. Comparison with other Acnc residues indicates that the ability to adapt to the targeted position improves considerably with the ring size, i.e. when the rigidity introduced by the strain of the ring decreases. PMID:18201062
Substrate specificity and reaction kinetics of an X-motif ribozyme

PubMed Central

LAZAREV, DENIS; PUSKARZ, IZABELA; BREAKER, RONALD R.

2003-01-01

The X-motif is an in vitro-selected ribozyme that catalyzes RNA cleavage by an internal phosphoester transfer reaction. This ribozyme class is distinguished by the fact that it emerged as the dominant clone among at least 12 different classes of ribozymes when in vitro selection was conducted to favor the isolation of high-speed catalysts. We have examined the structural and kinetic properties of the X-motif in order to provide a framework for its application as an RNA-cleaving agent and to explore how this ribozyme catalyzes phosphoester transfer with a predicted rate constant that is similar to those exhibited by the four natural self-cleaving ribozymes. The secondary structure of the X-motif includes four stem elements that form a central unpaired junction. In a bimolecular format, two of these base-paired arms define the substrate specificity of the ribozyme and can be changed to target different RNAs for cleavage. The requirements for nucleotide identity at the cleavage site are GD, where D = G, A, or U and cleavage occurs between the two nucleotides. The ribozyme has an absolute requirement for a divalent cation cofactor and exhibits kinetic behavior that is consistent with the obligate binding of at least two metal ions. PMID:12756327
The B7-1 Cytoplasmic Tail Enhances Intracellular Transport and Mammalian Cell Surface Display of Chimeric Proteins in the Absence of a Linear ER Export Motif

PubMed Central

Lin, Yi-Chieh; Chen, Bing-Mae; Lu, Wei-Cheng; Su, Chien-I; Prijovich, Zeljko M.; Chung, Wen-Chuan; Wu, Pei-Yu; Chen, Kai-Chuan; Lee, I-Chiao; Juan, Ting-Yi; Roffler, Steve R.

2013-01-01

Membrane-tethered proteins (mammalian surface display) are increasingly being used for novel therapeutic and biotechnology applications. Maximizing surface expression of chimeric proteins on mammalian cells is important for these applications. We show that the cytoplasmic domain from the B7-1 antigen, a commonly used element for mammalian surface display, can enhance the intracellular transport and surface display of chimeric proteins in a Sar1 and Rab1 dependent fashion. However, mutational, alanine scanning and deletion analysis demonstrate the absence of linear ER export motifs in the B7 cytoplasmic domain. Rather, efficient intracellular transport correlated with the presence of predicted secondary structure in the cytoplasmic tail. Examination of the cytoplasmic domains of 984 human and 782 mouse type I transmembrane proteins revealed that many previously identified ER export motifs are rarely found in the cytoplasmic tail of type I transmembrane proteins. Our results suggest that efficient intracellular transport of B7 chimeric proteins is associated with the structure rather than to the presence of a linear ER export motif in the cytoplasmic tail, and indicate that short (less than ~ 10-20 amino acids) and unstructured cytoplasmic tails should be avoided to express high levels of chimeric proteins on mammalian cells. PMID:24073236
Novel ATPase activity of the polyprotein intermediate, Viral Protein genome-linked-Nuclear Inclusion-a protease, of Pepper vein banding potyvirus

DOE Office of Scientific and Technical Information (OSTI.GOV)

Mathur, Chhavi; Savithri, Handanahal S., E-mail: bchss@biochem.iisc.ernet.in

2012-10-12

Highlights: Black-Right-Pointing-Pointer Pepper vein banding potyvirus VPg harbors Walker motifs. Black-Right-Pointing-Pointer VPg exhibits ATPase activity in the presence of NIa-Pro. Black-Right-Pointing-Pointer Plausible structural and functional interplay between VPg and NIa-Pro. Black-Right-Pointing-Pointer Functional relevance of prolonged presence of VPg-Pro during infection. -- Abstract: Potyviruses temporally regulate their protein function by polyprotein processing. Previous studies have shown that VPg (Viral Protein genome-linked) of Pepper vein banding virus interacts with the NIa-Pro (Nuclear Inclusion-a protease) domain, and modulates the kinetics of the protease. In the present study, we report for the first time that VPg harbors the Walker motifs A and B, andmore » the presence of NIa-Pro, especially in cis (cleavage site (E191A) VPg-Pro mutant), is essential for manifestation of the ATPase activity. Mutation of Lys47 (Walker motif A) and Asp88:Glu89 (Walker motif B) to alanine in E191A VPg-Pro lead to reduced ATPase activity, confirming that this activity was inherent to VPg. We propose that potyviral VPg, established as an intrinsically disordered domain, undergoes plausible structural alterations upon interaction with globular NIa-Pro which induces the ATPase activity.« less
New paradigm in ankyrin repeats: Beyond protein-protein interaction module.

PubMed

Islam, Zeyaul; Nagampalli, Raghavendra Sashi Krishna; Fatima, Munazza Tamkeen; Ashraf, Ghulam Md

2018-04-01

Classically, ankyrin repeat (ANK) proteins are built from tandems of two or more repeats and form curved solenoid structures that are associated with protein-protein interactions. These are short, widespread structural motif of around 33 amino acids repeats in tandem, having a canonical helix-loop-helix fold, found individually or in combination with other domains. The multiplicity of structural pattern enables it to form assemblies of diverse sizes, required for their abilities to confer multiple binding and structural roles of proteins. Three-dimensional structures of these repeats determined to date reveal a degree of structural variability that translates into the considerable functional versatility of this protein superfamily. Recent work on the ANK has proposed novel structural information, especially protein-lipid, protein-sugar and protein-protein interaction. Self-assembly of these repeats was also shown to prevent the associated protein in forming filaments. In this review, we summarize the latest findings and how the new structural information has increased our understanding of the structural determinants of ANK proteins. We discussed latest findings on how these proteins participate in various interactions to diversify the ANK roles in numerous biological processes, and explored the emerging and evolving field of designer ankyrins and its framework for protein engineering emphasizing on biotechnological applications. Copyright © 2017 Elsevier B.V. All rights reserved.
DNA octaplex formation with an I-motif of water-mediated A-quartets: reinterpretation of the crystal structure of d(GCGAAAGC).

PubMed

Sato, Yoshiteru; Mitomi, Kenta; Sunami, Tomoko; Kondo, Jiro; Takénaka, Akio

2006-12-01

The crystal structure of the tetragonal form of d(gcGAAAgc) has been revised and reasonably refined including the disordered residues. The two DNA strands form a base-intercalated duplex, and the four duplexes are assembled according to the crystallographic 222 symmetry to form an octaplex. In the central region, the eight strands are associated by I-motif of double A-quartets. Furthermore, eight hydrated-magnesium cations link the four duplexes to support the octaplex formation. Based on these structural features, a proposal that folding of d(GAAA)n, found in the non-coding region of genomes, into an octaplex can induce slippage during replication to facilitate length polymorphism is presented.
Crystal structure of a DEAD box protein from the hyperthermophile Methanococcus jannaschii

PubMed Central

Story, Randall M.; Li, Hong; Abelson, John N.

2001-01-01

We have determined the structure of a DEAD box putative RNA helicase from the hyperthermophile Methanococcus jannaschii. Like other helicases, the protein contains two α/β domains, each with a recA-like topology. Unlike other helicases, the protein exists as a dimer in the crystal. Through an interaction that resembles the dimer interface of insulin, the amino-terminal domain's 7-strand β-sheet is extended to 14 strands across the two molecules. Motifs conserved in the DEAD box family cluster in the cleft between domains, and many of their functions can be deduced by mutational data and by comparison with other helicase structures. Several lines of evidence suggest that motif III Ser-Ala-Thr may be involved in binding RNA. PMID:11171974
Creation of hybrid nanorods from sequences of natural trimeric fibrous proteins using the fibritin trimerization motif.

PubMed

Papanikolopoulou, Katerina; van Raaij, Mark J; Mitraki, Anna

2008-01-01

Stable, artificial fibrous proteins that can be functionalized open new avenues in fields such as bionanomaterials design and fiber engineering. An important source of inspiration for the creation of such proteins are natural fibrous proteins such as collagen, elastin, insect silks, and fibers from phages and viruses. The fibrous parts of this last class of proteins usually adopt trimeric, beta-stranded structural folds and are appended to globular, receptor-binding domains. It has been recently shown that the globular domains are essential for correct folding and trimerization and can be successfully substituted by a very small (27-amino acid) trimerization motif from phage T4 fibritin. The hybrid proteins are correctly folded nanorods that can withstand extreme conditions. When the fibrous part derives from the adenovirus fiber shaft, different tissue-targeting specificities can be engineered into the hybrid proteins, which therefore can be used as gene therapy vectors. The integration of such stable nanorods in devices is also a big challenge in the field of biomechanical design. The fibritin foldon domain is a versatile trimerization motif and can be combined with a variety of fibrous motifs, such as coiled-coil, collagenous, and triple beta-stranded motifs, provided the appropriate linkers are used. The combination of different motifs within the same fibrous molecule to create stable rods with multiple functions can even be envisioned. We provide a comprehensive overview of the experimental procedures used for designing, creating, and characterizing hybrid fibrous nanorods using the fibritin trimerization motif.
Role of sequence encoded κB DNA geometry in gene regulation by Dorsal

PubMed Central

Mrinal, Nirotpal; Tomar, Archana; Nagaraju, Javaregowda

2011-01-01

Many proteins of the Rel family can act as both transcriptional activators and repressors. However, mechanism that discerns the ‘activator/repressor’ functions of Rel-proteins such as Dorsal (Drosophila homologue of mammalian NFκB) is not understood. Using genomic, biophysical and biochemical approaches, we demonstrate that the underlying principle of this functional specificity lies in the ‘sequence-encoded structure’ of the κB-DNA. We show that Dorsal-binding motifs exist in distinct activator and repressor conformations. Molecular dynamics of DNA-Dorsal complexes revealed that repressor κB-motifs typically have A-tract and flexible conformation that facilitates interaction with co-repressors. Deformable structure of repressor motifs, is due to changes in the hydrogen bonding in A:T pair in the ‘A-tract’ core. The sixth nucleotide in the nonameric κB-motif, ‘A’ (A6) in the repressor motifs and ‘T’ (T6) in the activator motifs, is critical to confer this functional specificity as A6 → T6 mutation transformed flexible repressor conformation into a rigid activator conformation. These results highlight that ‘sequence encoded κB DNA-geometry’ regulates gene expression by exerting allosteric effect on binding of Rel proteins which in turn regulates interaction with co-regulators. Further, we identified and characterized putative repressor motifs in Dl-target genes, which can potentially aid in functional annotation of Dorsal gene regulatory network. PMID:21890896
Onco-Regulon: an integrated database and software suite for site specific targeting of transcription factors of cancer genes

PubMed Central

Tomar, Navneet; Mishra, Akhilesh; Mrinal, Nirotpal; Jayaram, B.

2016-01-01

Transcription factors (TFs) bind at multiple sites in the genome and regulate expression of many genes. Regulating TF binding in a gene specific manner remains a formidable challenge in drug discovery because the same binding motif may be present at multiple locations in the genome. Here, we present Onco-Regulon (http://www.scfbio-iitd.res.in/software/onco/NavSite/index.htm), an integrated database of regulatory motifs of cancer genes clubbed with Unique Sequence-Predictor (USP) a software suite that identifies unique sequences for each of these regulatory DNA motifs at the specified position in the genome. USP works by extending a given DNA motif, in 5′→3′, 3′ →5′ or both directions by adding one nucleotide at each step, and calculates the frequency of each extended motif in the genome by Frequency Counter programme. This step is iterated till the frequency of the extended motif becomes unity in the genome. Thus, for each given motif, we get three possible unique sequences. Closest Sequence Finder program predicts off-target drug binding in the genome. Inclusion of DNA-Protein structural information further makes Onco-Regulon a highly informative repository for gene specific drug development. We believe that Onco-Regulon will help researchers to design drugs which will bind to an exclusive site in the genome with no off-target effects, theoretically. Database URL: http://www.scfbio-iitd.res.in/software/onco/NavSite/index.htm PMID:27515825
Creation of Hybrid Nanorods From Sequences of Natural Trimeric Fibrous Proteins Using the Fibritin Trimerization Motif

NASA Astrophysics Data System (ADS)

Papanikolopoulou, Katerina; van Raaij, Mark J.; Mitraki, Anna

Stable, artificial fibrous proteins that can be functionalized open new avenues in fields such as bionanomaterials design and fiber engineering. An important source of inspiration for the creation of such proteins are natural fibrous proteins such as collagen, elastin, insect silks, and fibers from phages and viruses. The fibrous parts of this last class of proteins usually adopt trimeric, β-stranded structural folds and are appended to globular, receptor-binding domains. It has been recently shown that the globular domains are essential for correct folding and trimerization and can be successfully substituted by a very small (27-amino acid) trimerization motif from phage T4 fibritin. The hybrid proteins are correctly folded nanorods that can withstand extreme conditions. When the fibrous part derives from the adenovirus fiber shaft, different tissue-targeting specificities can be engineered into the hybrid proteins, which therefore can be used as gene therapy vectors. The integration of such stable nanorods in devices is also a big challenge in the field of biomechanical design. The fibritin foldon domain is a versatile trimerization motif and can be combined with a variety of fibrous motifs, such as coiled-coil, collagenous, and triple β-stranded motifs, provided the appropriate linkers are used. The combination of different motifs within the same fibrous molecule to create stable rods with multiple functions can even be envisioned. We provide a comprehensive overview of the experimental procedures used for designing, creating, and characterizing hybrid fibrous nanorods using the fibritin trimerization motif.
Structural polymorphism of a cytosine-rich DNA sequence forming i-motif structure: Exploring pH based biosensors.

PubMed

Ahmed, Saami; Kaushik, Mahima; Chaudhary, Swati; Kukreti, Shrikant

2018-05-01

Sequence recognition and conformational polymorphism enable DNA to emerge out as a substantial tool in fabricating the devices within nano-dimensions. These DNA associated nano devices work on the principle of conformational switches, which can be facilitated by many factors like sequence of DNA/RNA strand, change in pH or temperature, enzyme or ligand interactions etc. Thus, controlling these DNA conformational changes to acquire the desired function is significant for evolving DNA hybridization biosensor, used in genetic screening and molecular diagnosis. For exploring this conformational switching ability of cytosine-rich DNA oligonucleotides as a function of pH for their potential usage as biosensors, this study has been designed. A C-rich stretch of DNA sequence (5'-TCCCCCAATTAATTCCCCCA-3'; SG20c) has been investigated using UV-Thermal denaturation, poly-acrylamide gel electrophoresis and CD spectroscopy. The SG20c sequence is shown to adopt various topologies of i-motif structure at low pH. This pH dependent transition of SG20c from unstructured single strand to unimolecular and bimolecular i-motif structures can further be exploited for its utilization as switching on/off pH-based biosensors. Copyright © 2018. Published by Elsevier B.V.
Are Long-Range Structural Correlations Behind the Aggregration Phenomena of Polyglutamine Diseases?

PubMed Central

Moradi, Mahmoud; Babin, Volodymyr; Roland, Christopher; Sagui, Celeste

2012-01-01

We have characterized the conformational ensembles of polyglutamine peptides of various lengths (ranging from to ), both with and without the presence of a C-terminal polyproline hexapeptide. For this, we used state-of-the-art molecular dynamics simulations combined with a novel statistical analysis to characterize the various properties of the backbone dihedral angles and secondary structural motifs of the glutamine residues. For (i.e., just above the pathological length for Huntington's disease), the equilibrium conformations of the monomer consist primarily of disordered, compact structures with non-negligible -helical and turn content. We also observed a relatively small population of extended structures suitable for forming aggregates including - and -strands, and - and -hairpins. Most importantly, for we find that there exists a long-range correlation (ranging for at least residues) among the backbone dihedral angles of the Q residues. For polyglutamine peptides below the pathological length, the population of the extended strands and hairpins is considerably smaller, and the correlations are short-range (at most residues apart). Adding a C-terminal hexaproline to suppresses both the population of these rare motifs and the long-range correlation of the dihedral angles. We argue that the long-range correlation of the polyglutamine homopeptide, along with the presence of these rare motifs, could be responsible for its aggregation phenomena. PMID:22577357
Lessons from a tarantula: new insights into myosin interacting-heads motif evolution and its implications on disease.

PubMed

Alamo, Lorenzo; Pinto, Antonio; Sulbarán, Guidenn; Mavárez, Jesús; Padrón, Raúl

2017-09-04

Tarantula's leg muscle thick filament is the ideal model for the study of the structure and function of skeletal muscle thick filaments. Its analysis has given rise to a series of structural and functional studies, leading, among other things, to the discovery of the myosin interacting-heads motif (IHM). Further electron microscopy (EM) studies have shown the presence of IHM in frozen-hydrated and negatively stained thick filaments of striated, cardiac, and smooth muscle of bilaterians, most showing the IHM parallel to the filament axis. EM studies on negatively stained heavy meromyosin of different species have shown the presence of IHM on sponges, animals that lack muscle, extending the presence of IHM to metazoans. The IHM evolved about 800 MY ago in the ancestor of Metazoa, and independently with functional differences in the lineage leading to the slime mold Dictyostelium discoideum (Mycetozoa). This motif conveys important functional advantages, such as Ca 2+ regulation and ATP energy-saving mechanisms. Recent interest has focused on human IHM structure in order to understand the structural basis underlying various conditions and situations of scientific and medical interest: the hypertrophic and dilated cardiomyopathies, overfeeding control, aging and hormone deprival muscle weakness, drug design for schistosomiasis control, and conditioning exercise physiology for the training of power athletes.
Charge splitters and charge transport junctions based on guanine quadruplexes

NASA Astrophysics Data System (ADS)

Sha, Ruojie; Xiang, Limin; Liu, Chaoren; Balaeff, Alexander; Zhang, Yuqi; Zhang, Peng; Li, Yueqi; Beratan, David N.; Tao, Nongjian; Seeman, Nadrian C.

2018-04-01

Self-assembling circuit elements, such as current splitters or combiners at the molecular scale, require the design of building blocks with three or more terminals. A promising material for such building blocks is DNA, wherein multiple strands can self-assemble into multi-ended junctions, and nucleobase stacks can transport charge over long distances. However, nucleobase stacking is often disrupted at junction points, hindering electric charge transport between the two terminals of the junction. Here, we show that a guanine-quadruplex (G4) motif can be used as a connector element for a multi-ended DNA junction. By attaching specific terminal groups to the motif, we demonstrate that charges can enter the structure from one terminal at one end of a three-way G4 motif, and can exit from one of two terminals at the other end with minimal carrier transport attenuation. Moreover, we study four-way G4 junction structures by performing theoretical calculations to assist in the design and optimization of these connectors.
A novel Arg H52/Tyr H33 conservative motif in antibodies: A correlation between sequence of antibodies and antigen binding.

PubMed

Petrov, Artem; Arzhanik, Vladimir; Makarov, Gennady; Koliasnikov, Oleg

2016-08-01

Antibodies are the family of proteins, which are responsible for antigen recognition. The computational modeling of interaction between an antigen and an antibody is very important when crystallographic structure is unavailable. In this research, we have discovered the correlation between the amino acid sequence of antibody and its specific binding characteristics on the example of the novel conservative binding motif, which consists of four residues: Arg H52, Tyr H33, Thr H59, and Glu H61. These residues are specifically oriented in the binding site and interact with each other in a specific manner. The residues of the binding motif are involved in interaction strictly with negatively charged groups of antigens, and form a binding complex. Mechanism of interaction and characteristics of the complex were also discovered. The results of this research can be used to increase the accuracy of computational antibody-antigen interaction modeling and for post-modeling quality control of the modeled structures.
In Vitro Selection of pH-Activated DNA Nanostructures.

PubMed

Fong, Faye Yi; Oh, Seung Soo; Hawker, Craig J; Soh, H Tom

2016-12-05

We report the first in vitro selection of DNA nanostructures that switch their conformation when triggered by change in pH. Previously, most pH-active nanostructures were designed using known pH-active motifs, such as the i-motif or the triplex structure. In contrast, we performed de novo selections starting from a random library and generated nanostructures that can sequester and release Mipomersen, a clinically approved antisense DNA drug, in response to pH change. We demonstrate extraordinary pH-selectivity, releasing up to 714-fold more Mipomersen at pH 5.2 compared to pH 7.5. Interestingly, none of our nanostructures showed significant sequence similarity to known pH-sensitive motifs, suggesting that they may operate via novel structure-switching mechanisms. We believe our selection scheme is general and could be adopted for generating DNA nanostructures for many applications including drug delivery, sensors and pH-active surfaces. © 2016 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.

Cooling rate dependence of structural order in Ni 62 Nb 38 metallic glass

DOE PAGES

Wen, Tongqi; Sun, Yang; Ye, Beilin; ...

2018-01-31

In this article, molecular dynamics (MD) simulations are performed to study the structure of Ni 62Nb 38 bulk metallic glass at the atomistic level. Structural analysis based on the cluster alignment method is carried out and a new Ni-centered distorted-icosahedra (DISICO) motif is excavated. We show that the short-range order and medium-range order in the glass are enhanced with lower cooling rate. Almost 50% of the clusters around the Ni atoms in the well-annealed Ni 62Nb 38 glass sample from our MD simulations can be classified as DISICO. It is revealed that the structural distortion with respect to the perfectmore » icosahedra is driven by chemical ordering in the distorted region of the DISICO motif. The relationship between the structure, energy, and dynamics in this glass-forming alloy during the cooling and annealing processes is also established.« less
Cooling rate dependence of structural order in Ni 62 Nb 38 metallic glass

DOE Office of Scientific and Technical Information (OSTI.GOV)

Wen, Tongqi; Sun, Yang; Ye, Beilin

In this article, molecular dynamics (MD) simulations are performed to study the structure of Ni 62Nb 38 bulk metallic glass at the atomistic level. Structural analysis based on the cluster alignment method is carried out and a new Ni-centered distorted-icosahedra (DISICO) motif is excavated. We show that the short-range order and medium-range order in the glass are enhanced with lower cooling rate. Almost 50% of the clusters around the Ni atoms in the well-annealed Ni 62Nb 38 glass sample from our MD simulations can be classified as DISICO. It is revealed that the structural distortion with respect to the perfectmore » icosahedra is driven by chemical ordering in the distorted region of the DISICO motif. The relationship between the structure, energy, and dynamics in this glass-forming alloy during the cooling and annealing processes is also established.« less
Influence of the Ag concentration on the medium-range order in a CuZrAlAg bulk metallic glass

DOE PAGES

Gammer, C.; Escher, B.; Ebner, C.; ...

2017-03-21

Fluctuation electron microscopy of bulk metallic glasses of CuZrAl(Ag) demonstrates that medium-range order is sensitive to minor compositional changes. Furthermore, by analyzing nanodiffraction patterns medium-range order is detected with crystal-like motifs based on the B2 CuZr structure and its distorted structures resembling the martensitic ones. This result thus demonstrates some structural homology between the metallic glass and its high temperature crystalline phase. The amount of medium-range order seems slightly affected with increasing Ag concentration (0, 2, 5 at.%) but the structural motifs of the medium-range ordered clusters become more diverse at the highest Ag concentration. The decrease of dominant clustersmore » is consistent with the destabilization of the B2 structure measured by calorimetry and accounts for the increased glass-forming ability.« less
Influence of the Ag concentration on the medium-range order in a CuZrAlAg bulk metallic glass

DOE Office of Scientific and Technical Information (OSTI.GOV)

Gammer, C.; Escher, B.; Ebner, C.

Fluctuation electron microscopy of bulk metallic glasses of CuZrAl(Ag) demonstrates that medium-range order is sensitive to minor compositional changes. Furthermore, by analyzing nanodiffraction patterns medium-range order is detected with crystal-like motifs based on the B2 CuZr structure and its distorted structures resembling the martensitic ones. This result thus demonstrates some structural homology between the metallic glass and its high temperature crystalline phase. The amount of medium-range order seems slightly affected with increasing Ag concentration (0, 2, 5 at.%) but the structural motifs of the medium-range ordered clusters become more diverse at the highest Ag concentration. The decrease of dominant clustersmore » is consistent with the destabilization of the B2 structure measured by calorimetry and accounts for the increased glass-forming ability.« less
Cooling rate dependence of structural order in Ni62Nb38 metallic glass

NASA Astrophysics Data System (ADS)

Wen, Tongqi; Sun, Yang; Ye, Beilin; Tang, Ling; Yang, Zejin; Ho, Kai-Ming; Wang, Cai-Zhuang; Wang, Nan

2018-01-01

Molecular dynamics (MD) simulations are performed to study the structure of Ni62Nb38 bulk metallic glass at the atomistic level. Structural analysis based on the cluster alignment method is carried out and a new Ni-centered distorted-icosahedra (DISICO) motif is excavated. We show that the short-range order and medium-range order in the glass are enhanced with lower cooling rate. Almost 50% of the clusters around the Ni atoms in the well-annealed Ni62Nb38 glass sample from our MD simulations can be classified as DISICO. It is revealed that the structural distortion with respect to the perfect icosahedra is driven by chemical ordering in the distorted region of the DISICO motif. The relationship between the structure, energy, and dynamics in this glass-forming alloy during the cooling and annealing processes is also established.
Moss and liverwort xyloglucans contain galacturonic acid and are structurally distinct from the xyloglucans synthesized by hornworts and vascular plants.

PubMed

Peña, Maria J; Darvill, Alan G; Eberhard, Stefan; York, William S; O'Neill, Malcolm A

2008-11-01

Xyloglucan is a well-characterized hemicellulosic polysaccharide that is present in the cell walls of all seed-bearing plants. The cell walls of avascular and seedless vascular plants are also believed to contain xyloglucan. However, these xyloglucans have not been structurally characterized. This lack of information is an impediment to understanding changes in xyloglucan structure that occurred during land plant evolution. In this study, xyloglucans were isolated from the walls of avascular (liverworts, mosses, and hornworts) and seedless vascular plants (club and spike mosses and ferns and fern allies). Each xyloglucan was fragmented with a xyloglucan-specific endo-glucanase and the resulting oligosaccharides then structurally characterized using NMR spectroscopy, MALDI-TOF and electrospray mass spectrometry, and glycosyl-linkage and glycosyl residue composition analyses. Our data show that xyloglucan is present in the cell walls of all major divisions of land plants and that these xyloglucans have several common structural motifs. However, these polysaccharides are not identical because specific plant groups synthesize xyloglucans with unique structural motifs. For example, the moss Physcomitrella patens and the liverwort Marchantia polymorpha synthesize XXGGG- and XXGG-type xyloglucans, respectively, with sidechains that contain a beta-D-galactosyluronic acid and a branched xylosyl residue. By contrast, hornworts synthesize XXXG-type xyloglucans that are structurally homologous to the xyloglucans synthesized by many seed-bearing and seedless vascular plants. Our results increase our understanding of the evolution, diversity, and function of structural motifs in land-plant xyloglucans and provide support to the proposal that hornworts are sisters to the vascular plants.
Curated collection of yeast transcription factor DNA binding specificity data reveals novel structural and gene regulatory insights

PubMed Central

2011-01-01

Background Transcription factors (TFs) play a central role in regulating gene expression by interacting with cis-regulatory DNA elements associated with their target genes. Recent surveys have examined the DNA binding specificities of most Saccharomyces cerevisiae TFs, but a comprehensive evaluation of their data has been lacking. Results We analyzed in vitro and in vivo TF-DNA binding data reported in previous large-scale studies to generate a comprehensive, curated resource of DNA binding specificity data for all characterized S. cerevisiae TFs. Our collection comprises DNA binding site motifs and comprehensive in vitro DNA binding specificity data for all possible 8-bp sequences. Investigation of the DNA binding specificities within the basic leucine zipper (bZIP) and VHT1 regulator (VHR) TF families revealed unexpected plasticity in TF-DNA recognition: intriguingly, the VHR TFs, newly characterized by protein binding microarrays in this study, recognize bZIP-like DNA motifs, while the bZIP TF Hac1 recognizes a motif highly similar to the canonical E-box motif of basic helix-loop-helix (bHLH) TFs. We identified several TFs with distinct primary and secondary motifs, which might be associated with different regulatory functions. Finally, integrated analysis of in vivo TF binding data with protein binding microarray data lends further support for indirect DNA binding in vivo by sequence-specific TFs. Conclusions The comprehensive data in this curated collection allow for more accurate analyses of regulatory TF-DNA interactions, in-depth structural studies of TF-DNA specificity determinants, and future experimental investigations of the TFs' predicted target genes and regulatory roles. PMID:22189060
Single crystal, vibrational and computational studies of Theophylline (a bronchodilator drug) and its chloride salt

NASA Astrophysics Data System (ADS)

Mary Novena, L.; Suresh Kumar, S.; Athimoolam, S.; Saminathan, K.; Sridhar, B.

2017-04-01

The crystal structure of Theophylline (TH) and Theophyillinium chloride monohydrate (THC) and its complete molecular structure analysis on theoretical and experimental methods is reported here. The hydrogen bonding studies were carried out as a special note of the present work. The electron density analyses of the compounds were also analyzed in view of the intermolecular interactions. Moreover, it is an ever first quantum chemical report of this drug (TH) and its chloride salt. In TH crystal, the water molecule connects the Theophylline molecules through Osbnd H⋯N hydrogen bond forming discrete D22(7) motif and dimeric ring R22(10) motif through Nsbnd H⋯O hydrogen bond. In THC, the two classical (Nsbnd H⋯O, Nsbnd H⋯Cl) and one non-classical (Csbnd H⋯O) hydrogen bonds produce two pentameric chain C55 (16) and C55(17) motifs. These two chain motifs are interconnected by Osbnd H⋯O hydrogen bond and cross linked by Nsbnd H⋯Cl and Osbnd H⋯Cl hydrogen bonds to produce octametric ring R88(27) and R88(28) motifs. The solubility test is carried out to enhance the drug solubility and the therapeutic effectiveness of the drug. Experimentally obtained vibrational wavenumbers are compared with the spectra obtained theoretically for both the compound. The strong intensity bands and the shifting of bands due to intermolecular hydrogen bonds are also investigated. The Mulliken atomic charges, HOMO-LUMO and thermodynamic properties are calculated using Density Functional Theory (DFT) and Hartree-Fock Theory (HF) using 6-311++G(d,p) basis set.
Structure-Based Mutational Analysis of the Hepatitis C Virus NS3 Helicase

PubMed Central

Tai, Chun-Ling; Pan, Wen-Ching; Liaw, Shwu-Huey; Yang, Ueng-Cheng; Hwang, Lih-Hwa; Chen, Ding-Shinn

2001-01-01

The carboxyl terminus of the hepatitis C virus (HCV) nonstructural protein 3 (NS3) possesses ATP-dependent RNA helicase activity. Based on the conserved sequence motifs and the crystal structures of the helicase domain, 17 mutants of the HCV NS3 helicase were generated. The ATP hydrolysis, RNA binding, and RNA unwinding activities of the mutant proteins were examined in vitro to determine the functional role of the mutated residues. The data revealed that Lys-210 in the Walker A motif and Asp-290, Glu-291, and His-293 in the Walker B motif were crucial to ATPase activity and that Thr-322 and Thr-324 in motif III and Arg-461 in motif VI significantly influenced ATPase activity. When the pairing between His-293 and Gln-460, referred to as gatekeepers, was replaced with the Asp-293/His-460 pair, which makes the NS3 helicase more like the DEAD helicase subgroup, ATPase activity was not restored. It thus indicated that the whole microenvironment surrounding the gatekeepers, rather than the residues per se, was important to the enzymatic activities. Arg-461 and Trp-501 are important residues for RNA binding, while Val-432 may only play a coadjutant role. The data demonstrated that RNA helicase activity was possibly abolished by the loss of ATPase activity or by reduced RNA binding activity. Nevertheless, a low threshold level of ATPase activity was found sufficient for helicase activity. Results in this study provide a valuable reference for efforts under way to develop anti-HCV therapeutic drugs targeting NS3. PMID:11483774
Extreme primary and secondary protein structure variability in the chimeric male-transmitted cytochrome c oxidase subunit II protein in freshwater mussels: Evidence for an elevated amino acid substitution rate in the face of domain-specific purifying selection

PubMed Central

2008-01-01

Background Freshwater unionoidean bivalves, and species representing two marine bivalve orders (Mytiloida and Veneroida), exhibit a mode of mtDNA inheritance involving distinct maternal (F) and paternal (M) transmission routes concomitant with highly divergent gender-associated mtDNA genomes. Additionally, male unionoidean bivalves have a ~550 bp 3' coding extension to the cox2 gene (Mcox2e), that is apparently absent from all other metazoan taxa. Results Our molecular sequence analyses of MCOX2e indicate that both the primary and secondary structures of the MCOX2e region are evolving much faster than other regions of the F and M COX2-COX1 gene junction. The near N-terminus ~2/3 of the MCOX2e region contains an interspecifically variable number of predicted transmembrane helices (TMH) and interhelical loops (IHL) whereas the C-terminus ~1/3 is relatively conserved and hydrophilic while containing conserved functional motifs. MCOX2e displays an overall pattern of purifying selection that leads to the preservation of TMH/IHL and C-terminus tail sub-regions. However, 14 amino acid positions in the MCOX2e TMH/IHL sub-region might be targeted by diversifying selection, each representing a site where there exists interspecific variation for the constituent amino acids residing in a TMH or IHL. Conclusion Our results indicate that Mcox2e is unique to unionoidean bivalves, likely the result of a single insertion event that took place over 65 MYA and that MCOX2e is functional. The predicted TMH number, length and position variability likely stems from substitution-based processes rather than the typically implicated insertion/deletion events. MCOX2e has relatively high rates of primary and secondary structure evolution, with some amino acid residues potentially subjected to site-specific positive selection, yet an overall pattern of purifying selection leading to the preservation of the TMH/IHL and hydrophilic C-terminus tail subregions. The more conserved C-terminus tail (relative to the TMH/IHL sub-region of MCOX2e) is likely biologically active because it contains functional motifs. The rapid evolution of primary and secondary structure in MCOX2e, combined with the action of both positive and purifying selection, provide supporting evidence for the hypothesis that MCOX2e has a novel reproductive function within unionoidean bivalves. All tolled, our data indicate that unionoidean bivalve MCOX2 is the first reported chimeric animal mtDNA-encoded protein. PMID:18513440
Ciliate telomerase RNA loop IV nucleotides promote hierarchical RNP assembly and holoenzyme stability.

PubMed

Robart, Aaron R; O'Connor, Catherine M; Collins, Kathleen

2010-03-01

Telomerase adds simple-sequence repeats to chromosome 3' ends to compensate for the loss of repeats with each round of genome replication. To accomplish this de novo DNA synthesis, telomerase uses a template within its integral RNA component. In addition to providing the template, the telomerase RNA subunit (TER) also harbors nontemplate motifs that contribute to the specialized telomerase catalytic cycle of reiterative repeat synthesis. Most nontemplate TER motifs function through linkage with the template, but in ciliate and vertebrate telomerases, a stem-loop motif binds telomerase reverse transcriptase (TERT) and reconstitutes full activity of the minimal recombinant TERT+TER RNP, even when physically separated from the template. Here, we resolve the functional requirements for this motif of ciliate TER in physiological RNP context using the Tetrahymena thermophila p65-TER-TERT core RNP reconstituted in vitro and the holoenzyme reconstituted in vivo. Contrary to expectation based on assays of the minimal recombinant RNP, we find that none of a panel of individual loop IV nucleotide substitutions impacts the profile of telomerase product synthesis when reconstituted as physiological core RNP or holoenzyme RNP. However, loop IV nucleotide substitutions do variably reduce assembly of TERT with the p65-TER complex in vitro and reduce the accumulation and stability of telomerase RNP in endogenous holoenzyme context. Our results point to a unifying model of a conformational activation role for this TER motif in the telomerase RNP enzyme.
Euglena gracilis and Trypanosomatids possess common patterns in predicted mitochondrial targeting presequences.

PubMed

Krnáčová, Katarína; Vesteg, Matej; Hampl, Vladimír; Vlček, Čestmír; Horváth, Anton

2012-10-01

Euglena gracilis possessing chloroplasts of secondary green algal origin and parasitic trypanosomatids Trypanosoma brucei, Trypanosoma cruzi and Leishmania major belong to the protist phylum Euglenozoa. Euglenozoa might be among the earliest eukaryotic branches bearing ancestral traits reminiscent of the last eukaryotic common ancestor (LECA) or missing features present in other eukaryotes. LECA most likely possessed mitochondria of endosymbiotic α-proteobacterial origin. In this study, we searched for the presence of homologs of mitochondria-targeted proteins from other organisms in the currently available EST dataset of E. gracilis. The common motifs in predicted N-terminal presequences and corresponding homologs from T. brucei, T. cruzi and L. major (if found) were analyzed. Other trypanosomatid mitochondrial protein precursor (e.g., those involved in RNA editing) were also included in the analysis. Mitochondrial presequences of E. gracilis and these trypanosomatids seem to be highly variable in sequence length (5-118 aa), but apparently share statistically significant similarities. In most cases, the common (M/L)RR motif is present at the N-terminus and it is probably responsible for recognition via import apparatus of mitochondrial outer membrane. Interestingly, this motif is present inside the predicted presequence region in some cases. In most presequences, this motif is followed by a hydrophobic region rich in alanine, leucine, and valine. In conclusion, either RR motif or arginine-rich region within hydrophobic aa-s present at the N-terminus of a preprotein can be sufficient signals for mitochondrial import irrespective of presequence length in Euglenozoa.
Members of the Meloidogyne avirulence protein family contain multiple plant ligand-like motifs.

PubMed

Rutter, William B; Hewezi, Tarek; Maier, Tom R; Mitchum, Melissa G; Davis, Eric L; Hussey, Richard S; Baum, Thomas J

2014-08-01

Sedentary plant-parasitic nematodes engage in complex interactions with their host plants by secreting effector proteins. Some effectors of both root-knot nematodes (Meloidogyne spp.) and cyst nematodes (Heterodera and Globodera spp.) mimic plant ligand proteins. Most prominently, cyst nematodes secrete effectors that mimic plant CLAVATA3/ESR-related (CLE) ligand proteins. However, only cyst nematodes have been shown to secrete such effectors and to utilize CLE ligand mimicry in their interactions with host plants. Here, we document the presence of ligand-like motifs in bona fide root-knot nematode effectors that are most similar to CLE peptides from plants and cyst nematodes. We have identified multiple tandem CLE-like motifs conserved within the previously identified Meloidogyne avirulence protein (MAP) family that are secreted from root-knot nematodes and have been shown to function in planta. By searching all 12 MAP family members from multiple Meloidogyne spp., we identified 43 repetitive CLE-like motifs composing 14 unique variants. At least one CLE-like motif was conserved in each MAP family member. Furthermore, we documented the presence of other conserved sequences that resemble the variable domains described in Heterodera and Globodera CLE effectors. These findings document that root-knot nematodes appear to use CLE ligand mimicry and point toward a common host node targeted by two evolutionarily diverse groups of nematodes. As a consequence, it is likely that CLE signaling pathways are important in other phytonematode pathosystems as well.
Comparative genomics of metabolic capacities of regulons controlled by cis-regulatory RNA motifs in bacteria.

PubMed

Sun, Eric I; Leyn, Semen A; Kazanov, Marat D; Saier, Milton H; Novichkov, Pavel S; Rodionov, Dmitry A

2013-09-02

In silico comparative genomics approaches have been efficiently used for functional prediction and reconstruction of metabolic and regulatory networks. Riboswitches are metabolite-sensing structures often found in bacterial mRNA leaders controlling gene expression on transcriptional or translational levels.An increasing number of riboswitches and other cis-regulatory RNAs have been recently classified into numerous RNA families in the Rfam database. High conservation of these RNA motifs provides a unique advantage for their genomic identification and comparative analysis. A comparative genomics approach implemented in the RegPredict tool was used for reconstruction and functional annotation of regulons controlled by RNAs from 43 Rfam families in diverse taxonomic groups of Bacteria. The inferred regulons include ~5200 cis-regulatory RNAs and more than 12000 target genes in 255 microbial genomes. All predicted RNA-regulated genes were classified into specific and overall functional categories. Analysis of taxonomic distribution of these categories allowed us to establish major functional preferences for each analyzed cis-regulatory RNA motif family. Overall, most RNA motif regulons showed predictable functional content in accordance with their experimentally established effector ligands. Our results suggest that some RNA motifs (including thiamin pyrophosphate and cobalamin riboswitches that control the cofactor metabolism) are widespread and likely originated from the last common ancestor of all bacteria. However, many more analyzed RNA motifs are restricted to a narrow taxonomic group of bacteria and likely represent more recent evolutionary innovations. The reconstructed regulatory networks for major known RNA motifs substantially expand the existing knowledge of transcriptional regulation in bacteria. The inferred regulons can be used for genetic experiments, functional annotations of genes, metabolic reconstruction and evolutionary analysis. The obtained genome-wide collection of reference RNA motif regulons is available in the RegPrecise database (http://regprecise.lbl.gov/).
Mechanisms of Zero-Lag Synchronization in Cortical Motifs

PubMed Central

Gollo, Leonardo L.; Mirasso, Claudio; Sporns, Olaf; Breakspear, Michael

2014-01-01

Zero-lag synchronization between distant cortical areas has been observed in a diversity of experimental data sets and between many different regions of the brain. Several computational mechanisms have been proposed to account for such isochronous synchronization in the presence of long conduction delays: Of these, the phenomenon of “dynamical relaying” – a mechanism that relies on a specific network motif – has proven to be the most robust with respect to parameter mismatch and system noise. Surprisingly, despite a contrary belief in the community, the common driving motif is an unreliable means of establishing zero-lag synchrony. Although dynamical relaying has been validated in empirical and computational studies, the deeper dynamical mechanisms and comparison to dynamics on other motifs is lacking. By systematically comparing synchronization on a variety of small motifs, we establish that the presence of a single reciprocally connected pair – a “resonance pair” – plays a crucial role in disambiguating those motifs that foster zero-lag synchrony in the presence of conduction delays (such as dynamical relaying) from those that do not (such as the common driving triad). Remarkably, minor structural changes to the common driving motif that incorporate a reciprocal pair recover robust zero-lag synchrony. The findings are observed in computational models of spiking neurons, populations of spiking neurons and neural mass models, and arise whether the oscillatory systems are periodic, chaotic, noise-free or driven by stochastic inputs. The influence of the resonance pair is also robust to parameter mismatch and asymmetrical time delays amongst the elements of the motif. We call this manner of facilitating zero-lag synchrony resonance-induced synchronization, outline the conditions for its occurrence, and propose that it may be a general mechanism to promote zero-lag synchrony in the brain. PMID:24763382
Unfolding Kinetics of the Human Telomere i-Motif Under a 10 pN Force Imposed by the α-Hemolysin Nanopore Identify Transient Folded-State Lifetimes at Physiological pH.

PubMed

Ding, Yun; Fleming, Aaron M; He, Lidong; Burrows, Cynthia J

2015-07-22

Cytosine (C)-rich DNA can adopt i-motif folds under acidic conditions, with the human telomere i-motif providing a well-studied example. The dimensions of this i-motif are appropriate for capture in the nanocavity of the α-hemolysin (α-HL) protein pore under an electrophoretic force. Interrogation of the current vs time (i-t) traces when the i-motif interacts with α-HL identified characteristic signals that were pH dependent. These features were evaluated from pH 5.0 to 7.2, a region surrounding the transition pH of the i-motif (6.1). When the i-motif without polynucleotide tails was studied at pH 5.0, the folded structure entered the nanocavity of α-HL from either the top or bottom face to yield characteristic current patterns. Addition of a 5' 25-mer poly-2'-deoxyadensosine tail allowed capture of the i-motif from the unfolded terminus, and this was used to analyze the pH dependency of unfolding. At pH values below the transition point, only folded strands were observed, and when the pH was increased above the transition pH, the number of folded events decreased, while the unfolded events increased. At pH 6.8 and 7.2 4% and 2% of the strands were still folded, respectively. The lifetimes for the folded states at pH 6.8 and 7.2 were 21 and 9 ms, respectively, at 160 mV electrophoretic force. These lifetimes are sufficiently long to affect enzymes operating on DNA. Furthermore, these transient lifetimes are readily obtained using the α-HL nanopore, a feature that is not easily achievable by other methods.
Two alternative ways of start site selection in human norovirus reinitiation of translation.

PubMed

Luttermann, Christine; Meyers, Gregor

2014-04-25

The calicivirus minor capsid protein VP2 is expressed via termination/reinitiation. This process depends on an upstream sequence element denoted termination upstream ribosomal binding site (TURBS). We have shown for feline calicivirus and rabbit hemorrhagic disease virus that the TURBS contains three sequence motifs essential for reinitiation. Motif 1 is conserved among caliciviruses and is complementary to a sequence in the 18 S rRNA leading to the model that hybridization between motif 1 and 18 S rRNA tethers the post-termination ribosome to the mRNA. Motif 2 and motif 2* are proposed to establish a secondary structure positioning the ribosome relative to the start site of the terminal ORF. Here, we analyzed human norovirus (huNV) sequences for the presence and importance of these motifs. The three motifs were identified by sequence analyses in the region upstream of the VP2 start site, and we showed that these motifs are essential for reinitiation of huNV VP2 translation. More detailed analyses revealed that the site of reinitiation is not fixed to a single codon and does not need to be an AUG, even though this codon is clearly preferred. Interestingly, we were able to show that reinitiation can occur at AUG codons downstream of the canonical start/stop site in huNV and feline calicivirus but not in rabbit hemorrhagic disease virus. Although reinitiation at the original start site is independent of the Kozak context, downstream initiation exhibits requirements for start site sequence context known for linear scanning. These analyses on start codon recognition give a more detailed insight into this fascinating mechanism of gene expression.
Comparative Analysis of AGPase Genes and Encoded Proteins in Eight Monocots and Three Dicots with Emphasis on Wheat

PubMed Central

Batra, Ritu; Saripalli, Gautam; Mohan, Amita; Gupta, Saurabh; Gill, Kulvinder S.; Varadwaj, Pritish K.; Balyan, Harindra S.; Gupta, Pushpendra K.

2017-01-01

ADP-glucose pyrophosphorylase (AGPase) is a heterotetrameric enzyme with two large subunits (LS) and two small subunits (SS). It plays a critical role in starch biosynthesis. We are reporting here detailed structure, function and evolution of the genes encoding the LS and the SS among monocots and dicots. “True” orthologs of maize Sh2 (AGPase LS) and Bt2 (AGPase SS) were identified in seven other monocots and three dicots; structure of the enzyme at protein level was also studied. Novel findings of the current study include the following: (i) at the DNA level, the genes controlling the SS are more conserved than those controlling the LS; the variation in both is mainly due to intron number, intron length and intron phase distribution; (ii) at protein level, the SS genes are more conserved relative to those for LS; (iii) “QTCL” motif present in SS showed evolutionary differences in AGPase belonging to wheat 7BS, T. urartu, rice and sorghum, while “LGGG” motif in LS was present in all species except T. urartu and chickpea; SS provides thermostability to AGPase, while LS is involved in regulation of AGPase activity; (iv) heterotetrameric structure of AGPase was predicted and analyzed in real time environment through molecular dynamics simulation for all the species; (v) several cis-acting regulatory elements were identified in the AGPase promoters with their possible role in regulating spatial and temporal expression (endosperm and leaf tissue) and also the expression, in response to abiotic stresses; and (vi) expression analysis revealed downregulation of both subunits under conditions of heat and drought stress. The results of the present study have allowed better understanding of structure and evolution of the genes and the encoded proteins and provided clues for exploitation of variability in these genes for engineering thermostable AGPase. PMID:28174576
Structural and Functional Studies of Fatty Acyl Adenylate Ligases from E. coli and L. pneumophila

DOE Office of Scientific and Technical Information (OSTI.GOV)

Zhang, Z.; Swaminathan, S.; Zhou, R.

2011-02-18

Fatty acyl-AMP ligase (FAAL) is a new member of a family of adenylate-forming enzymes that were recently discovered in Mycobacterium tuberculosis. They are similar in sequence to fatty acyl-coenzyme A (CoA) ligases (FACLs). However, while FACLs perform a two-step catalytic reaction, AMP ligation followed by CoA ligation using ATP and CoA as cofactors, FAALs produce only the acyl adenylate and are unable to perform the second step. We report X-ray crystal structures of full-length FAAL from Escherichia coli (EcFAAL) and FAAL from Legionella pneumophila (LpFAAL) bound to acyl adenylate, determined at resolution limits of 3.0 and 1.85 {angstrom}, respectively. Themore » structures share a larger N-terminal domain and a smaller C-terminal domain, which together resemble the previously determined structures of FAAL and FACL proteins. Our two structures occur in quite different conformations. EcFAAL adopts the adenylate-forming conformation typical of FACLs, whereas LpFAAL exhibits a unique intermediate conformation. Both EcFAAL and LpFAAL have insertion motifs that distinguish them from the FACLs. Structures of EcFAAL and LpFAAL reveal detailed interactions between this insertion motif and the interdomain hinge region and with the C-terminal domain. We suggest that the insertion motifs support sufficient interdomain motions to allow substrate binding and product release during acyl adenylate formation, but they preclude CoA binding, thereby preventing CoA ligation.« less
Structural and Functional Studies of Fatty Acyl Adenylate Ligases from E. coli and L. pneumophila

DOE Office of Scientific and Technical Information (OSTI.GOV)

Z Zhang; R Zhou; J Sauder

2011-12-31

Fatty acyl-AMP ligase (FAAL) is a new member of a family of adenylate-forming enzymes that were recently discovered in Mycobacterium tuberculosis. They are similar in sequence to fatty acyl-coenzyme A (CoA) ligases (FACLs). However, while FACLs perform a two-step catalytic reaction, AMP ligation followed by CoA ligation using ATP and CoA as cofactors, FAALs produce only the acyl adenylate and are unable to perform the second step. We report X-ray crystal structures of full-length FAAL from Escherichia coli (EcFAAL) and FAAL from Legionella pneumophila (LpFAAL) bound to acyl adenylate, determined at resolution limits of 3.0 and 1.85 {angstrom}, respectively. Themore » structures share a larger N-terminal domain and a smaller C-terminal domain, which together resemble the previously determined structures of FAAL and FACL proteins. Our two structures occur in quite different conformations. EcFAAL adopts the adenylate-forming conformation typical of FACLs, whereas LpFAAL exhibits a unique intermediate conformation. Both EcFAAL and LpFAAL have insertion motifs that distinguish them from the FACLs. Structures of EcFAAL and LpFAAL reveal detailed interactions between this insertion motif and the interdomain hinge region and with the C-terminal domain. We suggest that the insertion motifs support sufficient interdomain motions to allow substrate binding and product release during acyl adenylate formation, but they preclude CoA binding, thereby preventing CoA ligation.« less

The mechanism of transforming diamond nanowires to carbon nanostructures.

PubMed

Sorkin, Anastassia; Su, Haibin

2014-01-24

The transformation of diamond nanowires (DNWs) with different diameters and geometries upon heating is investigated with density-functional-based tight-binding molecular dynamics. DNWs of {100} and {111} oriented cross-section with projected average line density between 7 and 20 atoms Å(-1) transform into carbon nanotubes (CNTs) under gradual heating up to 3500-4000 K. DNWs with projected average line density larger than 25 atoms Å(-1) transform into double-wall CNTs. The route of transformation into CNTs clearly exhibits three stages, with the intriguing intermediate structural motif of a carbon nanoscroll (CNS). Moreover, the morphology plays an important role in the transformation involving the CNS as one important intermediate motif to form CNTs. When starting with [Formula: see text] oriented DNWs with a square cross-section consisting of two {111} facets facing each other, one interesting structure with 'nano-bookshelf' shape emerges: a number of graphene 'shelves' located inside the CNT, bonding to the CNT walls with sp(3) hybridized atoms. The nano-bookshelf structures exist in a wide range of temperatures up to 3,000 K. The further transformation from nano-bookshelf structures depends on the strength of the joints connecting shelves with CNT walls. Notably, the nano-bookshelf structure can evolve into two end products: one is CNT via the CNS pathway, the other is graphene transformed directly from the nano-bookshelf structure at high temperature. This work sheds light on the microscopic insight of carbon nanostructure formation mechanisms with the featured motifs highlighted in the pathways.
Structure of Rhodococcus equi virulence-associated protein B (VapB) reveals an eight-stranded antiparallel β-barrel consisting of two Greek-key motifs

DOE Office of Scientific and Technical Information (OSTI.GOV)

Geerds, Christina; Wohlmann, Jens; Haas, Albert

The structure of VapB, a member of the Vap protein family that is involved in virulence of the bacterial pathogen R. equi, was determined by SAD phasing and reveals an eight-stranded antiparallel β-barrel similar to avidin, suggestive of a binding function. Made up of two Greek-key motifs, the topology of VapB is unusual or even unique. Members of the virulence-associated protein (Vap) family from the pathogen Rhodococcus equi regulate virulence in an unknown manner. They do not share recognizable sequence homology with any protein of known structure. VapB and VapA are normally associated with isolates from pigs and horses, respectively.more » To contribute to a molecular understanding of Vap function, the crystal structure of a protease-resistant VapB fragment was determined at 1.4 Å resolution. The structure was solved by SAD phasing employing the anomalous signal of one endogenous S atom and two bound Co ions with low occupancy. VapB is an eight-stranded antiparallel β-barrel with a single helix. Structural similarity to avidins suggests a potential binding function. Unlike other eight- or ten-stranded β-barrels found in avidins, bacterial outer membrane proteins, fatty-acid-binding proteins and lysozyme inhibitors, Vaps do not have a next-neighbour arrangement but consist of two Greek-key motifs with strand order 41238567, suggesting an unusual or even unique topology.« less
Solution structure of CEH-37 homeodomain of the nematode Caenorhabditis elegans

DOE Office of Scientific and Technical Information (OSTI.GOV)

Moon, Sunjin; Lee, Yong Woo; Kim, Woo Taek

Highlights: •We have determined solution structures of CEH-37 homedomain. •CEH-37 HD has a compact α-helical structure with HTH DNA binding motif. •Solution structure of CEH-37 HD shares its molecular topology with that of the homeodomain proteins. •Residues in the N-terminal region and HTH motif are important in binding to Caenorhabditis elegans telomeric DNA. •CEH-37 could play an important role in telomere function via DNA binding. -- Abstract: The nematode Caenorhabditis elegans protein CEH-37 belongs to the paired OTD/OTX family of homeobox-containing homeodomain proteins. CEH-37 shares sequence similarity with homeodomain proteins, although it specifically binds to double-stranded C. elegans telomeric DNA,more » which is unusual to homeodomain proteins. Here, we report the solution structure of CEH-37 homeodomain and molecular interaction with double-stranded C. elegans telomeric DNA using nuclear magnetic resonance (NMR) spectroscopy. NMR structure shows that CEH-37 homeodomain is composed of a flexible N-terminal region and three α-helices with a helix-turn-helix (HTH) DNA binding motif. Data from size-exclusion chromatography and fluorescence spectroscopy reveal that CEH-37 homeodomain interacts strongly with double-stranded C. elegans telomeric DNA. NMR titration experiments identified residues responsible for specific binding to nematode double-stranded telomeric DNA. These results suggest that C. elegans homeodomain protein, CEH-37 could play an important role in telomere function via DNA binding.« less
The Cluster Variation Method: A Primer for Neuroscientists.

PubMed

Maren, Alianna J

2016-09-30

Effective Brain-Computer Interfaces (BCIs) require that the time-varying activation patterns of 2-D neural ensembles be modelled. The cluster variation method (CVM) offers a means for the characterization of 2-D local pattern distributions. This paper provides neuroscientists and BCI researchers with a CVM tutorial that will help them to understand how the CVM statistical thermodynamics formulation can model 2-D pattern distributions expressing structural and functional dynamics in the brain. The premise is that local-in-time free energy minimization works alongside neural connectivity adaptation, supporting the development and stabilization of consistent stimulus-specific responsive activation patterns. The equilibrium distribution of local patterns, or configuration variables , is defined in terms of a single interaction enthalpy parameter ( h ) for the case of an equiprobable distribution of bistate (neural/neural ensemble) units. Thus, either one enthalpy parameter (or two, for the case of non-equiprobable distribution) yields equilibrium configuration variable values. Modeling 2-D neural activation distribution patterns with the representational layer of a computational engine, we can thus correlate variational free energy minimization with specific configuration variable distributions. The CVM triplet configuration variables also map well to the notion of a M = 3 functional motif. This paper addresses the special case of an equiprobable unit distribution, for which an analytic solution can be found.
The Cluster Variation Method: A Primer for Neuroscientists

PubMed Central

Maren, Alianna J.

2016-01-01

Effective Brain–Computer Interfaces (BCIs) require that the time-varying activation patterns of 2-D neural ensembles be modelled. The cluster variation method (CVM) offers a means for the characterization of 2-D local pattern distributions. This paper provides neuroscientists and BCI researchers with a CVM tutorial that will help them to understand how the CVM statistical thermodynamics formulation can model 2-D pattern distributions expressing structural and functional dynamics in the brain. The premise is that local-in-time free energy minimization works alongside neural connectivity adaptation, supporting the development and stabilization of consistent stimulus-specific responsive activation patterns. The equilibrium distribution of local patterns, or configuration variables, is defined in terms of a single interaction enthalpy parameter (h) for the case of an equiprobable distribution of bistate (neural/neural ensemble) units. Thus, either one enthalpy parameter (or two, for the case of non-equiprobable distribution) yields equilibrium configuration variable values. Modeling 2-D neural activation distribution patterns with the representational layer of a computational engine, we can thus correlate variational free energy minimization with specific configuration variable distributions. The CVM triplet configuration variables also map well to the notion of a M = 3 functional motif. This paper addresses the special case of an equiprobable unit distribution, for which an analytic solution can be found. PMID:27706022
Development of designed site-directed pseudopeptide-peptido-mimetic immunogens as novel minimal subunit-vaccine candidates for malaria.

PubMed

Lozano, José Manuel; Lesmes, Liliana P; Carreño, Luisa F; Gallego, Gina M; Patarroyo, Manuel Elkin

2010-12-06

Synthetic vaccines constitute the most promising tools for controlling and preventing infectious diseases. When synthetic immunogens are designed from the pathogen native sequences, these are normally poorly immunogenic and do not induce protection, as demonstrated in our research. After attempting many synthetic strategies for improving the immunogenicity properties of these sequences, the approach consisting of identifying high binding motifs present in those, and then performing specific changes on amino-acids belonging to such motifs, has proven to be a workable strategy. In addition, other strategies consisting of chemically introducing non-natural constraints to the backbone topology of the molecule and modifying the α-carbon asymmetry are becoming valuable tools to be considered in this pursuit. Non-natural structural constraints to the peptide backbone can be achieved by introducing peptide bond isosters such as reduced amides, partially retro or retro-inverso modifications or even including urea motifs. The second can be obtained by strategically replacing L-amino-acids with their enantiomeric forms for obtaining both structurally site-directed designed immunogens as potential vaccine candidates and their Ig structural molecular images, both having immuno-therapeutic effects for preventing and controlling malaria.
Structural basis for corepressor assembly by the orphan nuclear receptor TLX

PubMed Central

Zhou, X. Edward; He, Yuanzheng; Searose-Xu, Kelvin; Zhang, Chun-Li; Tsai, Chih-Cheng; Melcher, Karsten

2015-01-01

The orphan nuclear receptor TLX regulates neural stem cell self-renewal in the adult brain and functions primarily as a transcription repressor through recruitment of Atrophin corepressors, which bind to TLX via a conserved peptide motif termed the Atro box. Here we report crystal structures of the human and insect TLX ligand-binding domain in complex with Atro box peptides. In these structures, TLX adopts an autorepressed conformation in which its helix H12 occupies the coactivator-binding groove. Unexpectedly, H12 in this autorepressed conformation forms a novel binding pocket with residues from helix H3 that accommodates a short helix formed by the conserved ALXXLXXY motif of the Atro box. Mutations that weaken the TLX–Atrophin interaction compromise the repressive activity of TLX, demonstrating that this interaction is required for Atrophin to confer repressor activity to TLX. Moreover, the autorepressed conformation is conserved in the repressor class of orphan nuclear receptors, and mutations of corresponding residues in other members of this class of receptors diminish their repressor activities. Together, our results establish the functional conservation of the autorepressed conformation and define a key sequence motif in the Atro box that is essential for TLX-mediated repression. PMID:25691470
Novel Inhibitor Cystine Knot Peptides from Momordica charantia

PubMed Central

Clark, Richard J.; Tang, Jun; Zeng, Guang-Zhi; Franco, Octavio L.; Cantacessi, Cinzia; Craik, David J.; Daly, Norelle L.; Tan, Ning-Hua

2013-01-01

Two new peptides, MCh-1 and MCh-2, along with three known trypsin inhibitors (MCTI-I, MCTI-II and MCTI-III), were isolated from the seeds of the tropical vine Momordica charantia. The sequences of the peptides were determined using mass spectrometry and NMR spectroscopy. Using a strategy involving partial reduction and stepwise alkylation of the peptides, followed by enzymatic digestion and tandem mass spectrometry sequencing, the disulfide connectivity of MCh-1 was elucidated to be CysI-CysIV, CysII-CysV and CysIII-CysVI. The three-dimensional structures of MCh-1 and MCh-2 were determined using NMR spectroscopy and found to contain the inhibitor cystine knot (ICK) motif. The sequences of the novel peptides differ significantly from peptides previously isolated from this plant. Therefore, this study expands the known peptide diversity in M. charantia and the range of sequences that can be accommodated by the ICK motif. Furthermore, we show that a stable two-disulfide intermediate is involved in the oxidative folding of MCh-1. This disulfide intermediate is structurally homologous to the proposed ancestral fold of ICK peptides, and provides a possible pathway for the evolution of this structural motif, which is highly prevalent in nature. PMID:24116036
Controlled Growth of Ceria Nanoarrays on Anatase Titania Powder: A Bottom-up Physical Picture.

PubMed

Kim, Hyun You; Hybertsen, Mark S; Liu, Ping

2017-01-11

The leading edge of catalysis research motivates physical understanding of the growth of nanoscale oxide structures on different supporting oxide materials that are themselves also nanostructured. This research opens up for consideration a diverse range of facets on the support material, versus the single facet typically involved in wide-area growth of thin films. Here, we study the growth of ceria nanoarchitectures on practical anatase titania powders as a showcase inspired by recent experiments. Density functional theory (DFT)-based methods are employed to characterize and rationalize the broad array of low energy nanostructures that emerge. Using a bottom-up approach, we are able to identify and characterize the underlying mechanisms for the facet-dependent growth of various ceria motifs on anatase titania based on formation energy. These motifs include 0D clusters, 1D chains, 2D plates, and 3D nanoparticles. The ceria growth mode and morphology are determined by the interplay of several factors including the role of the common cation valence, the interface template effect for different facets of the anatase support, enhanced ionic binding for more compact ceria motifs, and the local structural flexibility of oxygen ions in bridging the interface between anatase and ceria structures.
A unique PDZ domain and arrestin-like fold interaction reveals mechanistic details of endocytic recycling by SNX27-retromer.

PubMed

Gallon, Matthew; Clairfeuille, Thomas; Steinberg, Florian; Mas, Caroline; Ghai, Rajesh; Sessions, Richard B; Teasdale, Rohan D; Collins, Brett M; Cullen, Peter J

2014-09-02

The sorting nexin 27 (SNX27)-retromer complex is a major regulator of endosome-to-plasma membrane recycling of transmembrane cargos that contain a PSD95, Dlg1, zo-1 (PDZ)-binding motif. Here we describe the core interaction in SNX27-retromer assembly and its functional relevance for cargo sorting. Crystal structures and NMR experiments reveal that an exposed β-hairpin in the SNX27 PDZ domain engages a groove in the arrestin-like structure of the vacuolar protein sorting 26A (VPS26A) retromer subunit. The structure establishes how the SNX27 PDZ domain simultaneously binds PDZ-binding motifs and retromer-associated VPS26. Importantly, VPS26A binding increases the affinity of the SNX27 PDZ domain for PDZ- binding motifs by an order of magnitude, revealing cooperativity in cargo selection. With disruption of SNX27 and retromer function linked to synaptic dysfunction and neurodegenerative disease, our work provides the first step, to our knowledge, in the molecular description of this important sorting complex, and more broadly describes a unique interaction between a PDZ domain and an arrestin-like fold.
Structural basis for corepressor assembly by the orphan nuclear receptor TLX

DOE PAGES

Zhi, Xiaoyong; Zhou, X. Edward; He, Yuanzheng; ...

2015-02-15

The orphan nuclear receptor TLX regulates neural stem cell self-renewal in the adult brain and functions primarily as a transcription repressor through recruitment of Atrophin corepressors, which bind to TLX via a conserved peptide motif termed the Atro box. Here we report crystal structures of the human and insect TLX ligand-binding domain in complex with Atro box peptides. In these structures, TLX adopts an autorepressed conformation in which its helix H12 occupies the coactivator-binding groove. Unexpectedly, H12 in this autorepressed conformation forms a novel binding pocket with residues from helix H3 that accommodates a short helix formed by the conservedmore » ALXXLXXY motif of the Atro box. Mutations that weaken the TLX–Atrophin interaction compromise the repressive activity of TLX, demonstrating that this interaction is required for Atrophin to confer repressor activity to TLX. Moreover, the autorepressed conformation is conserved in the repressor class of orphan nuclear receptors, and mutations of corresponding residues in other members of this class of receptors diminish their repressor activities. Together, our results establish the functional conservation of the autorepressed conformation and define a key sequence motif in the Atro box that is essential for TLX-mediated repression.« less
Identification and characterization of a calmodulin binding domain in the plasma membrane Ca2+-ATPase from Trypanosoma equiperdum.

PubMed

Ramírez-Iglesias, José Rubén; Pérez-Gordones, María Carolina; Del Castillo, Jesús Rafael; Mijares, Alfredo; Benaim, Gustavo; Mendoza, Marta

2018-05-09

The plasma membrane Ca 2+ -ATPase (PMCA) from trypanosomatids lacks a classical calmodulin (CaM) binding domain, although CaM stimulated activities have been detected by biochemical assays. Recently we proposed that the Trypanosoma equiperdum CaM-sensitive PMCA (TePMCA) contains a potential 1-18 CaM-binding motif at the C-terminal region of the pump. In the present study, we evaluated the potential CaM-binding motifs using CaM from Trypanosoma cruzi and either the recombinant full length TePMCA C-terminal sequence (P14) or synthetic peptides comprising different regions of the C-terminal domain. We demonstrated that P14 and a synthetic peptide corresponding to residues 1037-1062 (which contains the predicted 1-18 binding motif) competed efficiently for binding to TcCaM, exhibiting similar IC 50 s of 200 nM. A stable complex of this peptide and TcCaM was formed in the presence of Ca 2+ , as determined by native-polyacrylamide gel electrophoresis. A predicted structure obtained by molecular docking showed an interaction of the 1-18 binding motif with the Ca 2+ /CaM complex. Moreover, when the peptide was incubated with CaM and Ca 2+ , a blue shift in the tryptophan fluorescence spectrum (from 350 to 329 nm) was observed. Substitutions at W 1039 and F 1056 , strongly decreased both CaM-peptide interaction and the complex assembly. Our results demonstrated the presence of a functional 1-18 motif at the TePMCA C-terminal domain. Furthermore, on the basis of spectrofluorometric assays and the resulting structure modeled by docking we propose that the L 1042 and W 1060 residues might also participate as anchors to form a 1-4-18-22 motif. Copyright © 2018 Elsevier B.V. All rights reserved.
[Structure and evolution of the eukaryotic FANCJ-like proteins].

PubMed

Wuhe, Jike; Zefeng, Wu; Sanhong, Fan; Xuguang, Xi

2015-02-01

The FANCJ-like protein family is a class of ATP-dependent helicases that can catalytically unwind duplex DNA along the 5'-3' direction. It is involved in the processes of DNA damage repair, homologous recombination and G-quadruplex DNA unwinding, and plays a critical role in maintaining genome integrity. In this study, we systemically analyzed FNACJ-like proteins from 47 eukaryotic species and discussed their sequences diversity, origin and evolution, motif organization patterns and spatial structure differences. Four members of FNACJ-like proteins, including XPD, CHL1, RTEL1 and FANCJ, were found in eukaryotes, but some of them were seriously deficient in most fungi and some insects. For example, the Zygomycota fungi lost RTEL1, Basidiomycota and Ascomycota fungi lost RTEL1 and FANCJ, and Diptera insect lost FANCJ. FANCJ-like proteins contain canonical motor domains HD1 and HD2, and the HD1 domain further integrates with three unique domains Fe-S, Arch and Extra-D. Fe-S and Arch domains are relatively conservative in all members of the family, but the Extra-D domain is lost in XPD and differs from one another in rest members. There are 7, 10 and 2 specific motifs found from the three unique domains respectively, while 5 and 12 specific motifs are found from HD1 and HD2 domains except the conserved motifs reported previously. By analyzing the arrangement pattern of these specific motifs, we found that RTEL1 and FANCJ are more closer and share two specific motifs Vb2 and Vc in HD2 domain, which are likely related with their G-quadruplex DNA unwinding activity. The evidence of evolution showed that FACNJ-like proteins were originated from a helicase, which has a HD1 domain inserted by extra Fe-S domain and Arch domain. By three continuous gene duplication events and followed specialization, eukaryotes finally possessed the current four members of FANCJ-like proteins.
Three 3D hybrid networks based on octamolybdates and different Cu{sup I}/Cu{sup II}-bis(triazole) motifs

DOE Office of Scientific and Technical Information (OSTI.GOV)

Zhang, Chun-Jing; Pang, Hai-Jun; Tang, Qun

2010-12-15

Three 3D compounds based on octamolybdate clusters and various Cu{sup I}/Cu{sup II}-bis(triazole) motifs, [Cu{sup I}{sub 2}btb][{beta}-Mo{sub 8}O{sub 26}]{sub 0.5} (1), [Cu{sup I}{sub 2}btpe][{beta}-Mo{sub 8}O{sub 26}]{sub 0.5} (2), and [Cu{sup II}(btpe){sub 2}][{beta}-Mo{sub 8}O{sub 26}]{sub 0.5} (3) [btb=1,4-bis(1,2,4-triazol-1-yl)butane, btpe=1,5-bis(1,2,4-triazol-1-yl)pentane], were isolated via tuning flexible ligand spacer length and metal coordination preferences. In 1, the copper(I)-btb motif is a one-dimensional (1D) chain which is further linked by hexadentate {beta}-[Mo{sub 8}O{sub 26}]{sup 4-} clusters via coordinating to Cu{sup I} cations giving a 3D structure. In 2, the copper(I)-btpe motif exhibits a 'stairs'-like [Cu{sup I}{sub 2}btpe]{sup 2+} sheet, and the tetradentate {beta}-[Mo{sub 8}O{sub 26}]{sup 4-}more » clusters interact with two neighboring [Cu{sup I}{sub 2}btpe]{sup 2+} sheets constructing a 3D framework. In 3, the copper(II)-btpe motif possesses a novel (2D{yields}3D) interdigitated structure, which is further connected by the tetradentate {beta}-[Mo{sub 8}O{sub 26}]{sup 4-} clusters forming a 3D framework. The thermal stability and luminescent properties of 1-3 are investigated in the solid state. -- Graphical abstract: Three 3D compounds based on {beta}-[Mo{sub 8}O{sub 26}]{sup 4-} clusters with different Cu{sup I}/Cu{sup II}-bis(triazole) motifs were synthesized by regularly tuning flexible ligand spacer length and metal coordination preferences. Display Omitted« less
Crystal structure of heterodimeric hexaprenyl diphosphate synthase from Micrococcus luteus B-P 26 reveals that the small subunit is directly involved in the product chain length regulation.

PubMed

Sasaki, Daisuke; Fujihashi, Masahiro; Okuyama, Naomi; Kobayashi, Yukiko; Noike, Motoyoshi; Koyama, Tanetoshi; Miki, Kunio

2011-02-04

Hexaprenyl diphosphate synthase from Micrococcus luteus B-P 26 (Ml-HexPPs) is a heterooligomeric type trans-prenyltransferase catalyzing consecutive head-to-tail condensations of three molecules of isopentenyl diphosphates (C(5)) on a farnesyl diphosphate (FPP; C(15)) to form an (all-E) hexaprenyl diphosphate (HexPP; C(30)). Ml-HexPPs is known to function as a heterodimer of two different subunits, small and large subunits called HexA and HexB, respectively. Compared with homooligomeric trans-prenyltransferases, the molecular mechanism of heterooligomeric trans-prenyltransferases is not yet clearly understood, particularly with respect to the role of the small subunits lacking the catalytic motifs conserved in most known trans-prenyltransferases. We have determined the crystal structure of Ml-HexPPs both in the substrate-free form and in complex with 7,11-dimethyl-2,6,10-dodecatrien-1-yl diphosphate ammonium salt (3-DesMe-FPP), an analog of FPP. The structure of HexB is composed of mostly antiparallel α-helices joined by connecting loops. Two aspartate-rich motifs (designated the first and second aspartate-rich motifs) and the other characteristic motifs in HexB are located around the diphosphate part of 3-DesMe-FPP. Despite the very low amino acid sequence identity and the distinct polypeptide chain lengths between HexA and HexB, the structure of HexA is quite similar to that of HexB. The aliphatic tail of 3-DesMe-FPP is accommodated in a large hydrophobic cleft starting from HexB and penetrating to the inside of HexA. These structural features suggest that HexB catalyzes the condensation reactions and that HexA is directly involved in the product chain length control in cooperation with HexB.
Investigating diversity and possible functions of G-quadruplexes in regulatory regions of maize genes

USDA-ARS?s Scientific Manuscript database

G4-quadruplexes are reversible DNA structures that likely function in gene regulation, but exactly how they work is not known. G4 DNA can be predicted from sequence motifs such as the pattern G-G-G-N(1,7)-G-G-G-N(1,7)-G-G-G-N(1,7)-G-G-G-N(1,7). In the maize genome, G4 motifs were found to occupy ...
β-hairpin-mediated nucleation of polyglutamine amyloid formation

PubMed Central

Kar, Karunakar; Hoop, Cody L.; Drombosky, Kenneth W.; Baker, Matthew A.; Kodali, Ravindra; Arduini, Irene; van der Wel, Patrick C. A.; Horne, W. Seth; Wetzel, Ronald

2013-01-01

The conformational preferences of polyglutamine (polyQ) sequences are of major interest because of their central importance in the expanded CAG repeat diseases that include Huntington’s disease (HD). Here we explore the response of various biophysical parameters to the introduction of β-hairpin motifs within polyQ sequences. These motifs (trpzip, disulfide, D-Pro-Gly, Coulombic attraction, L-Pro-Gly) enhance formation rates and stabilities of amyloid fibrils with degrees of effectiveness well-correlated with their known abilities to enhance β-hairpin formation in other peptides. These changes led to decreases in the critical nucleus for amyloid formation from a value of n* = 4 for a simple, unbroken Q23 sequence to approximate unitary n* values for similar length polyQs containing β-hairpin motifs. At the same time, the morphologies, secondary structures, and bioactivities of the resulting fibrils were essentially unchanged from simple polyQ aggregates. In particular, the signature pattern of SSNMR 13C Gln resonances that appears to be unique to polyQ amyloid is replicated exactly in fibrils from a β-hairpin polyQ. Importantly, while β-hairpin motifs do produce enhancements in the equilibrium constant for nucleation in aggregation reactions, these Kn* values remain quite low (~ 10−10) and there is no evidence for significant embellishment of β-structure within the monomer ensemble. The results indicate an important role for β-turns in the nucleation mechanism and structure of polyQ amyloid and have implications for the nature of the toxic species in expanded CAG repeat diseases. PMID:23353826
La-related protein 1 (LARP1) binds the mRNA cap, blocking eIF4F assembly on TOP mRNAs.

PubMed

Lahr, Roni M; Fonseca, Bruno D; Ciotti, Gabrielle E; Al-Ashtal, Hiba A; Jia, Jian-Jun; Niklaus, Marius R; Blagden, Sarah P; Alain, Tommy; Berman, Andrea J

2017-04-07

The 5'terminal oligopyrimidine (5'TOP) motif is a cis -regulatory RNA element located immediately downstream of the 7-methylguanosine [m 7 G] cap of TOP mRNAs, which encode ribosomal proteins and translation factors. In eukaryotes, this motif coordinates the synchronous and stoichiometric expression of the protein components of the translation machinery. La-related protein 1 (LARP1) binds TOP mRNAs, regulating their stability and translation. We present crystal structures of the human LARP1 DM15 region in complex with a 5'TOP motif, a cap analog (m 7 GTP), and a capped cytidine (m 7 GpppC), resolved to 2.6, 1.8 and 1.7 Å, respectively. Our binding, competition, and immunoprecipitation data corroborate and elaborate on the mechanism of 5'TOP motif binding by LARP1. We show that LARP1 directly binds the cap and adjacent 5'TOP motif of TOP mRNAs, effectively impeding access of eIF4E to the cap and preventing eIF4F assembly. Thus, LARP1 is a specialized TOP mRNA cap-binding protein that controls ribosome biogenesis.
Local Higher-Order Graph Clustering

PubMed Central

Yin, Hao; Benson, Austin R.; Leskovec, Jure; Gleich, David F.

2018-01-01

Local graph clustering methods aim to find a cluster of nodes by exploring a small region of the graph. These methods are attractive because they enable targeted clustering around a given seed node and are faster than traditional global graph clustering methods because their runtime does not depend on the size of the input graph. However, current local graph partitioning methods are not designed to account for the higher-order structures crucial to the network, nor can they effectively handle directed networks. Here we introduce a new class of local graph clustering methods that address these issues by incorporating higher-order network information captured by small subgraphs, also called network motifs. We develop the Motif-based Approximate Personalized PageRank (MAPPR) algorithm that finds clusters containing a seed node with minimal motif conductance, a generalization of the conductance metric for network motifs. We generalize existing theory to prove the fast running time (independent of the size of the graph) and obtain theoretical guarantees on the cluster quality (in terms of motif conductance). We also develop a theory of node neighborhoods for finding sets that have small motif conductance, and apply these results to the case of finding good seed nodes to use as input to the MAPPR algorithm. Experimental validation on community detection tasks in both synthetic and real-world networks, shows that our new framework MAPPR outperforms the current edge-based personalized PageRank methodology. PMID:29770258
Binding properties of SUMO-interacting motifs (SIMs) in yeast.

PubMed

Jardin, Christophe; Horn, Anselm H C; Sticht, Heinrich

2015-03-01

Small ubiquitin-like modifier (SUMO) conjugation and interaction play an essential role in many cellular processes. A large number of yeast proteins is known to interact non-covalently with SUMO via short SUMO-interacting motifs (SIMs), but the structural details of this interaction are yet poorly characterized. In the present work, sequence analysis of a large dataset of 148 yeast SIMs revealed the existence of a hydrophobic core binding motif and a preference for acidic residues either within or adjacent to the core motif. Thus the sequence properties of yeast SIMs are highly similar to those described for human. Molecular dynamics simulations were performed to investigate the binding preferences for four representative SIM peptides differing in the number and distribution of acidic residues. Furthermore, the relative stability of two previously observed alternative binding orientations (parallel, antiparallel) was assessed. For all SIMs investigated, the antiparallel binding mode remained stable in the simulations and the SIMs were tightly bound via their hydrophobic core residues supplemented by polar interactions of the acidic residues. In contrary, the stability of the parallel binding mode is more dependent on the sequence features of the SIM motif like the number and position of acidic residues or the presence of additional adjacent interaction motifs. This information should be helpful to enhance the prediction of SIMs and their binding properties in different organisms to facilitate the reconstruction of the SUMO interactome.

Classic Nuclear Localization Signals and a Novel Nuclear Localization Motif Are Required for Nuclear Transport of Porcine Parvovirus Capsid Proteins

PubMed Central

Boisvert, Maude; Bouchard-Lévesque, Véronique; Fernandes, Sandra

2014-01-01

ABSTRACT Nuclear targeting of capsid proteins (VPs) is important for genome delivery and precedes assembly in the replication cycle of porcine parvovirus (PPV). Clusters of basic amino acids, corresponding to potential nuclear localization signals (NLS), were found only in the unique region of VP1 (VP1up, for VP1 unique part). Of the five identified basic regions (BR), three were important for nuclear localization of VP1up: BR1 was a classic Pat7 NLS, and the combination of BR4 and BR5 was a classic bipartite NLS. These NLS were essential for viral replication. VP2, the major capsid protein, lacked these NLS and contained no region with more than two basic amino acids in proximity. However, three regions of basic clusters were identified in the folded protein, assembled into a trimeric structure. Mutagenesis experiments showed that only one of these three regions was involved in VP2 transport to the nucleus. This structural NLS, termed the nuclear localization motif (NLM), is located inside the assembled capsid and thus can be used to transport trimers to the nucleus in late steps of infection but not for virions in initial infection steps. The two NLS of VP1up are located in the N-terminal part of the protein, externalized from the capsid during endosomal transit, exposing them for nuclear targeting during early steps of infection. Globally, the determinants of nuclear transport of structural proteins of PPV were different from those of closely related parvoviruses. IMPORTANCE Most DNA viruses use the nucleus for their replication cycle. Thus, structural proteins need to be targeted to this cellular compartment at two distinct steps of the infection: in early steps to deliver viral genomes to the nucleus and in late steps to assemble new viruses. Nuclear targeting of proteins depends on the recognition of a stretch of basic amino acids by cellular transport proteins. This study reports the identification of two classic nuclear localization signals in the minor capsid protein (VP1) of porcine parvovirus. The major protein (VP2) nuclear localization was shown to depend on a complex structural motif. This motif can be used as a strategy by the virus to avoid transport of incorrectly folded proteins and to selectively import assembled trimers into the nucleus. Structural nuclear localization motifs can also be important for nuclear proteins without a classic basic amino acid stretch, including multimeric cellular proteins. PMID:25078698
Junctions between i-motif tetramers in supramolecular structures

PubMed Central

Guittet, Eric; Renciuk, Daniel; Leroy, Jean-Louis

2012-01-01

The symmetry of i-motif tetramers gives to cytidine-rich oligonucleotides the capacity to associate into supramolecular structures (sms). In order to determine how the tetramers are linked together in such structures, we have measured by gel filtration chromatography and NMR the formation and dissociation kinetics of sms built by oligonucleotides containing two short C stretches separated by a non-cytidine-base. We show that a stretch of only two cytidines either at the 3′- or 5′-end is long enough to link the tetramers into sms. The analysis of the properties of sms formed by oligonucleotides differing by the length of the oligo-C stretches, the sequence orientation and the nature of the non-C base provides a model of the junction connecting the tetramers in sms. PMID:22362739
Efficient sequential and parallel algorithms for finding edit distance based motifs.

PubMed

Pal, Soumitra; Xiao, Peng; Rajasekaran, Sanguthevar

2016-08-18

Motif search is an important step in extracting meaningful patterns from biological data. The general problem of motif search is intractable and there is a pressing need to develop efficient, exact and approximation algorithms to solve this problem. In this paper, we present several novel, exact, sequential and parallel algorithms for solving the (l,d) Edit-distance-based Motif Search (EMS) problem: given two integers l,d and n biological strings, find all strings of length l that appear in each input string with atmost d errors of types substitution, insertion and deletion. One popular technique to solve the problem is to explore for each input string the set of all possible l-mers that belong to the d-neighborhood of any substring of the input string and output those which are common for all input strings. We introduce a novel and provably efficient neighborhood exploration technique. We show that it is enough to consider the candidates in neighborhood which are at a distance exactly d. We compactly represent these candidate motifs using wildcard characters and efficiently explore them with very few repetitions. Our sequential algorithm uses a trie based data structure to efficiently store and sort the candidate motifs. Our parallel algorithm in a multi-core shared memory setting uses arrays for storing and a novel modification of radix-sort for sorting the candidate motifs. The algorithms for EMS are customarily evaluated on several challenging instances such as (8,1), (12,2), (16,3), (20,4), and so on. The best previously known algorithm, EMS1, is sequential and in estimated 3 days solves up to instance (16,3). Our sequential algorithms are more than 20 times faster on (16,3). On other hard instances such as (9,2), (11,3), (13,4), our algorithms are much faster. Our parallel algorithm has more than 600 % scaling performance while using 16 threads. Our algorithms have pushed up the state-of-the-art of EMS solvers and we believe that the techniques introduced in this paper are also applicable to other motif search problems such as Planted Motif Search (PMS) and Simple Motif Search (SMS).
A Dbf4p BRCA1 C-Terminal-Like Domain Required for the Response to Replication Fork Arrest in Budding Yeast

PubMed Central

Gabrielse, Carrie; Miller, Charles T.; McConnell, Kristopher H.; DeWard, Aaron; Fox, Catherine A.; Weinreich, Michael

2006-01-01

Dbf4p is an essential regulatory subunit of the Cdc7p kinase required for the initiation of DNA replication. Cdc7p and Dbf4p orthologs have also been shown to function in the response to DNA damage. A previous Dbf4p multiple sequence alignment identified a conserved ∼40-residue N-terminal region with similarity to the BRCA1 C-terminal (BRCT) motif called “motif N.” BRCT motifs encode ∼100-amino-acid domains involved in the DNA damage response. We have identified an expanded and conserved ∼100-residue N-terminal region of Dbf4p that includes motif N but is capable of encoding a single BRCT-like domain. Dbf4p orthologs diverge from the BRCT motif at the C terminus but may encode a similar secondary structure in this region. We have therefore called this the BRCT and DBF4 similarity (BRDF) motif. The principal role of this Dbf4p motif was in the response to replication fork (RF) arrest; however, it was not required for cell cycle progression, activation of Cdc7p kinase activity, or interaction with the origin recognition complex (ORC) postulated to recruit Cdc7p–Dbf4p to origins. Rad53p likely directly phosphorylated Dbf4p in response to RF arrest and Dbf4p was required for Rad53p abundance. Rad53p and Dbf4p therefore cooperated to coordinate a robust cellular response to RF arrest. PMID:16547092
Structural characterization of viral epitopes recognized by broadly cross-reactive antibodies.

PubMed

Lee, Peter S; Wilson, Ian A

2015-01-01

Influenza hemagglutinin (HA) is the major surface glycoprotein on influenza viruses and mediates viral attachment and subsequent fusion with host cells. The HA is the major target of the immune response, but due to its high level of variability, as evidenced by substantial antigenic diversity, it had been historically considered to elicit only a narrow, strain-specific antibody response. However, a recent explosion in the discovery of broadly neutralizing antibodies (bnAbs) to influenza virus has identified two major supersites of vulnerability on the HA through structural characterization of HA-antibody complexes. These commonly targeted epitopes are involved with receptor binding as well as the fusion machinery and, hence, are functionally conserved and less prone to mutation. These bnAbs can neutralize viruses by blocking infection or the spread of infection by preventing progeny release. Structural analyses of these bnAbs show they exhibit striking similarities and trends in recognition of the HA and use recurring recognition motifs, despite substantial differences in their germline genes. This information can be utilized in design of novel therapeutics as well as in immunogens for improved vaccines with greater breadth and efficacy.
Stabilised DNA secondary structures with increasing transcription localise hypermutable bases for somatic hypermutation in IGHV3-23.

PubMed

Duvvuri, Bhargavi; Duvvuri, Venkata R; Wu, Jianhong; Wu, Gillian E

2012-07-01

Somatic hypermutation (SHM) mediated by activation-induced cytidine deaminase (AID) is a transcription-coupled mechanism most responsible for generating high affinity antibodies. An issue remaining enigmatic in SHM is how AID is preferentially targeted during transcription to hypermutable bases in its substrates (WRC motifs) on both DNA strands. AID targets only single stranded DNA. By modelling the dynamical behaviour of IGHV3-23 DNA, a commonly used human variable gene segment, we observed that hypermutable bases on the non-transcribed strand are paired whereas those on transcribed strand are mostly unpaired. Hypermutable bases (both paired and unpaired) are made accessible to AID in stabilised secondary structures formed with increasing transcription levels. This observation provides a rationale for the hypermutable bases on both the strands of DNA being targeted to a similar extent despite having differences in unpairedness. We propose that increasing transcription and RNAP II stalling resulting in the formation and stabilisation of stem-loop structures with AID hotspots in negatively supercoiled region can localise the hypermutable bases of both strands of DNA, to AID-mediated SHM.
Identification of interfaces involved in weak interactions with application to F-actin-aldolase rafts.

PubMed

Hu, Guiqing; Taylor, Dianne W; Liu, Jun; Taylor, Kenneth A

2018-03-01

Macromolecular interactions occur with widely varying affinities. Strong interactions form well defined interfaces but weak interactions are more dynamic and variable. Weak interactions can collectively lead to large structures such as microvilli via cooperativity and are often the precursors of much stronger interactions, e.g. the initial actin-myosin interaction during muscle contraction. Electron tomography combined with subvolume alignment and classification is an ideal method for the study of weak interactions because a 3-D image is obtained for the individual interactions, which subsequently are characterized collectively. Here we describe a method to characterize heterogeneous F-actin-aldolase interactions in 2-D rafts using electron tomography. By forming separate averages of the two constituents and fitting an atomic structure to each average, together with the alignment information which relates the raw motif to the average, an atomic model of each crosslink is determined and a frequency map of contact residues is computed. The approach should be applicable to any large structure composed of constituents that interact weakly and heterogeneously. Copyright © 2017 Elsevier Inc. All rights reserved.
Matching 4.7-Å XRD Spacing in Amelogenin Nanoribbons and Enamel Matrix

PubMed Central

Sanii, B.; Martinez-Avila, O.; Simpliciano, C.; Zuckermann, R.N.; Habelitz, S.

2014-01-01

The recent discovery of conditions that induce nanoribbon structures of amelogenin protein in vitro raises questions about their role in enamel formation. Nanoribbons of recombinant human full-length amelogenin (rH174) are about 17 nm wide and self-align into parallel bundles; thus, they could act as templates for crystallization of nanofibrous apatite comprising dental enamel. Here we analyzed the secondary structures of nanoribbon amelogenin by x-ray diffraction (XRD) and Fourier transform infrared spectroscopy (FTIR) and tested if the structural motif matches previous data on the organic matrix of enamel. XRD analysis showed that a peak corresponding to 4.7 Å is present in nanoribbons of amelogenin. In addition, FTIR analysis showed that amelogenin in the form of nanoribbons was comprised of β-sheets by up to 75%, while amelogenin nanospheres had predominantly random-coil structure. The observation of a 4.7-Å XRD spacing confirms the presence of β-sheets and illustrates structural parallels between the in vitro assemblies and structural motifs in developing enamel. PMID:25048248
Three-Dimensional RNA Structure of the Major HIV-1 Packaging Signal Region

PubMed Central

Stephenson, James D.; Li, Haitao; Kenyon, Julia C.; Symmons, Martyn; Klenerman, Dave; Lever, Andrew M.L.

2013-01-01

Summary HIV-1 genomic RNA has a noncoding 5′ region containing sequential conserved structural motifs that control many parts of the life cycle. Very limited data exist on their three-dimensional (3D) conformation and, hence, how they work structurally. To assemble a working model, we experimentally reassessed secondary structure elements of a 240-nt region and used single-molecule distances, derived from fluorescence resonance energy transfer, between defined locations in these elements as restraints to drive folding of the secondary structure into a 3D model with an estimated resolution below 10 Å. The folded 3D model satisfying the data is consensual with short nuclear-magnetic-resonance-solved regions and reveals previously unpredicted motifs, offering insight into earlier functional assays. It is a 3D representation of this entire region, with implications for RNA dimerization and protein binding during regulatory steps. The structural information of this highly conserved region of the virus has the potential to reveal promising therapeutic targets. PMID:23685210
Structural Determination of Functional Domains in Early B-cell Factor (EBF) Family of Transcription Factors Reveals Similarities to Rel DNA-binding Proteins and a Novel Dimerization Motif*

PubMed Central

Siponen, Marina I.; Wisniewska, Magdalena; Lehtiö, Lari; Johansson, Ida; Svensson, Linda; Raszewski, Grzegorz; Nilsson, Lennart; Sigvardsson, Mikael; Berglund, Helena

2010-01-01

The early B-cell factor (EBF) transcription factors are central regulators of development in several organs and tissues. This protein family shows low sequence similarity to other protein families, which is why structural information for the functional domains of these proteins is crucial to understand their biochemical features. We have used a modular approach to determine the crystal structures of the structured domains in the EBF family. The DNA binding domain reveals a striking resemblance to the DNA binding domains of the Rel homology superfamily of transcription factors but contains a unique zinc binding structure, termed zinc knuckle. Further the EBF proteins contain an IPT/TIG domain and an atypical helix-loop-helix domain with a novel type of dimerization motif. The data presented here provide insights into unique structural features of the EBF proteins and open possibilities for detailed molecular investigations of this important transcription factor family. PMID:20592035
Interlocked DNA nanostructures controlled by a reversible logic circuit.

PubMed

Li, Tao; Lohmann, Finn; Famulok, Michael

2014-09-17

DNA nanostructures constitute attractive devices for logic computing and nanomechanics. An emerging interest is to integrate these two fields and devise intelligent DNA nanorobots. Here we report a reversible logic circuit built on the programmable assembly of a double-stranded (ds) DNA [3]pseudocatenane that serves as a rigid scaffold to position two separate branched-out head-motifs, a bimolecular i-motif and a G-quadruplex. The G-quadruplex only forms when preceded by the assembly of the i-motif. The formation of the latter, in turn, requires acidic pH and unhindered mobility of the head-motif containing dsDNA nanorings with respect to the central ring to which they are interlocked, triggered by release oligodeoxynucleotides. We employ these features to convert the structural changes into Boolean operations with fluorescence labelling. The nanostructure behaves as a reversible logic circuit consisting of tandem YES and AND gates. Such reversible logic circuits integrated into functional nanodevices may guide future intelligent DNA nanorobots to manipulate cascade reactions in biological systems.
Interlocked DNA nanostructures controlled by a reversible logic circuit

PubMed Central

Li, Tao; Lohmann, Finn; Famulok, Michael

2014-01-01

DNA nanostructures constitute attractive devices for logic computing and nanomechanics. An emerging interest is to integrate these two fields and devise intelligent DNA nanorobots. Here we report a reversible logic circuit built on the programmable assembly of a double-stranded (ds) DNA [3]pseudocatenane that serves as a rigid scaffold to position two separate branched-out head-motifs, a bimolecular i-motif and a G-quadruplex. The G-quadruplex only forms when preceded by the assembly of the i-motif. The formation of the latter, in turn, requires acidic pH and unhindered mobility of the head-motif containing dsDNA nanorings with respect to the central ring to which they are interlocked, triggered by release oligodeoxynucleotides. We employ these features to convert the structural changes into Boolean operations with fluorescence labelling. The nanostructure behaves as a reversible logic circuit consisting of tandem YES and AND gates. Such reversible logic circuits integrated into functional nanodevices may guide future intelligent DNA nanorobots to manipulate cascade reactions in biological systems. PMID:25229207
Selective integrin endocytosis is driven by interactions between the integrin α-chain and AP2

PubMed Central

De Franceschi, Nicola; Arjonen, Antti; Elkhatib, Nadia; Denessiouk, Konstantin; Wrobel, Antoni G; Wilson, Thomas A; Pouwels, Jeroen; Montagnac, Guillaume; Owen, David J; Ivaska, Johanna

2016-01-01

Integrins are heterodimeric cell-surface adhesion molecules comprising one of possible 18 α-chains and one of possible 8 β-chains. They control a range of cell functions in a matrix- and ligand-specific manner. Integrins can be internalised by clathrin-mediated endocytosis (CME) through β subunit-based motifs found in all integrin heterodimers. However, whether specific integrin heterodimers can be selectively endocytosed was unknown. Here, we found that a subset of α subunits contain an evolutionarily conserved and functional YxxΦ motif directing integrins to selective internalisation by the most abundant endocytic clathrin adaptor, AP2. We determined the structure of the human integrin α4-tail motif in complex with AP2 C-µ2 subunit and confirmed the interaction by isothermal titration calorimetry. Mutagenesis of the motif impaired selective heterodimer endocytosis and attenuated integrin-mediated cell migration. We propose that integrins evolved to enable selective integrin-receptor turnover in response to changing matrix conditions. PMID:26779610
Selective integrin endocytosis is driven by interactions between the integrin α-chain and AP2.

PubMed

De Franceschi, Nicola; Arjonen, Antti; Elkhatib, Nadia; Denessiouk, Konstantin; Wrobel, Antoni G; Wilson, Thomas A; Pouwels, Jeroen; Montagnac, Guillaume; Owen, David J; Ivaska, Johanna

2016-02-01

Integrins are heterodimeric cell-surface adhesion molecules comprising one of 18 possible α-chains and one of eight possible β-chains. They control a range of cell functions in a matrix- and ligand-specific manner. Integrins can be internalized by clathrin-mediated endocytosis (CME) through β subunit-based motifs found in all integrin heterodimers. However, whether specific integrin heterodimers can be selectively endocytosed was unknown. Here, we found that a subset of α subunits contain an evolutionarily conserved and functional YxxΦ motif directing integrins to selective internalization by the most abundant endocytic clathrin adaptor, AP2. We determined the structure of the human integrin α4-tail motif in complex with the AP2 C-μ2 subunit and confirmed the interaction by isothermal titration calorimetry. Mutagenesis of the motif impaired selective heterodimer endocytosis and attenuated integrin-mediated cell migration. We propose that integrins evolved to enable selective integrin-receptor turnover in response to changing matrix conditions.
TRIM67 Protein Negatively Regulates Ras Activity through Degradation of 80K-H and Induces Neuritogenesis*

PubMed Central

Yaguchi, Hiroaki; Okumura, Fumihiko; Takahashi, Hidehisa; Kano, Takahiro; Kameda, Hiroyuki; Uchigashima, Motokazu; Tanaka, Shinya; Watanabe, Masahiko; Sasaki, Hidenao; Hatakeyama, Shigetsugu

2012-01-01

Tripartite motif (TRIM)-containing proteins, which are defined by the presence of a common domain structure composed of a RING finger, one or two B-box motifs and a coiled-coil motif, are involved in many biological processes including innate immunity, viral infection, carcinogenesis, and development. Here we show that TRIM67, which has a TRIM motif, an FN3 domain and a SPRY domain, is highly expressed in the cerebellum and that TRIM67 interacts with PRG-1 and 80K-H, which is involved in the Ras-mediated signaling pathway. Ectopic expression of TRIM67 results in degradation of endogenous 80K-H and attenuation of cell proliferation and enhances neuritogenesis in the neuroblastoma cell line N1E-115. Furthermore, morphological and biological changes caused by knockdown of 80K-H are similar to those observed by overexpression of TRIM67. These findings suggest that TRIM67 regulates Ras signaling via degradation of 80K-H, leading to neural differentiation including neuritogenesis. PMID:22337885
TRIM67 protein negatively regulates Ras activity through degradation of 80K-H and induces neuritogenesis.

PubMed

Yaguchi, Hiroaki; Okumura, Fumihiko; Takahashi, Hidehisa; Kano, Takahiro; Kameda, Hiroyuki; Uchigashima, Motokazu; Tanaka, Shinya; Watanabe, Masahiko; Sasaki, Hidenao; Hatakeyama, Shigetsugu

2012-04-06

Tripartite motif (TRIM)-containing proteins, which are defined by the presence of a common domain structure composed of a RING finger, one or two B-box motifs and a coiled-coil motif, are involved in many biological processes including innate immunity, viral infection, carcinogenesis, and development. Here we show that TRIM67, which has a TRIM motif, an FN3 domain and a SPRY domain, is highly expressed in the cerebellum and that TRIM67 interacts with PRG-1 and 80K-H, which is involved in the Ras-mediated signaling pathway. Ectopic expression of TRIM67 results in degradation of endogenous 80K-H and attenuation of cell proliferation and enhances neuritogenesis in the neuroblastoma cell line N1E-115. Furthermore, morphological and biological changes caused by knockdown of 80K-H are similar to those observed by overexpression of TRIM67. These findings suggest that TRIM67 regulates Ras signaling via degradation of 80K-H, leading to neural differentiation including neuritogenesis.
Process-based network decomposition reveals backbone motif structure

PubMed Central

Wang, Guanyu; Du, Chenghang; Chen, Hao; Simha, Rahul; Rong, Yongwu; Xiao, Yi; Zeng, Chen

2010-01-01

A central challenge in systems biology today is to understand the network of interactions among biomolecules and, especially, the organizing principles underlying such networks. Recent analysis of known networks has identified small motifs that occur ubiquitously, suggesting that larger networks might be constructed in the manner of electronic circuits by assembling groups of these smaller modules. Using a unique process-based approach to analyzing such networks, we show for two cell-cycle networks that each of these networks contains a giant backbone motif spanning all the network nodes that provides the main functional response. The backbone is in fact the smallest network capable of providing the desired functionality. Furthermore, the remaining edges in the network form smaller motifs whose role is to confer stability properties rather than provide function. The process-based approach used in the above analysis has additional benefits: It is scalable, analytic (resulting in a single analyzable expression that describes the behavior), and computationally efficient (all possible minimal networks for a biological process can be identified and enumerated). PMID:20498084
The bioactive acidic serine- and aspartate-rich motif peptide.

PubMed

Minamizaki, Tomoko; Yoshiko, Yuji

2015-01-01

The organic component of the bone matrix comprises 40% dry weight of bone. The organic component is mostly composed of type I collagen and small amounts of non-collagenous proteins (NCPs) (10-15% of the total bone protein content). The small integrin-binding ligand N-linked glycoprotein (SIBLING) family, a NCP, is considered to play a key role in bone mineralization. SIBLING family of proteins share common structural features and includes the arginine-glycine-aspartic acid (RGD) motif and acidic serine- and aspartic acid-rich motif (ASARM). Clinical manifestations of gene mutations and/or genetically modified mice indicate that SIBLINGs play diverse roles in bone and extraskeletal tissues. ASARM peptides might not be primary responsible for the functional diversity of SIBLINGs, but this motif is suggested to be a key domain of SIBLINGs. However, the exact function of ASARM peptides is poorly understood. In this article, we discuss the considerable progress made in understanding the role of ASARM as a bioactive peptide.
Self-Assembled Coacervates of Chitosan and an Insect Cuticle Protein Containing a Rebers-Riddiford Motif.

PubMed

Vaclaw, M Coleman; Sprouse, Patricia A; Dittmer, Neal T; Ghazvini, Saba; Middaugh, C Russell; Kanost, Michael R; Gehrke, Stevin H; Dhar, Prajnaparamita

2018-05-09

The interactions among biomacromolecules within insect cuticle may offer new motifs for biomimetic material design. CPR27 is an abundant protein in the rigid cuticle of the elytron from Tribolium castaneum. CPR27 contains the Rebers-Riddiford (RR) motif, which is hypothesized to bind chitin. In this study, active magnetic microrheology coupled with microscopy and protein particle analysis techniques were used to correlate alterations in the viscosity of chitosan solutions with changes in solution microstructure. Addition of CPR27 to chitosan solutions led to a 3-fold drop in viscosity. This change was accompanied by the presence of micrometer-sized coacervate particles in solution. Coacervate formation had a strong dependence on chitosan concentration. Analysis showed the existence of a critical CPR27 concentration beyond which a significant increase in particle count was observed. These effects were not observed when a non-RR cuticular protein, CP30, was tested, providing evidence of a structure-function relationship related to the RR motif.
Involvement of a putative substrate binding site in the biogenesis and assembly of phosphatidylserine decarboxylase 1 from Saccharomyces cerevisiae.

PubMed

Di Bartolomeo, Francesca; Doan, Kim Nguyen; Athenstaedt, Karin; Becker, Thomas; Daum, Günther

2017-07-01

In the yeast Saccharomyces cerevisiae, the mitochondrial phosphatidylserine decarboxylase 1 (Psd1p) produces the largest amount of cellular phosphatidylethanolamine (PE). Psd1p is synthesized as a larger precursor on cytosolic ribosomes and then imported into mitochondria in a three-step processing event leading to the formation of an α-subunit and a β-subunit. The α-subunit harbors a highly conserved motif, which was proposed to be involved in phosphatidylserine (PS) binding. Here, we present a molecular analysis of this consensus motif for the function of Psd1p by using Psd1p variants bearing either deletions or point mutations in this region. Our data show that mutations in this motif affect processing and stability of Psd1p, and consequently the enzyme's activity. Thus, we conclude that this consensus motif is essential for structural integrity and processing of Psd1p. Copyright © 2017 Elsevier B.V. All rights reserved.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.