sequence dependent structure: Topics by Science.gov

Sample records for sequence dependent structure

Improved Model for Predicting the Free Energy Contribution of Dinucleotide Bulges to RNA Duplex Stability.

PubMed

Tomcho, Jeremy C; Tillman, Magdalena R; Znosko, Brent M

2015-09-01

Predicting the secondary structure of RNA is an intermediate in predicting RNA three-dimensional structure. Commonly, determining RNA secondary structure from sequence uses free energy minimization and nearest neighbor parameters. Current algorithms utilize a sequence-independent model to predict free energy contributions of dinucleotide bulges. To determine if a sequence-dependent model would be more accurate, short RNA duplexes containing dinucleotide bulges with different sequences and nearest neighbor combinations were optically melted to derive thermodynamic parameters. These data suggested energy contributions of dinucleotide bulges were sequence-dependent, and a sequence-dependent model was derived. This model assigns free energy penalties based on the identity of nucleotides in the bulge (3.06 kcal/mol for two purines, 2.93 kcal/mol for two pyrimidines, 2.71 kcal/mol for 5'-purine-pyrimidine-3', and 2.41 kcal/mol for 5'-pyrimidine-purine-3'). The predictive model also includes a 0.45 kcal/mol penalty for an A-U pair adjacent to the bulge and a -0.28 kcal/mol bonus for a G-U pair adjacent to the bulge. The new sequence-dependent model results in predicted values within, on average, 0.17 kcal/mol of experimental values, a significant improvement over the sequence-independent model. This model and new experimental values can be incorporated into algorithms that predict RNA stability and secondary structure from sequence.
Processing multiple non-adjacent dependencies: evidence from sequence learning

PubMed Central

de Vries, Meinou H.; Petersson, Karl Magnus; Geukes, Sebastian; Zwitserlood, Pienie; Christiansen, Morten H.

2012-01-01

Processing non-adjacent dependencies is considered to be one of the hallmarks of human language. Assuming that sequence-learning tasks provide a useful way to tap natural-language-processing mechanisms, we cross-modally combined serial reaction time and artificial-grammar learning paradigms to investigate the processing of multiple nested (A1A2A3B3B2B1) and crossed dependencies (A1A2A3B1B2B3), containing either three or two dependencies. Both reaction times and prediction errors highlighted problems with processing the middle dependency in nested structures (A1A2A3B3_B1), reminiscent of the ‘missing-verb effect’ observed in English and French, but not with crossed structures (A1A2A3B1_B3). Prior linguistic experience did not play a major role: native speakers of German and Dutch—which permit nested and crossed dependencies, respectively—showed a similar pattern of results for sequences with three dependencies. As for sequences with two dependencies, reaction times and prediction errors were similar for both nested and crossed dependencies. The results suggest that constraints on the processing of multiple non-adjacent dependencies are determined by the specific ordering of the non-adjacent dependencies (i.e. nested or crossed), as well as the number of non-adjacent dependencies to be resolved (i.e. two or three). Furthermore, these constraints may not be specific to language but instead derive from limitations on structured sequence learning. PMID:22688641
Domain-specific learning of grammatical structure in musical and phonological sequences.

PubMed

Bly, Benjamin Martin; Carrión, Ricardo E; Rasch, Björn

2009-01-01

Artificial grammar learning depends on acquisition of abstract structural representations rather than domain-specific representational constraints, or so many studies tell us. Using an artificial grammar task, we compared learning performance in two stimulus domains in which respondents have differing tacit prior knowledge. We found that despite grammatically identical sequence structures, learning was better for harmonically related chord sequences than for letter name sequences or harmonically unrelated chord sequences. We also found transfer effects within the musical and letter name tasks, but not across the domains. We conclude that knowledge acquired in implicit learning depends not only on abstract features of structured stimuli, but that the learning of regularities is in some respects domain-specific and strongly linked to particular features of the stimulus domain.
The Thiamine-Pyrophosphate-Motif

NASA Technical Reports Server (NTRS)

Ciszak, Ewa; Dominiak, Paulina

2004-01-01

Thiamin pyrophosphate (TPP), a derivative of vitamin B1, is a cofactor for enzymes performing catalysis in pathways of energy production including the well known decarboxylation of a-keto acid dehydrogenases followed by transketolation. TPP-dependent enzymes constitute a structurally and functionally diverse group exhibiting multimeric subunit organization, multiple domains and two chemically equivalent catalytic centers. Annotation of functional TPP-dependcnt enzymes, therefore, has not been trivial due to low sequence similarity related to this complex organization. Our approach to analysis of structures of known TPP-dependent enzymes reveals for the first time features common to this group, which we have termed the TPP-motif. The TPP-motif consists of specific spatial arrangements of structural elements and their specific contacts to provide for a flip-flop, or alternate site, enzymatic mechanism of action. Analysis of structural elements entrained in the flip-flop action displayed by TPP-dependent enzymes reveals a novel definition of the common amino acid sequences. These sequences allow for annotation of TPP-dependent enzymes, thus advancing functional proteomics. Further details of three-dimensional structures of TPP-dependent enzymes will be discussed.
Cleavage of nucleic acids

DOEpatents

Prudent, James R.; Hall, Jeff G.; Lyamichev, Victor L.; Brow, Mary Ann D.; Dahlberg, James E.

2007-12-11

The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof.
Invasive cleavage of nucleic acids

DOEpatents

Prudent, James R.; Hall, Jeff G.; Lyamichev, Victor I.; Brow, Mary Ann D.; Dahlberg, James E.

1999-01-01

The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof.
Invasive cleavage of nucleic acids

DOEpatents

Prudent, James R.; Hall, Jeff G.; Lyamichev, Victor I.; Brow, Mary Ann D.; Dahlberg, James E.

2002-01-01

The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof.
Cleavage of nucleic acids

DOEpatents

Prudent, James R.; Hall, Jeff G.; Lyamichev, Victor I.; Brow; Mary Ann D.; Dahlberg, James E.

2010-11-09

The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof.
Cleavage of nucleic acids

DOEpatents

Prudent, James R.; Hall, Jeff G.; Lyamichev, Victor I.; Brow, Mary Ann D.; Dahlberg, James E.

2000-01-01

The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof.
Nucleic acid detection assays

DOEpatents

Prudent, James R.; Hall, Jeff G.; Lyamichev, Victor I.; Brow, Mary Ann; Dahlberg, James E.

2005-04-05

The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof.
Elucidation of Peptide-Directed Palladium Surface Structure for Biologically Tunable Nanocatalysts

DOE Office of Scientific and Technical Information (OSTI.GOV)

Bedford, Nicholas M.; Ramezani-Dakhel, Hadi; Slocik, Joseph M.

Peptide-enabled synthesis of inorganic nanostructures represents an avenue to access catalytic materials with tunable and optimized properties. This is achieved via peptide complexity and programmability that is missing in traditional ligands for catalytic nanomaterials. Unfortunately, there is limited information available to correlate peptide sequence to particle structure and catalytic activity to date. As such, the application of peptide-enabled nanocatalysts remains limited to trial and error approaches. In this paper, a hybrid experimental and computational approach is introduced to systematically elucidate biomolecule-dependent structure/function relationships for peptide-capped Pd nanocatalysts. Synchrotron X-ray techniques were used to uncover substantial particle surface structural disorder, whichmore » was dependent upon the amino acid sequence of the peptide capping ligand. Nanocatalyst configurations were then determined directly from experimental data using reverse Monte Carlo methods and further refined using molecular dynamics simulation, obtaining thermodynamically stable peptide-Pd nanoparticle configurations. Sequence-dependent catalytic property differences for C-C coupling and olefin hydrogenation were then eluddated by identification of the catalytic active sites at the atomic level and quantitative prediction of relative reaction rates. This hybrid methodology provides a clear route to determine peptide-dependent structure/function relationships, enabling the generation of guidelines for catalyst design through rational tailoring of peptide sequences« less
Elucidation of peptide-directed palladium surface structure for biologically tunable nanocatalysts.

PubMed

Bedford, Nicholas M; Ramezani-Dakhel, Hadi; Slocik, Joseph M; Briggs, Beverly D; Ren, Yang; Frenkel, Anatoly I; Petkov, Valeri; Heinz, Hendrik; Naik, Rajesh R; Knecht, Marc R

2015-05-26

Peptide-enabled synthesis of inorganic nanostructures represents an avenue to access catalytic materials with tunable and optimized properties. This is achieved via peptide complexity and programmability that is missing in traditional ligands for catalytic nanomaterials. Unfortunately, there is limited information available to correlate peptide sequence to particle structure and catalytic activity to date. As such, the application of peptide-enabled nanocatalysts remains limited to trial and error approaches. In this paper, a hybrid experimental and computational approach is introduced to systematically elucidate biomolecule-dependent structure/function relationships for peptide-capped Pd nanocatalysts. Synchrotron X-ray techniques were used to uncover substantial particle surface structural disorder, which was dependent upon the amino acid sequence of the peptide capping ligand. Nanocatalyst configurations were then determined directly from experimental data using reverse Monte Carlo methods and further refined using molecular dynamics simulation, obtaining thermodynamically stable peptide-Pd nanoparticle configurations. Sequence-dependent catalytic property differences for C-C coupling and olefin hydrogenation were then elucidated by identification of the catalytic active sites at the atomic level and quantitative prediction of relative reaction rates. This hybrid methodology provides a clear route to determine peptide-dependent structure/function relationships, enabling the generation of guidelines for catalyst design through rational tailoring of peptide sequences.
Sequence-dependent effects in drug-DNA interaction: the crystal structure of Hoechst 33258 bound to the d(CGCAAATTTGCG)2 duplex.

PubMed Central

Spink, N; Brown, D G; Skelly, J V; Neidle, S

1994-01-01

The bis-benzimidazole drug Hoechst 33258 has been co-crystallized with the dodecanucleotide sequence d(CGCAAATTTGCG)2. The structure has been solved by molecular replacement and refined to an R factor of 18.5% for 2125 reflections collected on a Xentronics area detector. The drug is bound in the minor groove, at the five base-pair site 5'-ATTTG and is in a unique orientation. This is displaced by one base pair in the 5' direction compared to previously-determined structures of this drug with the sequence d(CGCGAATTCGCG)2. Reasons for this difference in behaviour are discussed in terms of several sequence-dependent structural features of the DNA, with particular reference to differences in propeller twist and minor-groove width. Images PMID:7515488
[Influence of "prehistory" of sequential movements of the right and the left hand on reproduction: coding of positions, movements and sequence structure].

PubMed

Bobrova, E V; Liakhovetskiĭ, V A; Borshchevskaia, E R

2011-01-01

The dependence of errors during reproduction of a sequence of hand movements without visual feedback on the previous right- and left-hand performance ("prehistory") and on positions in space of sequence elements (random or ordered by the explicit rule) was analyzed. It was shown that the preceding information about the ordered positions of the sequence elements was used during right-hand movements, whereas left-hand movements were performed with involvement of the information about the random sequence. The data testify to a central mechanism of the analysis of spatial structure of sequence elements. This mechanism activates movement coding specific for the left hemisphere (vector coding) in case of an ordered sequence structure and positional coding specific for the right hemisphere in case of a random sequence structure.
StralSV: assessment of sequence variability within similar 3D structures and application to polio RNA-dependent RNA polymerase.

PubMed

Zemla, Adam T; Lang, Dorothy M; Kostova, Tanya; Andino, Raul; Ecale Zhou, Carol L

2011-06-02

Most of the currently used methods for protein function prediction rely on sequence-based comparisons between a query protein and those for which a functional annotation is provided. A serious limitation of sequence similarity-based approaches for identifying residue conservation among proteins is the low confidence in assigning residue-residue correspondences among proteins when the level of sequence identity between the compared proteins is poor. Multiple sequence alignment methods are more satisfactory--still, they cannot provide reliable results at low levels of sequence identity. Our goal in the current work was to develop an algorithm that could help overcome these difficulties by facilitating the identification of structurally (and possibly functionally) relevant residue-residue correspondences between compared protein structures. Here we present StralSV (structure-alignment sequence variability), a new algorithm for detecting closely related structure fragments and quantifying residue frequency from tight local structure alignments. We apply StralSV in a study of the RNA-dependent RNA polymerase of poliovirus, and we demonstrate that the algorithm can be used to determine regions of the protein that are relatively unique, or that share structural similarity with proteins that would be considered distantly related. By quantifying residue frequencies among many residue-residue pairs extracted from local structural alignments, one can infer potential structural or functional importance of specific residues that are determined to be highly conserved or that deviate from a consensus. We further demonstrate that considerable detailed structural and phylogenetic information can be derived from StralSV analyses. StralSV is a new structure-based algorithm for identifying and aligning structure fragments that have similarity to a reference protein. StralSV analysis can be used to quantify residue-residue correspondences and identify residues that may be of particular structural or functional importance, as well as unusual or unexpected residues at a given sequence position. StralSV is provided as a web service at http://proteinmodel.org/AS2TS/STRALSV/.
Evolutionarily conserved regions and hydrophobic contacts at the superfamily level: The case of the fold-type I, pyridoxal-5′-phosphate-dependent enzymes

PubMed Central

Paiardini, Alessandro; Bossa, Francesco; Pascarella, Stefano

2004-01-01

The wealth of biological information provided by structural and genomic projects opens new prospects of understanding life and evolution at the molecular level. In this work, it is shown how computational approaches can be exploited to pinpoint protein structural features that remain invariant upon long evolutionary periods in the fold-type I, PLP-dependent enzymes. A nonredundant set of 23 superposed crystallographic structures belonging to this superfamily was built. Members of this family typically display high-structural conservation despite low-sequence identity. For each structure, a multiple-sequence alignment of orthologous sequences was obtained, and the 23 alignments were merged using the structural information to obtain a comprehensive multiple alignment of 921 sequences of fold-type I enzymes. The structurally conserved regions (SCRs), the evolutionarily conserved residues, and the conserved hydrophobic contacts (CHCs) were extracted from this data set, using both sequence and structural information. The results of this study identified a structural pattern of hydrophobic contacts shared by all of the superfamily members of fold-type I enzymes and involved in native interactions. This profile highlights the presence of a nucleus for this fold, in which residues participating in the most conserved native interactions exhibit preferential evolutionary conservation, that correlates significantly (r = 0.70) with the extent of mean hydrophobic contact value of their apolar fraction. PMID:15498941
Sequence periodicity in nucleosomal DNA and intrinsic curvature.

PubMed

Nair, T Murlidharan

2010-05-17

Most eukaryotic DNA contained in the nucleus is packaged by wrapping DNA around histone octamers. Histones are ubiquitous and bind most regions of chromosomal DNA. In order to achieve smooth wrapping of the DNA around the histone octamer, the DNA duplex should be able to deform and should possess intrinsic curvature. The deformability of DNA is a result of the non-parallelness of base pair stacks. The stacking interaction between base pairs is sequence dependent. The higher the stacking energy the more rigid the DNA helix, thus it is natural to expect that sequences that are involved in wrapping around the histone octamer should be unstacked and possess intrinsic curvature. Intrinsic curvature has been shown to be dictated by the periodic recurrence of certain dinucleotides. Several genome-wide studies directed towards mapping of nucleosome positions have revealed periodicity associated with certain stretches of sequences. In the current study, these sequences have been analyzed with a view to understand their sequence-dependent structures. Higher order DNA structures and the distribution of molecular bend loci associated with 146 base nucleosome core DNA sequence from C. elegans and chicken have been analyzed using the theoretical model for DNA curvature. The curvature dispersion calculated by cyclically permuting the sequences revealed that the molecular bend loci were delocalized throughout the nucleosome core region and had varying degrees of intrinsic curvature. The higher order structures associated with nucleosomes of C.elegans and chicken calculated from the sequences revealed heterogeneity with respect to the deviation of the DNA axis. The results points to the possibility of context dependent curvature of varying degrees to be associated with nucleosomal DNA.
Detection of nucleic acid sequences by invader-directed cleavage

DOEpatents

Brow, Mary Ann D.; Hall, Jeff Steven Grotelueschen; Lyamichev, Victor; Olive, David Michael; Prudent, James Robert

1999-01-01

The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The 5' nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof. The present invention further relates to methods and devices for the separation of nucleic acid molecules based by charge.
Sequence-Dependent Structure/Function Relationships of Catalytic Peptide-Enabled Gold Nanoparticles Generated under Ambient Synthetic Conditions.

PubMed

Bedford, Nicholas M; Hughes, Zak E; Tang, Zhenghua; Li, Yue; Briggs, Beverly D; Ren, Yang; Swihart, Mark T; Petkov, Valeri G; Naik, Rajesh R; Knecht, Marc R; Walsh, Tiffany R

2016-01-20

Peptide-enabled nanoparticle (NP) synthesis routes can create and/or assemble functional nanomaterials under environmentally friendly conditions, with properties dictated by complex interactions at the biotic/abiotic interface. Manipulation of this interface through sequence modification can provide the capability for material properties to be tailored to create enhanced materials for energy, catalysis, and sensing applications. Fully realizing the potential of these materials requires a comprehensive understanding of sequence-dependent structure/function relationships that is presently lacking. In this work, the atomic-scale structures of a series of peptide-capped Au NPs are determined using a combination of atomic pair distribution function analysis of high-energy X-ray diffraction data and advanced molecular dynamics (MD) simulations. The Au NPs produced with different peptide sequences exhibit varying degrees of catalytic activity for the exemplar reaction 4-nitrophenol reduction. The experimentally derived atomic-scale NP configurations reveal sequence-dependent differences in structural order at the NP surface. Replica exchange with solute-tempering MD simulations are then used to predict the morphology of the peptide overlayer on these Au NPs and identify factors determining the structure/catalytic properties relationship. We show that the amount of exposed Au surface, the underlying surface structural disorder, and the interaction strength of the peptide with the Au surface all influence catalytic performance. A simplified computational prediction of catalytic performance is developed that can potentially serve as a screening tool for future studies. Our approach provides a platform for broadening the analysis of catalytic peptide-enabled metallic NP systems, potentially allowing for the development of rational design rules for property enhancement.
StralSV: assessment of sequence variability within similar 3D structures and application to polio RNA-dependent RNA polymerase

DOE Office of Scientific and Technical Information (OSTI.GOV)

Zemla, A; Lang, D; Kostova, T

2010-11-29

Most of the currently used methods for protein function prediction rely on sequence-based comparisons between a query protein and those for which a functional annotation is provided. A serious limitation of sequence similarity-based approaches for identifying residue conservation among proteins is the low confidence in assigning residue-residue correspondences among proteins when the level of sequence identity between the compared proteins is poor. Multiple sequence alignment methods are more satisfactory - still, they cannot provide reliable results at low levels of sequence identity. Our goal in the current work was to develop an algorithm that could overcome these difficulties and facilitatemore » the identification of structurally (and possibly functionally) relevant residue-residue correspondences between compared protein structures. Here we present StralSV, a new algorithm for detecting closely related structure fragments and quantifying residue frequency from tight local structure alignments. We apply StralSV in a study of the RNA-dependent RNA polymerase of poliovirus and demonstrate that the algorithm can be used to determine regions of the protein that are relatively unique or that shared structural similarity with structures that are distantly related. By quantifying residue frequencies among many residue-residue pairs extracted from local alignments, one can infer potential structural or functional importance of specific residues that are determined to be highly conserved or that deviate from a consensus. We further demonstrate that considerable detailed structural and phylogenetic information can be derived from StralSV analyses. StralSV is a new structure-based algorithm for identifying and aligning structure fragments that have similarity to a reference protein. StralSV analysis can be used to quantify residue-residue correspondences and identify residues that may be of particular structural or functional importance, as well as unusual or unexpected residues at a given sequence position.« less

Sequencing of Dust Filter Production Process Using Design Structure Matrix (DSM)

NASA Astrophysics Data System (ADS)

Sari, R. M.; Matondang, A. R.; Syahputri, K.; Anizar; Siregar, I.; Rizkya, I.; Ursula, C.

2018-01-01

Metal casting company produces machinery spare part for manufactures. One of the product produced is dust filter. Most of palm oil mill used this product. Since it is used in most of palm oil mill, company often have problems to address this product. One of problem is the disordered of production process. It carried out by the job sequencing. The important job that should be solved first, least implement, while less important job and could be completed later, implemented first. Design Structure Matrix (DSM) used to analyse and determine priorities in the production process. DSM analysis is sort of production process through dependency sequencing. The result of dependency sequences shows the sequence process according to the inter-process linkage considering before and after activities. Finally, it demonstrates their activities to the coupled activities for metal smelting, refining, grinding, cutting container castings, metal expenditure of molds, metal casting, coating processes, and manufacture of molds of sand.
Sequence periodicity in nucleosomal DNA and intrinsic curvature

PubMed Central

2010-01-01

Background Most eukaryotic DNA contained in the nucleus is packaged by wrapping DNA around histone octamers. Histones are ubiquitous and bind most regions of chromosomal DNA. In order to achieve smooth wrapping of the DNA around the histone octamer, the DNA duplex should be able to deform and should possess intrinsic curvature. The deformability of DNA is a result of the non-parallelness of base pair stacks. The stacking interaction between base pairs is sequence dependent. The higher the stacking energy the more rigid the DNA helix, thus it is natural to expect that sequences that are involved in wrapping around the histone octamer should be unstacked and possess intrinsic curvature. Intrinsic curvature has been shown to be dictated by the periodic recurrence of certain dinucleotides. Several genome-wide studies directed towards mapping of nucleosome positions have revealed periodicity associated with certain stretches of sequences. In the current study, these sequences have been analyzed with a view to understand their sequence-dependent structures. Results Higher order DNA structures and the distribution of molecular bend loci associated with 146 base nucleosome core DNA sequence from C. elegans and chicken have been analyzed using the theoretical model for DNA curvature. The curvature dispersion calculated by cyclically permuting the sequences revealed that the molecular bend loci were delocalized throughout the nucleosome core region and had varying degrees of intrinsic curvature. Conclusions The higher order structures associated with nucleosomes of C.elegans and chicken calculated from the sequences revealed heterogeneity with respect to the deviation of the DNA axis. The results points to the possibility of context dependent curvature of varying degrees to be associated with nucleosomal DNA. PMID:20487515
Sequence-Dependent Structure/Function Relationships of Catalytic Peptide-Enabled Gold Nanoparticles Generated under Ambient Synthetic Conditions

DOE Office of Scientific and Technical Information (OSTI.GOV)

Bedford, Nicholas M.; Hughes, Zak E.; Tang, Zhenghua

Peptide-enabled nanoparticle (NP) synthesis routes can create and/or assemble functional nanomaterials under environmentally friendly conditions, with properties dictated by complex interactions at the biotic/abiotic interface. Manipulation of this interface through sequence modification can provide the capability for material properties to be tailored to create enhanced materials for energy, catalysis, and sensing applications. Fully realizing the potential of these materials requires a comprehensive understanding of sequence-dependent structure/function relationships that is presently lacking. In this work, the atomic-scale structures of a series of peptide-capped Au NPs are determined using a combination of atomic pair distribution function analysis of high-energy X-ray diffraction datamore » and advanced molecular dynamics (MD) simulations. The Au NPs produced with different peptide sequences exhibit varying degrees of catalytic activity for the exemplar reaction 4-nitrophenol reduction. The experimentally derived atomic-scale NP configurations reveal sequence-dependent differences in structural order at the NP surface. Replica exchange with solute-tempering MD simulations are then used to predict the morphology of the peptide overlayer on these Au NPs and identify factors determining the structure/catalytic properties relationship. We show that the amount of exposed Au surface, the underlying surface structural disorder, and the interaction strength of the peptide with the Au surface all influence catalytic performance. A simplified computational prediction of catalytic performance is developed that can potentially serve as a screening tool for future studies. Our approach provides a platform for broadening the analysis of catalytic peptide-enabled metallic NP systems, potentially allowing for the development of rational design rules for property enhancement.« less
NMR studies on the structure and dynamics of lac operator DNA

DOE Office of Scientific and Technical Information (OSTI.GOV)

Lee, S.C.

Nuclear Magnetic Resonance spectroscopy was used to elucidate the relationships between structure, dynamics and function of the gene regulatory sequence corresponding to the lactose operon operator of Escherichia coli. The length of the DNA fragments examined varied from 13 to 36 base pair, containing all or part of the operator sequence. These DNA fragments are either derived genetically or synthesized chemically. Resonances of the imino protons were assigned by one dimensional inter-base pair nuclear Overhauser enhancement (NOE) measurements. Imino proton exchange rates were measured by saturation recovery methods. Results from the kinetic measurements show an interesting dynamic heterogeneity with amore » maximum opening rate centered about a GTG/CAC sequence which correlates with the biological function of the operator DNA. This particular three base pair sequence occurs frequently and often symmetrically in prokaryotic nd eukaryotic DNA sites where one anticipates specific protein interaction for gene regulation. The observed sequence dependent imino proton exchange rate may be a reflection of variation of the local structure of regulatory DNA. The results also indicate that the observed imino proton exchange rates are length dependent.« less
Sequence-similar, structure-dissimilar protein pairs in the PDB.

PubMed

Kosloff, Mickey; Kolodny, Rachel

2008-05-01

It is often assumed that in the Protein Data Bank (PDB), two proteins with similar sequences will also have similar structures. Accordingly, it has proved useful to develop subsets of the PDB from which "redundant" structures have been removed, based on a sequence-based criterion for similarity. Similarly, when predicting protein structure using homology modeling, if a template structure for modeling a target sequence is selected by sequence alone, this implicitly assumes that all sequence-similar templates are equivalent. Here, we show that this assumption is often not correct and that standard approaches to create subsets of the PDB can lead to the loss of structurally and functionally important information. We have carried out sequence-based structural superpositions and geometry-based structural alignments of a large number of protein pairs to determine the extent to which sequence similarity ensures structural similarity. We find many examples where two proteins that are similar in sequence have structures that differ significantly from one another. The source of the structural differences usually has a functional basis. The number of such proteins pairs that are identified and the magnitude of the dissimilarity depend on the approach that is used to calculate the differences; in particular sequence-based structure superpositioning will identify a larger number of structurally dissimilar pairs than geometry-based structural alignments. When two sequences can be aligned in a statistically meaningful way, sequence-based structural superpositioning provides a meaningful measure of structural differences. This approach and geometry-based structure alignments reveal somewhat different information and one or the other might be preferable in a given application. Our results suggest that in some cases, notably homology modeling, the common use of nonredundant datasets, culled from the PDB based on sequence, may mask important structural and functional information. We have established a data base of sequence-similar, structurally dissimilar protein pairs that will help address this problem (http://luna.bioc.columbia.edu/rachel/seqsimstrdiff.htm).
Complex structural behavior of oligopurine-oligopyrimidine sequence cloned within the supercoiled plasmid.

PubMed Central

Parniewski, P; Galazka, G; Wilk, A; Klysik, J

1989-01-01

Synthetic sequence GATCC(AG)7ATCG(AT)4CG(AG)7 was cloned into plasmid and its structural behavior under the influence of supercoiling was analysed by chemical modification at variety of experimental conditions. It was found that this sequence adopts at least two different non-B conformations depending on -delta and pH values. Moreover, 12 nucleotide long non-pur.pyr spacer region separating two identical (AG)7 blocks does not provide a significant energy barrier protecting against unusual structures formation. Images PMID:2644622
Sequence-Mandated, Distinct Assembly of Giant Molecules

DOE PAGES

Zhang, Wei; Lu, Xinlin; Mao, Jialin; ...

2017-10-24

Although controlling the primary structure of synthetic polymers is itself a great challenge, the potential of sequence control for tailoring hierarchical structures remains to be exploited, especially in the creation of new and unconventional phases. A series of model amphiphilic chain-like giant molecules was designed and synthesized by interconnecting both hydrophobic and hydrophilic molecular nanoparticles in precisely defined sequence and composition to investigate their sequence-dependent phase structures. Not only compositional variation changed the self-assembled supramolecular phases, but also specific sequences induce unconventional phase formation, including Frank-Kasper phases. The formation mechanism was attributed to the conformational change driven by the collectivemore » hydrogen bonding and the sequence-mandated topology of the molecules. Lastly, these results show that sequence control in synthetic polymers can have a dramatic impact on polymer properties and self-assembly.« less
Sequence-Mandated, Distinct Assembly of Giant Molecules

DOE Office of Scientific and Technical Information (OSTI.GOV)

Zhang, Wei; Lu, Xinlin; Mao, Jialin

Although controlling the primary structure of synthetic polymers is itself a great challenge, the potential of sequence control for tailoring hierarchical structures remains to be exploited, especially in the creation of new and unconventional phases. A series of model amphiphilic chain-like giant molecules was designed and synthesized by interconnecting both hydrophobic and hydrophilic molecular nanoparticles in precisely defined sequence and composition to investigate their sequence-dependent phase structures. Not only compositional variation changed the self-assembled supramolecular phases, but also specific sequences induce unconventional phase formation, including Frank-Kasper phases. The formation mechanism was attributed to the conformational change driven by the collectivemore » hydrogen bonding and the sequence-mandated topology of the molecules. Lastly, these results show that sequence control in synthetic polymers can have a dramatic impact on polymer properties and self-assembly.« less
Predicting PDZ domain mediated protein interactions from structure

PubMed Central

2013-01-01

Background PDZ domains are structural protein domains that recognize simple linear amino acid motifs, often at protein C-termini, and mediate protein-protein interactions (PPIs) in important biological processes, such as ion channel regulation, cell polarity and neural development. PDZ domain-peptide interaction predictors have been developed based on domain and peptide sequence information. Since domain structure is known to influence binding specificity, we hypothesized that structural information could be used to predict new interactions compared to sequence-based predictors. Results We developed a novel computational predictor of PDZ domain and C-terminal peptide interactions using a support vector machine trained with PDZ domain structure and peptide sequence information. Performance was estimated using extensive cross validation testing. We used the structure-based predictor to scan the human proteome for ligands of 218 PDZ domains and show that the predictions correspond to known PDZ domain-peptide interactions and PPIs in curated databases. The structure-based predictor is complementary to the sequence-based predictor, finding unique known and novel PPIs, and is less dependent on training–testing domain sequence similarity. We used a functional enrichment analysis of our hits to create a predicted map of PDZ domain biology. This map highlights PDZ domain involvement in diverse biological processes, some only found by the structure-based predictor. Based on this analysis, we predict novel PDZ domain involvement in xenobiotic metabolism and suggest new interactions for other processes including wound healing and Wnt signalling. Conclusions We built a structure-based predictor of PDZ domain-peptide interactions, which can be used to scan C-terminal proteomes for PDZ interactions. We also show that the structure-based predictor finds many known PDZ mediated PPIs in human that were not found by our previous sequence-based predictor and is less dependent on training–testing domain sequence similarity. Using both predictors, we defined a functional map of human PDZ domain biology and predict novel PDZ domain function. Users may access our structure-based and previous sequence-based predictors at http://webservice.baderlab.org/domains/POW. PMID:23336252
Streptococcal phosphoenolpyruvate-sugar phosphotransferase system: amino acid sequence and site of ATP-dependent phosphorylation of HPr

DOE Office of Scientific and Technical Information (OSTI.GOV)

Deutscher, J.; Pevec, B.; Beyreuther, K.

1986-10-21

The amino acid sequence of histidine-containing protein (HPr) from Streptococcus faecalis has been determined by direct Edman degradation of intact HPr and by amino acid sequence analysis of tryptic peptides, V8 proteolyptic peptides, thermolytic peptides, and cyanogen bromide cleavage products. HPr from S. faecalis was found to contain 89 amino acid residues, corresponding to a molecular weight of 9438. The amino acid sequence of HPr from S. faecalis shows extended homology to the primary structure of HPr proteins from other bacteria. Besides the phosphoenolpyruvate-dependent phosphorylation of a histidyl residue in HPr, catalyzed by enzyme I of the bacterial phosphotransferase system,more » HPr was also found to be phosphorylated at a seryl residue in an ATP-dependent protein kinase catalyzed reaction. The site of ATP-dependent phosphorylation in HPr of S faecalis has now been determined. (/sup 32/P)P-Ser-HPr was digested with three different proteases, and in each case, a single labeled peptide was isolated. Following digestion with subtilisin, they obtained a peptide with the sequence -(P)Ser-Ile-Met-. Using chymotrypsin, they isolated a peptide with the sequence -Ser-Val-Asn-Leu-Lys-(P)Ser-Ile-Met-Gly-Val-Met-. The longest labeled peptide was obtained with V8 staphylococcal protease. According to amino acid analysis, this peptide contained 36 out of the 89 amino acid residues of HPr. The following sequence of 12 amino acid residues of the V8 peptide was determined: -Tyr-Lys-Gly-Lys-Ser-Val-Asn-Leu-Lys-(P)Ser-Ile-Met-. Thus, the site of ATP-dependent phosphorylation was determined to be Ser-46 within the primary structure of HPr.« less
Sequence-dependent DNA deformability studied using molecular dynamics simulations.

PubMed

Fujii, Satoshi; Kono, Hidetoshi; Takenaka, Shigeori; Go, Nobuhiro; Sarai, Akinori

2007-01-01

Proteins recognize specific DNA sequences not only through direct contact between amino acids and bases, but also indirectly based on the sequence-dependent conformation and deformability of the DNA (indirect readout). We used molecular dynamics simulations to analyze the sequence-dependent DNA conformations of all 136 possible tetrameric sequences sandwiched between CGCG sequences. The deformability of dimeric steps obtained by the simulations is consistent with that by the crystal structures. The simulation results further showed that the conformation and deformability of the tetramers can highly depend on the flanking base pairs. The conformations of xATx tetramers show the most rigidity and are not affected by the flanking base pairs and the xYRx show by contrast the greatest flexibility and change their conformations depending on the base pairs at both ends, suggesting tetramers with the same central dimer can show different deformabilities. These results suggest that analysis of dimeric steps alone may overlook some conformational features of DNA and provide insight into the mechanism of indirect readout during protein-DNA recognition. Moreover, the sequence dependence of DNA conformation and deformability may be used to estimate the contribution of indirect readout to the specificity of protein-DNA recognition as well as nucleosome positioning and large-scale behavior of nucleic acids.
Quantification of loading in biomechanical testing: the influence of dissection sequence.

PubMed

Funabashi, Martha; El-Rich, Marwan; Prasad, Narasimha; Kawchuk, Gregory N

2015-09-18

Sequential dissection is a technique used to investigate loads experienced by articular tissues. When the joint of interest is tested in an unconstrained manner, its kinematics change with each tissue removal. To address this limitation, sufficiently rigid robots are used to constrain joint kinematics. While this approach can quantify loads experienced by each tissue, it does not assure similar results when removal order is changed. Specifically, structure loading is assumed to be independent of removal order if the structure behaves linearly (i.e. principle of superposition applies), but dependent on removal order when response is affected by material and/or geometry nonlinearities and/or viscoelasticiy (e.g. biological tissues). Therefore, this experiment was conducted to evaluate if structure loading created through robotic testing is dependent on the order in which connectors are removed. Six identical models were 3D printed. Each model was composed of 2 rigid bodies and 3 connecting structures with nonlinear time-dependent behavior. To these models, pure rotations were applied about a predefined static center of rotation using a parallel robot. A unique dissection sequence was used for each of the six models and the same movements applied robotically after each dissection. When comparing the moments experienced by each structure between different removal sequences, a statistically significant difference (p<0.05) was observed. These results suggest that even in an optimized environment, the sequence in which nonlinear viscoelastic structures are removed influence model loading. These findings support prior work suggesting that tissue loads obtained from robotic testing are specific to removal order. Copyright © 2015 Elsevier Ltd. All rights reserved.
Molecular phylogeny of 21 tropical bamboo species reconstructed by integrating non-coding internal transcribed spacer (ITS1 and 2) sequences and their consensus secondary structure.

PubMed

Ghosh, Jayadri Sekhar; Bhattacharya, Samik; Pal, Amita

2017-06-01

The unavailability of the reproductive structure and unpredictability of vegetative characters for the identification and phylogenetic study of bamboo prompted the application of molecular techniques for greater resolution and consensus. We first employed internal transcribed spacer (ITS1, 5.8S rRNA and ITS2) sequences to construct the phylogenetic tree of 21 tropical bamboo species. While the sequence alone could grossly reconstruct the traditional phylogeny amongst the 21-tropical species studied, some anomalies were encountered that prompted a further refinement of the phylogenetic analyses. Therefore, we integrated the secondary structure of the ITS sequences to derive individual sequence-structure matrix to gain more resolution on the phylogenetic reconstruction. The results showed that ITS sequence-structure is the reliable alternative to the conventional phenotypic method for the identification of bamboo species. The best-fit topology obtained by the sequence-structure based phylogeny over the sole sequence based one underscores closer clustering of all the studied Bambusa species (Sub-tribe Bambusinae), while Melocanna baccifera, which belongs to Sub-Tribe Melocanneae, disjointedly clustered as an out-group within the consensus phylogenetic tree. In this study, we demonstrated the dependability of the combined (ITS sequence+structure-based) approach over the only sequence-based analysis for phylogenetic relationship assessment of bamboo.
A Generative Angular Model of Protein Structure Evolution

PubMed Central

Golden, Michael; García-Portugués, Eduardo; Sørensen, Michael; Mardia, Kanti V.; Hamelryck, Thomas; Hein, Jotun

2017-01-01

Abstract Recently described stochastic models of protein evolution have demonstrated that the inclusion of structural information in addition to amino acid sequences leads to a more reliable estimation of evolutionary parameters. We present a generative, evolutionary model of protein structure and sequence that is valid on a local length scale. The model concerns the local dependencies between sequence and structure evolution in a pair of homologous proteins. The evolutionary trajectory between the two structures in the protein pair is treated as a random walk in dihedral angle space, which is modeled using a novel angular diffusion process on the two-dimensional torus. Coupling sequence and structure evolution in our model allows for modeling both “smooth” conformational changes and “catastrophic” conformational jumps, conditioned on the amino acid changes. The model has interpretable parameters and is comparatively more realistic than previous stochastic models, providing new insights into the relationship between sequence and structure evolution. For example, using the trained model we were able to identify an apparent sequence–structure evolutionary motif present in a large number of homologous protein pairs. The generative nature of our model enables us to evaluate its validity and its ability to simulate aspects of protein evolution conditioned on an amino acid sequence, a related amino acid sequence, a related structure or any combination thereof. PMID:28453724
Functional specificity of a Hox protein mediated by the recognition of minor groove structure.

PubMed

Joshi, Rohit; Passner, Jonathan M; Rohs, Remo; Jain, Rinku; Sosinsky, Alona; Crickmore, Michael A; Jacob, Vinitha; Aggarwal, Aneel K; Honig, Barry; Mann, Richard S

2007-11-02

The recognition of specific DNA-binding sites by transcription factors is a critical yet poorly understood step in the control of gene expression. Members of the Hox family of transcription factors bind DNA by making nearly identical major groove contacts via the recognition helices of their homeodomains. In vivo specificity, however, often depends on extended and unstructured regions that link Hox homeodomains to a DNA-bound cofactor, Extradenticle (Exd). Using a combination of structure determination, computational analysis, and in vitro and in vivo assays, we show that Hox proteins recognize specific Hox-Exd binding sites via residues located in these extended regions that insert into the minor groove but only when presented with the correct DNA sequence. Our results suggest that these residues, which are conserved in a paralog-specific manner, confer specificity by recognizing a sequence-dependent DNA structure instead of directly reading a specific DNA sequence.
Fluorescent DNA-templated silver nanoclusters

NASA Astrophysics Data System (ADS)

Lin, Ruoqian

Because of the ultra-small size and biocompatibility of silver nanoclusters, they have attracted much research interest for their applications in biolabeling. Among the many ways of synthesizing silver nanoclusters, DNA templated method is particularly attractive---the high tunability of DNA sequences provides another degree of freedom for controlling the chemical and photophysical properties. However, systematic studies about how DNA sequences and concentrations are controlling the photophysical properties are still lacking. The aim of this thesis is to investigate the binding mechanisms of silver clusters binding and single stranded DNAs. Here in this thesis, we report synthesis and characterization of DNA-templated silver nanoclusters and provide a systematic interrogation of the effects of DNA concentrations and sequences, including lengths and secondary structures. We performed a series of syntheses utilizing five different sequences to explore the optimal synthesis condition. By characterizing samples with UV-vis and fluorescence spectroscopy, we achieved the most proper reactants ratio and synthesis conditions. Two of them were chosen for further concentration dependence studies and sequence dependence studies. We found that cytosine-rich sequences are more likely to produce silver nanoclusters with stronger fluorescence signals; however, sequences with hairpin secondary structures are more capable in stabilizing silver nanoclusters. In addition, the fluorescence peak emission intensities and wavelengths of the DNA templated silver clusters have sequence dependent fingerprints. This potentially can be applied to sequence sensing in the future. However all the current conclusions are not warranted; there is still difficulty in formulating general rules in DNA strand design and silver nanocluster production. Further investigation of more sequences could solve these questions in the future.
SMARTIV: combined sequence and structure de-novo motif discovery for in-vivo RNA binding data.

PubMed

Polishchuk, Maya; Paz, Inbal; Yakhini, Zohar; Mandel-Gutfreund, Yael

2018-05-25

Gene expression regulation is highly dependent on binding of RNA-binding proteins (RBPs) to their RNA targets. Growing evidence supports the notion that both RNA primary sequence and its local secondary structure play a role in specific Protein-RNA recognition and binding. Despite the great advance in high-throughput experimental methods for identifying sequence targets of RBPs, predicting the specific sequence and structure binding preferences of RBPs remains a major challenge. We present a novel webserver, SMARTIV, designed for discovering and visualizing combined RNA sequence and structure motifs from high-throughput RNA-binding data, generated from in-vivo experiments. The uniqueness of SMARTIV is that it predicts motifs from enriched k-mers that combine information from ranked RNA sequences and their predicted secondary structure, obtained using various folding methods. Consequently, SMARTIV generates Position Weight Matrices (PWMs) in a combined sequence and structure alphabet with assigned P-values. SMARTIV concisely represents the sequence and structure motif content as a single graphical logo, which is informative and easy for visual perception. SMARTIV was examined extensively on a variety of high-throughput binding experiments for RBPs from different families, generated from different technologies, showing consistent and accurate results. Finally, SMARTIV is a user-friendly webserver, highly efficient in run-time and freely accessible via http://smartiv.technion.ac.il/.
The structure of cell adhesion molecule uvomorulin. Insights into the molecular mechanism of Ca2+-dependent cell adhesion.

PubMed Central

Ringwald, M; Schuh, R; Vestweber, D; Eistetter, H; Lottspeich, F; Engel, J; Dölz, R; Jähnig, F; Epplen, J; Mayer, S

1987-01-01

We have determined the amino acid sequence of the Ca2+-dependent cell adhesion molecule uvomorulin as it appears on the cell surface. The extracellular part of the molecule exhibits three internally repeated domains of 112 residues which are most likely generated by gene duplication. Each of the repeated domains contains two highly conserved units which could represent putative Ca2+-binding sites. Secondary structure predictions suggest that the putative Ca2+-binding units are located in external loops at the surface of the protein. The protein sequence exhibits a single membrane-spanning region and a cytoplasmic domain. Sequence comparison reveals extensive homology to the chicken L-CAM. Both uvomorulin and L-CAM are identical in 65% of their entire amino acid sequence suggesting a common origin for both CAMs. Images Fig. 1. Fig. 4. Fig. 7. PMID:3501370
Underwound DNA under Tension: Structure, Elasticity, and Sequence-Dependent Behaviors

NASA Astrophysics Data System (ADS)

Sheinin, Maxim Y.; Forth, Scott; Marko, John F.; Wang, Michelle D.

2011-09-01

DNA melting under torsion plays an important role in a wide variety of cellular processes. In the present Letter, we have investigated DNA melting at the single-molecule level using an angular optical trap. By directly measuring force, extension, torque, and angle of DNA, we determined the structural and elastic parameters of torsionally melted DNA. Our data reveal that under moderate forces, the melted DNA assumes a left-handed structure as opposed to an open bubble conformation and is highly torsionally compliant. We have also discovered that at low forces melted DNA properties are highly dependent on DNA sequence. These results provide a more comprehensive picture of the global DNA force-torque phase diagram.
Productive mRNA stem loop-mediated transcriptional slippage: Crucial features in common with intrinsic terminators.

PubMed

Penno, Christophe; Sharma, Virag; Coakley, Arthur; O'Connell Motherway, Mary; van Sinderen, Douwe; Lubkowska, Lucyna; Kireeva, Maria L; Kashlev, Mikhail; Baranov, Pavel V; Atkins, John F

2015-04-21

Escherichia coli and yeast DNA-dependent RNA polymerases are shown to mediate efficient nascent transcript stem loop formation-dependent RNA-DNA hybrid realignment. The realignment was discovered on the heteropolymeric sequence T5C5 and yields transcripts lacking a C residue within a corresponding U5C4. The sequence studied is derived from a Roseiflexus insertion sequence (IS) element where the resulting transcriptional slippage is required for transposase synthesis. The stability of the RNA structure, the proximity of the stem loop to the slippage site, the length and composition of the slippage site motif, and the identity of its 3' adjacent nucleotides (nt) are crucial for transcripts lacking a single C. In many respects, the RNA structure requirements for this slippage resemble those for hairpin-dependent transcription termination. In a purified in vitro system, the slippage efficiency ranges from 5% to 75% depending on the concentration ratios of the nucleotides specified by the slippage sequence and the 3' nt context. The only previous proposal of stem loop mediated slippage, which was in Ebola virus expression, was based on incorrect data interpretation. We propose a mechanical slippage model involving the RNAP translocation state as the main motor in slippage directionality and efficiency. It is distinct from previously described models, including the one proposed for paramyxovirus, where following random movement efficiency is mainly dependent on the stability of the new realigned hybrid. In broadening the scope for utilization of transcription slippage for gene expression, the stimulatory structure provides parallels with programmed ribosomal frameshifting at the translation level.

p53 Specifically Binds Triplex DNA In Vitro and in Cells

PubMed Central

Brázdová, Marie; Tichý, Vlastimil; Helma, Robert; Bažantová, Pavla; Polášková, Alena; Krejčí, Aneta; Petr, Marek; Navrátilová, Lucie; Tichá, Olga; Nejedlý, Karel; Bennink, Martin L.; Subramaniam, Vinod; Bábková, Zuzana; Martínek, Tomáš; Lexa, Matej; Adámik, Matej

2016-01-01

Triplex DNA is implicated in a wide range of biological activities, including regulation of gene expression and genomic instability leading to cancer. The tumor suppressor p53 is a central regulator of cell fate in response to different type of insults. Sequence and structure specific modes of DNA recognition are core attributes of the p53 protein. The focus of this work is the structure-specific binding of p53 to DNA containing triplex-forming sequences in vitro and in cells and the effect on p53-driven transcription. This is the first DNA binding study of full-length p53 and its deletion variants to both intermolecular and intramolecular T.A.T triplexes. We demonstrate that the interaction of p53 with intermolecular T.A.T triplex is comparable to the recognition of CTG-hairpin non-B DNA structure. Using deletion mutants we determined the C-terminal DNA binding domain of p53 to be crucial for triplex recognition. Furthermore, strong p53 recognition of intramolecular T.A.T triplexes (H-DNA), stabilized by negative superhelicity in plasmid DNA, was detected by competition and immunoprecipitation experiments, and visualized by AFM. Moreover, chromatin immunoprecipitation revealed p53 binding T.A.T forming sequence in vivo. Enhanced reporter transactivation by p53 on insertion of triplex forming sequence into plasmid with p53 consensus sequence was observed by luciferase reporter assays. In-silico scan of human regulatory regions for the simultaneous presence of both consensus sequence and T.A.T motifs identified a set of candidate p53 target genes and p53-dependent activation of several of them (ABCG5, ENOX1, INSR, MCC, NFAT5) was confirmed by RT-qPCR. Our results show that T.A.T triplex comprises a new class of p53 binding sites targeted by p53 in a DNA structure-dependent mode in vitro and in cells. The contribution of p53 DNA structure-dependent binding to the regulation of transcription is discussed. PMID:27907175
Annotating Protein Functional Residues by Coupling High-Throughput Fitness Profile and Homologous-Structure Analysis.

PubMed

Du, Yushen; Wu, Nicholas C; Jiang, Lin; Zhang, Tianhao; Gong, Danyang; Shu, Sara; Wu, Ting-Ting; Sun, Ren

2016-11-01

Identification and annotation of functional residues are fundamental questions in protein sequence analysis. Sequence and structure conservation provides valuable information to tackle these questions. It is, however, limited by the incomplete sampling of sequence space in natural evolution. Moreover, proteins often have multiple functions, with overlapping sequences that present challenges to accurate annotation of the exact functions of individual residues by conservation-based methods. Using the influenza A virus PB1 protein as an example, we developed a method to systematically identify and annotate functional residues. We used saturation mutagenesis and high-throughput sequencing to measure the replication capacity of single nucleotide mutations across the entire PB1 protein. After predicting protein stability upon mutations, we identified functional PB1 residues that are essential for viral replication. To further annotate the functional residues important to the canonical or noncanonical functions of viral RNA-dependent RNA polymerase (vRdRp), we performed a homologous-structure analysis with 16 different vRdRp structures. We achieved high sensitivity in annotating the known canonical polymerase functional residues. Moreover, we identified a cluster of noncanonical functional residues located in the loop region of the PB1 β-ribbon. We further demonstrated that these residues were important for PB1 protein nuclear import through the interaction with Ran-binding protein 5. In summary, we developed a systematic and sensitive method to identify and annotate functional residues that are not restrained by sequence conservation. Importantly, this method is generally applicable to other proteins about which homologous-structure information is available. To fully comprehend the diverse functions of a protein, it is essential to understand the functionality of individual residues. Current methods are highly dependent on evolutionary sequence conservation, which is usually limited by sampling size. Sequence conservation-based methods are further confounded by structural constraints and multifunctionality of proteins. Here we present a method that can systematically identify and annotate functional residues of a given protein. We used a high-throughput functional profiling platform to identify essential residues. Coupling it with homologous-structure comparison, we were able to annotate multiple functions of proteins. We demonstrated the method with the PB1 protein of influenza A virus and identified novel functional residues in addition to its canonical function as an RNA-dependent RNA polymerase. Not limited to virology, this method is generally applicable to other proteins that can be functionally selected and about which homologous-structure information is available. Copyright © 2016 Du et al.
Sequence and Structure Dependent DNA-DNA Interactions

NASA Astrophysics Data System (ADS)

Kopchick, Benjamin; Qiu, Xiangyun

Molecular forces between dsDNA strands are largely dominated by electrostatics and have been extensively studied. Quantitative knowledge has been accumulated on how DNA-DNA interactions are modulated by varied biological constituents such as ions, cationic ligands, and proteins. Despite its central role in biology, the sequence of DNA has not received substantial attention and ``random'' DNA sequences are typically used in biophysical studies. However, ~50% of human genome is composed of non-random-sequence DNAs, particularly repetitive sequences. Furthermore, covalent modifications of DNA such as methylation play key roles in gene functions. Such DNAs with specific sequences or modifications often take on structures other than the canonical B-form. Here we present series of quantitative measurements of the DNA-DNA forces with the osmotic stress method on different DNA sequences, from short repeats to the most frequent sequences in genome, and to modifications such as bromination and methylation. We observe peculiar behaviors that appear to be strongly correlated with the incurred structural changes. We speculate the causalities in terms of the differences in hydration shell and DNA surface structures.
Length-independent structural similarities enrich the antibody CDR canonical class model.

PubMed

Nowak, Jaroslaw; Baker, Terry; Georges, Guy; Kelm, Sebastian; Klostermann, Stefan; Shi, Jiye; Sridharan, Sudharsan; Deane, Charlotte M

2016-01-01

Complementarity-determining regions (CDRs) are antibody loops that make up the antigen binding site. Here, we show that all CDR types have structurally similar loops of different lengths. Based on these findings, we created length-independent canonical classes for the non-H3 CDRs. Our length variable structural clusters show strong sequence patterns suggesting either that they evolved from the same original structure or result from some form of convergence. We find that our length-independent method not only clusters a larger number of CDRs, but also predicts canonical class from sequence better than the standard length-dependent approach. To demonstrate the usefulness of our findings, we predicted cluster membership of CDR-L3 sequences from 3 next-generation sequencing datasets of the antibody repertoire (over 1,000,000 sequences). Using the length-independent clusters, we can structurally classify an additional 135,000 sequences, which represents a ∼20% improvement over the standard approach. This suggests that our length-independent canonical classes might be a highly prevalent feature of antibody space, and could substantially improve our ability to accurately predict the structure of novel CDRs identified by next-generation sequencing.
Sequence dependent aggregation of peptides and fibril formation

NASA Astrophysics Data System (ADS)

Hung, Nguyen Ba; Le, Duy-Manh; Hoang, Trinh X.

2017-09-01

Deciphering the links between amino acid sequence and amyloid fibril formation is key for understanding protein misfolding diseases. Here we use Monte Carlo simulations to study the aggregation of short peptides in a coarse-grained model with hydrophobic-polar (HP) amino acid sequences and correlated side chain orientations for hydrophobic contacts. A significant heterogeneity is observed in the aggregate structures and in the thermodynamics of aggregation for systems of different HP sequences and different numbers of peptides. Fibril-like ordered aggregates are found for several sequences that contain the common HPH pattern, while other sequences may form helix bundles or disordered aggregates. A wide variation of the aggregation transition temperatures among sequences, even among those of the same hydrophobic fraction, indicates that not all sequences undergo aggregation at a presumable physiological temperature. The transition is found to be the most cooperative for sequences forming fibril-like structures. For a fibril-prone sequence, it is shown that fibril formation follows the nucleation and growth mechanism. Interestingly, a binary mixture of peptides of an aggregation-prone and a non-aggregation-prone sequence shows the association and conversion of the latter to the fibrillar structure. Our study highlights the role of a sequence in selecting fibril-like aggregates and also the impact of a structural template on fibril formation by peptides of unrelated sequences.
Influence of DNA sequence on the structure of minicircles under torsional stress

PubMed Central

Wang, Qian; Irobalieva, Rossitza N.; Chiu, Wah; Schmid, Michael F.; Fogg, Jonathan M.; Zechiedrich, Lynn

2017-01-01

Abstract The sequence dependence of the conformational distribution of DNA under various levels of torsional stress is an important unsolved problem. Combining theory and coarse-grained simulations shows that the DNA sequence and a structural correlation due to topology constraints of a circle are the main factors that dictate the 3D structure of a 336 bp DNA minicircle under torsional stress. We found that DNA minicircle topoisomers can have multiple bend locations under high torsional stress and that the positions of these sharp bends are determined by the sequence, and by a positive mechanical correlation along the sequence. We showed that simulations and theory are able to provide sequence-specific information about individual DNA minicircles observed by cryo-electron tomography (cryo-ET). We provided a sequence-specific cryo-ET tomogram fitting of DNA minicircles, registering the sequence within the geometric features. Our results indicate that the conformational distribution of minicircles under torsional stress can be designed, which has important implications for using minicircle DNA for gene therapy. PMID:28609782
Structure and Sequence Search on Aptamer-Protein Docking

NASA Astrophysics Data System (ADS)

Xiao, Jiajie; Bonin, Keith; Guthold, Martin; Salsbury, Freddie

2015-03-01

Interactions between proteins and deoxyribonucleic acid (DNA) play a significant role in the living systems, especially through gene regulation. However, short nucleic acids sequences (aptamers) with specific binding affinity to specific proteins exhibit clinical potential as therapeutics. Our capillary and gel electrophoresis selection experiments show that specific sequences of aptamers can be selected that bind specific proteins. Computationally, given the experimentally-determined structure and sequence of a thrombin-binding aptamer, we can successfully dock the aptamer onto thrombin in agreement with experimental structures of the complex. In order to further study the conformational flexibility of this thrombin-binding aptamer and to potentially develop a predictive computational model of aptamer-binding, we use GPU-enabled molecular dynamics simulations to both examine the conformational flexibility of the aptamer in the absence of binding to thrombin, and to determine our ability to fold an aptamer. This study should help further de-novo predictions of aptamer sequences by enabling the study of structural and sequence-dependent effects on aptamer-protein docking specificity.
Overcoming Sequence Misalignments with Weighted Structural Superposition

PubMed Central

Khazanov, Nickolay A.; Damm-Ganamet, Kelly L.; Quang, Daniel X.; Carlson, Heather A.

2012-01-01

An appropriate structural superposition identifies similarities and differences between homologous proteins that are not evident from sequence alignments alone. We have coupled our Gaussian-weighted RMSD (wRMSD) tool with a sequence aligner and seed extension (SE) algorithm to create a robust technique for overlaying structures and aligning sequences of homologous proteins (HwRMSD). HwRMSD overcomes errors in the initial sequence alignment that would normally propagate into a standard RMSD overlay. SE can generate a corrected sequence alignment from the improved structural superposition obtained by wRMSD. HwRMSD’s robust performance and its superiority over standard RMSD are demonstrated over a range of homologous proteins. Its better overlay results in corrected sequence alignments with good agreement to HOMSTRAD. Finally, HwRMSD is compared to established structural alignment methods: FATCAT, SSM, CE, and Dalilite. Most methods are comparable at placing residue pairs within 2 Å, but HwRMSD places many more residue pairs within 1 Å, providing a clear advantage. Such high accuracy is essential in drug design, where small distances can have a large impact on computational predictions. This level of accuracy is also needed to correct sequence alignments in an automated fashion, especially for omics-scale analysis. HwRMSD can align homologs with low sequence identity and large conformational differences, cases where both sequence-based and structural-based methods may fail. The HwRMSD pipeline overcomes the dependency of structural overlays on initial sequence pairing and removes the need to determine the best sequence-alignment method, substitution matrix, and gap parameters for each unique pair of homologs. PMID:22733542
Evolution of ribozymes in the presence of a mineral surface

PubMed Central

Stephenson, James D.; Popović, Milena; Bristow, Thomas F.

2016-01-01

Mineral surfaces are often proposed as the sites of critical processes in the emergence of life. Clay minerals in particular are thought to play significant roles in the origin of life including polymerizing, concentrating, organizing, and protecting biopolymers. In these scenarios, the impact of minerals on biopolymer folding is expected to influence evolutionary processes. These processes include both the initial emergence of functional structures in the presence of the mineral and the subsequent transition away from the mineral-associated niche. The initial evolution of function depends upon the number and distribution of sequences capable of functioning in the presence of the mineral, and the transition to new environments depends upon the overlap between sequences that evolve on the mineral surface and sequences that can perform the same functions in the mineral's absence. To examine these processes, we evolved self-cleaving ribozymes in vitro in the presence or absence of Na-saturated montmorillonite clay mineral particles. Starting from a shared population of random sequences, RNA populations were evolved in parallel, along separate evolutionary trajectories. Comparative sequence analysis and activity assays show that the impact of this clay mineral on functional structure selection was minimal; it neither prevented common structures from emerging, nor did it promote the emergence of new structures. This suggests that montmorillonite does not improve RNA's ability to evolve functional structures; however, it also suggests that RNAs that do evolve in contact with montmorillonite retain the same structures in mineral-free environments, potentially facilitating an evolutionary transition away from a mineral-associated niche. PMID:27793980
Pairwise Sequence Alignment Library

DOE Office of Scientific and Technical Information (OSTI.GOV)

Jeff Daily, PNNL

2015-05-20

Vector extensions, such as SSE, have been part of the x86 CPU since the 1990s, with applications in graphics, signal processing, and scientific applications. Although many algorithms and applications can naturally benefit from automatic vectorization techniques, there are still many that are difficult to vectorize due to their dependence on irregular data structures, dense branch operations, or data dependencies. Sequence alignment, one of the most widely used operations in bioinformatics workflows, has a computational footprint that features complex data dependencies. The trend of widening vector registers adversely affects the state-of-the-art sequence alignment algorithm based on striped data layouts. Therefore, amore » novel SIMD implementation of a parallel scan-based sequence alignment algorithm that can better exploit wider SIMD units was implemented as part of the Parallel Sequence Alignment Library (parasail). Parasail features: Reference implementations of all known vectorized sequence alignment approaches. Implementations of Smith Waterman (SW), semi-global (SG), and Needleman Wunsch (NW) sequence alignment algorithms. Implementations across all modern CPU instruction sets including AVX2 and KNC. Language interfaces for C/C++ and Python.« less
Surveying unsteady flows by means of movie sequences - A case study

NASA Astrophysics Data System (ADS)

Freymuth, P.; Bank, W.; Finaish, F.

Photographic surveying techniques and their results are presented for vortical pattern development in unsteady two-dimensional flows, which depends on a multitude of parameters that have heretofore hampered broad investigation, in order to delineate the more important parametric dependencies. Samples are given from 100 films representing over 2000 sequences consisting of 400,000 photographic frames. Attention is given to the problems posed by resolution of time and lateral dimensions, spanwise vortical structure, and the dependence of angle of attack on Reynolds number and flow geometry.
PSS-3D1D: an improved 3D1D profile method of protein fold recognition for the annotation of twilight zone sequences.

PubMed

Ganesan, K; Parthasarathy, S

2011-12-01

Annotation of any newly determined protein sequence depends on the pairwise sequence identity with known sequences. However, for the twilight zone sequences which have only 15-25% identity, the pair-wise comparison methods are inadequate and the annotation becomes a challenging task. Such sequences can be annotated by using methods that recognize their fold. Bowie et al. described a 3D1D profile method in which the amino acid sequences that fold into a known 3D structure are identified by their compatibility to that known 3D structure. We have improved the above method by using the predicted secondary structure information and employ it for fold recognition from the twilight zone sequences. In our Protein Secondary Structure 3D1D (PSS-3D1D) method, a score (w) for the predicted secondary structure of the query sequence is included in finding the compatibility of the query sequence to the known fold 3D structures. In the benchmarks, the PSS-3D1D method shows a maximum of 21% improvement in predicting correctly the α + β class of folds from the sequences with twilight zone level of identity, when compared with the 3D1D profile method. Hence, the PSS-3D1D method could offer more clues than the 3D1D method for the annotation of twilight zone sequences. The web based PSS-3D1D method is freely available in the PredictFold server at http://bioinfo.bdu.ac.in/servers/ .
In the Absence of Writhe, DNA Relieves Torsional Stress with Localized, Sequence-Dependent Structural Failure to Preserve B-form

DOE Office of Scientific and Technical Information (OSTI.GOV)

Randall, Graham L.; Zechiedrich, E. L.; Pettitt, Bernard M.

2009-09-01

To understand how underwinding and overwinding the DNA helix affects its structure, we simulated 19 independent DNA systems with fixed degrees of twist using molecular dynamics in a system that does not allow writhe. Underwinding DNA induced spontaneous, sequence-dependent base flipping and local denaturation, while overwinding DNA induced the formation of Pauling-like DNA (P-DNA). The winding resulted in a bimodal state simultaneously including local structural failure and B-form DNA for both underwinding and extreme overwinding. Our simulations suggest that base flipping and local denaturation may provide a landscape influencing protein recognition of DNA sequence to affect, for examples, replication, transcriptionmore » and recombination. Additionally, our findings help explain results from singlemolecule experiments and demonstrate that elastic rod models are strictly valid on average only for unstressed or overwound DNA up to P-DNA formation. Finally, our data support a model in which base flipping can result from torsional stress.« less
De Novo Protein Structure Prediction

NASA Astrophysics Data System (ADS)

Hung, Ling-Hong; Ngan, Shing-Chung; Samudrala, Ram

An unparalleled amount of sequence data is being made available from large-scale genome sequencing efforts. The data provide a shortcut to the determination of the function of a gene of interest, as long as there is an existing sequenced gene with similar sequence and of known function. This has spurred structural genomic initiatives with the goal of determining as many protein folds as possible (Brenner and Levitt, 2000; Burley, 2000; Brenner, 2001; Heinemann et al., 2001). The purpose of this is twofold: First, the structure of a gene product can often lead to direct inference of its function. Second, since the function of a protein is dependent on its structure, direct comparison of the structures of gene products can be more sensitive than the comparison of sequences of genes for detecting homology. Presently, structural determination by crystallography and NMR techniques is still slow and expensive in terms of manpower and resources, despite attempts to automate the processes. Computer structure prediction algorithms, while not providing the accuracy of the traditional techniques, are extremely quick and inexpensive and can provide useful low-resolution data for structure comparisons (Bonneau and Baker, 2001). Given the immense number of structures which the structural genomic projects are attempting to solve, there would be a considerable gain even if the computer structure prediction approach were applicable to a subset of proteins.
STAT1:DNA sequence-dependent binding modulation by phosphorylation, protein:protein interactions and small-molecule inhibition

PubMed Central

Bonham, Andrew J.; Wenta, Nikola; Osslund, Leah M.; Prussin, Aaron J.; Vinkemeier, Uwe; Reich, Norbert O.

2013-01-01

The DNA-binding specificity and affinity of the dimeric human transcription factor (TF) STAT1, were assessed by total internal reflectance fluorescence protein-binding microarrays (TIRF-PBM) to evaluate the effects of protein phosphorylation, higher-order polymerization and small-molecule inhibition. Active, phosphorylated STAT1 showed binding preferences consistent with prior characterization, whereas unphosphorylated STAT1 showed a weak-binding preference for one-half of the GAS consensus site, consistent with recent models of STAT1 structure and function in response to phosphorylation. This altered-binding preference was further tested by use of the inhibitor LLL3, which we show to disrupt STAT1 binding in a sequence-dependent fashion. To determine if this sequence-dependence is specific to STAT1 and not a general feature of human TF biology, the TF Myc/Max was analysed and tested with the inhibitor Mycro3. Myc/Max inhibition by Mycro3 is sequence independent, suggesting that the sequence-dependent inhibition of STAT1 may be specific to this system and a useful target for future inhibitor design. PMID:23180800
Contingency Table Browser - prediction of early stage protein structure.

PubMed

Kalinowska, Barbara; Krzykalski, Artur; Roterman, Irena

2015-01-01

The Early Stage (ES) intermediate represents the starting structure in protein folding simulations based on the Fuzzy Oil Drop (FOD) model. The accuracy of FOD predictions is greatly dependent on the accuracy of the chosen intermediate. A suitable intermediate can be constructed using the sequence-structure relationship information contained in the so-called contingency table - this table expresses the likelihood of encountering various structural motifs for each tetrapeptide fragment in the amino acid sequence. The limited accuracy with which such structures could previously be predicted provided the motivation for a more indepth study of the contingency table itself. The Contingency Table Browser is a tool which can visualize, search and analyze the table. Our work presents possible applications of Contingency Table Browser, among them - analysis of specific protein sequences from the point of view of their structural ambiguity.
Iterative refinement of structure-based sequence alignments by Seed Extension

PubMed Central

Kim, Changhoon; Tai, Chin-Hsien; Lee, Byungkook

2009-01-01

Background Accurate sequence alignment is required in many bioinformatics applications but, when sequence similarity is low, it is difficult to obtain accurate alignments based on sequence similarity alone. The accuracy improves when the structures are available, but current structure-based sequence alignment procedures still mis-align substantial numbers of residues. In order to correct such errors, we previously explored the possibility of replacing the residue-based dynamic programming algorithm in structure alignment procedures with the Seed Extension algorithm, which does not use a gap penalty. Here, we describe a new procedure called RSE (Refinement with Seed Extension) that iteratively refines a structure-based sequence alignment. Results RSE uses SE (Seed Extension) in its core, which is an algorithm that we reported recently for obtaining a sequence alignment from two superimposed structures. The RSE procedure was evaluated by comparing the correctly aligned fractions of residues before and after the refinement of the structure-based sequence alignments produced by popular programs. CE, DaliLite, FAST, LOCK2, MATRAS, MATT, TM-align, SHEBA and VAST were included in this analysis and the NCBI's CDD root node set was used as the reference alignments. RSE improved the average accuracy of sequence alignments for all programs tested when no shift error was allowed. The amount of improvement varied depending on the program. The average improvements were small for DaliLite and MATRAS but about 5% for CE and VAST. More substantial improvements have been seen in many individual cases. The additional computation times required for the refinements were negligible compared to the times taken by the structure alignment programs. Conclusion RSE is a computationally inexpensive way of improving the accuracy of a structure-based sequence alignment. It can be used as a standalone procedure following a regular structure-based sequence alignment or to replace the traditional iterative refinement procedures based on residue-level dynamic programming algorithm in many structure alignment programs. PMID:19589133
Identification of a conserved branched RNA structure that functions as a factor-independent terminator.

PubMed

Johnson, Christopher M; Chen, Yuqing; Lee, Heejin; Ke, Ailong; Weaver, Keith E; Dunny, Gary M

2014-03-04

Anti-Q is a small RNA encoded on pCF10, an antibiotic resistance plasmid of Enterococcus faecalis, which negatively regulates conjugation of the plasmid. In this study we sought to understand how Anti-Q is generated relative to larger transcripts of the same operon. We found that Anti-Q folds into a branched structure that functions as a factor-independent terminator. In vitro and in vivo, termination is dependent on the integrity of this structure as well as the presence of a 3' polyuridine tract, but is not dependent on other downstream sequences. In vitro, terminated transcripts are released from RNA polymerase after synthesis. In vivo, a mutant with reduced termination efficiency demonstrated loss of tight control of conjugation function. A search of bacterial genomes revealed the presence of sequences that encode Anti-Q-like RNA structures. In vitro and in vivo experiments demonstrated that one of these functions as a terminator. This work reveals a previously unappreciated flexibility in the structure of factor-independent terminators and identifies a mechanism for generation of functional small RNAs; it should also inform annotation of bacterial sequence features, such as terminators, functional sRNAs, and operons.
Identification of a conserved branched RNA structure that functions as a factor-independent terminator

PubMed Central

Johnson, Christopher M.; Chen, Yuqing; Lee, Heejin; Ke, Ailong; Weaver, Keith E.; Dunny, Gary M.

2014-01-01

Anti-Q is a small RNA encoded on pCF10, an antibiotic resistance plasmid of Enterococcus faecalis, which negatively regulates conjugation of the plasmid. In this study we sought to understand how Anti-Q is generated relative to larger transcripts of the same operon. We found that Anti-Q folds into a branched structure that functions as a factor-independent terminator. In vitro and in vivo, termination is dependent on the integrity of this structure as well as the presence of a 3′ polyuridine tract, but is not dependent on other downstream sequences. In vitro, terminated transcripts are released from RNA polymerase after synthesis. In vivo, a mutant with reduced termination efficiency demonstrated loss of tight control of conjugation function. A search of bacterial genomes revealed the presence of sequences that encode Anti-Q–like RNA structures. In vitro and in vivo experiments demonstrated that one of these functions as a terminator. This work reveals a previously unappreciated flexibility in the structure of factor-independent terminators and identifies a mechanism for generation of functional small RNAs; it should also inform annotation of bacterial sequence features, such as terminators, functional sRNAs, and operons. PMID:24550474
Role of Sequence and Structural Polymorphism on the Mechanical Properties of Amyloid Fibrils

PubMed Central

Kim, Jae In; Na, Sungsoo; Eom, Kilho

2014-01-01

Amyloid fibrils playing a critical role in disease expression, have recently been found to exhibit the excellent mechanical properties such as elastic modulus in the order of 10 GPa, which is comparable to that of other mechanical proteins such as microtubule, actin filament, and spider silk. These remarkable mechanical properties of amyloid fibrils are correlated with their functional role in disease expression. This suggests the importance in understanding how these excellent mechanical properties are originated through self-assembly process that may depend on the amino acid sequence. However, the sequence-structure-property relationship of amyloid fibrils has not been fully understood yet. In this work, we characterize the mechanical properties of human islet amyloid polypeptide (hIAPP) fibrils with respect to their molecular structures as well as their amino acid sequence by using all-atom explicit water molecular dynamics (MD) simulation. The simulation result suggests that the remarkable bending rigidity of amyloid fibrils can be achieved through a specific self-aggregation pattern such as antiparallel stacking of β strands (peptide chain). Moreover, we have shown that a single point mutation of hIAPP chain constituting a hIAPP fibril significantly affects the thermodynamic stability of hIAPP fibril formed by parallel stacking of peptide chain, and that a single point mutation results in a significant change in the bending rigidity of hIAPP fibrils formed by antiparallel stacking of β strands. This clearly elucidates the role of amino acid sequence on not only the equilibrium conformations of amyloid fibrils but also their mechanical properties. Our study sheds light on sequence-structure-property relationships of amyloid fibrils, which suggests that the mechanical properties of amyloid fibrils are encoded in their sequence-dependent molecular architecture. PMID:24551113

Using chaos to generate variations on movement sequences

NASA Astrophysics Data System (ADS)

Bradley, Elizabeth; Stuart, Joshua

1998-12-01

We describe a method for introducing variations into predefined motion sequences using a chaotic symbol-sequence reordering technique. A progression of symbols representing the body positions in a dance piece, martial arts form, or other motion sequence is mapped onto a chaotic trajectory, establishing a symbolic dynamics that links the movement sequence and the attractor structure. A variation on the original piece is created by generating a trajectory with slightly different initial conditions, inverting the mapping, and using special corpus-based graph-theoretic interpolation schemes to smooth any abrupt transitions. Sensitive dependence guarantees that the variation is different from the original; the attractor structure and the symbolic dynamics guarantee that the two resemble one another in both aesthetic and mathematical senses.
Sequence-dependent modelling of local DNA bending phenomena: curvature prediction and vibrational analysis.

PubMed

Vlahovicek, K; Munteanu, M G; Pongor, S

1999-01-01

Bending is a local conformational micropolymorphism of DNA in which the original B-DNA structure is only distorted but not extensively modified. Bending can be predicted by simple static geometry models as well as by a recently developed elastic model that incorporate sequence dependent anisotropic bendability (SDAB). The SDAB model qualitatively explains phenomena including affinity of protein binding, kinking, as well as sequence-dependent vibrational properties of DNA. The vibrational properties of DNA segments can be studied by finite element analysis of a model subjected to an initial bending moment. The frequency spectrum is obtained by applying Fourier analysis to the displacement values in the time domain. This analysis shows that the spectrum of the bending vibrations quite sensitively depends on the sequence, for example the spectrum of a curved sequence is characteristically different from the spectrum of straight sequence motifs of identical basepair composition. Curvature distributions are genome-specific, and pronounced differences are found between protein-coding and regulatory regions, respectively, that is, sites of extreme curvature and/or bendability are less frequent in protein-coding regions. A WWW server is set up for the prediction of curvature and generation of 3D models from DNA sequences (http:@www.icgeb.trieste.it/dna).
Detection of nucleic acids by multiple sequential invasive cleavages

DOEpatents

Hall, Jeff G.; Lyamichev, Victor I.; Mast, Andrea L.; Brow, Mary Ann D.

1999-01-01

The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof. The present invention further relates to methods and devices for the separation of nucleic acid molecules based on charge. The present invention also provides methods for the detection of non-target cleavage products via the formation of a complete and activated protein binding region. The invention further provides sensitive and specific methods for the detection of human cytomegalovirus nucleic acid in a sample.
Nucleic acid detection kits

DOEpatents

Hall, Jeff G.; Lyamichev, Victor I.; Mast, Andrea L.; Brow, Mary Ann; Kwiatkowski, Robert W.; Vavra, Stephanie H.

2005-03-29

The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof. The present invention further relates to methods and devices for the separation of nucleic acid molecules based on charge. The present invention also provides methods for the detection of non-target cleavage products via the formation of a complete and activated protein binding region. The invention further provides sensitive and specific methods for the detection of nucleic acid from various viruses in a sample.
Detection of nucleic acids by multiple sequential invasive cleavages 02

DOEpatents

Hall, Jeff G.; Lyamichev, Victor I.; Mast, Andrea L.; Brow, Mary Ann D.

2002-01-01

The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof. The present invention further relates to methods and devices for the separation of nucleic acid molecules based on charge. The present invention also provides methods for the detection of non-target cleavage products via the formation of a complete and activated protein binding region. The invention further provides sensitive and specific methods for the detection of human cytomegalovirus nucleic acid in a sample.
Detection of nucleic acids by multiple sequential invasive cleavages

DOEpatents

Hall, Jeff G; Lyamichev, Victor I; Mast, Andrea L; Brow, Mary Ann D

2012-10-16

The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof. The present invention further relates to methods and devices for the separation of nucleic acid molecules based on charge. The present invention also provides methods for the detection of non-target cleavage products via the formation of a complete and activated protein binding region. The invention further provides sensitive and specific methods for the detection of human cytomegalovirus nucleic acid in a sample.
A Novel Bioinformatics Strategy to Analyze Microbial Big Sequence Data for Efficient Knowledge Discovery: Batch-Learning Self-Organizing Map (BLSOM).

PubMed

Iwasaki, Yuki; Abe, Takashi; Wada, Kennosuke; Wada, Yoshiko; Ikemura, Toshimichi

2013-11-20

With the remarkable increase of genomic sequence data of microorganisms, novel tools are needed for comprehensive analyses of the big sequence data available. The self-organizing map (SOM) is an effective tool for clustering and visualizing high-dimensional data, such as oligonucleotide composition on one map. By modifying the conventional SOM, we developed batch-learning SOM (BLSOM), which allowed classification of sequence fragments (e.g., 1 kb) according to phylotypes, solely depending on oligonucleotide composition. Metagenomics studies of uncultivable microorganisms in clinical and environmental samples should allow extensive surveys of genes important in life sciences. BLSOM is most suitable for phylogenetic assignment of metagenomic sequences, because fragmental sequences can be clustered according to phylotypes, solely depending on oligonucleotide composition. We first constructed oligonucleotide BLSOMs for all available sequences from genomes of known species, and by mapping metagenomic sequences on these large-scale BLSOMs, we can predict phylotypes of individual metagenomic sequences, revealing a microbial community structure of uncultured microorganisms, including viruses. BLSOM has shown that influenza viruses isolated from humans and birds clearly differ in oligonucleotide composition. Based on this host-dependent oligonucleotide composition, we have proposed strategies for predicting directional changes of virus sequences and for surveilling potentially hazardous strains when introduced into humans from non-human sources.
In vitro fluorescence studies of transcription factor IIB-DNA interaction.

PubMed

Górecki, Andrzej; Figiel, Małgorzata; Dziedzicka-Wasylewska, Marta

2015-01-01

General transcription factor TFIIB is one of the basal constituents of the preinitiation complex of eukaryotic RNA polymerase II, acting as a bridge between the preinitiation complex and the polymerase, and binding promoter DNA in an asymmetric manner, thereby defining the direction of the transcription. Methods of fluorescence spectroscopy together with circular dichroism spectroscopy were used to observe conformational changes in the structure of recombinant human TFIIB after binding to specific DNA sequence. To facilitate the exploration of the structural changes, several site-directed mutations have been introduced altering the fluorescence properties of the protein. Our observations showed that binding of specific DNA sequences changed the protein structure and dynamics, and TFIIB may exist in two conformational states, which can be described by a different microenvironment of W52. Fluorescence studies using both intrinsic and exogenous fluorophores showed that these changes significantly depended on the recognition sequence and concerned various regions of the protein, including those interacting with other transcription factors and RNA polymerase II. DNA binding can cause rearrangements in regions of proteins interacting with the polymerase in a manner dependent on the recognized sequences, and therefore, influence the gene expression.
The right inferior frontal gyrus processes nested non-local dependencies in music.

PubMed

Cheung, Vincent K M; Meyer, Lars; Friederici, Angela D; Koelsch, Stefan

2018-02-28

Complex auditory sequences known as music have often been described as hierarchically structured. This permits the existence of non-local dependencies, which relate elements of a sequence beyond their temporal sequential order. Previous studies in music have reported differential activity in the inferior frontal gyrus (IFG) when comparing regular and irregular chord-transitions based on theories in Western tonal harmony. However, it is unclear if the observed activity reflects the interpretation of hierarchical structure as the effects are confounded by local irregularity. Using functional magnetic resonance imaging (fMRI), we found that violations to non-local dependencies in nested sequences of three-tone musical motifs in musicians elicited increased activity in the right IFG. This is in contrast to similar studies in language which typically report the left IFG in processing grammatical syntax. Effects of increasing auditory working demands are moreover reflected by distributed activity in frontal and parietal regions. Our study therefore demonstrates the role of the right IFG in processing non-local dependencies in music, and suggests that hierarchical processing in different cognitive domains relies on similar mechanisms that are subserved by domain-selective neuronal subpopulations.
Torque measurements reveal sequence-specific cooperative transitions in supercoiled DNA

PubMed Central

Oberstrass, Florian C.; Fernandes, Louis E.; Bryant, Zev

2012-01-01

B-DNA becomes unstable under superhelical stress and is able to adopt a wide range of alternative conformations including strand-separated DNA and Z-DNA. Localized sequence-dependent structural transitions are important for the regulation of biological processes such as DNA replication and transcription. To directly probe the effect of sequence on structural transitions driven by torque, we have measured the torsional response of a panel of DNA sequences using single molecule assays that employ nanosphere rotational probes to achieve high torque resolution. The responses of Z-forming d(pGpC)n sequences match our predictions based on a theoretical treatment of cooperative transitions in helical polymers. “Bubble” templates containing 50–100 bp mismatch regions show cooperative structural transitions similar to B-DNA, although less torque is required to disrupt strand–strand interactions. Our mechanical measurements, including direct characterization of the torsional rigidity of strand-separated DNA, establish a framework for quantitative predictions of the complex torsional response of arbitrary sequences in their biological context. PMID:22474350
Power law tails in phylogenetic systems.

PubMed

Qin, Chongli; Colwell, Lucy J

2018-01-23

Covariance analysis of protein sequence alignments uses coevolving pairs of sequence positions to predict features of protein structure and function. However, current methods ignore the phylogenetic relationships between sequences, potentially corrupting the identification of covarying positions. Here, we use random matrix theory to demonstrate the existence of a power law tail that distinguishes the spectrum of covariance caused by phylogeny from that caused by structural interactions. The power law is essentially independent of the phylogenetic tree topology, depending on just two parameters-the sequence length and the average branch length. We demonstrate that these power law tails are ubiquitous in the large protein sequence alignments used to predict contacts in 3D structure, as predicted by our theory. This suggests that to decouple phylogenetic effects from the interactions between sequence distal sites that control biological function, it is necessary to remove or down-weight the eigenvectors of the covariance matrix with largest eigenvalues. We confirm that truncating these eigenvectors improves contact prediction.
Polytypism in the ground state structure of the Lennard-Jonesium.

PubMed

Pártay, Lívia B; Ortner, Christoph; Bartók, Albert P; Pickard, Chris J; Csányi, Gábor

2017-07-26

We present a systematic study of the stability of nineteen different periodic structures using the finite range Lennard-Jones potential model discussing the effects of pressure, potential truncation, cutoff distance and Lennard-Jones exponents. The structures considered are the hexagonal close packed (hcp), face centred cubic (fcc) and seventeen other polytype stacking sequences, such as dhcp and 9R. We found that at certain pressure and cutoff distance values, neither fcc nor hcp is the ground state structure as previously documented, but different polytypic sequences. This behaviour shows a strong dependence on the way the tail of the potential is truncated.
Learning Temporal Statistics for Sensory Predictions in Aging.

PubMed

Luft, Caroline Di Bernardi; Baker, Rosalind; Goldstone, Aimee; Zhang, Yang; Kourtzi, Zoe

2016-03-01

Predicting future events based on previous knowledge about the environment is critical for successful everyday interactions. Here, we ask which brain regions support our ability to predict the future based on implicit knowledge about the past in young and older age. Combining behavioral and fMRI measurements, we test whether training on structured temporal sequences improves the ability to predict upcoming sensory events; we then compare brain regions involved in learning predictive structures between young and older adults. Our behavioral results demonstrate that exposure to temporal sequences without feedback facilitates the ability of young and older adults to predict the orientation of an upcoming stimulus. Our fMRI results provide evidence for the involvement of corticostriatal regions in learning predictive structures in both young and older learners. In particular, we showed learning-dependent fMRI responses for structured sequences in frontoparietal regions and the striatum (putamen) for young adults. However, for older adults, learning-dependent activations were observed mainly in subcortical (putamen, thalamus) regions but were weaker in frontoparietal regions. Significant correlations of learning-dependent behavioral and fMRI changes in these regions suggest a strong link between brain activations and behavioral improvement rather than general overactivation. Thus, our findings suggest that predicting future events based on knowledge of temporal statistics engages brain regions involved in implicit learning in both young and older adults.
COMPUTER SIMULATION STUDY OF AMYLOID FIBRIL FORMATION BY PALINDROMIC SEQUENCES IN PRION PEPTIDES

PubMed Central

Wagoner, Victoria; Cheon, Mookyung; Chang, Iksoo; Hall, Carol

2011-01-01

We simulate the aggregation of large systems containing palindromic peptides from the Syrian hamster prion protein SHaPrP 113–120 (AGAAAAGA) and the mouse prion protein MoPrP 111–120 (VAGAAAAGAV) and eight sequence variations: GAAAAAAG, (AG)4, A8, GAAAGAAA, A10, V10, GAVAAAAVAG, and VAVAAAAVAV The first two peptides are thought to act as the Velcro that holds the parent prion proteins together in amyloid structures and can form fibrils themselves. Kinetic events along the fibrillization pathway influence the types of structures that occur and variations in the sequence affect aggregation kinetics and fibrillar structure. Discontinuous molecular dynamics simulations using the PRIME20 force field are performed on systems containing 48 peptides starting from a random coil configuration. Depending on the sequence, fibrillar structures form spontaneously over a range of temperatures, below which amorphous aggregates form and above which no aggregation occurs. AGAAAAGA forms well organized fibrillar structures whereas VAGAAAAGAV forms less well organized structures that are partially fibrillar and partially amorphous. The degree of order in the fibrillar structure stems in part from the types of kinetic events leading up to its formation, with AGAAAAGA forming less amorphous structures early in the simulation than VAGAAAAGAV. The ability to form fibrils increases as the chain length and the length of the stretch of hydrophobic residues increase. However as the hydrophobicity of the sequence increases, the ability to form well-ordered structures decreases. Thus, longer hydrophobic sequences form slightly disordered aggregates that are partially fibrillar and partially amorphous. Subtle changes in sequence result in slightly different fibril structures. PMID:21557317
Structural basis of toxicity and immunity in contact-dependent growth inhibition (CDI) systems.

PubMed

Morse, Robert P; Nikolakakis, Kiel C; Willett, Julia L E; Gerrick, Elias; Low, David A; Hayes, Christopher S; Goulding, Celia W

2012-12-26

Contact-dependent growth inhibition (CDI) systems encode polymorphic toxin/immunity proteins that mediate competition between neighboring bacterial cells. We present crystal structures of CDI toxin/immunity complexes from Escherichia coli EC869 and Burkholderia pseudomallei 1026b. Despite sharing little sequence identity, the toxin domains are structurally similar and have homology to endonucleases. The EC869 toxin is a Zn(2+)-dependent DNase capable of completely degrading the genomes of target cells, whereas the Bp1026b toxin cleaves the aminoacyl acceptor stems of tRNA molecules. Each immunity protein binds and inactivates its cognate toxin in a unique manner. The EC869 toxin/immunity complex is stabilized through an unusual β-augmentation interaction. In contrast, the Bp1026b immunity protein exploits shape and charge complementarity to occlude the toxin active site. These structures represent the initial glimpse into the CDI toxin/immunity network, illustrating how sequence-diverse toxins adopt convergent folds yet retain distinct binding interactions with cognate immunity proteins. Moreover, we present visual demonstration of CDI toxin delivery into a target cell.
Maximum-Likelihood Detection Of Noncoherent CPM

NASA Technical Reports Server (NTRS)

Divsalar, Dariush; Simon, Marvin K.

1993-01-01

Simplified detectors proposed for use in maximum-likelihood-sequence detection of symbols in alphabet of size M transmitted by uncoded, full-response continuous phase modulation over radio channel with additive white Gaussian noise. Structures of receivers derived from particular interpretation of maximum-likelihood metrics. Receivers include front ends, structures of which depends only on M, analogous to those in receivers of coherent CPM. Parts of receivers following front ends have structures, complexity of which would depend on N.
Structural and functional analyses of Mycobacterium tuberculosis Rv3315c-encoded metal-dependent homotetrameric cytidine deaminase.

PubMed

Sánchez-Quitian, Zilpa A; Schneider, Cristopher Z; Ducati, Rodrigo G; de Azevedo, Walter F; Bloch, Carlos; Basso, Luiz A; Santos, Diógenes S

2010-03-01

The emergence of drug-resistant strains of Mycobacterium tuberculosis, the causative agent of tuberculosis, has exacerbated the treatment and control of this disease. Cytidine deaminase (CDA) is a pyrimidine salvage pathway enzyme that recycles cytidine and 2'-deoxycytidine for uridine and 2'-deoxyuridine synthesis, respectively. A probable M. tuberculosis CDA-coding sequence (cdd, Rv3315c) was cloned, sequenced, expressed in Escherichia coli BL21(DE3), and purified to homogeneity. Mass spectrometry, N-terminal amino acid sequencing, gel filtration chromatography, and metal analysis of M. tuberculosis CDA (MtCDA) were carried out. These results and multiple sequence alignment demonstrate that MtCDA is a homotetrameric Zn(2+)-dependent metalloenzyme. Steady-state kinetic measurements yielded the following parameters: K(m)=1004 microM and k(cat)=4.8s(-1) for cytidine, and K(m)=1059 microM and k(cat)=3.5s(-1) for 2'-deoxycytidine. The pH dependence of k(cat) and k(cat)/K(M) for cytidine indicate that protonation of a single ionizable group with apparent pK(a) value of 4.3 abolishes activity, and protonation of a group with pK(a) value of 4.7 reduces binding. MtCDA was crystallized and crystal diffracted at 2.0 A resolution. Analysis of the crystallographic structure indicated the presence of a Zn(2+) coordinated by three conserved cysteines and the structure exhibits the canonical cytidine deaminase fold. (c) 2009 Elsevier Inc. All rights reserved.
Recombination-dependent replication and gene conversion homogenize repeat sequences and diversify plastid genome structure.

PubMed

Ruhlman, Tracey A; Zhang, Jin; Blazier, John C; Sabir, Jamal S M; Jansen, Robert K

2017-04-01

There is a misinterpretation in the literature regarding the variable orientation of the small single copy region of plastid genomes (plastomes). The common phenomenon of small and large single copy inversion, hypothesized to occur through intramolecular recombination between inverted repeats (IR) in a circular, single unit-genome, in fact, more likely occurs through recombination-dependent replication (RDR) of linear plastome templates. If RDR can be primed through both intra- and intermolecular recombination, then this mechanism could not only create inversion isomers of so-called single copy regions, but also an array of alternative sequence arrangements. We used Illumina paired-end and PacBio single-molecule real-time (SMRT) sequences to characterize repeat structure in the plastome of Monsonia emarginata (Geraniaceae). We used OrgConv and inspected nucleotide alignments to infer ancestral nucleotides and identify gene conversion among repeats and mapped long (>1 kb) SMRT reads against the unit-genome assembly to identify alternative sequence arrangements. Although M. emarginata lacks the canonical IR, we found that large repeats (>1 kilobase; kb) represent ∼22% of the plastome nucleotide content. Among the largest repeats (>2 kb), we identified GC-biased gene conversion and mapping filtered, long SMRT reads to the M. emarginata unit-genome assembly revealed alternative, substoichiometric sequence arrangements. We offer a model based on RDR and gene conversion between long repeated sequences in the M. emarginata plastome and provide support that both intra-and intermolecular recombination between large repeats, particularly in repeat-rich plastomes, varies unit-genome structure while homogenizing the nucleotide sequence of repeats. © 2017 Botanical Society of America.
Applying Agrep to r-NSA to solve multiple sequences approximate matching.

PubMed

Ni, Bing; Wong, Man-Hon; Lam, Chi-Fai David; Leung, Kwong-Sak

2014-01-01

This paper addresses the approximate matching problem in a database consisting of multiple DNA sequences, where the proposed approach applies Agrep to a new truncated suffix array, r-NSA. The construction time of the structure is linear to the database size, and the computations of indexing a substring in the structure are constant. The number of characters processed in applying Agrep is analysed theoretically, and the theoretical upper-bound can approximate closely the empirical number of characters, which is obtained through enumerating the characters in the actual structure built. Experiments are carried out using (synthetic) random DNA sequences, as well as (real) genome sequences including Hepatitis-B Virus and X-chromosome. Experimental results show that, compared to the straight-forward approach that applies Agrep to multiple sequences individually, the proposed approach solves the matching problem in much shorter time. The speed-up of our approach depends on the sequence patterns, and for highly similar homologous genome sequences, which are the common cases in real-life genomes, it can be up to several orders of magnitude.
Sequence Dependent Interactions Between DNA and Single-Walled Carbon Nanotubes

NASA Astrophysics Data System (ADS)

Roxbury, Daniel

It is known that single-stranded DNA adopts a helical wrap around a single-walled carbon nanotube (SWCNT), forming a water-dispersible hybrid molecule. The ability to sort mixtures of SWCNTs based on chirality (electronic species) has recently been demonstrated using special short DNA sequences that recognize certain matching SWCNTs of specific chirality. This thesis investigates the intricacies of DNA-SWCNT sequence-specific interactions through both experimental and molecular simulation studies. The DNA-SWCNT binding strengths were experimentally quantified by studying the kinetics of DNA replacement by a surfactant on the surface of particular SWCNTs. Recognition ability was found to correlate strongly with measured binding strength, e.g. DNA sequence (TAT)4 was found to bind 20 times stronger to the (6,5)-SWCNT than sequence (TAT)4T. Next, using replica exchange molecular dynamics (REMD) simulations, equilibrium structures formed by (a) single-strands and (b) multiple-strands of 12-mer oligonucleotides adsorbed on various SWCNTs were explored. A number of structural motifs were discovered in which the DNA strand wraps around the SWCNT and 'stitches' to itself via hydrogen bonding. Great variability among equilibrium structures was observed and shown to be directly influenced by DNA sequence and SWCNT type. For example, the (6,5)-SWCNT DNA recognition sequence, (TAT)4, was found to wrap in a tight single-stranded right-handed helical conformation. In contrast, DNA sequence T12 forms a beta-barrel left-handed structure on the same SWCNT. These are the first theoretical indications that DNA-based SWCNT selectivity can arise on a molecular level. In a biomedical collaboration with the Mayo Clinic, pathways for DNA-SWCNT internalization into healthy human endothelial cells were explored. Through absorbance spectroscopy, TEM imaging, and confocal fluorescence microscopy, we showed that intracellular concentrations of SWCNTs far exceeded those of the incubation solution, which suggested an energy-dependent pathway. Additionally, by means of pharmacological inhibition and vector-induced gene knockout studies, the DNA-SWCNTs were shown to enter the cells via Rac1-mediated macropinocytosis.

RNA-dependent RNA polymerase of hepatitis C virus binds to its coding region RNA stem-loop structure, 5BSL3.2, and its negative strand.

PubMed

Kanamori, Hiroshi; Yuhashi, Kazuhito; Ohnishi, Shin; Koike, Kazuhiko; Kodama, Tatsuhiko

2010-05-01

The hepatitis C virus NS5B RNA-dependent RNA polymerase (RdRp) is a key enzyme involved in viral replication. Interaction between NS5B RdRp and the viral RNA sequence is likely to be an important step in viral RNA replication. The C-terminal half of the NS5B-coding sequence, which contains the important cis-acting replication element, has been identified as an NS5B-binding sequence. In the present study, we confirm the specific binding of NS5B to one of the RNA stem-loop structures in the region, 5BSL3.2. In addition, we show that NS5B binds to the complementary strand of 5BSL3.2 (5BSL3.2N). The bulge structure of 5BSL3.2N was shown to be indispensable for tight binding to NS5B. In vitro RdRp activity was inhibited by 5BSL3.2N, indicating the importance of the RNA element in the polymerization by RdRp. These results suggest the involvement of the RNA stem-loop structure of the negative strand in the replication process.
An unbiased adaptive sampling algorithm for the exploration of RNA mutational landscapes under evolutionary pressure.

PubMed

Waldispühl, Jérôme; Ponty, Yann

2011-11-01

The analysis of the relationship between sequences and structures (i.e., how mutations affect structures and reciprocally how structures influence mutations) is essential to decipher the principles driving molecular evolution, to infer the origins of genetic diseases, and to develop bioengineering applications such as the design of artificial molecules. Because their structures can be predicted from the sequence data only, RNA molecules provide a good framework to study this sequence-structure relationship. We recently introduced a suite of algorithms called RNAmutants which allows a complete exploration of RNA sequence-structure maps in polynomial time and space. Formally, RNAmutants takes an input sequence (or seed) to compute the Boltzmann-weighted ensembles of mutants with exactly k mutations, and sample mutations from these ensembles. However, this approach suffers from major limitations. Indeed, since the Boltzmann probabilities of the mutations depend of the free energy of the structures, RNAmutants has difficulties to sample mutant sequences with low G+C-contents. In this article, we introduce an unbiased adaptive sampling algorithm that enables RNAmutants to sample regions of the mutational landscape poorly covered by classical algorithms. We applied these methods to sample mutations with low G+C-contents. These adaptive sampling techniques can be easily adapted to explore other regions of the sequence and structural landscapes which are difficult to sample. Importantly, these algorithms come at a minimal computational cost. We demonstrate the insights offered by these techniques on studies of complete RNA sequence structures maps of sizes up to 40 nucleotides. Our results indicate that the G+C-content has a strong influence on the size and shape of the evolutionary accessible sequence and structural spaces. In particular, we show that low G+C-contents favor the apparition of internal loops and thus possibly the synthesis of tertiary structure motifs. On the other hand, high G+C-contents significantly reduce the size of the evolutionary accessible mutational landscapes.
Evolutionary profiles from the QR factorization of multiple sequence alignments

PubMed Central

Sethi, Anurag; O'Donoghue, Patrick; Luthey-Schulten, Zaida

2005-01-01

We present an algorithm to generate complete evolutionary profiles that represent the topology of the molecular phylogenetic tree of the homologous group. The method, based on the multidimensional QR factorization of numerically encoded multiple sequence alignments, removes redundancy from the alignments and orders the protein sequences by increasing linear dependence, resulting in the identification of a minimal basis set of sequences that spans the evolutionary space of the homologous group of proteins. We observe a general trend that these smaller, more evolutionarily balanced profiles have comparable and, in many cases, better performance in database searches than conventional profiles containing hundreds of sequences, constructed in an iterative and computationally intensive procedure. For more diverse families or superfamilies, with sequence identity <30%, structural alignments, based purely on the geometry of the protein structures, provide better alignments than pure sequence-based methods. Merging the structure and sequence information allows the construction of accurate profiles for distantly related groups. These structure-based profiles outperformed other sequence-based methods for finding distant homologs and were used to identify a putative class II cysteinyl-tRNA synthetase (CysRS) in several archaea that eluded previous annotation studies. Phylogenetic analysis showed the putative class II CysRSs to be a monophyletic group and homology modeling revealed a constellation of active site residues similar to that in the known class I CysRS. PMID:15741270
Structure of homeodomain-leucine zipper/DNA complexes studied using hydroxyl radical cleavage of DNA and methylation interference.

PubMed

Tron, Adriana E; Comelli, Raúl N; Gonzalez, Daniel H

2005-12-27

Homeodomain-leucine zipper (HD-Zip) proteins, unlike most homeodomain proteins, bind a pseudopalindromic DNA sequence as dimers. We have investigated the structure of the DNA complexes formed by two HD-Zip proteins with different nucleotide preferences at the central position of the binding site using footprinting and interference methods. The results indicate that the respective complexes are not symmetric, with the strand bearing a central purine (top strand) showing higher protection around the central region and the bottom strand protected toward the 3' end. Binding to a sequence with a nonpreferred central base pair produces a decrease in protection in either the top or the bottom strand, depending upon the protein. Modeling studies derived from the complex formed by the monomeric Antennapedia homeodomain with DNA indicate that in the HD-Zip/DNA complex the recognition helix of one of the monomers is displaced within the major groove respective to the other one. This monomer seems to lose contacts with a part of the recognition sequence upon binding to the nonpreferred site. The results show that the structure of the complex formed by HD-Zip proteins with DNA is dependent upon both protein intrinsic characteristics and the nucleotides present at the central position of the recognition sequence.
Comprehensive analysis of RNA-protein interactions by high-throughput sequencing-RNA affinity profiling.

PubMed

Tome, Jacob M; Ozer, Abdullah; Pagano, John M; Gheba, Dan; Schroth, Gary P; Lis, John T

2014-06-01

RNA-protein interactions play critical roles in gene regulation, but methods to quantitatively analyze these interactions at a large scale are lacking. We have developed a high-throughput sequencing-RNA affinity profiling (HiTS-RAP) assay by adapting a high-throughput DNA sequencer to quantify the binding of fluorescently labeled protein to millions of RNAs anchored to sequenced cDNA templates. Using HiTS-RAP, we measured the affinity of mutagenized libraries of GFP-binding and NELF-E-binding aptamers to their respective targets and identified critical regions of interaction. Mutations additively affected the affinity of the NELF-E-binding aptamer, whose interaction depended mainly on a single-stranded RNA motif, but not that of the GFP aptamer, whose interaction depended primarily on secondary structure.
Evaluating the accuracy of SHAPE-directed RNA secondary structure predictions

PubMed Central

Sükösd, Zsuzsanna; Swenson, M. Shel; Kjems, Jørgen; Heitsch, Christine E.

2013-01-01

Recent advances in RNA structure determination include using data from high-throughput probing experiments to improve thermodynamic prediction accuracy. We evaluate the extent and nature of improvements in data-directed predictions for a diverse set of 16S/18S ribosomal sequences using a stochastic model of experimental SHAPE data. The average accuracy for 1000 data-directed predictions always improves over the original minimum free energy (MFE) structure. However, the amount of improvement varies with the sequence, exhibiting a correlation with MFE accuracy. Further analysis of this correlation shows that accurate MFE base pairs are typically preserved in a data-directed prediction, whereas inaccurate ones are not. Thus, the positive predictive value of common base pairs is consistently higher than the directed prediction accuracy. Finally, we confirm sequence dependencies in the directability of thermodynamic predictions and investigate the potential for greater accuracy improvements in the worst performing test sequence. PMID:23325843
cgDNA: a software package for the prediction of sequence-dependent coarse-grain free energies of B-form DNA.

PubMed

Petkevičiūtė, D; Pasi, M; Gonzalez, O; Maddocks, J H

2014-11-10

cgDNA is a package for the prediction of sequence-dependent configuration-space free energies for B-form DNA at the coarse-grain level of rigid bases. For a fragment of any given length and sequence, cgDNA calculates the configuration of the associated free energy minimizer, i.e. the relative positions and orientations of each base, along with a stiffness matrix, which together govern differences in free energies. The model predicts non-local (i.e. beyond base-pair step) sequence dependence of the free energy minimizer. Configurations can be input or output in either the Curves+ definition of the usual helical DNA structural variables, or as a PDB file of coordinates of base atoms. We illustrate the cgDNA package by comparing predictions of free energy minimizers from (a) the cgDNA model, (b) time-averaged atomistic molecular dynamics (or MD) simulations, and (c) NMR or X-ray experimental observation, for (i) the Dickerson-Drew dodecamer and (ii) three oligomers containing A-tracts. The cgDNA predictions are rather close to those of the MD simulations, but many orders of magnitude faster to compute. Both the cgDNA and MD predictions are in reasonable agreement with the available experimental data. Our conclusion is that cgDNA can serve as a highly efficient tool for studying structural variations in B-form DNA over a wide range of sequences. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Sequence-Dependent Self-Assembly and Structural Diversity of Islet Amyloid Polypeptide-Derived β-Sheet Fibrils

DOE PAGES

Wang, Shih-Ting; Lin, Yiyang; Spencer, Ryan K.; ...

2017-08-03

Determining the structural origins of amyloid fibrillation is essential for understanding both the pathology of amyloidosis and the rational design of inhibitors to prevent or reverse amyloid formation. In this work, the decisive roles of peptide structures on amyloid self-assembly and morphological diversity were investigated by the design of eight amyloidogenic peptides derived from islet amyloid polypeptide. Among the segments, two distinct morphologies were highlighted in the form of twisted and planar (untwisted) ribbons with varied diameters, thicknesses, and lengths. In particular, transformation of amyloid fibrils from twisted ribbons into untwisted structures was triggered by substitution of the C-terminal serinemore » with threonine, where the side chain methyl group was responsible for the distinct morphological change. This effect was confirmed following serine substitution with alanine and valine and was ascribed to the restriction of intersheet torsional strain through the increased hydrophobic interactions and hydrogen bonding. We also studied the variation of fibril morphology (i.e., association and helicity) and peptide aggregation propensity by increasing the hydrophobicity of the peptide side group, capping the N-terminus, and extending sequence length. Lastly, we anticipate that our insights into sequence-dependent fibrillation and morphological diversity will shed light on the structural interpretation of amyloidogenesis and development of structure-specific imaging agents and aggregation inhibitors.« less
A reduced amino acid alphabet for understanding and designing protein adaptation to mutation.

PubMed

Etchebest, C; Benros, C; Bornot, A; Camproux, A-C; de Brevern, A G

2007-11-01

Protein sequence world is considerably larger than structure world. In consequence, numerous non-related sequences may adopt similar 3D folds and different kinds of amino acids may thus be found in similar 3D structures. By grouping together the 20 amino acids into a smaller number of representative residues with similar features, sequence world simplification may be achieved. This clustering hence defines a reduced amino acid alphabet (reduced AAA). Numerous works have shown that protein 3D structures are composed of a limited number of building blocks, defining a structural alphabet. We previously identified such an alphabet composed of 16 representative structural motifs (5-residues length) called Protein Blocks (PBs). This alphabet permits to translate the structure (3D) in sequence of PBs (1D). Based on these two concepts, reduced AAA and PBs, we analyzed the distributions of the different kinds of amino acids and their equivalences in the structural context. Different reduced sets were considered. Recurrent amino acid associations were found in all the local structures while other were specific of some local structures (PBs) (e.g Cysteine, Histidine, Threonine and Serine for the alpha-helix Ncap). Some similar associations are found in other reduced AAAs, e.g Ile with Val, or hydrophobic aromatic residues Trp with Phe and Tyr. We put into evidence interesting alternative associations. This highlights the dependence on the information considered (sequence or structure). This approach, equivalent to a substitution matrix, could be useful for designing protein sequence with different features (for instance adaptation to environment) while preserving mainly the 3D fold.
Does TATA matter? A structural exploration of the selectivity determinants in its complexes with TATA box-binding protein.

PubMed Central

Pastor, N; Pardo, L; Weinstein, H

1997-01-01

The binding of the TATA box-binding protein (TBP) to a TATA sequence in DNA is essential for eukaryotic basal transcription. TBP binds in the minor groove of DNA, causing a large distortion of the DNA helix. Given the apparent stereochemical equivalence of AT and TA basepairs in the minor groove, DNA deformability must play a significant role in binding site selection, because not all AT-rich sequences are bound effectively by TBP. To gain insight into the precise role that the properties of the TATA sequence have in determining the specificity of the DNA substrates of TBP, the solution structure and dynamics of seven DNA dodecamers have been studied by using molecular dynamics simulations. The analysis of the structural properties of basepair steps in these TATA sequences suggests a reason for the preference for alternating pyrimidine-purine (YR) sequences, but indicates that these properties cannot be the sole determinant of the sequence specificity of TBP. Rather, recognition depends on the interplay between the inherent deformability of the DNA and steric complementarity at the molecular interface. Images FIGURE 2 PMID:9251783
CircularLogo: A lightweight web application to visualize intra-motif dependencies.

PubMed

Ye, Zhenqing; Ma, Tao; Kalmbach, Michael T; Dasari, Surendra; Kocher, Jean-Pierre A; Wang, Liguo

2017-05-22

The sequence logo has been widely used to represent DNA or RNA motifs for more than three decades. Despite its intelligibility and intuitiveness, the traditional sequence logo is unable to display the intra-motif dependencies and therefore is insufficient to fully characterize nucleotide motifs. Many methods have been developed to quantify the intra-motif dependencies, but fewer tools are available for visualization. We developed CircularLogo, a web-based interactive application, which is able to not only visualize the position-specific nucleotide consensus and diversity but also display the intra-motif dependencies. Applying CircularLogo to HNF6 binding sites and tRNA sequences demonstrated its ability to show intra-motif dependencies and intuitively reveal biomolecular structure. CircularLogo is implemented in JavaScript and Python based on the Django web framework. The program's source code and user's manual are freely available at http://circularlogo.sourceforge.net . CircularLogo web server can be accessed from http://bioinformaticstools.mayo.edu/circularlogo/index.html . CircularLogo is an innovative web application that is specifically designed to visualize and interactively explore intra-motif dependencies.
Downregulation of viral RNA translation by hepatitis C virus non-structural protein NS5A requires the poly(U/UC) sequence in the 3' UTR.

PubMed

Hoffman, Brett; Li, Zhubing; Liu, Qiang

2015-08-01

Hepatitis C virus (HCV) non-structural protein 5A (NS5A) is essential for viral replication; however, its effect on HCV RNA translation remains controversial partially due to the use of reporters lacking the 3' UTR, where NS5A binds to the poly(U/UC) sequence. We investigated the role of NS5A in HCV translation using a monocistronic RNA containing a Renilla luciferase gene flanked by the HCV UTRs. We found that NS5A downregulated viral RNA translation in a dose-dependent manner. This downregulation required both the 5' and 3' UTRs of HCV because substitution of either sequence with the 5' and 3' UTRs of enterovirus 71 or a cap structure at the 5' end eliminated the effects of NS5A on translation. Translation of the HCV genomic RNA was also downregulated by NS5A. The inhibition of HCV translation by NS5A required the poly(U/UC) sequence in the 3' UTR as NS5A did not affect translation when it was deleted. In addition, we showed that, whilst the amphipathic α-helix of NS5A has no effect on viral translation, the three domains of NS5A can inhibit translation independently, also dependent on the presence of the poly(U/UC) sequence in the 3' UTR. These results suggested that NS5A downregulated HCV RNA translation through a mechanism involving the poly(U/UC) sequence in the 3' UTR.
On the Impact of Widening Vector Registers on Sequence Alignment

DOE Office of Scientific and Technical Information (OSTI.GOV)

Daily, Jeffrey A.; Kalyanaraman, Anantharaman; Krishnamoorthy, Sriram

2016-09-22

Vector extensions, such as SSE, have been part of the x86 since the 1990s, with applications in graphics, signal processing, and scientific applications. Although many algorithms and applications can naturally benefit from automatic vectorization techniques, there are still many that are difficult to vectorize due to their dependence on irregular data structures, dense branch operations, or data dependencies. Sequence alignment, one of the most widely used operations in bioinformatics workflows, has a computational footprint that features complex data dependencies. In this paper, we demonstrate that the trend of widening vector registers adversely affects the state-of-the-art sequence alignment algorithm based onmore » striped data layouts. We present a practically efficient SIMD implementation of a parallel scan based sequence alignment algorithm that can better exploit wider SIMD units. We conduct comprehensive workload and use case analyses to characterize the relative behavior of the striped and scan approaches and identify the best choice of algorithm based on input length and SIMD width.« less
Tidal dissipation in rotating low-mass stars and implications for the orbital evolution of close-in planets. I. From the PMS to the RGB at solar metallicity

NASA Astrophysics Data System (ADS)

Gallet, F.; Bolmont, E.; Mathis, S.; Charbonnel, C.; Amard, L.

2017-08-01

Context. Star-planet interactions must be taken into account in stellar models to understand the dynamical evolution of close-in planets. The dependence of the tidal interactions on the structural and rotational evolution of the star is of particular importance and should be correctly treated. Aims: We quantify how tidal dissipation in the convective envelope of rotating low-mass stars evolves from the pre-main sequence up to the red-giant branch depending on the initial stellar mass. We investigate the consequences of this evolution on planetary orbital evolution. Methods: We couple the tidal dissipation formalism previously described to the stellar evolution code STAREVOL and apply this coupling to rotating stars with masses between 0.3 and 1.4 M⊙. As a first step, this formalism assumes a simplified bi-layer stellar structure with corresponding averaged densities for the radiative core and the convective envelope. We use a frequency-averaged treatment of the dissipation of tidal inertial waves in the convection zone (but neglect the dissipation of tidal gravity waves in the radiation zone). In addition, we generalize a recent work by following the orbital evolution of close-in planets using the new tidal dissipation predictions for advanced phases of stellar evolution. Results: On the pre-main sequence the evolution of tidal dissipation is controlled by the evolution of the internal structure of the contracting star. On the main sequence it is strongly driven by the variation of surface rotation that is impacted by magnetized stellar winds braking. The main effect of taking into account the rotational evolution of the stars is to lower the tidal dissipation strength by about four orders of magnitude on the main sequence, compared to a normalized dissipation rate that only takes into account structural changes. Conclusions: The evolution of the dissipation strongly depends on the evolution of the internal structure and rotation of the star. From the pre-main sequence up to the tip of the red-giant branch, it varies by several orders of magnitude, with strong consequences for the orbital evolution of close-in massive planets. These effects are the strongest during the pre-main sequence, implying that the planets are mainly sensitive to the star's early history.
The Conservation of Structure and Mechanism of Catalytic Action in a Family of Thiamin Pyrophosphate (TPP)-dependent Enzymes

NASA Technical Reports Server (NTRS)

Dominiak, P.; Ciszak, Ewa

2004-01-01

Thiamin pyrophosphate (TPP)-dependent enzymes are a divergent family of TPP and metal ion binding proteins that perform a wide range of functions with the common decarboxylation steps of a -(O=)C-C(OH)- fragment of alpha-ketoacids and alpha- hydroxyaldehydes. To determine how structure and catalytic action are conserved in the context of large sequence differences existing within this family of enzymes, we have carried out an analysis of TPP-dependent enzymes of known structures. The common structure of TPP-dependent enzymes is formed at the interface of four alpha/beta domains from at least two subunits, which provide for two metal and TPP-binding sites. Residues around these catalytic sites are conserved for functional purpose, while those further away from TPP are conserved for structural reasons. Together they provide a network of contacts required for flip-flop catalytic action within TPP-dependent enzymes. Thus our analysis defines a TPP-action motif that is proposed for annotating TPP-dependent enzymes for advancing functional proteomics.
Double-stranded RNA interferes in a sequence-specific manner with the infection of representative members of the two viroid families

DOE Office of Scientific and Technical Information (OSTI.GOV)

Carbonell, Alberto; Martinez de Alba, Angel-Emilio; Flores, Ricardo

2008-02-05

Infection by viroids, non-protein-coding circular RNAs, occurs with the accumulation of 21-24 nt viroid-derived small RNAs (vd-sRNAs) with characteristic properties of small interfering RNAs (siRNAs) associated to RNA silencing. The vd-sRNAs most likely derive from dicer-like (DCL) enzymes acting on viroid-specific dsRNA, the key elicitor of RNA silencing, or on the highly structured genomic RNA. Previously, viral dsRNAs delivered mechanically or agroinoculated have been shown to interfere with virus infection in a sequence-specific manner. Here, we report similar results with members of the two families of nuclear- and chloroplast-replicating viroids. Moreover, homologous vd-sRNAs co-delivered mechanically also interfered with one ofmore » the viroids examined. The interference was sequence-specific, temperature-dependent and, in some cases, also dependent on the dose of the co-inoculated dsRNA or vd-sRNAs. The sequence-specific nature of these effects suggests the involvement of the RNA induced silencing complex (RISC), which provides sequence specificity to RNA silencing machinery. Therefore, viroid titer in natural infections might be regulated by the concerted action of DCL and RISC. Viroids could have evolved their secondary structure as a compromise between resistance to DCL and RISC, which act preferentially against RNAs with compact and relaxed secondary structures, respectively. In addition, compartmentation, association with proteins or active replication might also help viroids to elude their host RNA silencing machinery.« less
The role of molecular structure of sugar-phosphate backbone and nucleic acid bases in the formation of single-stranded and double-stranded DNA structures.

PubMed

Poltev, Valeri; Anisimov, Victor M; Danilov, Victor I; Garcia, Dolores; Sanchez, Carolina; Deriabina, Alexandra; Gonzalez, Eduardo; Rivas, Francisco; Polteva, Nina

2014-06-01

Our previous DFT computations of deoxydinucleoside monophosphate complexes with Na(+)-ions (dDMPs) have demonstrated that the main characteristics of Watson-Crick (WC) right-handed duplex families are predefined in the local energy minima of dDMPs. In this work, we study the mechanisms of contribution of chemically monotonous sugar-phosphate backbone and the bases into the double helix irregularity. Geometry optimization of sugar-phosphate backbone produces energy minima matching the WC DNA conformations. Studying the conformational variability of dDMPs in response to sequence permutation, we found that simple replacement of bases in the previously fully optimized dDMPs, e.g. by constructing Pyr-Pur from Pur-Pyr, and Pur-Pyr from Pyr-Pur sequences, while retaining the backbone geometry, automatically produces the mutual base position characteristic of the target sequence. Based on that, we infer that the directionality and the preferable regions of the sugar-phosphate torsions, combined with the difference of purines from pyrimidines in ring shape, determines the sequence dependence of the structure of WC DNA. No such sequence dependence exists in dDMPs corresponding to other DNA conformations (e.g., Z-family and Hoogsteen duplexes). Unlike other duplexes, WC helix is unique by its ability to match the local energy minima of the free single strand to the preferable conformations of the duplex. Copyright © 2013 Wiley Periodicals, Inc.
Percolation in random-Sierpiński carpets: A real space renormalization group approach

NASA Astrophysics Data System (ADS)

Perreau, Michel; Peiro, Joaquina; Berthier, Serge

1996-11-01

The site percolation transition in random Sierpiński carpets is investigated by real space renormalization. The fixed point is not unique like in regular translationally invariant lattices, but depends on the number k of segmentation steps of the generation process of the fractal. It is shown that, for each scale invariance ratio n, the sequence of fixed points pn,k is increasing with k, and converges when k-->∞ toward a limit pn strictly less than 1. Moreover, in such scale invariant structures, the percolation threshold does not depend only on the scale invariance ratio n, but also on the scale. The sequence pn,k and pn are calculated for n=4, 8, 16, 32, and 64, and for k=1 to k=11, and k=∞. The corresponding thermal exponent sequence νn,k is calculated for n=8 and 16, and for k=1 to k=5, and k=∞. Suggestions are made for an experimental test in physical self-similar structures.
An analysis by metabolic labelling of the encephalomyocarditis virus ribosomal frameshifting efficiency and stimulators.

PubMed

Ling, Roger; Firth, Andrew E

2017-08-01

Programmed -1 ribosomal frameshifting is a mechanism of gene expression whereby specific signals within messenger RNAs direct a proportion of ribosomes to shift -1 nt and continue translating in the new reading frame. Such frameshifting normally depends on an RNA structure stimulator 3'-adjacent to a 'slippery' heptanucleotide shift site sequence. Recently we identified an unusual frameshifting mechanism in encephalomyocarditis virus, where the stimulator involves a trans-acting virus protein. Thus, in contrast to other examples of -1 frameshifting, the efficiency of frameshifting in encephalomyocarditis virus is best studied in the context of virus infection. Here we use metabolic labelling to analyse the frameshifting efficiency of wild-type and mutant viruses. Confirming previous results, frameshifting depends on a G_GUU_UUU shift site sequence and a 3'-adjacent stem-loop structure, but is not appreciably affected by the 'StopGo' sequence present ~30 nt upstream. At late timepoints, frameshifting was estimated to be 46-76 % efficient.
Improving the realism of white matter numerical phantoms: a step towards a better understanding of the influence of structural disorders in diffusion MRI

NASA Astrophysics Data System (ADS)

Ginsburger, Kévin; Poupon, Fabrice; Beaujoin, Justine; Estournet, Delphine; Matuschke, Felix; Mangin, Jean-François; Axer, Markus; Poupon, Cyril

2018-02-01

White matter is composed of irregularly packed axons leading to a structural disorder in the extra-axonal space. Diffusion MRI experiments using oscillating gradient spin echo sequences have shown that the diffusivity transverse to axons in this extra-axonal space is dependent on the frequency of the employed sequence. In this study, we observe the same frequency-dependence using 3D simulations of the diffusion process in disordered media. We design a novel white matter numerical phantom generation algorithm which constructs biomimicking geometric configurations with few design parameters, and enables to control the level of disorder of the generated phantoms. The influence of various geometrical parameters present in white matter, such as global angular dispersion, tortuosity, presence of Ranvier nodes, beading, on the extra-cellular perpendicular diffusivity frequency dependence was investigated by simulating the diffusion process in numerical phantoms of increasing complexity and fitting the resulting simulated diffusion MR signal attenuation with an adequate analytical model designed for trapezoidal OGSE sequences. This work suggests that angular dispersion and especially beading have non-negligible effects on this extracellular diffusion metrics that may be measured using standard OGSE DW-MRI clinical protocols.

Distribution and Features of the Six Classes of Peroxiredoxins

PubMed Central

Poole, Leslie B.; Nelson, Kimberly J.

2016-01-01

Peroxiredoxins are cysteine-dependent peroxide reductases that group into 6 different, structurally discernable classes. In 2011, our research team reported the application of a bioinformatic approach called active site profiling to extract active site-proximal sequence segments from the 29 distinct, structurally-characterized peroxiredoxins available at the time. These extracted sequences were then used to create unique profiles for the six groups which were subsequently used to search GenBank(nr), allowing identification of ∼3500 peroxiredoxin sequences and their respective subgroups. Summarized in this minireview are the features and phylogenetic distributions of each of these peroxiredoxin subgroups; an example is also provided illustrating the use of the web accessible, searchable database known as PREX to identify subfamily-specific peroxiredoxin sequences for the organism Vitis vinifera (grape). PMID:26810075
Structural hot spots for the solubility of globular proteins

PubMed Central

Ganesan, Ashok; Siekierska, Aleksandra; Beerten, Jacinte; Brams, Marijke; Van Durme, Joost; De Baets, Greet; Van der Kant, Rob; Gallardo, Rodrigo; Ramakers, Meine; Langenberg, Tobias; Wilkinson, Hannah; De Smet, Frederik; Ulens, Chris; Rousseau, Frederic; Schymkowitz, Joost

2016-01-01

Natural selection shapes protein solubility to physiological requirements and recombinant applications that require higher protein concentrations are often problematic. This raises the question whether the solubility of natural protein sequences can be improved. We here show an anti-correlation between the number of aggregation prone regions (APRs) in a protein sequence and its solubility, suggesting that mutational suppression of APRs provides a simple strategy to increase protein solubility. We show that mutations at specific positions within a protein structure can act as APR suppressors without affecting protein stability. These hot spots for protein solubility are both structure and sequence dependent but can be computationally predicted. We demonstrate this by reducing the aggregation of human α-galactosidase and protective antigen of Bacillus anthracis through mutation. Our results indicate that many proteins possess hot spots allowing to adapt protein solubility independently of structure and function. PMID:26905391
Benchmarking Inverse Statistical Approaches for Protein Structure and Design with Exactly Solvable Models.

PubMed

Jacquin, Hugo; Gilson, Amy; Shakhnovich, Eugene; Cocco, Simona; Monasson, Rémi

2016-05-01

Inverse statistical approaches to determine protein structure and function from Multiple Sequence Alignments (MSA) are emerging as powerful tools in computational biology. However the underlying assumptions of the relationship between the inferred effective Potts Hamiltonian and real protein structure and energetics remain untested so far. Here we use lattice protein model (LP) to benchmark those inverse statistical approaches. We build MSA of highly stable sequences in target LP structures, and infer the effective pairwise Potts Hamiltonians from those MSA. We find that inferred Potts Hamiltonians reproduce many important aspects of 'true' LP structures and energetics. Careful analysis reveals that effective pairwise couplings in inferred Potts Hamiltonians depend not only on the energetics of the native structure but also on competing folds; in particular, the coupling values reflect both positive design (stabilization of native conformation) and negative design (destabilization of competing folds). In addition to providing detailed structural information, the inferred Potts models used as protein Hamiltonian for design of new sequences are able to generate with high probability completely new sequences with the desired folds, which is not possible using independent-site models. Those are remarkable results as the effective LP Hamiltonians used to generate MSA are not simple pairwise models due to the competition between the folds. Our findings elucidate the reasons for the success of inverse approaches to the modelling of proteins from sequence data, and their limitations.
Deriving high-resolution protein backbone structure propensities from all crystal data using the information maximization device.

PubMed

Solis, Armando D

2014-01-01

The most informative probability distribution functions (PDFs) describing the Ramachandran phi-psi dihedral angle pair, a fundamental descriptor of backbone conformation of protein molecules, are derived from high-resolution X-ray crystal structures using an information-theoretic approach. The Information Maximization Device (IMD) is established, based on fundamental information-theoretic concepts, and then applied specifically to derive highly resolved phi-psi maps for all 20 single amino acid and all 8000 triplet sequences at an optimal resolution determined by the volume of current data. The paper shows that utilizing the latent information contained in all viable high-resolution crystal structures found in the Protein Data Bank (PDB), totaling more than 77,000 chains, permits the derivation of a large number of optimized sequence-dependent PDFs. This work demonstrates the effectiveness of the IMD and the superiority of the resulting PDFs by extensive fold recognition experiments and rigorous comparisons with previously published triplet PDFs. Because it automatically optimizes PDFs, IMD results in improved performance of knowledge-based potentials, which rely on such PDFs. Furthermore, it provides an easy computational recipe for empirically deriving other kinds of sequence-dependent structural PDFs with greater detail and precision. The high-resolution phi-psi maps derived in this work are available for download.
The ion-induced folding of the hammerhead ribozyme: core sequence changes that perturb folding into the active conformation.

PubMed Central

Bassi, G S; Murchie, A I; Lilley, D M

1996-01-01

The hammerhead ribozyme undergoes an ion-dependent folding process into the active conformation. We find that the folding can be blocked at specific stages by changes of sequence or functionality within the core. In the the absence of added metal ions, the global structure of the hammerhead is extended, with a large angle subtended between stems I and II. No core sequence changes appear to alter this geometry, consistent with an unstructured core under these conditions. Upon addition of low concentrations of magnesium ions, the hammerhead folds by an association of stems II and III, to include a large angle between them. This stage is inhibited or altered by mutations within the oligopurine sequence lying between stems II and III, and folding is completely prevented by an A14G mutation. Further increase in magnesium ion concentration brings about a second stage of folding in the natural sequence hammerhead, involving a reorientation of stem I, which rotates around into the same direction of stem II. Because this transition occurs over the same range of magnesium ion concentration over which the hammerhead ribozyme becomes active, it is likely that the final conformation is most closely related to the active form of the structure. Magnesium ion-dependent folding into this conformation is prevented by changes at G5, notably removal of the 2'-hydroxyl group and replacement of the base by cytidine. The ability to dissect the folding process by means of sequence changes suggests that two separate ion-dependent stages are involved in the folding of the hammerhead ribozyme into the active conformation. PMID:8752086
Statistical discovery of site inter-dependencies in sub-molecular hierarchical protein structuring

PubMed Central

2012-01-01

Background Much progress has been made in understanding the 3D structure of proteins using methods such as NMR and X-ray crystallography. The resulting 3D structures are extremely informative, but do not always reveal which sites and residues within the structure are of special importance. Recently, there are indications that multiple-residue, sub-domain structural relationships within the larger 3D consensus structure of a protein can be inferred from the analysis of the multiple sequence alignment data of a protein family. These intra-dependent clusters of associated sites are used to indicate hierarchical inter-residue relationships within the 3D structure. To reveal the patterns of associations among individual amino acids or sub-domain components within the structure, we apply a k-modes attribute (aligned site) clustering algorithm to the ubiquitin and transthyretin families in order to discover associations among groups of sites within the multiple sequence alignment. We then observe what these associations imply within the 3D structure of these two protein families. Results The k-modes site clustering algorithm we developed maximizes the intra-group interdependencies based on a normalized mutual information measure. The clusters formed correspond to sub-structural components or binding and interface locations. Applying this data-directed method to the ubiquitin and transthyretin protein family multiple sequence alignments as a test bed, we located numerous interesting associations of interdependent sites. These clusters were then arranged into cluster tree diagrams which revealed four structural sub-domains within the single domain structure of ubiquitin and a single large sub-domain within transthyretin associated with the interface among transthyretin monomers. In addition, several clusters of mutually interdependent sites were discovered for each protein family, each of which appear to play an important role in the molecular structure and/or function. Conclusions Our results demonstrate that the method we present here using a k-modes site clustering algorithm based on interdependency evaluation among sites obtained from a sequence alignment of homologous proteins can provide significant insights into the complex, hierarchical inter-residue structural relationships within the 3D structure of a protein family. PMID:22793672
Statistical discovery of site inter-dependencies in sub-molecular hierarchical protein structuring.

PubMed

Durston, Kirk K; Chiu, David Ky; Wong, Andrew Kc; Li, Gary Cl

2012-07-13

Much progress has been made in understanding the 3D structure of proteins using methods such as NMR and X-ray crystallography. The resulting 3D structures are extremely informative, but do not always reveal which sites and residues within the structure are of special importance. Recently, there are indications that multiple-residue, sub-domain structural relationships within the larger 3D consensus structure of a protein can be inferred from the analysis of the multiple sequence alignment data of a protein family. These intra-dependent clusters of associated sites are used to indicate hierarchical inter-residue relationships within the 3D structure. To reveal the patterns of associations among individual amino acids or sub-domain components within the structure, we apply a k-modes attribute (aligned site) clustering algorithm to the ubiquitin and transthyretin families in order to discover associations among groups of sites within the multiple sequence alignment. We then observe what these associations imply within the 3D structure of these two protein families. The k-modes site clustering algorithm we developed maximizes the intra-group interdependencies based on a normalized mutual information measure. The clusters formed correspond to sub-structural components or binding and interface locations. Applying this data-directed method to the ubiquitin and transthyretin protein family multiple sequence alignments as a test bed, we located numerous interesting associations of interdependent sites. These clusters were then arranged into cluster tree diagrams which revealed four structural sub-domains within the single domain structure of ubiquitin and a single large sub-domain within transthyretin associated with the interface among transthyretin monomers. In addition, several clusters of mutually interdependent sites were discovered for each protein family, each of which appear to play an important role in the molecular structure and/or function. Our results demonstrate that the method we present here using a k-modes site clustering algorithm based on interdependency evaluation among sites obtained from a sequence alignment of homologous proteins can provide significant insights into the complex, hierarchical inter-residue structural relationships within the 3D structure of a protein family.
Femtosecond laser-induced periodic surface structures on silicon upon polarization controlled two-color double-pulse irradiation.

PubMed

Höhm, Sandra; Herzlieb, Marcel; Rosenfeld, Arkadi; Krüger, Jörg; Bonse, Jörn

2015-01-12

Two-color double-fs-pulse experiments were performed on silicon wafers to study the temporally distributed energy deposition in the formation of laser-induced periodic surface structures (LIPSS). A Mach-Zehnder interferometer generated parallel or cross-polarized double-pulse sequences at 400 and 800 nm wavelength, with inter-pulse delays up to a few picoseconds between the sub-ablation 50-fs-pulses. Multiple two-color double-pulse sequences were collinearly focused by a spherical mirror to the sample. The resulting LIPSS characteristics (periods, areas) were analyzed by scanning electron microscopy. A wavelength-dependent plasmonic mechanism is proposed to explain the delay-dependence of the LIPSS. These two-color experiments extend previous single-color studies and prove the importance of the ultrafast energy deposition for LIPSS formation.
A folded viral noncoding RNA blocks host cell exoribonucleases through a conformationally dynamic RNA structure.

PubMed

Steckelberg, Anna-Lena; Akiyama, Benjamin M; Costantino, David A; Sit, Tim L; Nix, Jay C; Kieft, Jeffrey S

2018-06-19

Folded RNA elements that block processive 5' → 3' cellular exoribonucleases (xrRNAs) to produce biologically active viral noncoding RNAs have been discovered in flaviviruses, potentially revealing a new mode of RNA maturation. However, whether this RNA structure-dependent mechanism exists elsewhere and, if so, whether a singular RNA fold is required, have been unclear. Here we demonstrate the existence of authentic RNA structure-dependent xrRNAs in dianthoviruses, plant-infecting viruses unrelated to animal-infecting flaviviruses. These xrRNAs have no sequence similarity to known xrRNAs; thus, we used a combination of biochemistry and virology to characterize their sequence requirements and mechanism of stopping exoribonucleases. By solving the structure of a dianthovirus xrRNA by X-ray crystallography, we reveal a complex fold that is very different from that of the flavivirus xrRNAs. However, both versions of xrRNAs contain a unique topological feature, a pseudoknot that creates a protective ring around the 5' end of the RNA structure; this may be a defining structural feature of xrRNAs. Single-molecule FRET experiments reveal that the dianthovirus xrRNAs undergo conformational changes and can use "codegradational remodeling," exploiting the exoribonucleases' degradation-linked helicase activity to help form their resistant structure; such a mechanism has not previously been reported. Convergent evolution has created RNA structure-dependent exoribonuclease resistance in different contexts, which establishes it as a general RNA maturation mechanism and defines xrRNAs as an authentic functional class of RNAs.
T box riboswitches in Actinobacteria: Translational regulation via novel tRNA interactions

PubMed Central

Sherwood, Anna V.; Grundy, Frank J.; Henkin, Tina M.

2015-01-01

The T box riboswitch regulates many amino acid-related genes in Gram-positive bacteria. T box riboswitch-mediated gene regulation was shown previously to occur at the level of transcription attenuation via structural rearrangements in the 5′ untranslated (leader) region of the mRNA in response to binding of a specific uncharged tRNA. In this study, a novel group of isoleucyl-tRNA synthetase gene (ileS) T box leader sequences found in organisms of the phylum Actinobacteria was investigated. The Stem I domains of these RNAs lack several highly conserved elements that are essential for interaction with the tRNA ligand in other T box RNAs. Many of these RNAs were predicted to regulate gene expression at the level of translation initiation through tRNA-dependent stabilization of a helix that sequesters a sequence complementary to the Shine–Dalgarno (SD) sequence, thus freeing the SD sequence for ribosome binding and translation initiation. We demonstrated specific binding to the cognate tRNAIle and tRNAIle-dependent structural rearrangements consistent with regulation at the level of translation initiation, providing the first biochemical demonstration, to our knowledge, of translational regulation in a T box riboswitch. PMID:25583497
Crystal structure and sequence-dependent conformation of the A.G mispaired oligonucleotide d(CGCAAGCTGGCG).

PubMed Central

Webster, G D; Sanderson, M R; Skelly, J V; Neidle, S; Swann, P F; Li, B F; Tickle, I J

1990-01-01

The crystal structure of the dodecanucleotide d(CGCAAGCTGGCG) has been determined to a resolution of 2.5 A and refined to an R factor of 19.3% for 1710 reflections. The sequence crystallizes as a B-type double helix, with two G(anti).A(syn) base pairs. These are stabilized by three-center hydrogen bonds to pyrimidines that induce perturbations in base-pair geometry. The central AGCT region of the helix has a wide (greater than 6 A) minor groove. PMID:2395870
Influences of the molecular fuel structure on combustion reactions towards soot precursors in selected alkane and alkene flames.

PubMed

Ruwe, Lena; Moshammer, Kai; Hansen, Nils; Kohse-Höinghaus, Katharina

2018-04-25

In this study, we experimentally investigate the high-temperature oxidation kinetics of n-pentane, 1-pentene and 2-methyl-2-butene (2M2B) in a combustion environment using flame-sampling molecular beam mass spectrometry. The selected C5 fuels are prototypes for linear and branched, saturated and unsaturated fuel components, featuring different C-C and C-H bond structures. It is shown that the formation tendency of species, such as polycyclic aromatic hydrocarbons (PAHs), yielded through mass growth reactions increases drastically in the sequence n-pentane < 1-pentene < 2M2B. This comparative study enables valuable insights into fuel-dependent reaction sequences of the gas-phase combustion mechanism that provide explanations for the observed difference in the PAH formation tendency. First, we investigate the fuel-structure-dependent formation of small hydrocarbon species that are yielded as intermediate species during the fuel decomposition, because these species are at the origin of the subsequent mass growth reaction pathways. Second, we review typical PAH formation reactions inspecting repetitive growth sequences in dependence of the molecular fuel structure. Third, we discuss how differences in the intermediate species pool influence the formation reactions of key aromatic ring species that are important for the PAH growth process underlying soot formation. As a main result it was found that for the fuels featuring a C[double bond, length as m-dash]C double bond, the chemistry of their allylic fuel radicals and their decomposition products strongly influences the combination reactions to the initially formed aromatic ring species and as a consequence, the PAH formation tendency.
Protein family clustering for structural genomics.

PubMed

Yan, Yongpan; Moult, John

2005-10-28

A major goal of structural genomics is the provision of a structural template for a large fraction of protein domains. The magnitude of this task depends on the number and nature of protein sequence families. With a large number of bacterial genomes now fully sequenced, it is possible to obtain improved estimates of the number and diversity of families in that kingdom. We have used an automated clustering procedure to group all sequences in a set of genomes into protein families. Bench-marking shows the clustering method is sensitive at detecting remote family members, and has a low level of false positives. This comprehensive protein family set has been used to address the following questions. (1) What is the structure coverage for currently known families? (2) How will the number of known apparent families grow as more genomes are sequenced? (3) What is a practical strategy for maximizing structure coverage in future? Our study indicates that approximately 20% of known families with three or more members currently have a representative structure. The study indicates also that the number of apparent protein families will be considerably larger than previously thought: We estimate that, by the criteria of this work, there will be about 250,000 protein families when 1000 microbial genomes have been sequenced. However, the vast majority of these families will be small, and it will be possible to obtain structural templates for 70-80% of protein domains with an achievable number of representative structures, by systematically sampling the larger families.
The computational linguistics of biological sequences

DOE Office of Scientific and Technical Information (OSTI.GOV)

Searls, D.

1995-12-31

This tutorial was one of eight tutorials selected to be presented at the Third International Conference on Intelligent Systems for Molecular Biology which was held in the United Kingdom from July 16 to 19, 1995. Protein sequences are analogous in many respects, particularly their folding behavior. Proteins have a much richer variety of interactions, but in theory the same linguistic principles could come to bear in describing dependencies between distant residues that arise by virtue of three-dimensional structure. This tutorial will concentrate on nucleic acid sequences.
A parallel implementation of the Wuchty algorithm with additional experimental filters to more thoroughly explore RNA conformational space.

PubMed

Stone, Jonathan W; Bleckley, Samuel; Lavelle, Sean; Schroeder, Susan J

2015-01-01

We present new modifications to the Wuchty algorithm in order to better define and explore possible conformations for an RNA sequence. The new features, including parallelization, energy-independent lonely pair constraints, context-dependent chemical probing constraints, helix filters, and optional multibranch loops, provide useful tools for exploring the landscape of RNA folding. Chemical probing alone may not necessarily define a single unique structure. The helix filters and optional multibranch loops are global constraints on RNA structure that are an especially useful tool for generating models of encapsidated viral RNA for which cryoelectron microscopy or crystallography data may be available. The computations generate a combinatorially complete set of structures near a free energy minimum and thus provide data on the density and diversity of structures near the bottom of a folding funnel for an RNA sequence. The conformational landscapes for some RNA sequences may resemble a low, wide basin rather than a steep funnel that converges to a single structure.
Cloning and characterization of a Prevotella melaninogenica hemolysin.

PubMed Central

Allison, H E; Hillman, J D

1997-01-01

Hemolysins have been proven to be important virulence factors in many medically relevant pathogenic organisms. Their production has also been implicated in the etiology of periodontal disease. Hemolytic strain 361B of Prevotella melaninogenica, a putative etiologic agent of periodontal disease, was used in this study. The cloning, sequencing, and characterization of phyA, the structural gene for a P. melaninogenica hemolysin, is described. No extensive sequence homology could be identified between phyA and any reported sequence at either the nucleotide or amino acid level. As predicted from sequence analysis, this gene produces a 39-kDa protein which has hemolytic activity as measured by zymogram analysis. Unlike many Ca2+-dependent bacterial hemolysins, both the cloned and native PhyA proteins were enhanced by the presence of EDTA in a dose-dependent fashion with 40 mM EDTA allowing maximum activity. Ca2+ and Mg2+ were found to be inhibitory. The hemolytic activity also was found to have a dose-dependent endpoint. Through recovery of hemolytic activity from a spent reaction, this endpoint was shown to be the result of end product inhibition. This is the first report describing the cloning and sequencing of a gene from P. melaninogenica. PMID:9199448
Cloning and characterization of a Prevotella melaninogenica hemolysin.

PubMed

Allison, H E; Hillman, J D

1997-07-01

Hemolysins have been proven to be important virulence factors in many medically relevant pathogenic organisms. Their production has also been implicated in the etiology of periodontal disease. Hemolytic strain 361B of Prevotella melaninogenica, a putative etiologic agent of periodontal disease, was used in this study. The cloning, sequencing, and characterization of phyA, the structural gene for a P. melaninogenica hemolysin, is described. No extensive sequence homology could be identified between phyA and any reported sequence at either the nucleotide or amino acid level. As predicted from sequence analysis, this gene produces a 39-kDa protein which has hemolytic activity as measured by zymogram analysis. Unlike many Ca2+-dependent bacterial hemolysins, both the cloned and native PhyA proteins were enhanced by the presence of EDTA in a dose-dependent fashion with 40 mM EDTA allowing maximum activity. Ca2+ and Mg2+ were found to be inhibitory. The hemolytic activity also was found to have a dose-dependent endpoint. Through recovery of hemolytic activity from a spent reaction, this endpoint was shown to be the result of end product inhibition. This is the first report describing the cloning and sequencing of a gene from P. melaninogenica.
The Murine Norovirus Core Subgenomic RNA Promoter Consists of a Stable Stem-Loop That Can Direct Accurate Initiation of RNA Synthesis

PubMed Central

Yunus, Muhammad Amir; Lin, Xiaoyan; Bailey, Dalan; Karakasiliotis, Ioannis; Chaudhry, Yasmin; Vashist, Surender; Zhang, Guo; Thorne, Lucy; Kao, C. Cheng

2014-01-01

ABSTRACT All members of the Caliciviridae family of viruses produce a subgenomic RNA during infection. The subgenomic RNA typically encodes only the major and minor capsid proteins, but in murine norovirus (MNV), the subgenomic RNA also encodes the VF1 protein, which functions to suppress host innate immune responses. To date, the mechanism of norovirus subgenomic RNA synthesis has not been characterized. We have previously described the presence of an evolutionarily conserved RNA stem-loop structure on the negative-sense RNA, the complementary sequence of which codes for the viral RNA-dependent RNA polymerase (NS7). The conserved stem-loop is positioned 6 nucleotides 3′ of the start site of the subgenomic RNA in all caliciviruses. We demonstrate that the conserved stem-loop is essential for MNV viability. Mutant MNV RNAs with substitutions in the stem-loop replicated poorly until they accumulated mutations that revert to restore the stem-loop sequence and/or structure. The stem-loop sequence functions in a noncoding context, as it was possible to restore the replication of an MNV mutant by introducing an additional copy of the stem-loop between the NS7- and VP1-coding regions. Finally, in vitro biochemical data suggest that the stem-loop sequence is sufficient for the initiation of viral RNA synthesis by the recombinant MNV RNA-dependent RNA polymerase, confirming that the stem-loop forms the core of the norovirus subgenomic promoter. IMPORTANCE Noroviruses are a significant cause of viral gastroenteritis, and it is important to understand the mechanism of norovirus RNA synthesis. Here we describe the identification of an RNA stem-loop structure that functions as the core of the norovirus subgenomic RNA promoter in cells and in vitro. This work provides new insights into the molecular mechanisms of norovirus RNA synthesis and the sequences that determine the recognition of viral RNA by the RNA-dependent RNA polymerase. PMID:25392209
The murine norovirus core subgenomic RNA promoter consists of a stable stem-loop that can direct accurate initiation of RNA synthesis.

PubMed

Yunus, Muhammad Amir; Lin, Xiaoyan; Bailey, Dalan; Karakasiliotis, Ioannis; Chaudhry, Yasmin; Vashist, Surender; Zhang, Guo; Thorne, Lucy; Kao, C Cheng; Goodfellow, Ian

2015-01-15

All members of the Caliciviridae family of viruses produce a subgenomic RNA during infection. The subgenomic RNA typically encodes only the major and minor capsid proteins, but in murine norovirus (MNV), the subgenomic RNA also encodes the VF1 protein, which functions to suppress host innate immune responses. To date, the mechanism of norovirus subgenomic RNA synthesis has not been characterized. We have previously described the presence of an evolutionarily conserved RNA stem-loop structure on the negative-sense RNA, the complementary sequence of which codes for the viral RNA-dependent RNA polymerase (NS7). The conserved stem-loop is positioned 6 nucleotides 3' of the start site of the subgenomic RNA in all caliciviruses. We demonstrate that the conserved stem-loop is essential for MNV viability. Mutant MNV RNAs with substitutions in the stem-loop replicated poorly until they accumulated mutations that revert to restore the stem-loop sequence and/or structure. The stem-loop sequence functions in a noncoding context, as it was possible to restore the replication of an MNV mutant by introducing an additional copy of the stem-loop between the NS7- and VP1-coding regions. Finally, in vitro biochemical data suggest that the stem-loop sequence is sufficient for the initiation of viral RNA synthesis by the recombinant MNV RNA-dependent RNA polymerase, confirming that the stem-loop forms the core of the norovirus subgenomic promoter. Noroviruses are a significant cause of viral gastroenteritis, and it is important to understand the mechanism of norovirus RNA synthesis. Here we describe the identification of an RNA stem-loop structure that functions as the core of the norovirus subgenomic RNA promoter in cells and in vitro. This work provides new insights into the molecular mechanisms of norovirus RNA synthesis and the sequences that determine the recognition of viral RNA by the RNA-dependent RNA polymerase. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
Sequence dependence of electron-induced DNA strand breakage revealed by DNA nanoarrays

PubMed Central

Keller, Adrian; Rackwitz, Jenny; Cauët, Emilie; Liévin, Jacques; Körzdörfer, Thomas; Rotaru, Alexandru; Gothelf, Kurt V.; Besenbacher, Flemming; Bald, Ilko

2014-01-01

The electronic structure of DNA is determined by its nucleotide sequence, which is for instance exploited in molecular electronics. Here we demonstrate that also the DNA strand breakage induced by low-energy electrons (18 eV) depends on the nucleotide sequence. To determine the absolute cross sections for electron induced single strand breaks in specific 13 mer oligonucleotides we used atomic force microscopy analysis of DNA origami based DNA nanoarrays. We investigated the DNA sequences 5′-TT(XYX)3TT with X = A, G, C and Y = T, BrU 5-bromouracil and found absolute strand break cross sections between 2.66 · 10−14 cm2 and 7.06 · 10−14 cm2. The highest cross section was found for 5′-TT(ATA)3TT and 5′-TT(ABrUA)3TT, respectively. BrU is a radiosensitizer, which was discussed to be used in cancer radiation therapy. The replacement of T by BrU into the investigated DNA sequences leads to a slight increase of the absolute strand break cross sections resulting in sequence-dependent enhancement factors between 1.14 and 1.66. Nevertheless, the variation of strand break cross sections due to the specific nucleotide sequence is considerably higher. Thus, the present results suggest the development of targeted radiosensitizers for cancer radiation therapy. PMID:25487346

Effect of temperature on terahertz photonic and omnidirectional band gaps in one-dimensional quasi-periodic photonic crystals composed of semiconductor InSb.

PubMed

Singh, Bipin K; Pandey, Praveen C

2016-07-20

Engineering of thermally tunable terahertz photonic and omnidirectional bandgaps has been demonstrated theoretically in one-dimensional quasi-periodic photonic crystals (PCs) containing semiconductor and dielectric materials. The considered quasi-periodic structures are taken in the form of Fibonacci, Thue-Morse, and double periodic sequences. We have shown that the photonic and omnidirectional bandgaps in the quasi-periodic structures with semiconductor constituents are strongly depend on the temperature, thickness of the constituted semiconductor and dielectric material layers, and generations of the quasi-periodic sequences. It has been found that the number of photonic bandgaps increases with layer thickness and generation of the quasi-periodic sequences. Omnidirectional bandgaps in the structures have also been obtained. Results show that the bandwidths of photonic and omnidirectional bandgaps are tunable by changing the temperature and lattice parameters of the structures. The generation of quasi-periodic sequences can also change the properties of photonic and omnidirectional bandgaps remarkably. The frequency range of the photonic and omnidirectional bandgaps can be tuned by the change of temperature and layer thickness of the considered quasi-periodic structures. This work will be useful to design tunable terahertz PC devices.
DNA interactions with a Methylene Blue redox indicator depend on the DNA length and are sequence specific.

PubMed

Farjami, Elaheh; Clima, Lilia; Gothelf, Kurt V; Ferapontova, Elena E

2010-06-01

A DNA molecular beacon approach was used for the analysis of interactions between DNA and Methylene Blue (MB) as a redox indicator of a hybridization event. DNA hairpin structures of different length and guanine (G) content were immobilized onto gold electrodes in their folded states through the alkanethiol linker at the 5'-end. Binding of MB to the folded hairpin DNA was electrochemically studied and compared with binding to the duplex structure formed by hybridization of the hairpin DNA to a complementary DNA strand. Variation of the electrochemical signal from the DNA-MB complex was shown to depend primarily on the DNA length and sequence used: the G-C base pairs were the preferential sites of MB binding in the duplex. For short 20 nts long DNA sequences, the increased electrochemical response from MB bound to the duplex structure was consistent with the increased amount of bound and electrochemically readable MB molecules (i.e. MB molecules that are available for the electron transfer (ET) reaction with the electrode). With longer DNA sequences, the balance between the amounts of the electrochemically readable MB molecules bound to the hairpin DNA and to the hybrid was opposite: a part of the MB molecules bound to the long-sequence DNA duplex seem to be electrochemically mute due to long ET distance. The increasing electrochemical response from MB bound to the short-length DNA hybrid contrasts with the decreasing signal from MB bound to the long-length DNA hybrid and allows an "off"-"on" genosensor development.
Conservation of Fold and Topology of Functional Elements in Thiamin Pyrophosphate Enzymes

NASA Technical Reports Server (NTRS)

Dominiak, P.; Ciszak, E. M.

2005-01-01

Thiamin pyrophosphate (TPP)-dependent enzymes are a highly divergent family of proteins binding both TPP and metal ions. They perform decarboxylation-hydroxyaldehydes. Prior -ketoacids and of a common - (O=)C-C(OH)- fragment of to knowledge of three-dimensional structures of these enzmes, the GDGY25-30NN sequence was used to identify these enzymes. Subsequently, a number of structural studies on those enzymes revealed multi-subunit organization and the features of the two duplicate cofactor binding sites. Analyzing the structures of 44 structurally known enzymes, we found that the common structure of these enzymes is reduced to 180-220 amino acid long fragments of two PP and two PYR domains that form the [PP:PYR]2 binding center of two cofactor molecules. The structures of PP and PYR are arranged in a similar fold-sheet with triplets of helices on both sides.Dconsisting of a six-stranded Residues surrounding the cofactors are not strictly conserved, but they provide the same interatomic contacts required for the catalytic functions that these enzymes perform while maintaining interactive structural integrity. These structural and functional amino acids are topological counterparts located in the same positions of the conserved fold of sets of PP and PYR domains. Additional parallels include short fragments of sequences that link these amino acids to the fold and function. This report on the structural commonalities amongst TPP dependent enzymes is thought to contribute new approaches to annotation that may assist in advancing the functional proteomics of TPP dependent enzymes, and trace their complexity within evolutionary context.
Structures of Bacterial Biosynthetic Arginine Decarboxylases

DOE Office of Scientific and Technical Information (OSTI.GOV)

F Forouhar; S Lew; J Seetharaman

2011-12-31

Biosynthetic arginine decarboxylase (ADC; also known as SpeA) plays an important role in the biosynthesis of polyamines from arginine in bacteria and plants. SpeA is a pyridoxal-5'-phosphate (PLP)-dependent enzyme and shares weak sequence homology with several other PLP-dependent decarboxylases. Here, the crystal structure of PLP-bound SpeA from Campylobacter jejuni is reported at 3.0 {angstrom} resolution and that of Escherichia coli SpeA in complex with a sulfate ion is reported at 3.1 {angstrom} resolution. The structure of the SpeA monomer contains two large domains, an N-terminal TIM-barrel domain followed by a {beta}-sandwich domain, as well as two smaller helical domains. Themore » TIM-barrel and {beta}-sandwich domains share structural homology with several other PLP-dependent decarboxylases, even though the sequence conservation among these enzymes is less than 25%. A similar tetramer is observed for both C. jejuni and E. coli SpeA, composed of two dimers of tightly associated monomers. The active site of SpeA is located at the interface of this dimer and is formed by residues from the TIM-barrel domain of one monomer and a highly conserved loop in the {beta}-sandwich domain of the other monomer. The PLP cofactor is recognized by hydrogen-bonding, {pi}-stacking and van der Waals interactions.« less
Competition between B-Z and B-L transitions in a single DNA molecule: Computational studies

NASA Astrophysics Data System (ADS)

Kwon, Ah-Young; Nam, Gi-Moon; Johner, Albert; Kim, Seyong; Hong, Seok-Cheol; Lee, Nam-Kyung

2016-02-01

Under negative torsion, DNA adopts left-handed helical forms, such as Z-DNA and L-DNA. Using the random copolymer model developed for a wormlike chain, we represent a single DNA molecule with structural heterogeneity as a helical chain consisting of monomers which can be characterized by different helical senses and pitches. By Monte Carlo simulation, where we take into account bending and twist fluctuations explicitly, we study sequence dependence of B-Z transitions under torsional stress and tension focusing on the interaction with B-L transitions. We consider core sequences, (GC) n repeats or (TG) n repeats, which can interconvert between the right-handed B form and the left-handed Z form, imbedded in a random sequence, which can convert to left-handed L form with different (tension dependent) helical pitch. We show that Z-DNA formation from the (GC) n sequence is always supported by unwinding torsional stress but Z-DNA formation from the (TG) n sequence, which are more costly to convert but numerous, can be strongly influenced by the quenched disorder in the surrounding random sequence.
Is there a domain-general cognitive structuring system? Evidence from structural priming across music, math, action descriptions, and language.

PubMed

Van de Cavey, Joris; Hartsuiker, Robert J

2016-01-01

Cognitive processing in many domains (e.g., sentence comprehension, music listening, and math solving) requires sequential information to be organized into an integrational structure. There appears to be some overlap in integrational processing across domains, as shown by cross-domain interference effects when for example linguistic and musical stimuli are jointly presented (Koelsch, Gunter, Wittfoth, & Sammler, 2005; Slevc, Rosenberg, & Patel, 2009). These findings support theories of overlapping resources for integrational processing across domains (cfr. SSIRH Patel, 2003; SWM, Kljajevic, 2010). However, there are some limitations to the studies mentioned above, such as the frequent use of unnaturalistic integrational difficulties. In recent years, the idea has risen that evidence for domain-generality in structural processing might also be yielded though priming paradigms (cfr. Scheepers, 2003). The rationale behind this is that integrational processing across domains regularly requires the processing of dependencies across short or long distances in the sequence, involving respectively less or more syntactic working memory resources (cfr. SWM, Kljajevic, 2010), and such processing decisions might persist over time. However, whereas recent studies have shown suggestive priming of integrational structure between language and arithmetics (though often dependent on arithmetic performance, cfr. Scheepers et al., 2011; Scheepers & Sturt, 2014), it remains to be investigated to what extent we can also find evidence for priming in other domains, such as music and action (cfr. SWM, Kljajevic, 2010). Experiment 1a showed structural priming from the processing of musical sequences onto the position in the sentence structure (early or late) to which a relative clause was attached in subsequent sentence completion. Importantly, Experiment 1b showed that a similar structural manipulation based on non-hierarchically ordered color sequences did not yield any priming effect, suggesting that the priming effect is not based on linear order, but integrational dependency. Finally, Experiment 2 presented primes in four domains (relative clause sentences, music, mathematics, and structured descriptions of actions), and consistently showed priming within and across domains. These findings provide clear evidence for domain-general structural processing mechanisms. Copyright © 2015 Elsevier B.V. All rights reserved.
Optimization of rotamers prior to template minimization improves stability predictions made by computational protein design.

PubMed

Davey, James A; Chica, Roberto A

2015-04-01

Computational protein design (CPD) predictions are highly dependent on the structure of the input template used. However, it is unclear how small differences in template geometry translate to large differences in stability prediction accuracy. Herein, we explored how structural changes to the input template affect the outcome of stability predictions by CPD. To do this, we prepared alternate templates by Rotamer Optimization followed by energy Minimization (ROM) and used them to recapitulate the stability of 84 protein G domain β1 mutant sequences. In the ROM process, side-chain rotamers for wild-type (WT) or mutant sequences are optimized on crystal or nuclear magnetic resonance (NMR) structures prior to template minimization, resulting in alternate structures termed ROM templates. We show that use of ROM templates prepared from sequences known to be stable results predominantly in improved prediction accuracy compared to using the minimized crystal or NMR structures. Conversely, ROM templates prepared from sequences that are less stable than the WT reduce prediction accuracy by increasing the number of false positives. These observed changes in prediction outcomes are attributed to differences in side-chain contacts made by rotamers in ROM templates. Finally, we show that ROM templates prepared from sequences that are unfolded or that adopt a nonnative fold result in the selective enrichment of sequences that are also unfolded or that adopt a nonnative fold, respectively. Our results demonstrate the existence of a rotamer bias caused by the input template that can be harnessed to skew predictions toward sequences displaying desired characteristics. © 2014 The Protein Society.
Sequence- and Temperature-Dependent Properties of Unfolded and Disordered Proteins from Atomistic Simulations.

PubMed

Zerze, Gül H; Best, Robert B; Mittal, Jeetain

2015-11-19

We use all-atom molecular simulation with explicit solvent to study the properties of selected intrinsically disordered proteins and unfolded states of foldable proteins, which include chain dimensions and shape, secondary structure propensity, solvent accessible surface area, and contact formation. We find that the qualitative scaling behavior of the chains matches expectations from theory under ambient conditions. In particular, unfolded globular proteins tend to be more collapsed under the same conditions than charged disordered sequences of the same length. However, inclusion of explicit solvent in addition naturally captures temperature-dependent solvation effects, which results in an initial collapse of the chains as temperature is increased, in qualitative agreement with experiment. There is a universal origin to the collapse, revealed in the change of hydration of individual residues as a function of temperature: namely, that the initial collapse is driven by unfavorable solvation free energy of individual residues, which in turn has a strong temperature dependence. We also observe that in unfolded globular proteins, increased temperature also initially favors formation of native-like (rather than non-native-like) structure. Our results help to establish how sequence encodes the degree of intrinsic disorder or order as well as its response to changes in environmental conditions.
Subtelomeric Rearrangements and Copy Number Variations in People with Intellectual Disabilities

ERIC Educational Resources Information Center

Christofolini, D. M.; De Paula Ramos, M. A.; Kulikowski, L. D.; Da Silva Bellucco, F. T.; Belangero, S. I. N.; Brunoni, D.; Melaragno, M. I.

2010-01-01

Background: The most prevalent type of structural variation in the human genome is represented by copy number variations that can affect transcription levels, sequence, structure and function of genes. Method: In the present study, we used the multiplex ligation-dependent probe amplification (MLPA) technique and quantitative PCR for the detection…
Network Analysis of Protein Adaptation: Modeling the Functional Impact of Multiple Mutations

PubMed Central

Beleva Guthrie, Violeta; Masica, David L; Fraser, Andrew; Federico, Joseph; Fan, Yunfan; Camps, Manel; Karchin, Rachel

2018-01-01

Abstract The evolution of new biochemical activities frequently involves complex dependencies between mutations and rapid evolutionary radiation. Mutation co-occurrence and covariation have previously been used to identify compensating mutations that are the result of physical contacts and preserve protein function and fold. Here, we model pairwise functional dependencies and higher order interactions that enable evolution of new protein functions. We use a network model to find complex dependencies between mutations resulting from evolutionary trade-offs and pleiotropic effects. We present a method to construct these networks and to identify functionally interacting mutations in both extant and reconstructed ancestral sequences (Network Analysis of Protein Adaptation). The time ordering of mutations can be incorporated into the networks through phylogenetic reconstruction. We apply NAPA to three distantly homologous β-lactamase protein clusters (TEM, CTX-M-3, and OXA-51), each of which has experienced recent evolutionary radiation under substantially different selective pressures. By analyzing the network properties of each protein cluster, we identify key adaptive mutations, positive pairwise interactions, different adaptive solutions to the same selective pressure, and complex evolutionary trajectories likely to increase protein fitness. We also present evidence that incorporating information from phylogenetic reconstruction and ancestral sequence inference can reduce the number of spurious links in the network, whereas preserving overall network community structure. The analysis does not require structural or biochemical data. In contrast to function-preserving mutation dependencies, which are frequently from structural contacts, gain-of-function mutation dependencies are most commonly between residues distal in protein structure. PMID:29522102
Structure prediction of Fe(II) 2-oxoglutarate dioxygenase from a psychrophilic yeast Glaciozyma antarctica PI12

NASA Astrophysics Data System (ADS)

Yusof, Nik Yusnoraini; Bakar, Farah Diba Abu; Mahadi, Nor Muhammad; Raih, Mohd Firdaus; Murad, Abdul Munir Abdul

2015-09-01

A cDNA encoding Fe(II) 2-oxoglutarate (2OG) dependent dioxygenases was isolated from psychrophilic yeast, Glaciozyma antarctica PI12. We have successfully amplified 1,029 bp cDNA sequence that encodes 342 amino acid with predicted molecular weight 38 kDa. The prediction protein was analysed using various bioinformatics tools to explore the properties of the protein. Based on a BLAST search analysis, the Fe2OX amino acid sequence showed 61% identity to the sequence of oxoglutarate/iron-dependent oxygenase from Rhodosporidium toruloides NP11. SignalP prediction showed that the Fe2OX protein contains no putative signal peptide, which suggests that this enzyme most probably localised intracellularly.The structure of Fe2OX was predicted by homology modelling using MODELLER9v11. The model with the lowest objective function was selected from hundred models generated using MODELLER9v11. Analysis of the structure revealed the longer loop at Fe2OX from G.antarctica that might be responsible for the flexibility of the structure, which contributes to its adaptation to low temperatures. Fe2OX hold a highly conserved Fe(II) binding HXD/E…H triad motif. The binding site for 2-oxoglutarate was found conserved for Arg280 among reported studies, however the Phe268 was found to be different in Fe2OX.
Structure and dynamics of single hydrophobic/ionic heteropolymers at the vapor-liquid interface of water.

PubMed

Vembanur, Srivathsan; Venkateshwaran, Vasudevan; Garde, Shekhar

2014-04-29

We focus on the conformational stability, structure, and dynamics of hydrophobic/charged homopolymers and heteropolymers at the vapor-liquid interface of water using extensive molecular dynamics simulations. Hydrophobic polymers collapse into globular structures in bulk water but unfold and sample a broad range of conformations at the vapor-liquid interface of water. We show that adding a pair of charges to a hydrophobic polymer at the interface can dramatically change its conformations, stabilizing hairpinlike structures, with molecular details depending on the location of the charged pair in the sequence. The translational dynamics of homopolymers and heteropolymers are also different, whereas the homopolymers skate on the interface with low drag, the tendency of charged groups to remain hydrated pulls the heteropolymers toward the liquid side of the interface, thus pinning them, increasing drag, and slowing the translational dynamics. The conformational dynamics of heteropolymers are also slower than that of the homopolymer and depend on the location of the charged groups in the sequence. Conformational dynamics are most restricted for the end-charged heteropolymer and speed up as the charge pair is moved toward the center of the sequence. We rationalize these trends using the fundamental understanding of the effects of the interface on primitive pair-level interactions between two hydrophobic groups and between oppositely charged ions in its vicinity.
Community detection in sequence similarity networks based on attribute clustering

DOE PAGES

Chowdhary, Janamejaya; Loeffler, Frank E.; Smith, Jeremy C.

2017-07-24

Networks are powerful tools for the presentation and analysis of interactions in multi-component systems. A commonly studied mesoscopic feature of networks is their community structure, which arises from grouping together similar nodes into one community and dissimilar nodes into separate communities. Here in this paper, the community structure of protein sequence similarity networks is determined with a new method: Attribute Clustering Dependent Communities (ACDC). Sequence similarity has hitherto typically been quantified by the alignment score or its expectation value. However, pair alignments with the same score or expectation value cannot thus be differentiated. To overcome this deficiency, the method constructs,more » for pair alignments, an extended alignment metric, the link attribute vector, which includes the score and other alignment characteristics. Rescaling components of the attribute vectors qualitatively identifies a systematic variation of sequence similarity within protein superfamilies. The problem of community detection is then mapped to clustering the link attribute vectors, selection of an optimal subset of links and community structure refinement based on the partition density of the network. ACDC-predicted communities are found to be in good agreement with gold standard sequence databases for which the "ground truth" community structures (or families) are known. ACDC is therefore a community detection method for sequence similarity networks based entirely on pair similarity information. A serial implementation of ACDC is available from https://cmb.ornl.gov/resources/developments« less
Community detection in sequence similarity networks based on attribute clustering

DOE Office of Scientific and Technical Information (OSTI.GOV)

Chowdhary, Janamejaya; Loeffler, Frank E.; Smith, Jeremy C.

Networks are powerful tools for the presentation and analysis of interactions in multi-component systems. A commonly studied mesoscopic feature of networks is their community structure, which arises from grouping together similar nodes into one community and dissimilar nodes into separate communities. Here in this paper, the community structure of protein sequence similarity networks is determined with a new method: Attribute Clustering Dependent Communities (ACDC). Sequence similarity has hitherto typically been quantified by the alignment score or its expectation value. However, pair alignments with the same score or expectation value cannot thus be differentiated. To overcome this deficiency, the method constructs,more » for pair alignments, an extended alignment metric, the link attribute vector, which includes the score and other alignment characteristics. Rescaling components of the attribute vectors qualitatively identifies a systematic variation of sequence similarity within protein superfamilies. The problem of community detection is then mapped to clustering the link attribute vectors, selection of an optimal subset of links and community structure refinement based on the partition density of the network. ACDC-predicted communities are found to be in good agreement with gold standard sequence databases for which the "ground truth" community structures (or families) are known. ACDC is therefore a community detection method for sequence similarity networks based entirely on pair similarity information. A serial implementation of ACDC is available from https://cmb.ornl.gov/resources/developments« less
Independent Evolution of Six Families of Halogenating Enzymes.

PubMed

Xu, Gangming; Wang, Bin-Gui

2016-01-01

Halogenated natural products are widespread in the environment, and the halogen atoms are typically vital to their bioactivities. Thus far, six families of halogenating enzymes have been identified: cofactor-free haloperoxidases (HPO), vanadium-dependent haloperoxidases (V-HPO), heme iron-dependent haloperoxidases (HI-HPO), non-heme iron-dependent halogenases (NI-HG), flavin-dependent halogenases (F-HG), and S-adenosyl-L-methionine (SAM)-dependent halogenases (S-HG). However, these halogenating enzymes with similar biological functions but distinct structures might have evolved independently. Phylogenetic and structural analyses suggest that the HPO, V-HPO, HI-HPO, NI-HG, F-HG, and S-HG enzyme families may have evolutionary relationships to the α/β hydrolases, acid phosphatases, peroxidases, chemotaxis phosphatases, oxidoreductases, and SAM hydroxide adenosyltransferases, respectively. These halogenating enzymes have established sequence homology, structural conservation, and mechanistic features within each family. Understanding the distinct evolutionary history of these halogenating enzymes will provide further insights into the study of their catalytic mechanisms and halogenation specificity.
Target Site Recognition by a Diversity-Generating Retroelement

PubMed Central

Guo, Huatao; Tse, Longping V.; Nieh, Angela W.; Czornyj, Elizabeth; Williams, Steven; Oukil, Sabrina; Liu, Vincent B.; Miller, Jeff F.

2011-01-01

Diversity-generating retroelements (DGRs) are in vivo sequence diversification machines that are widely distributed in bacterial, phage, and plasmid genomes. They function to introduce vast amounts of targeted diversity into protein-encoding DNA sequences via mutagenic homing. Adenine residues are converted to random nucleotides in a retrotransposition process from a donor template repeat (TR) to a recipient variable repeat (VR). Using the Bordetella bacteriophage BPP-1 element as a prototype, we have characterized requirements for DGR target site function. Although sequences upstream of VR are dispensable, a 24 bp sequence immediately downstream of VR, which contains short inverted repeats, is required for efficient retrohoming. The inverted repeats form a hairpin or cruciform structure and mutational analysis demonstrated that, while the structure of the stem is important, its sequence can vary. In contrast, the loop has a sequence-dependent function. Structure-specific nuclease digestion confirmed the existence of a DNA hairpin/cruciform, and marker coconversion assays demonstrated that it influences the efficiency, but not the site of cDNA integration. Comparisons with other phage DGRs suggested that similar structures are a conserved feature of target sequences. Using a kanamycin resistance determinant as a reporter, we found that transplantation of the IMH and hairpin/cruciform-forming region was sufficient to target the DGR diversification machinery to a heterologous gene. In addition to furthering our understanding of DGR retrohoming, our results suggest that DGRs may provide unique tools for directed protein evolution via in vivo DNA diversification. PMID:22194701
Structural changes induced by binding of the high-mobility group I protein to a mouse satellite DNA sequence.

PubMed Central

Slama-Schwok, A; Zakrzewska, K; Léger, G; Leroux, Y; Takahashi, M; Käs, E; Debey, P

2000-01-01

Using spectroscopic methods, we have studied the structural changes induced in both protein and DNA upon binding of the High-Mobility Group I (HMG-I) protein to a 21-bp sequence derived from mouse satellite DNA. We show that these structural changes depend on the stoichiometry of the protein/DNA complexes formed, as determined by Job plots derived from experiments using pyrene-labeled duplexes. Circular dichroism and melting temperature experiments extended in the far ultraviolet range show that while native HMG-I is mainly random coiled in solution, it adopts a beta-turn conformation upon forming a 1:1 complex in which the protein first binds to one of two dA.dT stretches present in the duplex. HMG-I structure in the 1:1 complex is dependent on the sequence of its DNA target. A 3:1 HMG-I/DNA complex can also form and is characterized by a small increase in the DNA natural bend and/or compaction coupled to a change in the protein conformation, as determined from fluorescence resonance energy transfer (FRET) experiments. In addition, a peptide corresponding to an extended DNA-binding domain of HMG-I induces an ordered condensation of DNA duplexes. Based on the constraints derived from pyrene excimer measurements, we present a model of these nucleated structures. Our results illustrate an extreme case of protein structure induced by DNA conformation that may bear on the evolutionary conservation of the DNA-binding motifs of HMG-I. We discuss the functional relevance of the structural flexibility of HMG-I associated with the nature of its DNA targets and the implications of the binding stoichiometry for several aspects of chromatin structure and gene regulation. PMID:10777751
Sequence and structural characterization of Trx-Grx type of monothiol glutaredoxins from Ashbya gossypii.

PubMed

Yadav, Saurabh; Kumari, Pragati; Kushwaha, Hemant Ritturaj

2013-01-01

Glutaredoxins are enzymatic antioxidants which are small, ubiquitous, glutathione dependent and essentially classified under thioredoxin-fold superfamily. Glutaredoxins are classified into two types: dithiol and monothiol. Monothiol glutaredoxins which carry the signature "CGFS" as a redox active motif is known for its role in oxidative stress, inside the cell. In the present analysis, the 138 amino acid long monothiol glutaredoxin, AgGRX1 from Ashbya gossypii was identified and has been used for the analysis. The multiple sequence alignment of the AgGRX1 protein sequence revealed the characteristic motif of typical monothiol glutaredoxin as observed in various other organisms. The proposed structure of the AgGRX1 protein was used to analyze signature folds related to the thioredoxin superfamily. Further, the study highlighted the structural features pertaining to the complex mechanism of glutathione docking and interacting residues.
Protein Tertiary Structure Prediction Based on Main Chain Angle Using a Hybrid Bees Colony Optimization Algorithm

NASA Astrophysics Data System (ADS)

Mahmood, Zakaria N.; Mahmuddin, Massudi; Mahmood, Mohammed Nooraldeen

Encoding proteins of amino acid sequence to predict classified into their respective families and subfamilies is important research area. However for a given protein, knowing the exact action whether hormonal, enzymatic, transmembranal or nuclear receptors does not depend solely on amino acid sequence but on the way the amino acid thread folds as well. This study provides a prototype system that able to predict a protein tertiary structure. Several methods are used to develop and evaluate the system to produce better accuracy in protein 3D structure prediction. The Bees Optimization algorithm which inspired from the honey bees food foraging method, is used in the searching phase. In this study, the experiment is conducted on short sequence proteins that have been used by the previous researches using well-known tools. The proposed approach shows a promising result.
The dependence of the tunneling characteristic on the electronic energy bands and the carrier’s states of Graphene superlattice

NASA Astrophysics Data System (ADS)

Yang, C. H.; Shen, G. Z.; Ao, Z. M.; Xu, Y. W.

2016-09-01

Using the transfer matrix method, the carrier tunneling properties in graphene superlattice generated by the Thue-Morse sequence and Kolakoski sequence are investigated. The positions and strength of the transmission can be modulated by the barrier structures, the incident energy and angle, the height and width of the potential. These carriers tunneling characteristic can be understood from the energy band structures in the corresponding superlattice systems and the carrier’s states in well/barriers. The transmission peaks above the critical incident angle rely on the carrier’s resonance in the well regions. The structural diversity can modulate the electronic and transport properties, thus expanding its applications.

Spatio-Temporal Structure, Path Characteristics, and Perceptual Grouping in Immediate Serial Spatial Recall

PubMed Central

De Lillo, Carlo; Kirby, Melissa; Poole, Daniel

2016-01-01

Immediate serial spatial recall measures the ability to retain sequences of locations in short-term memory and is considered the spatial equivalent of digit span. It is tested by requiring participants to reproduce sequences of movements performed by an experimenter or displayed on a monitor. Different organizational factors dramatically affect serial spatial recall but they are often confounded or underspecified. Untangling them is crucial for the characterization of working-memory models and for establishing the contribution of structure and memory capacity to spatial span. We report five experiments assessing the relative role and independence of factors that have been reported in the literature. Experiment 1 disentangled the effects of spatial clustering and path-length by manipulating the distance of items displayed on a touchscreen monitor. Long-path sequences segregated by spatial clusters were compared with short-path sequences not segregated by clusters. Recall was more accurate for sequences segregated by clusters independently from path-length. Experiment 2 featured conditions where temporal pauses were introduced between or within cluster boundaries during the presentation of sequences with the same paths. Thus, the temporal structure of the sequences was either consistent or inconsistent with a hierarchical representation based on segmentation by spatial clusters but the effect of structure could not be confounded with effects of path-characteristics. Pauses at cluster boundaries yielded more accurate recall, as predicted by a hierarchical model. In Experiment 3, the systematic manipulation of sequence structure, path-length, and presence of path-crossings of sequences showed that structure explained most of the variance, followed by the presence/absence of path-crossings, and path-length. Experiments 4 and 5 replicated the results of the previous experiments in immersive virtual reality navigation tasks where the viewpoint of the observer changed dynamically during encoding and recall. This suggested that the effects of structure in spatial span are not dependent on perceptual grouping processes induced by the aerial view of the stimulus array typically afforded by spatial recall tasks. These results demonstrate the independence of coding strategies based on structure from effects of path characteristics and perceptual grouping in immediate serial spatial recall. PMID:27891101
Structural polymorphism of a cytosine-rich DNA sequence forming i-motif structure: Exploring pH based biosensors.

PubMed

Ahmed, Saami; Kaushik, Mahima; Chaudhary, Swati; Kukreti, Shrikant

2018-05-01

Sequence recognition and conformational polymorphism enable DNA to emerge out as a substantial tool in fabricating the devices within nano-dimensions. These DNA associated nano devices work on the principle of conformational switches, which can be facilitated by many factors like sequence of DNA/RNA strand, change in pH or temperature, enzyme or ligand interactions etc. Thus, controlling these DNA conformational changes to acquire the desired function is significant for evolving DNA hybridization biosensor, used in genetic screening and molecular diagnosis. For exploring this conformational switching ability of cytosine-rich DNA oligonucleotides as a function of pH for their potential usage as biosensors, this study has been designed. A C-rich stretch of DNA sequence (5'-TCCCCCAATTAATTCCCCCA-3'; SG20c) has been investigated using UV-Thermal denaturation, poly-acrylamide gel electrophoresis and CD spectroscopy. The SG20c sequence is shown to adopt various topologies of i-motif structure at low pH. This pH dependent transition of SG20c from unstructured single strand to unimolecular and bimolecular i-motif structures can further be exploited for its utilization as switching on/off pH-based biosensors. Copyright © 2018. Published by Elsevier B.V.
Sites of instability in the human TCF3 (E2A) gene adopt G-quadruplex DNA structures in vitro

PubMed Central

Williams, Jonathan D.; Fleetwood, Sara; Berroyer, Alexandra; Kim, Nayun; Larson, Erik D.

2015-01-01

The formation of highly stable four-stranded DNA, called G-quadruplex (G4), promotes site-specific genome instability. G4 DNA structures fold from repetitive guanine sequences, and increasing experimental evidence connects G4 sequence motifs with specific gene rearrangements. The human transcription factor 3 (TCF3) gene (also termed E2A) is subject to genetic instability associated with severe disease, most notably a common translocation event t(1;19) associated with acute lymphoblastic leukemia. The sites of instability in TCF3 are not randomly distributed, but focused to certain sequences. We asked if G4 DNA formation could explain why TCF3 is prone to recombination and mutagenesis. Here we demonstrate that sequences surrounding the major t(1;19) break site and a region associated with copy number variations both contain G4 sequence motifs. The motifs identified readily adopt G4 DNA structures that are stable enough to interfere with DNA synthesis in physiological salt conditions in vitro. When introduced into the yeast genome, TCF3 G4 motifs promoted gross chromosomal rearrangements in a transcription-dependent manner. Our results provide a molecular rationale for the site-specific instability of human TCF3, suggesting that G4 DNA structures contribute to oncogenic DNA breaks and recombination. PMID:26029241
Mapping RNA Structure In Vitro with SHAPE Chemistry and Next-Generation Sequencing (SHAPE-Seq).

PubMed

Watters, Kyle E; Lucks, Julius B

2016-01-01

Mapping RNA structure with selective 2'-hydroxyl acylation analyzed by primer extension (SHAPE) chemistry has proven to be a versatile method for characterizing RNA structure in a variety of contexts. SHAPE reagents covalently modify RNAs in a structure-dependent manner to create adducts at the 2'-OH group of the ribose backbone at nucleotides that are structurally flexible. The positions of these adducts are detected using reverse transcriptase (RT) primer extension, which stops one nucleotide before the modification, to create a pool of cDNAs whose lengths reflect the location of SHAPE modification. Quantification of the cDNA pools is used to estimate the "reactivity" of each nucleotide in an RNA molecule to the SHAPE reagent. High reactivities indicate nucleotides that are structurally flexible, while low reactivities indicate nucleotides that are inflexible. These SHAPE reactivities can then be used to infer RNA structures by restraining RNA structure prediction algorithms. Here, we provide a state-of-the-art protocol describing how to perform in vitro RNA structure probing with SHAPE chemistry using next-generation sequencing to quantify cDNA pools and estimate reactivities (SHAPE-Seq). The use of next-generation sequencing allows for higher throughput, more consistent data analysis, and multiplexing capabilities. The technique described herein, SHAPE-Seq v2.0, uses a universal reverse transcription priming site that is ligated to the RNA after SHAPE modification. The introduced priming site allows for the structural analysis of an RNA independent of its sequence.
Relative stability of major types of beta-turns as a function of amino acid composition: a study based on Ab initio energetic and natural abundance data.

PubMed

Perczel, András; Jákli, Imre; McAllister, Michael A; Csizmadia, Imre G

2003-06-06

Folding properties of small globular proteins are determined by their amino acid sequence (primary structure). This holds both for local (secondary structure) and for global conformational features of linear polypeptides and proteins composed from natural amino acid derivatives. It thus provides the rational basis of structure prediction algorithms. The shortest secondary structure element, the beta-turn, most typically adopts either a type I or a type II form, depending on the amino acid composition. Herein we investigate the sequence-dependent folding stability of both major types of beta-turns using simple dipeptide models (-Xxx-Yyy-). Gas-phase ab initio properties of 16 carefully selected and suitably protected dipeptide models (for example Val-Ser, Ala-Gly, Ser-Ser) were studied. For each backbone fold most probable side-chain conformers were considered. Fully optimized 321G RHF molecular structures were employed in medium level [B3LYP/6-311++G(d,p)//RHF/3-21G] energy calculations to estimate relative populations of the different backbone conformers. Our results show that the preference for beta-turn forms as calculated by quantum mechanics and observed in Xray determined proteins correlates significantly.
The primary structure of fatty-acid-binding protein from nurse shark liver. Structural and evolutionary relationship to the mammalian fatty-acid-binding protein family.

PubMed

Medzihradszky, K F; Gibson, B W; Kaur, S; Yu, Z H; Medzihradszky, D; Burlingame, A L; Bass, N M

1992-02-01

The primary structure of a fatty-acid-binding protein (FABP) isolated from the liver of the nurse shark (Ginglymostoma cirratum) was determined by high-performance tandem mass spectrometry (employing multichannel array detection) and Edman degradation. Shark liver FABP consists of 132 amino acids with an acetylated N-terminal valine. The chemical molecular mass of the intact protein determined by electrospray ionization mass spectrometry (Mr = 15124 +/- 2.5) was in good agreement with that calculated from the amino acid sequence (Mr = 15121.3). The amino acid sequence of shark liver FABP displays significantly greater similarity to the FABP expressed in mammalian heart, peripheral nerve myelin and adipose tissue (61-53% sequence similarity) than to the FABP expressed in mammalian liver (22% similarity). Phylogenetic trees derived from the comparison of the shark liver FABP amino acid sequence with the members of the mammalian fatty-acid/retinoid-binding protein gene family indicate the initial divergence of an ancestral gene into two major subfamilies: one comprising the genes for mammalian liver FABP and gastrotropin, the other comprising the genes for mammalian cellular retinol-binding proteins I and II, cellular retinoic-acid-binding protein myelin P2 protein, adipocyte FABP, heart FABP and shark liver FABP, the latter having diverged from the ancestral gene that ultimately gave rise to the present day mammalian heart-FABP, adipocyte FABP and myelin P2 protein sequences. The sequence for intestinal FABP from the rat could be assigned to either subfamily, depending on the approach used for phylogenetic tree construction, but clearly diverged at a relatively early evolutionary time point. Indeed, sequences proximately ancestral or closely related to mammalian intestinal FABP, liver FABP, gastrotropin and the retinoid-binding group of proteins appear to have arisen prior to the divergence of shark liver FABP and should therefore also be present in elasmobranchs. The presence in shark liver of an FABP which differs substantially in primary structure from mammalian liver FABP, while being closely related to the FABP expressed in mammalian heart muscle, peripheral nerve myelin and adipocytes, opens a further dimension regarding the question of the existence of structure-dependent and tissue-specific specialization of FABP function in lipid metabolism.
A 3D sequence-independent representation of the protein data bank.

PubMed

Fischer, D; Tsai, C J; Nussinov, R; Wolfson, H

1995-10-01

Here we address the following questions. How many structurally different entries are there in the Protein Data Bank (PDB)? How do the proteins populate the structural universe? To investigate these questions a structurally non-redundant set of representative entries was selected from the PDB. Construction of such a dataset is not trivial: (i) the considerable size of the PDB requires a large number of comparisons (there were more than 3250 structures of protein chains available in May 1994); (ii) the PDB is highly redundant, containing many structurally similar entries, not necessarily with significant sequence homology, and (iii) there is no clear-cut definition of structural similarity. The latter depend on the criteria and methods used. Here, we analyze structural similarity ignoring protein topology. To date, representative sets have been selected either by hand, by sequence comparison techniques which ignore the three-dimensional (3D) structures of the proteins or by using sequence comparisons followed by linear structural comparison (i.e. the topology, or the sequential order of the chains, is enforced in the structural comparison). Here we describe a 3D sequence-independent automated and efficient method to obtain a representative set of protein molecules from the PDB which contains all unique structures and which is structurally non-redundant. The method has two novel features. The first is the use of strictly structural criteria in the selection process without taking into account the sequence information. To this end we employ a fast structural comparison algorithm which requires on average approximately 2 s per pairwise comparison on a workstation. The second novel feature is the iterative application of a heuristic clustering algorithm that greatly reduces the number of comparisons required. We obtain a representative set of 220 chains with resolution better than 3.0 A, or 268 chains including lower resolution entries, NMR entries and models. The resulting set can serve as a basis for extensive structural classification and studies of 3D recurring motifs and of sequence-structure relationships. The clustering algorithm succeeds in classifying into the same structural family chains with no significant sequence homology, e.g. all the globins in one single group, all the trypsin-like serine proteases in another or all the immunoglobulin-like folds into a third. In addition, unexpected structural similarities of interest have been automatically detected between pairs of chains. A cluster analysis of the representative structures demonstrates the way the "structural universe' is populated.
Evolution of sparsity and modularity in a model of protein allostery

NASA Astrophysics Data System (ADS)

Hemery, Mathieu; Rivoire, Olivier

2015-04-01

The sequence of a protein is not only constrained by its physical and biochemical properties under current selection, but also by features of its past evolutionary history. Understanding the extent and the form that these evolutionary constraints may take is important to interpret the information in protein sequences. To study this problem, we introduce a simple but physical model of protein evolution where selection targets allostery, the functional coupling of distal sites on protein surfaces. This model shows how the geometrical organization of couplings between amino acids within a protein structure can depend crucially on its evolutionary history. In particular, two scenarios are found to generate a spatial concentration of functional constraints: high mutation rates and fluctuating selective pressures. This second scenario offers a plausible explanation for the high tolerance of natural proteins to mutations and for the spatial organization of their least tolerant amino acids, as revealed by sequence analysis and mutagenesis experiments. It also implies a faculty to adapt to new selective pressures that is consistent with observations. The model illustrates how several independent functional modules may emerge within the same protein structure, depending on the nature of past environmental fluctuations. Our model thus relates the evolutionary history of proteins to the geometry of their functional constraints, with implications for decoding and engineering protein sequences.
Functional Evolution of PLP-dependent Enzymes based on Active-Site Structural Similarities

PubMed Central

Catazaro, Jonathan; Caprez, Adam; Guru, Ashu; Swanson, David; Powers, Robert

2014-01-01

Families of distantly related proteins typically have very low sequence identity, which hinders evolutionary analysis and functional annotation. Slowly evolving features of proteins, such as an active site, are therefore valuable for annotating putative and distantly related proteins. To date, a complete evolutionary analysis of the functional relationship of an entire enzyme family based on active-site structural similarities has not yet been undertaken. Pyridoxal-5’-phosphate (PLP) dependent enzymes are primordial enzymes that diversified in the last universal ancestor. Using the Comparison of Protein Active Site Structures (CPASS) software and database, we show that the active site structures of PLP-dependent enzymes can be used to infer evolutionary relationships based on functional similarity. The enzymes successfully clustered together based on substrate specificity, function, and three-dimensional fold. This study demonstrates the value of using active site structures for functional evolutionary analysis and the effectiveness of CPASS. PMID:24920327
Functional evolution of PLP-dependent enzymes based on active-site structural similarities.

PubMed

Catazaro, Jonathan; Caprez, Adam; Guru, Ashu; Swanson, David; Powers, Robert

2014-10-01

Families of distantly related proteins typically have very low sequence identity, which hinders evolutionary analysis and functional annotation. Slowly evolving features of proteins, such as an active site, are therefore valuable for annotating putative and distantly related proteins. To date, a complete evolutionary analysis of the functional relationship of an entire enzyme family based on active-site structural similarities has not yet been undertaken. Pyridoxal-5'-phosphate (PLP) dependent enzymes are primordial enzymes that diversified in the last universal ancestor. Using the comparison of protein active site structures (CPASS) software and database, we show that the active site structures of PLP-dependent enzymes can be used to infer evolutionary relationships based on functional similarity. The enzymes successfully clustered together based on substrate specificity, function, and three-dimensional-fold. This study demonstrates the value of using active site structures for functional evolutionary analysis and the effectiveness of CPASS. © 2014 Wiley Periodicals, Inc.
There is Diversity in Disorder-"In all Chaos there is a Cosmos, in all Disorder a Secret Order".

PubMed

Nielsen, Jakob T; Mulder, Frans A A

2016-01-01

The protein universe consists of a continuum of structures ranging from full order to complete disorder. As the structured part of the proteome has been intensively studied, stably folded proteins are increasingly well documented and understood. However, proteins that are fully, or in large part, disordered are much less well characterized. Here we collected NMR chemical shifts in a small database for 117 protein sequences that are known to contain disorder. We demonstrate that NMR chemical shift data can be brought to bear as an exquisite judge of protein disorder at the residue level, and help in validation. With the help of secondary chemical shift analysis we demonstrate that the proteins in the database span the full spectrum of disorder, but still, largely segregate into two classes; disordered with small segments of order scattered along the sequence, and structured with small segments of disorder inserted between the different structured regions. A detailed analysis reveals that the distribution of order/disorder along the sequence shows a complex and asymmetric distribution, that is highly protein-dependent. Access to ratified training data further suggests an avenue to improving prediction of disorder from sequence.
Probing the Structures of Viral RNA Regulatory Elements with SHAPE and Related Methodologies

PubMed Central

Rausch, Jason W.; Sztuba-Solinska, Joanna; Le Grice, Stuart F. J.

2018-01-01

Viral RNAs were selected by evolution to possess maximum functionality in a minimal sequence. Depending on the classification of the virus and the type of RNA in question, viral RNAs must alternately be replicated, spliced, transcribed, transported from the nucleus into the cytoplasm, translated and/or packaged into nascent virions, and in most cases, provide the sequence and structural determinants to facilitate these processes. One consequence of this compact multifunctionality is that viral RNA structures can be exquisitely complex, often involving intermolecular interactions with RNA or protein, intramolecular interactions between sequence segments separated by several thousands of nucleotides, or specialized motifs such as pseudoknots or kissing loops. The fluidity of viral RNA structure can also present a challenge when attempting to characterize it, as genomic RNAs especially are likely to sample numerous conformations at various stages of the virus life cycle. Here we review advances in chemoenzymatic structure probing that have made it possible to address such challenges with respect to cis-acting elements, full-length viral genomes and long non-coding RNAs that play a major role in regulating viral gene expression. PMID:29375504
Sequence-dependent DNA flexibility mediates DNase I cleavage.

PubMed

Heddi, Brahim; Abi-Ghanem, Josephine; Lavigne, Marc; Hartmann, Brigitte

2010-01-08

Understanding the preference of nonspecific proteins for certain DNA structural features requires an accurate description of the properties of free DNA, especially regarding their possible predisposition to adopt a conformation that favors the formation of a complex. Exploiting previous exhaustive NMR studies performed on free DNA oligomers, we investigated the molecular basis of DNase I sensitivity under conditions where DNase I binding limits the probability of cleavage. We showed that cleavage intensity was correlated with adjacent 3' phosphate linkage flexibility, monitored by (31)P chemical shifts. Examining NMR-refined DNA structures highlighted that sequence-dependent flexible phosphates were associated with large minor groove variations that may promote the affinity of DNase I, according to relevant DNA-protein complexes. In sum, this work demonstrates that specificity in DNA-DNase I interaction is mediated by DNA flexibility, which influences the induced-fit transitions required to form productive complexes.
Complete convergence of randomly weighted END sequences and its application.

PubMed

Li, Penghua; Li, Xiaoqin; Wu, Kehan

2017-01-01

We investigate the complete convergence of partial sums of randomly weighted extended negatively dependent (END) random variables. Some results of complete moment convergence, complete convergence and the strong law of large numbers for this dependent structure are obtained. As an application, we study the convergence of the state observers of linear-time-invariant systems. Our results extend the corresponding earlier ones.
Making the Bend: DNA Tertiary Structure and Protein-DNA Interactions

PubMed Central

Harteis, Sabrina; Schneider, Sabine

2014-01-01

DNA structure functions as an overlapping code to the DNA sequence. Rapid progress in understanding the role of DNA structure in gene regulation, DNA damage recognition and genome stability has been made. The three dimensional structure of both proteins and DNA plays a crucial role for their specific interaction, and proteins can recognise the chemical signature of DNA sequence (“base readout”) as well as the intrinsic DNA structure (“shape recognition”). These recognition mechanisms do not exist in isolation but, depending on the individual interaction partners, are combined to various extents. Driving force for the interaction between protein and DNA remain the unique thermodynamics of each individual DNA-protein pair. In this review we focus on the structures and conformations adopted by DNA, both influenced by and influencing the specific interaction with the corresponding protein binding partner, as well as their underlying thermodynamics. PMID:25026169
A multivariate prediction model for Rho-dependent termination of transcription.

PubMed

Nadiras, Cédric; Eveno, Eric; Schwartz, Annie; Figueroa-Bossi, Nara; Boudvillain, Marc

2018-06-21

Bacterial transcription termination proceeds via two main mechanisms triggered either by simple, well-conserved (intrinsic) nucleic acid motifs or by the motor protein Rho. Although bacterial genomes can harbor hundreds of termination signals of either type, only intrinsic terminators are reliably predicted. Computational tools to detect the more complex and diversiform Rho-dependent terminators are lacking. To tackle this issue, we devised a prediction method based on Orthogonal Projections to Latent Structures Discriminant Analysis [OPLS-DA] of a large set of in vitro termination data. Using previously uncharacterized genomic sequences for biochemical evaluation and OPLS-DA, we identified new Rho-dependent signals and quantitative sequence descriptors with significant predictive value. Most relevant descriptors specify features of transcript C>G skewness, secondary structure, and richness in regularly-spaced 5'CC/UC dinucleotides that are consistent with known principles for Rho-RNA interaction. Descriptors collectively warrant OPLS-DA predictions of Rho-dependent termination with a ∼85% success rate. Scanning of the Escherichia coli genome with the OPLS-DA model identifies significantly more termination-competent regions than anticipated from transcriptomics and predicts that regions intrinsically refractory to Rho are primarily located in open reading frames. Altogether, this work delineates features important for Rho activity and describes the first method able to predict Rho-dependent terminators in bacterial genomes.
Understanding the structural and dynamic consequences of DNA epigenetic modifications: Computational insights into cytosine methylation and hydroxymethylation

PubMed Central

Carvalho, Alexandra T P; Gouveia, Leonor; Kanna, Charan Raju; Wärmländer, Sebastian K T S; Platts, Jamie A; Kamerlin, Shina Caroline Lynn

2014-01-01

We report a series of molecular dynamics (MD) simulations of up to a microsecond combined simulation time designed to probe epigenetically modified DNA sequences. More specifically, by monitoring the effects of methylation and hydroxymethylation of cytosine in different DNA sequences, we show, for the first time, that DNA epigenetic modifications change the molecule's dynamical landscape, increasing the propensity of DNA toward different values of twist and/or roll/tilt angles (in relation to the unmodified DNA) at the modification sites. Moreover, both the extent and position of different modifications have significant effects on the amount of structural variation observed. We propose that these conformational differences, which are dependent on the sequence environment, can provide specificity for protein binding. PMID:25625845
Polymeric peptide pigments with sequence-encoded properties

DOE Office of Scientific and Technical Information (OSTI.GOV)

Lampel, Ayala; McPhee, Scott A.; Park, Hang-Ah

Melanins are a family of heterogeneous polymeric pigments that provide ultraviolet (UV) light protection, structural support, coloration, and free radical scavenging. Formed by oxidative oligomerization of catecholic small molecules, the physical properties of melanins are influenced by covalent and noncovalent disorder. We report the use of tyrosine-containing tripeptides as tunable precursors for polymeric pigments. In these structures, phenols are presented in a (supra-)molecular context dictated by the positions of the amino acids in the peptide sequence. Oxidative polymerization can be tuned in a sequence-dependent manner, resulting in peptide sequence–encoded properties such as UV absorbance, morphology, coloration, and electrochemical properties overmore » a considerable range. Short peptides have low barriers to application and can be easily scaled, suggesting near-term applications in cosmetics and biomedicine.« less
Recognition of the DNA sequence by an inorganic crystal surface

PubMed Central

Sampaolese, Beatrice; Bergia, Anna; Scipioni, Anita; Zuccheri, Giampaolo; Savino, Maria; Samorì, Bruno; De Santis, Pasquale

2002-01-01

The sequence-dependent curvature is generally recognized as an important and biologically relevant property of DNA because it is involved in the formation and stability of association complexes with proteins. When a DNA tract, intrinsically curved for the periodical recurrence on the same strand of A-tracts phased with the B-DNA periodicity, is deposited on a flat surface, it exposes to that surface either a T- or an A-rich face. The surface of a freshly cleaved mica crystal recognizes those two faces and preferentially interacts with the former one. Statistical analysis of scanning force microscopy (SFM) images provides evidence of this recognition between an inorganic crystal surface and nanoscale structures of double-stranded DNA. This finding could open the way toward the use of the sequence-dependent adhesion to specific crystal faces for nanotechnological purposes. PMID:12361979
Opposite consequences of two transcription pauses caused by an intrinsic terminator oligo(U): antitermination versus termination by bacteriophage T7 RNA polymerase.

PubMed

Lee, Sooncheol; Kang, Changwon

2011-05-06

The RNA oligo(U) sequence, along with an immediately preceding RNA hairpin structure, is an essential cis-acting element for bacterial class I intrinsic termination. This sequence not only causes a pause in transcription during the beginning of the termination process but also facilitates transcript release at the end of the process. In this study, the oligo(U) sequence of the bacteriophage T7 intrinsic terminator Tφ, rather than the hairpin structure, induced pauses of phage T7 RNA polymerase not only at the termination site, triggering a termination process, but also 3 bp upstream, exerting an antitermination effect. The upstream pause presumably allowed RNA to form a thermodynamically more stable secondary structure rather than a terminator hairpin and to persist because the 5'-half of the terminator hairpin-forming sequence could be sequestered by a farther upstream sequence via sequence-specific hybridization, prohibiting formation of the terminator hairpin and termination. The putative antiterminator RNA structure lacked several base pairs essential for termination when probed using RNases A, T1, and V1. When the antiterminator was destabilized by incorporation of IMP into nascent RNA at G residue positions, antitermination was abolished. Furthermore, antitermination strength increased with more stable antiterminator secondary structures and longer pauses. Thus, the oligo(U)-mediated pause prior to the termination site can exert a cis-acting antitermination activity on intrinsic terminator Tφ, and the termination efficiency depends primarily on the termination-interfering pause that precedes the termination-facilitating pause at the termination site.

Annotating Protein Functional Residues by Coupling High-Throughput Fitness Profile and Homologous-Structure Analysis

PubMed Central

Du, Yushen; Wu, Nicholas C.; Jiang, Lin; Zhang, Tianhao; Gong, Danyang; Shu, Sara; Wu, Ting-Ting

2016-01-01

ABSTRACT Identification and annotation of functional residues are fundamental questions in protein sequence analysis. Sequence and structure conservation provides valuable information to tackle these questions. It is, however, limited by the incomplete sampling of sequence space in natural evolution. Moreover, proteins often have multiple functions, with overlapping sequences that present challenges to accurate annotation of the exact functions of individual residues by conservation-based methods. Using the influenza A virus PB1 protein as an example, we developed a method to systematically identify and annotate functional residues. We used saturation mutagenesis and high-throughput sequencing to measure the replication capacity of single nucleotide mutations across the entire PB1 protein. After predicting protein stability upon mutations, we identified functional PB1 residues that are essential for viral replication. To further annotate the functional residues important to the canonical or noncanonical functions of viral RNA-dependent RNA polymerase (vRdRp), we performed a homologous-structure analysis with 16 different vRdRp structures. We achieved high sensitivity in annotating the known canonical polymerase functional residues. Moreover, we identified a cluster of noncanonical functional residues located in the loop region of the PB1 β-ribbon. We further demonstrated that these residues were important for PB1 protein nuclear import through the interaction with Ran-binding protein 5. In summary, we developed a systematic and sensitive method to identify and annotate functional residues that are not restrained by sequence conservation. Importantly, this method is generally applicable to other proteins about which homologous-structure information is available. PMID:27803181
An Investigation of G-Quadruplex Structural Polymorphism in the Human Telomere Using a Combined Approach of Hydrodynamic Bead Modeling and Molecular Dynamics Simulation

PubMed Central

2015-01-01

Guanine-rich oligonucleotides can adopt noncanonical tertiary structures known as G-quadruplexes, which can exist in different forms depending on experimental conditions. High-resolution structural methods, such as X-ray crystallography and NMR spectroscopy, have been of limited usefulness in resolving the inherent structural polymorphism associated with G-quadruplex formation. The lack of, or the ambiguous nature of, currently available high-resolution structural data, in turn, has severely hindered investigations into the nature of these structures and their interactions with small-molecule inhibitors. We have used molecular dynamics in conjunction with hydrodynamic bead modeling to study the structures of the human telomeric G-quadruplex-forming sequences at the atomic level. We demonstrated that molecular dynamics can reproduce experimental hydrodynamic measurements and thus can be a powerful tool in the structural study of existing G-quadruplex sequences or in the prediction of new G-quadruplex structures. PMID:24779348
Modular prediction of protein structural classes from sequences of twilight-zone identity with predicting sequences.

PubMed

Mizianty, Marcin J; Kurgan, Lukasz

2009-12-13

Knowledge of structural class is used by numerous methods for identification of structural/functional characteristics of proteins and could be used for the detection of remote homologues, particularly for chains that share twilight-zone similarity. In contrast to existing sequence-based structural class predictors, which target four major classes and which are designed for high identity sequences, we predict seven classes from sequences that share twilight-zone identity with the training sequences. The proposed MODular Approach to Structural class prediction (MODAS) method is unique as it allows for selection of any subset of the classes. MODAS is also the first to utilize a novel, custom-built feature-based sequence representation that combines evolutionary profiles and predicted secondary structure. The features quantify information relevant to the definition of the classes including conservation of residues and arrangement and number of helix/strand segments. Our comprehensive design considers 8 feature selection methods and 4 classifiers to develop Support Vector Machine-based classifiers that are tailored for each of the seven classes. Tests on 5 twilight-zone and 1 high-similarity benchmark datasets and comparison with over two dozens of modern competing predictors show that MODAS provides the best overall accuracy that ranges between 80% and 96.7% (83.5% for the twilight-zone datasets), depending on the dataset. This translates into 19% and 8% error rate reduction when compared against the best performing competing method on two largest datasets. The proposed predictor provides accurate predictions at 58% accuracy for membrane proteins class, which is not considered by majority of existing methods, in spite that this class accounts for only 2% of the data. Our predictive model is analyzed to demonstrate how and why the input features are associated with the corresponding classes. The improved predictions stem from the novel features that express collocation of the secondary structure segments in the protein sequence and that combine evolutionary and secondary structure information. Our work demonstrates that conservation and arrangement of the secondary structure segments predicted along the protein chain can successfully predict structural classes which are defined based on the spatial arrangement of the secondary structures. A web server is available at http://biomine.ece.ualberta.ca/MODAS/.
Modular prediction of protein structural classes from sequences of twilight-zone identity with predicting sequences

PubMed Central

2009-01-01

Background Knowledge of structural class is used by numerous methods for identification of structural/functional characteristics of proteins and could be used for the detection of remote homologues, particularly for chains that share twilight-zone similarity. In contrast to existing sequence-based structural class predictors, which target four major classes and which are designed for high identity sequences, we predict seven classes from sequences that share twilight-zone identity with the training sequences. Results The proposed MODular Approach to Structural class prediction (MODAS) method is unique as it allows for selection of any subset of the classes. MODAS is also the first to utilize a novel, custom-built feature-based sequence representation that combines evolutionary profiles and predicted secondary structure. The features quantify information relevant to the definition of the classes including conservation of residues and arrangement and number of helix/strand segments. Our comprehensive design considers 8 feature selection methods and 4 classifiers to develop Support Vector Machine-based classifiers that are tailored for each of the seven classes. Tests on 5 twilight-zone and 1 high-similarity benchmark datasets and comparison with over two dozens of modern competing predictors show that MODAS provides the best overall accuracy that ranges between 80% and 96.7% (83.5% for the twilight-zone datasets), depending on the dataset. This translates into 19% and 8% error rate reduction when compared against the best performing competing method on two largest datasets. The proposed predictor provides accurate predictions at 58% accuracy for membrane proteins class, which is not considered by majority of existing methods, in spite that this class accounts for only 2% of the data. Our predictive model is analyzed to demonstrate how and why the input features are associated with the corresponding classes. Conclusions The improved predictions stem from the novel features that express collocation of the secondary structure segments in the protein sequence and that combine evolutionary and secondary structure information. Our work demonstrates that conservation and arrangement of the secondary structure segments predicted along the protein chain can successfully predict structural classes which are defined based on the spatial arrangement of the secondary structures. A web server is available at http://biomine.ece.ualberta.ca/MODAS/. PMID:20003388
Geometric phase coded metasurface: from polarization dependent directive electromagnetic wave scattering to diffusion-like scattering.

PubMed

Chen, Ke; Feng, Yijun; Yang, Zhongjie; Cui, Li; Zhao, Junming; Zhu, Bo; Jiang, Tian

2016-10-24

Ultrathin metasurface compromising various sub-wavelength meta-particles offers promising advantages in controlling electromagnetic wave by spatially manipulating the wavefront characteristics across the interface. The recently proposed digital coding metasurface could even simplify the design and optimization procedures due to the digitalization of the meta-particle geometry. However, current attempts to implement the digital metasurface still utilize several structural meta-particles to obtain certain electromagnetic responses, and requiring time-consuming optimization especially in multi-bits coding designs. In this regard, we present herein utilizing geometric phase based single structured meta-particle with various orientations to achieve either 1-bit or multi-bits digital metasurface. Particular electromagnetic wave scattering patterns dependent on the incident polarizations can be tailored by the encoded metasurfaces with regular sequences. On the contrast, polarization insensitive diffusion-like scattering can also been successfully achieved by digital metasurface encoded with randomly distributed coding sequences leading to substantial suppression of backward scattering in a broadband microwave frequency. The proposed digital metasurfaces provide simple designs and reveal new opportunities for controlling electromagnetic wave scattering with or without polarization dependence.
Geometric phase coded metasurface: from polarization dependent directive electromagnetic wave scattering to diffusion-like scattering

PubMed Central

Chen, Ke; Feng, Yijun; Yang, Zhongjie; Cui, Li; Zhao, Junming; Zhu, Bo; Jiang, Tian

2016-01-01

Ultrathin metasurface compromising various sub-wavelength meta-particles offers promising advantages in controlling electromagnetic wave by spatially manipulating the wavefront characteristics across the interface. The recently proposed digital coding metasurface could even simplify the design and optimization procedures due to the digitalization of the meta-particle geometry. However, current attempts to implement the digital metasurface still utilize several structural meta-particles to obtain certain electromagnetic responses, and requiring time-consuming optimization especially in multi-bits coding designs. In this regard, we present herein utilizing geometric phase based single structured meta-particle with various orientations to achieve either 1-bit or multi-bits digital metasurface. Particular electromagnetic wave scattering patterns dependent on the incident polarizations can be tailored by the encoded metasurfaces with regular sequences. On the contrast, polarization insensitive diffusion-like scattering can also been successfully achieved by digital metasurface encoded with randomly distributed coding sequences leading to substantial suppression of backward scattering in a broadband microwave frequency. The proposed digital metasurfaces provide simple designs and reveal new opportunities for controlling electromagnetic wave scattering with or without polarization dependence. PMID:27775064
Triphasic spike-timing-dependent plasticity organizes networks to produce robust sequences of neural activity

PubMed Central

Waddington, Amelia; Appleby, Peter A.; De Kamps, Marc; Cohen, Netta

2012-01-01

Synfire chains have long been proposed to generate precisely timed sequences of neural activity. Such activity has been linked to numerous neural functions including sensory encoding, cognitive and motor responses. In particular, it has been argued that synfire chains underlie the precise spatiotemporal firing patterns that control song production in a variety of songbirds. Previous studies have suggested that the development of synfire chains requires either initial sparse connectivity or strong topological constraints, in addition to any synaptic learning rules. Here, we show that this necessity can be removed by using a previously reported but hitherto unconsidered spike-timing-dependent plasticity (STDP) rule and activity-dependent excitability. Under this rule the network develops stable synfire chains that possess a non-trivial, scalable multi-layer structure, in which relative layer sizes appear to follow a universal function. Using computational modeling and a coarse grained random walk model, we demonstrate the role of the STDP rule in growing, molding and stabilizing the chain, and link model parameters to the resulting structure. PMID:23162457
SCHEMA computational design of virus capsid chimeras: calibrating how genome packaging, protection, and transduction correlate with calculated structural disruption.

PubMed

Ho, Michelle L; Adler, Benjamin A; Torre, Michael L; Silberg, Jonathan J; Suh, Junghae

2013-12-20

Adeno-associated virus (AAV) recombination can result in chimeric capsid protein subunits whose ability to assemble into an oligomeric capsid, package a genome, and transduce cells depends on the inheritance of sequence from different AAV parents. To develop quantitative design principles for guiding site-directed recombination of AAV capsids, we have examined how capsid structural perturbations predicted by the SCHEMA algorithm correlate with experimental measurements of disruption in seventeen chimeric capsid proteins. In our small chimera population, created by recombining AAV serotypes 2 and 4, we found that protection of viral genomes and cellular transduction were inversely related to calculated disruption of the capsid structure. Interestingly, however, we did not observe a correlation between genome packaging and calculated structural disruption; a majority of the chimeric capsid proteins formed at least partially assembled capsids and more than half packaged genomes, including those with the highest SCHEMA disruption. These results suggest that the sequence space accessed by recombination of divergent AAV serotypes is rich in capsid chimeras that assemble into 60-mer capsids and package viral genomes. Overall, the SCHEMA algorithm may be useful for delineating quantitative design principles to guide the creation of libraries enriched in genome-protecting virus nanoparticles that can effectively transduce cells. Such improvements to the virus design process may help advance not only gene therapy applications but also other bionanotechnologies dependent upon the development of viruses with new sequences and functions.
SCHEMA computational design of virus capsid chimeras: calibrating how genome packaging, protection, and transduction correlate with calculated structural disruption

PubMed Central

Ho, Michelle L.; Adler, Benjamin A.; Torre, Michael L.; Silberg, Jonathan J.; Suh, Junghae

2013-01-01

Adeno-associated virus (AAV) recombination can result in chimeric capsid protein subunits whose ability to assemble into an oligomeric capsid, package a genome, and transduce cells depends on the inheritance of sequence from different AAV parents. To develop quantitative design principles for guiding site-directed recombination of AAV capsids, we have examined how capsid structural perturbations predicted by the SCHEMA algorithm correlate with experimental measurements of disruption in seventeen chimeric capsid proteins. In our small chimera population, created by recombining AAV serotypes 2 and 4, we found that protection of viral genomes and cellular transduction were inversely related to calculated disruption of the capsid structure. Interestingly, however, we did not observe a correlation between genome packaging and calculated structural disruption; a majority of the chimeric capsid proteins formed at least partially assembled capsids and more than half packaged genomes, including those with the highest SCHEMA disruption. These results suggest that the sequence space accessed by recombination of divergent AAV serotypes is rich in capsid chimeras that assemble into 60-mer capsids and package viral genomes. Overall, the SCHEMA algorithm may be useful for delineating quantitative design principles to guide the creation of libraries enriched in genome-protecting virus nanoparticles that can effectively transduce cells. Such improvements to the virus design process may help advance not only gene therapy applications, but also other bionanotechnologies dependent upon the development of viruses with new sequences and functions. PMID:23899192
Preservation of protein clefts in comparative models.

PubMed

Piedra, David; Lois, Sergi; de la Cruz, Xavier

2008-01-16

Comparative, or homology, modelling of protein structures is the most widely used prediction method when the target protein has homologues of known structure. Given that the quality of a model may vary greatly, several studies have been devoted to identifying the factors that influence modelling results. These studies usually consider the protein as a whole, and only a few provide a separate discussion of the behaviour of biologically relevant features of the protein. Given the value of the latter for many applications, here we extended previous work by analysing the preservation of native protein clefts in homology models. We chose to examine clefts because of their role in protein function/structure, as they are usually the locus of protein-protein interactions, host the enzymes' active site, or, in the case of protein domains, can also be the locus of domain-domain interactions that lead to the structure of the whole protein. We studied how the largest cleft of a protein varies in comparative models. To this end, we analysed a set of 53507 homology models that cover the whole sequence identity range, with a special emphasis on medium and low similarities. More precisely we examined how cleft quality - measured using six complementary parameters related to both global shape and local atomic environment, depends on the sequence identity between target and template proteins. In addition to this general analysis, we also explored the impact of a number of factors on cleft quality, and found that the relationship between quality and sequence identity varies depending on cleft rank amongst the set of protein clefts (when ordered according to size), and number of aligned residues. We have examined cleft quality in homology models at a range of seq.id. levels. Our results provide a detailed view of how quality is affected by distinct parameters and thus may help the user of comparative modelling to determine the final quality and applicability of his/her cleft models. In addition, the large variability in model quality that we observed within each sequence bin, with good models present even at low sequence identities (between 20% and 30%), indicates that properly developed identification methods could be used to recover good cleft models in this sequence range.
Predicting 3D structure and stability of RNA pseudoknots in monovalent and divalent ion solutions.

PubMed

Shi, Ya-Zhou; Jin, Lei; Feng, Chen-Jie; Tan, Ya-Lan; Tan, Zhi-Jie

2018-06-01

RNA pseudoknots are a kind of minimal RNA tertiary structural motifs, and their three-dimensional (3D) structures and stability play essential roles in a variety of biological functions. Therefore, to predict 3D structures and stability of RNA pseudoknots is essential for understanding their functions. In the work, we employed our previously developed coarse-grained model with implicit salt to make extensive predictions and comprehensive analyses on the 3D structures and stability for RNA pseudoknots in monovalent/divalent ion solutions. The comparisons with available experimental data show that our model can successfully predict the 3D structures of RNA pseudoknots from their sequences, and can also make reliable predictions for the stability of RNA pseudoknots with different lengths and sequences over a wide range of monovalent/divalent ion concentrations. Furthermore, we made comprehensive analyses on the unfolding pathway for various RNA pseudoknots in ion solutions. Our analyses for extensive pseudokonts and the wide range of monovalent/divalent ion concentrations verify that the unfolding pathway of RNA pseudoknots is mainly dependent on the relative stability of unfolded intermediate states, and show that the unfolding pathway of RNA pseudoknots can be significantly modulated by their sequences and solution ion conditions.
Evidence for the Concerted Evolution between Short Linear Protein Motifs and Their Flanking Regions

PubMed Central

Chica, Claudia; Diella, Francesca; Gibson, Toby J.

2009-01-01

Background Linear motifs are short modules of protein sequences that play a crucial role in mediating and regulating many protein–protein interactions. The function of linear motifs strongly depends on the context, e.g. functional instances mainly occur inside flexible regions that are accessible for interaction. Sometimes linear motifs appear as isolated islands of conservation in multiple sequence alignments. However, they also occur in larger blocks of sequence conservation, suggesting an active role for the neighbouring amino acids. Results The evolution of regions flanking 116 functional linear motif instances was studied. The conservation of the amino acid sequence and order/disorder tendency of those regions was related to presence/absence of the instance. For the majority of the analysed instances, the pairs of sequences conserving the linear motif were also observed to maintain a similar local structural tendency and/or to have higher local sequence conservation when compared to pairs of sequences where one is missing the linear motif. Furthermore, those instances have a higher chance to co–evolve with the neighbouring residues in comparison to the distant ones. Those findings are supported by examples where the regulation of the linear motif–mediated interaction has been shown to depend on the modifications (e.g. phosphorylation) at neighbouring positions or is thought to benefit from the binding versatility of disordered regions. Conclusion The results suggest that flanking regions are relevant for linear motif–mediated interactions, both at the structural and sequence level. More interestingly, they indicate that the prediction of linear motif instances can be enriched with contextual information by performing a sequence analysis similar to the one presented here. This can facilitate the understanding of the role of these predicted instances in determining the protein function inside the broader context of the cellular network where they arise. PMID:19584925
Hydrodynamic Radii of Intrinsically Disordered Proteins Determined from Experimental Polyproline II Propensities

PubMed Central

Tomasso, Maria E.; Tarver, Micheal J.; Devarajan, Deepa; Whitten, Steven T.

2016-01-01

The properties of disordered proteins are thought to depend on intrinsic conformational propensities for polyproline II (PP II) structure. While intrinsic PP II propensities have been measured for the common biological amino acids in short peptides, the ability of these experimentally determined propensities to quantitatively reproduce structural behavior in intrinsically disordered proteins (IDPs) has not been established. Presented here are results from molecular simulations of disordered proteins showing that the hydrodynamic radius (R h) can be predicted from experimental PP II propensities with good agreement, even when charge-based considerations are omitted. The simulations demonstrate that R h and chain propensity for PP II structure are linked via a simple power-law scaling relationship, which was tested using the experimental R h of 22 IDPs covering a wide range of peptide lengths, net charge, and sequence composition. Charge effects on R h were found to be generally weak when compared to PP II effects on R h. Results from this study indicate that the hydrodynamic dimensions of IDPs are evidence of considerable sequence-dependent backbone propensities for PP II structure that qualitatively, if not quantitatively, match conformational propensities measured in peptides. PMID:26727467
Histoimmunogenetics Markup Language 1.0: Reporting next generation sequencing-based HLA and KIR genotyping.

PubMed

Milius, Robert P; Heuer, Michael; Valiga, Daniel; Doroschak, Kathryn J; Kennedy, Caleb J; Bolon, Yung-Tsi; Schneider, Joel; Pollack, Jane; Kim, Hwa Ran; Cereb, Nezih; Hollenbach, Jill A; Mack, Steven J; Maiers, Martin

2015-12-01

We present an electronic format for exchanging data for HLA and KIR genotyping with extensions for next-generation sequencing (NGS). This format addresses NGS data exchange by refining the Histoimmunogenetics Markup Language (HML) to conform to the proposed Minimum Information for Reporting Immunogenomic NGS Genotyping (MIRING) reporting guidelines (miring.immunogenomics.org). Our refinements of HML include two major additions. First, NGS is supported by new XML structures to capture additional NGS data and metadata required to produce a genotyping result, including analysis-dependent (dynamic) and method-dependent (static) components. A full genotype, consensus sequence, and the surrounding metadata are included directly, while the raw sequence reads and platform documentation are externally referenced. Second, genotype ambiguity is fully represented by integrating Genotype List Strings, which use a hierarchical set of delimiters to represent allele and genotype ambiguity in a complete and accurate fashion. HML also continues to enable the transmission of legacy methods (e.g. site-specific oligonucleotide, sequence-specific priming, and Sequence Based Typing (SBT)), adding features such as allowing multiple group-specific sequencing primers, and fully leveraging techniques that combine multiple methods to obtain a single result, such as SBT integrated with NGS. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.
Functional evolution and structural conservation in chimeric cytochromes p450: calibrating a structure-guided approach.

PubMed

Otey, Christopher R; Silberg, Jonathan J; Voigt, Christopher A; Endelman, Jeffrey B; Bandara, Geethani; Arnold, Frances H

2004-03-01

Recombination generates chimeric proteins whose ability to fold depends on minimizing structural perturbations that result when portions of the sequence are inherited from different parents. These chimeric sequences can display functional properties characteristic of the parents or acquire entirely new functions. Seventeen chimeras were generated from two CYP102 members of the functionally diverse cytochrome p450 family. Chimeras predicted to have limited structural disruption, as defined by the SCHEMA algorithm, displayed CO binding spectra characteristic of folded p450s. Even this small population exhibited significant functional diversity: chimeras displayed altered substrate specificities, a wide range in thermostabilities, up to a 40-fold increase in peroxidase activity, and ability to hydroxylate a substrate toward which neither parent heme domain shows detectable activity. These results suggest that SCHEMA-guided recombination can be used to generate diverse p450s for exploring function evolution within the p450 structural framework.
Structural and functional characterization of a calcium-activated cation channel from Tsukamurella paurometabola

NASA Astrophysics Data System (ADS)

Dhakshnamoorthy, Balasundaresan; Rohaim, Ahmed; Rui, Huan; Blachowicz, Lydia; Roux, Benoît

2016-09-01

The selectivity filter is an essential functional element of K+ channels that is highly conserved both in terms of its primary sequence and its three-dimensional structure. Here, we investigate the properties of an ion channel from the Gram-positive bacterium Tsukamurella paurometabola with a selectivity filter formed by an uncommon proline-rich sequence. Electrophysiological recordings show that it is a non-selective cation channel and that its activity depends on Ca2+ concentration. In the crystal structure, the selectivity filter adopts a novel conformation with Ca2+ ions bound within the filter near the pore helix where they are coordinated by backbone oxygen atoms, a recurrent motif found in multiple proteins. The binding of Ca2+ ion in the selectivity filter controls the widening of the pore as shown in crystal structures and in molecular dynamics simulations. The structural, functional and computational data provide a characterization of this calcium-gated cationic channel.
Hidden Markov model-derived structural alphabet for proteins: the learning of protein local shapes captures sequence specificity.

PubMed

Camproux, A C; Tufféry, P

2005-08-05

Understanding and predicting protein structures depend on the complexity and the accuracy of the models used to represent them. We have recently set up a Hidden Markov Model to optimally compress protein three-dimensional conformations into a one-dimensional series of letters of a structural alphabet. Such a model learns simultaneously the shape of representative structural letters describing the local conformation and the logic of their connections, i.e. the transition matrix between the letters. Here, we move one step further and report some evidence that such a model of protein local architecture also captures some accurate amino acid features. All the letters have specific and distinct amino acid distributions. Moreover, we show that words of amino acids can have significant propensities for some letters. Perspectives point towards the prediction of the series of letters describing the structure of a protein from its amino acid sequence.
From a marine neuropeptide to antimicrobial pseudopeptides containing aza-β(3)-amino acids: structure and activity

PubMed Central

Laurencin, Mathieu; Legrand, Baptiste; Duval, Emilie; Henry, Joël; Baudy-Floc'H, Michèle; Zatylny-Gaudin, Céline; Bondon, Arnaud

2012-01-01

Incorporation of aza-β3-amino acids into endogenous neuropeptide from mollusks (ALSGDAFLRF-NH2) with weak antimicrobial activities allows us to design new AMPs sequences. We find that, depending on the nature of the substitution, these could result either in inactive pseudopeptides or in a drastic enhancement of the antimicrobial activity without high cytotoxicity resulted. Structural studies perform by NMR and circular dichroism on the pseudopeptides show the impact of aza-β3-amino acids on the peptide structures. We obtain the first three-dimensional structures of pseudopeptides containing aza-β3-amino acids in aqueous micellar SDS and demonstrate that hydrazino turn can be formed in aqueous solution. Overall, these results demonstrate the ability to modulate AMPs activities through structural modifications induced by the nature and the position of these amino acid analogs in the peptide sequences. PMID:22320306
Modeling coding-sequence evolution within the context of residue solvent accessibility.

PubMed

Scherrer, Michael P; Meyer, Austin G; Wilke, Claus O

2012-09-12

Protein structure mediates site-specific patterns of sequence divergence. In particular, residues in the core of a protein (solvent-inaccessible residues) tend to be more evolutionarily conserved than residues on the surface (solvent-accessible residues). Here, we present a model of sequence evolution that explicitly accounts for the relative solvent accessibility of each residue in a protein. Our model is a variant of the Goldman-Yang 1994 (GY94) model in which all model parameters can be functions of the relative solvent accessibility (RSA) of a residue. We apply this model to a data set comprised of nearly 600 yeast genes, and find that an evolutionary-rate ratio ω that varies linearly with RSA provides a better model fit than an RSA-independent ω or an ω that is estimated separately in individual RSA bins. We further show that the branch length t and the transition-transverion ratio κ also vary with RSA. The RSA-dependent GY94 model performs better than an RSA-dependent Muse-Gaut 1994 (MG94) model in which the synonymous and non-synonymous rates individually are linear functions of RSA. Finally, protein core size affects the slope of the linear relationship between ω and RSA, and gene expression level affects both the intercept and the slope. Structure-aware models of sequence evolution provide a significantly better fit than traditional models that neglect structure. The linear relationship between ω and RSA implies that genes are better characterized by their ω slope and intercept than by just their mean ω.
A charge-dependent mechanism is responsible for the dynamic accumulation of proteins inside nucleoli.

PubMed

Musinova, Yana R; Kananykhina, Eugenia Y; Potashnikova, Daria M; Lisitsyna, Olga M; Sheval, Eugene V

2015-01-01

The majority of known nucleolar proteins are freely exchanged between the nucleolus and the surrounding nucleoplasm. One way proteins are retained in the nucleoli is by the presence of specific amino acid sequences, namely nucleolar localization signals (NoLSs). The mechanism by which NoLSs retain proteins inside the nucleoli is still unclear. Here, we present data showing that the charge-dependent (electrostatic) interactions of NoLSs with nucleolar components lead to nucleolar accumulation as follows: (i) known NoLSs are enriched in positively charged amino acids, but the NoLS structure is highly heterogeneous, and it is not possible to identify a consensus sequence for this type of signal; (ii) in two analyzed proteins (NF-κB-inducing kinase and HIV-1 Tat), the NoLS corresponds to a region that is enriched for positively charged amino acid residues; substituting charged amino acids with non-charged ones reduced the nucleolar accumulation in proportion to the charge reduction, and nucleolar accumulation efficiency was strongly correlated with the predicted charge of the tested sequences; and (iii) sequences containing only lysine or arginine residues (which were referred to as imitative NoLSs, or iNoLSs) are accumulated in the nucleoli in a charge-dependent manner. The results of experiments with iNoLSs suggested that charge-dependent accumulation inside the nucleoli was dependent on interactions with nucleolar RNAs. The results of this work are consistent with the hypothesis that nucleolar protein accumulation by NoLSs can be determined by the electrostatic interaction of positively charged regions with nucleolar RNAs rather than by any sequence-specific mechanism. Copyright © 2014 Elsevier B.V. All rights reserved.

DOE Office of Scientific and Technical Information (OSTI.GOV)

Man, Viet Hoang; Pan, Feng; Sagui, Celeste, E-mail: sagui@ncsu.edu

We explore the use of a fast laser melting simulation approach combined with atomistic molecular dynamics simulations in order to determine the melting and healing responses of B-DNA and Z-DNA dodecamers with the same d(5′-CGCGCGCGCGCG-3′){sub 2} sequence. The frequency of the laser pulse is specifically tuned to disrupt Watson-Crick hydrogen bonds, thus inducing melting of the DNA duplexes. Subsequently, the structures relax and partially refold, depending on the field strength. In addition to the inherent interest of the nonequilibrium melting process, we propose that fast melting by an infrared laser pulse could be used as a technique for a fastmore » comparison of relative stabilities of same-sequence oligonucleotides with different secondary structures with full atomistic detail of the structures and solvent. This could be particularly useful for nonstandard secondary structures involving non-canonical base pairs, mismatches, etc.« less
In vitro selection of high temperature Zn(2+)-dependent DNAzymes.

PubMed

Nelson, Kevin E; Bruesehoff, Peter J; Lu, Yi

2005-08-01

In vitro selection of Zn(2+)-dependent RNA-cleaving DNAzymes with activity at 90 degrees C has yielded a diverse spool of selected sequences. The RNA cleavage efficiency was found in all cases to be specific for Zn(2+) over Pb(2+), Ca(2+), Cd(2+), Co(2+), Hg(2+), and Mg(2+). The Zn(2+)-dependent activity assay of the most active sequence showed that the DNAzyme possesses an apparent Zn(2+)-binding dissociation constant of 234 muM and that its activity increases with increasing temperatures from 50-90 degrees C. A fit of the Arrhenius plot data gave E(a) = 15.3 kcal mol(-1). Surprisingly, the selected Zn(2+)-dependent DNAzymes showed only a modest (approximately 3-fold) activity enhancement over the background rate of cleavage of random sequences containing a single embedded ribonucleotide within an otherwise DNA oligonucleotide. The result is attributable to the ability of DNA to sustain cleavage activity at high temperature with minimal secondary structure when Zn(2+) is present. Since this effect is highly specific for Zn(2+), this metal ion may play a special role in molecular evolution of nucleic acids at high temperature.
Independent Evolution of Six Families of Halogenating Enzymes

PubMed Central

Xu, Gangming; Wang, Bin-Gui

2016-01-01

Halogenated natural products are widespread in the environment, and the halogen atoms are typically vital to their bioactivities. Thus far, six families of halogenating enzymes have been identified: cofactor-free haloperoxidases (HPO), vanadium-dependent haloperoxidases (V-HPO), heme iron-dependent haloperoxidases (HI-HPO), non-heme iron-dependent halogenases (NI-HG), flavin-dependent halogenases (F-HG), and S-adenosyl-L-methionine (SAM)-dependent halogenases (S-HG). However, these halogenating enzymes with similar biological functions but distinct structures might have evolved independently. Phylogenetic and structural analyses suggest that the HPO, V-HPO, HI-HPO, NI-HG, F-HG, and S-HG enzyme families may have evolutionary relationships to the α/β hydrolases, acid phosphatases, peroxidases, chemotaxis phosphatases, oxidoreductases, and SAM hydroxide adenosyltransferases, respectively. These halogenating enzymes have established sequence homology, structural conservation, and mechanistic features within each family. Understanding the distinct evolutionary history of these halogenating enzymes will provide further insights into the study of their catalytic mechanisms and halogenation specificity. PMID:27153321
JDet: interactive calculation and visualization of function-related conservation patterns in multiple sequence alignments and structures.

PubMed

Muth, Thilo; García-Martín, Juan A; Rausell, Antonio; Juan, David; Valencia, Alfonso; Pazos, Florencio

2012-02-15

We have implemented in a single package all the features required for extracting, visualizing and manipulating fully conserved positions as well as those with a family-dependent conservation pattern in multiple sequence alignments. The program allows, among other things, to run different methods for extracting these positions, combine the results and visualize them in protein 3D structures and sequence spaces. JDet is a multiplatform application written in Java. It is freely available, including the source code, at http://csbg.cnb.csic.es/JDet. The package includes two of our recently developed programs for detecting functional positions in protein alignments (Xdet and S3Det), and support for other methods can be added as plug-ins. A help file and a guided tutorial for JDet are also available.
Development of a novel biosensing system based on the structural change of a polymerized guanine-quadruplex DNA nanostructure.

PubMed

Morita, Yo; Yoshida, Wataru; Savory, Nasa; Han, Sung Woong; Tera, Masayuki; Nagasawa, Kazuo; Nakamura, Chikashi; Sode, Koji; Ikebukuro, Kazunori

2011-08-15

By inserting an adenosine aptamer into an aptamer that forms a G-quadruplex, we developed an adaptor molecule, named the Gq-switch, which links an electrode with flavin adenine dinucleotide-dependent glucose dehydrogenase (FADGDH) that is capable of transferring electron to a electrode directly. First, we selected an FADGDH-binding aptamer and identified that its sequence is composed of two blocks of consecutive six guanine bases and it forms a polymerized G-quadruplex structure. Then, we inserted a sequence of an adenosine aptamer between the two blocks of consecutive guanine bases, and we found it also bound to adenosine. Then we named it as Gq-switch. In the absence of adenosine, the Gq-switch-FADGDH complex forms a 30-nm high bulb-shaped structure that changes in the presence of adenosine to give an 8-nm high wire-shaped structure. This structural change brings the FADGDH sufficiently close to the electrode for electron transfer to occur, and the adenosine can be detected from the current produced by the FADGDH. Adenosine was successfully detected with a concentration dependency using the Gq-switch-FADGDH complex immobilized Au electrode by measuring response current to the addition of glucose. Copyright © 2011 Elsevier B.V. All rights reserved.
Distinct Mechanisms of Nuclease-Directed DNA-Structure-Induced Genetic Instability in Cancer Genomes.

PubMed

Zhao, Junhua; Wang, Guliang; Del Mundo, Imee M; McKinney, Jennifer A; Lu, Xiuli; Bacolla, Albino; Boulware, Stephen B; Zhang, Changsheng; Zhang, Haihua; Ren, Pengyu; Freudenreich, Catherine H; Vasquez, Karen M

2018-01-30

Sequences with the capacity to adopt alternative DNA structures have been implicated in cancer etiology; however, the mechanisms are unclear. For example, H-DNA-forming sequences within oncogenes have been shown to stimulate genetic instability in mammals. Here, we report that H-DNA-forming sequences are enriched at translocation breakpoints in human cancer genomes, further implicating them in cancer etiology. H-DNA-induced mutations were suppressed in human cells deficient in the nucleotide excision repair nucleases, ERCC1-XPF and XPG, but were stimulated in cells deficient in FEN1, a replication-related endonuclease. Further, we found that these nucleases cleaved H-DNA conformations, and the interactions of modeled H-DNA with ERCC1-XPF, XPG, and FEN1 proteins were explored at the sub-molecular level. The results suggest mechanisms of genetic instability triggered by H-DNA through distinct structure-specific, cleavage-based replication-independent and replication-dependent pathways, providing critical evidence for a role of the DNA structure itself in the etiology of cancer and other human diseases. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.
In silico approaches reveal the potential for DNA sequence-dependent histone octamer affinity to influence chromatin structure in vivo.

PubMed

Fraser, Ross M; Allan, James; Simmen, Martin W

2006-12-08

Nucleosome positioning signals embedded within the DNA sequence have the potential to influence the detailed structure of the higher-order chromatin fibre. In two previous studies of long stretches of DNA, encompassing the chicken beta-globin and ovine beta-lactoglobulin genes, respectively, we mapped the relative affinity of every site for the core histone octamer. In both cases a periodic arrangement of the in vitro positioning sites suggests that they might influence the folding of a nucleosome chain into higher-order structure; this hypothesis was borne out in the case of the beta-lactoglobulin gene, where the distribution of the in vitro positioning sites is related to the positions nucleosomes actually occupy in sheep liver cells. Here, we have exploited the in vitro nucleosome positioning datasets to simulate nucleosomal organisation using in silico approaches. We use the high-resolution, quantitative positioning maps to define a one-dimensional positioning energy lattice, which can be populated with a defined number of nucleosomes. Monte Carlo techniques are employed to simulate the behaviour of the model at equilibrium to produce a set of configurations, which provide a probability-based occupancy map. Employing a variety of techniques we show that the occupancy maps are a sensitive function of the histone octamer density (nucleosome repeat length) and find that a minimal change in this property can produce dramatic localised changes in structure. Although simulations generally give rise to regular periodic nucleosomal arrangements, they often show octamer density-dependent discontinuities, which tend to co-localise with sequences that adopt distinctive chromatin structure in vivo. Furthermore, the overall organisation of simulated chromatin structures are more closely related to the situation in vivo than is the original in vitro positioning data, particularly at a nucleosome density corresponding to the in vivo state. Although our model is simplified, we argue that it provides a unique insight into the influence that DNA sequence can have in determining chromatin structure and could serve as a useful basis for the incorporation of other parameters.
RNAdualPF: software to compute the dual partition function with sample applications in molecular evolution theory.

PubMed

Garcia-Martin, Juan Antonio; Bayegan, Amir H; Dotu, Ivan; Clote, Peter

2016-10-19

RNA inverse folding is the problem of finding one or more sequences that fold into a user-specified target structure s 0 , i.e. whose minimum free energy secondary structure is identical to the target s 0 . Here we consider the ensemble of all RNA sequences that have low free energy with respect to a given target s 0 . We introduce the program RNAdualPF, which computes the dual partition function Z ∗ , defined as the sum of Boltzmann factors exp(-E(a,s 0 )/RT) of all RNA nucleotide sequences a compatible with target structure s 0 . Using RNAdualPF, we efficiently sample RNA sequences that approximately fold into s 0 , where additionally the user can specify IUPAC sequence constraints at certain positions, and whether to include dangles (energy terms for stacked, single-stranded nucleotides). Moreover, since we also compute the dual partition function Z ∗ (k) over all sequences having GC-content k, the user can require that all sampled sequences have a precise, specified GC-content. Using Z ∗ , we compute the dual expected energy 〈E ∗ 〉, and use it to show that natural RNAs from the Rfam 12.0 database have higher minimum free energy than expected, thus suggesting that functional RNAs are under evolutionary pressure to be only marginally thermodynamically stable. We show that C. elegans precursor microRNA (pre-miRNA) is significantly non-robust with respect to mutations, by comparing the robustness of each wild type pre-miRNA sequence with 2000 [resp. 500] sequences of the same GC-content generated by RNAdualPF, which approximately [resp. exactly] fold into the wild type target structure. We confirm and strengthen earlier findings that precursor microRNAs and bacterial small noncoding RNAs display plasticity, a measure of structural diversity. We describe RNAdualPF, which rapidly computes the dual partition function Z ∗ and samples sequences having low energy with respect to a target structure, allowing sequence constraints and specified GC-content. Using different inverse folding software, another group had earlier shown that pre-miRNA is mutationally robust, even controlling for compositional bias. Our opposite conclusion suggests a cautionary note that computationally based insights into molecular evolution may heavily depend on the software used. C/C++-software for RNAdualPF is available at http://bioinformatics.bc.edu/clotelab/RNAdualPF .
Main immunogenic region structure promotes binding of conformation-dependent myasthenia gravis autoantibodies, nicotinic acetylcholine receptor conformation maturation, and agonist sensitivity

PubMed Central

Luo, Jie; Taylor, Palmer; Losen, Mario; de Baets, Marc H.; Shelton, G. Diane; Lindstrom, Jon

2009-01-01

The main immunogenic region (MIR) is a conformation-dependent region at the extracellular apex of α1 subunits of muscle nicotinic acetylcholine receptor (AChR) that is the target of half or more of the autoantibodies to muscle AChRs in human myasthenia gravis and rat experimental autoimmune myasthenia gravis. By making chimeras of human α1 subunits with α7 subunits, both MIR epitopes recognized by rat mAbs and by the patient-derived human mAb 637 to the MIR were determined to consist of two discontiguous sequences, which are adjacent only in the native conformation. The MIR, including loop α1 67–76 in combination with the N-terminal α helix α1 1–14, conferred high-affinity binding for most rat mAbs to the MIR. However, an additional sequence corresponding to α1 15–32 was required for high-affinity binding of human mAb 637. A water soluble chimera of Aplysia acetylcholine binding protein with the same α1 MIR sequences substituted was recognized by a majority of human, feline, and canine MG sera. The presence of the α1 MIR sequences in α1/α7 chimeras greatly promoted AChR expression and significantly altered the sensitivity to activation. This reveals a structural and functional, as well as antigenic, significance of the MIR. PMID:19890000
Progressive levels of physical dependence to tobacco coincide with changes in the anterior cingulum bundle microstructure.

PubMed

Huang, Wei; DiFranza, Joseph R; Kennedy, David N; Zhang, Nanyin; Ziedonis, Douglas; Ursprung, Sanouri; King, Jean A

2013-01-01

The tobacco withdrawal syndrome indicates the development of neurophysiologic dependence. Clinical evidence indicates that neurophysiologic dependence develops through a set sequence of symptom presentation that can be assessed with a new 3-item survey measure of wanting, craving, and needing tobacco, the Level of Physical Dependence (PD). This study sought to determine if advancing neurophysiologic dependence as measured by the Level of PD correlates with characteristics of white matter structure measured by Fractional Anisotropy (FA). Diffusion-MRI based FA and diffusion tensor imaging probabilistic tractography were used to evaluate 11 smokers and 10 nonsmokers. FA was also examined in relation to two additional measures of dependence severity, the Hooked on Nicotine Checklist (HONC), and the Fagerström Test for Nicotine Dependence (FTND). Among smokers, FA in the left anterior cingulate bundle (ACb) correlated negatively with the Level of PD (r = -0.68, p = 0.02) and HONC scores (r = -0.65, p = 0.03), but the correlation for the FTND did not reach statistical significance (r = -49, p = 0.12). With advancing Levels of PD, the density of streamlines between the ACb and precuneus increased (r = -0.67, p<0.05) and those between the ACb and white matter projecting to the superior-frontal cortex (r = -0.86, p = 0.0006) decreased significantly. The correlations between neural structure and both the clinical Level of PD survey measure and the HONC suggest that the Level of PD and the HONC may reflect the microstructural integrity of white matter, as influenced by tobacco abuse. Given that the Level of PD is measuring a sequence of symptoms of neurophysiologic dependence that develops over time, the correlation between the Level of PD and neural structure suggests that these features might represent neuroplastic changes that develop over time to support the development of neurophysiologic dependence.
Purification and sequence characterization of chondroitin sulfate and dermatan sulfate from fishes.

PubMed

Lin, Na; Mo, Xiaoli; Yang, Yang; Zhang, Hong

2017-04-01

Chondroitin sulfate (CS) and dermatan sulfate (DS) were extracted and purified from skins or bones of salmon (Salmo salar), snakehead (Channa argus), monkfish (Lophius litulon) and skipjack tuna (Katsuwonus pelamis). Size, structural sequences and sulfate groups of oligosaccharides in the purified CS and DS could be characterized and identified using high performance liquid chromatography (HPLC) combined with Orbitrap mass spectrometry. CS and DS chain structure varies depending on origin, but motif structure appears consistent. Structures of CS and DS oligosaccharides with different size and sulfate groups were compared between fishes and other animals, and results showed that some minor differences of special structures could be identified by hydrophilic interaction chromatography-liquid chromatography-fourier transform-mass/mass spectrometry (HILIC-LC-FT-MS/MS). For example, data showed that salmon and skipjack CS had a higher percentage content of high-level sulfated oligosaccharides than that porcine CS. In addition, structural information of different origins of CS and DS was analyzed by principal component analysis (PCA) and results showed that CS and DS samples could be differentiated according to their molecular conformation and oligosaccharide fragments information. Understanding CS and DS structure derived from different origins may lead to the production of CS or DS with unique disaccharides or oligosaccharides sequence composition and biological functions.
Optimal packaging of FIV genomic RNA depends upon a conserved long-range interaction and a palindromic sequence within gag.

PubMed

Rizvi, Tahir A; Kenyon, Julia C; Ali, Jahabar; Aktar, Suriya J; Phillip, Pretty S; Ghazawi, Akela; Mustafa, Farah; Lever, Andrew M L

2010-10-15

The feline immunodeficiency virus (FIV) is a lentivirus that is related to human immunodeficiency virus (HIV), causing a similar pathology in cats. It is a potential small animal model for AIDS and the FIV-based vectors are also being pursued for human gene therapy. Previous studies have mapped the FIV packaging signal (ψ) to two or more discontinuous regions within the 5' 511 nt of the genomic RNA and structural analyses have determined its secondary structure. The 5' and 3' sequences within ψ region interact through extensive long-range interactions (LRIs), including a conserved heptanucleotide interaction between R/U5 and gag. Other secondary structural elements identified include a conserved 150 nt stem-loop (SL2) and a small palindromic stem-loop within gag open reading frame that might act as a viral dimerization initiation site. We have performed extensive mutational analysis of these sequences and structures and ascertained their importance in FIV packaging using a trans-complementation assay. Disrupting the conserved heptanucleotide LRI to prevent base pairing between R/U5 and gag reduced packaging by 2.8-5.5 fold. Restoration of pairing using an alternative, non-wild type (wt) LRI sequence restored RNA packaging and propagation to wt levels, suggesting that it is the structure of the LRI, rather than its sequence, that is important for FIV packaging. Disrupting the palindrome within gag reduced packaging by 1.5-3-fold, but substitution with a different palindromic sequence did not restore packaging completely, suggesting that the sequence of this region as well as its palindromic nature is important. Mutation of individual regions of SL2 did not have a pronounced effect on FIV packaging, suggesting that either it is the structure of SL2 as a whole that is necessary for optimal packaging, or that there is redundancy within this structure. The mutational analysis presented here has further validated the previously predicted RNA secondary structure of FIV ψ. Copyright © 2010 Elsevier Ltd. All rights reserved.
Deciphering the Hidden Informational Content of Protein Sequences

PubMed Central

Liu, Ming; Hua, Qing-xin; Hu, Shi-Quan; Jia, Wenhua; Yang, Yanwu; Saith, Sunil Evan; Whittaker, Jonathan; Arvan, Peter; Weiss, Michael A.

2010-01-01

Protein sequences encode both structure and foldability. Whereas the interrelationship of sequence and structure has been extensively investigated, the origins of folding efficiency are enigmatic. We demonstrate that the folding of proinsulin requires a flexible N-terminal hydrophobic residue that is dispensable for the structure, activity, and stability of the mature hormone. This residue (PheB1 in placental mammals) is variably positioned within crystal structures and exhibits 1H NMR motional narrowing in solution. Despite such flexibility, its deletion impaired insulin chain combination and led in cell culture to formation of non-native disulfide isomers with impaired secretion of the variant proinsulin. Cellular folding and secretion were maintained by hydrophobic substitutions at B1 but markedly perturbed by polar or charged side chains. We propose that, during folding, a hydrophobic side chain at B1 anchors transient long-range interactions by a flexible N-terminal arm (residues B1–B8) to mediate kinetic or thermodynamic partitioning among disulfide intermediates. Evidence for the overall contribution of the arm to folding was obtained by alanine scanning mutagenesis. Together, our findings demonstrate that efficient folding of proinsulin requires N-terminal sequences that are dispensable in the native state. Such arm-dependent folding can be abrogated by mutations associated with β-cell dysfunction and neonatal diabetes mellitus. PMID:20663888
Structural Studies of Geosmin Synthase, a Bifunctional Sesquiterpene Synthase with Alpha-Alpha Domain Architecture that Catalyzes a Unique Cyclization-Fragmentation Reaction Sequence

PubMed Central

Harris, Golda G.; Lombardi, Patrick M.; Pemberton, Travis A.; Matsui, Tsutomu; Weiss, Thomas M.; Cole, Kathryn E.; Köksal, Mustafa; Murphy, Frank V.; Vedula, L. Sangeetha; Chou, Wayne K.W.; Cane, David E.; Christianson, David W.

2015-01-01

Geosmin synthase from Streptomyces coelicolor (ScGS) catalyzes an unusual, metal-dependent terpenoid cyclization and fragmentation reaction sequence. Two distinct active sites are required for catalysis: the N-terminal domain catalyzes the ionization and cyclization of farnesyl diphosphate to form germacradienol and inorganic pyrophosphate (PPi), and the C-terminal domain catalyzes the protonation, cyclization, and fragmentation of germacradienol to form geosmin and acetone through a retro-Prins reaction. A unique αα domain architecture is predicted for ScGS based on amino acid sequence: each domain contains the metal-binding motifs typical of a class I terpenoid cyclase, and each domain requires Mg2+ for catalysis. Here, we report the X-ray crystal structure of the unliganded N-terminal domain of ScGS and the structure of its complex with 3 Mg2+ ions and alendronate. These structures highlight conformational changes required for active site closure and catalysis. Although neither full-length ScGS nor constructs of the C-terminal domain could be crystallized, homology models of the C-terminal domain were constructed based on ~36% sequence identity with the N-terminal domain. Small-angle X-ray scattering experiments yield low resolution molecular envelopes into which the N-terminal domain crystal structure and the C-terminal domain homology model were fit, suggesting possible αα domain architectures as frameworks for bifunctional catalysis. PMID:26598179
Chirality- and sequence-selective successive self-sorting via specific homo- and complementary-duplex formations

PubMed Central

Makiguchi, Wataru; Tanabe, Junki; Yamada, Hidekazu; Iida, Hiroki; Taura, Daisuke; Ousaka, Naoki; Yashima, Eiji

2015-01-01

Self-recognition and self-discrimination within complex mixtures are of fundamental importance in biological systems, which entirely rely on the preprogrammed monomer sequences and homochirality of biological macromolecules. Here we report artificial chirality- and sequence-selective successive self-sorting of chiral dimeric strands bearing carboxylic acid or amidine groups joined by chiral amide linkers with different sequences through homo- and complementary-duplex formations. A mixture of carboxylic acid dimers linked by racemic-1,2-cyclohexane bis-amides with different amide sequences (NHCO or CONH) self-associate to form homoduplexes in a completely sequence-selective way, the structures of which are different from each other depending on the linker amide sequences. The further addition of an enantiopure amide-linked amidine dimer to a mixture of the racemic carboxylic acid dimers resulted in the formation of a single optically pure complementary duplex with a 100% diastereoselectivity and complete sequence specificity stabilized by the amidinium–carboxylate salt bridges, leading to the perfect chirality- and sequence-selective duplex formation. PMID:26051291
Structure of human POFUT2: insights into thrombospondin type 1 repeat fold and O-fucosylation

PubMed Central

Chen, Chun-I; Keusch, Jeremy J; Klein, Dominique; Hess, Daniel; Hofsteenge, Jan; Gut, Heinz

2012-01-01

Protein O-fucosylation is a post-translational modification found on serine/threonine residues of thrombospondin type 1 repeats (TSR). The fucose transfer is catalysed by the enzyme protein O-fucosyltransferase 2 (POFUT2) and >40 human proteins contain the TSR consensus sequence for POFUT2-dependent fucosylation. To better understand O-fucosylation on TSR, we carried out a structural and functional analysis of human POFUT2 and its TSR substrate. Crystal structures of POFUT2 reveal a variation of the classical GT-B fold and identify sugar donor and TSR acceptor binding sites. Structural findings are correlated with steady-state kinetic measurements of wild-type and mutant POFUT2 and TSR and give insight into the catalytic mechanism and substrate specificity. By using an artificial mini-TSR substrate, we show that specificity is not primarily encoded in the TSR protein sequence but rather in the unusual 3D structure of a small part of the TSR. Our findings uncover that recognition of distinct conserved 3D fold motifs can be used as a mechanism to achieve substrate specificity by enzymes modifying completely folded proteins of very wide sequence diversity and biological function. PMID:22588082
In vitro synthesis of minus-strand RNA by an isolated cereal yellow dwarf virus RNA-dependent RNA polymerase requires VPg and a stem-loop structure at the 3' end of the virus RNA.

PubMed

Osman, Toba A M; Coutts, Robert H A; Buck, Kenneth W

2006-11-01

Cereal yellow dwarf virus (CYDV) RNA has a 5'-terminal genome-linked protein (VPg). We have expressed the VPg region of the CYDV genome in bacteria and used the purified protein (bVPg) to raise an antiserum which was able to detect free VPg in extracts of CYDV-infected oat plants. A template-dependent RNA-dependent RNA polymerase (RdRp) has been produced from a CYDV membrane-bound RNA polymerase by treatment with BAL 31 nuclease. The RdRp was template specific, being able to utilize templates from CYDV plus- and minus-strand RNAs but not those of three unrelated viruses, Red clover necrotic mosaic virus, Cucumber mosaic virus, and Tobacco mosaic virus. RNA synthesis catalyzed by the RdRp required a 3'-terminal GU sequence and the presence of bVPg. Additionally, synthesis of minus-strand RNA on a plus-strand RNA template required the presence of a putative stem-loop structure near the 3' terminus of CYDV RNA. The base-paired stem, a single-nucleotide (A) bulge in the stem, and the sequence of a tetraloop were all required for the template activity. Evidence was produced showing that minus-strand synthesis in vitro was initiated by priming by bVPg at the 3' end of the template. The data are consistent with a model in which the RdRp binds to the stem-loop structure which positions the active site to recognize the 3'-terminal GU sequence for initiation of RNA synthesis by the addition of an A residue to VPg.
Conformation-dependent epitopes recognized by prion protein antibodies probed using mutational scanning and deep sequencing.

PubMed

Doolan, Kyle M; Colby, David W

2015-01-30

Prion diseases are caused by a structural rearrangement of the cellular prion protein, PrP(C), into a disease-associated conformation, PrP(Sc), which may be distinguished from one another using conformation-specific antibodies. We used mutational scanning by cell-surface display to screen 1341 PrP single point mutants for attenuated interaction with four anti-PrP antibodies, including several with conformational specificity. Single-molecule real-time gene sequencing was used to quantify enrichment of mutants, returning 26,000 high-quality full-length reads for each screened population on average. Relative enrichment of mutants correlated to the magnitude of the change in binding affinity. Mutations that diminished binding of the antibody ICSM18 represented the core of contact residues in the published crystal structure of its complex. A similarly located binding site was identified for D18, comprising discontinuous residues in helix 1 of PrP, brought into close proximity to one another only when the alpha helix is intact. The specificity of these antibodies for the normal form of PrP likely arises from loss of this conformational feature after conversion to the disease-associated form. Intriguingly, 6H4 binding was found to depend on interaction with the same residues, among others, suggesting that its ability to recognize both forms of PrP depends on a structural rearrangement of the antigen. The application of mutational scanning and deep sequencing provides residue-level resolution of positions in the protein-protein interaction interface that are critical for binding, as well as a quantitative measure of the impact of mutations on binding affinity. Copyright © 2014 Elsevier Ltd. All rights reserved.
In Vitro Synthesis of Minus-Strand RNA by an Isolated Cereal Yellow Dwarf Virus RNA-Dependent RNA Polymerase Requires VPg and a Stem-Loop Structure at the 3′ End of the Virus RNA▿

PubMed Central

Osman, Toba A. M.; Coutts, Robert H. A.; Buck, Kenneth W.

2006-01-01

Cereal yellow dwarf virus (CYDV) RNA has a 5′-terminal genome-linked protein (VPg). We have expressed the VPg region of the CYDV genome in bacteria and used the purified protein (bVPg) to raise an antiserum which was able to detect free VPg in extracts of CYDV-infected oat plants. A template-dependent RNA-dependent RNA polymerase (RdRp) has been produced from a CYDV membrane-bound RNA polymerase by treatment with BAL 31 nuclease. The RdRp was template specific, being able to utilize templates from CYDV plus- and minus-strand RNAs but not those of three unrelated viruses, Red clover necrotic mosaic virus, Cucumber mosaic virus, and Tobacco mosaic virus. RNA synthesis catalyzed by the RdRp required a 3′-terminal GU sequence and the presence of bVPg. Additionally, synthesis of minus-strand RNA on a plus-strand RNA template required the presence of a putative stem-loop structure near the 3′ terminus of CYDV RNA. The base-paired stem, a single-nucleotide (A) bulge in the stem, and the sequence of a tetraloop were all required for the template activity. Evidence was produced showing that minus-strand synthesis in vitro was initiated by priming by bVPg at the 3′ end of the template. The data are consistent with a model in which the RdRp binds to the stem-loop structure which positions the active site to recognize the 3′-terminal GU sequence for initiation of RNA synthesis by the addition of an A residue to VPg. PMID:16928757
Characterization of host-dependent mutations of apple fruit crinkle viroid replicating in newly identified experimental hosts suggests maintenance of stem-loop structures in the left-hand half of the molecule is important for replication.

PubMed

Suzuki, Takahiro; Fujibayashi, Misato; Hataya, Tatsuji; Taneda, Akito; He, Ying-Hong; Tsushima, Taro; Duraisamy, Ganesh Selvaraj; Siglová, Kristyna; Matoušek, Jaroslav; Sano, Teruo

2017-03-01

Apple fruit crinkle viroid (AFCVd) is a tentative member of the genus Apscaviroid, family Pospiviroidae. AFCVd has a narrow host range and is known to infect apple, hop and persimmon as natural hosts. In this study, tomato, cucumber and wild hop have been identified as new experimental herbaceous hosts. Foliar symptoms were very mild or virtually undetectable, but fruits of infected tomato were small, cracked and distorted. These symptoms resemble those observed on some AFCVd-sensitive apple cultivars. After transfer to tomato, cucumber and wild hop, sequence changes were detected in a natural AFCVd isolate from hop, and major variants in tomato, cucumber and wild hop differed in 10, 8 or 2 nucleotides, respectively, from the predominant one in the inoculum. The major variants in tomato and cucumber were almost identical, and the one in wild hop was very similar to the one in cultivated hop. Detailed analyses of the host-dependent sequence changes that appear in a naturally occurring AFCVd isolate from hop after transfer to tomato using small RNA deep sequence data and infectivity studies with dimeric RNA transcripts followed by progeny analysis indicate that the major AFCVd variant in tomato emerged by selection of a minor variant present in the inoculum (i.e. hop) followed by one to two host-dependent de novo mutations. Comparison of the secondary structures of major variants in hop, tomato and persimmon after transfer to tomato suggested that maintenance of stem-loop structures in the left-hand half of the molecule is critical for infection.

Correlation of Local Effects of DNA Sequence and Position of Beta-Alanine Inserts with Polyamide-DNA Complex Binding Affinities and Kinetics

PubMed Central

Wang, Shuo; Nanjunda, Rupesh; Aston, Karl; Bashkin, James K.; Wilson, W. David

2012-01-01

In order to better understand the effects of β-alanine (β) substitution and the number of heterocycles on DNA binding affinity and selectivity, the interactions of an eight-ring hairpin polyamide (PA) and two β derivatives as well as a six-heterocycle analog have been investigated with their cognate DNA sequence, 5′-TGGCTT-3′. Binding selectivity and the effects of β have been investigated with the cognate and five mutant DNAs. A set of powerful and complementary methods have been employed for both energetic and structural evaluations: UV-melting, biosensor-surface plasmon resonance, isothermal titration calorimetry, circular dichroism and a DNA ligation ladder global structure assay. The reduced number of heterocycles in the six-ring PA weakens the binding affinity; however, the smaller PA aggregates significantly less than the larger PAs, and allows us to obtain the binding thermodynamics. The PA-DNA binding enthalpy is large and negative with a large negative ΔCp, and is the primary driving component of the Gibbs free energy. The complete SPR binding results clearly show that β substitutions can substantially weaken the binding affinity of hairpin PAs in a position-dependent manner. More importantly, the changes in PA binding to the mutant DNAs further confirm the position-dependent effects on PA-DNA interaction affinity. Comparison of mutant DNA sequences also shows a different effect in recognition of T•A versus A•T base pairs. The effects of DNA mutations on binding of a single PA as well as the effects of the position of β substitution on binding tell a clear and very important story about sequence dependent binding of PAs to DNA. PMID:23167504
Prediction of beta-turns and beta-turn types by a novel bidirectional Elman-type recurrent neural network with multiple output layers (MOLEBRNN).

PubMed

Kirschner, Andreas; Frishman, Dmitrij

2008-10-01

Prediction of beta-turns from amino acid sequences has long been recognized as an important problem in structural bioinformatics due to their frequent occurrence as well as their structural and functional significance. Because various structural features of proteins are intercorrelated, secondary structure information has been often employed as an additional input for machine learning algorithms while predicting beta-turns. Here we present a novel bidirectional Elman-type recurrent neural network with multiple output layers (MOLEBRNN) capable of predicting multiple mutually dependent structural motifs and demonstrate its efficiency in recognizing three aspects of protein structure: beta-turns, beta-turn types, and secondary structure. The advantage of our method compared to other predictors is that it does not require any external input except for sequence profiles because interdependencies between different structural features are taken into account implicitly during the learning process. In a sevenfold cross-validation experiment on a standard test dataset our method exhibits the total prediction accuracy of 77.9% and the Mathew's Correlation Coefficient of 0.45, the highest performance reported so far. It also outperforms other known methods in delineating individual turn types. We demonstrate how simultaneous prediction of multiple targets influences prediction performance on single targets. The MOLEBRNN presented here is a generic method applicable in a variety of research fields where multiple mutually depending target classes need to be predicted. http://webclu.bio.wzw.tum.de/predator-web/.
Complementary molecular information changes our perception of food web structure

PubMed Central

Wirta, Helena K.; Hebert, Paul D. N.; Kaartinen, Riikka; Prosser, Sean W.; Várkonyi, Gergely; Roslin, Tomas

2014-01-01

How networks of ecological interactions are structured has a major impact on their functioning. However, accurately resolving both the nodes of the webs and the links between them is fraught with difficulties. We ask whether the new resolution conferred by molecular information changes perceptions of network structure. To probe a network of antagonistic interactions in the High Arctic, we use two complementary sources of molecular data: parasitoid DNA sequenced from the tissues of their hosts and host DNA sequenced from the gut of adult parasitoids. The information added by molecular analysis radically changes the properties of interaction structure. Overall, three times as many interaction types were revealed by combining molecular information from parasitoids and hosts with rearing data, versus rearing data alone. At the species level, our results alter the perceived host specificity of parasitoids, the parasitoid load of host species, and the web-wide role of predators with a cryptic lifestyle. As the northernmost network of host–parasitoid interactions quantified, our data point exerts high leverage on global comparisons of food web structure. However, how we view its structure will depend on what information we use: compared with variation among networks quantified at other sites, the properties of our web vary as much or much more depending on the techniques used to reconstruct it. We thus urge ecologists to combine multiple pieces of evidence in assessing the structure of interaction webs, and suggest that current perceptions of interaction structure may be strongly affected by the methods used to construct them. PMID:24449902
G-quadruplex prediction in E. coli genome reveals a conserved putative G-quadruplex-Hairpin-Duplex switch.

PubMed

Kaplan, Oktay I; Berber, Burak; Hekim, Nezih; Doluca, Osman

2016-11-02

Many studies show that short non-coding sequences are widely conserved among regulatory elements. More and more conserved sequences are being discovered since the development of next generation sequencing technology. A common approach to identify conserved sequences with regulatory roles relies on topological changes such as hairpin formation at the DNA or RNA level. G-quadruplexes, non-canonical nucleic acid topologies with little established biological roles, are increasingly considered for conserved regulatory element discovery. Since the tertiary structure of G-quadruplexes is strongly dependent on the loop sequence which is disregarded by the generally accepted algorithm, we hypothesized that G-quadruplexes with similar topology and, indirectly, similar interaction patterns, can be determined using phylogenetic clustering based on differences in the loop sequences. Phylogenetic analysis of 52 G-quadruplex forming sequences in the Escherichia coli genome revealed two conserved G-quadruplex motifs with a potential regulatory role. Further analysis revealed that both motifs tend to form hairpins and G quadruplexes, as supported by circular dichroism studies. The phylogenetic analysis as described in this work can greatly improve the discovery of functional G-quadruplex structures and may explain unknown regulatory patterns. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
The production of Multiple Small Peptaibol Families by Single 14-Module Peptide Synthetases in Trichoderma/Hypocrea

DOE Office of Scientific and Technical Information (OSTI.GOV)

Degenkolb, Thomas; Aghchehb, Razieh Karimi; Dieckmann, Ralf

2012-03-01

The most common peptaibibiotic structures are 11-residue peptaibols found widely distributed in the genus Trichoderma/Hypocrea. Frequently associated are 14-residue peptaibols sharing partial sequence identity. Genome sequencing projects of 3 Trichoderma strains of the major clades reveal the presence of up to 3 types of nonribosomal peptide synthetases with 7, 14, or 18-20 amino acid adding modules. We here provide evidence that the 14-module NRPS type found in T. virens, T. reesei (teleomorph Hypocrea jecorina) and T. atroviride produces both 11- and 14- residue peptaibols based on the disruption of the respective NRPS gene of T. reesei, and bioinformatic analysis ofmore » their amino acid activating domains and modules. The structures of these peptides may be predicted from the gene structures and have been confirmed by analysis of families of 11- and 14-residue peptaibols from the strain 618, termed hypojecorins A (23 sequences determined, 4 new) and B (3 new sequences), and the recently established trichovirins A from T. virens. The distribution of 11- and 14-residue products is strain-specific and depends on growth conditions as well. Possible mechanisms of module skipping are discussed.« less
Inability of Prevotella bryantii to Form a Functional Shine-Dalgarno Interaction Reflects Unique Evolution of Ribosome Binding Sites in Bacteroidetes

PubMed Central

Accetto, Tomaž; Avguštin, Gorazd

2011-01-01

The Shine-Dalgarno (SD) sequence is a key element directing the translation to initiate at the authentic start codons and also enabling translation initiation to proceed in 5′ untranslated mRNA regions (5′-UTRs) containing moderately strong secondary structures. Bioinformatic analysis of almost forty genomes from the major bacterial phylum Bacteroidetes revealed, however, a general absence of SD sequence, drop in GC content and consequently reduced tendency to form secondary structures in 5′-UTRs. The experiments using the Prevotella bryantii TC1-1 expression system were in agreement with these findings: neither addition nor omission of SD sequence in the unstructured 5′-UTR affected the level of the reporter protein, non-specific nuclease NucB. Further, NucB level in P. bryantii TC1-1, contrary to hMGFP level in Escherichia coli, was five times lower when SD sequence formed part of the secondary structure with a folding energy -5,2 kcal/mol. Also, the extended SD sequences did not affect protein levels as in E. coli. It seems therefore that a functional SD interaction does not take place during the translation initiation in P. bryanttii TC1-1 and possibly other members of phylum Bacteroidetes although the anti SD sequence is present in 16S rRNA genes of their genomes. We thus propose that in the absence of the SD sequence interaction, the selection of genuine start codons in Bacteroidetes is accomplished by binding of ribosomal protein S1 to unstructured 5′-UTR as opposed to coding region which is inaccessible due to mRNA secondary structure. Additionally, we found that sequence logos of region preceding the start codons may be used as taxonomical markers. Depending on whether complete sequence logo or only part of it, such as information content and base proportion at specific positions, is used, bacterial genera or families and in some cases even bacterial phyla can be distinguished. PMID:21857964
Selection of homeotic proteins for binding to a human DNA replication origin.

PubMed

de Stanchina, E; Gabellini, D; Norio, P; Giacca, M; Peverali, F A; Riva, S; Falaschi, A; Biamonti, G

2000-06-09

We have previously shown that a cell cycle-dependent nucleoprotein complex assembles in vivo on a 74 bp sequence within the human DNA replication origin associated to the Lamin B2 gene. Here, we report the identification, using a one-hybrid screen in yeast, of three proteins interacting with the 74 bp sequence. All of them, namely HOXA13, HOXC10 and HOXC13, are orthologues of the Abdominal-B gene of Drosophila melanogaster and are members of the homeogene family of developmental regulators. We describe the complete open reading frame sequence of HOXC10 and HOXC13 along with the structure of the HoxC13 gene. The specificity of binding of these two proteins to the Lamin B2 origin is confirmed by both band-shift and in vitro footprinting assays. In addition, the ability of HOXC10 and HOXC13 to increase the activity of a promoter containing the 74 bp sequence, as assayed by CAT-assay experiments, demonstrates a direct interaction of these homeoproteins with the origin sequence in mammalian cells. We also show that HOXC10 expression is cell-type-dependent and positively correlates with cell proliferation. Copyright 2000 Academic Press.
Sequence features of viral and human Internal Ribosome Entry Sites predictive of their activity

PubMed Central

Elias-Kirma, Shani; Nir, Ronit; Segal, Eran

2017-01-01

Translation of mRNAs through Internal Ribosome Entry Sites (IRESs) has emerged as a prominent mechanism of cellular and viral initiation. It supports cap-independent translation of select cellular genes under normal conditions, and in conditions when cap-dependent translation is inhibited. IRES structure and sequence are believed to be involved in this process. However due to the small number of IRESs known, there have been no systematic investigations of the determinants of IRES activity. With the recent discovery of thousands of novel IRESs in human and viruses, the next challenge is to decipher the sequence determinants of IRES activity. We present the first in-depth computational analysis of a large body of IRESs, exploring RNA sequence features predictive of IRES activity. We identified predictive k-mer features resembling IRES trans-acting factor (ITAF) binding motifs across human and viral IRESs, and found that their effect on expression depends on their sequence, number and position. Our results also suggest that the architecture of retroviral IRESs differs from that of other viruses, presumably due to their exposure to the nuclear environment. Finally, we measured IRES activity of synthetically designed sequences to confirm our prediction of increasing activity as a function of the number of short IRES elements. PMID:28922394
High-resolution characterization of sequence signatures due to non-random cleavage of cell-free DNA.

PubMed

Chandrananda, Dineika; Thorne, Natalie P; Bahlo, Melanie

2015-06-17

High-throughput sequencing of cell-free DNA fragments found in human plasma has been used to non-invasively detect fetal aneuploidy, monitor organ transplants and investigate tumor DNA. However, many biological properties of this extracellular genetic material remain unknown. Research that further characterizes circulating DNA could substantially increase its diagnostic value by allowing the application of more sophisticated bioinformatics tools that lead to an improved signal to noise ratio in the sequencing data. In this study, we investigate various features of cell-free DNA in plasma using deep-sequencing data from two pregnant women (>70X, >50X) and compare them with matched cellular DNA. We utilize a descriptive approach to examine how the biological cleavage of cell-free DNA affects different sequence signatures such as fragment lengths, sequence motifs at fragment ends and the distribution of cleavage sites along the genome. We show that the size distributions of these cell-free DNA molecules are dependent on their autosomal and mitochondrial origin as well as the genomic location within chromosomes. DNA mapping to particular microsatellites and alpha repeat elements display unique size signatures. We show how cell-free fragments occur in clusters along the genome, localizing to nucleosomal arrays and are preferentially cleaved at linker regions by correlating the mapping locations of these fragments with ENCODE annotation of chromatin organization. Our work further demonstrates that cell-free autosomal DNA cleavage is sequence dependent. The region spanning up to 10 positions on either side of the DNA cleavage site show a consistent pattern of preference for specific nucleotides. This sequence motif is present in cleavage sites localized to nucleosomal cores and linker regions but is absent in nucleosome-free mitochondrial DNA. These background signals in cell-free DNA sequencing data stem from the non-random biological cleavage of these fragments. This sequence structure can be harnessed to improve bioinformatics algorithms, in particular for CNV and structural variant detection. Descriptive measures for cell-free DNA features developed here could also be used in biomarker analysis to monitor the changes that occur during different pathological conditions.
History dependence in insect flight decisions during odor tracking.

PubMed

Pang, Rich; van Breugel, Floris; Dickinson, Michael; Riffell, Jeffrey A; Fairhall, Adrienne

2018-02-01

Natural decision-making often involves extended decision sequences in response to variable stimuli with complex structure. As an example, many animals follow odor plumes to locate food sources or mates, but turbulence breaks up the advected odor signal into intermittent filaments and puffs. This scenario provides an opportunity to ask how animals use sparse, instantaneous, and stochastic signal encounters to generate goal-oriented behavioral sequences. Here we examined the trajectories of flying fruit flies (Drosophila melanogaster) and mosquitoes (Aedes aegypti) navigating in controlled plumes of attractive odorants. While it is known that mean odor-triggered flight responses are dominated by upwind turns, individual responses are highly variable. We asked whether deviations from mean responses depended on specific features of odor encounters, and found that odor-triggered turns were slightly but significantly modulated by two features of odor encounters. First, encounters with higher concentrations triggered stronger upwind turns. Second, encounters occurring later in a sequence triggered weaker upwind turns. To contextualize the latter history dependence theoretically, we examined trajectories simulated from three normative tracking strategies. We found that neither a purely reactive strategy nor a strategy in which the tracker learned the plume centerline over time captured the observed history dependence. In contrast, "infotaxis", in which flight decisions maximized expected information gain about source location, exhibited a history dependence aligned in sign with the data, though much larger in magnitude. These findings suggest that while true plume tracking is dominated by a reactive odor response it might also involve a history-dependent modulation of responses consistent with the accumulation of information about a source over multi-encounter timescales. This suggests that short-term memory processes modulating decision sequences may play a role in natural plume tracking.
History dependence in insect flight decisions during odor tracking

PubMed Central

van Breugel, Floris; Dickinson, Michael; Riffell, Jeffrey A.; Fairhall, Adrienne

2018-01-01

Natural decision-making often involves extended decision sequences in response to variable stimuli with complex structure. As an example, many animals follow odor plumes to locate food sources or mates, but turbulence breaks up the advected odor signal into intermittent filaments and puffs. This scenario provides an opportunity to ask how animals use sparse, instantaneous, and stochastic signal encounters to generate goal-oriented behavioral sequences. Here we examined the trajectories of flying fruit flies (Drosophila melanogaster) and mosquitoes (Aedes aegypti) navigating in controlled plumes of attractive odorants. While it is known that mean odor-triggered flight responses are dominated by upwind turns, individual responses are highly variable. We asked whether deviations from mean responses depended on specific features of odor encounters, and found that odor-triggered turns were slightly but significantly modulated by two features of odor encounters. First, encounters with higher concentrations triggered stronger upwind turns. Second, encounters occurring later in a sequence triggered weaker upwind turns. To contextualize the latter history dependence theoretically, we examined trajectories simulated from three normative tracking strategies. We found that neither a purely reactive strategy nor a strategy in which the tracker learned the plume centerline over time captured the observed history dependence. In contrast, “infotaxis”, in which flight decisions maximized expected information gain about source location, exhibited a history dependence aligned in sign with the data, though much larger in magnitude. These findings suggest that while true plume tracking is dominated by a reactive odor response it might also involve a history-dependent modulation of responses consistent with the accumulation of information about a source over multi-encounter timescales. This suggests that short-term memory processes modulating decision sequences may play a role in natural plume tracking. PMID:29432454
Stepwise Mechanism of Temperature-Dependent Coacervation of the Elastin-like Peptide Analogue Dimer, (C(WPGVG)3)2.

PubMed

Tatsubo, Daiki; Suyama, Keitaro; Miyazaki, Masaya; Maeda, Iori; Nose, Takeru

2018-03-13

Elastin-like peptides (ELPs) are distinct, repetitive, hydrophobic sequences, such as (VPGVG) n , that exhibit coacervation, the property of reversible, temperature-dependent self-association and dissociation. ELPs can be found in elastin and have been developed as new scaffold biomaterials. However, the detailed relationship between their amino acid sequences and coacervation properties remains obscure because of the structural flexibility of ELPs. In this study, we synthesized a novel, dimeric ELP analogue (H-C(WPGVG) 3 -NH 2 ) 2 , henceforth abbreviated (CW3)2, and analyzed its self-assembly properties and structural factors as indicators of coacervation. Turbidity measurements showed that (CW3)2 demonstrated coacervation at a concentration much lower than that of its monomeric form and another ELP. In addition, the coacervate held water-soluble dye molecules. Thus, potent and distinct coacervation was obtained with a remarkably short sequence of (CW3)2. Furthermore, fluorescence microscopy, dynamic light scattering, and optical microscopy revealed that the coacervation of (CW3)2 was a stepwise process. The structural factors of (CW3)2 were analyzed by molecular dynamics simulations and circular dichroism spectroscopy. These measurements indicated that helical structures primarily consisting of proline and glycine became more disordered at high temperatures with concurrent, significant exposure of their hydrophobic surfaces. This extreme change in the hydrophobic surface contributes to the potent coacervation observed for (CW3)2. These results provide important insights into more efficient applications of ELPs and their analogues, as well as the coacervation mechanisms of ELP and elastin.
Insights into the Aggregation Mechanism of PolyQ Proteins with Different Glutamine Repeat Lengths.

PubMed

Yushchenko, Tetyana; Deuerling, Elke; Hauser, Karin

2018-04-24

Polyglutamine (polyQ) diseases, including Huntington's disease, result from the aggregation of an abnormally expanded polyQ repeat in the affected protein. The length of the polyQ repeat is essential for the disease's onset; however, the molecular mechanism of polyQ aggregation is still poorly understood. Controlled conditions and initiation of the aggregation process are prerequisites for the detection of transient intermediate states. We present an attenuated total reflection Fourier-transform infrared spectroscopic approach combined with protein immobilization to study polyQ aggregation dependent on the polyQ length. PolyQ proteins were engineered mimicking the mammalian N-terminus fragment of the Huntingtin protein and containing a polyQ sequence with the number of glutamines below (Q11), close to (Q38), and above (Q56) the disease threshold. A monolayer of the polyQ construct was chemically immobilized on the internal reflection element of the attenuated total reflection cell, and the aggregation was initiated via enzymatic cleavage. Structural changes of the polyQ sequence were monitored by time-resolved infrared difference spectroscopy. We observed faster aggregation kinetics for the longer sequences, and furthermore, we could distinguish β-structured intermediates for the different constructs, allowing us to propose aggregation mechanisms dependent on the repeat length. Q11 forms a β-structured aggregate by intermolecular interaction of stretched monomers, whereas Q38 and Q56 undergo conformational changes to various β-structured intermediates, including intramolecular β-sheets. Copyright © 2018 Biophysical Society. Published by Elsevier Inc. All rights reserved.
Comparison of topological clustering within protein networks using edge metrics that evaluate full sequence, full structure, and active site microenvironment similarity.

PubMed

Leuthaeuser, Janelle B; Knutson, Stacy T; Kumar, Kiran; Babbitt, Patricia C; Fetrow, Jacquelyn S

2015-09-01

The development of accurate protein function annotation methods has emerged as a major unsolved biological problem. Protein similarity networks, one approach to function annotation via annotation transfer, group proteins into similarity-based clusters. An underlying assumption is that the edge metric used to identify such clusters correlates with functional information. In this contribution, this assumption is evaluated by observing topologies in similarity networks using three different edge metrics: sequence (BLAST), structure (TM-Align), and active site similarity (active site profiling, implemented in DASP). Network topologies for four well-studied protein superfamilies (enolase, peroxiredoxin (Prx), glutathione transferase (GST), and crotonase) were compared with curated functional hierarchies and structure. As expected, network topology differs, depending on edge metric; comparison of topologies provides valuable information on structure/function relationships. Subnetworks based on active site similarity correlate with known functional hierarchies at a single edge threshold more often than sequence- or structure-based networks. Sequence- and structure-based networks are useful for identifying sequence and domain similarities and differences; therefore, it is important to consider the clustering goal before deciding appropriate edge metric. Further, conserved active site residues identified in enolase and GST active site subnetworks correspond with published functionally important residues. Extension of this analysis yields predictions of functionally determinant residues for GST subgroups. These results support the hypothesis that active site similarity-based networks reveal clusters that share functional details and lay the foundation for capturing functionally relevant hierarchies using an approach that is both automatable and can deliver greater precision in function annotation than current similarity-based methods. © 2015 The Authors Protein Science published by Wiley Periodicals, Inc. on behalf of The Protein Society.
Comparison of topological clustering within protein networks using edge metrics that evaluate full sequence, full structure, and active site microenvironment similarity

PubMed Central

Leuthaeuser, Janelle B; Knutson, Stacy T; Kumar, Kiran; Babbitt, Patricia C; Fetrow, Jacquelyn S

2015-01-01

The development of accurate protein function annotation methods has emerged as a major unsolved biological problem. Protein similarity networks, one approach to function annotation via annotation transfer, group proteins into similarity-based clusters. An underlying assumption is that the edge metric used to identify such clusters correlates with functional information. In this contribution, this assumption is evaluated by observing topologies in similarity networks using three different edge metrics: sequence (BLAST), structure (TM-Align), and active site similarity (active site profiling, implemented in DASP). Network topologies for four well-studied protein superfamilies (enolase, peroxiredoxin (Prx), glutathione transferase (GST), and crotonase) were compared with curated functional hierarchies and structure. As expected, network topology differs, depending on edge metric; comparison of topologies provides valuable information on structure/function relationships. Subnetworks based on active site similarity correlate with known functional hierarchies at a single edge threshold more often than sequence- or structure-based networks. Sequence- and structure-based networks are useful for identifying sequence and domain similarities and differences; therefore, it is important to consider the clustering goal before deciding appropriate edge metric. Further, conserved active site residues identified in enolase and GST active site subnetworks correspond with published functionally important residues. Extension of this analysis yields predictions of functionally determinant residues for GST subgroups. These results support the hypothesis that active site similarity-based networks reveal clusters that share functional details and lay the foundation for capturing functionally relevant hierarchies using an approach that is both automatable and can deliver greater precision in function annotation than current similarity-based methods. PMID:26073648
RNA Dependent RNA Polymerases: Insights from Structure, Function and Evolution.

PubMed

Venkataraman, Sangita; Prasad, Burra V L S; Selvarajan, Ramasamy

2018-02-10

RNA dependent RNA polymerase (RdRp) is one of the most versatile enzymes of RNA viruses that is indispensable for replicating the genome as well as for carrying out transcription. The core structural features of RdRps are conserved, despite the divergence in their sequences. The structure of RdRp resembles that of a cupped right hand and consists of fingers, palm and thumb subdomains. The catalysis involves the participation of conserved aspartates and divalent metal ions. Complexes of RdRps with substrates, inhibitors and metal ions provide a comprehensive view of their functional mechanism and offer valuable insights regarding the development of antivirals. In this article, we provide an overview of the structural aspects of RdRps and their complexes from the Group III, IV and V viruses and their structure-based phylogeny.
RNA Dependent RNA Polymerases: Insights from Structure, Function and Evolution

PubMed Central

Venkataraman, Sangita; Prasad, Burra V L S; Selvarajan, Ramasamy

2018-01-01

RNA dependent RNA polymerase (RdRp) is one of the most versatile enzymes of RNA viruses that is indispensable for replicating the genome as well as for carrying out transcription. The core structural features of RdRps are conserved, despite the divergence in their sequences. The structure of RdRp resembles that of a cupped right hand and consists of fingers, palm and thumb subdomains. The catalysis involves the participation of conserved aspartates and divalent metal ions. Complexes of RdRps with substrates, inhibitors and metal ions provide a comprehensive view of their functional mechanism and offer valuable insights regarding the development of antivirals. In this article, we provide an overview of the structural aspects of RdRps and their complexes from the Group III, IV and V viruses and their structure-based phylogeny. PMID:29439438
The Structure of the RNA-Dependent RNA Polymerase of a Permutotetravirus Suggests a Link between Primer-Dependent and Primer-Independent Polymerases

PubMed Central

Ferrero, Diego S.; Buxaderas, Mònica; Rodríguez, José F.; Verdaguer, Núria

2015-01-01

Thosea asigna virus (TaV), an insect virus belonging to the Permutatetraviridae family, has a positive-sense single-stranded RNA (ssRNA) genome with two overlapping open reading frames, encoding for the replicase and capsid proteins. The particular TaV replicase includes a structurally unique RNA-dependent RNA polymerase (RdRP) with a sequence permutation in the palm sub-domain, where the active site is anchored. This non-canonical arrangement of the RdRP palm is also found in double-stranded RNA viruses of the Birnaviridae family. Both virus families also share a conserved VPg sequence motif at the polymerase N-terminus which in birnaviruses appears to be used to covalently link a fraction of the replicase molecules to the 5’-end of the genomic segments. Birnavirus VPgs are presumed to be used as primers for replication initiation. Here we have solved the crystal structure of the TaV RdRP, the first non-canonical RdRP of a ssRNA virus, in its apo- form and bound to different substrates. The enzyme arranges as a stable dimer maintained by mutual interactions between the active site cleft of one molecule and the flexible N-terminal tail of the symmetrically related RdRP. The latter, partially mimicking the RNA template backbone, is involved in regulating the polymerization activity. As expected from previous sequence-based bioinformatics predictions, the overall architecture of the TaV enzyme shows important resemblances with birnavirus polymerases. In addition, structural comparisons and biochemical analyses reveal unexpected similarities between the TaV RdRP and those of Flaviviruses. In particular, a long loop protruding from the thumb domain towards the central enzyme cavity appears to act as a platform for de novo initiation of RNA replication. Our findings strongly suggest an unexpected evolutionary relationship between the RdRPs encoded by these distant ssRNA virus groups. PMID:26625123
A galaxy of folds.

PubMed

Alva, Vikram; Remmert, Michael; Biegert, Andreas; Lupas, Andrei N; Söding, Johannes

2010-01-01

Many protein classification systems capture homologous relationships by grouping domains into families and superfamilies on the basis of sequence similarity. Superfamilies with similar 3D structures are further grouped into folds. In the absence of discernable sequence similarity, these structural similarities were long thought to have originated independently, by convergent evolution. However, the growth of databases and advances in sequence comparison methods have led to the discovery of many distant evolutionary relationships that transcend the boundaries of superfamilies and folds. To investigate the contributions of convergent versus divergent evolution in the origin of protein folds, we clustered representative domains of known structure by their sequence similarity, treating them as point masses in a virtual 2D space which attract or repel each other depending on their pairwise sequence similarities. As expected, families in the same superfamily form tight clusters. But often, superfamilies of the same fold are linked with each other, suggesting that the entire fold evolved from an ancient prototype. Strikingly, some links connect superfamilies with different folds. They arise from modular peptide fragments of between 20 and 40 residues that co-occur in the connected folds in disparate structural contexts. These may be descendants of an ancestral pool of peptide modules that evolved as cofactors in the RNA world and from which the first folded proteins arose by amplification and recombination. Our galaxy of folds summarizes, in a single image, most known and many yet undescribed homologous relationships between protein superfamilies, providing new insights into the evolution of protein domains.
Nucleosome Positioning and Epigenetics

NASA Astrophysics Data System (ADS)

Schwab, David; Bruinsma, Robijn

2008-03-01

The role of chromatin structure in gene regulation has recently taken center stage in the field of epigenetics, phenomena that change the phenotype without changing the DNA sequence. Recent work has also shown that nucleosomes, a complex of DNA wrapped around a histone octamer, experience a sequence dependent energy landscape due to the variation in DNA bend stiffness with sequence composition. In this talk, we consider the role nucleosome positioning might play in the formation of heterochromatin, a compact form of DNA generically responsible for gene silencing. In particular, we discuss how different patterns of nucleosome positions, periodic or random, could either facilitate or suppress heterochromatin stability and formation.

Using the self-select paradigm to delineate the nature of speech motor programming.

PubMed

Wright, David L; Robin, Don A; Rhee, Jooyhun; Vaculin, Amber; Jacks, Adam; Guenther, Frank H; Fox, Peter T

2009-06-01

The authors examined the involvement of 2 speech motor programming processes identified by S. T. Klapp (1995, 2003) during the articulation of utterances differing in syllable and sequence complexity. According to S. T. Klapp, 1 process, INT, resolves the demands of the programmed unit, whereas a second process, SEQ, oversees the serial order demands of longer sequences. A modified reaction time paradigm was used to assess INT and SEQ demands. Specifically, syllable complexity was dependent on syllable structure, whereas sequence complexity involved either repeated or unique syllabi within an utterance. INT execution was slowed when articulating single syllables in the form CCCV compared to simpler CV syllables. Planning unique syllables within a multisyllabic utterance rather than repetitions of the same syllable slowed INT but not SEQ. The INT speech motor programming process, important for mental syllabary access, is sensitive to changes in both syllable structure and the number of unique syllables in an utterance.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Ma, Xiang; Zhang, Shuai; Jiao, Fang

Two-step nucleation pathways in which disordered, amorphous, or dense liquid states precede appearance of crystalline phases have been reported for a wide range of materials, but the dynamics of such pathways are poorly understood. Moreover, whether these pathways are general features of crystallizing systems or a consequence of system-specific structural details that select for direct vs two-step processes is unknown. Using atomic force microscopy to directly observe crystallization of sequence-defined polymers, we show that crystallization pathways are indeed sequence dependent. When a short hydrophobic region is added to a sequence that directly forms crystalline particles, crystallization instead follows a two-stepmore » pathway that begins with creation of disordered clusters of 10-20 molecules and is characterized by highly non-linear crystallization kinetics in which clusters transform into ordered structures that then enter the growth phase. The results shed new light on non-classical crystallization mechanisms and have implications for design of self-assembling polymer systems.« less
Laser-induced periodic surface structures on zinc oxide crystals upon two-colour femtosecond double-pulse irradiation

NASA Astrophysics Data System (ADS)

Höhm, S.; Rosenfeld, A.; Krüger, J.; Bonse, J.

2017-03-01

In order to study the temporally distributed energy deposition in the formation of laser-induced periodic surface structures (LIPSS) on single-crystalline zinc oxide (ZnO), two-colour double-fs-pulse experiments were performed. Parallel or cross-polarised double-pulse sequences at 400 and 800 nm wavelength were generated by a Mach-Zehnder interferometer, exhibiting inter-pulse delays up to a few picoseconds between the sub-ablation 50-fs-pulses. Twenty two-colour double-pulse sequences were collinearly focused by a spherical mirror to the sample surface. The resulting LIPSS periods and areas were analysed by scanning electron microscopy. The delay-dependence of these LIPSS characteristics shows a dissimilar behaviour when compared to the semiconductor silicon, the dielectric fused silica, or the metal titanium. A wavelength-dependent plasmonic mechanism is proposed to explain the delay-dependence of the LIPSS on ZnO when considering multi-photon excitation processes. Our results support the involvement of nonlinear processes for temporally overlapping pulses. These experiments extend previous two-colour studies on the indirect semiconductor silicon towards the direct wide band-gap semiconductor ZnO and further manifest the relevance of the ultrafast energy deposition for LIPSS formation.
The Shine-Dalgarno sequence of riboswitch-regulated single mRNAs shows ligand-dependent accessibility bursts

NASA Astrophysics Data System (ADS)

Rinaldi, Arlie J.; Lund, Paul E.; Blanco, Mario R.; Walter, Nils G.

2016-01-01

In response to intracellular signals in Gram-negative bacteria, translational riboswitches--commonly embedded in messenger RNAs (mRNAs)--regulate gene expression through inhibition of translation initiation. It is generally thought that this regulation originates from occlusion of the Shine-Dalgarno (SD) sequence upon ligand binding; however, little direct evidence exists. Here we develop Single Molecule Kinetic Analysis of RNA Transient Structure (SiM-KARTS) to investigate the ligand-dependent accessibility of the SD sequence of an mRNA hosting the 7-aminomethyl-7-deazaguanine (preQ1)-sensing riboswitch. Spike train analysis reveals that individual mRNA molecules alternate between two conformational states, distinguished by `bursts' of probe binding associated with increased SD sequence accessibility. Addition of preQ1 decreases the lifetime of the SD's high-accessibility (bursting) state and prolongs the time between bursts. In addition, ligand-jump experiments reveal imperfect riboswitching of single mRNA molecules. Such complex ligand sensing by individual mRNA molecules rationalizes the nuanced ligand response observed during bulk mRNA translation.
Using DNA mechanics to predict in vitro nucleosome positions and formation energies

PubMed Central

Morozov, Alexandre V.; Fortney, Karissa; Gaykalova, Daria A.; Studitsky, Vasily M.; Widom, Jonathan; Siggia, Eric D.

2009-01-01

In eukaryotic genomes, nucleosomes function to compact DNA and to regulate access to it both by simple physical occlusion and by providing the substrate for numerous covalent epigenetic tags. While competition with other DNA-binding factors and action of chromatin remodeling enzymes significantly affect nucleosome formation in vivo, nucleosome positions in vitro are determined by steric exclusion and sequence alone. We have developed a biophysical model, DNABEND, for the sequence dependence of DNA bending energies, and validated it against a collection of in vitro free energies of nucleosome formation and a set of in vitro nucleosome positions mapped at high resolution. We have also made a first ab initio prediction of nucleosomal DNA geometries, and checked its accuracy against the nucleosome crystal structure. We have used DNABEND to design both strong and weak histone- binding sequences, and measured the corresponding free energies of nucleosome formation. We find that DNABEND can successfully predict in vitro nucleosome positions and free energies, providing a physical explanation for the intrinsic sequence dependence of histone–DNA interactions. PMID:19509309
Robust analysis of semiparametric renewal process models

PubMed Central

Lin, Feng-Chang; Truong, Young K.; Fine, Jason P.

2013-01-01

Summary A rate model is proposed for a modulated renewal process comprising a single long sequence, where the covariate process may not capture the dependencies in the sequence as in standard intensity models. We consider partial likelihood-based inferences under a semiparametric multiplicative rate model, which has been widely studied in the context of independent and identical data. Under an intensity model, gap times in a single long sequence may be used naively in the partial likelihood with variance estimation utilizing the observed information matrix. Under a rate model, the gap times cannot be treated as independent and studying the partial likelihood is much more challenging. We employ a mixing condition in the application of limit theory for stationary sequences to obtain consistency and asymptotic normality. The estimator's variance is quite complicated owing to the unknown gap times dependence structure. We adapt block bootstrapping and cluster variance estimators to the partial likelihood. Simulation studies and an analysis of a semiparametric extension of a popular model for neural spike train data demonstrate the practical utility of the rate approach in comparison with the intensity approach. PMID:24550568
Peptide-Directed PdAu Nanoscale Surface Segregation: Toward Controlled Bimetallic Architecture for Catalytic Materials

DOE Office of Scientific and Technical Information (OSTI.GOV)

Bedford, Nicholas M.; Showalter, Allison R.; Woehl, Taylor J.

Bimetallic nanoparticles are of immense scientific and technological interest given the synergistic properties observed when mixing two different metallic species at the nanoscale. This is particularly prevalent in catalysis, where bimetallic nanoparticles often exhibit improved catalytic activity and durability over their monometallic counterparts. Yet despite intense research efforts, little is understood regarding how to optimize bimetallic surface composition and structure synthetically using rational design principles. Recently, it has been demonstrated that peptide-enabled routes for nanoparticle synthesis result in materials with sequence-dependent catalytic properties, providing an opportunity for rational design through sequence manipulation. In this study, bimetallic PdAu nanoparticles are synthesizedmore » with a small set of peptides containing known Pd and Au binding motifs. The resulting nanoparticles were extensively characterized using high-resolution scanning transmission electron microscopy, X-ray absorption spectroscopy and high-energy X-ray diffraction coupled to atomic pair distribution function analysis. Structural information obtained from synchrotron radiation methods were then used to generate model nanoparticle configurations using reverse Monte Carlo simulations, which illustrate sequence-dependence in both surface structure and surface composition. Replica exchange solute tempering molecular dynamic simulations were also used to predict the modes of peptide binding on monometallic surfaces, indicating that different sequences bind to the metal interfaces via different mechanisms. As a testbed reaction, electrocatalytic methanol oxidation experiments were performed, wherein differences in catalytic activity are clearly observed in materials with identical bimetallic composition. Finally, taken together, this study indicates that peptides could be used to arrive at bimetallic surfaces with enhanced catalytic properties, which could be leveraged for rational bimetallic nanoparticle design using peptide-enabled approaches.« less
Peptide-Directed PdAu Nanoscale Surface Segregation: Toward Controlled Bimetallic Architecture for Catalytic Materials

DOE PAGES

Bedford, Nicholas M.; Showalter, Allison R.; Woehl, Taylor J.; ...

2016-09-01

Bimetallic nanoparticles are of immense scientific and technological interest given the synergistic properties observed when mixing two different metallic species at the nanoscale. This is particularly prevalent in catalysis, where bimetallic nanoparticles often exhibit improved catalytic activity and durability over their monometallic counterparts. Yet despite intense research efforts, little is understood regarding how to optimize bimetallic surface composition and structure synthetically using rational design principles. Recently, it has been demonstrated that peptide-enabled routes for nanoparticle synthesis result in materials with sequence-dependent catalytic properties, providing an opportunity for rational design through sequence manipulation. In this study, bimetallic PdAu nanoparticles are synthesizedmore » with a small set of peptides containing known Pd and Au binding motifs. The resulting nanoparticles were extensively characterized using high-resolution scanning transmission electron microscopy, X-ray absorption spectroscopy and high-energy X-ray diffraction coupled to atomic pair distribution function analysis. Structural information obtained from synchrotron radiation methods were then used to generate model nanoparticle configurations using reverse Monte Carlo simulations, which illustrate sequence-dependence in both surface structure and surface composition. Replica exchange solute tempering molecular dynamic simulations were also used to predict the modes of peptide binding on monometallic surfaces, indicating that different sequences bind to the metal interfaces via different mechanisms. As a testbed reaction, electrocatalytic methanol oxidation experiments were performed, wherein differences in catalytic activity are clearly observed in materials with identical bimetallic composition. Finally, taken together, this study indicates that peptides could be used to arrive at bimetallic surfaces with enhanced catalytic properties, which could be leveraged for rational bimetallic nanoparticle design using peptide-enabled approaches.« less
Chiral Differentiation of DNA Adducts Formed by Enantiomeric Analogues of Antitumor Cisplatin Is Sequence-Dependent

PubMed Central

Delalande, Olivier; Malina, Jaroslav; Brabec, Viktor; Kozelka, Jiří

2005-01-01

1,2-GG intrastrand cross-links formed in DNA by the enantiomeric complexes [PtCl2(R,R-2,3-diaminobutane (DAB))] and [PtCl2(S,S-DAB)] were studied by biophysical methods. Molecular modeling revealed that structure of the cross-links formed at the TGGT sequence was affected by repulsion between the 5′-directed methyl group of the DAB ligand and the methyl group of the 5′-thymine of the TGGT fragment. Molecular dynamics simulations of the solvated platinated duplexes and our recent structural data indicated that the adduct of [PtCl2(R,R-DAB)] alleviated this repulsion by unwinding the TpG step, whereas the adduct of [PtCl2(S,S-DAB)] avoided the unfavorable methyl-methyl interaction by decreasing the kink angle. Electrophoretic retardation measurements on DNA duplexes containing 1,2-GG intrastrand cross-links of Pt(R,R-DAB)2+ or Pt(S,S-DAB)2+ at a CGGA site showed that in this sequence both enantiomers distorted the double helix to the identical extent similar to that found previously for the same sequence containing the cross-links of the parent antitumor \\documentclass[12pt]{minimal} \\usepackage{amsmath} \\usepackage{wasysym} \\usepackage{amsfonts} \\usepackage{amssymb} \\usepackage{amsbsy} \\usepackage{mathrsfs} \\setlength{\\oddsidemargin}{-69pt} \\begin{document} \\begin{equation*}cis-{\\mathrm{Pt}}({\\mathrm{NH}}_{3})_{2}^{2+}\\end{equation*}\\end{document} (cisplatin). In addition, the adducts showed similar affinities toward the high-mobility-group box 1 proteins. Hence, whereas the structural perturbation induced in DNA by 1,2-GG intrastrand cross-links of cisplatin does not depend largely on the bases flanking the cross-links, the perturbation related to GG cross-linking by bulkier platinum diamine derivatives does. PMID:15805172
microRNA-122 target sites in the hepatitis C virus RNA NS5B coding region and 3' untranslated region: function in replication and influence of RNA secondary structure.

PubMed

Gerresheim, Gesche K; Dünnes, Nadia; Nieder-Röhrmann, Anika; Shalamova, Lyudmila A; Fricke, Markus; Hofacker, Ivo; Höner Zu Siederdissen, Christian; Marz, Manja; Niepmann, Michael

2017-02-01

We have analyzed the binding of the liver-specific microRNA-122 (miR-122) to three conserved target sites of hepatitis C virus (HCV) RNA, two in the non-structural protein 5B (NS5B) coding region and one in the 3' untranslated region (3'UTR). miR-122 binding efficiency strongly depends on target site accessibility under conditions when the range of flanking sequences available for the formation of local RNA secondary structures changes. Our results indicate that the particular sequence feature that contributes most to the correlation between target site accessibility and binding strength varies between different target sites. This suggests that the dynamics of miRNA/Ago2 binding not only depends on the target site itself but also on flanking sequence context to a considerable extent, in particular in a small viral genome in which strong selection constraints act on coding sequence and overlapping cis-signals and model the accessibility of cis-signals. In full-length genomes, single and combination mutations in the miR-122 target sites reveal that site 5B.2 is positively involved in regulating overall genome replication efficiency, whereas mutation of site 5B.3 showed a weaker effect. Mutation of the 3'UTR site and double or triple mutants showed no significant overall effect on genome replication, whereas in a translation reporter RNA, the 3'UTR target site inhibits translation directed by the HCV 5'UTR. Thus, the miR-122 target sites in the 3'-region of the HCV genome are involved in a complex interplay in regulating different steps of the HCV replication cycle.
50 years of DNA ‘Breathing’: Reflections on Old and New Approaches

PubMed Central

von Hippel, Peter H.; Johnson, Neil P.; Marcus, Andrew H.

2015-01-01

Summary The coding sequences for genes, and much other regulatory information involved in genome expression, are located ‘inside’ the DNA duplex. Thus the ‘macromolecular machines’ that read-out this information from the base sequence of the DNA must somehow access the DNA ‘interior’. Double-stranded (ds) DNA is a highly structured and cooperatively stabilized system at physiological temperatures, but is also only marginally stable and undergoes a cooperative ‘melting phase transition’ at temperatures not far above physiological. Furthermore, due to its length and heterogeneous sequence, with AT-rich segments being less stable than GC-rich segments, the DNA genome ‘melts’ in a multistate fashion. Therefore the DNA genome must also manifest thermally driven structural (‘breathing’) fluctuations at physiological temperatures that should reflect the heterogeneity of the dsDNA stability near the melting temperature. Thus many of the breathing fluctuations of dsDNA are likely also to be sequence dependent, and could well contain information that should be ‘readable’ and useable by regulatory proteins and protein complexes in site-specific binding reactions involving dsDNA ‘opening’. Our laboratory has been involved in studying the breathing fluctuations of duplex DNA for about 50 years. In this ‘Reflections’ article we present a relatively chronological overview of these studies, starting with the use of simple chemical probes (such as hydrogen exchange, formaldehyde and simple DNA ‘melting’ proteins) to examine the local stability of the dsDNA structure, and culminating in sophisticated spectroscopic approaches that can be used to monitor the breathing-dependent interactions of regulatory complexes with their duplex DNA targets in ‘real time’. PMID:23840028
Propensities of peptides containing the Asn-Gly segment to form β-turn and β-hairpin structures.

PubMed

Kang, Young Kee; Yoo, In Kee

2016-09-01

The propensities of peptides that contain the Asn-Gly segment to form β-turn and β-hairpin structures were explored using the density functional methods and the implicit solvation model in CH2 Cl2 and water. The populations of preferred β-turn structures varied depending on the sequence and solvent polarity. In solution, β-hairpin structures with βI' turn motifs were most preferred for the heptapeptides containing the Asn-Gly segment regardless of the sequence of the strands. These preferences in solution are consistent with the corresponding X-ray structures. The sequence, H-bond strengths, solvent polarity, and conformational flexibility appeared to interact to determine the preferred β-hairpin structure of each heptapeptide, although the β-turn segments played a role in promoting the formation of β-hairpin structures and the β-hairpin propensity varied. In the heptapeptides containing the Asn-Gly segment, the β-hairpin formation was enthalpically favored and entropically disfavored at 25°C in water. The calculated results for β-turns and β-hairpins containing the Asn-Gly segment imply that these structural preferences may be useful for the design of bioactive macrocyclic peptides containing β-hairpin mimics and the design of binding epitopes for protein-protein and protein-nucleic acid recognitions. © 2016 Wiley Periodicals, Inc. Biopolymers 105: 653-664, 2016. © 2016 Wiley Periodicals, Inc.
Ultrafast Pulse Sequencing for Fast Projective Measurements of Atomic Hyperfine Qubits

NASA Astrophysics Data System (ADS)

Ip, Michael; Ransford, Anthony; Campbell, Wesley

2015-05-01

Projective readout of quantum information stored in atomic hyperfine structure typically uses state-dependent CW laser-induced fluorescence. This method requires an often sophisticated imaging system to spatially filter out the background CW laser light. We present an alternative approach that instead uses simple pulse sequences from a mode-locked laser to affect the same state-dependent excitations in less than 1 ns. The resulting atomic fluorescence occurs in the dark, allowing the placement of non-imaging detectors right next to the atom to improve the qubit state detection efficiency and speed. We also discuss methods of Doppler cooling with mode-locked lasers for trapped ions, where the creation of the necessary UV light is often difficult with CW lasers.
Segmented-memory recurrent neural networks.

PubMed

Chen, Jinmiao; Chaudhari, Narendra S

2009-08-01

Conventional recurrent neural networks (RNNs) have difficulties in learning long-term dependencies. To tackle this problem, we propose an architecture called segmented-memory recurrent neural network (SMRNN). A symbolic sequence is broken into segments and then presented as inputs to the SMRNN one symbol per cycle. The SMRNN uses separate internal states to store symbol-level context, as well as segment-level context. The symbol-level context is updated for each symbol presented for input. The segment-level context is updated after each segment. The SMRNN is trained using an extended real-time recurrent learning algorithm. We test the performance of SMRNN on the information latching problem, the "two-sequence problem" and the problem of protein secondary structure (PSS) prediction. Our implementation results indicate that SMRNN performs better on long-term dependency problems than conventional RNNs. Besides, we also theoretically analyze how the segmented memory of SMRNN helps learning long-term temporal dependencies and study the impact of the segment length.
Structured oligonucleotides for target indexing to allow single-vessel PCR amplification and solid support microarray hybridization.

PubMed

Girard, Laurie D; Boissinot, Karel; Peytavi, Régis; Boissinot, Maurice; Bergeron, Michel G

2015-02-07

The combination of molecular diagnostic technologies is increasingly used to overcome limitations on sensitivity, specificity or multiplexing capabilities, and provide efficient lab-on-chip devices. Two such techniques, PCR amplification and microarray hybridization are used serially to take advantage of the high sensitivity and specificity of the former combined with high multiplexing capacities of the latter. These methods are usually performed in different buffers and reaction chambers. However, these elaborate methods have high complexity and cost related to reagent requirements, liquid storage and the number of reaction chambers to integrate into automated devices. Furthermore, microarray hybridizations have a sequence dependent efficiency not always predictable. In this work, we have developed the concept of a structured oligonucleotide probe which is activated by cleavage from polymerase exonuclease activity. This technology is called SCISSOHR for Structured Cleavage Induced Single-Stranded Oligonucleotide Hybridization Reaction. The SCISSOHR probes enable indexing the target sequence to a tag sequence. The SCISSOHR technology also allows the combination of nucleic acid amplification and microarray hybridization in a single vessel in presence of the PCR buffer only. The SCISSOHR technology uses an amplification probe that is irreversibly modified in presence of the target, releasing a single-stranded DNA tag for microarray hybridization. Each tag is composed of a 3-nucleotide sequence-dependent segment and a unique "target sequence-independent" 14-nucleotide segment allowing for optimal hybridization with minimal cross-hybridization. We evaluated the performance of five (5) PCR buffers to support microarray hybridization, compared to a conventional hybridization buffer. Finally, as a proof of concept, we developed a multiplexed assay for the amplification, detection, and identification of three (3) DNA targets. This new technology will facilitate the design of lab-on-chip microfluidic devices, while also reducing consumable costs. At term, it will allow the cost-effective automation of highly multiplexed assays for detection and identification of genetic targets.
Segmenting Dynamic Human Action via Statistical Structure

ERIC Educational Resources Information Center

Baldwin, Dare; Andersson, Annika; Saffran, Jenny; Meyer, Meredith

2008-01-01

Human social, cognitive, and linguistic functioning depends on skills for rapidly processing action. Identifying distinct acts within the dynamic motion flow is one basic component of action processing; for example, skill at segmenting action is foundational to action categorization, verb learning, and comprehension of novel action sequences. Yet…
DNA G-Wire Formation Using an Artificial Peptide is Controlled by Protease Activity.

PubMed

Usui, Kenji; Okada, Arisa; Sakashita, Shungo; Shimooka, Masayuki; Tsuruoka, Takaaki; Nakano, Shu-Ichi; Miyoshi, Daisuke; Mashima, Tsukasa; Katahira, Masato; Hamada, Yoshio

2017-11-16

The development of a switching system for guanine nanowire (G-wire) formation by external signals is important for nanobiotechnological applications. Here, we demonstrate a DNA nanostructural switch (G-wire <--> particles) using a designed peptide and a protease. The peptide consists of a PNA sequence for inducing DNA to form DNA-PNA hybrid G-quadruplex structures, and a protease substrate sequence acting as a switching module that is dependent on the activity of a particular protease. Micro-scale analyses via TEM and AFM showed that G-rich DNA alone forms G-wires in the presence of Ca 2+ , and that the peptide disrupted this formation, resulting in the formation of particles. The addition of the protease and digestion of the peptide regenerated the G-wires. Macro-scale analyses by DLS, zeta potential, CD, and gel filtration were in agreement with the microscopic observations. These results imply that the secondary structure change (DNA G-quadruplex <--> DNA/PNA hybrid structure) induces a change in the well-formed nanostructure (G-wire <--> particles). Our findings demonstrate a control system for forming DNA G-wire structures dependent on protease activity using designed peptides. Such systems hold promise for regulating the formation of nanowire for various applications, including electronic circuits for use in nanobiotechnologies.
Coevolutionary modeling of protein sequences: Predicting structure, function, and mutational landscapes

NASA Astrophysics Data System (ADS)

Weigt, Martin

Over the last years, biological research has been revolutionized by experimental high-throughput techniques, in particular by next-generation sequencing technology. Unprecedented amounts of data are accumulating, and there is a growing request for computational methods unveiling the information hidden in raw data, thereby increasing our understanding of complex biological systems. Statistical-physics models based on the maximum-entropy principle have, in the last few years, played an important role in this context. To give a specific example, proteins and many non-coding RNA show a remarkable degree of structural and functional conservation in the course of evolution, despite a large variability in amino acid sequences. We have developed a statistical-mechanics inspired inference approach - called Direct-Coupling Analysis - to link this sequence variability (easy to observe in sequence alignments, which are available in public sequence databases) to bio-molecular structure and function. In my presentation I will show, how this methodology can be used (i) to infer contacts between residues and thus to guide tertiary and quaternary protein structure prediction and RNA structure prediction, (ii) to discriminate interacting from non-interacting protein families, and thus to infer conserved protein-protein interaction networks, and (iii) to reconstruct mutational landscapes and thus to predict the phenotypic effect of mutations. References [1] M. Figliuzzi, H. Jacquier, A. Schug, O. Tenaillon and M. Weigt ''Coevolutionary landscape inference and the context-dependence of mutations in beta-lactamase TEM-1'', Mol. Biol. Evol. (2015), doi: 10.1093/molbev/msv211 [2] E. De Leonardis, B. Lutz, S. Ratz, S. Cocco, R. Monasson, A. Schug, M. Weigt ''Direct-Coupling Analysis of nucleotide coevolution facilitates RNA secondary and tertiary structure prediction'', Nucleic Acids Research (2015), doi: 10.1093/nar/gkv932 [3] F. Morcos, A. Pagnani, B. Lunt, A. Bertolino, D. Marks, C. Sander, R. Zecchina, J.N. Onuchic, T. Hwa, M. Weigt, ''Direct-coupling analysis of residue co-evolution captures native contacts across many protein families'', Proc. Natl. Acad. Sci. 108, E1293-E1301 (2011).
PreSSAPro: a software for the prediction of secondary structure by amino acid properties.

PubMed

Costantini, Susan; Colonna, Giovanni; Facchiano, Angelo M

2007-10-01

PreSSAPro is a software, available to the scientific community as a free web service designed to provide predictions of secondary structures starting from the amino acid sequence of a given protein. Predictions are based on our recently published work on the amino acid propensities for secondary structures in either large but not homogeneous protein data sets, as well as in smaller but homogeneous data sets corresponding to protein structural classes, i.e. all-alpha, all-beta, or alpha-beta proteins. Predictions result improved by the use of propensities evaluated for the right protein class. PreSSAPro predicts the secondary structure according to the right protein class, if known, or gives a multiple prediction with reference to the different structural classes. The comparison of these predictions represents a novel tool to evaluate what sequence regions can assume different secondary structures depending on the structural class assignment, in the perspective of identifying proteins able to fold in different conformations. The service is available at the URL http://bioinformatica.isa.cnr.it/PRESSAPRO/.
Structural requirements for recognition of the HLA-Dw14 class II epitope: A key HLA determinant associated with rheumatoid arthritis

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hiraiwa, Akikazu; Yamanaka, Katsuo; Kwok, W.W.

Although HLA genes have been shown to be associated with certain diseases, the basis for this association is unknown. Recent studies, however, have documented patterns of nucleotide sequence variation among some HLA genes associated with a particular disease. For rheumatoid arthritis, HLA genes in most patients have a shared nucleotide sequence encoding a key structural element of an HLA class II polypeptide; this sequence element is critical for the interaction of the HLA molecule with antigenic peptides and with responding T cells, suggestive of a direct role for this sequence element in disease susceptibility. The authors describe the serological andmore » cellular immunologic characteristics encoded by this rheumatoid arthritis-associated sequence element. Site-directed mutagenesis of the DRB1 gene was used to define amino acids critical for antibody and T-cell recognition of this structural element, focusing on residues that distinguish the rheumatoid arthritis-associated alleles Dw4 and Dw14 from a closely related allele, Dw10, not associated with disease. Both the gain and loss of rheumatoid arthritis-associated epitopes were highly dependent on three residues within a discrete domain of the HLA-DR molecule. Recognition was most strongly influenced by the following amino acids (in order): 70 > 71 > 67. Some alloreactive T-cell clones were also influenced by amino acid variation in portions of the DR molecule lying outside the shared sequence element.« less

Temporal and Motor Representation of Rhythm in Fronto-Parietal Cortical Areas: An fMRI Study

PubMed Central

Konoike, Naho; Kotozaki, Yuka; Jeong, Hyeonjeong; Miyazaki, Atsuko; Sakaki, Kohei; Shinada, Takamitsu; Sugiura, Motoaki; Kawashima, Ryuta; Nakamura, Katsuki

2015-01-01

When sounds occur with temporally structured patterns, we can feel a rhythm. To memorize a rhythm, perception of its temporal patterns and organization of them into a hierarchically structured sequence are necessary. On the other hand, rhythm perception can often cause unintentional body movements. Thus, we hypothesized that rhythm information can be manifested in two different ways; temporal and motor representations. The motor representation depends on effectors, such as the finger or foot, whereas the temporal representation is effector-independent. We tested our hypothesis with a working memory paradigm to elucidate neuronal correlates of temporal or motor representation of rhythm and to reveal the neural networks associated with these representations. We measured brain activity by fMRI while participants memorized rhythms and reproduced them by tapping with the right finger, left finger, or foot, or by articulation. The right inferior frontal gyrus and the inferior parietal lobule exhibited significant effector-independent activations during encoding and retrieval of rhythm information, whereas the left inferior parietal lobule and supplementary motor area (SMA) showed effector-dependent activations during retrieval. These results suggest that temporal sequences of rhythm are probably represented in the right fronto-parietal network, whereas motor sequences of rhythm can be represented in the SMA-parietal network. PMID:26076024
Complete genomic sequence of an infectious pancreatic necrosis virus isolated from rainbow trout (Oncorhynchus mykiss) in China.

PubMed

Ji, Feng; Zhao, Jing-Zhuang; Liu, Miao; Lu, Tong-Yan; Liu, Hong-Bai; Yin, Jiasheng; Xu, Li-Ming

2017-04-01

Infectious pancreatic necrosis (IPN) is a significant disease of farmed salmonids resulting in direct economic losses due to high mortality in China. However, no gene sequence of any Chinese infectious pancreatic necrosis virus (IPNV) isolates was available. In the study, moribund rainbow trout fry samples were collected during an outbreak of IPN in Yunnan province of southwest China in 2013. An IPNV was isolated and tentatively named ChRtm213. We determined the full genome sequence of the IPNV ChRtm213 and compared it with previously identified IPNV sequences worldwide. The sequences of different structural and non-structural protein genes were compared to those of other aquatic birnaviruses sequenced to date. The results indicated that the complete genome sequence of ChRtm213 strain contains a segment A (3099 nucleotides) coding a polyprotein VP2-VP4-VP3, and a segment B (2789 nucleotides) coding a RNA-dependent RNA polymerase VP1. The phylogenetic analyses showed that ChRtm213 strain fell within genogroup 1, serotype A9 (Jasper), having similarities of 96.3% (segment A) and 97.3% (segment B) with the IPNV strain AM98 from Japan. The results suggest that the Chinese IPNV isolate has relative closer relationship with Japanese IPNV strains. The sequence of ChRtm213 was the first gene sequence of IPNV isolates in China. This study provided a robust reference for diagnosis and/or control of IPNV prevalent in China.
SIMBAD : a sequence-independent molecular-replacement pipeline

DOE PAGES

Simpkin, Adam J.; Simkovic, Felix; Thomas, Jens M. H.; ...

2018-06-08

The conventional approach to finding structurally similar search models for use in molecular replacement (MR) is to use the sequence of the target to search against those of a set of known structures. Sequence similarity often correlates with structure similarity. Given sufficient similarity, a known structure correctly positioned in the target cell by the MR process can provide an approximation to the unknown phases of the target. An alternative approach to identifying homologous structures suitable for MR is to exploit the measured data directly, comparing the lattice parameters or the experimentally derived structure-factor amplitudes with those of known structures. Here,more » SIMBAD , a new sequence-independent MR pipeline which implements these approaches, is presented. SIMBAD can identify cases of contaminant crystallization and other mishaps such as mistaken identity (swapped crystallization trays), as well as solving unsequenced targets and providing a brute-force approach where sequence-dependent search-model identification may be nontrivial, for example because of conformational diversity among identifiable homologues. The program implements a three-step pipeline to efficiently identify a suitable search model in a database of known structures. The first step performs a lattice-parameter search against the entire Protein Data Bank (PDB), rapidly determining whether or not a homologue exists in the same crystal form. The second step is designed to screen the target data for the presence of a crystallized contaminant, a not uncommon occurrence in macromolecular crystallography. Solving structures with MR in such cases can remain problematic for many years, since the search models, which are assumed to be similar to the structure of interest, are not necessarily related to the structures that have actually crystallized. To cater for this eventuality, SIMBAD rapidly screens the data against a database of known contaminant structures. Where the first two steps fail to yield a solution, a final step in SIMBAD can be invoked to perform a brute-force search of a nonredundant PDB database provided by the MoRDa MR software. Through early-access usage of SIMBAD , this approach has solved novel cases that have otherwise proved difficult to solve.« less
SIMBAD : a sequence-independent molecular-replacement pipeline

DOE Office of Scientific and Technical Information (OSTI.GOV)

Simpkin, Adam J.; Simkovic, Felix; Thomas, Jens M. H.

The conventional approach to finding structurally similar search models for use in molecular replacement (MR) is to use the sequence of the target to search against those of a set of known structures. Sequence similarity often correlates with structure similarity. Given sufficient similarity, a known structure correctly positioned in the target cell by the MR process can provide an approximation to the unknown phases of the target. An alternative approach to identifying homologous structures suitable for MR is to exploit the measured data directly, comparing the lattice parameters or the experimentally derived structure-factor amplitudes with those of known structures. Here,more » SIMBAD , a new sequence-independent MR pipeline which implements these approaches, is presented. SIMBAD can identify cases of contaminant crystallization and other mishaps such as mistaken identity (swapped crystallization trays), as well as solving unsequenced targets and providing a brute-force approach where sequence-dependent search-model identification may be nontrivial, for example because of conformational diversity among identifiable homologues. The program implements a three-step pipeline to efficiently identify a suitable search model in a database of known structures. The first step performs a lattice-parameter search against the entire Protein Data Bank (PDB), rapidly determining whether or not a homologue exists in the same crystal form. The second step is designed to screen the target data for the presence of a crystallized contaminant, a not uncommon occurrence in macromolecular crystallography. Solving structures with MR in such cases can remain problematic for many years, since the search models, which are assumed to be similar to the structure of interest, are not necessarily related to the structures that have actually crystallized. To cater for this eventuality, SIMBAD rapidly screens the data against a database of known contaminant structures. Where the first two steps fail to yield a solution, a final step in SIMBAD can be invoked to perform a brute-force search of a nonredundant PDB database provided by the MoRDa MR software. Through early-access usage of SIMBAD , this approach has solved novel cases that have otherwise proved difficult to solve.« less
ModeRNA: a tool for comparative modeling of RNA 3D structure

PubMed Central

Rother, Magdalena; Rother, Kristian; Puton, Tomasz; Bujnicki, Janusz M.

2011-01-01

RNA is a large group of functionally important biomacromolecules. In striking analogy to proteins, the function of RNA depends on its structure and dynamics, which in turn is encoded in the linear sequence. However, while there are numerous methods for computational prediction of protein three-dimensional (3D) structure from sequence, with comparative modeling being the most reliable approach, there are very few such methods for RNA. Here, we present ModeRNA, a software tool for comparative modeling of RNA 3D structures. As an input, ModeRNA requires a 3D structure of a template RNA molecule, and a sequence alignment between the target to be modeled and the template. It must be emphasized that a good alignment is required for successful modeling, and for large and complex RNA molecules the development of a good alignment usually requires manual adjustments of the input data based on previous expertise of the respective RNA family. ModeRNA can model post-transcriptional modifications, a functionally important feature analogous to post-translational modifications in proteins. ModeRNA can also model DNA structures or use them as templates. It is equipped with many functions for merging fragments of different nucleic acid structures into a single model and analyzing their geometry. Windows and UNIX implementations of ModeRNA with comprehensive documentation and a tutorial are freely available. PMID:21300639
Do pattern recognition skills transfer across sports? A preliminary analysis.

PubMed

Smeeton, Nicholas J; Ward, Paul; Williams, A Mark

2004-02-01

The ability to recognize patterns of play is fundamental to performance in team sports. While typically assumed to be domain-specific, pattern recognition skills may transfer from one sport to another if similarities exist in the perceptual features and their relations and/or the strategies used to encode and retrieve relevant information. A transfer paradigm was employed to compare skilled and less skilled soccer, field hockey and volleyball players' pattern recognition skills. Participants viewed structured and unstructured action sequences from each sport, half of which were randomly represented with clips not previously seen. The task was to identify previously viewed action sequences quickly and accurately. Transfer of pattern recognition skill was dependent on the participant's skill, sport practised, nature of the task and degree of structure. The skilled soccer and hockey players were quicker than the skilled volleyball players at recognizing structured soccer and hockey action sequences. Performance differences were not observed on the structured volleyball trials between the skilled soccer, field hockey and volleyball players. The skilled field hockey and soccer players were able to transfer perceptual information or strategies between their respective sports. The less skilled participants' results were less clear. Implications for domain-specific expertise, transfer and diversity across domains are discussed.
The shikimate pathway: review of amino acid sequence, function and three-dimensional structures of the enzymes.

PubMed

Mir, Rafia; Jallu, Shais; Singh, T P

2015-06-01

The aromatic compounds such as aromatic amino acids, vitamin K and ubiquinone are important prerequisites for the metabolism of an organism. All organisms can synthesize these aromatic metabolites through shikimate pathway, except for mammals which are dependent on their diet for these compounds. The pathway converts phosphoenolpyruvate and erythrose 4-phosphate to chorismate through seven enzymatically catalyzed steps and chorismate serves as a precursor for the synthesis of variety of aromatic compounds. These enzymes have shown to play a vital role for the viability of microorganisms and thus are suggested to present attractive molecular targets for the design of novel antimicrobial drugs. This review focuses on the seven enzymes of the shikimate pathway, highlighting their primary sequences, functions and three-dimensional structures. The understanding of their active site amino acid maps, functions and three-dimensional structures will provide a framework on which the rational design of antimicrobial drugs would be based. Comparing the full length amino acid sequences and the X-ray crystal structures of these enzymes from bacteria, fungi and plant sources would contribute in designing a specific drug and/or in developing broad-spectrum compounds with efficacy against a variety of pathogens.
Structure, replication efficiency and fragility of yeast ARS elements.

PubMed

Dhar, Manoj K; Sehgal, Shelly; Kaul, Sanjana

2012-05-01

DNA replication in eukaryotes initiates at specific sites known as origins of replication, or replicators. These replication origins occur throughout the genome, though the propensity of their occurrence depends on the type of organism. In eukaryotes, zones of initiation of replication spanning from about 100 to 50,000 base pairs have been reported. The characteristics of eukaryotic replication origins are best understood in the budding yeast Saccharomyces cerevisiae, where some autonomously replicating sequences, or ARS elements, confer origin activity. ARS elements are short DNA sequences of a few hundred base pairs, identified by their efficiency at initiating a replication event when cloned in a plasmid. ARS elements, although structurally diverse, maintain a basic structure composed of three domains, A, B and C. Domain A is comprised of a consensus sequence designated ACS (ARS consensus sequence), while the B domain has the DNA unwinding element and the C domain is important for DNA-protein interactions. Although there are ∼400 ARS elements in the yeast genome, not all of them are active origins of replication. Different groups within the genus Saccharomyces have ARS elements as components of replication origin. The present paper provides a comprehensive review of various aspects of ARSs, starting from their structural conservation to sequence thermodynamics. All significant and conserved functional sequence motifs within different types of ARS elements have been extensively described. Issues like silencing at ARSs, their inherent fragility and factors governing their replication efficiency have also been addressed. Progress in understanding crucial components associated with the replication machinery and timing at these ARS elements is discussed in the section entitled "The replicon revisited". Copyright © 2012 Institut Pasteur. Published by Elsevier Masson SAS. All rights reserved.
G4RNA: an RNA G-quadruplex database

PubMed Central

Garant, Jean-Michel; Luce, Mikael J.; Scott, Michelle S.

2015-01-01

Abstract G-quadruplexes (G4) are tetrahelical structures formed from planar arrangement of guanines in nucleic acids. A simple, regular motif was originally proposed to describe G4-forming sequences. More recently, however, formation of G4 was discovered to depend, at least in part, on the contextual backdrop of neighboring sequences. Prediction of G4 folding is thus becoming more challenging as G4 outlier structures, not described by the originally proposed motif, are increasingly reported. Recent observations thus call for a comprehensive tool, capable of consolidating the expanding information on tested G4s, in order to conduct systematic comparative analyses of G4-promoting sequences. The G4RNA Database we propose was designed to help meet the need for easily-retrievable data on known RNA G4s. A user-friendly, flexible query system allows for data retrieval on experimentally tested sequences, from many separate genes, to assess G4-folding potential. Query output sorts data according to sequence position, G4 likelihood, experimental outcomes and associated bibliographical references. G4RNA also provides an ideal foundation to collect and store additional sequence and experimental data, considering the growing interest G4s currently generate. Database URL: scottgroup.med.usherbrooke.ca/G4RNA PMID:26200754
Import of a major mitochondrial enzyme depends on synergy between two distinct helices of its presequence

USDA-ARS?s Scientific Manuscript database

The human mitochondrial glutamate dehydrogenase isozymes (hGDH1 and 2) are abundant matrix-localized proteins encoded by nuclear genes. The proteins are synthesized in the cytoplasm, with an atypically long N-terminal mitochondrial targeting sequence (MTS). The results of secondary structure predi...
The yeast Pif1 helicase prevents genomic instability caused by G-quadruplex-forming CEB1 sequences in vivo.

PubMed

Ribeyre, Cyril; Lopes, Judith; Boulé, Jean-Baptiste; Piazza, Aurèle; Guédin, Aurore; Zakian, Virginia A; Mergny, Jean-Louis; Nicolas, Alain

2009-05-01

In budding yeast, the Pif1 DNA helicase is involved in the maintenance of both nuclear and mitochondrial genomes, but its role in these processes is still poorly understood. Here, we provide evidence for a new Pif1 function by demonstrating that its absence promotes genetic instability of alleles of the G-rich human minisatellite CEB1 inserted in the Saccharomyces cerevisiae genome, but not of other tandem repeats. Inactivation of other DNA helicases, including Sgs1, had no effect on CEB1 stability. In vitro, we show that CEB1 repeats formed stable G-quadruplex (G4) secondary structures and the Pif1 protein unwinds these structures more efficiently than regular B-DNA. Finally, synthetic CEB1 arrays in which we mutated the potential G4-forming sequences were no longer destabilized in pif1Delta cells. Hence, we conclude that CEB1 instability in pif1Delta cells depends on the potential to form G-quadruplex structures, suggesting that Pif1 could play a role in the metabolism of G4-forming sequences.
MultiSeq: unifying sequence and structure data for evolutionary analysis

PubMed Central

Roberts, Elijah; Eargle, John; Wright, Dan; Luthey-Schulten, Zaida

2006-01-01

Background Since the publication of the first draft of the human genome in 2000, bioinformatic data have been accumulating at an overwhelming pace. Currently, more than 3 million sequences and 35 thousand structures of proteins and nucleic acids are available in public databases. Finding correlations in and between these data to answer critical research questions is extremely challenging. This problem needs to be approached from several directions: information science to organize and search the data; information visualization to assist in recognizing correlations; mathematics to formulate statistical inferences; and biology to analyze chemical and physical properties in terms of sequence and structure changes. Results Here we present MultiSeq, a unified bioinformatics analysis environment that allows one to organize, display, align and analyze both sequence and structure data for proteins and nucleic acids. While special emphasis is placed on analyzing the data within the framework of evolutionary biology, the environment is also flexible enough to accommodate other usage patterns. The evolutionary approach is supported by the use of predefined metadata, adherence to standard ontological mappings, and the ability for the user to adjust these classifications using an electronic notebook. MultiSeq contains a new algorithm to generate complete evolutionary profiles that represent the topology of the molecular phylogenetic tree of a homologous group of distantly related proteins. The method, based on the multidimensional QR factorization of multiple sequence and structure alignments, removes redundancy from the alignments and orders the protein sequences by increasing linear dependence, resulting in the identification of a minimal basis set of sequences that spans the evolutionary space of the homologous group of proteins. Conclusion MultiSeq is a major extension of the Multiple Alignment tool that is provided as part of VMD, a structural visualization program for analyzing molecular dynamics simulations. Both are freely distributed by the NIH Resource for Macromolecular Modeling and Bioinformatics and MultiSeq is included with VMD starting with version 1.8.5. The MultiSeq website has details on how to download and use the software: PMID:16914055
Influence of a heptad repeat stutter on the pH-dependent conformational behavior of the central coiled-coil from influenza hemagglutinin HA2.

PubMed

Higgins, Chelsea D; Malashkevich, Vladimir N; Almo, Steven C; Lai, Jonathan R

2014-09-01

The coiled-coil is one of the most common protein structural motifs. Amino acid sequences of regions that participate in coiled-coils contain a heptad repeat in which every third then forth residue is occupied by a hydrophobic residue. Here we examine the consequences of a "stutter," a deviation of the idealized heptad repeat that is found in the central coiled-coil of influenza hemagluttinin HA2. Characterization of a peptide containing the native stutter-containing HA2 sequence, as well as several variants in which the stutter was engineered out to restore an idealized heptad repeat pattern, revealed that the stutter is important for allowing coiled-coil formation in the WT HA2 at both neutral and low pH (7.1 and 4.5). By contrast, all variants that contained idealized heptad repeats exhibited marked pH-dependent coiled-coil formation with structures forming much more stably at low pH. A crystal structure of one variant containing an idealized heptad repeat, and comparison to the WT HA2 structure, suggest that the stutter distorts the optimal interhelical core packing arrangement, resulting in unwinding of the coiled-coil superhelix. Interactions between acidic side chains, in particular E69 and E74 (present in all peptides studied), are suggested to play a role in mediating these pH-dependent conformational effects. This conclusion is partially supported by studies on HA2 variant peptides in which these positions were altered to aspartic acid. These results provide new insight into the structural role of the heptad repeat stutter in HA2. © 2014 Wiley Periodicals, Inc.
G-Quadruplex Induction by the Hairpin Pyrrole-Imidazole Polyamide Dimer.

PubMed

Obata, Shunsuke; Asamitsu, Sefan; Hashiya, Kaori; Bando, Toshikazu; Sugiyama, Hiroshi

2018-02-06

The G-quadruplex (G4) is one type of higher-order structure of nucleic acids and is thought to play important roles in various biological events such as regulation of transcription and inhibition of DNA replication. Pyrrole-imidazole polyamides (PIPs) are programmable small molecules that can sequence-specifically bind with high affinity to the minor groove of double-stranded DNA (dsDNA). Herein, we designed head-to-head hairpin PIP dimers and their target dsDNA in a model G4-forming sequence. Using an electrophoresis mobility shift assay and transcription arrest assay, we found that PIP dimers could induce the structural change to G4 DNA from dsDNA through the recognition by one PIP dimer molecule of two duplex-binding sites flanking both ends of the G4-forming sequence. This induction ability was dependent on linker length. This is the first study to induce G4 formation using PIPs, which are known to be dsDNA binders. The results reported here suggest that selective G4 induction in native sequences may be achieved with PIP dimers by applying the same design strategy.
Simulation studies of DNA at the nanoscale: Interactions with proteins, polycations, and surfaces

NASA Astrophysics Data System (ADS)

Elder, Robert M.

Understanding the nanoscale interactions of DNA, a multifunctional biopolymer with sequence-dependent properties, with other biological and synthetic substrates and molecules is essential to advancing these technologies. This doctoral thesis research is aimed at understanding the thermodynamics and molecular-level structure when DNA interacts with proteins, polycations, and functionalized surfaces. First, we investigate the ability of a DNA damage recognition protein (HMGB1a) to bind to anti-cancer drug-induced DNA damage, seeking to explain how HMGB1a differentiates between the drugs in vivo. Using atomistic molecular dynamics simulations, we show that the structure of the drug-DNA molecule exhibits drug- and base sequence-dependence that explains some of the experimentally observed differential recognition of the drugs in various sequence contexts. Then, we show how steric hindrance from the drug decreases the deformability of the drug-DNA molecule, which decreases recognition by the protein, a concept that can be applied to rational drug design. Second, we study how polycation architecture and chemistry affect polycation-DNA binding so as to design optimal polycations for high efficiency gene (DNA) delivery. Using a multiscale computational approach involving atomistic and coarse-grained simulations, we examine how rearranging polylysine from a linear to a grafted architecture, and several aspects of the grafted architecture, affect polycation-DNA binding and the structure of polycation-DNA complexes. Next, going beyond lysine we examine how oligopeptide chemistry and sequence in the grafted architecture affects polycation-DNA binding and find that strategic placement of hydrophobic peptides might be used to tailor binding strength. Third, we study the adsorption and conformations of single-stranded DNA (an amphiphilic biopolymer) on model hydrophilic and hydrophobic surfaces. Short ssDNA oligomers adsorb to both surfaces with similar strength, with the strength of adsorption to the hydrophobic surface depending on the composition of the DNA strands, i.e. purine or pyrimidine bases. Additionally, DNA-surface and DNA-water interactions near the surfaces govern the adsorption. For longer ssDNA oligomers, the effects of surface chemistry and temperature on ssDNA conformations are rather small, but either the hydrophilic surface or increased temperature favor slightly more compact conformations due to energetic and entropic effects, respectively.
Probing the electrostatics and pharmacologic modulation of sequence-specific binding by the DNA-binding domain of the ETS-family transcription factor PU.1: a binding affinity and kinetics investigation

PubMed Central

Munde, Manoj; Poon, Gregory M. K.; Wilson, W. David

2013-01-01

Members of the ETS family of transcription factors regulate a functionally diverse array of genes. All ETS proteins share a structurally-conserved but sequence-divergent DNA-binding domain, known as the ETS domain. Although the structure and thermodynamics of the ETS-DNA complexes are well known, little is known about the kinetics of sequence recognition, a facet that offers potential insight into its molecular mechanism. We have characterized DNA binding by the ETS domain of PU.1 by biosensor-surface plasmon resonance (SPR). SPR analysis revealed a striking kinetic profile for DNA binding by the PU.1 ETS domain. At low salt concentrations, it binds high-affinity cognate DNA with a very slow association rate constant (≤105 M−1 s−1), compensated by a correspondingly small dissociation rate constant. The kinetics are strongly salt-dependent but mutually balance to produce a relatively weak dependence in the equilibrium constant. This profile contrasts sharply with reported data for other ETS domains (e.g., Ets-1, TEL) for which high-affinity binding is driven by rapid association (>107 M−1 s−1). We interpret this difference in terms of the hydration properties of ETS-DNA binding and propose that at least two mechanisms of sequence recognition are employed by this family of DNA-binding domain. Additionally, we use SPR to demonstrate the potential for pharmacological inhibition of sequence-specific ETS-DNA binding, using the minor groove-binding distamycin as a model compound. Our work establishes SPR as a valuable technique for extending our understanding of the molecular mechanisms of ETS-DNA interactions as well as developing potential small-molecule agents for biotechnological and therapeutic purposes. PMID:23416556
Sequence-structure correlations in silk: Poly-Ala repeat of N. clavipes MaSp1 is naturally optimized at a critical length scale.

PubMed

Bratzel, Graham; Buehler, Markus J

2012-03-01

Spider silk is a self-assembling biopolymer that outperforms many known materials in terms of its mechanical performance despite being constructed from simple and inferior building blocks. While experimental studies have shown that the molecular structure of silk has a direct influence on the stiffness, toughness, and failure strength of silk, few molecular-level analyses of the nanostructure of silk assemblies in particular under variations of genetic sequences have been reported. Here we report atomistic-level structures of the MaSp1 protein from the Nephila Clavipes spider dragline silk sequence, obtained using an in silico approach based on replica exchange molecular dynamics (REMD) and explicit water molecular dynamics. We apply this method to study the effects of a systematic variation of the poly-alanine repeat lengths, a parameter controlled by the genetic makeup of silk, on the resulting molecular structure of silk at the nanoscale. Confirming earlier experimental and computational work, a structural analysis reveals that poly-alanine regions in silk predominantly form distinct and orderly β-sheet crystal domains while disorderly regions are formed by glycine-rich repeats that consist of 3(10)-helix type structures and β-turns. Our predictions are directly validated against experimental data based on dihedral angle pair calculations presented in Ramachandran plots combined with an analysis of the secondary structure content. The key result of our study is our finding of a strong dependence of the resulting silk nanostructure depending on the poly-alanine length. We observe that the wildtype poly-alanine repeat length of six residues defines a critical minimum length that consistently results in clearly defined β-sheet nanocrystals. For poly-alanine lengths below six, the β-sheet nanocrystals are not well-defined or not visible at all, while for poly-alanine lengths at and above six, the characteristic nanocomposite structure of silk emerges with no significant improvement of the quality of the β-sheet nanocrystal geometry. We present a simple biophysical model that explains these computational observations based on the mechanistic insight gained from the molecular simulations. Our findings set the stage for understanding how variations in the spidroin sequence can be used to engineer the structure and thereby functional properties of this biological superfiber, and present a design strategy for the genetic optimization of spidroins for enhanced mechanical properties. The approach used here may also find application in the design of other self-assembled molecular structures and fibers and in particular biologically inspired or completely synthetic systems. Copyright Â© 2011 Elsevier Ltd. All rights reserved.
Nullomers and High Order Nullomers in Genomic Sequences

PubMed Central

Vergni, Davide; Santoni, Daniele

2016-01-01

A nullomer is an oligomer that does not occur as a subsequence in a given DNA sequence, i.e. it is an absent word of that sequence. The importance of nullomers in several applications, from drug discovery to forensic practice, is now debated in the literature. Here, we investigated the nature of nullomers, whether their absence in genomes has just a statistical explanation or it is a peculiar feature of genomic sequences. We introduced an extension of the notion of nullomer, namely high order nullomers, which are nullomers whose mutated sequences are still nullomers. We studied different aspects of them: comparison with nullomers of random sequences, CpG distribution and mean helical rise. In agreement with previous results we found that the number of nullomers in the human genome is much larger than expected by chance. Nevertheless antithetical results were found when considering a random DNA sequence preserving dinucleotide frequencies. The analysis of CpG frequencies in nullomers and high order nullomers revealed, as expected, a high CpG content but it also highlighted a strong dependence of CpG frequencies on the dinucleotide position, suggesting that nullomers have their own peculiar structure and are not simply sequences whose CpG frequency is biased. Furthermore, phylogenetic trees were built on eleven species based on both the similarities between the dinucleotide frequencies and the number of nullomers two species share, showing that nullomers are fairly conserved among close species. Finally the study of mean helical rise of nullomers sequences revealed significantly high mean rise values, reinforcing the hypothesis that those sequences have some peculiar structural features. The obtained results show that nullomers are the consequence of the peculiar structure of DNA (also including biased CpG frequency and CpGs islands), so that the hypermutability model, also taking into account CpG islands, seems to be not sufficient to explain nullomer phenomenon. Finally, high order nullomers could emphasize those features that already make simple nullomers useful in several applications. PMID:27906971
SimRNA: a coarse-grained method for RNA folding simulations and 3D structure prediction.

PubMed

Boniecki, Michal J; Lach, Grzegorz; Dawson, Wayne K; Tomala, Konrad; Lukasz, Pawel; Soltysinski, Tomasz; Rother, Kristian M; Bujnicki, Janusz M

2016-04-20

RNA molecules play fundamental roles in cellular processes. Their function and interactions with other biomolecules are dependent on the ability to form complex three-dimensional (3D) structures. However, experimental determination of RNA 3D structures is laborious and challenging, and therefore, the majority of known RNAs remain structurally uncharacterized. Here, we present SimRNA: a new method for computational RNA 3D structure prediction, which uses a coarse-grained representation, relies on the Monte Carlo method for sampling the conformational space, and employs a statistical potential to approximate the energy and identify conformations that correspond to biologically relevant structures. SimRNA can fold RNA molecules using only sequence information, and, on established test sequences, it recapitulates secondary structure with high accuracy, including correct prediction of pseudoknots. For modeling of complex 3D structures, it can use additional restraints, derived from experimental or computational analyses, including information about secondary structure and/or long-range contacts. SimRNA also can be used to analyze conformational landscapes and identify potential alternative structures. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
SIBIS: a Bayesian model for inconsistent protein sequence estimation.

PubMed

Khenoussi, Walyd; Vanhoutrève, Renaud; Poch, Olivier; Thompson, Julie D

2014-09-01

The prediction of protein coding genes is a major challenge that depends on the quality of genome sequencing, the accuracy of the model used to elucidate the exonic structure of the genes and the complexity of the gene splicing process leading to different protein variants. As a consequence, today's protein databases contain a huge amount of inconsistency, due to both natural variants and sequence prediction errors. We have developed a new method, called SIBIS, to detect such inconsistencies based on the evolutionary information in multiple sequence alignments. A Bayesian framework, combined with Dirichlet mixture models, is used to estimate the probability of observing specific amino acids and to detect inconsistent or erroneous sequence segments. We evaluated the performance of SIBIS on a reference set of protein sequences with experimentally validated errors and showed that the sensitivity is significantly higher than previous methods, with only a small loss of specificity. We also assessed a large set of human sequences from the UniProt database and found evidence of inconsistency in 48% of the previously uncharacterized sequences. We conclude that the integration of quality control methods like SIBIS in automatic analysis pipelines will be critical for the robust inference of structural, functional and phylogenetic information from these sequences. Source code, implemented in C on a linux system, and the datasets of protein sequences are freely available for download at http://www.lbgi.fr/∼julie/SIBIS. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Conserved Features in the Structure, Mechanism, and Biogenesis of the Inverse Autotransporter Protein Family

PubMed Central

Heinz, Eva; Stubenrauch, Christopher J.; Grinter, Rhys; Croft, Nathan P.; Purcell, Anthony W.; Strugnell, Richard A.; Dougan, Gordon; Lithgow, Trevor

2016-01-01

The bacterial cell surface proteins intimin and invasin are virulence factors that share a common domain structure and bind selectively to host cell receptors in the course of bacterial pathogenesis. The β-barrel domains of intimin and invasin show significant sequence and structural similarities. Conversely, a variety of proteins with sometimes limited sequence similarity have also been annotated as “intimin-like” and “invasin” in genome datasets, while other recent work on apparently unrelated virulence-associated proteins ultimately revealed similarities to intimin and invasin. Here we characterize the sequence and structural relationships across this complex protein family. Surprisingly, intimins and invasins represent a very small minority of the sequence diversity in what has been previously the “intimin/invasin protein family”. Analysis of the assembly pathway for expression of the classic intimin, EaeA, and a characteristic example of the most prevalent members of the group, FdeC, revealed a dependence on the translocation and assembly module as a common feature for both these proteins. While the majority of the sequences in the grouping are most similar to FdeC, a further and widespread group is two-partner secretion systems that use the β-barrel domain as the delivery device for secretion of a variety of virulence factors. This comprehensive analysis supports the adoption of the “inverse autotransporter protein family” as the most accurate nomenclature for the family and, in turn, has important consequences for our overall understanding of the Type V secretion systems of bacterial pathogens. PMID:27190006
Identification of sequence-structure RNA binding motifs for SELEX-derived aptamers.

PubMed

Hoinka, Jan; Zotenko, Elena; Friedman, Adam; Sauna, Zuben E; Przytycka, Teresa M

2012-06-15

Systematic Evolution of Ligands by EXponential Enrichment (SELEX) represents a state-of-the-art technology to isolate single-stranded (ribo)nucleic acid fragments, named aptamers, which bind to a molecule (or molecules) of interest via specific structural regions induced by their sequence-dependent fold. This powerful method has applications in designing protein inhibitors, molecular detection systems, therapeutic drugs and antibody replacement among others. However, full understanding and consequently optimal utilization of the process has lagged behind its wide application due to the lack of dedicated computational approaches. At the same time, the combination of SELEX with novel sequencing technologies is beginning to provide the data that will allow the examination of a variety of properties of the selection process. To close this gap we developed, Aptamotif, a computational method for the identification of sequence-structure motifs in SELEX-derived aptamers. To increase the chances of identifying functional motifs, Aptamotif uses an ensemble-based approach. We validated the method using two published aptamer datasets containing experimentally determined motifs of increasing complexity. We were able to recreate the author's findings to a high degree, thus proving the capability of our approach to identify binding motifs in SELEX data. Additionally, using our new experimental dataset, we illustrate the application of Aptamotif to elucidate several properties of the selection process.
The influence of sequence context and length on the kinetics of DNA duplex formation from complementary hairpins possessing (CNG) repeats.

PubMed

Paiva, Anthony M; Sheardy, Richard D

2005-04-20

The formation of unusual structures during DNA replication has been invoked for gene expansion in genomes possessing triplet repeat sequences, CNG, where N = A, C, G, or T. In particular, it has been suggested that the daughter strand of the leading strand partially dissociates from the parent strand and forms a hairpin. The equilibrium between the fully duplexed parent:daugter species and the parent:hairpin species is dependent upon their relative stabilities and the rates of reannealing of the daughter strand back to the parent. These stabilities and rates are ultimately influenced by the sequence context of the DNA and its length. Previous work has demonstrated that longer strands are more stable than shorter strands and that the identity of N also influences the thermal stability [Paiva, A. M.; Sheardy, R. D. Biochemistry 2004, 43, 14218-14227]. Here, we show that the rate of duplex formation from complementary hairpins is also sequence context and length dependent. In particular, longer duplexes have higher activation energies than shorter duplexes of the same sequence context. Further, [(CCG):(GGC)] duplexes have lower activation energies than corresponding [(CAG):(GTC)] duplexes of the same length. Hence, hairpins formed from long CNG sequences are more thermodynamically stable and have slower kinetics for reannealing to their complement than shorter analogues. Gene expansion can now be explained in terms of thermodynamics and kinetics.
General approach to reversing ketol-acid reductoisomerase cofactor dependence from NADPH to NADH

DOE PAGES

Brinkmann-Chen, Sabine; Flock, Tilman; Cahn, Jackson K. B.; ...

2013-06-17

To date, efforts to switch the cofactor specificity of oxidoreductases from nicotinamide adenine dinucleotide phosphate (NADPH) to nicotinamide adenine dinucleotide (NADH) have been made on a case-by-case basis with varying degrees of success. Here we present a straightforward recipe for altering the cofactor specificity of a class of NADPH-dependent oxidoreductases, the ketol-acid reductoisomerases (KARIs). Combining previous results for an engineered NADH-dependent variant of Escherichia coli KARI with available KARI crystal structures and a comprehensive KARI-sequence alignment, we identified key cofactor specificity determinants and used this information to construct five KARIs with reversed cofactor preference. Additional directed evolution generated two enzymesmore » having NADH-dependent catalytic efficiencies that are greater than the wild-type enzymes with NADPH. As a result, high-resolution structures of a wild-type/variant pair reveal the molecular basis of the cofactor switch.« less
Evolutionary profiles derived from the QR factorization of multiple structural alignments gives an economy of information.

PubMed

O'Donoghue, Patrick; Luthey-Schulten, Zaida

2005-02-25

We present a new algorithm, based on the multidimensional QR factorization, to remove redundancy from a multiple structural alignment by choosing representative protein structures that best preserve the phylogenetic tree topology of the homologous group. The classical QR factorization with pivoting, developed as a fast numerical solution to eigenvalue and linear least-squares problems of the form Ax=b, was designed to re-order the columns of A by increasing linear dependence. Removing the most linear dependent columns from A leads to the formation of a minimal basis set which well spans the phase space of the problem at hand. By recasting the problem of redundancy in multiple structural alignments into this framework, in which the matrix A now describes the multiple alignment, we adapted the QR factorization to produce a minimal basis set of protein structures which best spans the evolutionary (phase) space. The non-redundant and representative profiles obtained from this procedure, termed evolutionary profiles, are shown in initial results to outperform well-tested profiles in homology detection searches over a large sequence database. A measure of structural similarity between homologous proteins, Q(H), is presented. By properly accounting for the effect and presence of gaps, a phylogenetic tree computed using this metric is shown to be congruent with the maximum-likelihood sequence-based phylogeny. The results indicate that evolutionary information is indeed recoverable from the comparative analysis of protein structure alone. Applications of the QR ordering and this structural similarity metric to analyze the evolution of structure among key, universally distributed proteins involved in translation, and to the selection of representatives from an ensemble of NMR structures are also discussed.
A systematic molecular dynamics study of nearest-neighbor effects on base pair and base pair step conformations and fluctuations in B-DNA

PubMed Central

Lavery, Richard; Zakrzewska, Krystyna; Beveridge, David; Bishop, Thomas C.; Case, David A.; Cheatham, Thomas; Dixit, Surjit; Jayaram, B.; Lankas, Filip; Laughton, Charles; Maddocks, John H.; Michon, Alexis; Osman, Roman; Orozco, Modesto; Perez, Alberto; Singh, Tanya; Spackova, Nada; Sponer, Jiri

2010-01-01

It is well recognized that base sequence exerts a significant influence on the properties of DNA and plays a significant role in protein–DNA interactions vital for cellular processes. Understanding and predicting base sequence effects requires an extensive structural and dynamic dataset which is currently unavailable from experiment. A consortium of laboratories was consequently formed to obtain this information using molecular simulations. This article describes results providing information not only on all 10 unique base pair steps, but also on all possible nearest-neighbor effects on these steps. These results are derived from simulations of 50–100 ns on 39 different DNA oligomers in explicit solvent and using a physiological salt concentration. We demonstrate that the simulations are converged in terms of helical and backbone parameters. The results show that nearest-neighbor effects on base pair steps are very significant, implying that dinucleotide models are insufficient for predicting sequence-dependent behavior. Flanking base sequences can notably lead to base pair step parameters in dynamic equilibrium between two conformational sub-states. Although this study only provides limited data on next-nearest-neighbor effects, we suggest that such effects should be analyzed before attempting to predict the sequence-dependent behavior of DNA. PMID:19850719
Local energetic frustration affects the dependence of green fluorescent protein folding on the chaperonin GroEL.

PubMed

Bandyopadhyay, Boudhayan; Goldenzweig, Adi; Unger, Tamar; Adato, Orit; Fleishman, Sarel J; Unger, Ron; Horovitz, Amnon

2017-12-15

The GroE chaperonin system in Escherichia coli comprises GroEL and GroES and facilitates ATP-dependent protein folding in vivo and in vitro Proteins with very similar sequences and structures can differ in their dependence on GroEL for efficient folding. One potential but unverified source for GroEL dependence is frustration, wherein not all interactions in the native state are optimized energetically, thereby potentiating slow folding and misfolding. Here, we chose enhanced green fluorescent protein as a model system and subjected it to random mutagenesis, followed by screening for variants whose in vivo folding displays increased or decreased GroEL dependence. We confirmed the altered GroEL dependence of these variants with in vitro folding assays. Strikingly, mutations at positions predicted to be highly frustrated were found to correlate with decreased GroEL dependence. Conversely, mutations at positions with low frustration were found to correlate with increased GroEL dependence. Further support for this finding was obtained by showing that folding of an enhanced green fluorescent protein variant designed computationally to have reduced frustration is indeed less GroEL-dependent. Our results indicate that changes in local frustration also affect partitioning in vivo between spontaneous and chaperonin-mediated folding. Hence, the design of minimally frustrated sequences can reduce chaperonin dependence and improve protein expression levels. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
Identification of family-specific residue packing motifs and their use for structure-based protein function prediction: I. Method development.

PubMed

Bandyopadhyay, Deepak; Huan, Jun; Prins, Jan; Snoeyink, Jack; Wang, Wei; Tropsha, Alexander

2009-11-01

Protein function prediction is one of the central problems in computational biology. We present a novel automated protein structure-based function prediction method using libraries of local residue packing patterns that are common to most proteins in a known functional family. Critical to this approach is the representation of a protein structure as a graph where residue vertices (residue name used as a vertex label) are connected by geometrical proximity edges. The approach employs two steps. First, it uses a fast subgraph mining algorithm to find all occurrences of family-specific labeled subgraphs for all well characterized protein structural and functional families. Second, it queries a new structure for occurrences of a set of motifs characteristic of a known family, using a graph index to speed up Ullman's subgraph isomorphism algorithm. The confidence of function inference from structure depends on the number of family-specific motifs found in the query structure compared with their distribution in a large non-redundant database of proteins. This method can assign a new structure to a specific functional family in cases where sequence alignments, sequence patterns, structural superposition and active site templates fail to provide accurate annotation.
3DNALandscapes: a database for exploring the conformational features of DNA.

PubMed

Zheng, Guohui; Colasanti, Andrew V; Lu, Xiang-Jun; Olson, Wilma K

2010-01-01

3DNALandscapes, located at: http://3DNAscapes.rutgers.edu, is a new database for exploring the conformational features of DNA. In contrast to most structural databases, which archive the Cartesian coordinates and/or derived parameters and images for individual structures, 3DNALandscapes enables searches of conformational information across multiple structures. The database contains a wide variety of structural parameters and molecular images, computed with the 3DNA software package and known to be useful for characterizing and understanding the sequence-dependent spatial arrangements of the DNA sugar-phosphate backbone, sugar-base side groups, base pairs, base-pair steps, groove structure, etc. The data comprise all DNA-containing structures--both free and bound to proteins, drugs and other ligands--currently available in the Protein Data Bank. The web interface allows the user to link, report, plot and analyze this information from numerous perspectives and thereby gain insight into DNA conformation, deformability and interactions in different sequence and structural contexts. The data accumulated from known, well-resolved DNA structures can serve as useful benchmarks for the analysis and simulation of new structures. The collective data can also help to understand how DNA deforms in response to proteins and other molecules and undergoes conformational rearrangements.
Tools to evaluate the conformation of protein products.

PubMed

Manta, Bruno; Obal, Gonzalo; Ricciardi, Alejandro; Pritsch, Otto; Denicola, Ana

2011-06-01

Production of recombinant proteins is a process intensively used in the research laboratory. In addition, the main biotechnology market products are recombinant proteins and monoclonal antibodies. The biological (and clinical) properties of the protein product strongly depend on the conformation of the polypeptide. Therefore, assessment of the correct conformation of the produced protein is crucial. There is no single method to assess every aspect of protein structure or function. Depending on the protein, the methods of choice vary. There are general methods to evaluate not only mass and primary sequence of the protein, but also higher-order structure. This review outlines the principal techniques for determining the conformation of a protein from structural (biophysical methods) to functional (in vitro binding assays) analyses. Copyright © 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Xiong, J.-P.; Stehle, T.; Zhang, R.

The structural basis for the divalent cation-dependent binding of heterodimeric alpha beta integrins to their ligands, which contain the prototypical Arg-Gly-Asp sequence, is unknown. Interaction with ligands triggers tertiary and quaternary structural rearrangements in integrins that are needed for cell signaling. Here we report the crystal structure of the extracellular segment of integrin alpha Vbeta 3 in complex with a cyclic peptide presenting the Arg-Gly-Asp sequence. The ligand binds at the major interface between the alpha V and beta 3 subunits and makes extensive contacts with both. Both tertiary and quaternary changes are observed in the presence of ligand. Themore » tertiary rearrangements take place in beta A, the ligand-binding domain of beta 3; in the complex, beta A acquires two cations, one of which contacts the ligand Asp directly and the other stabilizes the ligand-binding surface. Ligand binding induces small changes in the orientation of alpha V relative to beta 3.« less
From protein sequence to dynamics and disorder with DynaMine.

PubMed

Cilia, Elisa; Pancsa, Rita; Tompa, Peter; Lenaerts, Tom; Vranken, Wim F

2013-01-01

Protein function and dynamics are closely related; however, accurate dynamics information is difficult to obtain. Here based on a carefully assembled data set derived from experimental data for proteins in solution, we quantify backbone dynamics properties on the amino-acid level and develop DynaMine--a fast, high-quality predictor of protein backbone dynamics. DynaMine uses only protein sequence information as input and shows great potential in distinguishing regions of different structural organization, such as folded domains, disordered linkers, molten globules and pre-structured binding motifs of different sizes. It also identifies disordered regions within proteins with an accuracy comparable to the most sophisticated existing predictors, without depending on prior disorder knowledge or three-dimensional structural information. DynaMine provides molecular biologists with an important new method that grasps the dynamical characteristics of any protein of interest, as we show here for human p53 and E1A from human adenovirus 5.
Antifreeze glycopeptide analogues: microwave-enhanced synthesis and functional studies.

PubMed

Heggemann, Carolin; Budke, Carsten; Schomburg, Benjamin; Majer, Zsuzsa; Wissbrock, Marco; Koop, Thomas; Sewald, Norbert

2010-01-01

Antifreeze glycoproteins enable life at temperatures below the freezing point of physiological solutions. They usually consist of the repetitive tripeptide unit (-Ala-Ala-Thr-) with the disaccharide alpha-D-galactosyl-(1-3)-beta-N-acetyl-D-galactosamine attached to each hydroxyl group of threonine. Monoglycosylated analogues have been synthesized from the corresponding monoglycosylated threonine building block by microwave-assisted solid phase peptide synthesis. This method allows the preparation of analogues containing sequence variations which are not accessible by other synthetic methods. As antifreeze glycoproteins consist of numerous isoforms they are difficult to obtain in pure form from natural sources. The synthetic peptides have been structurally analyzed by CD and NMR spectroscopy in proton exchange experiments revealing a structure as flexible as reported for the native peptides. Microphysical recrystallization tests show an ice structuring influence and ice growth inhibition depending on the concentration, chain length and sequence of the peptides.
Structure and function of small heat shock/alpha-crystallin proteins: established concepts and emerging ideas.

PubMed

MacRae, T H

2000-06-01

Small heat shock/alpha-crystallin proteins are defined by conserved sequence of approximately 90 amino acid residues, termed the alpha-crystallin domain, which is bounded by variable amino- and carboxy-terminal extensions. These proteins form oligomers, most of uncertain quaternary structure, and oligomerization is prerequisite to their function as molecular chaperones. Sequence modelling and physical analyses show that the secondary structure of small heat shock/alpha-crystallin proteins is predominately beta-pleated sheet. Crystallography, site-directed spin-labelling and yeast two-hybrid selection demonstrate regions of secondary structure within the alpha-crystallin domain that interact during oligomer assembly, a process also dependent on the amino terminus. Oligomers are dynamic, exhibiting subunit exchange and organizational plasticity, perhaps leading to functional diversity. Exposure of hydrophobic residues by structural modification facilitates chaperoning where denaturing proteins in the molten globule state associate with oligomers. The flexible carboxy-terminal extension contributes to chaperone activity by enhancing the solubility of small heat shock/alpha-crystallin proteins. Site-directed mutagenesis has yielded proteins where the effect of the change on structure and function depends upon the residue modified, the organism under study and the analytical techniques used. Most revealing, substitution of a conserved arginine residue within the alpha-crystallin domain has a major impact on quaternary structure and chaperone action probably through realignment of beta-sheets. These mutations are linked to inherited diseases. Oligomer size is regulated by a stress-responsive cascade including MAPKAP kinase 2/3 and p38. Phosphorylation of small heat shock/alpha-crystallin proteins has important consequences within stressed cells, especially for microfilaments.
Algebraic multigrid methods applied to problems in computational structural mechanics

NASA Technical Reports Server (NTRS)

Mccormick, Steve; Ruge, John

1989-01-01

The development of algebraic multigrid (AMG) methods and their application to certain problems in structural mechanics are described with emphasis on two- and three-dimensional linear elasticity equations and the 'jacket problems' (three-dimensional beam structures). Various possible extensions of AMG are also described. The basic idea of AMG is to develop the discretization sequence based on the target matrix and not the differential equation. Therefore, the matrix is analyzed for certain dependencies that permit the proper construction of coarser matrices and attendant transfer operators. In this manner, AMG appears to be adaptable to structural analysis applications.
Length and sequence dependence in the association of Huntingtin protein with lipid membranes

NASA Astrophysics Data System (ADS)

Jawahery, Sudi; Nagarajan, Anu; Matysiak, Silvina

2013-03-01

There is a fundamental gap in our understanding of how aggregates of mutant Huntingtin protein (htt) with overextended polyglutamine (polyQ) sequences gain the toxic properties that cause Huntington's disease (HD). Experimental studies have shown that the most important step associated with toxicity is the binding of mutant htt aggregates to lipid membranes. Studies have also shown that flanking amino acid sequences around the polyQ sequence directly affect interactions with the lipid bilayer, and that polyQ sequences of greater than 35 glutamine repeats in htt are a characteristic of HD. The key steps that determine how flanking sequences and polyQ length affect the structure of lipid bilayers remain unknown. In this study, we use atomistic molecular dynamics simulations to study the interactions between lipid membranes of varying compositions and polyQ peptides of varying lengths and flanking sequences. We find that overextended polyQ interactions do cause deformation in model membranes, and that the flanking sequences do play a role in intensifying this deformation by altering the shape of the affected regions.
Have the temperature time series a structural change after 1998?

NASA Astrophysics Data System (ADS)

Werner, Rolf; Valev, Dimitare; Danov, Dimitar

2012-07-01

The global and hemisphere temperature GISS and Hadcrut3 time series were analysed for structural changes. We postulate the continuity of the preceding temperature function depending from the time. The slopes are calculated for a sequence of segments limited by time thresholds. We used a standard method, the restricted linear regression with dummy variables. We performed the calculations and tests for different number of thresholds. The thresholds are searched continuously in determined time intervals. The F-statistic is used to obtain the time points of the structural changes.
Cloning and sequence analysis demonstrate the chromate reduction ability of a novel chromate reductase gene from Serratia sp.

PubMed

Deng, Peng; Tan, Xiaoqing; Wu, Ying; Bai, Qunhua; Jia, Yan; Xiao, Hong

2015-03-01

The ChrT gene encodes a chromate reductase enzyme which catalyzes the reduction of Cr(VI). The chromate reductase is also known as flavin mononucleotide (FMN) reductase (FMN_red). The aim of the present study was to clone the full-length ChrT DNA from Serratia sp. CQMUS2 and analyze the deduced amino acid sequence and three-dimensional structure. The putative ChrT gene fragment of Serratia sp. CQMUS2 was isolated by polymerase chain reaction (PCR), according to the known FMN_red gene sequence from Serratia sp. AS13. The flanking sequences of the ChrT gene were obtained by high efficiency TAIL-PCR, while the full-length gene of ChrT was cloned in Escherichia coli for subsequent sequencing. The nucleotide sequence of ChrT was submitted onto GenBank under the accession number, KF211434. Sequence analysis of the gene and amino acids was conducted using the Basic Local Alignment Search Tool, and open reading frame (ORF) analysis was performed using ORF Finder software. The ChrT gene was found to be an ORF of 567 bp that encodes a 188-amino acid enzyme with a calculated molecular weight of 20.4 kDa. In addition, the ChrT protein was hypothesized to be an NADPH-dependent FMN_red and a member of the flavodoxin-2 superfamily. The amino acid sequence of ChrT showed high sequence similarity to the FMN reductase genes of Klebsiella pneumonia and Raoultella ornithinolytica , which belong to the flavodoxin-2 superfamily. Furthermore, ChrT was shown to have a 85.6% similarity to the three-dimensional structure of Escherichia coli ChrR, sharing four common enzyme active sites for chromate reduction. Therefore, ChrT gene cloning and protein structure determination demonstrated the ability of the gene for chromate reduction. The results of the present study provide a basis for further studies on ChrT gene expression and protein function.
Cloning and sequence analysis demonstrate the chromate reduction ability of a novel chromate reductase gene from Serratia sp

PubMed Central

DENG, PENG; TAN, XIAOQING; WU, YING; BAI, QUNHUA; JIA, YAN; XIAO, HONG

2015-01-01

The ChrT gene encodes a chromate reductase enzyme which catalyzes the reduction of Cr(VI). The chromate reductase is also known as flavin mononucleotide (FMN) reductase (FMN_red). The aim of the present study was to clone the full-length ChrT DNA from Serratia sp. CQMUS2 and analyze the deduced amino acid sequence and three-dimensional structure. The putative ChrT gene fragment of Serratia sp. CQMUS2 was isolated by polymerase chain reaction (PCR), according to the known FMN_red gene sequence from Serratia sp. AS13. The flanking sequences of the ChrT gene were obtained by high efficiency TAIL-PCR, while the full-length gene of ChrT was cloned in Escherichia coli for subsequent sequencing. The nucleotide sequence of ChrT was submitted onto GenBank under the accession number, KF211434. Sequence analysis of the gene and amino acids was conducted using the Basic Local Alignment Search Tool, and open reading frame (ORF) analysis was performed using ORF Finder software. The ChrT gene was found to be an ORF of 567 bp that encodes a 188-amino acid enzyme with a calculated molecular weight of 20.4 kDa. In addition, the ChrT protein was hypothesized to be an NADPH-dependent FMN_red and a member of the flavodoxin-2 superfamily. The amino acid sequence of ChrT showed high sequence similarity to the FMN reductase genes of Klebsiella pneumonia and Raoultella ornithinolytica, which belong to the flavodoxin-2 superfamily. Furthermore, ChrT was shown to have a 85.6% similarity to the three-dimensional structure of Escherichia coli ChrR, sharing four common enzyme active sites for chromate reduction. Therefore, ChrT gene cloning and protein structure determination demonstrated the ability of the gene for chromate reduction. The results of the present study provide a basis for further studies on ChrT gene expression and protein function. PMID:25667630
Novel cis-acting element within the capsid-coding region enhances flavivirus viral-RNA replication by regulating genome cyclization.

PubMed

Liu, Zhong-Yu; Li, Xiao-Feng; Jiang, Tao; Deng, Yong-Qiang; Zhao, Hui; Wang, Hong-Jiang; Ye, Qing; Zhu, Shun-Ya; Qiu, Yang; Zhou, Xi; Qin, E-De; Qin, Cheng-Feng

2013-06-01

cis-Acting elements in the viral genome RNA (vRNA) are essential for the translation, replication, and/or encapsidation of RNA viruses. In this study, a novel conserved cis-acting element was identified in the capsid-coding region of mosquito-borne flavivirus. The downstream of 5' cyclization sequence (5'CS) pseudoknot (DCS-PK) element has a three-stem pseudoknot structure, as demonstrated by structure prediction and biochemical analysis. Using dengue virus as a model, we show that DCS-PK enhances vRNA replication and that its function depends on its secondary structure and specific primary sequence. Mutagenesis revealed that the highly conserved stem 1 and loop 2, which are involved in potential loop-helix interactions, are crucial for DCS-PK function. A predicted loop 1-stem 3 base triple interaction is important for the structural stability and function of DCS-PK. Moreover, the function of DCS-PK depends on its position relative to the 5'CS, and the presence of DCS-PK facilitates the formation of 5'-3' RNA complexes. Taken together, our results reveal that the cis-acting element DCS-PK enhances vRNA replication by regulating genome cyclization, and DCS-PK might interplay with other cis-acting elements to form a functional vRNA cyclization domain, thus playing critical roles during the flavivirus life cycle and evolution.

Novel cis-Acting Element within the Capsid-Coding Region Enhances Flavivirus Viral-RNA Replication by Regulating Genome Cyclization

PubMed Central

Liu, Zhong-Yu; Li, Xiao-Feng; Jiang, Tao; Deng, Yong-Qiang; Zhao, Hui; Wang, Hong-Jiang; Ye, Qing; Zhu, Shun-Ya; Qiu, Yang; Zhou, Xi; Qin, E-De

2013-01-01

cis-Acting elements in the viral genome RNA (vRNA) are essential for the translation, replication, and/or encapsidation of RNA viruses. In this study, a novel conserved cis-acting element was identified in the capsid-coding region of mosquito-borne flavivirus. The downstream of 5′ cyclization sequence (5′CS) pseudoknot (DCS-PK) element has a three-stem pseudoknot structure, as demonstrated by structure prediction and biochemical analysis. Using dengue virus as a model, we show that DCS-PK enhances vRNA replication and that its function depends on its secondary structure and specific primary sequence. Mutagenesis revealed that the highly conserved stem 1 and loop 2, which are involved in potential loop-helix interactions, are crucial for DCS-PK function. A predicted loop 1-stem 3 base triple interaction is important for the structural stability and function of DCS-PK. Moreover, the function of DCS-PK depends on its position relative to the 5′CS, and the presence of DCS-PK facilitates the formation of 5′-3′ RNA complexes. Taken together, our results reveal that the cis-acting element DCS-PK enhances vRNA replication by regulating genome cyclization, and DCS-PK might interplay with other cis-acting elements to form a functional vRNA cyclization domain, thus playing critical roles during the flavivirus life cycle and evolution. PMID:23576500
TIA-1 RRM23 binding and recognition of target oligonucleotides

PubMed Central

Waris, Saboora; García-Mauriño, Sofía M.; Sivakumaran, Andrew; Beckham, Simone A.; Loughlin, Fionna E.; Gorospe, Myriam; Díaz-Moreno, Irene; Wilce, Matthew C.J.

2017-01-01

Abstract TIA-1 (T-cell restricted intracellular antigen-1) is an RNA-binding protein involved in splicing and translational repression. It mainly interacts with RNA via its second and third RNA recognition motifs (RRMs), with specificity for U-rich sequences directed by RRM2. It has recently been shown that RRM3 also contributes to binding, with preferential binding for C-rich sequences. Here we designed UC-rich and CU-rich 10-nt sequences for engagement of both RRM2 and RRM3 and demonstrated that the TIA-1 RRM23 construct preferentially binds the UC-rich RNA ligand (5΄-UUUUUACUCC-3΄). Interestingly, this binding depends on the presence of Lys274 that is C-terminal to RRM3 and binding to equivalent DNA sequences occurs with similar affinity. Small-angle X-ray scattering was used to demonstrate that, upon complex formation with target RNA or DNA, TIA-1 RRM23 adopts a compact structure, showing that both RRMs engage with the target 10-nt sequences to form the complex. We also report the crystal structure of TIA-1 RRM2 in complex with DNA to 2.3 Å resolution providing the first atomic resolution structure of any TIA protein RRM in complex with oligonucleotide. Together our data support a specific mode of TIA-1 RRM23 interaction with target oligonucleotides consistent with the role of TIA-1 in binding RNA to regulate gene expression. PMID:28184449
TIA-1 RRM23 binding and recognition of target oligonucleotides.

PubMed

Waris, Saboora; García-Mauriño, Sofía M; Sivakumaran, Andrew; Beckham, Simone A; Loughlin, Fionna E; Gorospe, Myriam; Díaz-Moreno, Irene; Wilce, Matthew C J; Wilce, Jacqueline A

2017-05-05

TIA-1 (T-cell restricted intracellular antigen-1) is an RNA-binding protein involved in splicing and translational repression. It mainly interacts with RNA via its second and third RNA recognition motifs (RRMs), with specificity for U-rich sequences directed by RRM2. It has recently been shown that RRM3 also contributes to binding, with preferential binding for C-rich sequences. Here we designed UC-rich and CU-rich 10-nt sequences for engagement of both RRM2 and RRM3 and demonstrated that the TIA-1 RRM23 construct preferentially binds the UC-rich RNA ligand (5΄-UUUUUACUCC-3΄). Interestingly, this binding depends on the presence of Lys274 that is C-terminal to RRM3 and binding to equivalent DNA sequences occurs with similar affinity. Small-angle X-ray scattering was used to demonstrate that, upon complex formation with target RNA or DNA, TIA-1 RRM23 adopts a compact structure, showing that both RRMs engage with the target 10-nt sequences to form the complex. We also report the crystal structure of TIA-1 RRM2 in complex with DNA to 2.3 Å resolution providing the first atomic resolution structure of any TIA protein RRM in complex with oligonucleotide. Together our data support a specific mode of TIA-1 RRM23 interaction with target oligonucleotides consistent with the role of TIA-1 in binding RNA to regulate gene expression. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
DNA minor groove electrostatic potential: influence of sequence-specific transitions of the torsion angle gamma and deoxyribose conformations.

PubMed

Zhitnikova, M Y; Shestopalova, A V

2017-11-01

The structural adjustments of the sugar-phosphate DNA backbone (switching of the γ angle (O5'-C5'-C4'-C3') from canonical to alternative conformations and/or C2'-endo → C3'-endo transition of deoxyribose) lead to the sequence-specific changes in accessible surface area of both polar and non-polar atoms of the grooves and the polar/hydrophobic profile of the latter ones. The distribution of the minor groove electrostatic potential is likely to be changing as a result of such conformational rearrangements in sugar-phosphate DNA backbone. Our analysis of the crystal structures of the short free DNA fragments and calculation of their electrostatic potentials allowed us to determine: (1) the number of classical and alternative γ angle conformations in the free B-DNA; (2) changes in the minor groove electrostatic potential, depending on the conformation of the sugar-phosphate DNA backbone; (3) the effect of the DNA sequence on the minor groove electrostatic potential. We have demonstrated that the structural adjustments of the DNA double helix (the conformations of the sugar-phosphate backbone and the minor groove dimensions) induce changes in the distribution of the minor groove electrostatic potential and are sequence-specific. Therefore, these features of the minor groove sizes and distribution of minor groove electrostatic potential can be used as a signal for recognition of the target DNA sequence by protein in the implementation of the indirect readout mechanism.
Genomewide analysis of Drosophila circular RNAs reveals their structural and sequence properties and age-dependent neural accumulation

PubMed Central

Westholm, Jakub O.; Miura, Pedro; Olson, Sara; Shenker, Sol; Joseph, Brian; Sanfilippo, Piero; Celniker, Susan E.; Graveley, Brenton R.; Lai, Eric C.

2014-01-01

Circularization was recently recognized to broadly expand transcriptome complexity. Here, we exploit massive Drosophila total RNA-sequencing data, >5 billion paired-end reads from >100 libraries covering diverse developmental stages, tissues and cultured cells, to rigorously annotate >2500 fruitfly circular RNAs. These mostly derive from back-splicing of protein-coding genes and lack poly(A) tails, and circularization of hundreds of genes is conserved across multiple Drosophila species. We elucidate structural and sequence properties of Drosophila circular RNAs, which exhibit commonalities and distinctions from mammalian circles. Notably, Drosophila circular RNAs harbor >1000 well-conserved canonical miRNA seed matches, especially within coding regions, and coding conserved miRNA sites reside preferentially within circularized exons. Finally, we analyze the developmental and tissue specificity of circular RNAs, and note their preferred derivation from neural genes and enhanced accumulation in neural tissues. Interestingly, circular isoforms increase dramatically relative to linear isoforms during CNS aging, and constitute a novel aging biomarker. PMID:25544350
Chromatin accessibility and guide sequence secondary structure affect CRISPR-Cas9 gene editing efficiency.

PubMed

Jensen, Kristopher Torp; Fløe, Lasse; Petersen, Trine Skov; Huang, Jinrong; Xu, Fengping; Bolund, Lars; Luo, Yonglun; Lin, Lin

2017-07-01

Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR)-associated protein 9 (CRISPR-Cas9) systems have emerged as the method of choice for genome editing, but large variations in on-target efficiencies continue to limit their applicability. Here, we investigate the effect of chromatin accessibility on Cas9-mediated gene editing efficiency for 20 gRNAs targeting 10 genomic loci in HEK293T cells using both SpCas9 and the eSpCas9(1.1) variant. Our study indicates that gene editing is more efficient in euchromatin than in heterochromatin, and we validate this finding in HeLa cells and in human fibroblasts. Furthermore, we investigate the gRNA sequence determinants of CRISPR-Cas9 activity using a surrogate reporter system and find that the efficiency of Cas9-mediated gene editing is dependent on guide sequence secondary structure formation. This knowledge can aid in the further improvement of tools for gRNA design. © 2017 Federation of European Biochemical Societies.
Quantitative analysis of RNA-protein interactions on a massively parallel array for mapping biophysical and evolutionary landscapes

PubMed Central

Buenrostro, Jason D.; Chircus, Lauren M.; Araya, Carlos L.; Layton, Curtis J.; Chang, Howard Y.; Snyder, Michael P.; Greenleaf, William J.

2015-01-01

RNA-protein interactions drive fundamental biological processes and are targets for molecular engineering, yet quantitative and comprehensive understanding of the sequence determinants of affinity remains limited. Here we repurpose a high-throughput sequencing instrument to quantitatively measure binding and dissociation of MS2 coat protein to >107 RNA targets generated on a flow-cell surface by in situ transcription and inter-molecular tethering of RNA to DNA. We decompose the binding energy contributions from primary and secondary RNA structure, finding that differences in affinity are often driven by sequence-specific changes in association rates. By analyzing the biophysical constraints and modeling mutational paths describing the molecular evolution of MS2 from low- to high-affinity hairpins, we quantify widespread molecular epistasis, and a long-hypothesized structure-dependent preference for G:U base pairs over C:A intermediates in evolutionary trajectories. Our results suggest that quantitative analysis of RNA on a massively parallel array (RNAMaP) relationships across molecular variants. PMID:24727714
Using the Self-Select Paradigm to Delineate the Nature of Speech Motor Programming

PubMed Central

Wright, David L.; Robin, Don A.; Rhee, Jooyhun; Vaculin, Amber; Jacks, Adam; Guenther, Frank H.; Fox, Peter T.

2015-01-01

Purpose The authors examined the involvement of 2 speech motor programming processes identified by S. T. Klapp (1995, 2003) during the articulation of utterances differing in syllable and sequence complexity. According to S. T. Klapp, 1 process, INT, resolves the demands of the programmed unit, whereas a second process, SEQ, oversees the serial order demands of longer sequences. Method A modified reaction time paradigm was used to assess INT and SEQ demands. Specifically, syllable complexity was dependent on syllable structure, whereas sequence complexity involved either repeated or unique syllabi within an utterance. Results INT execution was slowed when articulating single syllables in the form CCCV compared to simpler CV syllables. Planning unique syllables within a multisyllabic utterance rather than repetitions of the same syllable slowed INT but not SEQ. Conclusions The INT speech motor programming process, important for mental syllabary access, is sensitive to changes in both syllable structure and the number of unique syllables in an utterance. PMID:19474396
Genome-wide Analysis of Drosophila Circular RNAs Reveals Their Structural and Sequence Properties and Age-Dependent Neural Accumulation

DOE PAGES

Westholm, Jakub O.; Miura, Pedro; Olson, Sara; ...

2014-11-26

Circularization was recently recognized to broadly expand transcriptome complexity. Here, we exploit massive Drosophila total RNA-sequencing data, >5 billion paired-end reads from >100 libraries covering diverse developmental stages, tissues, and cultured cells, to rigorously annotate >2,500 fruit fly circular RNAs. These mostly derive from back-splicing of protein-coding genes and lack poly(A) tails, and the circularization of hundreds of genes is conserved across multiple Drosophila species. We elucidate structural and sequence properties of Drosophila circular RNAs, which exhibit commonalities and distinctions from mammalian circles. Notably, Drosophila circular RNAs harbor >1,000 well-conserved canonical miRNA seed matches, especially within coding regions, and codingmore » conserved miRNA sites reside preferentially within circularized exons. Finally, we analyze the developmental and tissue specificity of circular RNAs and note their preferred derivation from neural genes and enhanced accumulation in neural tissues. Interestingly, circular isoforms increase substantially relative to linear isoforms during CNS aging and constitute an aging biomarker.« less
Genome-wide Analysis of Drosophila Circular RNAs Reveals Their Structural and Sequence Properties and Age-Dependent Neural Accumulation

DOE Office of Scientific and Technical Information (OSTI.GOV)

Westholm, Jakub O.; Miura, Pedro; Olson, Sara

Circularization was recently recognized to broadly expand transcriptome complexity. Here, we exploit massive Drosophila total RNA-sequencing data, >5 billion paired-end reads from >100 libraries covering diverse developmental stages, tissues, and cultured cells, to rigorously annotate >2,500 fruit fly circular RNAs. These mostly derive from back-splicing of protein-coding genes and lack poly(A) tails, and the circularization of hundreds of genes is conserved across multiple Drosophila species. We elucidate structural and sequence properties of Drosophila circular RNAs, which exhibit commonalities and distinctions from mammalian circles. Notably, Drosophila circular RNAs harbor >1,000 well-conserved canonical miRNA seed matches, especially within coding regions, and codingmore » conserved miRNA sites reside preferentially within circularized exons. Finally, we analyze the developmental and tissue specificity of circular RNAs and note their preferred derivation from neural genes and enhanced accumulation in neural tissues. Interestingly, circular isoforms increase substantially relative to linear isoforms during CNS aging and constitute an aging biomarker.« less
Temporal fractals in seabird foraging behaviour: diving through the scales of time

PubMed Central

MacIntosh, Andrew J. J.; Pelletier, Laure; Chiaradia, Andre; Kato, Akiko; Ropert-Coudert, Yan

2013-01-01

Animal behaviour exhibits fractal structure in space and time. Fractal properties in animal space-use have been explored extensively under the Lévy flight foraging hypothesis, but studies of behaviour change itself through time are rarer, have typically used shorter sequences generated in the laboratory, and generally lack critical assessment of their results. We thus performed an in-depth analysis of fractal time in binary dive sequences collected via bio-logging from free-ranging little penguins (Eudyptula minor) across full-day foraging trips (216 data points; 4 orders of temporal magnitude). Results from 4 fractal methods show that dive sequences are long-range dependent and persistent across ca. 2 orders of magnitude. This fractal structure correlated with trip length and time spent underwater, but individual traits had little effect. Fractal time is a fundamental characteristic of penguin foraging behaviour, and its investigation is thus a promising avenue for research on interactions between animals and their environments. PMID:23703258
Spectroscopic insights into quadruplexes of five-repeat telomere DNA sequences upon G-block damage.

PubMed

Dvořáková, Zuzana; Vorlíčková, Michaela; Renčiuk, Daniel

2017-11-01

The DNA lesions, resulting from oxidative damage, were shown to destabilize human telomere four-repeat quadruplex and to alter its structure. Long telomere DNA, as a repetitive sequence, offers, however, other mechanisms of dealing with the lesion: extrusion of the damaged repeat into loop or shifting the quadruplex position by one repeat. Using circular dichroism and UV absorption spectroscopy and polyacrylamide electrophoresis, we studied consequences of lesions at different positions of the model five-repeat human telomere DNA sequences on the structure and stability of their quadruplexes in sodium and in potassium. The repeats affected by lesion are preferentially positioned as terminal overhangs of the core quadruplex structurally similar to the four-repeat one. Forced affecting of the inner repeats leads to presence of variety of more parallel folds in potassium. In sodium the designed models form mixture of two dominant antiparallel quadruplexes whose population varies with the position of the affected repeat. The shapes of quadruplex CD spectra, namely the height of dominant peaks, significantly correlate with melting temperatures. Lesion in one guanine tract of a more than four repeats long human telomere DNA sequence may cause re-positioning of its quadruplex arrangement associated with a shift of the structure to less common quadruplex conformations. The type of the quadruplex depends on the loop position and external conditions. The telomere DNA quadruplexes are quite resistant to the effect of point mutations due to the telomere DNA repetitive nature, although their structure and, consequently, function might be altered. Copyright © 2017. Published by Elsevier B.V.
Anomalous diffusion in neutral evolution of model proteins.

PubMed

Nelson, Erik D; Grishin, Nick V

2015-06-01

Protein evolution is frequently explored using minimalist polymer models, however, little attention has been given to the problem of structural drift, or diffusion. Here, we study neutral evolution of small protein motifs using an off-lattice heteropolymer model in which individual monomers interact as low-resolution amino acids. In contrast to most earlier models, both the length and folded structure of the polymers are permitted to change. To describe structural change, we compute the mean-square distance (MSD) between monomers in homologous folds separated by n neutral mutations. We find that structural change is episodic, and, averaged over lineages (for example, those extending from a single sequence), exhibits a power-law dependence on n. We show that this exponent depends on the alignment method used, and we analyze the distribution of waiting times between neutral mutations. The latter are more disperse than for models required to maintain a specific fold, but exhibit a similar power-law tail.
Anomalous diffusion in neutral evolution of model proteins

NASA Astrophysics Data System (ADS)

Nelson, Erik D.; Grishin, Nick V.

2015-06-01

Protein evolution is frequently explored using minimalist polymer models, however, little attention has been given to the problem of structural drift, or diffusion. Here, we study neutral evolution of small protein motifs using an off-lattice heteropolymer model in which individual monomers interact as low-resolution amino acids. In contrast to most earlier models, both the length and folded structure of the polymers are permitted to change. To describe structural change, we compute the mean-square distance (MSD) between monomers in homologous folds separated by n neutral mutations. We find that structural change is episodic, and, averaged over lineages (for example, those extending from a single sequence), exhibits a power-law dependence on n . We show that this exponent depends on the alignment method used, and we analyze the distribution of waiting times between neutral mutations. The latter are more disperse than for models required to maintain a specific fold, but exhibit a similar power-law tail.
Biologically important conformational features of DNA as interpreted by quantum mechanics and molecular mechanics computations of its simple fragments.

PubMed

Poltev, V; Anisimov, V M; Dominguez, V; Gonzalez, E; Deriabina, A; Garcia, D; Rivas, F; Polteva, N A

2018-02-01

Deciphering the mechanism of functioning of DNA as the carrier of genetic information requires identifying inherent factors determining its structure and function. Following this path, our previous DFT studies attributed the origin of unique conformational characteristics of right-handed Watson-Crick duplexes (WCDs) to the conformational profile of deoxydinucleoside monophosphates (dDMPs) serving as the minimal repeating units of DNA strand. According to those findings, the directionality of the sugar-phosphate chain and the characteristic ranges of dihedral angles of energy minima combined with the geometric differences between purines and pyrimidines determine the dependence on base sequence of the three-dimensional (3D) structure of WCDs. This work extends our computational study to complementary deoxydinucleotide-monophosphates (cdDMPs) of non-standard conformation, including those of Z-family, Hoogsteen duplexes, parallel-stranded structures, and duplexes with mispaired bases. For most of these systems, except Z-conformation, computations closely reproduce experimental data within the tolerance of characteristic limits of dihedral parameters for each conformation family. Computation of cdDMPs with Z-conformation reveals that their experimental structures do not correspond to the internal energy minimum. This finding establishes the leading role of external factors in formation of the Z-conformation. Energy minima of cdDMPs of non-Watson-Crick duplexes demonstrate different sequence-dependence features than those known for WCDs. The obtained results provide evidence that the biologically important regularities of 3D structure distinguish WCDs from duplexes having non-Watson-Crick nucleotide pairing.
On a new class of completely integrable nonlinear wave equations. II. Multi-Hamiltonian structure

NASA Astrophysics Data System (ADS)

Nutku, Y.

1987-11-01

The multi-Hamiltonian structure of a class of nonlinear wave equations governing the propagation of finite amplitude waves is discussed. Infinitely many conservation laws had earlier been obtained for these equations. Starting from a (primary) Hamiltonian formulation of these equations the necessary and sufficient conditions for the existence of bi-Hamiltonian structure are obtained and it is shown that the second Hamiltonian operator can be constructed solely through a knowledge of the first Hamiltonian function. The recursion operator which first appears at the level of bi-Hamiltonian structure gives rise to an infinite sequence of conserved Hamiltonians. It is found that in general there exist two different infinite sequences of conserved quantities for these equations. The recursion relation defining higher Hamiltonian structures enables one to obtain the necessary and sufficient conditions for the existence of the (k+1)st Hamiltonian operator which depends on the kth Hamiltonian function. The infinite sequence of conserved Hamiltonians are common to all the higher Hamiltonian structures. The equations of gas dynamics are discussed as an illustration of this formalism and it is shown that in general they admit tri-Hamiltonian structure with two distinct infinite sets of conserved quantities. The isothermal case of γ=1 is an exceptional one that requires separate treatment. This corresponds to a specialization of the equations governing the expansion of plasma into vacuum which will be shown to be equivalent to Poisson's equation in nonlinear acoustics.
Diversified Structural Basis of a Conserved Molecular Mechanism for pH-Dependent Dimerization in Spider Silk N-Terminal Domains.

PubMed

Otikovs, Martins; Chen, Gefei; Nordling, Kerstin; Landreh, Michael; Meng, Qing; Jörnvall, Hans; Kronqvist, Nina; Rising, Anna; Johansson, Jan; Jaudzems, Kristaps

2015-08-17

Conversion of spider silk proteins from soluble dope to insoluble fibers involves pH-dependent dimerization of the N-terminal domain (NT). This conversion is tightly regulated to prevent premature precipitation and enable rapid silk formation at the end of the duct. Three glutamic acid residues that mediate this process in the NT from Euprosthenops australis major ampullate spidroin 1 are well conserved among spidroins. However, NTs of minor ampullate spidroins from several species, including Araneus ventricosus ((Av)MiSp NT), lack one of the glutamic acids. Here we investigate the pH-dependent structural changes of (Av)MiSp NT, revealing that it uses the same mechanism but involves a non-conserved glutamic acid residue instead. Homology modeling of the structures of other MiSp NTs suggests that these harbor different compensatory residues. This indicates that, despite sequence variations, the molecular mechanism underlying pH-dependent dimerization of NT is conserved among different silk types. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
The effects of DNA supercoiling on G-quadruplex formation.

PubMed

Sekibo, Doreen A T; Fox, Keith R

2017-12-01

Guanine-rich DNAs can fold into four-stranded structures that contain stacks of G-quartets. Bioinformatics studies have revealed that G-rich sequences with the potential to adopt these structures are unevenly distributed throughout genomes, and are especially found in gene promoter regions. With the exception of the single-stranded telomeric DNA, all genomic G-rich sequences will always be present along with their C-rich complements, and quadruplex formation will be in competition with the corresponding Watson-Crick duplex. Quadruplex formation must therefore first require local dissociation (melting) of the duplex strands. Since negative supercoiling is known to facilitate the formation of alternative DNA structures, we have investigated G-quadruplex formation within negatively supercoiled DNA plasmids. Plasmids containing multiple copies of (G3T)n and (G3T4)n repeats, were probed with dimethylsulphate, potassium permanganate and S1 nuclease. While dimethylsulphate footprinting revealed some evidence for G-quadruplex formation in (G3T)n sequences, this was not affected by supercoiling, and permanganate failed to detect exposed thymines in the loop regions. (G3T4)n sequences were not protected from DMS and showed no reaction with permanganate. Similarly, both S1 nuclease and 2D gel electrophoresis of DNA topoisomers did not detect any supercoil-dependent structural transitions. These results suggest that negative supercoiling alone is not sufficient to drive G-quadruplex formation. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
The mechanism and control of DNA transfer by the conjugative relaxase of resistance plasmid pCU1

DOE Office of Scientific and Technical Information (OSTI.GOV)

Nash, Rebekah Potts; Habibi, Sohrab; Cheng, Yuan

2010-11-15

Bacteria expand their genetic diversity, spread antibiotic resistance genes, and obtain virulence factors through the highly coordinated process of conjugative plasmid transfer (CPT). A plasmid-encoded relaxase enzyme initiates and terminates CPT by nicking and religating the transferred plasmid in a sequence-specific manner. We solved the 2.3 {angstrom} crystal structure of the relaxase responsible for the spread of the resistance plasmid pCU1 and determined its DNA binding and nicking capabilities. The overall fold of the pCU1 relaxase is similar to that of the F plasmid and plasmid R388 relaxases. However, in the pCU1 structure, the conserved tyrosine residues (Y18,19,26,27) that aremore » required for DNA nicking and religation were displaced up to 14 {angstrom} out of the relaxase active site, revealing a high degree of mobility in this region of the enzyme. In spite of this flexibility, the tyrosines still cleaved the nic site of the plasmid's origin of transfer, and did so in a sequence-specific, metal-dependent manner. Unexpectedly, the pCU1 relaxase lacked the sequence-specific DNA binding previously reported for the homologous F and R388 relaxase enzymes, despite its high sequence and structural similarity with both proteins. In summary, our work outlines novel structural and functional aspects of the relaxase-mediated conjugative transfer of plasmid pCU1.« less
Enrichment and identification of cellulolytic bacteria from the gastrointestinal tract of Giant African snail, Achatina fulica.

PubMed

Pawar, Kiran D; Dar, Mudasir A; Rajput, Bharati P; Kulkarni, Girish J

2015-02-01

The cellulolytic bacterial community structure in gastrointestinal (GI) tract of Achatina fulica was studied using culture-independent and -dependent methods by enrichment in carboxymethyl cellulose (CMC). Culture-dependent method indicated that GI tract of snail was dominated by Enterobacteriaceae members. When tested for cellulase activities, all isolates obtained by culture-dependent method showed both or either of CMCase or avicelase activity. Isolate identified as Citrobacter freundii showed highest CMCase and medium avicelase activity. Sequencing of clones from the 16S rRNA gene clone library identified ten operational taxonomic units (OTUs), which were affiliated to Enterobacteriaceae of phylum Gammaproteobacteria. Of these ten OTUs, eight OTUs closely matched with Enterobacter and Klebsiella genera. The most abundant OTU allied to Klebsiella oxytoca accounted for 70 % of the total sequences. The members of Klebsiella and Enterobacter were observed by both methods indicating their dominance among the cellulolytic bacterial community in the GI tract of the snail.

Accelerating calculations of RNA secondary structure partition functions using GPUs

PubMed Central

2013-01-01

Background RNA performs many diverse functions in the cell in addition to its role as a messenger of genetic information. These functions depend on its ability to fold to a unique three-dimensional structure determined by the sequence. The conformation of RNA is in part determined by its secondary structure, or the particular set of contacts between pairs of complementary bases. Prediction of the secondary structure of RNA from its sequence is therefore of great interest, but can be computationally expensive. In this work we accelerate computations of base-pair probababilities using parallel graphics processing units (GPUs). Results Calculation of the probabilities of base pairs in RNA secondary structures using nearest-neighbor standard free energy change parameters has been implemented using CUDA to run on hardware with multiprocessor GPUs. A modified set of recursions was introduced, which reduces memory usage by about 25%. GPUs are fastest in single precision, and for some hardware, restricted to single precision. This may introduce significant roundoff error. However, deviations in base-pair probabilities calculated using single precision were found to be negligible compared to those resulting from shifting the nearest-neighbor parameters by a random amount of magnitude similar to their experimental uncertainties. For large sequences running on our particular hardware, the GPU implementation reduces execution time by a factor of close to 60 compared with an optimized serial implementation, and by a factor of 116 compared with the original code. Conclusions Using GPUs can greatly accelerate computation of RNA secondary structure partition functions, allowing calculation of base-pair probabilities for large sequences in a reasonable amount of time, with a negligible compromise in accuracy due to working in single precision. The source code is integrated into the RNAstructure software package and available for download at http://rna.urmc.rochester.edu. PMID:24180434
Enzymatic Synthesis of Self-assembled Dicer Substrate RNA Nanostructures for Programmable Gene Silencing.

PubMed

Jang, Bora; Kim, Boyoung; Kim, Hyunsook; Kwon, Hyokyoung; Kim, Minjeong; Seo, Yunmi; Colas, Marion; Jeong, Hansaem; Jeong, Eun Hye; Lee, Kyuri; Lee, Hyukjin

2018-06-08

Enzymatic synthesis of RNA nanostructures is achieved by isothermal rolling circle transcription (RCT). Each arm of RNA nanostructures provides a functional role of Dicer substrate RNA inducing sequence specific RNA interference (RNAi). Three different RNAi sequences (GFP, RFP, and BFP) are incorporated within the three-arm junction RNA nanostructures (Y-RNA). The template and helper DNA strands are designed for the large-scale in vitro synthesis of RNA strands to prepare self-assembled Y-RNA. Interestingly, Dicer processing of Y-RNA is highly influenced by its physical structure and different gene silencing activity is achieved depending on its arm length and overhang. In addition, enzymatic synthesis allows the preparation of various Y-RNA structures using a single DNA template offering on demand regulation of multiple target genes.
Barcoded NS31/AML2 primers for sequencing of arbuscular mycorrhizal communities in environmental samples1

PubMed Central

Morgan, Benjamin S. T.; Egerton-Warburton, Louise M.

2017-01-01

Premise of the study: Arbuscular mycorrhizal fungi (AMF) are globally important root symbioses that enhance plant growth and nutrition and influence ecosystem structure and function. To better characterize levels of AMF diversity relevant to ecosystem function, deeper sequencing depth in environmental samples is needed. In this study, Illumina barcoded primers and a bioinformatics pipeline were developed and applied to study AMF diversity and community structure in environmental samples. Methods: Libraries of small subunit ribosomal RNA fragment amplicons were amplified from environmental DNA using a single-step PCR reaction with barcoded NS31/AML2 primers. Amplicons were sequenced on an Illumina MiSeq sequencer using version 2, 2 × 250-bp paired-end chemistry, and analyzed using QIIME and RDP Classifier. Results: Sequencing captured 196 to 6416 operational taxonomic units (OTUs; depending on clustering parameters) representing nine AMF genera. Regardless of clustering parameters, ∼20 OTUs dominated AMF communities (78–87% reads) with the remaining reads distributed among other OTUs. Analyses also showed significant biogeographic differences in AMF communities and that community composition could be linked to specific edaphic factors. Discussion: Barcoded NS31/AML2 primers and Illumina MiSeq sequencing provide a powerful approach to address AMF diversity and variations in fungal assemblages across host plants, ecosystems, and responses to environmental drivers including global change. PMID:28924511
Unveiling Stability Criteria of DNA-Carbon Nanotubes Constructs by Scanning Tunneling Microscopy and Computational Modeling

DOE PAGES

Kilina, Svetlana; Yarotski, Dzmitry A.; Talin, A. Alec; ...

2011-01-01

We present a combined approach that relies on computational simulations and scanning tunneling microscopy (STM) measurements to reveal morphological properties and stability criteria of carbon nanotube-DNA (CNT-DNA) constructs. Application of STM allows direct observation of very stable CNT-DNA hybrid structures with the well-defined DNA wrapping angle of 63.4 ° and a coiling period of 3.3 nm. Using force field simulations, we determine how the DNA-CNT binding energy depends on the sequence and binding geometry of a single strand DNA. This dependence allows us to quantitatively characterize the stability of a hybrid structure with an optimal π-stacking between DNA nucleotides and themore » tube surface and better interpret STM data. Our simulations clearly demonstrate the existence of a very stable DNA binding geometry for (6,5) CNT as evidenced by the presence of a well-defined minimum in the binding energy as a function of an angle between DNA strand and the nanotube chiral vector. This novel approach demonstrates the feasibility of CNT-DNA geometry studies with subnanometer resolution and paves the way towards complete characterization of the structural and electronic properties of drug-delivering systems based on DNA-CNT hybrids as a function of DNA sequence and a nanotube chirality.« less
Characterization of Aftershock Sequences from Large Strike-Slip Earthquakes Along Geometrically Complex Faults

NASA Astrophysics Data System (ADS)

Sexton, E.; Thomas, A.; Delbridge, B. G.

2017-12-01

Large earthquakes often exhibit complex slip distributions and occur along non-planar fault geometries, resulting in variable stress changes throughout the region of the fault hosting aftershocks. To better discern the role of geometric discontinuities on aftershock sequences, we compare areas of enhanced and reduced Coulomb failure stress and mean stress for systematic differences in the time dependence and productivity of these aftershock sequences. In strike-slip faults, releasing structures, including stepovers and bends, experience an increase in both Coulomb failure stress and mean stress during an earthquake, promoting fluid diffusion into the region and further failure. Conversely, Coulomb failure stress and mean stress decrease in restraining bends and stepovers in strike-slip faults, and fluids diffuse away from these areas, discouraging failure. We examine spatial differences in seismicity patterns along structurally complex strike-slip faults which have hosted large earthquakes, such as the 1992 Mw 7.3 Landers, the 2010 Mw 7.2 El-Mayor Cucapah, the 2014 Mw 6.0 South Napa, and the 2016 Mw 7.0 Kumamoto events. We characterize the behavior of these aftershock sequences with the Epidemic Type Aftershock-Sequence Model (ETAS). In this statistical model, the total occurrence rate of aftershocks induced by an earthquake is λ(t) = λ_0 + \\sum_{i:t_i
SGP-1: Prediction and Validation of Homologous Genes Based on Sequence Alignments

PubMed Central

Wiehe, Thomas; Gebauer-Jung, Steffi; Mitchell-Olds, Thomas; Guigó, Roderic

2001-01-01

Conventional methods of gene prediction rely on the recognition of DNA-sequence signals, the coding potential or the comparison of a genomic sequence with a cDNA, EST, or protein database. Reasons for limited accuracy in many circumstances are species-specific training and the incompleteness of reference databases. Lately, comparative genome analysis has attracted increasing attention. Several analysis tools that are based on human/mouse comparisons are already available. Here, we present a program for the prediction of protein-coding genes, termed SGP-1 (Syntenic Gene Prediction), which is based on the similarity of homologous genomic sequences. In contrast to most existing tools, the accuracy of SGP-1 depends little on species-specific properties such as codon usage or the nucleotide distribution. SGP-1 may therefore be applied to nonstandard model organisms in vertebrates as well as in plants, without the need for extensive parameter training. In addition to predicting genes in large-scale genomic sequences, the program may be useful to validate gene structure annotations from databases. To this end, SGP-1 output also contains comparisons between predicted and annotated gene structures in HTML format. The program can be accessed via a Web server at http://soft.ice.mpg.de/sgp-1. The source code, written in ANSI C, is available on request from the authors. PMID:11544202
Physics and evolution of thermophilic adaptation.

PubMed

Berezovsky, Igor N; Shakhnovich, Eugene I

2005-09-06

Analysis of structures and sequences of several hyperthermostable proteins from various sources reveals two major physical mechanisms of their thermostabilization. The first mechanism is "structure-based," whereby some hyperthermostable proteins are significantly more compact than their mesophilic homologues, while no particular interaction type appears to cause stabilization; rather, a sheer number of interactions is responsible for thermostability. Other hyperthermostable proteins employ an alternative, "sequence-based" mechanism of their thermal stabilization. They do not show pronounced structural differences from mesophilic homologues. Rather, a small number of apparently strong interactions is responsible for high thermal stability of these proteins. High-throughput comparative analysis of structures and complete genomes of several hyperthermophilic archaea and bacteria revealed that organisms develop diverse strategies of thermophilic adaptation by using, to a varying degree, two fundamental physical mechanisms of thermostability. The choice of a particular strategy depends on the evolutionary history of an organism. Proteins from organisms that originated in an extreme environment, such as hyperthermophilic archaea (Pyrococcus furiosus), are significantly more compact and more hydrophobic than their mesophilic counterparts. Alternatively, organisms that evolved as mesophiles but later recolonized a hot environment (Thermotoga maritima) relied in their evolutionary strategy of thermophilic adaptation on "sequence-based" mechanism of thermostability. We propose an evolutionary explanation of these differences based on physical concepts of protein designability.
Pms2 Suppresses Large Expansions of the (GAA·TTC)n Sequence in Neuronal Tissues

PubMed Central

Bourn, Rebecka L.; De Biase, Irene; Pinto, Ricardo Mouro; Sandi, Chiranjeevi; Al-Mahdawi, Sahar; Pook, Mark A.; Bidichandani, Sanjay I.

2012-01-01

Expanded trinucleotide repeat sequences are the cause of several inherited neurodegenerative diseases. Disease pathogenesis is correlated with several features of somatic instability of these sequences, including further large expansions in postmitotic tissues. The presence of somatic expansions in postmitotic tissues is consistent with DNA repair being a major determinant of somatic instability. Indeed, proteins in the mismatch repair (MMR) pathway are required for instability of the expanded (CAG·CTG)n sequence, likely via recognition of intrastrand hairpins by MutSβ. It is not clear if or how MMR would affect instability of disease-causing expanded trinucleotide repeat sequences that adopt secondary structures other than hairpins, such as the triplex/R-loop forming (GAA·TTC)n sequence that causes Friedreich ataxia. We analyzed somatic instability in transgenic mice that carry an expanded (GAA·TTC)n sequence in the context of the human FXN locus and lack the individual MMR proteins Msh2, Msh6 or Pms2. The absence of Msh2 or Msh6 resulted in a dramatic reduction in somatic mutations, indicating that mammalian MMR promotes instability of the (GAA·TTC)n sequence via MutSα. The absence of Pms2 resulted in increased accumulation of large expansions in the nervous system (cerebellum, cerebrum, and dorsal root ganglia) but not in non-neuronal tissues (heart and kidney), without affecting the prevalence of contractions. Pms2 suppressed large expansions specifically in tissues showing MutSα-dependent somatic instability, suggesting that they may act on the same lesion or structure associated with the expanded (GAA·TTC)n sequence. We conclude that Pms2 specifically suppresses large expansions of a pathogenic trinucleotide repeat sequence in neuronal tissues, possibly acting independently of the canonical MMR pathway. PMID:23071719
Pms2 suppresses large expansions of the (GAA·TTC)n sequence in neuronal tissues.

PubMed

Bourn, Rebecka L; De Biase, Irene; Pinto, Ricardo Mouro; Sandi, Chiranjeevi; Al-Mahdawi, Sahar; Pook, Mark A; Bidichandani, Sanjay I

2012-01-01

Expanded trinucleotide repeat sequences are the cause of several inherited neurodegenerative diseases. Disease pathogenesis is correlated with several features of somatic instability of these sequences, including further large expansions in postmitotic tissues. The presence of somatic expansions in postmitotic tissues is consistent with DNA repair being a major determinant of somatic instability. Indeed, proteins in the mismatch repair (MMR) pathway are required for instability of the expanded (CAG·CTG)(n) sequence, likely via recognition of intrastrand hairpins by MutSβ. It is not clear if or how MMR would affect instability of disease-causing expanded trinucleotide repeat sequences that adopt secondary structures other than hairpins, such as the triplex/R-loop forming (GAA·TTC)(n) sequence that causes Friedreich ataxia. We analyzed somatic instability in transgenic mice that carry an expanded (GAA·TTC)(n) sequence in the context of the human FXN locus and lack the individual MMR proteins Msh2, Msh6 or Pms2. The absence of Msh2 or Msh6 resulted in a dramatic reduction in somatic mutations, indicating that mammalian MMR promotes instability of the (GAA·TTC)(n) sequence via MutSα. The absence of Pms2 resulted in increased accumulation of large expansions in the nervous system (cerebellum, cerebrum, and dorsal root ganglia) but not in non-neuronal tissues (heart and kidney), without affecting the prevalence of contractions. Pms2 suppressed large expansions specifically in tissues showing MutSα-dependent somatic instability, suggesting that they may act on the same lesion or structure associated with the expanded (GAA·TTC)(n) sequence. We conclude that Pms2 specifically suppresses large expansions of a pathogenic trinucleotide repeat sequence in neuronal tissues, possibly acting independently of the canonical MMR pathway.
Thermodynamics of RNA structures by Wang–Landau sampling

PubMed Central

Lou, Feng; Clote, Peter

2010-01-01

Motivation: Thermodynamics-based dynamic programming RNA secondary structure algorithms have been of immense importance in molecular biology, where applications range from the detection of novel selenoproteins using expressed sequence tag (EST) data, to the determination of microRNA genes and their targets. Dynamic programming algorithms have been developed to compute the minimum free energy secondary structure and partition function of a given RNA sequence, the minimum free-energy and partition function for the hybridization of two RNA molecules, etc. However, the applicability of dynamic programming methods depends on disallowing certain types of interactions (pseudoknots, zig-zags, etc.), as their inclusion renders structure prediction an nondeterministic polynomial time (NP)-complete problem. Nevertheless, such interactions have been observed in X-ray structures. Results: A non-Boltzmannian Monte Carlo algorithm was designed by Wang and Landau to estimate the density of states for complex systems, such as the Ising model, that exhibit a phase transition. In this article, we apply the Wang-Landau (WL) method to compute the density of states for secondary structures of a given RNA sequence, and for hybridizations of two RNA sequences. Our method is shown to be much faster than existent software, such as RNAsubopt. From density of states, we compute the partition function over all secondary structures and over all pseudoknot-free hybridizations. The advantage of the WL method is that by adding a function to evaluate the free energy of arbitary pseudoknotted structures and of arbitrary hybridizations, we can estimate thermodynamic parameters for situations known to be NP-complete. This extension to pseudoknots will be made in the sequel to this article; in contrast, the current article describes the WL algorithm applied to pseudoknot-free secondary structures and hybridizations. Availability: The WL RNA hybridization web server is under construction at http://bioinformatics.bc.edu/clotelab/. Contact: clote@bc.edu PMID:20529917
The contribution of alu elements to mutagenic DNA double-strand break repair.

PubMed

Morales, Maria E; White, Travis B; Streva, Vincent A; DeFreece, Cecily B; Hedges, Dale J; Deininger, Prescott L

2015-03-01

Alu elements make up the largest family of human mobile elements, numbering 1.1 million copies and comprising 11% of the human genome. As a consequence of evolution and genetic drift, Alu elements of various sequence divergence exist throughout the human genome. Alu/Alu recombination has been shown to cause approximately 0.5% of new human genetic diseases and contribute to extensive genomic structural variation. To begin understanding the molecular mechanisms leading to these rearrangements in mammalian cells, we constructed Alu/Alu recombination reporter cell lines containing Alu elements ranging in sequence divergence from 0%-30% that allow detection of both Alu/Alu recombination and large non-homologous end joining (NHEJ) deletions that range from 1.0 to 1.9 kb in size. Introduction of as little as 0.7% sequence divergence between Alu elements resulted in a significant reduction in recombination, which indicates even small degrees of sequence divergence reduce the efficiency of homology-directed DNA double-strand break (DSB) repair. Further reduction in recombination was observed in a sequence divergence-dependent manner for diverged Alu/Alu recombination constructs with up to 10% sequence divergence. With greater levels of sequence divergence (15%-30%), we observed a significant increase in DSB repair due to a shift from Alu/Alu recombination to variable-length NHEJ which removes sequence between the two Alu elements. This increase in NHEJ deletions depends on the presence of Alu sequence homeology (similar but not identical sequences). Analysis of recombination products revealed that Alu/Alu recombination junctions occur more frequently in the first 100 bp of the Alu element within our reporter assay, just as they do in genomic Alu/Alu recombination events. This is the first extensive study characterizing the influence of Alu element sequence divergence on DNA repair, which will inform predictions regarding the effect of Alu element sequence divergence on both the rate and nature of DNA repair events.
2016 update on APBioNet's annual international conference on bioinformatics (InCoB).

PubMed

Schönbach, Christian; Verma, Chandra; Wee, Lawrence Jin Kiat; Bond, Peter John; Ranganathan, Shoba

2016-12-22

InCoB became since its inception in 2002 one of the largest annual bioinformatics conferences in the Asia-Pacific region with attendance ranging between 150 and 250 delegates depending on the venue location. InCoB 2016 in Singapore was attended by almost 220 delegates. This year, sessions on structural bioinformatics, sequence and sequencing, and next-generation sequencing fielded the highest number of oral presentation. Forty-four out 96 oral presentations were associated with an accepted manuscript in supplemental issues of BMC Bioinformatics, BMC Genomics, BMC Medical Genomics or BMC Systems Biology. Articles with a genomics focus are reviewed in this editorial. Next year's InCoB will be held in Shenzen, China from September 20 to 22, 2017.
Molecular cloning, sequence and structural analysis of dehairing Mn(2+) dependent alkaline serine protease (MASPT) of Bacillus pumilus TMS55.

PubMed

Ibrahim, Kalibulla Syed; Muniyandi, Jeyaraj; Pandian, Shunmugiah Karutha

2011-10-01

Leather industries release a large amount of pollution-causing chemicals which creates one of the major industrial pollutions. The development of enzyme based processes as a potent alternative to pollution-causing chemicals is useful to overcome this issue. Proteases are enzymes which have extensive applications in leather processing and in several bioremediation processes due to their high alkaline protease activity and dehairing efficacy. In the present study, we report cloning, characterization of a Mn2+ dependent alkaline serine protease gene (MASPT) of Bacillus pumilus TMS55. The gene encoding the protease from B. pumilus TMS55 was cloned and its nucleotide sequence was determined. This gene has an open reading frame (ORF) of 1,149 bp that encodes a polypeptide of 383 amino acid residues. Our analysis showed that this polypeptide is composed of 29 residues N-terminal signal peptide, a propeptide of 79 residues and a mature protein of 275 amino acids. We performed bioinformatics analysis to compare MASPT enzyme with other proteases. Homology modeling was employed to model three dimensional structure for MASPT. Structural analysis showed that MASPT structure is composed of nine α-helices and nine β-strands. It has 3 catalytic residues and 14 metal binding residues. Docking analysis showed that residues S223, A260, N263, T328 and S329 interact with Mn2+. This study allows initial inferences about the structure of the protease and will allow the rational design of its derivatives for structure-function studies and also for further improvement of the enzyme.
The Influence of Primary and Secondary DNA Structure in Deletion and Duplication between Direct Repeats in Escherichia Coli

PubMed Central

Trinh, T. Q.; Sinden, R. R.

1993-01-01

We describe a system to measure the frequency of both deletions and duplications between direct repeats. Short 17- and 18-bp palindromic and nonpalindromic DNA sequences were cloned into the EcoRI site within the chloramphenicol acetyltransferase gene of plasmids pBR325 and pJT7. This creates an insert between direct repeated EcoRI sites and results in a chloramphenicol-sensitive phenotype. Selection for chloramphenicol resistance was utilized to select chloramphenicol resistant revertants that included those with precise deletion of the insert from plasmid pBR325 and duplication of the insert in plasmid pJT7. The frequency of deletion or duplication varied more than 500-fold depending on the sequence of the short sequence inserted into the EcoRI site. For the nonpalindromic inserts, multiple internal direct repeats and the length of the direct repeats appear to influence the frequency of deletion. Certain palindromic DNA sequences with the potential to form DNA hairpin structures that might stabilize the misalignment of direct repeats had a high frequency of deletion. Other DNA sequences with the potential to form structures that might destabilize misalignment of direct repeats had a very low frequency of deletion. Duplication mutations occurred at the highest frequency when the DNA between the direct repeats contained no direct or inverted repeats. The presence of inverted repeats dramatically reduced the frequency of duplications. The results support the slippage-misalignment model, suggesting that misalignment occurring during DNA replication leads to deletion and duplication mutations. The results also support the idea that the formation of DNA secondary structures during DNA replication can facilitate and direct specific mutagenic events. PMID:8325478
Resolution of model Holliday junctions by yeast endonuclease: effect of DNA structure and sequence.

PubMed Central

Parsons, C A; Murchie, A I; Lilley, D M; West, S C

1989-01-01

The resolution of Holliday junctions in DNA involves specific cleavage at or close to the site of the junction. A nuclease from Saccharomyces cerevisiae cleaves model Holliday junctions in vitro by the introduction of nicks in regions of duplex DNA adjacent to the crossover point. In previous studies [Parsons and West (1988) Cell, 52, 621-629] it was shown that cleavage occurred within homologous arm sequences with precise symmetry across the junction. In contrast, junctions with heterologous arm sequences were cleaved asymmetrically. In this work, we have studied the effect of sequence changes and base modification upon the site of cleavage. It is shown that the specificity of cleavage is unchanged providing that perfect homology is maintained between opposing arm sequences. However, in the absence of homology, cleavage depends upon sequence context and is affected by minor changes such as base modification. These data support the proposed mechanism for cleavage of a Holliday junction, which requires homologous alignment of arm sequences in an enzyme--DNA complex as a prerequisite for symmetrical cleavage by the yeast endonuclease. Images PMID:2653810
RNAmutants: a web server to explore the mutational landscape of RNA secondary structures

PubMed Central

Waldispühl, Jerome; Devadas, Srinivas; Berger, Bonnie; Clote, Peter

2009-01-01

The history and mechanism of molecular evolution in DNA have been greatly elucidated by contributions from genetics, probability theory and bioinformatics—indeed, mathematical developments such as Kimura's neutral theory, Kingman's coalescent theory and efficient software such as BLAST, ClustalW, Phylip, etc., provide the foundation for modern population genetics. In contrast to DNA, the function of most noncoding RNA depends on tertiary structure, experimentally known to be largely determined by secondary structure, for which dynamic programming can efficiently compute the minimum free energy secondary structure. For this reason, understanding the effect of pointwise mutations in RNA secondary structure could reveal fundamental properties of structural RNA molecules and improve our understanding of molecular evolution of RNA. The web server RNAmutants provides several efficient tools to compute the ensemble of low-energy secondary structures for all k-mutants of a given RNA sequence, where k is bounded by a user-specified upper bound. As we have previously shown, these tools can be used to predict putative deleterious mutations and to analyze regulatory sequences from the hepatitis C and human immunodeficiency genomes. Web server is available at http://bioinformatics.bc.edu/clotelab/RNAmutants/, and downloadable binaries at http://rnamutants.csail.mit.edu/. PMID:19531740
Molecular Recognition and Structural Influences on Function in Bio-nanosystems of Nucleic Acids and Proteins

NASA Astrophysics Data System (ADS)

Sethaphong, Latsavongsakda

This work examines smart material properties of rational self-assembly and molecular recognition found in nano-biosystems. Exploiting the sequence and structural information encoded within nucleic acids and proteins will permit programmed synthesis of nanomaterials and help create molecular machines that may carry out new roles involving chemical catalysis and bioenergy. Responsive to different ionic environments thru self-reorgnization, nucleic acids (NA) are nature's signature smart material; organisms such as viruses and bacteria use features of NAs to react to their environment and orchestrate their lifecycle. Furthermore, nucleic acid systems (both RNA and DNA) are currently exploited as scaffolds; recent applications have been showcased to build bioelectronics and biotemplated nanostructures via directed assembly of multidimensional nanoelectronic devices 1. Since the most stable and rudimentary structure of nucleic acids is the helical duplex, these were modeled in order to examine the influence of the microenvironment, sequence, and cation-dependent perturbations of their canonical forms. Due to their negatively charged phosphate backbone, NA's rely on counterions to overcome the inherent repulsive forces that arise from the assembly of two complementary strands. As a realistic model system, we chose the HIV-TAR helix (PDB ID: 397D) to study specific sequence motifs on cation sequestration. At physiologically relevant concentrations of sodium and potassium ions, we observed sequence based effects where purine stretches were adept in retaining high residency cations. The transitional space between adenine and guanosine nucleotides (ApG step) in a sequence proved the most favorable. This work was the first to directly show these subtle interactions of sequence based cationic sequestration and may be useful for controlling metallization of nucleic acids in conductive nanowires. Extending the study further, we explored the degree to which the structure of NA duplexes alone interacted with cations distinct from a specific sequence. Under physiologically relevant conditions, a duplex of RNA polyguanine-polycitidine was highly responsive and able to sequester cations to the middle of the purine stretches. The least responsive structure was a DNA polyadenine-polythymine duplex. A random sequence DNA duplex contorted into an RNA-like helix resulted in cationic dynamics similar to RNA systems. These studies showed that cation diffusive binding events in nucleic acid duplex structures are sequence specific and heavily influenced by structural aspects helical forms to account for much of the differences observed. Although structural information in nucleic acids is encoded within their sequence, linking amino acid sequence to protein structure is murkier; the structural information within proteins is encoded by the folding process itself: a complex phenomenon driven toward the equilibrium state of the active conformation. Upwards of two thirds of a protein's sequence can be substituted with similar amino acids without significantly perturbing its function; conserved residues of about 10% seem to be vital; since evolutionary selection pressure in proteins operates 3-dimenionally, a linear sequence is partially informative. We explored this problem by folding de-novo the cytosolic portion of the membrane protein, cellulose synthase, CESA1 from upland cotton, Gossypium hirsutum (Ghcesa1). The cytoplasmic region was generated by homology modeling and refined with molecular dynamics. These mutations impair local structural flexibility which likely results in cellulose that is produced at a lower rate and is less crystalline. Additional modeling of fragments of cellulose synthases from the model plant, Arabidopsis thaliana, offered novel insights into the function of conserved cytosolic domains within plant cellulose synthases. Transport mechanisms related to the transmembrane region revealed significant differences between plants and a bacterial complex. These studies generated possible mutations that may allow for the creation of new synthases and identified other avenues of research in order to develop technologies that may alter the crystallinity and other useful properties of cellulose. 1. Karplus, K., SAM-T08, HMM-based protein structure prediction. Nucleic Acids Research, 2009. 37: p. W492-W497.
Simulation of gene evolution under directional mutational pressure

NASA Astrophysics Data System (ADS)

Dudkiewicz, Małgorzata; Mackiewicz, Paweł; Kowalczuk, Maria; Mackiewicz, Dorota; Nowicka, Aleksandra; Polak, Natalia; Smolarczyk, Kamila; Banaszak, Joanna; R. Dudek, Mirosław; Cebrat, Stanisław

2004-05-01

The two main mechanisms generating the genetic diversity, mutation and recombination, have random character but they are biased which has an effect on the generation of asymmetry in the bacterial chromosome structure and in the protein coding sequences. Thus, like in a case of two chiral molecules-the two possible orientations of a gene in relation to the topology of a chromosome are not equivalent. Assuming that the sequence of a gene may oscillate only between certain limits of its structural composition means that the gene could be forced out of these limits by the directional mutation pressure, in the course of evolution. The probability of the event depends on the time the gene stays under the same mutation pressure. Inversion of the gene changes the directional mutational pressure to the reciprocal one and hence it changes the distance of the gene to its lower and upper bound of the structural tolerance. Using Monte Carlo methods we were able to simulate the evolution of genes under experimentally found mutational pressure, assuming simple mechanisms of selection. We found that the mutation and recombination should work in accordance to lower their negative effects on the function of the products of coding sequences.
Structured oligonucleotides for target indexing to allow single-vessel PCR amplification and solid support microarray hybridization

PubMed Central

Girard, Laurie D.; Boissinot, Karel; Peytavi, Régis; Boissinot, Maurice; Bergeron, Michel G.

2014-01-01

The combination of molecular diagnostic technologies is increasingly used to overcome limitations on sensitivity, specificity or multiplexing capabilities, and provide efficient lab-on-chip devices. Two such techniques, PCR amplification and microarray hybridization are used serially to take advantage of the high sensitivity and specificity of the former combined with high multiplexing capacities of the latter. These methods are usually performed in different buffers and reaction chambers. However, these elaborate methods have a high complexity cost related to reagent requirements, liquid storage and the number of reaction chambers to integrate into automated devices. Furthermore, microarray hybridizations have a sequence dependent efficiency not always predictable. In this work, we have developed the concept of a structured oligonucleotide probe which is activated by cleavage from polymerase exonuclease activity. This technology is called SCISSOHR for Structured Cleavage Induced Single-Stranded Oligonucleotide Hybridization Reaction. The SCISSOHR probes enable indexing the target sequence to a tag sequence. The SCISSOHR technology also allows the combination of nucleic acid amplification and microarray hybridization in a single vessel in presence of the PCR buffer only. The SCISSOHR technology uses an amplification probe that is irreversibly modified in presence of the target, releasing a single-stranded DNA tag for microarray hybridization. Each tag is composed of a 3-nucleotidesequence-dependent segment and a unique “target sequence-independent” 14-nucleotide segment allowing for optimal hybridization with minimal cross-hybridization. We evaluated the performance of five (5) PCR buffers to support microarray hybridization, compared to a conventional hybridization buffer. Finally, as a proof of concept, we developed a multiplexed assay for the amplification, detection, and identification of three (3) DNA targets. This new technology will facilitate the design of lab-on-chip microfluidic devices, while also reducing consumable costs. At term, it will allow the cost-effective automation of highly multiplexed assays for detection and identification of genetic targets. PMID:25489607
Coevolution analysis of Hepatitis C virus genome to identify the structural and functional dependency network of viral proteins

NASA Astrophysics Data System (ADS)

Champeimont, Raphaël; Laine, Elodie; Hu, Shuang-Wei; Penin, Francois; Carbone, Alessandra

2016-05-01

A novel computational approach of coevolution analysis allowed us to reconstruct the protein-protein interaction network of the Hepatitis C Virus (HCV) at the residue resolution. For the first time, coevolution analysis of an entire viral genome was realized, based on a limited set of protein sequences with high sequence identity within genotypes. The identified coevolving residues constitute highly relevant predictions of protein-protein interactions for further experimental identification of HCV protein complexes. The method can be used to analyse other viral genomes and to predict the associated protein interaction networks.

Favorable 2'-substitution in the loop region of a thrombin-binding DNA aptamer.

PubMed

Awachat, Ragini; Wagh, Atish A; Aher, Manisha; Fernandes, Moneesha; Kumar, Vaijayanti A

2018-06-01

Simple 2'-OMe-chemical modification in the loop region of the 15mer G-rich DNA sequence GGTTGGTGTGGTTGG is reported. The G-quadruplex structure of this thrombin-binding aptamer (TBA), is stabilized by single modifications (T → 2'-OMe-U), depending on the position of the modification. The structural stability also renders significantly increased inhibition of thrombin-induced fibrin polymerization, a process closely associated with blood-clotting. Copyright © 2018 Elsevier Ltd. All rights reserved.
Salt bridges: geometrically specific, designable interactions.

PubMed

Donald, Jason E; Kulp, Daniel W; DeGrado, William F

2011-03-01

Salt bridges occur frequently in proteins, providing conformational specificity and contributing to molecular recognition and catalysis. We present a comprehensive analysis of these interactions in protein structures by surveying a large database of protein structures. Salt bridges between Asp or Glu and His, Arg, or Lys display extremely well-defined geometric preferences. Several previously observed preferences are confirmed, and others that were previously unrecognized are discovered. Salt bridges are explored for their preferences for different separations in sequence and in space, geometric preferences within proteins and at protein-protein interfaces, co-operativity in networked salt bridges, inclusion within metal-binding sites, preference for acidic electrons, apparent conformational side chain entropy reduction on formation, and degree of burial. Salt bridges occur far more frequently between residues at close than distant sequence separations, but, at close distances, there remain strong preferences for salt bridges at specific separations. Specific types of complex salt bridges, involving three or more members, are also discovered. As we observe a strong relationship between the propensity to form a salt bridge and the placement of salt-bridging residues in protein sequences, we discuss the role that salt bridges might play in kinetically influencing protein folding and thermodynamically stabilizing the native conformation. We also develop a quantitative method to select appropriate crystal structure resolution and B-factor cutoffs. Detailed knowledge of these geometric and sequence dependences should aid de novo design and prediction algorithms. Copyright © 2010 Wiley-Liss, Inc.
Structural brain aging and speech production: a surface-based brain morphometry study.

PubMed

Tremblay, Pascale; Deschamps, Isabelle

2016-07-01

While there has been a growing number of studies examining the neurofunctional correlates of speech production over the past decade, the neurostructural correlates of this immensely important human behaviour remain less well understood, despite the fact that previous studies have established links between brain structure and behaviour, including speech and language. In the present study, we thus examined, for the first time, the relationship between surface-based cortical thickness (CT) and three different behavioural indexes of sublexical speech production: response duration, reaction times and articulatory accuracy, in healthy young and older adults during the production of simple and complex meaningless sequences of syllables (e.g., /pa-pa-pa/ vs. /pa-ta-ka/). The results show that each behavioural speech measure was sensitive to the complexity of the sequences, as indicated by slower reaction times, longer response durations and decreased articulatory accuracy in both groups for the complex sequences. Older adults produced longer speech responses, particularly during the production of complex sequence. Unique age-independent and age-dependent relationships between brain structure and each of these behavioural measures were found in several cortical and subcortical regions known for their involvement in speech production, including the bilateral anterior insula, the left primary motor area, the rostral supramarginal gyrus, the right inferior frontal sulcus, the bilateral putamen and caudate, and in some region less typically associated with speech production, such as the posterior cingulate cortex.
Molecular dynamics study of some non-hydrogen-bonding base pair DNA strands

NASA Astrophysics Data System (ADS)

Tiwari, Rakesh K.; Ojha, Rajendra P.; Tiwari, Gargi; Pandey, Vishnudatt; Mall, Vijaysree

2018-05-01

In order to elucidate the structural activity of hydrophobic modified DNA, the DMMO2-D5SICS, base pair is introduced as a constituent in different set of 12-mer and 14-mer DNA sequences for the molecular dynamics (MD) simulation in explicit water solvent. AMBER 14 force field was employed for each set of duplex during the 200ns production-dynamics simulation in orthogonal-box-water solvent by the Particle-Mesh-Ewald (PME) method in infinite periodic boundary conditions (PBC) to determine conformational parameters of the complex. The force-field parameters of modified base-pair were calculated by Gaussian-code using Hartree-Fock /ab-initio methodology. RMSD Results reveal that the conformation of the duplex is sequence dependent and the binding energy of the complex depends on the position of the modified base-pair in the nucleic acid strand. We found that non-bonding energy had a significant contribution to stabilising such type of duplex in comparison to electrostatic energy. The distortion produced within strands by such type of base-pair was local and destabilised the duplex integrity near to substitution, moreover the binding energy of duplex depends on the position of substitution of hydrophobic base-pair and the DNA sequence and strongly supports the corresponding experimental study.
[Analysis of Conformational Features of Watson-Crick Duplex Fragments by Molecular Mechanics and Quantum Mechanics Methods].

PubMed

Poltev, V I; Anisimov, V M; Sanchez, C; Deriabina, A; Gonzalez, E; Garcia, D; Rivas, F; Polteva, N A

2016-01-01

It is generally accepted that the important characteristic features of the Watson-Crick duplex originate from the molecular structure of its subunits. However, it still remains to elucidate what properties of each subunit are responsible for the significant characteristic features of the DNA structure. The computations of desoxydinucleoside monophosphates complexes with Na-ions using density functional theory revealed a pivotal role of DNA conformational properties of single-chain minimal fragments in the development of unique features of the Watson-Crick duplex. We found that directionality of the sugar-phosphate backbone and the preferable ranges of its torsion angles, combined with the difference between purines and pyrimidines. in ring bases, define the dependence of three-dimensional structure of the Watson-Crick duplex on nucleotide base sequence. In this work, we extended these density functional theory computations to the minimal' fragments of DNA duplex, complementary desoxydinucleoside monophosphates complexes with Na-ions. Using several computational methods and various functionals, we performed a search for energy minima of BI-conformation for complementary desoxydinucleoside monophosphates complexes with different nucleoside sequences. Two sequences are optimized using ab initio method at the MP2/6-31++G** level of theory. The analysis of torsion angles, sugar ring puckering and mutual base positions of optimized structures demonstrates that the conformational characteristic features of complementary desoxydinucleoside monophosphates complexes with Na-ions remain within BI ranges and become closer to the corresponding characteristic features of the Watson-Crick duplex crystals. Qualitatively, the main characteristic features of each studied complementary desoxydinucleoside monophosphates complex remain invariant when different computational methods are used, although the quantitative values of some conformational parameters could vary lying within the limits typical for the corresponding family. We observe that popular functionals in density functional theory calculations lead to the overestimated distances between base pairs, while MP2 computations and the newer complex functionals produce the structures that have too close atom-atom contacts. A detailed study of some complementary desoxydinucleoside monophosphate complexes with Na-ions highlights the existence of several energy minima corresponding to BI-conformations, in other words, the complexity of the relief pattern of the potential energy surface of complementary desoxydinucleoside monophosphate complexes. This accounts for variability of conformational parameters of duplex fragments with the same base sequence. Popular molecular mechanics force fields AMBER and CHARMM reproduce most of the conformational characteristics of desoxydinucleoside monophosphates and their complementary complexes with Na-ions but fail to reproduce some details of the dependence of the Watson-Crick duplex conformation on the nucleotide sequence.
An artificial intelligence approach fit for tRNA gene studies in the era of big sequence data.

PubMed

Iwasaki, Yuki; Abe, Takashi; Wada, Kennosuke; Wada, Yoshiko; Ikemura, Toshimichi

2017-09-12

Unsupervised data mining capable of extracting a wide range of knowledge from big data without prior knowledge or particular models is a timely application in the era of big sequence data accumulation in genome research. By handling oligonucleotide compositions as high-dimensional data, we have previously modified the conventional self-organizing map (SOM) for genome informatics and established BLSOM, which can analyze more than ten million sequences simultaneously. Here, we develop BLSOM specialized for tRNA genes (tDNAs) that can cluster (self-organize) more than one million microbial tDNAs according to their cognate amino acid solely depending on tetra- and pentanucleotide compositions. This unsupervised clustering can reveal combinatorial oligonucleotide motifs that are responsible for the amino acid-dependent clustering, as well as other functionally and structurally important consensus motifs, which have been evolutionarily conserved. BLSOM is also useful for identifying tDNAs as phylogenetic markers for special phylotypes. When we constructed BLSOM with 'species-unknown' tDNAs from metagenomic sequences plus 'species-known' microbial tDNAs, a large portion of metagenomic tDNAs self-organized with species-known tDNAs, yielding information on microbial communities in environmental samples. BLSOM can also enhance accuracy in the tDNA database obtained from big sequence data. This unsupervised data mining should become important for studying numerous functionally unclear RNAs obtained from a wide range of organisms.
Phylogenetically Structured Differences in rRNA Gene Sequence Variation among Species of Arbuscular Mycorrhizal Fungi and Their Implications for Sequence Clustering

PubMed Central

Ekanayake, Saliya; Ruan, Yang; Schütte, Ursel M. E.; Kaonongbua, Wittaya; Fox, Geoffrey; Ye, Yuzhen; Bever, James D.

2016-01-01

ABSTRACT Arbuscular mycorrhizal (AM) fungi form mutualisms with plant roots that increase plant growth and shape plant communities. Each AM fungal cell contains a large amount of genetic diversity, but it is unclear if this diversity varies across evolutionary lineages. We found that sequence variation in the nuclear large-subunit (LSU) rRNA gene from 29 isolates representing 21 AM fungal species generally assorted into genus- and species-level clades, with the exception of species of the genera Claroideoglomus and Entrophospora. However, there were significant differences in the levels of sequence variation across the phylogeny and between genera, indicating that it is an evolutionarily constrained trait in AM fungi. These consistent patterns of sequence variation across both phylogenetic and taxonomic groups pose challenges to interpreting operational taxonomic units (OTUs) as approximations of species-level groups of AM fungi. We demonstrate that the OTUs produced by five sequence clustering methods using 97% or equivalent sequence similarity thresholds failed to match the expected species of AM fungi, although OTUs from AbundantOTU, CD-HIT-OTU, and CROP corresponded better to species than did OTUs from mothur or UPARSE. This lack of OTU-to-species correspondence resulted both from sequences of one species being split into multiple OTUs and from sequences of multiple species being lumped into the same OTU. The OTU richness therefore will not reliably correspond to the AM fungal species richness in environmental samples. Conservatively, this error can overestimate species richness by 4-fold or underestimate richness by one-half, and the direction of this error will depend on the genera represented in the sample. IMPORTANCE Arbuscular mycorrhizal (AM) fungi form important mutualisms with the roots of most plant species. Individual AM fungi are genetically diverse, but it is unclear whether the level of this diversity differs among evolutionary lineages. We found that the amount of sequence variation in an rRNA gene that is commonly used to identify AM fungal species varied significantly between evolutionary groups that correspond to different genera, with the exception of two genera that are genetically indistinguishable from each other. When we clustered groups of similar sequences into operational taxonomic units (OTUs) using five different clustering methods, these patterns of sequence variation caused the number of OTUs to either over- or underestimate the actual number of AM fungal species, depending on the genus. Our results indicate that OTU-based inferences about AM fungal species composition from environmental sequences can be improved if they take these taxonomically structured patterns of sequence variation into account. PMID:27260357
Heterogeneous Suppression of Sequential Effects in Random Sequence Generation, but Not in Operant Learning.

PubMed

Shteingart, Hanan; Loewenstein, Yonatan

2016-01-01

There is a long history of experiments in which participants are instructed to generate a long sequence of binary random numbers. The scope of this line of research has shifted over the years from identifying the basic psychological principles and/or the heuristics that lead to deviations from randomness, to one of predicting future choices. In this paper, we used generalized linear regression and the framework of Reinforcement Learning in order to address both points. In particular, we used logistic regression analysis in order to characterize the temporal sequence of participants' choices. Surprisingly, a population analysis indicated that the contribution of the most recent trial has only a weak effect on behavior, compared to more preceding trials, a result that seems irreconcilable with standard sequential effects that decay monotonously with the delay. However, when considering each participant separately, we found that the magnitudes of the sequential effect are a monotonous decreasing function of the delay, yet these individual sequential effects are largely averaged out in a population analysis because of heterogeneity. The substantial behavioral heterogeneity in this task is further demonstrated quantitatively by considering the predictive power of the model. We show that a heterogeneous model of sequential dependencies captures the structure available in random sequence generation. Finally, we show that the results of the logistic regression analysis can be interpreted in the framework of reinforcement learning, allowing us to compare the sequential effects in the random sequence generation task to those in an operant learning task. We show that in contrast to the random sequence generation task, sequential effects in operant learning are far more homogenous across the population. These results suggest that in the random sequence generation task, different participants adopt different cognitive strategies to suppress sequential dependencies when generating the "random" sequences.
Personalized Oncology Through Integrative High-Throughput Sequencing: A Pilot Study

PubMed Central

Roychowdhury, Sameek; Iyer, Matthew K.; Robinson, Dan R.; Lonigro, Robert J.; Wu, Yi-Mi; Cao, Xuhong; Kalyana-Sundaram, Shanker; Sam, Lee; Balbin, O. Alejandro; Quist, Michael J.; Barrette, Terrence; Everett, Jessica; Siddiqui, Javed; Kunju, Lakshmi P.; Navone, Nora; Araujo, John C.; Troncoso, Patricia; Logothetis, Christopher J.; Innis, Jeffrey W.; Smith, David C.; Lao, Christopher D.; Kim, Scott Y.; Roberts, J. Scott; Gruber, Stephen B.; Pienta, Kenneth J.; Talpaz, Moshe; Chinnaiyan, Arul M.

2012-01-01

Individual cancers harbor a set of genetic aberrations that can be informative for identifying rational therapies currently available or in clinical trials. We implemented a pilot study to explore the practical challenges of applying high-throughput sequencing in clinical oncology. We enrolled patients with advanced or refractory cancer who were eligible for clinical trials. For each patient, we performed whole-genome sequencing of the tumor, targeted whole-exome sequencing of tumor and normal DNA, and transcriptome sequencing (RNA-Seq) of the tumor to identify potentially informative mutations in a clinically relevant time frame of 3 to 4 weeks. With this approach, we detected several classes of cancer mutations including structural rearrangements, copy number alterations, point mutations, and gene expression alterations. A multidisciplinary Sequencing Tumor Board (STB) deliberated on the clinical interpretation of the sequencing results obtained. We tested our sequencing strategy on human prostate cancer xenografts. Next, we enrolled two patients into the clinical protocol and were able to review the results at our STB within 24 days of biopsy. The first patient had metastatic colorectal cancer in which we identified somatic point mutations in NRAS, TP53, AURKA, FAS, and MYH11, plus amplification and overexpression of cyclin-dependent kinase 8 (CDK8). The second patient had malignant melanoma, in which we identified a somatic point mutation in HRAS and a structural rearrangement affecting CDKN2C. The STB identified the CDK8 amplification and Ras mutation as providing a rationale for clinical trials with CDK inhibitors or MEK (mitogenactivated or extracellular signal–regulated protein kinase kinase) and PI3K (phosphatidylinositol 3-kinase) inhibitors, respectively. Integrative high-throughput sequencing of patients with advanced cancer generates a comprehensive, individual mutational landscape to facilitate biomarker-driven clinical trials in oncology. PMID:22133722
Restricted N-glycan conformational space in the PDB and its implication in glycan structure modeling.

PubMed

Jo, Sunhwan; Lee, Hui Sun; Skolnick, Jeffrey; Im, Wonpil

2013-01-01

Understanding glycan structure and dynamics is central to understanding protein-carbohydrate recognition and its role in protein-protein interactions. Given the difficulties in obtaining the glycan's crystal structure in glycoconjugates due to its flexibility and heterogeneity, computational modeling could play an important role in providing glycosylated protein structure models. To address if glycan structures available in the PDB can be used as templates or fragments for glycan modeling, we present a survey of the N-glycan structures of 35 different sequences in the PDB. Our statistical analysis shows that the N-glycan structures found on homologous glycoproteins are significantly conserved compared to the random background, suggesting that N-glycan chains can be confidently modeled with template glycan structures whose parent glycoproteins share sequence similarity. On the other hand, N-glycan structures found on non-homologous glycoproteins do not show significant global structural similarity. Nonetheless, the internal substructures of these N-glycans, particularly, the substructures that are closer to the protein, show significantly similar structures, suggesting that such substructures can be used as fragments in glycan modeling. Increased interactions with protein might be responsible for the restricted conformational space of N-glycan chains. Our results suggest that structure prediction/modeling of N-glycans of glycoconjugates using structure database could be effective and different modeling approaches would be needed depending on the availability of template structures.
Restricted N-glycan Conformational Space in the PDB and Its Implication in Glycan Structure Modeling

PubMed Central

Jo, Sunhwan; Lee, Hui Sun; Skolnick, Jeffrey; Im, Wonpil

2013-01-01

Understanding glycan structure and dynamics is central to understanding protein-carbohydrate recognition and its role in protein-protein interactions. Given the difficulties in obtaining the glycan's crystal structure in glycoconjugates due to its flexibility and heterogeneity, computational modeling could play an important role in providing glycosylated protein structure models. To address if glycan structures available in the PDB can be used as templates or fragments for glycan modeling, we present a survey of the N-glycan structures of 35 different sequences in the PDB. Our statistical analysis shows that the N-glycan structures found on homologous glycoproteins are significantly conserved compared to the random background, suggesting that N-glycan chains can be confidently modeled with template glycan structures whose parent glycoproteins share sequence similarity. On the other hand, N-glycan structures found on non-homologous glycoproteins do not show significant global structural similarity. Nonetheless, the internal substructures of these N-glycans, particularly, the substructures that are closer to the protein, show significantly similar structures, suggesting that such substructures can be used as fragments in glycan modeling. Increased interactions with protein might be responsible for the restricted conformational space of N-glycan chains. Our results suggest that structure prediction/modeling of N-glycans of glycoconjugates using structure database could be effective and different modeling approaches would be needed depending on the availability of template structures. PMID:23516343
Probing Xist RNA Structure in Cells Using Targeted Structure-Seq

PubMed Central

Rutenberg-Schoenberg, Michael; Simon, Matthew D.

2015-01-01

The long non-coding RNA (lncRNA) Xist is a master regulator of X-chromosome inactivation in mammalian cells. Models for how Xist and other lncRNAs function depend on thermodynamically stable secondary and higher-order structures that RNAs can form in the context of a cell. Probing accessible RNA bases can provide data to build models of RNA conformation that provide insight into RNA function, molecular evolution, and modularity. To study the structure of Xist in cells, we built upon recent advances in RNA secondary structure mapping and modeling to develop Targeted Structure-Seq, which combines chemical probing of RNA structure in cells with target-specific massively parallel sequencing. By enriching for signals from the RNA of interest, Targeted Structure-Seq achieves high coverage of the target RNA with relatively few sequencing reads, thus providing a targeted and scalable approach to analyze RNA conformation in cells. We use this approach to probe the full-length Xist lncRNA to develop new models for functional elements within Xist, including the repeat A element in the 5’-end of Xist. This analysis also identified new structural elements in Xist that are evolutionarily conserved, including a new element proximal to the C repeats that is important for Xist function. PMID:26646615
Cube - an online tool for comparison and contrasting of protein sequences.

PubMed

Zhang, Zong Hong; Khoo, Aik Aun; Mihalek, Ivana

2013-01-01

When comparing sequences of similar proteins, two kinds of questions can be asked, and the related two kinds of inference made. First, one may ask to what degree they are similar, and then, how they differ. In the first case one may tentatively conclude that the conserved elements common to all sequences are of central and common importance to the protein's function. In the latter case the regions of specialization may be discriminative of the function or binding partners across subfamilies of related proteins. Experimental efforts - mutagenesis or pharmacological intervention - can then be pointed in either direction, depending on the context of the study. Cube simplifies this process for users that already have their favorite sets of sequences, and helps them collate the information by visualization of the conservation and specialization scores on the sequence and on the structure, and by spreadsheet tabulation. All information can be visualized on the spot, or downloaded for reference and later inspection. http://eopsf.org/cube.
Two distinct DNA sequences recognized by transcription factors represent enthalpy and entropy optima

PubMed Central

Yin, Yimeng; Das, Pratyush K; Jolma, Arttu; Zhu, Fangjie; Popov, Alexander; Xu, You; Nilsson, Lennart

2018-01-01

Most transcription factors (TFs) can bind to a population of sequences closely related to a single optimal site. However, some TFs can bind to two distinct sequences that represent two local optima in the Gibbs free energy of binding (ΔG). To determine the molecular mechanism behind this effect, we solved the structures of human HOXB13 and CDX2 bound to their two optimal DNA sequences, CAATAAA and TCGTAAA. Thermodynamic analyses by isothermal titration calorimetry revealed that both sites were bound with similar ΔG. However, the interaction with the CAA sequence was driven by change in enthalpy (ΔH), whereas the TCG site was bound with similar affinity due to smaller loss of entropy (ΔS). This thermodynamic mechanism that leads to at least two local optima likely affects many macromolecular interactions, as ΔG depends on two partially independent variables ΔH and ΔS according to the central equation of thermodynamics, ΔG = ΔH - TΔS. PMID:29638214
Peptide-dependent Conformational Fluctuation Determines the Stability of the Human Leukocyte Antigen Class I Complex*

PubMed Central

Yanaka, Saeko; Ueno, Takamasa; Shi, Yi; Qi, Jianxun; Gao, George F.; Tsumoto, Kouhei; Sugase, Kenji

2014-01-01

In immune-mediated control of pathogens, human leukocyte antigen (HLA) class I presents various antigenic peptides to CD8+ T-cells. Long-lived peptide presentation is important for efficient antigen-specific T-cell activation. Presentation time depends on the peptide sequence and the stability of the peptide-HLA complex (pHLA). However, the determinant of peptide-dependent pHLA stability remains elusive. Here, to reveal the pHLA stabilization mechanism, we examined the crystal structures of an HLA class I allomorph in complex with HIV-derived peptides and evaluated site-specific conformational fluctuations using NMR. Although the crystal structures of various pHLAs were almost identical independent of the peptides, fluctuation analyses identified a peptide-dependent minor state that would be more tightly packed toward the peptide. The minor population correlated well with the thermostability and cell surface presentation of pHLA, indicating that this newly identified minor state is important for stabilizing the pHLA and facilitating T-cell recognition. PMID:25028510
The unfoldomics decade: an update on intrinsically disordered proteins.

PubMed

Dunker, A Keith; Oldfield, Christopher J; Meng, Jingwei; Romero, Pedro; Yang, Jack Y; Chen, Jessica Walton; Vacic, Vladimir; Obradovic, Zoran; Uversky, Vladimir N

2008-09-16

Our first predictor of protein disorder was published just over a decade ago in the Proceedings of the IEEE International Conference on Neural Networks (Romero P, Obradovic Z, Kissinger C, Villafranca JE, Dunker AK (1997) Identifying disordered regions in proteins from amino acid sequence. Proceedings of the IEEE International Conference on Neural Networks, 1: 90-95). By now more than twenty other laboratory groups have joined the efforts to improve the prediction of protein disorder. While the various prediction methodologies used for protein intrinsic disorder resemble those methodologies used for secondary structure prediction, the two types of structures are entirely different. For example, the two structural classes have very different dynamic properties, with the irregular secondary structure class being much less mobile than the disorder class. The prediction of secondary structure has been useful. On the other hand, the prediction of intrinsic disorder has been revolutionary, leading to major modifications of the more than 100 year-old views relating protein structure and function. Experimentalists have been providing evidence over many decades that some proteins lack fixed structure or are disordered (or unfolded) under physiological conditions. In addition, experimentalists are also showing that, for many proteins, their functions depend on the unstructured rather than structured state; such results are in marked contrast to the greater than hundred year old views such as the lock and key hypothesis. Despite extensive data on many important examples, including disease-associated proteins, the importance of disorder for protein function has been largely ignored. Indeed, to our knowledge, current biochemistry books don't present even one acknowledged example of a disorder-dependent function, even though some reports of disorder-dependent functions are more than 50 years old. The results from genome-wide predictions of intrinsic disorder and the results from other bioinformatics studies of intrinsic disorder are demanding attention for these proteins. Disorder prediction has been important for showing that the relatively few experimentally characterized examples are members of a very large collection of related disordered proteins that are wide-spread over all three domains of life. Many significant biological functions are now known to depend directly on, or are importantly associated with, the unfolded or partially folded state. Here our goal is to review the key discoveries and to weave these discoveries together to support novel approaches for understanding sequence-function relationships. Intrinsically disordered protein is common across the three domains of life, but especially common among the eukaryotic proteomes. Signaling sequences and sites of posttranslational modifications are frequently, or very likely most often, located within regions of intrinsic disorder. Disorder-to-order transitions are coupled with the adoption of different structures with different partners. Also, the flexibility of intrinsic disorder helps different disordered regions to bind to a common binding site on a common partner. Such capacity for binding diversity plays important roles in both protein-protein interaction networks and likely also in gene regulation networks. Such disorder-based signaling is further modulated in multicellular eukaryotes by alternative splicing, for which such splicing events map to regions of disorder much more often than to regions of structure. Associating alternative splicing with disorder rather than structure alleviates theoretical and experimentally observed problems associated with the folding of different length, isomeric amino acid sequences. The combination of disorder and alternative splicing is proposed to provide a mechanism for easily "trying out" different signaling pathways, thereby providing the mechanism for generating signaling diversity and enabling the evolution of cell differentiation and multicellularity. Finally, several recent small molecules of interest as potential drugs have been shown to act by blocking protein-protein interactions based on intrinsic disorder of one of the partners. Study of these examples has led to a new approach for drug discovery, and bioinformatics analysis of the human proteome suggests that various disease-associated proteins are very rich in such disorder-based drug discovery targets.
Electronic fingerprints of DNA bases on graphene.

PubMed

Ahmed, Towfiq; Kilina, Svetlana; Das, Tanmoy; Haraldsen, Jason T; Rehr, John J; Balatsky, Alexander V

2012-02-08

We calculate the electronic local density of states (LDOS) of DNA nucleotide bases (A,C,G,T), deposited on graphene. We observe significant base-dependent features in the LDOS in an energy range within a few electronvolts of the Fermi level. These features can serve as electronic fingerprints for the identification of individual bases in scanning tunneling spectroscopy (STS) experiments that perform image and site dependent spectroscopy on biomolecules. Thus the fingerprints of DNA-graphene hybrid structures may provide an alternative route to DNA sequencing using STS. © 2012 American Chemical Society
Charge-induced geometrical reorganization of DNA oligonucleotides studied by tandem mass spectrometry and ion mobility.

PubMed

Ickert, Stefanie; Hofmann, Johanna; Riedel, Jens; Beck, Sebastian; Pagel, Kevin; Linscheid, Michael W

2018-04-01

Mass spectrometry is applied as a tool for the elucidation of molecular structures. This premises that gas-phase structures reflect the original geometry of the analytes, while it requires a thorough understanding and investigation of the forces controlling and affecting the gas-phase structures. However, only little is known about conformational changes of oligonucleotides in the gas phase. In this study, a series of multiply charged DNA oligonucleotides (n = 15-40) has been subjected to a comprehensive tandem mass spectrometric study to unravel transitions between different ionic gas-phase structures. The nucleobase sequence and the chain length were varied to gain insights into their influence on the geometrical oligonucleotide organization. Altogether, 23 oligonucleotides were analyzed using collision-induced fragmentation. All sequences showed comparable correlation regarding the characteristic collision energy. This value that is also a measure for stability, strongly correlates with the net charge density of the precursor ions. With decreasing charge of the oligonucleotides, an increase in the fragmentation energy was observed. At a distinct charge density, a deviation from linearity was observed for all studied species, indicating a structural reorganization. To corroborate the proposed geometrical change, collisional cross-sections of the oligonucleotides at different charge states were determined using ion mobility-mass spectrometry. The results clearly indicate that an increase in charge density and thus Coulomb repulsion results in the transition from a folded, compact form to elongated structures of the precursor ions. Our data show this structural transition to depend mainly on the charge density, whereas sequence and size do not have an influence.
Critical Determinants of Substrate Recognition by Cyclin-Dependent Kinase-like 5 (CDKL5).

PubMed

Katayama, Syouichi; Sueyoshi, Noriyuki; Kameshita, Isamu

2015-05-19

Cyclin-dependent kinase-like 5 (CDKL5) is a Ser/Thr protein kinase known to be associated with X-linked neurodevelopmental disorders. In a previous study, we identified amphiphysin 1 (Amph1) as a potential substrate for CDKL5 and identified a single phosphorylation site at Ser-293. In this study, we investigated the molecular mechanisms of substrate recognition by CDKL5 using Amph1 as a model substrate. Amph1 served as an efficient CDKL5 substrate, whereas Amph2, a structurally related homologue of Amph1, was not phosphorylated by CDKL5. The sequence around the Amph1 phosphorylation site is RPR(293)SPSQ, while the corresponding sequence in Amph2 is IPK(332)SPSQ. To define the amino acid sequence specificity of the substrate, various point mutants of Amph1 and Amph2 were prepared and phosphorylated by CDKL5. Both Amph2(I329R) and Amph1 served as efficient CDKL5 substrates, but Amph1(R290I) did not, indicating that the arginyl residue at the P -3 position is critical for substrate recognition. With regard to prolyl residues around the phosphorylation site of Amph1, Pro-291 at the P -2 position, but not Pro-294 at the P +1 position, is indispensable for phosphorylation by CDKL5. Phosphorylation experiments using various deletion mutants of Amph1 revealed that the proline-rich domain (PRD) (amino acids 247-315) alone was not phosphorylated by CDKL5. In contrast, Amph1(247-385), which comprised the PRD and CLAP domains, served as an efficient CDKL5 substrate. These results, taken together, suggest that both the phosphorylation site sequence (RPXSX) and the CLAP domain structure in Amph1 play crucial roles in recognition and phosphorylation by CDKL5.
Proximity to AGCT sequences dictates MMR-independent versus MMR-dependent mechanisms for AID-induced mutation via UNG2

PubMed Central

Thientosapol, Eddy Sanchai; Sharbeen, George; Lau, K.K. Edwin; Bosnjak, Daniel; Durack, Timothy; Stevanovski, Igor; Weninger, Wolfgang

2017-01-01

Abstract AID deaminates C to U in either strand of Ig genes, exclusively producing C:G/G:C to T:A/A:T transition mutations if U is left unrepaired. Error-prone processing by UNG2 or mismatch repair diversifies mutation, predominantly at C:G or A:T base pairs, respectively. Here, we show that transversions at C:G base pairs occur by two distinct processing pathways that are dictated by sequence context. Within and near AGCT mutation hotspots, transversion mutation at C:G was driven by UNG2 without requirement for mismatch repair. Deaminations in AGCT were refractive both to processing by UNG2 and to high-fidelity base excision repair (BER) downstream of UNG2, regardless of mismatch repair activity. We propose that AGCT sequences resist faithful BER because they bind BER-inhibitory protein(s) and/or because hemi-deaminated AGCT motifs innately form a BER-resistant DNA structure. Distal to AGCT sequences, transversions at G were largely co-dependent on UNG2 and mismatch repair. We propose that AGCT-distal transversions are produced when apyrimidinic sites are exposed in mismatch excision patches, because completion of mismatch repair would require bypass of these sites. PMID:28039326

An Amino Acid Code for β-sheet Packing Structure

PubMed Central

Joo, Hyun; Tsai, Jerry

2014-01-01

To understand the relationship between protein sequence and structure, this work extends the knob-socket model in an investigation of β-sheet packing. Over a comprehensive set of β-sheet folds, the contacts between residues were used to identify packing cliques: sets of residues that all contact each other. These packing cliques were then classified based on size and contact order. From this analysis, the 2 types of 4 residue packing cliques necessary to describe β-sheet packing were characterized. Both occur between 2 adjacent hydrogen bonded β-strands. First, defining the secondary structure packing within β-sheets, the combined socket or XY:HG pocket consists of 4 residues i,i+2 on one strand and j,j+2 on the other. Second, characterizing the tertiary packing between β-sheets, the knob-socket XY:H+B consists of a 3 residue XY:H socket (i,i+2 on one strand and j on the other) packed against a knob B residue (residue k distant in sequence). Depending on the packing depth of the knob B residue, 2 types of knob-sockets are found: side-chain and main-chain sockets. The amino acid composition of the pockets and knob-sockets reveal the sequence specificity of β-sheet packing. For β-sheet formation, the XY:HG pocket clearly shows sequence specificity of amino acids. For tertiary packing, the XY:H+B side-chain and main-chain sockets exhibit distinct amino acid preferences at each position. These relationships define an amino acid code for β-sheet structure and provide an intuitive topological mapping of β-sheet packing. PMID:24668690
Structural flexibility and protein adaptation to temperature: Molecular dynamics analysis of malate dehydrogenases of marine molluscs.

PubMed

Dong, Yun-Wei; Liao, Ming-Ling; Meng, Xian-Liang; Somero, George N

2018-02-06

Orthologous proteins of species adapted to different temperatures exhibit differences in stability and function that are interpreted to reflect adaptive variation in structural "flexibility." However, quantifying flexibility and comparing flexibility across proteins has remained a challenge. To address this issue, we examined temperature effects on cytosolic malate dehydrogenase (cMDH) orthologs from differently thermally adapted congeners of five genera of marine molluscs whose field body temperatures span a range of ∼60 °C. We describe consistent patterns of convergent evolution in adaptation of function [temperature effects on K M of cofactor (NADH)] and structural stability (rate of heat denaturation of activity). To determine how these differences depend on flexibilities of overall structure and of regions known to be important in binding and catalysis, we performed molecular dynamics simulation (MDS) analyses. MDS analyses revealed a significant negative correlation between adaptation temperature and heat-induced increase of backbone atom movements [root mean square deviation (rmsd) of main-chain atoms]. Root mean square fluctuations (RMSFs) of movement by individual amino acid residues varied across the sequence in a qualitatively similar pattern among orthologs. Regions of sequence involved in ligand binding and catalysis-termed mobile regions 1 and 2 (MR1 and MR2), respectively-showed the largest values for RMSF. Heat-induced changes in RMSF values across the sequence and, importantly, in MR1 and MR2 were greatest in cold-adapted species. MDS methods are shown to provide powerful tools for examining adaptation of enzymes by providing a quantitative index of protein flexibility and identifying sequence regions where adaptive change in flexibility occurs.
Schematic representation of residue-based protein context-dependent data: an application to transmembrane proteins.

PubMed

Campagne, F; Weinstein, H

1999-01-01

An algorithmic method for drawing residue-based schematic diagrams of proteins on a 2D page is presented and illustrated. The method allows the creation of rendering engines dedicated to a given family of sequences, or fold. The initial implementation provides an engine that can produce a 2D diagram representing secondary structure for any transmembrane protein sequence. We present the details of the strategy for automating the drawing of these diagrams. The most important part of this strategy is the development of an algorithm for laying out residues of a loop that connects to arbitrary points of a 2D plane. As implemented, this algorithm is suitable for real-time modification of the loop layout. This work is of interest for the representation and analysis of data from (1) protein databases, (2) mutagenesis results, or (3) various kinds of protein context-dependent annotations or data.
Temporal Integration of Auditory Information Is Invariant to Temporal Grouping Cues

PubMed

Liu, Andrew S K; Tsunada, Joji; Gold, Joshua I; Cohen, Yale E

2015-01-01

Auditory perception depends on the temporal structure of incoming acoustic stimuli. Here, we examined whether a temporal manipulation that affects the perceptual grouping also affects the time dependence of decisions regarding those stimuli. We designed a novel discrimination task that required human listeners to decide whether a sequence of tone bursts was increasing or decreasing in frequency. We manipulated temporal perceptual-grouping cues by changing the time interval between the tone bursts, which led to listeners hearing the sequences as a single sound for short intervals or discrete sounds for longer intervals. Despite these strong perceptual differences, this manipulation did not affect the efficiency of how auditory information was integrated over time to form a decision. Instead, the grouping manipulation affected subjects' speed-accuracy trade-offs. These results indicate that the temporal dynamics of evidence accumulation for auditory perceptual decisions can be invariant to manipulations that affect the perceptual grouping of the evidence.
A search for structurally similar cellular internal ribosome entry sites

PubMed Central

Baird, Stephen D.; Lewis, Stephen M.; Turcotte, Marcel; Holcik, Martin

2007-01-01

Internal ribosome entry sites (IRES) allow ribosomes to be recruited to mRNA in a cap-independent manner. Some viruses that impair cap-dependent translation initiation utilize IRES to ensure that the viral RNA will efficiently compete for the translation machinery. IRES are also employed for the translation of a subset of cellular messages during conditions that inhibit cap-dependent translation initiation. IRES from viruses like Hepatitis C and Classical Swine Fever virus share a similar structure/function without sharing primary sequence similarity. Of the cellular IRES structures derived so far, none were shown to share an overall structural similarity. Therefore, we undertook a genome-wide search of human 5′UTRs (untranslated regions) with an empirically derived structure of the IRES from the key inhibitor of apoptosis, X-linked inhibitor of apoptosis protein (XIAP), to identify novel IRES that share structure/function similarity. Three of the top matches identified by this search that exhibit IRES activity are the 5′UTRs of Aquaporin 4, ELG1 and NF-kappaB repressing factor (NRF). The structures of AQP4 and ELG1 IRES have limited similarity to the XIAP IRES; however, they share trans-acting factors that bind the XIAP IRES. We therefore propose that cellular IRES are not defined by overall structure, as viral IRES, but are instead dependent upon short motifs and trans-acting factors for their function. PMID:17591613
Sequential Self-Folding Structures by 3D Printed Digital Shape Memory Polymers

NASA Astrophysics Data System (ADS)

Mao, Yiqi; Yu, Kai; Isakov, Michael S.; Wu, Jiangtao; Dunn, Martin L.; Jerry Qi, H.

2015-09-01

Folding is ubiquitous in nature with examples ranging from the formation of cellular components to winged insects. It finds technological applications including packaging of solar cells and space structures, deployable biomedical devices, and self-assembling robots and airbags. Here we demonstrate sequential self-folding structures realized by thermal activation of spatially-variable patterns that are 3D printed with digital shape memory polymers, which are digital materials with different shape memory behaviors. The time-dependent behavior of each polymer allows the temporal sequencing of activation when the structure is subjected to a uniform temperature. This is demonstrated via a series of 3D printed structures that respond rapidly to a thermal stimulus, and self-fold to specified shapes in controlled shape changing sequences. Measurements of the spatial and temporal nature of self-folding structures are in good agreement with the companion finite element simulations. A simplified reduced-order model is also developed to rapidly and accurately describe the self-folding physics. An important aspect of self-folding is the management of self-collisions, where different portions of the folding structure contact and then block further folding. A metric is developed to predict collisions and is used together with the reduced-order model to design self-folding structures that lock themselves into stable desired configurations.
DNA sequence-dependent mechanics and protein-assisted bending in repressor-mediated loop formation

PubMed Central

Boedicker, James Q.; Garcia, Hernan G.; Johnson, Stephanie; Phillips, Rob

2014-01-01

As the chief informational molecule of life, DNA is subject to extensive physical manipulations. The energy required to deform double-helical DNA depends on sequence, and this mechanical code of DNA influences gene regulation, such as through nucleosome positioning. Here we examine the sequence-dependent flexibility of DNA in bacterial transcription factor-mediated looping, a context for which the role of sequence remains poorly understood. Using a suite of synthetic constructs repressed by the Lac repressor and two well-known sequences that show large flexibility differences in vitro, we make precise statistical mechanical predictions as to how DNA sequence influences loop formation and test these predictions using in vivo transcription and in vitro single-molecule assays. Surprisingly, sequence-dependent flexibility does not affect in vivo gene regulation. By theoretically and experimentally quantifying the relative contributions of sequence and the DNA-bending protein HU to DNA mechanical properties, we reveal that bending by HU dominates DNA mechanics and masks intrinsic sequence-dependent flexibility. Such a quantitative understanding of how mechanical regulatory information is encoded in the genome will be a key step towards a predictive understanding of gene regulation at single-base pair resolution. PMID:24231252
Use of Limited Proteolysis and Mutagenesis To Identify Folding Domains and Sequence Motifs Critical for Wax Ester Synthase/Acyl Coenzyme A:Diacylglycerol Acyltransferase Activity

PubMed Central

Villa, Juan A.; Cabezas, Matilde; de la Cruz, Fernando

2014-01-01

Triacylglycerols and wax esters are synthesized as energy storage molecules by some proteobacteria and actinobacteria under stress. The enzyme responsible for neutral lipid accumulation is the bifunctional wax ester synthase/acyl-coenzyme A (CoA):diacylglycerol acyltransferase (WS/DGAT). Structural modeling of WS/DGAT suggests that it can adopt an acyl-CoA-dependent acyltransferase fold with the N-terminal and C-terminal domains connected by a helical linker, an architecture demonstrated experimentally by limited proteolysis. Moreover, we found that both domains form an active complex when coexpressed as independent polypeptides. The structural prediction and sequence alignment of different WS/DGAT proteins indicated catalytically important motifs in the enzyme. Their role was probed by measuring the activities of a series of alanine scanning mutants. Our study underscores the structural understanding of this protein family and paves the way for their modification to improve the production of neutral lipids. PMID:24296496
Single-molecule investigation of G-quadruplex folds of the human telomere sequence in a protein nanocavity

PubMed Central

An, Na; Fleming, Aaron M.; Middleton, Eric G.; Burrows, Cynthia J.

2014-01-01

Human telomeric DNA consists of tandem repeats of the sequence 5′-TTAGGG-3′ that can fold into various G-quadruplexes, including the hybrid, basket, and propeller folds. In this report, we demonstrate use of the α-hemolysin ion channel to analyze these subtle topological changes at a nanometer scale by providing structure-dependent electrical signatures through DNA–protein interactions. Whereas the dimensions of hybrid and basket folds allowed them to enter the protein vestibule, the propeller fold exceeds the size of the latch region, producing only brief collisions. After attaching a 25-mer poly-2′-deoxyadenosine extension to these structures, unraveling kinetics also were evaluated. Both the locations where the unfolding processes occur and the molecular shapes of the G-quadruplexes play important roles in determining their unfolding profiles. These results provide insights into the application of α-hemolysin as a molecular sieve to differentiate nanostructures as well as the potential technical hurdles DNA secondary structures may present to nanopore technology. PMID:25225404
Backbone hydration determines the folding signature of amino acid residues.

PubMed

Bignucolo, Olivier; Leung, Hoi Tik Alvin; Grzesiek, Stephan; Bernèche, Simon

2015-04-08

The relation between the sequence of a protein and its three-dimensional structure remains largely unknown. A lasting dream is to elucidate the side-chain-dependent driving forces that govern the folding process. Different structural data suggest that aromatic amino acids play a particular role in the stabilization of protein structures. To better understand the underlying mechanism, we studied peptides of the sequence EGAAXAASS (X = Gly, Ile, Tyr, Trp) through comparison of molecular dynamics (MD) trajectories and NMR residual dipolar coupling (RDC) measurements. The RDC data for aromatic substitutions provide evidence for a kink in the peptide backbone. Analysis of the MD simulations shows that the formation of internal hydrogen bonds underlying a helical turn is key to reproduce the experimental RDC values. The simulations further reveal that the driving force leading to such helical-turn conformations arises from the lack of hydration of the peptide chain on either side of the bulky aromatic side chain, which can potentially act as a nucleation point initiating the folding process.
The Integrity of Anticipatory Coarticulation in Fluent and Non-Fluent Tokens of Adults Who Stutter

ERIC Educational Resources Information Center

Sussman, Harvey M.; Byrd, Courtney T.; Guitar, Barry

2011-01-01

This article analysed the acoustic structure of voiced stop ++ vowel sequences in a group of persons who stutter (PWS). This phonetic unit was chosen because successful production is highly dependent on the differential tweaking of right-to-left anticipatory coarticulation as a function of stop place. Thus, essential elements of both speech motor…
Enhanced Modeling of First-Order Plant Equations of Motion for Aeroelastic and Aeroservoelastic Applications

NASA Technical Reports Server (NTRS)

Pototzky, Anthony S.

2010-01-01

A methodology is described for generating first-order plant equations of motion for aeroelastic and aeroservoelastic applications. The description begins with the process of generating data files representing specialized mode-shapes, such as rigid-body and control surface modes, using both PATRAN and NASTRAN analysis. NASTRAN executes the 146 solution sequence using numerous Direct Matrix Abstraction Program (DMAP) calls to import the mode-shape files and to perform the aeroelastic response analysis. The aeroelastic response analysis calculates and extracts structural frequencies, generalized masses, frequency-dependent generalized aerodynamic force (GAF) coefficients, sensor deflections and load coefficients data as text-formatted data files. The data files are then re-sequenced and re-formatted using a custom written FORTRAN program. The text-formatted data files are stored and coefficients for s-plane equations are fitted to the frequency-dependent GAF coefficients using two Interactions of Structures, Aerodynamics and Controls (ISAC) programs. With tabular files from stored data created by ISAC, MATLAB generates the first-order aeroservoelastic plant equations of motion. These equations include control-surface actuator, turbulence, sensor and load modeling. Altitude varying root-locus plot and PSD plot results for a model of the F-18 aircraft are presented to demonstrate the capability.
Self-expressive Dictionary Learning for Dynamic 3D Reconstruction.

PubMed

Zheng, Enliang; Ji, Dinghuang; Dunn, Enrique; Frahm, Jan-Michael

2017-08-22

We target the problem of sparse 3D reconstruction of dynamic objects observed by multiple unsynchronized video cameras with unknown temporal overlap. To this end, we develop a framework to recover the unknown structure without sequencing information across video sequences. Our proposed compressed sensing framework poses the estimation of 3D structure as the problem of dictionary learning, where the dictionary is defined as an aggregation of the temporally varying 3D structures. Given the smooth motion of dynamic objects, we observe any element in the dictionary can be well approximated by a sparse linear combination of other elements in the same dictionary (i.e. self-expression). Our formulation optimizes a biconvex cost function that leverages a compressed sensing formulation and enforces both structural dependency coherence across video streams, as well as motion smoothness across estimates from common video sources. We further analyze the reconstructability of our approach under different capture scenarios, and its comparison and relation to existing methods. Experimental results on large amounts of synthetic data as well as real imagery demonstrate the effectiveness of our approach.
Methylene blue binding to DNA with alternating AT base sequence: minor groove binding is favored over intercalation.

PubMed

Rohs, Remo; Sklenar, Heinz

2004-04-01

The results presented in this paper on methylene blue (MB) binding to DNA with AT alternating base sequence complement the data obtained in two former modeling studies of MB binding to GC alternating DNA. In the light of the large amount of experimental data for both systems, this theoretical study is focused on a detailed energetic analysis and comparison in order to understand their different behavior. Since experimental high-resolution structures of the complexes are not available, the analysis is based on energy minimized structural models of the complexes in different binding modes. For both sequences, four different intercalation structures and two models for MB binding in the minor and major groove have been proposed. Solvent electrostatic effects were included in the energetic analysis by using electrostatic continuum theory, and the dependence of MB binding on salt concentration was investigated by solving the non-linear Poisson-Boltzmann equation. We find that the relative stability of the different complexes is similar for the two sequences, in agreement with the interpretation of spectroscopic data. Subtle differences, however, are seen in energy decompositions and can be attributed to the change from symmetric 5'-YpR-3' intercalation to minor groove binding with increasing salt concentration, which is experimentally observed for the AT sequence at lower salt concentration than for the GC sequence. According to our results, this difference is due to the significantly lower non-electrostatic energy for the minor groove complex with AT alternating DNA, whereas the slightly lower binding energy to this sequence is caused by a higher deformation energy of DNA. The energetic data are in agreement with the conclusions derived from different spectroscopic studies and can also be structurally interpreted on the basis of the modeled complexes. The simple static modeling technique and the neglect of entropy terms and of non-electrostatic solute-solvent interactions, which are assumed to be nearly constant for the compared complexes of MB with DNA, seem to be justified by the results.
Why double-stranded RNA resists condensation

DOE Office of Scientific and Technical Information (OSTI.GOV)

Tolokh, Igor S.; Pabit, Suzette; Katz, Andrea M.

2014-09-15

The addition of small amounts of multivalent cations to solutions containing double-stranded DNA leads to attraction between the negatively charged helices and eventually to condensation. Surprisingly, this effect is suppressed in double-stranded RNA, which carries the same charge as the DNA, but assumes a different double helical form. However, additional characterization of short (25 base-pairs) nucleic acid (NA) duplex structures by circular dichroism shows that measured differences in condensation are not solely determined by duplex helical geometry. Here we combine experiment, theory, and atomistic simulations to propose a mechanism that connects the observed variations in condensation of short NA duplexesmore » with the spatial variation of cobalt hexammine (CoHex) binding at the NA duplex surface. The atomistic picture that emerged showed that CoHex distributions around the NA reveals two major NA-CoHex binding modes -- internal and external -- distinguished by the proximity of bound CoHex to the helical axis. Decreasing trends in experimentally observed condensation propensity of the four studied NA duplexes (from B-like form of homopolymeric DNA, to mixed sequence DNA, to DNA:RNA hybrid, to A-like RNA) are explained by the progressive decrease of a single quantity: the fraction of CoHex ions in the external binding mode. Thus, while NA condensation depends on a complex interplay between various structural and sequence features, our coupled experimental and theoretical results suggest a new model in which a single parameter connects the NA condensation propensity with geometry and sequence dependence of CoHex binding.« less
Unfolding and melting of DNA (RNA) hairpins: the concept of structure-specific 2D dynamic landscapes.

PubMed

Lin, Milo M; Meinhold, Lars; Shorokhov, Dmitry; Zewail, Ahmed H

2008-08-07

A 2D free-energy landscape model is presented to describe the (un)folding transition of DNA/RNA hairpins, together with molecular dynamics simulations and experimental findings. The dependence of the (un)folding transition on the stem sequence and the loop length is shown in the enthalpic and entropic contributions to the free energy. Intermediate structures are well defined by the two coordinates of the landscape during (un)zipping. Both the free-energy landscape model and the extensive molecular dynamics simulations totaling over 10 mus predict the existence of temperature-dependent kinetic intermediate states during hairpin (un)zipping and provide the theoretical description of recent ultrafast temperature-jump studies which indicate that hairpin (un)zipping is, in general, not a two-state process. The model allows for lucid prediction of the collapsed state(s) in simple 2D space and we term it the kinetic intermediate structure (KIS) model.
Electrostatic interactions guide the active site face of a structure-specific ribonuclease to its RNA substrate.

PubMed

Plantinga, Matthew J; Korennykh, Alexei V; Piccirilli, Joseph A; Correll, Carl C

2008-08-26

Restrictocin, a member of the alpha-sarcin family of site-specific endoribonucleases, uses electrostatic interactions to bind to the ribosome and to RNA oligonucleotides, including the minimal specific substrate, the sarcin/ricin loop (SRL) of 23S-28S rRNA. Restrictocin binds to the SRL by forming a ground-state E:S complex that is stabilized predominantly by Coulomb interactions and depends on neither the sequence nor structure of the RNA, suggesting a nonspecific complex. The 22 cationic residues of restrictocin are dispersed throughout this protein surface, complicating a priori identification of a Coulomb interacting surface. Structural studies have identified an enzyme-substrate interface, which is expected to overlap with the electrostatic E:S interface. Here, we identified restrictocin residues that contribute to binding in the E:S complex by determining the salt dependence [partial differential log(k 2/ K 1/2)/ partial differential log[KCl
Molecular structures of centromeric heterochromatin and karyotypic evolution in the Siamese crocodile (Crocodylus siamensis) (Crocodylidae, Crocodylia).

PubMed

Kawagoshi, Taiki; Nishida, Chizuko; Ota, Hidetoshi; Kumazawa, Yoshinori; Endo, Hideki; Matsuda, Yoichi

2008-01-01

Crocodilians have several unique karyotypic features, such as small diploid chromosome numbers (30-42) and the absence of dot-shaped microchromosomes. Of the extant crocodilian species, the Siamese crocodile (Crocodylus siamensis) has no more than 2n = 30, comprising mostly bi-armed chromosomes with large centromeric heterochromatin blocks. To investigate the molecular structures of C-heterochromatin and genomic compartmentalization in the karyotype, characterized by the disappearance of tiny microchromosomes and reduced chromosome number, we performed molecular cloning of centromeric repetitive sequences and chromosome mapping of the 18S-28S rDNA and telomeric (TTAGGG)( n ) sequences. The centromeric heterochromatin was composed mainly of two repetitive sequence families whose characteristics were quite different. Two types of GC-rich CSI-HindIII family sequences, the 305 bp CSI-HindIII-S (G+C content, 61.3%) and 424 bp CSI-HindIII-M (63.1%), were localized to the intensely PI-stained centric regions of all chromosomes, except for chromosome 2 with PI-negative heterochromatin. The 94 bp CSI-DraI (G+C content, 48.9%) was tandem-arrayed satellite DNA and localized to chromosome 2 and four pairs of small-sized chromosomes. The chromosomal size-dependent genomic compartmentalization that is supposedly unique to the Archosauromorpha was probably lost in the crocodilian lineage with the disappearance of microchromosomes followed by the homogenization of centromeric repetitive sequences between chromosomes, except for chromosome 2.
Understanding the mechanisms of protein-DNA interactions

NASA Astrophysics Data System (ADS)

Lavery, Richard

2004-03-01

Structural, biochemical and thermodynamic data on protein-DNA interactions show that specific recognition cannot be reduced to a simple set of binary interactions between the partners (such as hydrogen bonds, ion pairs or steric contacts). The mechanical properties of the partners also play a role and, in the case of DNA, variations in both conformation and flexibility as a function of base sequence can be a significant factor in guiding a protein to the correct binding site. All-atom molecular modeling offers a means of analyzing the role of different binding mechanisms within protein-DNA complexes of known structure. This however requires estimating the binding strengths for the full range of sequences with which a given protein can interact. Since this number grows exponentially with the length of the binding site it is necessary to find a method to accelerate the calculations. We have achieved this by using a multi-copy approach (ADAPT) which allows us to build a DNA fragment with a variable base sequence. The results obtained with this method correlate well with experimental consensus binding sequences. They enable us to show that indirect recognition mechanisms involving the sequence dependent properties of DNA play a significant role in many complexes. This approach also offers a means of predicting protein binding sites on the basis of binding energies, which is complementary to conventional lexical techniques.
A Sequence-Independent, Unstructured Internal Ribosome Entry Site Is Responsible for Internal Expression of the Coat Protein of Turnip Crinkle Virus

PubMed Central

May, Jared; Johnson, Philip; Saleem, Huma

2017-01-01

ABSTRACT To maximize the coding potential of viral genomes, internal ribosome entry sites (IRES) can be used to bypass the traditional requirement of a 5′ cap and some/all of the associated translation initiation factors. Although viral IRES typically contain higher-order RNA structure, an unstructured sequence of about 84 nucleotides (nt) immediately upstream of the Turnip crinkle virus (TCV) coat protein (CP) open reading frame (ORF) has been found to promote internal expression of the CP from the genomic RNA (gRNA) both in vitro and in vivo. An absence of extensive RNA structure was predicted using RNA folding algorithms and confirmed by selective 2′-hydroxyl acylation analyzed by primer extension (SHAPE) RNA structure probing. Analysis of the IRES region in vitro by use of both the TCV gRNA and reporter constructs did not reveal any sequence-specific elements but rather suggested that an overall lack of structure was an important feature for IRES activity. The CP IRES is A-rich, independent of orientation, and strongly conserved among viruses in the same genus. The IRES was dependent on eIF4G, but not eIF4E, for activity. Low levels of CP accumulated in vivo in the absence of detectable TCV subgenomic RNAs, strongly suggesting that the IRES was active in the gRNA in vivo. Since the TCV CP also serves as the viral silencing suppressor, early translation of the CP from the viral gRNA is likely important for countering host defenses. Cellular mRNA IRES also lack extensive RNA structures or sequence conservation, suggesting that this viral IRES and cellular IRES may have similar strategies for internal translation initiation. IMPORTANCE Cap-independent translation is a common strategy among positive-sense, single-stranded RNA viruses for bypassing the host cell requirement of a 5′ cap structure. Viral IRES, in general, contain extensive secondary structure that is critical for activity. In contrast, we demonstrate that a region of viral RNA devoid of extensive secondary structure has IRES activity and produces low levels of viral coat protein in vitro and in vivo. Our findings may be applicable to cellular mRNA IRES that also have little or no sequences/structures in common. PMID:28179526

Modification-dependent restriction endonuclease, MspJI, flips 5-methylcytosine out of the DNA helix

DOE PAGES

Horton, J. R.; Wang, H.; Mabuchi, M. Y.; ...

2014-09-27

MspJI belongs to a family of restriction enzymes that cleave DNA containing 5-methylcytosine (5mC) or 5-hydroxymethylcytosine (5hmC). MspJI is specific for the sequence 5(h)mC-N-N-G or A and cleaves with some variability 9/13 nucleotides downstream. Earlier, we reported the crystal structure of MspJI without DNA and proposed how it might recognize this sequence and catalyze cleavage. Here we report its co-crystal structure with a 27-base pair oligonucleotide containing 5mC. This structure confirms that MspJI acts as a homotetramer and that the modified cytosine is flipped from the DNA helix into an SRA-like-binding pocket. We expected the structure to reveal two DNAmore » molecules bound specifically to the tetramer and engaged with the enzyme's two DNA-cleavage sites. A coincidence of crystal packing precluded this organization, however. We found that each DNA molecule interacted with two adjacent tetramers, binding one specifically and the other non-specifically. The latter interaction, which prevented cleavage-site engagement, also involved base flipping and might represent the sequence-interrogation phase that precedes specific recognition. MspJI is unusual in that DNA molecules are recognized and cleaved by different subunits. Such interchange of function might explain how other complex multimeric restriction enzymes act.« less
Analyses of Genotypic Diversity among North, South, and Central American Isolates of Sugarcane Yellow Leaf Virus: Evidence for Colombian Origins and for Intraspecific Spatial Phylogenetic Variation

PubMed Central

Moonan, Francis; Mirkov, T. Erik

2002-01-01

We have analyzed the genotypic diversity of Sugarcane yellow leaf virus (SCYLV) collected from North, South, and Central America by fingerprinting assays and selective cDNA cloning and sequencing. One group of isolates from Colombia, designated the C-population, has been identified as residing at the root node between a separable superpopulation structure of SCYLV and other members of the family Luteoviridae, indicating that the progenitor viruses of the North, South, and Central American isolates of the SCYLV superpopulation most likely arose from a C-population structure. From a model of intrafamilial evolution (F. Moonan et al., Virology 269:156–171, 2000), a prediction could be made that within the SCYLV species, the capacity of genomic sequence divergence would range from lowest in the capsid protein open reading frame 3 (ORF 3) to highest in a region spanning across the carboxy-terminal end of the RNA-dependent RNA polymerase ORF. We have demonstrated the validity and applicability of this intrafamilial model for the prediction of intraspecies SCYLV diversity. Analysis of spatial phylogenetic variation (SPV) within the SCYLV isolates could not be assessed by application of a “partial likelihoods assessed through optimization” (PLATO)-derived intraspecies model alone. However, application of a PLATO-derived intrafamilial model with the intraspecies-derived model allowed distinction of three forms of SPV. Two of the SPV forms identified correspond to the extremes in a continuum of sequence evolution displayed in a SCYLV superpopulation structure, and the third form was diagnostic of a C-population structure. The application of these types of models has value in terms of predicting the types of SCYLV intraspecies diversity that may exist worldwide, and in general, may be useful in application for more informed design of transgenes for use in the elicitation of homology-dependent virus resistance mechanisms in transgenic plants. PMID:11773408
Sequence heterogeneities of genes encoding 16S rRNAs in Paenibacillus polymyxa detected by temperature gradient gel electrophoresis.

PubMed Central

Nübel, U; Engelen, B; Felske, A; Snaidr, J; Wieshuber, A; Amann, R I; Ludwig, W; Backhaus, H

1996-01-01

Sequence heterogeneities in 16S rRNA genes from individual strains of Paenibacillus polymyxa were detected by sequence-dependent separation of PCR products by temperature gradient gel electrophoresis (TGGE). A fragment of the 16S rRNA genes, comprising variable regions V6 to V8, was used as a target sequence for amplifications. PCR products from P. polymyxa (type strain) emerged as a well-defined pattern of bands in the gradient gel. Six plasmids with different inserts, individually demonstrating the migration characteristics of single bands of the pattern, were obtained by cloning the PCR products. Their sequences were analyzed as a representative sample of the total heterogeneity. An amount of 10 variant nucleotide positions in the fragment of 347 bp was observed, with all substitutions conserving the relevant secondary structures of the V6 and V8 regions in the RNA molecules. Hybridizations with specifically designed probes demonstrated different chromosomal locations of the respective rRNA genes. Amplifications of reverse-transcribed rRNA from ribosome preparations, as well as whole-cell hybridizations, revealed a predominant representation of particular sequences in ribosomes of exponentially growing laboratory cultures. Different strains of P. polymyxa showed not only remarkably differing patterns of PCR products in TGGE analysis but also discriminative whole-cell labeling with the designed oligonucleotide probes, indicating the different representation of individual sequences in active ribosomes. Our results demonstrate the usefulness of TGGE for the structural analysis of heterogeneous rRNA genes together with their expression, stress problems of the generation of meaningful data for 16S rRNA sequences and probe designs, and might have consequences for evolutionary concepts. PMID:8824607
Nucleotide sequence of the gag gene and gag-pol junction of feline leukemia virus.

PubMed Central

Laprevotte, I; Hampe, A; Sherr, C J; Galibert, F

1984-01-01

The nucleotide sequence of the gag gene of feline leukemia virus and its flanking sequences were determined and compared with the corresponding sequences of two strains of feline sarcoma virus and with that of the Moloney strain of murine leukemia virus. A high degree of nucleotide sequence homology between the feline leukemia virus and murine leukemia virus gag genes was observed, suggesting that retroviruses of domestic cats and laboratory mice have a common, proximal evolutionary progenitor. The predicted structure of the complete feline leukemia virus gag gene precursor suggests that the translation of nonglycosylated and glycosylated gag gene polypeptides is initiated at two different AUG codons. These initiator codons fall in the same reading frame and are separated by a 222-base-pair segment which encodes an amino terminal signal peptide. The nucleotide sequence predicts the order of amino acids in each of the individual gag-coded proteins (p15, p12, p30, p10), all of which derive from the gag gene precursor. Stable stem-and-loop secondary structures are proposed for two regions of viral RNA. The first falls within sequences at the 5' end of the viral genome, together with adjacent palindromic sequences which may play a role in dimer linkage of RNA subunits. The second includes coding sequences at the gag-pol junction and is proposed to be involved in translation of the pol gene product. Sequence analysis of the latter region shows that the gag and pol genes are translated in different reading frames. Classical consensus splice donor and acceptor sequences could not be localized to regions which would permit synthesis of the expected gag-pol precursor protein. Alternatively, we suggest that the pol gene product (RNA-dependent DNA polymerase) could be translated by a frameshift suppressing mechanism which could involve cleavage modification of stems and loops in a manner similar to that observed in tRNA processing. PMID:6328019
Two intermediate states of the conformational switch in dual specificity phosphatase 13a.

PubMed

Wei, Chun Hwa; Min, Hee Gyeong; Kim, Myeongbin; Kim, Gwan Hee; Chun, Ha-Jung; Ryu, Seong Eon

2018-02-01

Dual specificity phosphatases (DUSPs) include MAP kinase phosphatases and atypical dual specificity phosphatases and mediate cell growth and differentiation, brain function, and immune responses. They serve as targets for drug development against cancers, diabetes and depression. Several DUSPs have non-canonical conformation of the central β-sheet and active site loops, suggesting that they may have conformational switch that is related to the regulation of enzyme activity. Here, we determined the crystal structure of DUSP13a, and identified two different structures that represent intermediates of the postulated conformational switch. Amino acid sequence of DUSP13a is not significantly homologous to DUSPs with conformational switch, indicating that the conformational switch is not sequence-dependent, but rather determined by ligand interaction. The sequence-independency suggests that other DUSPs with canonical conformation may have the conformational switch during specific cellular regulation. The conformational switch leads to significant changes in the protein surface, including a hydrophobic surface and pockets, which can be exploited for development of allosteric modulators of drug target DUSPs. Copyright © 2017 Elsevier Ltd. All rights reserved.
CNNdel: Calling Structural Variations on Low Coverage Data Based on Convolutional Neural Networks

PubMed Central

2017-01-01

Many structural variations (SVs) detection methods have been proposed due to the popularization of next-generation sequencing (NGS). These SV calling methods use different SV-property-dependent features; however, they all suffer from poor accuracy when running on low coverage sequences. The union of results from these tools achieves fairly high sensitivity but still produces low accuracy on low coverage sequence data. That is, these methods contain many false positives. In this paper, we present CNNdel, an approach for calling deletions from paired-end reads. CNNdel gathers SV candidates reported by multiple tools and then extracts features from aligned BAM files at the positions of candidates. With labeled feature-expressed candidates as a training set, CNNdel trains convolutional neural networks (CNNs) to distinguish true unlabeled candidates from false ones. Results show that CNNdel works well with NGS reads from 26 low coverage genomes of the 1000 Genomes Project. The paper demonstrates that convolutional neural networks can automatically assign the priority of SV features and reduce the false positives efficaciously. PMID:28630866
A functional U-statistic method for association analysis of sequencing data.

PubMed

Jadhav, Sneha; Tong, Xiaoran; Lu, Qing

2017-11-01

Although sequencing studies hold great promise for uncovering novel variants predisposing to human diseases, the high dimensionality of the sequencing data brings tremendous challenges to data analysis. Moreover, for many complex diseases (e.g., psychiatric disorders) multiple related phenotypes are collected. These phenotypes can be different measurements of an underlying disease, or measurements characterizing multiple related diseases for studying common genetic mechanism. Although jointly analyzing these phenotypes could potentially increase the power of identifying disease-associated genes, the different types of phenotypes pose challenges for association analysis. To address these challenges, we propose a nonparametric method, functional U-statistic method (FU), for multivariate analysis of sequencing data. It first constructs smooth functions from individuals' sequencing data, and then tests the association of these functions with multiple phenotypes by using a U-statistic. The method provides a general framework for analyzing various types of phenotypes (e.g., binary and continuous phenotypes) with unknown distributions. Fitting the genetic variants within a gene using a smoothing function also allows us to capture complexities of gene structure (e.g., linkage disequilibrium, LD), which could potentially increase the power of association analysis. Through simulations, we compared our method to the multivariate outcome score test (MOST), and found that our test attained better performance than MOST. In a real data application, we apply our method to the sequencing data from Minnesota Twin Study (MTS) and found potential associations of several nicotine receptor subunit (CHRN) genes, including CHRNB3, associated with nicotine dependence and/or alcohol dependence. © 2017 WILEY PERIODICALS, INC.
Sequence dependency of canonical base pair opening in the DNA double helix

PubMed Central

Villa, Alessandra

2017-01-01

The flipping-out of a DNA base from the double helical structure is a key step of many cellular processes, such as DNA replication, modification and repair. Base pair opening is the first step of base flipping and the exact mechanism is still not well understood. We investigate sequence effects on base pair opening using extensive classical molecular dynamics simulations targeting the opening of 11 different canonical base pairs in two DNA sequences. Two popular biomolecular force fields are applied. To enhance sampling and calculate free energies, we bias the simulation along a simple distance coordinate using a newly developed adaptive sampling algorithm. The simulation is guided back and forth along the coordinate, allowing for multiple opening pathways. We compare the calculated free energies with those from an NMR study and check assumptions of the model used for interpreting the NMR data. Our results further show that the neighboring sequence is an important factor for the opening free energy, but also indicates that other sequence effects may play a role. All base pairs are observed to have a propensity for opening toward the major groove. The preferred opening base is cytosine for GC base pairs, while for AT there is sequence dependent competition between the two bases. For AT opening, we identify two non-canonical base pair interactions contributing to a local minimum in the free energy profile. For both AT and CG we observe long-lived interactions with water and with sodium ions at specific sites on the open base pair. PMID:28369121
Equation Chapter 1 Section 1Sequence-To-Conformation Relationships of Disordered Regions Tethered to Folded Domains of Proteins.

PubMed

Mittal, Anuradha; Holehouse, Alex S; Cohan, Megan C; Pappu, Rohit V

2018-05-12

Intrinsically disordered proteins and regions (IDPs / IDRs) are characterized by well-defined sequence-to-conformation relationships (SCRs). These relationships refer to the sequence-specific preferences for average sizes, shapes, residue-specific secondary structure propensities, and amplitudes of multiscale conformational fluctuations. SCRs are discerned from the sequence-specific conformational ensembles of IDPs. A vast majority of IDPs are actually tethered to folded domains (FDs). This raises the question of whether or not SCRs inferred for IDPs are applicable to IDRs tethered to folded domains. Here, we use atomistic simulations based on a well-established forcefield paradigm and an enhanced sampling method to obtain comparative assessments of SCRs for thirteen archetypal IDRs modeled as autonomous units, as C-terminal tails connected to folded domains, and as linkers between pairs of folded domains. Our studies uncover a set of general observations regarding context-independent versus context-dependent SCRs of IDRs. SCRs are minimally perturbed upon tethering to folded domains if the IDRs are deficient in charged residues and for polyampholytic IDRs where the oppositely charged residues within the sequence of the IDR are separated into distinct blocks. In contrast, the interplay between IDRs and tethered folded domains has a significant modulatory effect on SCRs if the IDRs have intermediate fractions of charged residues or if they have sequence-intrinsic conformational preferences for canonical random coils. Our findings suggest that IDRs with context-independent SCRs might be independent evolutionary modules whereas IDRs with context-dependent intrinsic SCRs might co-evolve with the FDs to which they are tethered. Copyright © 2018. Published by Elsevier Ltd.
Structural features based genome-wide characterization and prediction of nucleosome organization

PubMed Central

2012-01-01

Background Nucleosome distribution along chromatin dictates genomic DNA accessibility and thus profoundly influences gene expression. However, the underlying mechanism of nucleosome formation remains elusive. Here, taking a structural perspective, we systematically explored nucleosome formation potential of genomic sequences and the effect on chromatin organization and gene expression in S. cerevisiae. Results We analyzed twelve structural features related to flexibility, curvature and energy of DNA sequences. The results showed that some structural features such as DNA denaturation, DNA-bending stiffness, Stacking energy, Z-DNA, Propeller twist and free energy, were highly correlated with in vitro and in vivo nucleosome occupancy. Specifically, they can be classified into two classes, one positively and the other negatively correlated with nucleosome occupancy. These two kinds of structural features facilitated nucleosome binding in centromere regions and repressed nucleosome formation in the promoter regions of protein-coding genes to mediate transcriptional regulation. Based on these analyses, we integrated all twelve structural features in a model to predict more accurately nucleosome occupancy in vivo than the existing methods that mainly depend on sequence compositional features. Furthermore, we developed a novel approach, named DLaNe, that located nucleosomes by detecting peaks of structural profiles, and built a meta predictor to integrate information from different structural features. As a comparison, we also constructed a hidden Markov model (HMM) to locate nucleosomes based on the profiles of these structural features. The result showed that the meta DLaNe and HMM-based method performed better than the existing methods, demonstrating the power of these structural features in predicting nucleosome positions. Conclusions Our analysis revealed that DNA structures significantly contribute to nucleosome organization and influence chromatin structure and gene expression regulation. The results indicated that our proposed methods are effective in predicting nucleosome occupancy and positions and that these structural features are highly predictive of nucleosome organization. The implementation of our DLaNe method based on structural features is available online. PMID:22449207
Association mining of dependency between time series

NASA Astrophysics Data System (ADS)

Hafez, Alaaeldin

2001-03-01

Time series analysis is considered as a crucial component of strategic control over a broad variety of disciplines in business, science and engineering. Time series data is a sequence of observations collected over intervals of time. Each time series describes a phenomenon as a function of time. Analysis on time series data includes discovering trends (or patterns) in a time series sequence. In the last few years, data mining has emerged and been recognized as a new technology for data analysis. Data Mining is the process of discovering potentially valuable patterns, associations, trends, sequences and dependencies in data. Data mining techniques can discover information that many traditional business analysis and statistical techniques fail to deliver. In this paper, we adapt and innovate data mining techniques to analyze time series data. By using data mining techniques, maximal frequent patterns are discovered and used in predicting future sequences or trends, where trends describe the behavior of a sequence. In order to include different types of time series (e.g. irregular and non- systematic), we consider past frequent patterns of the same time sequences (local patterns) and of other dependent time sequences (global patterns). We use the word 'dependent' instead of the word 'similar' for emphasis on real life time series where two time series sequences could be completely different (in values, shapes, etc.), but they still react to the same conditions in a dependent way. In this paper, we propose the Dependence Mining Technique that could be used in predicting time series sequences. The proposed technique consists of three phases: (a) for all time series sequences, generate their trend sequences, (b) discover maximal frequent trend patterns, generate pattern vectors (to keep information of frequent trend patterns), use trend pattern vectors to predict future time series sequences.
Transfer RNAs with novel cloverleaf structures

DOE Office of Scientific and Technical Information (OSTI.GOV)

Mukai, Takahito; Vargas-Rodriguez, Oscar; Englert, Markus

We report the identification of novel tRNA species with 12-base pair amino-acid acceptor branches composed of longer acceptor stem and shorter Tstem. While canonical tRNAs have a 7/5 configuration of the branch, the novel tRNAs have either 8/4 or 9/3 structure. They were found during the search for selenocysteine tRNAs in terabytes of genome, metagenome and metatranscriptome sequences. Certain bacteria and their phages employ the 8/4 structure for serine and histidine tRNAs, while minor cysteine and selenocysteine tRNA species may have a modified 8/4 structure with one bulge nucleotide. In Acidobacteria, tRNAs with 8/4 and 9/3 structures may function asmore » missense and nonsense suppressor tRNAs and/or regulatory noncod ing RNAs. In δ-proteobacteria, an additional cysteine tRNA with an 8/4 structure mimics selenocysteine tRNA and may function as opal suppressor. We examined the potential translation function of suppressor tRNA species inEscherichia coli; tRNAs with 8/4 or 9/3 structures efficiently inserted serine, alanine and cysteine in response to stop and sense codons, depending on the identity element and anticodon sequence of the tRNA. These findings expand our view of how tRNA, and possibly the genetic code, is diversified in nature.« less
Transfer RNAs with novel cloverleaf structures

DOE PAGES

Mukai, Takahito; Vargas-Rodriguez, Oscar; Englert, Markus; ...

2016-10-05

We report the identification of novel tRNA species with 12-base pair amino-acid acceptor branches composed of longer acceptor stem and shorter Tstem. While canonical tRNAs have a 7/5 configuration of the branch, the novel tRNAs have either 8/4 or 9/3 structure. They were found during the search for selenocysteine tRNAs in terabytes of genome, metagenome and metatranscriptome sequences. Certain bacteria and their phages employ the 8/4 structure for serine and histidine tRNAs, while minor cysteine and selenocysteine tRNA species may have a modified 8/4 structure with one bulge nucleotide. In Acidobacteria, tRNAs with 8/4 and 9/3 structures may function asmore » missense and nonsense suppressor tRNAs and/or regulatory noncod ing RNAs. In δ-proteobacteria, an additional cysteine tRNA with an 8/4 structure mimics selenocysteine tRNA and may function as opal suppressor. We examined the potential translation function of suppressor tRNA species inEscherichia coli; tRNAs with 8/4 or 9/3 structures efficiently inserted serine, alanine and cysteine in response to stop and sense codons, depending on the identity element and anticodon sequence of the tRNA. These findings expand our view of how tRNA, and possibly the genetic code, is diversified in nature.« less
From Ramachandran Maps to Tertiary Structures of Proteins.

PubMed

DasGupta, Debarati; Kaushik, Rahul; Jayaram, B

2015-08-27

Sequence to structure of proteins is an unsolved problem. A possible coarse grained resolution to this entails specification of all the torsional (Φ, Ψ) angles along the backbone of the polypeptide chain. The Ramachandran map quite elegantly depicts the allowed conformational (Φ, Ψ) space of proteins which is still very large for the purposes of accurate structure generation. We have divided the allowed (Φ, Ψ) space in Ramachandran maps into 27 distinct conformations sufficient to regenerate a structure to within 5 Å from the native, at least for small proteins, thus reducing the structure prediction problem to a specification of an alphanumeric string, i.e., the amino acid sequence together with one of the 27 conformations preferred by each amino acid residue. This still theoretically results in 27(n) conformations for a protein comprising "n" amino acids. We then investigated the spatial correlations at the two-residue (dipeptide) and three-residue (tripeptide) levels in what may be described as higher order Ramachandran maps, with the premise that the allowed conformational space starts to shrink as we introduce neighborhood effects. We found, for instance, for a tripeptide which potentially can exist in any of the 27(3) "allowed" conformations, three-fourths of these conformations are redundant to the 95% confidence level, suggesting sequence context dependent preferred conformations. We then created a look-up table of preferred conformations at the tripeptide level and correlated them with energetically favorable conformations. We found in particular that Boltzmann probabilities calculated from van der Waals energies for each conformation of tripeptides correlate well with the observed populations in the structural database (the average correlation coefficient is ∼0.8). An alpha-numeric string and hence the tertiary structure can be generated for any sequence from the look-up table within minutes on a single processor and to a higher level of accuracy if secondary structure can be specified. We tested the methodology on 100 small proteins, and in 90% of the cases, a structure within 5 Å is recovered. We thus believe that the method presented here provides the missing link between Ramachandran maps and tertiary structures of proteins. A Web server to convert a tertiary structure to an alphanumeric string and to predict the tertiary structure from the sequence of a protein using the above methodology is created and made freely accessible at http://www.scfbio-iitd.res.in/software/proteomics/rm2ts.jsp.
Composite Structural Motifs of Binding Sites for Delineating Biological Functions of Proteins

PubMed Central

Kinjo, Akira R.; Nakamura, Haruki

2012-01-01

Most biological processes are described as a series of interactions between proteins and other molecules, and interactions are in turn described in terms of atomic structures. To annotate protein functions as sets of interaction states at atomic resolution, and thereby to better understand the relation between protein interactions and biological functions, we conducted exhaustive all-against-all atomic structure comparisons of all known binding sites for ligands including small molecules, proteins and nucleic acids, and identified recurring elementary motifs. By integrating the elementary motifs associated with each subunit, we defined composite motifs that represent context-dependent combinations of elementary motifs. It is demonstrated that function similarity can be better inferred from composite motif similarity compared to the similarity of protein sequences or of individual binding sites. By integrating the composite motifs associated with each protein function, we define meta-composite motifs each of which is regarded as a time-independent diagrammatic representation of a biological process. It is shown that meta-composite motifs provide richer annotations of biological processes than sequence clusters. The present results serve as a basis for bridging atomic structures to higher-order biological phenomena by classification and integration of binding site structures. PMID:22347478
[A turning point in the knowledge of the structure-function-activity relations of elastin].

PubMed

Alix, A J

2001-01-01

In this review are presented the last new results of our research group dealing with the molecular structures (atomic level) of tropoelastin, elastin and elastin derived peptides studied by using essentially methods of bioinformatics (theoretical predictions and molecular modelling) linked to experimental circular dichroism spectroscopic studies. We already had characterized both the local secondary structure and some parts of the tertiary structure of the tropoelastin and elastin molecules (human, bovine...), by using either theoretical predictions (local secondary structure, linear epitopes...) and/or experimental data (optical spectroscopic methods: Raman scattering, infrared absorption, circular dichroism). Except the cross-linking regions which are in helical conformations, the whole tropoelastin structure displays a lot of beta-reverse turns which usually belong to irregular structures in proteins. These turns play a key role in other regularly structures orientation (alpha-helix, beta-strand), thus they are very important in the native protein 3D architecture. It is particularly true for human tropoelastin, because its sequence is rich in glycines and prolines, and these residues are frequently met in beta-turns (a beta-turn is made of four consecutive residues which are stabilized by an hydrogen bond). Several types of beta-turns can be defined with the dihedral angles values phi and psi of the two central residues. Thus, by using a very recent updated set of propensities for the amino acid residues to belong to given types of reverse beta-turns (extracted from a reference set of known 3-D structures of globular proteins), we have determined, (by using our home made software COUDES), for all possible tetrapeptides of the human tropoelastin sequence, the distribution and the characterization of the possible type of turns. Thus, it is shown that the locations and/or the types of these reverse beta-turns reveal a regularity and are not all random. This confirms our hypothesis that intra-molecular elasticity of tropoelastin could be explained by the possibility of transitions between conformations involving short beta-strands and beta-turns. This result is of great interest in the construction (by using molecular biology) of elastic biomaterials derived from the elastin sequence (particularly, the elastin derived peptides corresponding to the sequence exon 21--(exon 24--exon 24...). Our study permit also to predict the conformations of specific elastin derived peptides which could have interesting biological activity. Peptides resulting from the degradation of elastin, the insoluble polymer of tropoelastin and responsible for the elasticity of vertebrate tissues, can induce biological effects and notably the regulation of matrix metalloproteinases (MMP-s) activity. Recently, it was proposed that some elastin derived hexapeptides resulting from circular permutations of VGVAPG (a three fold repetition sequence in exon 24 of human tropoelastin) possess MMP-1 production and activation regulation properties. This effect depends on the presence of the tropoelastin specific membraneous receptor 67 KDa EBP (Elastin Binding Protein). Our results obtained by using both circular dichroism spectroscopy and linear predictions confirmed the hypothesis of a structure dependent mechanism with a possibly occurring type VIII beta-turn on the first four residues of the GXXPG sequence consensus which is only present among all active peptides. Thus, we have performed extensive molecular dynamics studies, in both implicit and explicit solvent, on these active and inactive elastin derived hexapeptides. Using our own analysis method of pattern recognition of the types of the beta-reverse-turns followed during the molecular dynamics trajectory, we found that active and inactive peptides effectively form two well distinct conformational groups in which active peptides preferentially adopt conformation close to type VIII GXXP (beta-reverse-turn. The structural role of the C terminal G residue could also be explained. Additional molecular simulations on (VGVAPG)2 and (VGVAPG)3 show the formation of two or three GXXP tetrapeptides adopting a structure close to type VIII beta-reverse-turn, suggesting a local conformational preference for this motif. This observation of a specific structural single and/or repeated motif is in agreement with the circular dichroism spectra of the involved (VGVAPG)1, (VGVAPG)2 and (VGVAPG)3 peptides and then it can be proposed that their biological activities have to be linear. The final aim of this type of work is to understand more about the sequence/structure/function/activity relationships of those structured peptides in order to propose specific sequences (corresponding to specific structures) for best biological activity results.
RExPrimer: an integrated primer designing tool increases PCR effectiveness by avoiding 3' SNP-in-primer and mis-priming from structural variation

PubMed Central

2009-01-01

Background Polymerase chain reaction (PCR) is very useful in many areas of molecular biology research. It is commonly observed that PCR success is critically dependent on design of an effective primer pair. Current tools for primer design do not adequately address the problem of PCR failure due to mis-priming on target-related sequences and structural variations in the genome. Methods We have developed an integrated graphical web-based application for primer design, called RExPrimer, which was written in Python language. The software uses Primer3 as the primer designing core algorithm. Locally stored sequence information and genomic variant information were hosted on MySQLv5.0 and were incorporated into RExPrimer. Results RExPrimer provides many functionalities for improved PCR primer design. Several databases, namely annotated human SNP databases, insertion/deletion (indel) polymorphisms database, pseudogene database, and structural genomic variation databases were integrated into RExPrimer, enabling an effective without-leaving-the-website validation of the resulting primers. By incorporating these databases, the primers reported by RExPrimer avoid mis-priming to related sequences (e.g. pseudogene, segmental duplication) as well as possible PCR failure because of structural polymorphisms (SNP, indel, and copy number variation (CNV)). To prevent mismatching caused by unexpected SNPs in the designed primers, in particular the 3' end (SNP-in-Primer), several SNP databases covering the broad range of population-specific SNP information are utilized to report SNPs present in the primer sequences. Population-specific SNP information also helps customize primer design for a specific population. Furthermore, RExPrimer offers a graphical user-friendly interface through the use of scalable vector graphic image that intuitively presents resulting primers along with the corresponding gene structure. In this study, we demonstrated the program effectiveness in successfully generating primers for strong homologous sequences. Conclusion The improvements for primer design incorporated into RExPrimer were demonstrated to be effective in designing primers for challenging PCR experiments. Integration of SNP and structural variation databases allows for robust primer design for a variety of PCR applications, irrespective of the sequence complexity in the region of interest. This software is freely available at http://www4a.biotec.or.th/rexprimer. PMID:19958502
Identification of Functionally Related Enzymes by Learning-to-Rank Methods.

PubMed

Stock, Michiel; Fober, Thomas; Hüllermeier, Eyke; Glinca, Serghei; Klebe, Gerhard; Pahikkala, Tapio; Airola, Antti; De Baets, Bernard; Waegeman, Willem

2014-01-01

Enzyme sequences and structures are routinely used in the biological sciences as queries to search for functionally related enzymes in online databases. To this end, one usually departs from some notion of similarity, comparing two enzymes by looking for correspondences in their sequences, structures or surfaces. For a given query, the search operation results in a ranking of the enzymes in the database, from very similar to dissimilar enzymes, while information about the biological function of annotated database enzymes is ignored. In this work, we show that rankings of that kind can be substantially improved by applying kernel-based learning algorithms. This approach enables the detection of statistical dependencies between similarities of the active cleft and the biological function of annotated enzymes. This is in contrast to search-based approaches, which do not take annotated training data into account. Similarity measures based on the active cleft are known to outperform sequence-based or structure-based measures under certain conditions. We consider the Enzyme Commission (EC) classification hierarchy for obtaining annotated enzymes during the training phase. The results of a set of sizeable experiments indicate a consistent and significant improvement for a set of similarity measures that exploit information about small cavities in the surface of enzymes.
A parallel strategy for predicting the secondary structure of polycistronic microRNAs.

PubMed

Han, Dianwei; Tang, Guiliang; Zhang, Jun

2013-01-01

The biogenesis of a functional microRNA is largely dependent on the secondary structure of the microRNA precursor (pre-miRNA). Recently, it has been shown that microRNAs are present in the genome as the form of polycistronic transcriptional units in plants and animals. It will be important to design efficient computational methods to predict such structures for microRNA discovery and its applications in gene silencing. In this paper, we propose a parallel algorithm based on the master-slave architecture to predict the secondary structure from an input sequence. We conducted some experiments to verify the effectiveness of our parallel algorithm. The experimental results show that our algorithm is able to produce the optimal secondary structure of polycistronic microRNAs.
(Pea)nuts and bolts of visual narrative: Structure and meaning in sequential image comprehension

PubMed Central

Cohn, Neil; Paczynski, Martin; Jackendoff, Ray; Holcomb, Phillip J.; Kuperberg, Gina R.

2012-01-01

Just as syntax differentiates coherent sentences from scrambled word strings, the comprehension of sequential images must also use a cognitive system to distinguish coherent narrative sequences from random strings of images. We conducted experiments analogous to two classic studies of language processing to examine the contributions of narrative structure and semantic relatedness to processing sequential images. We compared four types of comic strips: 1) Normal sequences with both structure and meaning, 2) Semantic Only sequences (in which the panels were related to a common semantic theme, but had no narrative structure), 3) Structural Only sequences (narrative structure but no semantic relatedness), and 4) Scrambled sequences of randomly-ordered panels. In Experiment 1, participants monitored for target panels in sequences presented panel-by-panel. Reaction times were slowest to panels in Scrambled sequences, intermediate in both Structural Only and Semantic Only sequences, and fastest in Normal sequences. This suggests that both semantic relatedness and narrative structure offer advantages to processing. Experiment 2 measured ERPs to all panels across the whole sequence. The N300/N400 was largest to panels in both the Scrambled and Structural Only sequences, intermediate in Semantic Only sequences and smallest in the Normal sequences. This implies that a combination of narrative structure and semantic relatedness can facilitate semantic processing of upcoming panels (as reflected by the N300/N400). Also, panels in the Scrambled sequences evoked a larger left-lateralized anterior negativity than panels in the Structural Only sequences. This localized effect was distinct from the N300/N400, and appeared despite the fact that these two sequence types were matched on local semantic relatedness between individual panels. These findings suggest that sequential image comprehension uses a narrative structure that may be independent of semantic relatedness. Altogether, we argue that the comprehension of visual narrative is guided by an interaction between structure and meaning. PMID:22387723

Implications of the dependence of the elastic properties of DNA on nucleotide sequence.

PubMed

Olson, Wilma K; Swigon, David; Coleman, Bernard D

2004-07-15

Recent advances in structural biochemistry have provided evidence that not only the geometric properties but also the elastic moduli of duplex DNA are strongly dependent on nucleotide sequence in a way that is not accounted for by classical rod models of the Kirchhoff type. A theory of sequence-dependent DNA elasticity is employed here to calculate the dependence of the equilibrium configurations of circular DNA on the binding of ligands that can induce changes in intrinsic twist at a single base-pair step. Calculations are presented of the influence on configurations of the assumed values and distribution along the DNA of intrinsic roll and twist and a modulus coupling roll to twist. Among the results obtained are the following. For minicircles formed from intrinsically straight DNA, the distribution of roll-twist coupling strongly affects the dependence of the total elastic energy Psi on the amount alpha of imposed untwisting, and that dependence can be far from quadratic. (In fact, for a periodic distribution of roll-twist coupling with a period equal to the intrinsic helical repeat length, Psi can be essentially independent of alpha for -90 degrees < alpha <90 degrees.) When the minicircle is homogeneous and without roll-twist coupling, but with uniform positive intrinsic roll, the point at which Psi attains its minimum value shifts towards negative values of alpha. It is remarked that there are cases in which one can relate graphs of Psi versus alpha to the 'effective values' of bending and twisting moduli and helical repeat length obtained from measurements of equilibrium distributions of topoisomers and probabilities of ring closure. For a minicircle formed from DNA that has an 'S' shape when stress-free, the graphs of Psi versus alpha have maxima at alpha = 0. As the binding of a twisting agent to such a minicircle results in a net decrease in Psi, the affinity of the twisting agent for binding to the minicircle is greater than its affinity for binding to unconstrained DNA with the same sequence.
Mapping Structurally Defined Guanine Oxidation Products along DNA Duplexes: Influence of Local Sequence Context and Endogenous Cytosine Methylation

PubMed Central

2015-01-01

DNA oxidation by reactive oxygen species is nonrandom, potentially leading to accumulation of nucleobase damage and mutations at specific sites within the genome. We now present the first quantitative data for sequence-dependent formation of structurally defined oxidative nucleobase adducts along p53 gene-derived DNA duplexes using a novel isotope labeling-based approach. Our results reveal that local nucleobase sequence context differentially alters the yields of 2,2,4-triamino-2H-oxal-5-one (Z) and 8-oxo-7,8-dihydro-2′-deoxyguanosine (OG) in double stranded DNA. While both lesions are overproduced within endogenously methylated MeCG dinucleotides and at 5′ Gs in runs of several guanines, the formation of Z (but not OG) is strongly preferred at solvent-exposed guanine nucleobases at duplex ends. Targeted oxidation of MeCG sequences may be caused by a lowered ionization potential of guanine bases paired with MeC and the preferential intercalation of riboflavin photosensitizer adjacent to MeC:G base pairs. Importantly, some of the most frequently oxidized positions coincide with the known p53 lung cancer mutational “hotspots” at codons 245 (GGC), 248 (CGG), and 158 (CGC) respectively, supporting a possible role of oxidative degradation of DNA in the initiation of lung cancer. PMID:24571128
A guide to enterotypes across the human body: meta-analysis of microbial community structures in human microbiome datasets.

PubMed

Koren, Omry; Knights, Dan; Gonzalez, Antonio; Waldron, Levi; Segata, Nicola; Knight, Rob; Huttenhower, Curtis; Ley, Ruth E

2013-01-01

Recent analyses of human-associated bacterial diversity have categorized individuals into 'enterotypes' or clusters based on the abundances of key bacterial genera in the gut microbiota. There is a lack of consensus, however, on the analytical basis for enterotypes and on the interpretation of these results. We tested how the following factors influenced the detection of enterotypes: clustering methodology, distance metrics, OTU-picking approaches, sequencing depth, data type (whole genome shotgun (WGS) vs.16S rRNA gene sequence data), and 16S rRNA region. We included 16S rRNA gene sequences from the Human Microbiome Project (HMP) and from 16 additional studies and WGS sequences from the HMP and MetaHIT. In most body sites, we observed smooth abundance gradients of key genera without discrete clustering of samples. Some body habitats displayed bimodal (e.g., gut) or multimodal (e.g., vagina) distributions of sample abundances, but not all clustering methods and workflows accurately highlight such clusters. Because identifying enterotypes in datasets depends not only on the structure of the data but is also sensitive to the methods applied to identifying clustering strength, we recommend that multiple approaches be used and compared when testing for enterotypes.
Sequencing BPS spectra

DOE Office of Scientific and Technical Information (OSTI.GOV)

Gukov, Sergei; Nawata, Satoshi; Saberi, Ingmar

In this article, we provide both a detailed study of color-dependence of link homologies, as realized in physics as certain spaces of BPS states, and a broad study of the behavior of BPS states in general. We consider how the spectrum of BPS states varies as continuous parameters of a theory are perturbed. This question can be posed in a wide variety of physical contexts, and we answer it by proposing that the relationship between unperturbed and perturbed BPS spectra is described by a spectral sequence. These general considerations unify previous applications of spectral sequence techniques to physics, and explainmore » from a physical standpoint the appearance of many spectral sequences relating various link homology theories to one another. We also study structural properties of colored HOMFLY homology for links and evaluate Poincar e polynomials in numerous examples. Among these structural properties is a novel "sliding" property, which can be explained by using (re fined) modular S-matrix. This leads to the identi fication of modular transformations in Chern-Simons theory and 3d N = 2 theory via the 3d/3d correspondence. In conclusion, we introduce the notion of associated varieties as classical limits of recursion relations of colored superpolynomials of links, and study their properties.« less
A Guide to Enterotypes across the Human Body: Meta-Analysis of Microbial Community Structures in Human Microbiome Datasets

PubMed Central

Waldron, Levi; Segata, Nicola; Knight, Rob; Huttenhower, Curtis; Ley, Ruth E.

2013-01-01

Recent analyses of human-associated bacterial diversity have categorized individuals into ‘enterotypes’ or clusters based on the abundances of key bacterial genera in the gut microbiota. There is a lack of consensus, however, on the analytical basis for enterotypes and on the interpretation of these results. We tested how the following factors influenced the detection of enterotypes: clustering methodology, distance metrics, OTU-picking approaches, sequencing depth, data type (whole genome shotgun (WGS) vs.16S rRNA gene sequence data), and 16S rRNA region. We included 16S rRNA gene sequences from the Human Microbiome Project (HMP) and from 16 additional studies and WGS sequences from the HMP and MetaHIT. In most body sites, we observed smooth abundance gradients of key genera without discrete clustering of samples. Some body habitats displayed bimodal (e.g., gut) or multimodal (e.g., vagina) distributions of sample abundances, but not all clustering methods and workflows accurately highlight such clusters. Because identifying enterotypes in datasets depends not only on the structure of the data but is also sensitive to the methods applied to identifying clustering strength, we recommend that multiple approaches be used and compared when testing for enterotypes. PMID:23326225
Sequencing BPS spectra

DOE PAGES

Gukov, Sergei; Nawata, Satoshi; Saberi, Ingmar; ...

2016-03-02

In this article, we provide both a detailed study of color-dependence of link homologies, as realized in physics as certain spaces of BPS states, and a broad study of the behavior of BPS states in general. We consider how the spectrum of BPS states varies as continuous parameters of a theory are perturbed. This question can be posed in a wide variety of physical contexts, and we answer it by proposing that the relationship between unperturbed and perturbed BPS spectra is described by a spectral sequence. These general considerations unify previous applications of spectral sequence techniques to physics, and explainmore » from a physical standpoint the appearance of many spectral sequences relating various link homology theories to one another. We also study structural properties of colored HOMFLY homology for links and evaluate Poincar e polynomials in numerous examples. Among these structural properties is a novel "sliding" property, which can be explained by using (re fined) modular S-matrix. This leads to the identi fication of modular transformations in Chern-Simons theory and 3d N = 2 theory via the 3d/3d correspondence. In conclusion, we introduce the notion of associated varieties as classical limits of recursion relations of colored superpolynomials of links, and study their properties.« less
A Comparative Study of Human Saposins.

PubMed

Garrido-Arandia, María; Cuevas-Zuviría, Bruno; Díaz-Perales, Araceli; Pacios, Luis F

2018-02-14

Saposins are small proteins implicated in trafficking and loading of lipids onto Cluster of Differentiation 1 (CD1) receptor proteins that in turn present lipid antigens to T cells and a variety of T-cell receptors, thus playing a crucial role in innate and adaptive immune responses in humans. Despite their low sequence identity, the four types of human saposins share a similar folding pattern consisting of four helices linked by three conserved disulfide bridges. However, their lipid-binding abilities as well as their activities in extracting, transporting and loading onto CD1 molecules a variety of sphingo- and phospholipids in biological membranes display two striking characteristics: a strong pH-dependence and a structural change between a compact, closed conformation and an open conformation. In this work, we present a comparative computational study of structural, electrostatic, and dynamic features of human saposins based upon their available experimental structures. By means of structural alignments, surface analyses, calculation of pH-dependent protonation states, Poisson-Boltzmann electrostatic potentials, and molecular dynamics simulations at three pH values representative of biological media where saposins fulfill their function, our results shed light into their intrinsic features. The similarities and differences in this class of proteins depend on tiny variations of local structural details that allow saposins to be key players in triggering responses in the human immune system.
Rapid Identification of Sequences for Orphan Enzymes to Power Accurate Protein Annotation

PubMed Central

Ojha, Sunil; Watson, Douglas S.; Bomar, Martha G.; Galande, Amit K.; Shearer, Alexander G.

2013-01-01

The power of genome sequencing depends on the ability to understand what those genes and their proteins products actually do. The automated methods used to assign functions to putative proteins in newly sequenced organisms are limited by the size of our library of proteins with both known function and sequence. Unfortunately this library grows slowly, lagging well behind the rapid increase in novel protein sequences produced by modern genome sequencing methods. One potential source for rapidly expanding this functional library is the “back catalog” of enzymology – “orphan enzymes,” those enzymes that have been characterized and yet lack any associated sequence. There are hundreds of orphan enzymes in the Enzyme Commission (EC) database alone. In this study, we demonstrate how this orphan enzyme “back catalog” is a fertile source for rapidly advancing the state of protein annotation. Starting from three orphan enzyme samples, we applied mass-spectrometry based analysis and computational methods (including sequence similarity networks, sequence and structural alignments, and operon context analysis) to rapidly identify the specific sequence for each orphan while avoiding the most time- and labor-intensive aspects of typical sequence identifications. We then used these three new sequences to more accurately predict the catalytic function of 385 previously uncharacterized or misannotated proteins. We expect that this kind of rapid sequence identification could be efficiently applied on a larger scale to make enzymology’s “back catalog” another powerful tool to drive accurate genome annotation. PMID:24386392
Rapid identification of sequences for orphan enzymes to power accurate protein annotation.

PubMed

Ramkissoon, Kevin R; Miller, Jennifer K; Ojha, Sunil; Watson, Douglas S; Bomar, Martha G; Galande, Amit K; Shearer, Alexander G

2013-01-01

The power of genome sequencing depends on the ability to understand what those genes and their proteins products actually do. The automated methods used to assign functions to putative proteins in newly sequenced organisms are limited by the size of our library of proteins with both known function and sequence. Unfortunately this library grows slowly, lagging well behind the rapid increase in novel protein sequences produced by modern genome sequencing methods. One potential source for rapidly expanding this functional library is the "back catalog" of enzymology--"orphan enzymes," those enzymes that have been characterized and yet lack any associated sequence. There are hundreds of orphan enzymes in the Enzyme Commission (EC) database alone. In this study, we demonstrate how this orphan enzyme "back catalog" is a fertile source for rapidly advancing the state of protein annotation. Starting from three orphan enzyme samples, we applied mass-spectrometry based analysis and computational methods (including sequence similarity networks, sequence and structural alignments, and operon context analysis) to rapidly identify the specific sequence for each orphan while avoiding the most time- and labor-intensive aspects of typical sequence identifications. We then used these three new sequences to more accurately predict the catalytic function of 385 previously uncharacterized or misannotated proteins. We expect that this kind of rapid sequence identification could be efficiently applied on a larger scale to make enzymology's "back catalog" another powerful tool to drive accurate genome annotation.
Plasmon-polaritonic bands in sequential doped graphene superlattices

NASA Astrophysics Data System (ADS)

Ramos-Mendieta, Felipe; Palomino-Ovando, Martha; Hernández-López, Alejandro; Fuentecilla-Cárcamo, Iván

Doped graphene has the extraordinary quality of supporting two types of surface excitations that involve electric charges (the transverse magnetic surface plasmons) or electric currents (the transverse electric modes). We have studied numerically the collective modes that result from the coupling of surface plasmons in doped graphene multilayers. By use of structured supercells with fixed dielectric background and inter layer separation, we found a series of plasmon-polaritonic bands of structure dependent on the doping sequence chosen for the graphene sheets. Periodic and quasiperiodic sequences for the graphene chemical potential have been studied. Our results show that transverse magnetic bands exist only in the low frequency regime but transverse electric bands arise within specific ranges of higher frequencies. Our calculations are valid for THz frequencies and graphene sheets with doping levels between 0.1 eV and 1.2 eV have been considered. AHL and IFC aknowledge fellowship support from CONACYT México.
Predicting RNA pseudoknot folding thermodynamics

PubMed Central

Cao, Song; Chen, Shi-Jie

2006-01-01

Based on the experimentally determined atomic coordinates for RNA helices and the self-avoiding walks of the P (phosphate) and C4 (carbon) atoms in the diamond lattice for the polynucleotide loop conformations, we derive a set of conformational entropy parameters for RNA pseudoknots. Based on the entropy parameters, we develop a folding thermodynamics model that enables us to compute the sequence-specific RNA pseudoknot folding free energy landscape and thermodynamics. The model is validated through extensive experimental tests both for the native structures and for the folding thermodynamics. The model predicts strong sequence-dependent helix-loop competitions in the pseudoknot stability and the resultant conformational switches between different hairpin and pseudoknot structures. For instance, for the pseudoknot domain of human telomerase RNA, a native-like and a misfolded hairpin intermediates are found to coexist on the (equilibrium) folding pathways, and the interplay between the stabilities of these intermediates causes the conformational switch that may underlie a human telomerase disease. PMID:16709732
SplicePlot: a utility for visualizing splicing quantitative trait loci.

PubMed

Wu, Eric; Nance, Tracy; Montgomery, Stephen B

2014-04-01

RNA sequencing has provided unprecedented resolution of alternative splicing and splicing quantitative trait loci (sQTL). However, there are few tools available for visualizing the genotype-dependent effects of splicing at a population level. SplicePlot is a simple command line utility that produces intuitive visualization of sQTLs and their effects. SplicePlot takes mapped RNA sequencing reads in BAM format and genotype data in VCF format as input and outputs publication-quality Sashimi plots, hive plots and structure plots, enabling better investigation and understanding of the role of genetics on alternative splicing and transcript structure. Source code and detailed documentation are available at http://montgomerylab.stanford.edu/spliceplot/index.html under Resources and at Github. SplicePlot is implemented in Python and is supported on Linux and Mac OS. A VirtualBox virtual machine running Ubuntu with SplicePlot already installed is also available.
Stabilization Effect of Amino Acid Side Chains in Peptide Assemblies on Graphite Studied by Scanning Tunneling Microscopy.

PubMed

Guo, Yuanyuan; Hou, Jingfei; Zhang, Xuemei; Yang, Yanlian; Wang, Chen

2017-04-19

An analysis is presented of the effects of amino acid side chains on peptide assemblies in ambient conditions on a graphite surface. The molecularly resolved assemblies of binary peptides are examined with scanning tunneling microscopy. A comparative analysis of the assembly structures reveals that the lamellae width has an appreciable dependence on the peptide sequence, which could be considered as a manifestation of a stabilizing effect of side-chain moieties of amino acids with high (phenylalanine) and low (alanine, asparagine, histidine and aspartic acid) propensities for aggregation. These amino acids are representative for the chemical structures involving the side chains of charged (histidine and aspartic acid), aromatic (phenylalanine), hydrophobic (alanine), and hydrophilic (asparagine) amino acids. These results might provide useful insight for understanding the effects of sequence on the assembly of surface-bound peptides. © 2017 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
The interaction between the iron-responsive element binding protein and its cognate RNA is highly dependent upon both RNA sequence and structure.

PubMed

Jaffrey, S R; Haile, D J; Klausner, R D; Harford, J B

1993-09-25

To assess the influence of RNA sequence/structure on the interaction RNAs with the iron-responsive element binding protein (IRE-BP), twenty eight altered RNAs were tested as competitors for an RNA corresponding to the ferritin H chain IRE. All changes in the loop of the predicted IRE hairpin and in the unpaired cytosine residue characteristically found in IRE stems significantly decreased the apparent affinity of the RNA for the IRE-BP. Similarly, alteration in the spacing and/or orientation of the loop and the unpaired cytosine of the stem by either increasing or decreasing the number of base pairs separating them significantly reduced efficacy as a competitor. It is inferred that the IRE-BP forms multiple contacts with its cognate RNA, and that these contacts, acting in concert, provide the basis for the high affinity of this interaction.
A sequence-dependent rigid-base model of DNA

NASA Astrophysics Data System (ADS)

Gonzalez, O.; Petkevičiutė, D.; Maddocks, J. H.

2013-02-01

A novel hierarchy of coarse-grain, sequence-dependent, rigid-base models of B-form DNA in solution is introduced. The hierarchy depends on both the assumed range of energetic couplings, and the extent of sequence dependence of the model parameters. A significant feature of the models is that they exhibit the phenomenon of frustration: each base cannot simultaneously minimize the energy of all of its interactions. As a consequence, an arbitrary DNA oligomer has an intrinsic or pre-existing stress, with the level of this frustration dependent on the particular sequence of the oligomer. Attention is focussed on the particular model in the hierarchy that has nearest-neighbor interactions and dimer sequence dependence of the model parameters. For a Gaussian version of this model, a complete coarse-grain parameter set is estimated. The parameterized model allows, for an oligomer of arbitrary length and sequence, a simple and explicit construction of an approximation to the configuration-space equilibrium probability density function for the oligomer in solution. The training set leading to the coarse-grain parameter set is itself extracted from a recent and extensive database of a large number of independent, atomic-resolution molecular dynamics (MD) simulations of short DNA oligomers immersed in explicit solvent. The Kullback-Leibler divergence between probability density functions is used to make several quantitative assessments of our nearest-neighbor, dimer-dependent model, which is compared against others in the hierarchy to assess various assumptions pertaining both to the locality of the energetic couplings and to the level of sequence dependence of its parameters. It is also compared directly against all-atom MD simulation to assess its predictive capabilities. The results show that the nearest-neighbor, dimer-dependent model can successfully resolve sequence effects both within and between oligomers. For example, due to the presence of frustration, the model can successfully predict the nonlocal changes in the minimum energy configuration of an oligomer that are consequent upon a local change of sequence at the level of a single point mutation.
A sequence-dependent rigid-base model of DNA.

PubMed

Gonzalez, O; Petkevičiūtė, D; Maddocks, J H

2013-02-07

A novel hierarchy of coarse-grain, sequence-dependent, rigid-base models of B-form DNA in solution is introduced. The hierarchy depends on both the assumed range of energetic couplings, and the extent of sequence dependence of the model parameters. A significant feature of the models is that they exhibit the phenomenon of frustration: each base cannot simultaneously minimize the energy of all of its interactions. As a consequence, an arbitrary DNA oligomer has an intrinsic or pre-existing stress, with the level of this frustration dependent on the particular sequence of the oligomer. Attention is focussed on the particular model in the hierarchy that has nearest-neighbor interactions and dimer sequence dependence of the model parameters. For a Gaussian version of this model, a complete coarse-grain parameter set is estimated. The parameterized model allows, for an oligomer of arbitrary length and sequence, a simple and explicit construction of an approximation to the configuration-space equilibrium probability density function for the oligomer in solution. The training set leading to the coarse-grain parameter set is itself extracted from a recent and extensive database of a large number of independent, atomic-resolution molecular dynamics (MD) simulations of short DNA oligomers immersed in explicit solvent. The Kullback-Leibler divergence between probability density functions is used to make several quantitative assessments of our nearest-neighbor, dimer-dependent model, which is compared against others in the hierarchy to assess various assumptions pertaining both to the locality of the energetic couplings and to the level of sequence dependence of its parameters. It is also compared directly against all-atom MD simulation to assess its predictive capabilities. The results show that the nearest-neighbor, dimer-dependent model can successfully resolve sequence effects both within and between oligomers. For example, due to the presence of frustration, the model can successfully predict the nonlocal changes in the minimum energy configuration of an oligomer that are consequent upon a local change of sequence at the level of a single point mutation.
The conserved CAAGAAAGA spacer sequence is an essential element for the formation of 3' termini of the sea urchin H3 histone mRNA by RNA processing.

PubMed Central

Georgiev, O; Birnstiel, M L

1985-01-01

Analysis of cDNA sequences obtained from the small nuclear RNA U7 has previously suggested specific contacts, by base pairing, between the conserved stem-loop structure and CAAGAAAGA sequence of the histone pre-mRNA and the 5'-terminal sequence of the U7 RNA during RNA processing. In order to test some aspects of the model we have created a series of linker scan, deletion and insertion mutants of the 3' terminus of a sea urchin H3 histone gene and have injected mutant DNAs or in vitro synthesized precursors into frog oocyte nuclei for interpretation. We find that, in addition to the stem-loop structure of the mRNA, the CAAGAAAGA spacer transcript within the histone pre-mRNA is required absolutely for RNA processing, as predicted from our model. Spacer sequences immediately downstream of the CAAGAAAGA motif are not complementary to U7 RNA. Nevertheless, they are necessary for obtaining a maximal rate of RNA processing, as is the ACCA sequence coding for the 3' terminus of the mature mRNA. An increase of distance between the mRNA palindrome and the CAAGAAAGA by as little as six nucleotides abolishes all processing. It may, therefore, be useful to regard both these sequence motifs as part of one and the same RNA processing signal with narrowly defined topologies. Interestingly, U7 RNA-dependent 3' processing of histone pre-mRNA can occur in RNA injection experiments only when the in vitro synthesized pre-mRNA contains sequence extensions well beyond the region of sequence complementarities to the U7 RNA. In addition to directing 3' processing the terminal mRNA sequences may have a role in histone mRNA stabilization in the cytoplasmic compartment. Images Fig. 3. Fig. 4. Fig. 5. Fig. 6. Fig. 7. PMID:2410259
Prediction of redox-sensitive cysteines using sequential distance and other sequence-based features.

PubMed

Sun, Ming-An; Zhang, Qing; Wang, Yejun; Ge, Wei; Guo, Dianjing

2016-08-24

Reactive oxygen species can modify the structure and function of proteins and may also act as important signaling molecules in various cellular processes. Cysteine thiol groups of proteins are particularly susceptible to oxidation. Meanwhile, their reversible oxidation is of critical roles for redox regulation and signaling. Recently, several computational tools have been developed for predicting redox-sensitive cysteines; however, those methods either only focus on catalytic redox-sensitive cysteines in thiol oxidoreductases, or heavily depend on protein structural data, thus cannot be widely used. In this study, we analyzed various sequence-based features potentially related to cysteine redox-sensitivity, and identified three types of features for efficient computational prediction of redox-sensitive cysteines. These features are: sequential distance to the nearby cysteines, PSSM profile and predicted secondary structure of flanking residues. After further feature selection using SVM-RFE, we developed Redox-Sensitive Cysteine Predictor (RSCP), a SVM based classifier for redox-sensitive cysteine prediction using primary sequence only. Using 10-fold cross-validation on RSC758 dataset, the accuracy, sensitivity, specificity, MCC and AUC were estimated as 0.679, 0.602, 0.756, 0.362 and 0.727, respectively. When evaluated using 10-fold cross-validation with BALOSCTdb dataset which has structure information, the model achieved performance comparable to current structure-based method. Further validation using an independent dataset indicates it is robust and of relatively better accuracy for predicting redox-sensitive cysteines from non-enzyme proteins. In this study, we developed a sequence-based classifier for predicting redox-sensitive cysteines. The major advantage of this method is that it does not rely on protein structure data, which ensures more extensive application compared to other current implementations. Accurate prediction of redox-sensitive cysteines not only enhances our understanding about the redox sensitivity of cysteine, it may also complement the proteomics approach and facilitate further experimental investigation of important redox-sensitive cysteines.
Impact of target mRNA structure on siRNA silencing efficiency: A large-scale study.

PubMed

Gredell, Joseph A; Berger, Angela K; Walton, S Patrick

2008-07-01

The selection of active siRNAs is generally based on identifying siRNAs with certain sequence and structural properties. However, the efficiency of RNA interference has also been shown to depend on the structure of the target mRNA, primarily through studies using exogenous transcripts with well-defined secondary structures in the vicinity of the target sequence. While these studies provide a means for examining the impact of target sequence and structure independently, the predicted secondary structures for these transcripts are often not reflective of structures that form in full-length, native mRNAs where interactions can occur between relatively remote segments of the mRNAs. Here, using a combination of experimental results and analysis of a large dataset, we demonstrate that the accessibility of certain local target structures on the mRNA is an important determinant in the gene silencing ability of siRNAs. siRNAs targeting the enhanced green fluorescent protein were chosen using a minimal siRNA selection algorithm followed by classification based on the predicted minimum free energy structures of the target transcripts. Transfection into HeLa and HepG2 cells revealed that siRNAs targeting regions of the mRNA predicted to have unpaired 5'- and 3'-ends resulted in greater gene silencing than regions predicted to have other types of secondary structure. These results were confirmed by analysis of gene silencing data from previously published siRNAs, which showed that mRNA target regions unpaired at either the 5'-end or 3'-end were silenced, on average, approximately 10% more strongly than target regions unpaired in the center or primarily paired throughout. We found this effect to be independent of the structure of the siRNA guide strand. Taken together, these results suggest minimal requirements for nucleation of hybridization between the siRNA guide strand and mRNA and that both mRNA and guide strand structure should be considered when choosing candidate siRNAs. (c) 2008 Wiley Periodicals, Inc.
Impact of target mRNA structure on siRNA silencing efficiency: a large-scale study

PubMed Central

Gredell, Joseph A.; Berger, Angela K.; Walton, S. Patrick

2009-01-01

The selection of active siRNAs is generally based on identifying siRNAs with certain sequence and structural properties. However, the efficiency of RNA interference has also been shown to depend on the structure of the target mRNA, primarily through studies using exogenous transcripts with well-defined secondary structures in the vicinity of the target sequence. While these studies provide a means for examining the impact of target sequence and structure independently, the predicted secondary structures for these transcripts are often not reflective of structures that form in full-length, native mRNAs where interactions can occur between relatively remote segments of the mRNAs. Here, using a combination of experimental results and analysis of a large dataset, we demonstrate that the accessibility of certain local target structures on the mRNA is an important determinant in the gene silencing ability of siRNAs. siRNAs targeting the enhanced green fluorescent protein were chosen using a minimal siRNA selection algorithm followed by classification based on the predicted minimum free energy structures of the target transcripts. Transfection into HeLa and HepG2 cells revealed that siRNAs targeting regions of the mRNA predicted to have unpaired 5’- and 3’-ends resulted in greater gene silencing than regions predicted to have other types of secondary structure. These results were confirmed by analysis of gene silencing data from previously published siRNAs, which showed that mRNA target regions unpaired at either the 5’-end or 3’-end were silenced, on average, ~10% more strongly than target regions unpaired in the center or primarily paired throughout. We found this effect to be independent of the structure of the siRNA guide strand. Taken together, these results suggest minimal requirements for nucleation of hybridization between the siRNA guide strand and mRNA and that both mRNA and guide strand structure should be considered when choosing candidate siRNAs. PMID:18306428

Free energy determinants of secondary structure formation: III. beta-turns and their role in protein folding.

PubMed

Yang, A S; Hitz, B; Honig, B

1996-06-21

The stability of beta-turns is calculated as a function of sequence and turn type with a Monte Carlo sampling technique. The conformational energy of four internal hydrogen-bonded turn types, I, I', II and II', is obtained by evaluating their gas phase energy with the CHARMM force field and accounting for solvation effects with the Finite Difference Poisson-Boltzmann (FDPB) method. All four turn types are found to be less stable than the coil state, independent of the sequence in the turn. The free-energy penalties associated with turn formation vary between 1.6 kcal/mol and 7.7 kcal/mol, depending on the sequence and turn type. Differences in turn stability arise mainly from intraresidue interactions within the two central residues of the turn. For each combination of the two central residues, except for -Gly-Gly-, the most stable beta-turn type is always found to occur most commonly in native proteins. The fact that a model based on local interactions accounts for the observed preference of specific sequences suggests that long-range tertiary interactions tend to play a secondary role in determining turn conformation. In contrast, for beta-hairpins, long-range interactions appear to dominate. Specifically, due to the right-handed twist of beta-strands, type I' turns for -Gly-Gly- are found to occur with high frequency, even when local energetics would dictate otherwise. The fact that any combination of two residues is found able to adopt a relatively low-energy turn structure explains why the amino acid sequence in turns is highly variable. The calculated free-energy cost of turn formation, when combined with related numbers obtained for alpha-helices and beta-sheets, suggests a model for the initiation of protein folding based on metastable fragments of secondary structure.
MS/MS-Assisted Design of Sequence-Controlled Synthetic Polymers for Improved Reading of Encoded Information

NASA Astrophysics Data System (ADS)

Charles, Laurence; Cavallo, Gianni; Monnier, Valérie; Oswald, Laurence; Szweda, Roza; Lutz, Jean-François

2017-06-01

In order to improve their MS/MS sequencing, structure of sequence-controlled synthetic polymers can be optimized based on considerations regarding their fragmentation behavior in collision-induced dissociation conditions, as demonstrated here for two digitally encoded polymer families. In poly(triazole amide)s, the main dissociation route proceeded via cleavage of the amide bond in each monomer, hence allowing the chains to be safely sequenced. However, a competitive cleavage of an ether bond in a tri(ethylene glycol) spacer placed between each coding moiety complicated MS/MS spectra while not bringing new structural information. Changing the tri(ethylene glycol) spacer to an alkyl group of the same size allowed this unwanted fragmentation pathway to be avoided, hence greatly simplifying the MS/MS reading step for such undecyl-based poly(triazole amide)s. In poly(alkoxyamine phosphodiester)s, a single dissociation pathway was achieved with repeating units containing an alkoxyamine linkage, which, by very low dissociation energy, made any other chemical bonds MS/MS-silent. Structure of these polymers was further tailored to enhance the stability of those precursor ions with a negatively charged phosphate group per monomer in order to improve their MS/MS readability. Increasing the size of both the alkyl coding moiety and the nitroxide spacer allowed sufficient distance between phosphate groups for all of them to be deprotonated simultaneously. Because the charge state of product ions increased with their polymerization degree, MS/MS spectra typically exhibited groups of fragments at one or the other side of the precursor ion depending on the original α or ω end-group they contain, allowing sequence reconstruction in a straightforward manner. [Figure not available: see fulltext.
Variations in Nuclear Localization Strategies Among Pol X Family Enzymes.

PubMed

Kirby, Thomas W; Pedersen, Lars C; Gabel, Scott A; Gassman, Natalie R; London, Robert E

2018-06-22

Despite the essential roles of pol X family enzymes in DNA repair, information about the structural basis of their nuclear import is limited. Recent studies revealed the unexpected presence of a functional NLS in DNA polymerase β, indicating the importance of active nuclear targeting, even for enzymes likely to leak into and out of the nucleus. The current studies further explore the active nuclear transport of these enzymes by identifying and structurally characterizing the functional NLS sequences in the three remaining human pol X enzymes: terminal deoxynucleotidyl transferase (TdT), DNA polymerase μ (pol μ), and DNA polymerase λ (pol λ). NLS identifications are based on Importin α (Impα) binding affinity determined by fluorescence polarization of fluorescein-labeled NLS peptides, X-ray crystallographic analysis of the Impα∆IBB•NLS complexes, and fluorescence-based subcellular localization studies. All three polymerases use NLS sequences located near their N-terminus; TdT and pol μ utilize monopartite NLS sequences, while pol λ utilizes a bipartite sequence, unique among the pol X family members. The pol μ NLS has relatively weak measured affinity for Impα, due in part to its proximity to the N-terminus that limits non-specific interactions of flanking residues preceding the NLS. However, this effect is partially mitigated by an N-terminal sequence unsupportive of Met1 removal by methionine aminopeptidase, leading to a 3-fold increase in affinity when the N-terminal methionine is present. Nuclear targeting is unique to each pol X family enzyme with variations dependent on the structure and unique functional role of each polymerase. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
DNA-Templated Polymerization of Side-Chain-Functionalized Peptide Nucleic Acid Aldehydes

PubMed Central

Kleiner, Ralph E.; Brudno, Yevgeny; Birnbaum, Michael E.; Liu, David R.

2009-01-01

The DNA-templated polymerization of synthetic building blocks provides a potential route to the laboratory evolution of sequence-defined polymers with structures and properties not necessarily limited to those of natural biopolymers. We previously reported the efficient and sequence-specific DNA-templated polymerization of peptide nucleic acid (PNA) aldehydes. Here, we report the enzyme-free, DNA-templated polymerization of side-chain-functionalized PNA tetramer and pentamer aldehydes. We observed that the polymerization of tetramer and pentamer PNA building blocks with a single lysine-based side chain at various positions in the building block could proceed efficiently and sequence-specifically. In addition, DNA-templated polymerization also proceeded efficiently and in a sequence-specific manner with pentamer PNA aldehydes containing two or three lysine side chains in a single building block to generate more densely functionalized polymers. To further our understanding of side-chain compatibility and expand the capabilities of this system, we also examined the polymerization efficiencies of 20 pentamer building blocks each containing one of five different side-chain groups and four different side-chain regio- and stereochemistries. Polymerization reactions were efficient for all five different side-chain groups and for three of the four combinations of side-chain regio- and stereochemistries. Differences in the efficiency and initial rate of polymerization correlate with the apparent melting temperature of each building block, which is dependent on side-chain regio- and stereochemistry, but relatively insensitive to side-chain structure among the substrates tested. Our findings represent a significant step towards the evolution of sequence-defined synthetic polymers and also demonstrate that enzyme-free nucleic acid-templated polymerization can occur efficiently using substrates with a wide range of side-chain structures, functionalization positions within each building block, and functionalization densities. PMID:18341334
Sequentially distant but structurally similar proteins exhibit fold specific patterns based on their biophysical properties.

PubMed

Rajendran, Senthilnathan; Jothi, Arunachalam

2018-05-16

The Three-dimensional structure of a protein depends on the interaction between their amino acid residues. These interactions are in turn influenced by various biophysical properties of the amino acids. There are several examples of proteins that share the same fold but are very dissimilar at the sequence level. For proteins to share a common fold some crucial interactions should be maintained despite insignificant sequence similarity. Since the interactions are because of the biophysical properties of the amino acids, we should be able to detect descriptive patterns for folds at such a property level. In this line, the main focus of our research is to analyze such proteins and to characterize them in terms of their biophysical properties. Protein structures with sequence similarity lesser than 40% were selected for ten different subfolds from three different mainfolds (according to CATH classification) and were used for this analysis. We used the normalized values of the 49 physio-chemical, energetic and conformational properties of amino acids. We characterize the folds based on the average biophysical property values. We also observed a fold specific correlational behavior of biophysical properties despite a very low sequence similarity in our data. We further trained three different binary classification models (Naive Bayes-NB, Support Vector Machines-SVM and Bayesian Generalized Linear Model-BGLM) which could discriminate mainfold based on the biophysical properties. We also show that among the three generated models, the BGLM classifier model was able to discriminate protein sequences coming under all beta category with 81.43% accuracy and all alpha, alpha-beta proteins with 83.37% accuracy. Copyright © 2018 Elsevier Ltd. All rights reserved.
Structural Requirement in Clostridium perfringens Collagenase mRNA 5′ Leader Sequence for Translational Induction through Small RNA-mRNA Base Pairing

PubMed Central

Nomura, Nobuhiko; Nakamura, Kouji

2013-01-01

The Gram-positive anaerobic bacterium Clostridium perfringens is pathogenic to humans and animals, and the production of its toxins is strictly regulated during the exponential phase. We recently found that the 5′ leader sequence of the colA transcript encoding collagenase, which is a major toxin of this organism, is processed and stabilized in the presence of the small RNA VR-RNA. The primary colA 5′-untranslated region (5′UTR) forms a long stem-loop structure containing an internal bulge and masks its own ribosomal binding site. Here we found that VR-RNA directly regulates colA expression through base pairing with colA mRNA in vivo. However, when the internal bulge structure was closed by point mutations in colA mRNA, translation ceased despite the presence of VR-RNA. In addition, a mutation disrupting the colA stem-loop structure induced mRNA processing and ColA-FLAG translational activation in the absence of VR-RNA, indicating that the stem-loop and internal bulge structure of the colA 5′ leader sequence is important for regulation by VR-RNA. On the other hand, processing was required for maximal ColA expression but was not essential for VR-RNA-dependent colA regulation. Finally, colA processing and translational activation were induced at a high temperature without VR-RNA. These results suggest that inhibition of the colA 5′ leader structure through base pairing is the primary role of VR-RNA in colA regulation and that the colA 5′ leader structure is a possible thermosensor. PMID:23585542
The VMC Survey. XXVII. Young Stellar Structures in the LMC’s Bar Star-forming Complex

NASA Astrophysics Data System (ADS)

Sun, Ning-Chen; de Grijs, Richard; Subramanian, Smitha; Bekki, Kenji; Bell, Cameron P. M.; Cioni, Maria-Rosa L.; Ivanov, Valentin D.; Marconi, Marcella; Oliveira, Joana M.; Piatti, Andrés E.; Ripepi, Vincenzo; Rubele, Stefano; Tatton, Ben L.; van Loon, Jacco Th.

2017-11-01

Star formation is a hierarchical process, forming young stellar structures of star clusters, associations, and complexes over a wide range of scales. The star-forming complex in the bar region of the Large Magellanic Cloud is investigated with upper main-sequence stars observed by the VISTA Survey of the Magellanic Clouds. The upper main-sequence stars exhibit highly nonuniform distributions. Young stellar structures inside the complex are identified from the stellar density map as density enhancements of different significance levels. We find that these structures are hierarchically organized such that larger, lower-density structures contain one or several smaller, higher-density ones. They follow power-law size and mass distributions, as well as a lognormal surface density distribution. All these results support a scenario of hierarchical star formation regulated by turbulence. The temporal evolution of young stellar structures is explored by using subsamples of upper main-sequence stars with different magnitude and age ranges. While the youngest subsample, with a median age of log(τ/yr) = 7.2, contains the most substructure, progressively older ones are less and less substructured. The oldest subsample, with a median age of log(τ/yr) = 8.0, is almost indistinguishable from a uniform distribution on spatial scales of 30-300 pc, suggesting that the young stellar structures are completely dispersed on a timescale of ˜100 Myr. These results are consistent with the characteristics of the 30 Doradus complex and the entire Large Magellanic Cloud, suggesting no significant environmental effects. We further point out that the fractal dimension may be method dependent for stellar samples with significant age spreads.
The zero age main sequence of WIMP burners

DOE Office of Scientific and Technical Information (OSTI.GOV)

Fairbairn, Malcolm; Scott, Pat; Edsjoe, Joakim

2008-02-15

We modify a stellar structure code to estimate the effect upon the main sequence of the accretion of weakly-interacting dark matter onto stars and its subsequent annihilation. The effect upon the stars depends upon whether the energy generation rate from dark matter annihilation is large enough to shut off the nuclear burning in the star. Main sequence weakly-interacting massive particles (WIMP) burners look much like proto-stars moving on the Hayashi track, although they are in principle completely stable. We make some brief comments about where such stars could be found, how they might be observed and more detailed simulations whichmore » are currently in progress. Finally we comment on whether or not it is possible to link the paradoxically hot, young stars found at the galactic center with WIMP burners.« less
ESTIMATING THE RADIUS OF THE CONVECTIVE CORE OF MAIN-SEQUENCE STARS FROM OBSERVED OSCILLATION FREQUENCIES

DOE Office of Scientific and Technical Information (OSTI.GOV)

Yang, Wuming, E-mail: yangwuming@bnu.edu.cn, E-mail: yangwuming@ynao.ac.cn

The determination of the size of the convective core of main-sequence stars is usually dependent on the construction of models of stars. Here we introduce a method to estimate the radius of the convective core of main-sequence stars with masses between about 1.1 and 1.5 M {sub ⊙} from observed frequencies of low-degree p -modes. A formula is proposed to achieve the estimation. The values of the radius of the convective core of four known stars are successfully estimated by the formula. The radius of the convective core of KIC 9812850 estimated by the formula is 0.140 ± 0.028 Rmore » {sub ⊙}. In order to confirm this prediction, a grid of evolutionary models was computed. The value of the convective-core radius of the best-fit model of KIC 9812850 is 0.149 R {sub ⊙}, which is in good agreement with that estimated by the formula from observed frequencies. The formula aids in understanding the interior structure of stars directly from observed frequencies. The understanding is not dependent on the construction of models.« less
siRNA and innate immunity.

PubMed

Robbins, Marjorie; Judge, Adam; MacLachlan, Ian

2009-06-01

Canonical small interfering RNA (siRNA) duplexes are potent activators of the mammalian innate immune system. The induction of innate immunity by siRNA is dependent on siRNA structure and sequence, method of delivery, and cell type. Synthetic siRNA in delivery vehicles that facilitate cellular uptake can induce high levels of inflammatory cytokines and interferons after systemic administration in mammals and in primary human blood cell cultures. This activation is predominantly mediated by immune cells, normally via a Toll-like receptor (TLR) pathway. The siRNA sequence dependency of these pathways varies with the type and location of the TLR involved. Alternatively nonimmune cell activation may also occur, typically resulting from siRNA interaction with cytoplasmic RNA sensors such as RIG1. As immune activation by siRNA-based drugs represents an undesirable side effect due to the considerable toxicities associated with excessive cytokine release in humans, understanding and abrogating this activity will be a critical component in the development of safe and effective therapeutics. This review describes the intracellular mechanisms of innate immune activation by siRNA, the design of appropriate sequences and chemical modification approaches, and suitable experimental methods for studying their effects, with a view toward reducing siRNA-mediated off-target effects.
SARNAclust: Semi-automatic detection of RNA protein binding motifs from immunoprecipitation data

PubMed Central

Dotu, Ivan; Adamson, Scott I.; Coleman, Benjamin; Fournier, Cyril; Ricart-Altimiras, Emma; Eyras, Eduardo

2018-01-01

RNA-protein binding is critical to gene regulation, controlling fundamental processes including splicing, translation, localization and stability, and aberrant RNA-protein interactions are known to play a role in a wide variety of diseases. However, molecular understanding of RNA-protein interactions remains limited; in particular, identification of RNA motifs that bind proteins has long been challenging, especially when such motifs depend on both sequence and structure. Moreover, although RNA binding proteins (RBPs) often contain more than one binding domain, algorithms capable of identifying more than one binding motif simultaneously have not been developed. In this paper we present a novel pipeline to determine binding peaks in crosslinking immunoprecipitation (CLIP) data, to discover multiple possible RNA sequence/structure motifs among them, and to experimentally validate such motifs. At the core is a new semi-automatic algorithm SARNAclust, the first unsupervised method to identify and deconvolve multiple sequence/structure motifs simultaneously. SARNAclust computes similarity between sequence/structure objects using a graph kernel, providing the ability to isolate the impact of specific features through the bulge graph formalism. Application of SARNAclust to synthetic data shows its capability of clustering 5 motifs at once with a V-measure value of over 0.95, while GraphClust achieves only a V-measure of 0.083 and RNAcontext cannot detect any of the motifs. When applied to existing eCLIP sets, SARNAclust finds known motifs for SLBP and HNRNPC and novel motifs for several other RBPs such as AGGF1, AKAP8L and ILF3. We demonstrate an experimental validation protocol, a targeted Bind-n-Seq-like high-throughput sequencing approach that relies on RNA inverse folding for oligo pool design, that can validate the components within the SLBP motif. Finally, we use this protocol to experimentally interrogate the SARNAclust motif predictions for protein ILF3. Our results support a newly identified partially double-stranded UUUUUGAGA motif similar to that known for the splicing factor HNRNPC. PMID:29596423
DOE Office of Scientific and Technical Information (OSTI.GOV)

Zhang, Yanfeng; Zheng, Yi; Qin, Ling

Beta-hydroxyacid dehydrogenase (β-HAD) genes have been identified in all sequenced genomes of eukaryotes and prokaryotes. Their gene products catalyze the NAD+- or NADP+-dependent oxidation of various β-hydroxy acid substrates into their corresponding semialdehyde. In many fungal and bacterial genomes, multiple β-HAD genes are observed leading to the hypothesis that these gene products may have unique, uncharacterized metabolic roles specific to their species. The genomes of Geobacter sulfurreducens and Geobacter metallireducens each contain two potential β-HAD genes. The protein sequences of one pair of these genes, Gs-βHAD (Q74DE4) and Gm-βHAD (Q39R98), have 65% sequence identity and 77% sequence similarity with eachmore » other. Both proteins reduce succinic semialdehyde, a metabolite of the GABA shunt. To further explore the structural and functional characteristics of these two β-HADs with a potentially unique substrate specificity, crystal structures for Gs-βHAD and Gm-βHAD in complex with NADP+ were determined to a resolution of 1.89 Å and 2.07 Å, respectively. The structure of both proteins are similar, composed of 14 α-helices and nine β-strands organized into two domains. Domain One (1-165) adopts a typical Rossmann fold composed of two α/β units: a six-strand parallel β-sheet surrounded by six α-helices (α1 – α6) followed by a mixed three-strand β-sheet surrounded by two α-helices (α7 and α8). Domain Two (166-287) is composed of a bundle of seven α-helices (α9 – α14). Four functional regions conserved in all β-HADs are spatially located near each other at the interdomain cleft in both Gs-βHAD and Gm-βHAD with a buried molecule of NADP+. The structural features of Gs-βHAD and Gm-βHAD are described in relation to the four conserved consensus sequences characteristic of β-HADs and the potential biochemical importance of these enzymes as an alternative pathway for the degradation of succinic semialdehyde.« less
FMRI investigation of cross-modal interactions in beat perception: Audition primes vision, but not vice versa

PubMed Central

Grahn, Jessica A.; Henry, Molly J.; McAuley, J. Devin

2011-01-01

How we measure time and integrate temporal cues from different sensory modalities are fundamental questions in neuroscience. Sensitivity to a “beat” (such as that routinely perceived in music) differs substantially between auditory and visual modalities. Here we examined beat sensitivity in each modality, and examined cross-modal influences, using functional magnetic resonance imaging (fMRI) to characterize brain activity during perception of auditory and visual rhythms. In separate fMRI sessions, participants listened to auditory sequences or watched visual sequences. The order of auditory and visual sequence presentation was counterbalanced so that cross-modal order effects could be investigated. Participants judged whether sequences were speeding up or slowing down, and the pattern of tempo judgments was used to derive a measure of sensitivity to an implied beat. As expected, participants were less sensitive to an implied beat in visual sequences than in auditory sequences. However, visual sequences produced a stronger sense of beat when preceded by auditory sequences with identical temporal structure. Moreover, increases in brain activity were observed in the bilateral putamen for visual sequences preceded by auditory sequences when compared to visual sequences without prior auditory exposure. No such order-dependent differences (behavioral or neural) were found for the auditory sequences. The results provide further evidence for the role of the basal ganglia in internal generation of the beat and suggest that an internal auditory rhythm representation may be activated during visual rhythm perception. PMID:20858544
Sequence-structure mapping errors in the PDB: OB-fold domains

PubMed Central

Venclovas, Česlovas; Ginalski, Krzysztof; Kang, Chulhee

2004-01-01

The Protein Data Bank (PDB) is the single most important repository of structural data for proteins and other biologically relevant molecules. Therefore, it is critically important to keep the PDB data, as much as possible, error-free. In this study, we have analyzed PDB crystal structures possessing oligonucleotide/oligosaccharide binding (OB)-fold, one of the highly populated folds, for the presence of sequence-structure mapping errors. Using energy-based structure quality assessment coupled with sequence analyses, we have found that there are at least five OB-structures in the PDB that have regions where sequences have been incorrectly mapped onto the structure. We have demonstrated that the combination of these computation techniques is effective not only in detecting sequence-structure mapping errors, but also in providing guidance to correct them. Namely, we have used results of computational analysis to direct a revision of X-ray data for one of the PDB entries containing a fairly inconspicuous sequence-structure mapping error. The revised structure has been deposited with the PDB. We suggest use of computational energy assessment and sequence analysis techniques to facilitate structure determination when homologs having known structure are available to use as a reference. Such computational analysis may be useful in either guiding the sequence-structure assignment process or verifying the sequence mapping within poorly defined regions. PMID:15133161
Modeling Of Object- And Scene-Prototypes With Hierarchically Structured Classes

NASA Astrophysics Data System (ADS)

Ren, Z.; Jensch, P.; Ameling, W.

1989-03-01

The success of knowledge-based image analysis methodology and implementation tools depends largely on an appropriately and efficiently built model wherein the domain-specific context information about and the inherent structure of the observed image scene have been encoded. For identifying an object in an application environment a computer vision system needs to know firstly the description of the object to be found in an image or in an image sequence, secondly the corresponding relationships between object descriptions within the image sequence. This paper presents models of image objects scenes by means of hierarchically structured classes. Using the topovisual formalism of graph and higraph, we are currently studying principally the relational aspect and data abstraction of the modeling in order to visualize the structural nature resident in image objects and scenes, and to formalize. their descriptions. The goal is to expose the structure of image scene and the correspondence of image objects in the low level image interpretation. process. The object-based system design approach has been applied to build the model base. We utilize the object-oriented programming language C + + for designing, testing and implementing the abstracted entity classes and the operation structures which have been modeled topovisually. The reference images used for modeling prototypes of objects and scenes are from industrial environments as'well as medical applications.
DNA sequence determinants controlling affinity, stability and shape of DNA complexes bound by the nucleoid protein Fis

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hancock, Stephen P.; Stella, Stefano; Cascio, Duilio

The abundant Fis nucleoid protein selectively binds poorly related DNA sequences with high affinities to regulate diverse DNA reactions. Fis binds DNA primarily through DNA backbone contacts and selects target sites by reading conformational properties of DNA sequences, most prominently intrinsic minor groove widths. High-affinity binding requires Fis-stabilized DNA conformational changes that vary depending on DNA sequence. In order to better understand the molecular basis for high affinity site recognition, we analyzed the effects of DNA sequence within and flanking the core Fis binding site on binding affinity and DNA structure. X-ray crystal structures of Fis-DNA complexes containing variable sequencesmore » in the noncontacted center of the binding site or variations within the major groove interfaces show that the DNA can adapt to the Fis dimer surface asymmetrically. We show that the presence and position of pyrimidine-purine base steps within the major groove interfaces affect both local DNA bending and minor groove compression to modulate affinities and lifetimes of Fis-DNA complexes. Sequences flanking the core binding site also modulate complex affinities, lifetimes, and the degree of local and global Fis-induced DNA bending. In particular, a G immediately upstream of the 15 bp core sequence inhibits binding and bending, and A-tracts within the flanking base pairs increase both complex lifetimes and global DNA curvatures. Taken together, our observations support a revised DNA motif specifying high-affinity Fis binding and highlight the range of conformations that Fis-bound DNA can adopt. Lastly, the affinities and DNA conformations of individual Fis-DNA complexes are likely to be tailored to their context-specific biological functions.« less
DNA sequence determinants controlling affinity, stability and shape of DNA complexes bound by the nucleoid protein Fis

DOE PAGES

Hancock, Stephen P.; Stella, Stefano; Cascio, Duilio; ...

2016-03-09

The abundant Fis nucleoid protein selectively binds poorly related DNA sequences with high affinities to regulate diverse DNA reactions. Fis binds DNA primarily through DNA backbone contacts and selects target sites by reading conformational properties of DNA sequences, most prominently intrinsic minor groove widths. High-affinity binding requires Fis-stabilized DNA conformational changes that vary depending on DNA sequence. In order to better understand the molecular basis for high affinity site recognition, we analyzed the effects of DNA sequence within and flanking the core Fis binding site on binding affinity and DNA structure. X-ray crystal structures of Fis-DNA complexes containing variable sequencesmore » in the noncontacted center of the binding site or variations within the major groove interfaces show that the DNA can adapt to the Fis dimer surface asymmetrically. We show that the presence and position of pyrimidine-purine base steps within the major groove interfaces affect both local DNA bending and minor groove compression to modulate affinities and lifetimes of Fis-DNA complexes. Sequences flanking the core binding site also modulate complex affinities, lifetimes, and the degree of local and global Fis-induced DNA bending. In particular, a G immediately upstream of the 15 bp core sequence inhibits binding and bending, and A-tracts within the flanking base pairs increase both complex lifetimes and global DNA curvatures. Taken together, our observations support a revised DNA motif specifying high-affinity Fis binding and highlight the range of conformations that Fis-bound DNA can adopt. Lastly, the affinities and DNA conformations of individual Fis-DNA complexes are likely to be tailored to their context-specific biological functions.« less
The red-sequence of 72 WINGS local galaxy clusters

NASA Astrophysics Data System (ADS)

Valentinuzzi, T.; Poggianti, B. M.; Fasano, G.; D'Onofrio, M.; Moretti, A.; Ramella, M.; Biviano, A.; Fritz, J.; Varela, J.; Bettoni, D.; Vulcani, B.; Moles, M.; Couch, W. J.; Dressler, A.; Kjærgaard, P.; Omizzolo, A.; Cava, A.

2011-12-01

We study the color - magnitude red sequence and blue fraction of 72 X-ray selected galaxy clusters at z = 0.04-0.07 from the WINGS survey, searching for correlations between the characteristics of the red sequence (RS) and the environment. We consider the slope and scatter of the red sequence, the number ratio of red luminous-to-faint galaxies, the blue fraction, and the fractions of ellipticals, S0s, and spirals that compose the RS. None of these quantities correlate with the cluster velocity dispersion, X-ray luminosity, number of cluster substructures, BCG prevalence over next brightest galaxies, and the spatial concentration of ellipticals. The properties of the RS, instead, depend strongly on local galaxy density. Higher density regions have a smaller RS scatter, a higher luminous-to-faint ratio, a lower blue fraction, and a lower spiral fraction on the RS. Our results clearly illustrate the prominent effect of the local density in setting the epoch when galaxies become passive and join the red sequence, as opposed to the mass of the galaxy host structure.
Modeling Structure-Function Relationships in Synthetic DNA Sequences using Attribute Grammars

PubMed Central

Cai, Yizhi; Lux, Matthew W.; Adam, Laura; Peccoud, Jean

2009-01-01

Recognizing that certain biological functions can be associated with specific DNA sequences has led various fields of biology to adopt the notion of the genetic part. This concept provides a finer level of granularity than the traditional notion of the gene. However, a method of formally relating how a set of parts relates to a function has not yet emerged. Synthetic biology both demands such a formalism and provides an ideal setting for testing hypotheses about relationships between DNA sequences and phenotypes beyond the gene-centric methods used in genetics. Attribute grammars are used in computer science to translate the text of a program source code into the computational operations it represents. By associating attributes with parts, modifying the value of these attributes using rules that describe the structure of DNA sequences, and using a multi-pass compilation process, it is possible to translate DNA sequences into molecular interaction network models. These capabilities are illustrated by simple example grammars expressing how gene expression rates are dependent upon single or multiple parts. The translation process is validated by systematically generating, translating, and simulating the phenotype of all the sequences in the design space generated by a small library of genetic parts. Attribute grammars represent a flexible framework connecting parts with models of biological function. They will be instrumental for building mathematical models of libraries of genetic constructs synthesized to characterize the function of genetic parts. This formalism is also expected to provide a solid foundation for the development of computer assisted design applications for synthetic biology. PMID:19816554
Theoretical Insights into the Biophysics of Protein Bi-stability and Evolutionary Switches

PubMed Central

Krobath, Heinrich; Chan, Hue Sun

2016-01-01

Deciphering the effects of nonsynonymous mutations on protein structure is central to many areas of biomedical research and is of fundamental importance to the study of molecular evolution. Much of the investigation of protein evolution has focused on mutations that leave a protein’s folded structure essentially unchanged. However, to evolve novel folds of proteins, mutations that lead to large conformational modifications have to be involved. Unraveling the basic biophysics of such mutations is a challenge to theory, especially when only one or two amino acid substitutions cause a large-scale conformational switch. Among the few such mutational switches identified experimentally, the one between the GA all-α and GB α+β folds is extensively characterized; but all-atom simulations using fully transferrable potentials have not been able to account for this striking switching behavior. Here we introduce an explicit-chain model that combines structure-based native biases for multiple alternative structures with a general physical atomic force field, and apply this construct to twelve mutants spanning the sequence variation between GA and GB. In agreement with experiment, we observe conformational switching from GA to GB upon a single L45Y substitution in the GA98 mutant. In line with the latent evolutionary potential concept, our model shows a gradual sequence-dependent change in fold preference in the mutants before this switch. Our analysis also indicates that a sharp GA/GB switch may arise from the orientation dependence of aromatic π-interactions. These findings provide physical insights toward rationalizing, predicting and designing evolutionary conformational switches. PMID:27253392

Analyses of the radiation of birnaviruses from diverse host phyla and of their evolutionary affinities with other double-stranded RNA and positive strand RNA viruses using robust structure-based multiple sequence alignments and advanced phylogenetic methods

PubMed Central

2013-01-01

Background Birnaviruses form a distinct family of double-stranded RNA viruses infecting animals as different as vertebrates, mollusks, insects and rotifers. With such a wide host range, they constitute a good model for studying the adaptation to the host. Additionally, several lines of evidence link birnaviruses to positive strand RNA viruses and suggest that phylogenetic analyses may provide clues about transition. Results We characterized the genome of a birnavirus from the rotifer Branchionus plicalitis. We used X-ray structures of RNA-dependent RNA polymerases and capsid proteins to obtain multiple structure alignments that allowed us to obtain reliable multiple sequence alignments and we employed “advanced” phylogenetic methods to study the evolutionary relationships between some positive strand and double-stranded RNA viruses. We showed that the rotifer birnavirus genome exhibited an organization remarkably similar to other birnaviruses. As this host was phylogenetically very distant from the other known species targeted by birnaviruses, we revisited the evolutionary pathways within the Birnaviridae family using phylogenetic reconstruction methods. We also applied a number of phylogenetic approaches based on structurally conserved domains/regions of the capsid and RNA-dependent RNA polymerase proteins to study the evolutionary relationships between birnaviruses, other double-stranded RNA viruses and positive strand RNA viruses. Conclusions We show that there is a good correlation between the phylogeny of the birnaviruses and that of their hosts at the phylum level using the RNA-dependent RNA polymerase (genomic segment B) on the one hand and a concatenation of the capsid protein, protease and ribonucleoprotein (genomic segment A) on the other hand. This correlation tends to vanish within phyla. The use of advanced phylogenetic methods and robust structure-based multiple sequence alignments allowed us to obtain a more accurate picture (in terms of probability of the tree topologies) of the evolutionary affinities between double-stranded RNA and positive strand RNA viruses. In particular, we were able to show that there exists a good statistical support for the claims that dsRNA viruses are not monophyletic and that viruses with permuted RdRps belong to a common evolution lineage as previously proposed by other groups. We also propose a tree topology with a good statistical support describing the evolutionary relationships between the Picornaviridae, Caliciviridae, Flaviviridae families and a group including the Alphatetraviridae, Nodaviridae, Permutotretraviridae, Birnaviridae, and Cystoviridae families. PMID:23865988
Aggregation of peptides in the tube model with correlated sidechain orientations

NASA Astrophysics Data System (ADS)

Hung, Nguyen Ba; Hoang, Trinh Xuan

2015-06-01

The ability of proteins and peptides to aggregate and form toxic amyloid fibrils is associated with a range of diseases including BSE (or mad cow), Alzheimer's and Parkinson's Diseases. In this study, we investigate the the role of amino acid sequence in the aggregation propensity by using a modified tube model with a new procedure for hydrophobic interaction. In this model, the amino acid sidechains are not considered explicitly, but their orientations are taken into account in the formation of hydrophobic contact. Extensive Monte Carlo simulations for systems of short peptides are carried out with the use of parallel tempering technique. Our results show that the propensity to form and the structures of the aggregates strongly depend on the amino acid sequence and the number of peptides. Some sequences may not aggregate at all at a presumable physiological temperature while other can easily form fibril-like, β-sheet struture. Our study provides an insight into the principles of how the formation of amyloid can be governed by amino acid sequence.
Nanopore Kinetic Proofreading of DNA Sequences

NASA Astrophysics Data System (ADS)

Ling, Xinsheng Sean

The concept of DNA sequencing using the time dependence of the nanopore ionic current was proposed in 1996 by Kasianowicz, Brandin, Branton, and Deamer (KBBD). The KBBD concept has generated tremendous amount interests in recent decade. In this talk, I will review the current understanding of the DNA ``translocation'' dynamics and how it can be described by Schrodinger's 1915 paper on first-passage-time distribution function. Schrodinger's distribution function can be used to give a rigorous criterion for achieving nanopore DNA sequencing which turns out to be identical to that of gel electrophoresis used by Sanger in the first-generation Sanger method. A nanopore DNA sequencing technology also requires discrimination of bases with high accuracies. I will describe a solid-state nanopore sandwich structure that can function as a proofreading device capable of discriminating between correct and incorrect hybridization probes with an accuracy rivaling that of high-fidelity DNA polymerases. The latest results from Nanjing will be presented. This work is supported by China 1000-Talent Program at Southeast University, Nanjing, China.
A Parvovirus B19 synthetic genome: sequence features and functional competence.

PubMed

Manaresi, Elisabetta; Conti, Ilaria; Bua, Gloria; Bonvicini, Francesca; Gallinella, Giorgio

2017-08-01

Central to genetic studies for Parvovirus B19 (B19V) is the availability of genomic clones that may possess functional competence and ability to generate infectious virus. In our study, we established a new model genetic system for Parvovirus B19. A synthetic approach was followed, by design of a reference genome sequence, by generation of a corresponding artificial construct and its molecular cloning in a complete and functional form, and by setup of an efficient strategy to generate infectious virus, via transfection in UT7/EpoS1 cells and amplification in erythroid progenitor cells. The synthetic genome was able to generate virus with biological properties paralleling those of native virus, its infectious activity being dependent on the preservation of self-complementarity and sequence heterogeneity within the terminal regions. A virus of defined genome sequence, obtained from controlled cell culture conditions, can constitute a reference tool for investigation of the structural and functional characteristics of the virus. Copyright © 2017 Elsevier Inc. All rights reserved.
A corticostriatal deficit promotes temporal distortion of automatic action in ageing

PubMed Central

Matamales, Miriam; Skrbis, Zala; Bailey, Matthew R; Balsam, Peter D; Balleine, Bernard W; Götz, Jürgen

2017-01-01

The acquisition of motor skills involves implementing action sequences that increase task efficiency while reducing cognitive loads. This learning capacity depends on specific cortico-basal ganglia circuits that are affected by normal ageing. Here, combining a series of novel behavioural tasks with extensive neuronal mapping and targeted cell manipulations in mice, we explored how ageing of cortico-basal ganglia networks alters the microstructure of action throughout sequence learning. We found that, after extended training, aged mice produced shorter actions and displayed squeezed automatic behaviours characterised by ultrafast oligomeric action chunks that correlated with deficient reorganisation of corticostriatal activity. Chemogenetic disruption of a striatal subcircuit in young mice reproduced age-related within-sequence features, and the introduction of an action-related feedback cue temporarily restored normal sequence structure in aged mice. Our results reveal static properties of aged cortico-basal ganglia networks that introduce temporal limits to action automaticity, something that can compromise procedural learning in ageing. PMID:29058672
Amino acid sequence analysis of the annexin super-gene family of proteins.

PubMed

Barton, G J; Newman, R H; Freemont, P S; Crumpton, M J

1991-06-15

The annexins are a widespread family of calcium-dependent membrane-binding proteins. No common function has been identified for the family and, until recently, no crystallographic data existed for an annexin. In this paper we draw together 22 available annexin sequences consisting of 88 similar repeat units, and apply the techniques of multiple sequence alignment, pattern matching, secondary structure prediction and conservation analysis to the characterisation of the molecules. The analysis clearly shows that the repeats cluster into four distinct families and that greatest variation occurs within the repeat 3 units. Multiple alignment of the 88 repeats shows amino acids with conserved physicochemical properties at 22 positions, with only Gly at position 23 being absolutely conserved in all repeats. Secondary structure prediction techniques identify five conserved helices in each repeat unit and patterns of conserved hydrophobic amino acids are consistent with one face of a helix packing against the protein core in predicted helices a, c, d, e. Helix b is generally hydrophobic in all repeats, but contains a striking pattern of repeat-specific residue conservation at position 31, with Arg in repeats 4 and Glu in repeats 2, but unconserved amino acids in repeats 1 and 3. This suggests repeats 2 and 4 may interact via a buried saltbridge. The loop between predicted helices a and b of repeat 3 shows features distinct from the equivalent loop in repeats 1, 2 and 4, suggesting an important structural and/or functional role for this region. No compelling evidence emerges from this study for uteroglobin and the annexins sharing similar tertiary structures, or for uteroglobin representing a derivative of a primordial one-repeat structure that underwent duplication to give the present day annexins. The analyses performed in this paper are re-evaluated in the Appendix, in the light of the recently published X-ray structure for human annexin V. The structure confirms most of the predictions and shows the power of techniques for the determination of tertiary structural information from the amino acid sequences of an aligned protein family.
Effects of syllable structure in aphasic errors: implications for a new model of speech production.

PubMed

Romani, Cristina; Galluzzi, Claudia; Bureca, Ivana; Olson, Andrew

2011-03-01

Current models of word production assume that words are stored as linear sequences of phonemes which are structured into syllables only at the moment of production. This is because syllable structure is always recoverable from the sequence of phonemes. In contrast, we present theoretical and empirical evidence that syllable structure is lexically represented. Storing syllable structure would have the advantage of making representations more stable and resistant to damage. On the other hand, re-syllabifications affect only a minimal part of phonological representations and occur only in some languages and depending on speech register. Evidence for these claims comes from analyses of aphasic errors which not only respect phonotactic constraints, but also avoid transformations which move the syllabic structure of the word further away from the original structure, even when equating for segmental complexity. This is true across tasks, types of errors, and, crucially, types of patients. The same syllabic effects are shown by apraxic patients and by phonological patients who have more central difficulties in retrieving phonological representations. If syllable structure was only computed after phoneme retrieval, it would have no way to influence the errors of phonological patients. Our results have implications for psycholinguistic and computational models of language as well as for clinical and educational practices. Copyright © 2010 Elsevier Inc. All rights reserved.
CombAlign: a code for generating a one-to-many sequence alignment from a set of pairwise structure-based sequence alignments.

PubMed

Zhou, Carol L Ecale

2015-01-01

In order to better define regions of similarity among related protein structures, it is useful to identify the residue-residue correspondences among proteins. Few codes exist for constructing a one-to-many multiple sequence alignment derived from a set of structure or sequence alignments, and a need was evident for creating such a tool for combining pairwise structure alignments that would allow for insertion of gaps in the reference structure. This report describes a new Python code, CombAlign, which takes as input a set of pairwise sequence alignments (which may be structure based) and generates a one-to-many, gapped, multiple structure- or sequence-based sequence alignment (MSSA). The use and utility of CombAlign was demonstrated by generating gapped MSSAs using sets of pairwise structure-based sequence alignments between structure models of the matrix protein (VP40) and pre-small/secreted glycoprotein (sGP) of Reston Ebolavirus and the corresponding proteins of several other filoviruses. The gapped MSSAs revealed structure-based residue-residue correspondences, which enabled identification of structurally similar versus differing regions in the Reston proteins compared to each of the other corresponding proteins. CombAlign is a new Python code that generates a one-to-many, gapped, multiple structure- or sequence-based sequence alignment (MSSA) given a set of pairwise sequence alignments (which may be structure based). CombAlign has utility in assisting the user in distinguishing structurally conserved versus divergent regions on a reference protein structure relative to other closely related proteins. CombAlign was developed in Python 2.6, and the source code is available for download from the GitHub code repository.
Implication of the cause of differences in 3D structures of proteins with high sequence identity based on analyses of amino acid sequences and 3D structures.

PubMed

Matsuoka, Masanari; Sugita, Masatake; Kikuchi, Takeshi

2014-09-18

Proteins that share a high sequence homology while exhibiting drastically different 3D structures are investigated in this study. Recently, artificial proteins related to the sequences of the GA and IgG binding GB domains of human serum albumin have been designed. These artificial proteins, referred to as GA and GB, share 98% amino acid sequence identity but exhibit different 3D structures, namely, a 3α bundle versus a 4β + α structure. Discriminating between their 3D structures based on their amino acid sequences is a very difficult problem. In the present work, in addition to using bioinformatics techniques, an analysis based on inter-residue average distance statistics is used to address this problem. It was hard to distinguish which structure a given sequence would take only with the results of ordinary analyses like BLAST and conservation analyses. However, in addition to these analyses, with the analysis based on the inter-residue average distance statistics and our sequence tendency analysis, we could infer which part would play an important role in its structural formation. The results suggest possible determinants of the different 3D structures for sequences with high sequence identity. The possibility of discriminating between the 3D structures based on the given sequences is also discussed.
Can We Improve Structured Sequence Processing? Exploring the Direct and Indirect Effects of Computerized Training Using a Mediational Model

PubMed Central

Smith, Gretchen N. L.; Conway, Christopher M.; Bauernschmidt, Althea; Pisoni, David B.

2015-01-01

Recent research suggests that language acquisition may rely on domain-general learning abilities, such as structured sequence processing, which is the ability to extract, encode, and represent structured patterns in a temporal sequence. If structured sequence processing supports language, then it may be possible to improve language function by enhancing this foundational learning ability. The goal of the present study was to use a novel computerized training task as a means to better understand the relationship between structured sequence processing and language function. Participants first were assessed on pre-training tasks to provide baseline behavioral measures of structured sequence processing and language abilities. Participants were then quasi-randomly assigned to either a treatment group involving adaptive structured visuospatial sequence training, a treatment group involving adaptive non-structured visuospatial sequence training, or a control group. Following four days of sequence training, all participants were assessed with the same pre-training measures. Overall comparison of the post-training means revealed no group differences. However, in order to examine the potential relations between sequence training, structured sequence processing, and language ability, we used a mediation analysis that showed two competing effects. In the indirect effect, adaptive sequence training with structural regularities had a positive impact on structured sequence processing performance, which in turn had a positive impact on language processing. This finding not only identifies a potential novel intervention to treat language impairments but also may be the first demonstration that structured sequence processing can be improved and that this, in turn, has an impact on language processing. However, in the direct effect, adaptive sequence training with structural regularities had a direct negative impact on language processing. This unexpected finding suggests that adaptive training with structural regularities might potentially interfere with language processing. Taken together, these findings underscore the importance of pursuing designs that promote a better understanding of the mechanisms underlying training-related changes, so that regimens can be developed that help reduce these types of negative effects while simultaneously maximizing the benefits to outcome measures of interest. PMID:25946222
Can we improve structured sequence processing? Exploring the direct and indirect effects of computerized training using a mediational model.

PubMed

Smith, Gretchen N L; Conway, Christopher M; Bauernschmidt, Althea; Pisoni, David B

2015-01-01

Recent research suggests that language acquisition may rely on domain-general learning abilities, such as structured sequence processing, which is the ability to extract, encode, and represent structured patterns in a temporal sequence. If structured sequence processing supports language, then it may be possible to improve language function by enhancing this foundational learning ability. The goal of the present study was to use a novel computerized training task as a means to better understand the relationship between structured sequence processing and language function. Participants first were assessed on pre-training tasks to provide baseline behavioral measures of structured sequence processing and language abilities. Participants were then quasi-randomly assigned to either a treatment group involving adaptive structured visuospatial sequence training, a treatment group involving adaptive non-structured visuospatial sequence training, or a control group. Following four days of sequence training, all participants were assessed with the same pre-training measures. Overall comparison of the post-training means revealed no group differences. However, in order to examine the potential relations between sequence training, structured sequence processing, and language ability, we used a mediation analysis that showed two competing effects. In the indirect effect, adaptive sequence training with structural regularities had a positive impact on structured sequence processing performance, which in turn had a positive impact on language processing. This finding not only identifies a potential novel intervention to treat language impairments but also may be the first demonstration that structured sequence processing can be improved and that this, in turn, has an impact on language processing. However, in the direct effect, adaptive sequence training with structural regularities had a direct negative impact on language processing. This unexpected finding suggests that adaptive training with structural regularities might potentially interfere with language processing. Taken together, these findings underscore the importance of pursuing designs that promote a better understanding of the mechanisms underlying training-related changes, so that regimens can be developed that help reduce these types of negative effects while simultaneously maximizing the benefits to outcome measures of interest.
Digital RNA sequencing minimizes sequence-dependent bias and amplification noise with optimized single-molecule barcodes

PubMed Central

Shiroguchi, Katsuyuki; Jia, Tony Z.; Sims, Peter A.; Xie, X. Sunney

2012-01-01

RNA sequencing (RNA-Seq) is a powerful tool for transcriptome profiling, but is hampered by sequence-dependent bias and inaccuracy at low copy numbers intrinsic to exponential PCR amplification. We developed a simple strategy for mitigating these complications, allowing truly digital RNA-Seq. Following reverse transcription, a large set of barcode sequences is added in excess, and nearly every cDNA molecule is uniquely labeled by random attachment of barcode sequences to both ends. After PCR, we applied paired-end deep sequencing to read the two barcodes and cDNA sequences. Rather than counting the number of reads, RNA abundance is measured based on the number of unique barcode sequences observed for a given cDNA sequence. We optimized the barcodes to be unambiguously identifiable, even in the presence of multiple sequencing errors. This method allows counting with single-copy resolution despite sequence-dependent bias and PCR-amplification noise, and is analogous to digital PCR but amendable to quantifying a whole transcriptome. We demonstrated transcriptome profiling of Escherichia coli with more accurate and reproducible quantification than conventional RNA-Seq. PMID:22232676
Fine-tuning structural RNA alignments in the twilight zone.

PubMed

Bremges, Andreas; Schirmer, Stefanie; Giegerich, Robert

2010-04-30

A widely used method to find conserved secondary structure in RNA is to first construct a multiple sequence alignment, and then fold the alignment, optimizing a score based on thermodynamics and covariance. This method works best around 75% sequence similarity. However, in a "twilight zone" below 55% similarity, the sequence alignment tends to obscure the covariance signal used in the second phase. Therefore, while the overall shape of the consensus structure may still be found, the degree of conservation cannot be estimated reliably. Based on a combination of available methods, we present a method named planACstar for improving structure conservation in structural alignments in the twilight zone. After constructing a consensus structure by alignment folding, planACstar abandons the original sequence alignment, refolds the sequences individually, but consistent with the consensus, aligns the structures, irrespective of sequence, by a pure structure alignment method, and derives an improved sequence alignment from the alignment of structures, to be re-submitted to alignment folding, etc.. This circle may be iterated as long as structural conservation improves, but normally, one step suffices. Employing the tools ClustalW, RNAalifold, and RNAforester, we find that for sequences with 30-55% sequence identity, structural conservation can be improved by 10% on average, with a large variation, measured in terms of RNAalifold's own criterion, the structure conservation index.
Functional 5' UTR mRNA structures in eukaryotic translation regulation and how to find them.

PubMed

Leppek, Kathrin; Das, Rhiju; Barna, Maria

2018-03-01

RNA molecules can fold into intricate shapes that can provide an additional layer of control of gene expression beyond that of their sequence. In this Review, we discuss the current mechanistic understanding of structures in 5' untranslated regions (UTRs) of eukaryotic mRNAs and the emerging methodologies used to explore them. These structures may regulate cap-dependent translation initiation through helicase-mediated remodelling of RNA structures and higher-order RNA interactions, as well as cap-independent translation initiation through internal ribosome entry sites (IRESs), mRNA modifications and other specialized translation pathways. We discuss known 5' UTR RNA structures and how new structure probing technologies coupled with prospective validation, particularly compensatory mutagenesis, are likely to identify classes of structured RNA elements that shape post-transcriptional control of gene expression and the development of multicellular organisms.
Transient effects in π-pulse sequences in MAS solid-state NMR

NASA Astrophysics Data System (ADS)

Hellwagner, Johannes; Wili, Nino; Ibáñez, Luis Fábregas; Wittmann, Johannes J.; Meier, Beat H.; Ernst, Matthias

2018-02-01

Dipolar recoupling techniques that use isolated rotor-synchronized π pulses are commonly used in solid-state NMR spectroscopy to gain insight into the structure of biological molecules. These sequences excel through their simplicity, stability towards radio-frequency (rf) inhomogeneity, and low rf requirements. For a theoretical understanding of such sequences, we present a Floquet treatment based on an interaction-frame transformation including the chemical-shift offset dependence. This approach is applied to the homonuclear dipolar-recoupling sequence Radio-Frequency Driven Recoupling (RFDR) and the heteronuclear recoupling sequence Rotational Echo Double Resonance (REDOR). Based on the Floquet approach, we show the influence of effective fields caused by pulse transients and discuss the advantages of pulse-transient compensation. We demonstrate experimentally that the transfer efficiency for homonuclear recoupling can be doubled in some cases in model compounds as well as in simple peptides if pulse-transient compensation is applied to the π pulses. Additionally, we discuss the influence of various phase cycles on the recoupling efficiency in order to reduce the magnitude of effective fields. Based on the findings from RFDR, we are able to explain why the REDOR sequence does not suffer in the recoupling efficiency despite the presence of effective fields.
Computational studies of sequence-specific driving forces in peptide self-assembly

NASA Astrophysics Data System (ADS)

Jeon, Joohyun

Peptides are biopolymers made from various sequences of twenty different types of amino acids, connected by peptide bonds. There are practically an infinite number of possible sequences and tremendous possible combinations of peptide-peptide interactions. Recently, an increasing number of studies have shown a stark variety of peptide self-assembled nanomaterials whose detailed structures depend on their sequences and environmental factors; these have end uses in medical and bio-electronic applications, for example. To understand the underlying physics of complex peptide self-assembly processes and to delineate sequence specific effects, in this study, I use various simulation tools spanning all-atom molecular dynamics to simple lattice models and quantify the balance of interactions in the peptide self-assembly processes. In contrast to the existing view that peptides' aggregation propensities are proportional to the net sequence hydrophobicity and inversely proportional to the net charge, I show the more nuanced effects of electrostatic interactions, including the cooperative effects between hydrophobic and electrostatic interactions. Notably, I suggest rather unexpected, yet important roles of entropies in the small scale oligomerization processes. Overall, this study broadens our understanding of the role of thermodynamic driving forces in peptide self-assembly.
Tandem Repeat Proteins Inspired By Squid Ring Teeth

NASA Astrophysics Data System (ADS)

Pena-Francesch, Abdon

Proteins are large biomolecules consisting of long chains of amino acids that hierarchically assemble into complex structures, and provide a variety of building blocks for biological materials. The repetition of structural building blocks is a natural evolutionary strategy for increasing the complexity and stability of protein structures. However, the relationship between amino acid sequence, structure, and material properties of protein systems remains unclear due to the lack of control over the protein sequence and the intricacies of the assembly process. In order to investigate the repetition of protein building blocks, a recently discovered protein from squids is examined as an ideal protein system. Squid ring teeth are predatory appendages located inside the suction cups that provide a strong grasp of prey, and are solely composed of a group of proteins with tandem repetition of building blocks. The objective of this thesis is the understanding of sequence, structure and property relationship in repetitive protein materials inspired in squid ring teeth for the first time. Specifically, this work focuses on squid-inspired structural proteins with tandem repeat units in their sequence (i.e., repetition of alternating building blocks) that are physically cross-linked via beta-sheet structures. The research work presented here tests the hypothesis that, in these systems, increasing the number of building blocks in the polypeptide chain decreases the protein network defects and improves the material properties. Hence, the sequence, nanostructure, and properties (thermal, mechanical, and conducting) of tandem repeat squid-inspired protein materials are examined. Spectroscopic structural analysis, advanced materials characterization, and entropic elasticity theory are combined to elucidate the structure and material properties of these repetitive proteins. This approach is applied not only to native squid proteins but also to squid-inspired synthetic polypeptides that allow for a fine control of the sequence and network morphology. The results provided in this work establish a clear dependence between the repetitive building blocks, the network morphology, and the properties of squid-inspired repetitive protein materials. Increasing the number of tandem repeat units in SRT-inspired proteins led to more effective protein networks with superior properties. Through increasing tandem repetition and optimization of network morphology, highly efficient protein materials capable of withstanding deformations up to 400% of their original length, with MPa-GPa modulus, high energy absorption (50 MJ m-3), peak proton conductivity of 3.7 mS cm-1 (at pH 7, highest reported to date for biological materials), and peak thermal conductivity of 1.4 W m-1 K -1 (which exceeds that of most polymer materials) were developed. These findings introduce new design rules in the engineering of proteins based on tandem repetition and morphology control, and provide a novel framework for tailoring and optimizing the properties of protein-based materials.
Short interspersed nuclear elements (SINEs) are abundant in Solanaceae and have a family-specific impact on gene structure and genome organization.

PubMed

Seibt, Kathrin M; Wenke, Torsten; Muders, Katja; Truberg, Bernd; Schmidt, Thomas

2016-05-01

Short interspersed nuclear elements (SINEs) are highly abundant non-autonomous retrotransposons that are widespread in plants. They are short in size, non-coding, show high sequence diversity, and are therefore mostly not or not correctly annotated in plant genome sequences. Hence, comparative studies on genomic SINE populations are rare. To explore the structural organization and impact of SINEs, we comparatively investigated the genome sequences of the Solanaceae species potato (Solanum tuberosum), tomato (Solanum lycopersicum), wild tomato (Solanum pennellii), and two pepper cultivars (Capsicum annuum). Based on 8.5 Gbp sequence data, we annotated 82 983 SINE copies belonging to 10 families and subfamilies on a base pair level. Solanaceae SINEs are dispersed over all chromosomes with enrichments in distal regions. Depending on the genome assemblies and gene predictions, 30% of all SINE copies are associated with genes, particularly frequent in introns and untranslated regions (UTRs). The close association with genes is family specific. More than 10% of all genes annotated in the Solanaceae species investigated contain at least one SINE insertion, and we found genes harbouring up to 16 SINE copies. We demonstrate the involvement of SINEs in gene and genome evolution including the donation of splice sites, start and stop codons and exons to genes, enlargement of introns and UTRs, generation of tandem-like duplications and transduction of adjacent sequence regions. © 2016 The Authors The Plant Journal © 2016 John Wiley & Sons Ltd.
Online incidental statistical learning of audiovisual word sequences in adults: a registered report.

PubMed

Kuppuraj, Sengottuvel; Duta, Mihaela; Thompson, Paul; Bishop, Dorothy

2018-02-01

Statistical learning has been proposed as a key mechanism in language learning. Our main goal was to examine whether adults are capable of simultaneously extracting statistical dependencies in a task where stimuli include a range of structures amenable to statistical learning within a single paradigm. We devised an online statistical learning task using real word auditory-picture sequences that vary in two dimensions: (i) predictability and (ii) adjacency of dependent elements. This task was followed by an offline recall task to probe learning of each sequence type. We registered three hypotheses with specific predictions. First, adults would extract regular patterns from continuous stream (effect of grammaticality). Second, within grammatical conditions, they would show differential speeding up for each condition as a factor of statistical complexity of the condition and exposure. Third, our novel approach to measure online statistical learning would be reliable in showing individual differences in statistical learning ability. Further, we explored the relation between statistical learning and a measure of verbal short-term memory (STM). Forty-two participants were tested and retested after an interval of at least 3 days on our novel statistical learning task. We analysed the reaction time data using a novel regression discontinuity approach. Consistent with prediction, participants showed a grammaticality effect, agreeing with the predicted order of difficulty for learning different statistical structures. Furthermore, a learning index from the task showed acceptable test-retest reliability ( r = 0.67). However, STM did not correlate with statistical learning. We discuss the findings noting the benefits of online measures in tracking the learning process.
Online incidental statistical learning of audiovisual word sequences in adults: a registered report

PubMed Central

Duta, Mihaela; Thompson, Paul

2018-01-01

Statistical learning has been proposed as a key mechanism in language learning. Our main goal was to examine whether adults are capable of simultaneously extracting statistical dependencies in a task where stimuli include a range of structures amenable to statistical learning within a single paradigm. We devised an online statistical learning task using real word auditory–picture sequences that vary in two dimensions: (i) predictability and (ii) adjacency of dependent elements. This task was followed by an offline recall task to probe learning of each sequence type. We registered three hypotheses with specific predictions. First, adults would extract regular patterns from continuous stream (effect of grammaticality). Second, within grammatical conditions, they would show differential speeding up for each condition as a factor of statistical complexity of the condition and exposure. Third, our novel approach to measure online statistical learning would be reliable in showing individual differences in statistical learning ability. Further, we explored the relation between statistical learning and a measure of verbal short-term memory (STM). Forty-two participants were tested and retested after an interval of at least 3 days on our novel statistical learning task. We analysed the reaction time data using a novel regression discontinuity approach. Consistent with prediction, participants showed a grammaticality effect, agreeing with the predicted order of difficulty for learning different statistical structures. Furthermore, a learning index from the task showed acceptable test–retest reliability (r = 0.67). However, STM did not correlate with statistical learning. We discuss the findings noting the benefits of online measures in tracking the learning process. PMID:29515876

Packaging of Mason-Pfizer monkey virus (MPMV) genomic RNA depends upon conserved long-range interactions (LRIs) between U5 and gag sequences

PubMed Central

Kalloush, Rawan M.; Vivet-Boudou, Valérie; Ali, Lizna M.; Mustafa, Farah; Marquet, Roland; Rizvi, Tahir A.

2016-01-01

MPMV has great potential for development as a vector for gene therapy. In this respect, precisely defining the sequences and structural motifs that are important for dimerization and packaging of its genomic RNA (gRNA) are of utmost importance. A distinguishing feature of the MPMV gRNA packaging signal is two phylogenetically conserved long-range interactions (LRIs) between U5 and gag complementary sequences, LRI-I and LRI-II. To test their biological significance in the MPMV life cycle, we introduced mutations into these structural motifs and tested their effects on MPMV gRNA packaging and propagation. Furthermore, we probed the structure of key mutants using SHAPE (selective 2′hydroxyl acylation analyzed by primer extension). Disrupting base-pairing of the LRIs affected gRNA packaging and propagation, demonstrating their significance to the MPMV life cycle. A double mutant restoring a heterologous LRI-I was fully functional, whereas a similar LRI-II mutant failed to restore gRNA packaging and propagation. These results demonstrate that while LRI-I acts at the structural level, maintaining base-pairing is not sufficient for LRI-II function. In addition, in vitro RNA dimerization assays indicated that the loss of RNA packaging in LRI mutants could not be attributed to the defects in dimerization. Our findings suggest that U5-gag LRIs play an important architectural role in maintaining the structure of the 5′ region of the MPMV gRNA, expanding the crucial role of LRIs to the nonlentiviral group of retroviruses. PMID:27095024
From Globular Clusters to Tidal Dwarfs: Structure Formation in Tidal Tails

NASA Astrophysics Data System (ADS)

Knierman, K.; Hunsberger, S.; Gallagher, S.; Charlton, J.; Whitmore, B.; Hibbard, J.; Kundu, A.; Zaritsky, D.

1999-12-01

Galaxy interactions trigger star formation in tidal debris. How does this star formation depend on the local and global physical conditions? Using WFPC2/HST images, we investigate the range of structure within tidal tails of four classic ``Toomre Sequence'' mergers: NGC 4038/9 (``Antennae''), NGC 7252 (``Atoms for Peace''), NGC 3921, and NGC 3256. These tails contain a variety of stellar associations with sizes from globular clusters up to dwarf Irregulars. We explore whether there is a continuum between the two extremes. Our eight fields sample seven tidal tails at a variety of stages in the evolutionary sequence. Some of these tails are rich in HI while others are HI poor. Large tidal dwarfs are embedded in three of the tails. Using V and I WFPC2 images, we measure luminosities and colors of substructures within the tidal tails. The properties of globular cluster candidates in the tails will be contrasted with those of the hundreds of young clusters in the central regions of these mergers. We address whether globular clusters form and survive in the tidal tails and whether tidal dwarfs are composed of only young stars. By comparing the properties of structures in the tails of the four mergers with different ages, we examine systematic evolution of structure along the evolutionary sequence and as a function of HI content. We acknowledge support from NASA through STScI, and from NSF for an REU supplement for Karen Knierman.
Sequence and structure-specific elements of HERG mRNA determine channel synthesis and trafficking efficiency

PubMed Central

Sroubek, Jakub; Krishnan, Yamini; McDonald, Thomas V.

2013-01-01

Human ether-á-gogo-related gene (HERG) encodes a potassium channel that is highly susceptible to deleterious mutations resulting in susceptibility to fatal cardiac arrhythmias. Most mutations adversely affect HERG channel assembly and trafficking. Why the channel is so vulnerable to missense mutations is not well understood. Since nothing is known of how mRNA structural elements factor in channel processing, we synthesized a codon-modified HERG cDNA (HERG-CM) where the codons were synonymously changed to reduce GC content, secondary structure, and rare codon usage. HERG-CM produced typical IKr-like currents; however, channel synthesis and processing were markedly different. Translation efficiency was reduced for HERG-CM, as determined by heterologous expression, in vitro translation, and polysomal profiling. Trafficking efficiency to the cell surface was greatly enhanced, as assayed by immunofluorescence, subcellular fractionation, and surface labeling. Chimeras of HERG-NT/CM indicated that trafficking efficiency was largely dependent on 5′ sequences, while translation efficiency involved multiple areas. These results suggest that HERG translation and trafficking rates are independently governed by noncoding information in various regions of the mRNA molecule. Noncoding information embedded within the mRNA may play a role in the pathogenesis of hereditary arrhythmia syndromes and could provide an avenue for targeted therapeutics.—Sroubek, J., Krishnan, Y., McDonald, T V. Sequence- and structure-specific elements of HERG mRNA determine channel synthesis and trafficking efficiency. PMID:23608144
DNA Barcode Sequence Identification Incorporating Taxonomic Hierarchy and within Taxon Variability

PubMed Central

Little, Damon P.

2011-01-01

For DNA barcoding to succeed as a scientific endeavor an accurate and expeditious query sequence identification method is needed. Although a global multiple–sequence alignment can be generated for some barcoding markers (e.g. COI, rbcL), not all barcoding markers are as structurally conserved (e.g. matK). Thus, algorithms that depend on global multiple–sequence alignments are not universally applicable. Some sequence identification methods that use local pairwise alignments (e.g. BLAST) are unable to accurately differentiate between highly similar sequences and are not designed to cope with hierarchic phylogenetic relationships or within taxon variability. Here, I present a novel alignment–free sequence identification algorithm–BRONX–that accounts for observed within taxon variability and hierarchic relationships among taxa. BRONX identifies short variable segments and corresponding invariant flanking regions in reference sequences. These flanking regions are used to score variable regions in the query sequence without the production of a global multiple–sequence alignment. By incorporating observed within taxon variability into the scoring procedure, misidentifications arising from shared alleles/haplotypes are minimized. An explicit treatment of more inclusive terminals allows for separate identifications to be made for each taxonomic level and/or for user–defined terminals. BRONX performs better than all other methods when there is imperfect overlap between query and reference sequences (e.g. mini–barcode queries against a full–length barcode database). BRONX consistently produced better identifications at the genus–level for all query types. PMID:21857897
Domain structure sequence in ferroelectric Pb(Zr0.2Ti0.8)O3 thin film on MgO

NASA Astrophysics Data System (ADS)

Janolin, Pierre-Eymeric; Fraisse, Bernard; Dkhil, Brahim; Le Marrec, Françoise; Ringgaard, Erling

2007-04-01

The structural evolution of a polydomain ferroelectric Pb(Zr0.2Ti0.8)O3 film was studied by temperature-dependent x-ray diffraction. Two critical temperatures were evidenced: T*=740K, corresponding to a change in the domain structure (a /c/a/c to a1/a2/a1/a2), and TCfilm=825K, where the film undergoes a ferroelectric-paraelectric phase transition. The film remains tetragonal on the whole range of temperature investigated. The evolutions of the domain structure and lattice parameters were found to be in very good agreement with the calculated domain stability map and theoretical temperature-misfit strain phase diagram, respectively.
In Situ Hi-C Library Preparation for Plants to Study Their Three-Dimensional Chromatin Interactions on a Genome-Wide Scale.

PubMed

Liu, Chang

2017-01-01

The spatial organization of the genome in the nucleus is critical for many cellular processes. It has been broadly accepted that the packing of chromatin inside the nucleus is not random, but structured at several hierarchical levels. The Hi-C method combines Chromatin Conformation Capture and high-throughput sequencing, which allows interrogating genome-wide chromatin interactions. Depending on the sequencing depth, chromatin packing patterns derived from Hi-C experiments can be viewed on a chromosomal scale or at a local genic level. Here, I describe a protocol of plant in situ Hi-C library preparation, which covers procedures starting from tissue fixation to library amplification.
Accuracy of Reaction Cross Section for Exotic Nuclei in Glauber Model Based on MCMC Diagnostics

NASA Astrophysics Data System (ADS)

Rueter, Keiti; Novikov, Ivan

2017-01-01

Parameters of a nuclear density distribution for an exotic nuclei with halo or skin structures can be determined from the experimentally measured reaction cross-section. In the presented work, to extract parameters such as nuclear size information for a halo and core, we compare experimental data on reaction cross-sections with values obtained using expressions of the Glauber Model. These calculations are performed using a Markov Chain Monte Carlo algorithm. We discuss the accuracy of the Monte Carlo approach and its dependence on k*, the power law turnover point in the discreet power spectrum of the random number sequence and on the lag-1 autocorrelation time of the random number sequence.
Sequence-structure relationships in RNA loops: establishing the basis for loop homology modeling.

PubMed

Schudoma, Christian; May, Patrick; Nikiforova, Viktoria; Walther, Dirk

2010-01-01

The specific function of RNA molecules frequently resides in their seemingly unstructured loop regions. We performed a systematic analysis of RNA loops extracted from experimentally determined three-dimensional structures of RNA molecules. A comprehensive loop-structure data set was created and organized into distinct clusters based on structural and sequence similarity. We detected clear evidence of the hallmark of homology present in the sequence-structure relationships in loops. Loops differing by <25% in sequence identity fold into very similar structures. Thus, our results support the application of homology modeling for RNA loop model building. We established a threshold that may guide the sequence divergence-based selection of template structures for RNA loop homology modeling. Of all possible sequences that are, under the assumption of isosteric relationships, theoretically compatible with actual sequences observed in RNA structures, only a small fraction is contained in the Rfam database of RNA sequences and classes implying that the actual RNA loop space may consist of a limited number of unique loop structures and conserved sequences. The loop-structure data sets are made available via an online database, RLooM. RLooM also offers functionalities for the modeling of RNA loop structures in support of RNA engineering and design efforts.
Role for cis-acting RNA sequences in the temperature-dependent expression of the multiadhesive lig proteins in Leptospira interrogans.

PubMed

Matsunaga, James; Schlax, Paula J; Haake, David A

2013-11-01

The spirochete Leptospira interrogans causes a systemic infection that provokes a febrile illness. The putative lipoproteins LigA and LigB promote adhesion of Leptospira to host proteins, interfere with coagulation, and capture complement regulators. In this study, we demonstrate that the expression level of the LigA and LigB proteins was substantially higher when L. interrogans proliferated at 37°C instead of the standard culture temperature of 30°C. The RNA comprising the 175-nucleotide 5' untranslated region (UTR) and first six lig codons, whose sequence is identical in ligA and ligB, is predicted to fold into two distinct stem-loop structures separated by a single-stranded region. The ribosome-binding site is partially sequestered in double-stranded RNA within the second structure. Toeprint analysis revealed that in vitro formation of a 30S-tRNA(fMet)-mRNA ternary complex was inhibited unless a 5' deletion mutation disrupted the second stem-loop structure. To determine whether the lig sequence could mediate temperature-regulated gene expression in vivo, the 5' UTR and the first six codons were inserted between the Escherichia coli l-arabinose promoter and bgaB (β-galactosidase from Bacillus stearothermophilus) to create a translational fusion. The lig fragment successfully conferred thermoregulation upon the β-galactosidase reporter in E. coli. The second stem-loop structure was sufficient to confer thermoregulation on the reporter, while sequences further upstream in the 5' UTR slightly diminished expression at each temperature tested. Finally, the expression level of β-galactosidase was significantly higher when point mutations predicted to disrupt base pairs in the second structure were introduced into the stem. Compensatory mutations that maintained base pairing of the stem without restoring the wild-type sequence reinstated the inhibitory effect of the 5' UTR on expression. These results indicate that ligA and ligB expression is limited by double-stranded RNA that occludes the ribosome-binding site. At elevated temperatures, the ribosome-binding site is exposed to promote translation initiation.
The crystal structure of choline kinase reveals a eukaryotic protein kinase fold

DOE Office of Scientific and Technical Information (OSTI.GOV)

Peisach, D.; Gee, P.; Kent, K.

2010-03-08

Choline kinase catalyzes the ATP-dependent phosphorylation of choline, the first committed step in the CDP-choline pathway for the biosynthesis of phosphatidylcholine. The 2.0 {angstrom} crystal structure of a choline kinase from C. elegans (CKA-2) reveals that the enzyme is a homodimeric protein with each monomer organized into a two-domain fold. The structure is remarkably similar to those of protein kinases and aminoglycoside phosphotransferases, despite no significant similarity in amino acid sequence. Comparisons to the structures of other kinases suggest that ATP binds to CKA-2 in a pocket formed by highly conserved and catalytically important residues. In addition, a choline bindingmore » site is proposed to be near the ATP binding pocket and formed by several structurally flexible loops.« less
Quantum Mechanical Modeling of Ballistic MOSFETs

NASA Technical Reports Server (NTRS)

Svizhenko, Alexei; Anantram, M. P.; Govindan, T. R.; Biegel, Bryan (Technical Monitor)

2001-01-01

The objective of this project was to develop theory, approximations, and computer code to model quasi 1D structures such as nanotubes, DNA, and MOSFETs: (1) Nanotubes: Influence of defects on ballistic transport, electro-mechanical properties, and metal-nanotube coupling; (2) DNA: Model electron transfer (biochemistry) and transport experiments, and sequence dependence of conductance; and (3) MOSFETs: 2D doping profiles, polysilicon depletion, source to drain and gate tunneling, understand ballistic limit.
[Physiotherapeutic care marketing research: current state-of-the art].

PubMed

Babaskin, D V

2011-01-01

Successful introduction of modern technologies into the national health care systems strongly depends on the current pharmaceutical market situation. The present article is focused on the peculiarities of marketing research with special reference to physiotherapeutic services and commodities. Analysis of the structure and sequence of marketing research processes is described along with the methods applied for the purpose including their support by the use of Internet resources and technologies.
Biophysics of protein-DNA interactions and chromosome organization

PubMed Central

Marko, John F.

2014-01-01

The function of DNA in cells depends on its interactions with protein molecules, which recognize and act on base sequence patterns along the double helix. These notes aim to introduce basic polymer physics of DNA molecules, biophysics of protein-DNA interactions and their study in single-DNA experiments, and some aspects of large-scale chromosome structure. Mechanisms for control of chromosome topology will also be discussed. PMID:25419039
Processing of Archaebacterial Intron-Containing tRNA Gene Transcripts.

DTIC Science & Technology

1987-07-31

1{ 1. Project Goals: A. To determine the mechanism of tRNA intron processing in the halophilic archaebacteria. B. Characterize and compare the...enzyme(s) responsible for the removal of 5’-flanking sequences from halophilic and sulfur-dependent tRNA gene transcripts. C. Examine the structure and...distribution of tRNA introns in the halophilic archaebacteria. 2. Accomplishments: A. Intron processing mechanism We have succeeded in our primary
Distance-dependent duplex DNA destabilization proximal to G-quadruplex/i-motif sequences

PubMed Central

König, Sebastian L. B.; Huppert, Julian L.; Sigel, Roland K. O.; Evans, Amanda C.

2013-01-01

G-quadruplexes and i-motifs are complementary examples of non-canonical nucleic acid substructure conformations. G-quadruplex thermodynamic stability has been extensively studied for a variety of base sequences, but the degree of duplex destabilization that adjacent quadruplex structure formation can cause has yet to be fully addressed. Stable in vivo formation of these alternative nucleic acid structures is likely to be highly dependent on whether sufficient spacing exists between neighbouring duplex- and quadruplex-/i-motif-forming regions to accommodate quadruplexes or i-motifs without disrupting duplex stability. Prediction of putative G-quadruplex-forming regions is likely to be assisted by further understanding of what distance (number of base pairs) is required for duplexes to remain stable as quadruplexes or i-motifs form. Using oligonucleotide constructs derived from precedented G-quadruplexes and i-motif-forming bcl-2 P1 promoter region, initial biophysical stability studies indicate that the formation of G-quadruplex and i-motif conformations do destabilize proximal duplex regions. The undermining effect that quadruplex formation can have on duplex stability is mitigated with increased distance from the duplex region: a spacing of five base pairs or more is sufficient to maintain duplex stability proximal to predicted quadruplex/i-motif-forming regions. PMID:23771141
Experimental mapping of DNA duplex shape enabled by global lineshape analyses of a nucleotide-independent nitroxide probe

PubMed Central

Ding, Yuan; Zhang, Xiaojun; Tham, Kenneth W.; Qin, Peter Z.

2014-01-01

Sequence-dependent variation in structure and dynamics of a DNA duplex, collectively referred to as ‘DNA shape’, critically impacts interactions between DNA and proteins. Here, a method based on the technique of site-directed spin labeling was developed to experimentally map shapes of two DNA duplexes that contain response elements of the p53 tumor suppressor. An R5a nitroxide spin label, which was covalently attached at a specific phosphate group, was scanned consecutively through the DNA duplex. X-band continuous-wave electron paramagnetic resonance spectroscopy was used to monitor rotational motions of R5a, which report on DNA structure and dynamics at the labeling site. An approach based on Pearson's coefficient analysis was developed to collectively examine the degree of similarity among the ensemble of R5a spectra. The resulting Pearson's coefficients were used to generate maps representing variation of R5a mobility along the DNA duplex. The R5a mobility maps were found to correlate with maps of certain DNA helical parameters, and were capable of revealing similarity and deviation in the shape of the two closely related DNA duplexes. Collectively, the R5a probe and the Pearson's coefficient-based lineshape analysis scheme yielded a generalizable method for examining sequence-dependent DNA shapes. PMID:25092920
A conformational switch is responsible for the reversal of the 6S RNA-dependent RNA polymerase inhibition in Escherichia coli.

PubMed

Steuten, Benedikt; Wagner, Rolf

2012-12-01

6S RNA is a bacterial transcriptional regulator,which accumulates during stationary phase and inhibits transcription from many promoters due to stable association with σ 70 -containing RNA polymerase. This inhibitory RNA polymerase ∼ 6S RNA complex dissociates during nutritional upshift, when cells undergo outgrowth from stationary phase, releasing active RNA polymerase ready for transcription. The release reaction depends on a characteristic property of 6S RNAs, namely to act as template for the de novo synthesis of small RNAs, termed pRNAs.Here, we used limited hydrolysis with structure-specific RNases and in-line probing of isolated 6S RNA and 6SRNA ∼ pRNA complexes to investigate the molecular details leading to the release reaction. Our results indicate that pRNA transcription induces the refolding of the 6S RNA secondary structure by disrupting part of the closing stem(conserved sequence regions CRI and CRIV) and formation of a new hairpin (conserved sequence regions CRIII and CRIV). Comparison of the dimethylsulfate modification pattern of 6S RNA in living cells at stationary growth and during outgrowth confirmed the conformational change observed in vitro. Based on our results, a model describing the individual steps of the release reaction is presented.
The crystal structure of the calcium-bound con-G[Q6A] peptide reveals a novel metal-dependent helical trimer

DOE Office of Scientific and Technical Information (OSTI.GOV)

Cnudde, Sara E.; Prorok, Mary; Jia, Xaofei

2012-02-15

The ability to form and control both secondary structure and oligomerization in short peptides has proven to be challenging owing to the structural instability of such peptides. The conantokin peptides are a family of {gamma}-carboxyglutamic acid containing peptides produced in the venoms of predatory sea snails of the Conus family. They are examples of short peptides that form stable helical structures, especially in the presence of divalent cations. Both monomeric and dimeric conantokin peptides have been identified and represent a new mechanism of helix association, 'the metallozipper motif' that is devoid of a hydrophobic interface between monomers. In the presentmore » study, a parallel/antiparallel three-helix bundle was identified and its crystal structure determined at high resolution. The three helices are almost perfectly parallel and represent a novel helix-helix association. The trimer interface is dominated by metal chelation between the three helices, and contains no interfacial hydrophobic interactions. It is now possible to produce stable monomeric, dimeric, or trimeric metallozippers depending on the peptide sequence and metal ion. Such structures have important applications in protein design.« less
Taxonomic structure and stability of the bacterial community in belgian sourdough ecosystems as assessed by culture and population fingerprinting.

PubMed

Scheirlinck, Ilse; Van der Meulen, Roel; Van Schoor, Ann; Vancanneyt, Marc; De Vuyst, Luc; Vandamme, Peter; Huys, Geert

2008-04-01

A total of 39 traditional sourdoughs were sampled at 11 bakeries located throughout Belgium which were visited twice with a 1-year interval. The taxonomic structure and stability of the bacterial communities occurring in these traditional sourdoughs were assessed using both culture-dependent and culture-independent methods. A total of 1,194 potential lactic acid bacterium (LAB) isolates were tentatively grouped and identified by repetitive element sequence-based PCR, followed by sequence-based identification using 16S rRNA and pheS genes from a selection of genotypically unique LAB isolates. In parallel, all samples were analyzed by denaturing gradient gel electrophoresis (DGGE) of V3-16S rRNA gene amplicons. In addition, extensive metabolite target analysis of more than 100 different compounds was performed. Both culturing and DGGE analysis showed that the species Lactobacillus sanfranciscensis, Lactobacillus paralimentarius, Lactobacillus plantarum, and Lactobacillus pontis dominated the LAB population of Belgian type I sourdoughs. In addition, DGGE band sequence analysis demonstrated the presence of Acetobacter sp. and a member of the Erwinia/Enterobacter/Pantoea group in some samples. Overall, the culture-dependent and culture-independent approaches each exhibited intrinsic limitations in assessing bacterial LAB diversity in Belgian sourdoughs. Irrespective of the LAB biodiversity, a large majority of the sugar and amino acid metabolites were detected in all sourdough samples. Principal component-based analysis of biodiversity and metabolic data revealed only little variation among the two samples of the sourdoughs produced at the same bakery. The rare cases of instability observed could generally be linked with variations in technological parameters or differences in detection capacity between culture-dependent and culture-independent approaches. Within a sampling interval of 1 year, this study reinforces previous observations that the bakery environment rather than the type or batch of flour largely determines the development of a stable LAB population in sourdoughs.
Taxonomic Structure and Stability of the Bacterial Community in Belgian Sourdough Ecosystems as Assessed by Culture and Population Fingerprinting▿ †

PubMed Central

Scheirlinck, Ilse; Van der Meulen, Roel; Van Schoor, Ann; Vancanneyt, Marc; De Vuyst, Luc; Vandamme, Peter; Huys, Geert

2008-01-01

A total of 39 traditional sourdoughs were sampled at 11 bakeries located throughout Belgium which were visited twice with a 1-year interval. The taxonomic structure and stability of the bacterial communities occurring in these traditional sourdoughs were assessed using both culture-dependent and culture-independent methods. A total of 1,194 potential lactic acid bacterium (LAB) isolates were tentatively grouped and identified by repetitive element sequence-based PCR, followed by sequence-based identification using 16S rRNA and pheS genes from a selection of genotypically unique LAB isolates. In parallel, all samples were analyzed by denaturing gradient gel electrophoresis (DGGE) of V3-16S rRNA gene amplicons. In addition, extensive metabolite target analysis of more than 100 different compounds was performed. Both culturing and DGGE analysis showed that the species Lactobacillus sanfranciscensis, Lactobacillus paralimentarius, Lactobacillus plantarum, and Lactobacillus pontis dominated the LAB population of Belgian type I sourdoughs. In addition, DGGE band sequence analysis demonstrated the presence of Acetobacter sp. and a member of the Erwinia/Enterobacter/Pantoea group in some samples. Overall, the culture-dependent and culture-independent approaches each exhibited intrinsic limitations in assessing bacterial LAB diversity in Belgian sourdoughs. Irrespective of the LAB biodiversity, a large majority of the sugar and amino acid metabolites were detected in all sourdough samples. Principal component-based analysis of biodiversity and metabolic data revealed only little variation among the two samples of the sourdoughs produced at the same bakery. The rare cases of instability observed could generally be linked with variations in technological parameters or differences in detection capacity between culture-dependent and culture-independent approaches. Within a sampling interval of 1 year, this study reinforces previous observations that the bakery environment rather than the type or batch of flour largely determines the development of a stable LAB population in sourdoughs. PMID:18310426

RNA-Mediated Thermoregulation of Iron-Acquisition Genes in Shigella dysenteriae and Pathogenic Escherichia coli

PubMed Central

Kouse, Andrew B.; Righetti, Francesco; Kortmann, Jens; Narberhaus, Franz; Murphy, Erin R.

2013-01-01

The initiation, progression and transmission of most bacterial infections is dependent upon the ability of the invading pathogen to acquire iron from each of the varied environments encountered during the course of a natural infection. In total, 95% of iron within the human body is complexed within heme, making heme a potentially rich source of host-associated nutrient iron for invading bacteria. As heme is encountered only within the host, pathogenic bacteria often regulate synthesis of heme utilization factors such that production is maximal under host-associated environmental conditions. This study examines the regulated production of ShuA, an outer-membrane receptor required for the utilization of heme as a source of nutrient iron by Shigella dysenteriae, a pathogenic bacterium that causes severe diarrheal diseases in humans. Specifically, the impact of the distinct environmental temperatures encountered during infection within a host (37°C) and transmission between hosts (25°C) on shuA expression is investigated. We show that shuA expression is subject to temperature-dependent post-transcriptional regulation resulting in increased ShuA production at 37°C. The observed thermoregulation is mediated by nucleic acid sequences within the 5′ untranslated region. In addition, we have identified similar nucleotide sequences within the 5′ untranslated region of the orthologous chuA transcript of enteropathogenic E. coli and have demonstrated that it also functions to confer temperature-dependent post-transcriptional regulation. In both function and predicted structure, the regulatory element within the shuA and chuA 5′ untranslated regions closely resembles a FourU RNA thermometer, a zipper-like RNA structure that occludes the Shine-Dalgarno sequence at low temperatures. Increased production of ShuA and ChuA in response to the host body temperature allows for maximal production of these heme acquisition factors within the environment where S. dysenteriae and pathogenic E. coli strains would encounter heme, a host-specific iron source. PMID:23704938
A method for probing the mutational landscape of amyloid structure.

PubMed

O'Donnell, Charles W; Waldispühl, Jérôme; Lis, Mieszko; Halfmann, Randal; Devadas, Srinivas; Lindquist, Susan; Berger, Bonnie

2011-07-01

Proteins of all kinds can self-assemble into highly ordered β-sheet aggregates known as amyloid fibrils, important both biologically and clinically. However, the specific molecular structure of a fibril can vary dramatically depending on sequence and environmental conditions, and mutations can drastically alter amyloid function and pathogenicity. Experimental structure determination has proven extremely difficult with only a handful of NMR-based models proposed, suggesting a need for computational methods. We present AmyloidMutants, a statistical mechanics approach for de novo prediction and analysis of wild-type and mutant amyloid structures. Based on the premise of protein mutational landscapes, AmyloidMutants energetically quantifies the effects of sequence mutation on fibril conformation and stability. Tested on non-mutant, full-length amyloid structures with known chemical shift data, AmyloidMutants offers roughly 2-fold improvement in prediction accuracy over existing tools. Moreover, AmyloidMutants is the only method to predict complete super-secondary structures, enabling accurate discrimination of topologically dissimilar amyloid conformations that correspond to the same sequence locations. Applied to mutant prediction, AmyloidMutants identifies a global conformational switch between Aβ and its highly-toxic 'Iowa' mutant in agreement with a recent experimental model based on partial chemical shift data. Predictions on mutant, yeast-toxic strains of HET-s suggest similar alternate folds. When applied to HET-s and a HET-s mutant with core asparagines replaced by glutamines (both highly amyloidogenic chemically similar residues abundant in many amyloids), AmyloidMutants surprisingly predicts a greatly reduced capacity of the glutamine mutant to form amyloid. We confirm this finding by conducting mutagenesis experiments. Our tool is publically available on the web at http://amyloid.csail.mit.edu/. lindquist_admin@wi.mit.edu; bab@csail.mit.edu.
Folding thermodynamics of pseudoknotted chain conformations

PubMed Central

Kopeikin, Zoia; Chen, Shi-Jie

2008-01-01

We develop a statistical mechanical framework for the folding thermodynamics of pseudoknotted structures. As applications of the theory, we investigate the folding stability and the free energy landscapes for both the thermal and the mechanical unfolding of pseudoknotted chains. For the mechanical unfolding process, we predict the force-extension curves, from which we can obtain the information about structural transitions in the unfolding process. In general, a pseudoknotted structure unfolds through multiple structural transitions. The interplay between the helix stems and the loops plays an important role in the folding stability of pseudoknots. For instance, variations in loop sizes can lead to the destabilization of some intermediate states and change the (equilibrium) folding pathways (e.g., two helix stems unfold either cooperatively or sequentially). In both thermal and mechanical unfolding, depending on the nucleotide sequence, misfolded intermediate states can emerge in the folding process. In addition, thermal and mechanical unfoldings often have different (equilibrium) pathways. For example, for certain sequences, the misfolded intermediates, which generally have longer tails, can fold, unfold, and refold again in the pulling process, which means that these intermediates can switch between two different average end-end extensions. PMID:16674261
Metal Cations in G-Quadruplex Folding and Stability

NASA Astrophysics Data System (ADS)

Bhattacharyya, Debmalya; Mirihana Arachchilage, Gayan; Basu, Soumitra

2016-09-01

This review is focused on the structural and physico-chemical aspects of metal cation coordination to G-Quadruplexes (GQ) and their effects on GQ stability and conformation. G-Quadruplex structures are non-canonical secondary structures formed by both DNA and RNA. G-quadruplexes regulate a wide range of important biochemical processes. Besides the sequence requirements, the coordination of monovalent cations in the GQ is essential for its formation and determines the stability and polymorphism of GQ structures. The nature, location and dynamics of the cation coordination and their impact on the overall GQ stability are dependent on several factors such as the ionic radii, hydration energy and the bonding strength to the O6 of guanines. The intracellular monovalent cation concentration and the localized ion concentrations determine the formation of GQs and can potentially dictate their regulatory roles. A wide range of biochemical and biophysical studies on an array of GQ enabling sequences have generated at a minimum the knowledge base that allows us to often predict the stability of GQs in presence of the physiologically relevant metal ions, however, prediction of conformation of such GQs is still out of the realm.
Sequence Alignment to Predict Across Species Susceptibility ...

EPA Pesticide Factsheets

Conservation of a molecular target across species can be used as a line-of-evidence to predict the likelihood of chemical susceptibility. The web-based Sequence Alignment to Predict Across Species Susceptibility (SeqAPASS) tool was developed to simplify, streamline, and quantitatively assess protein sequence/structural similarity across taxonomic groups as a means to predict relative intrinsic susceptibility. The intent of the tool is to allow for evaluation of any potential protein target, so it is amenable to variable degrees of protein characterization, depending on available information about the chemical/protein interaction and the molecular target itself. To allow for flexibility in the analysis, a layered strategy was adopted for the tool. The first level of the SeqAPASS analysis compares primary amino acid sequences to a query sequence, calculating a metric for sequence similarity (including detection of candidate orthologs), the second level evaluates sequence similarity within selected domains (e.g., ligand-binding domain, DNA binding domain), and the third level of analysis compares individual amino acid residue positions identified as being of importance for protein conformation and/or ligand binding upon chemical perturbation. Each level of the SeqAPASS analysis provides increasing evidence to apply toward rapid, screening-level assessments of probable cross species susceptibility. Such analyses can support prioritization of chemicals for further ev
Transcription blockage by homopurine DNA sequences: role of sequence composition and single-strand breaks

PubMed Central

Belotserkovskii, Boris P.; Neil, Alexander J.; Saleh, Syed Shayon; Shin, Jane Hae Soo; Mirkin, Sergei M.; Hanawalt, Philip C.

2013-01-01

The ability of DNA to adopt non-canonical structures can affect transcription and has broad implications for genome functioning. We have recently reported that guanine-rich (G-rich) homopurine-homopyrimidine sequences cause significant blockage of transcription in vitro in a strictly orientation-dependent manner: when the G-rich strand serves as the non-template strand [Belotserkovskii et al. (2010) Mechanisms and implications of transcription blockage by guanine-rich DNA sequences., Proc. Natl Acad. Sci. USA, 107, 12816–12821]. We have now systematically studied the effect of the sequence composition and single-stranded breaks on this blockage. Although substitution of guanine by any other base reduced the blockage, cytosine and thymine reduced the blockage more significantly than adenine substitutions, affirming the importance of both G-richness and the homopurine-homopyrimidine character of the sequence for this effect. A single-strand break in the non-template strand adjacent to the G-rich stretch dramatically increased the blockage. Breaks in the non-template strand result in much weaker blockage signals extending downstream from the break even in the absence of the G-rich stretch. Our combined data support the notion that transcription blockage at homopurine-homopyrimidine sequences is caused by R-loop formation. PMID:23275544
Mitochondrial genome of the moon jelly Aurelia aurita (Cnidaria, Scyphozoa): A linear DNA molecule encoding a putative DNA-dependent DNA polymerase.

PubMed

Shao, Zhiyong; Graf, Shannon; Chaga, Oleg Y; Lavrov, Dennis V

2006-10-15

The 16,937-nuceotide sequence of the linear mitochondrial DNA (mt-DNA) molecule of the moon jelly Aurelia aurita (Cnidaria, Scyphozoa) - the first mtDNA sequence from the class Scypozoa and the first sequence of a linear mtDNA from Metazoa - has been determined. This sequence contains genes for 13 energy pathway proteins, small and large subunit rRNAs, and methionine and tryptophan tRNAs. In addition, two open reading frames of 324 and 969 base pairs in length have been found. The deduced amino-acid sequence of one of them, ORF969, displays extensive sequence similarity with the polymerase [but not the exonuclease] domain of family B DNA polymerases, and this ORF has been tentatively identified as dnab. This is the first report of dnab in animal mtDNA. The genes in A. aurita mtDNA are arranged in two clusters with opposite transcriptional polarities; transcription proceeding toward the ends of the molecule. The determined sequences at the ends of the molecule are nearly identical but inverted and lack any obvious potential secondary structures or telomere-like repeat elements. The acquisition of mitochondrial genomic data for the second class of Cnidaria allows us to reconstruct characteristic features of mitochondrial evolution in this animal phylum.
Consolidating the effects of waking and sleep on motor-sequence learning.

PubMed

Brawn, Timothy P; Fenn, Kimberly M; Nusbaum, Howard C; Margoliash, Daniel

2010-10-20

Sleep is widely believed to play a critical role in memory consolidation. Sleep-dependent consolidation has been studied extensively in humans using an explicit motor-sequence learning paradigm. In this task, performance has been reported to remain stable across wakefulness and improve significantly after sleep, making motor-sequence learning the definitive example of sleep-dependent enhancement. Recent work, however, has shown that enhancement disappears when the task is modified to reduce task-related inhibition that develops over a training session, thus questioning whether sleep actively consolidates motor learning. Here we use the same motor-sequence task to demonstrate sleep-dependent consolidation for motor-sequence learning and explain the discrepancies in results across studies. We show that when training begins in the morning, motor-sequence performance deteriorates across wakefulness and recovers after sleep, whereas performance remains stable across both sleep and subsequent waking with evening training. This pattern of results challenges an influential model of memory consolidation defined by a time-dependent stabilization phase and a sleep-dependent enhancement phase. Moreover, the present results support a new account of the behavioral effects of waking and sleep on explicit motor-sequence learning that is consistent across a wide range of tasks. These observations indicate that current theories of memory consolidation that have been formulated to explain sleep-dependent performance enhancements are insufficient to explain the range of behavioral changes associated with sleep.
Computational modeling of RNA 3D structures, with the aid of experimental restraints

PubMed Central

Magnus, Marcin; Matelska, Dorota; Łach, Grzegorz; Chojnowski, Grzegorz; Boniecki, Michal J; Purta, Elzbieta; Dawson, Wayne; Dunin-Horkawicz, Stanislaw; Bujnicki, Janusz M

2014-01-01

In addition to mRNAs whose primary function is transmission of genetic information from DNA to proteins, numerous other classes of RNA molecules exist, which are involved in a variety of functions, such as catalyzing biochemical reactions or performing regulatory roles. In analogy to proteins, the function of RNAs depends on their structure and dynamics, which are largely determined by the ribonucleotide sequence. Experimental determination of high-resolution RNA structures is both laborious and difficult, and therefore, the majority of known RNAs remain structurally uncharacterized. To address this problem, computational structure prediction methods were developed that simulate either the physical process of RNA structure formation (“Greek science” approach) or utilize information derived from known structures of other RNA molecules (“Babylonian science” approach). All computational methods suffer from various limitations that make them generally unreliable for structure prediction of long RNA sequences. However, in many cases, the limitations of computational and experimental methods can be overcome by combining these two complementary approaches with each other. In this work, we review computational approaches for RNA structure prediction, with emphasis on implementations (particular programs) that can utilize restraints derived from experimental analyses. We also list experimental approaches, whose results can be relatively easily used by computational methods. Finally, we describe case studies where computational and experimental analyses were successfully combined to determine RNA structures that would remain out of reach for each of these approaches applied separately. PMID:24785264
Latent Ice Recrystallization Inhibition Activity in Nonantifreeze Proteins: Ca2+-Activated Plant Lectins and Cation-Activated Antimicrobial Peptides.

PubMed

Mitchell, Daniel E; Gibson, Matthew I

2015-10-12

Organisms living in polar regions have evolved a series of antifreeze (glyco) proteins (AFGPs) to enable them to survive by modulating the structure of ice. These proteins have huge potential for use in cellular cryopreservation, ice-resistant surfaces, frozen food, and cryosurgery, but they are limited by their relatively low availability and questions regarding their mode of action. This has triggered the search for biomimetic materials capable of reproducing this function. The identification of new structures and sequences capable of inhibiting ice growth is crucial to aid our understanding of these proteins. Here, we show that plant c-type lectins, which have similar biological function to human c-type lectins (glycan recognition) but no sequence homology to AFPs, display calcium-dependent ice recrystallization inhibition (IRI) activity. This IRI activity can be switched on/off by changing the Ca2+ concentration. To show that more (nonantifreeze) proteins may exist with the potential to display IRI, a second motif was considered, amphipathicity. All known AFPs have defined hydrophobic/hydrophilic domains, rationalizing this choice. The cheap, and widely used, antimicrobial Nisin was found to have cation-dependent IRI activity, controlled by either acid or addition of histidine-binding ions such as zinc or nickel, which promote its amphipathic structure. These results demonstrate a new approach in the identification of antifreeze protein mimetic macromolecules and may help in the development of synthetic mimics of AFPs.
Latent Ice Recrystallization Inhibition Activity in Nonantifreeze Proteins: Ca2+-Activated Plant Lectins and Cation-Activated Antimicrobial Peptides

PubMed Central

2015-01-01

Organisms living in polar regions have evolved a series of antifreeze (glyco) proteins (AFGPs) to enable them to survive by modulating the structure of ice. These proteins have huge potential for use in cellular cryopreservation, ice-resistant surfaces, frozen food, and cryosurgery, but they are limited by their relatively low availability and questions regarding their mode of action. This has triggered the search for biomimetic materials capable of reproducing this function. The identification of new structures and sequences capable of inhibiting ice growth is crucial to aid our understanding of these proteins. Here, we show that plant c-type lectins, which have similar biological function to human c-type lectins (glycan recognition) but no sequence homology to AFPs, display calcium-dependent ice recrystallization inhibition (IRI) activity. This IRI activity can be switched on/off by changing the Ca2+ concentration. To show that more (nonantifreeze) proteins may exist with the potential to display IRI, a second motif was considered, amphipathicity. All known AFPs have defined hydrophobic/hydrophilic domains, rationalizing this choice. The cheap, and widely used, antimicrobial Nisin was found to have cation-dependent IRI activity, controlled by either acid or addition of histidine-binding ions such as zinc or nickel, which promote its amphipathic structure. These results demonstrate a new approach in the identification of antifreeze protein mimetic macromolecules and may help in the development of synthetic mimics of AFPs. PMID:26407233
Submolecular Structure and Orientation of Oligonucleotide Duplexes Tethered to Gold Electrodes Probed by Infrared Reflection Absorption Spectroscopy: Effect of the Electrode Potentials.

PubMed

Kékedy-Nagy, László; Ferapontova, Elena E; Brand, Izabella

2017-02-23

Unique electronic and ligand recognition properties of the DNA double helix provide basis for DNA applications in biomolecular electronic and biosensor devices. However, the relation between the structure of DNA at electrified interfaces and its electronic properties is still not well understood. Here, potential-driven changes in the submolecular structure of DNA double helices composed of either adenine-thymine (dAdT) 25 or cytosine-guanine (dGdC) 20 base pairs tethered to the gold electrodes are for the first time analyzed by in situ polarization modulation infrared reflection absorption spectroscopy (PM IRRAS) performed under the electrochemical control. It is shown that the conformation of the DNA duplexes tethered to gold electrodes via the C 6 alkanethiol linker strongly depends on the nucleic acid sequence composition. The tilt of purine and pyrimidine rings of the complementary base pairs (dAdT and dGdC) depends on the potential applied to the electrode. By contrast, neither the conformation nor orientation of the ionic in character phosphate-sugar backbone is affected by the electrode potentials. At potentials more positive than the potential of zero charge (pzc), a gradual tilting of the double helix is observed. In this tilted orientation, the planes of the complementary purine and pyrimidine rings lie ideally parallel to each other. These potentials do not affect the integral stability of the DNA double helix at the charged interface. At potentials more negative than the pzc, DNA helices adopt a vertical to the gold surface orientation. Tilt of the purine and pyrimidine rings depends on the composition of the double helix. In monolayers composed of (dAdT) 25 molecules the rings of the complementary base pairs lie parallel to each other. By contrast, the tilt of purine and pyrimidine rings in (dGdC) 20 helices depends on the potential applied to the electrode. Such potential-induced mobility of the complementary base pairs can destabilize the helix structure at a submolecular level. These pioneer results on the potential-driven changes in the submolecular structure of double stranded DNA adsorbed on conductive supports contribute to further understanding of the potential-driven sequence-specific electronic properties of surface-tethered oligonucleotides.
Evolutionary and molecular foundations of multiple contemporary functions of the nitroreductase superfamily

PubMed Central

Akiva, Eyal; Copp, Janine N.; Tokuriki, Nobuhiko; Babbitt, Patricia C.

2017-01-01

Insight regarding how diverse enzymatic functions and reactions have evolved from ancestral scaffolds is fundamental to understanding chemical and evolutionary biology, and for the exploitation of enzymes for biotechnology. We undertook an extensive computational analysis using a unique and comprehensive combination of tools that include large-scale phylogenetic reconstruction to determine the sequence, structural, and functional relationships of the functionally diverse flavin mononucleotide-dependent nitroreductase (NTR) superfamily (>24,000 sequences from all domains of life, 54 structures, and >10 enzymatic functions). Our results suggest an evolutionary model in which contemporary subgroups of the superfamily have diverged in a radial manner from a minimal flavin-binding scaffold. We identified the structural design principle for this divergence: Insertions at key positions in the minimal scaffold that, combined with the fixation of key residues, have led to functional specialization. These results will aid future efforts to delineate the emergence of functional diversity in enzyme superfamilies, provide clues for functional inference for superfamily members of unknown function, and facilitate rational redesign of the NTR scaffold. PMID:29078300
A Dual-Specific Targeting Approach Based on the Simultaneous Recognition of Duplex and Quadruplex Motifs.

PubMed

Nguyen, Thi Quynh Ngoc; Lim, Kah Wai; Phan, Anh Tuân

2017-09-20

Small-molecule ligands targeting nucleic acids have been explored as potential therapeutic agents. Duplex groove-binding ligands have been shown to recognize DNA in a sequence-specific manner. On the other hand, quadruplex-binding ligands exhibit high selectivity between quadruplex and duplex, but show limited discrimination between different quadruplex structures. Here we propose a dual-specific approach through the simultaneous application of duplex- and quadruplex-binders. We demonstrated that a quadruplex-specific ligand and a duplex-specific ligand can simultaneously interact at two separate binding sites of a quadruplex-duplex hybrid harbouring both quadruplex and duplex structural elements. Such a dual-specific targeting strategy would combine the sequence specificity of duplex-binders and the strong binding affinity of quadruplex-binders, potentially allowing the specific targeting of unique quadruplex structures. Future research can be directed towards the development of conjugated compounds targeting specific genomic quadruplex-duplex sites, for which the linker would be highly context-dependent in terms of length and flexibility, as well as the attachment points onto both ligands.
Crystal structures of the SAM-III/SMK riboswitch reveal the SAM-dependent translation inhibition mechanism

PubMed Central

Lu, Changrui; Smith, Angela M; Fuchs, Ryan T; Ding, Fang; Rajashankar, Kanagalaghatta; Henkin, Tina M; Ke, Ailong

2011-01-01

Three distinct classes of S-adenosyl-l-methionine (SAM)-responsive riboswitches have been identified that regulate bacterial gene expression at the levels of transcription attenuation or translation inhibition. The SMK box (SAM-III) translational riboswitch has been identified in the SAM synthetase gene in members of the Lactobacillales. Here we report the 2.2-Å crystal structure of the Enterococcus faecalis SMK box riboswitch. The Y-shaped riboswitch organizes its conserved nucleotides around a three-way junction for SAM recognition. The Shine-Dalgarno sequence, which is sequestered by base-pairing with the anti–Shine-Dalgarno sequence in response to SAM binding, also directly participates in SAM recognition. The riboswitch makes extensive interactions with the adenosine and sulfonium moieties of SAM but does not appear to recognize the tail of the methionine moiety. We captured a structural snapshot of the SMK box riboswitch sampling the near-cognate ligand S-adenosyl-l-homocysteine (SAH) in which SAH was found to adopt an alternative conformation and fails to make several key interactions. PMID:18806797
Crystal structures of the SAM-III/S[subscript MK] riboswitch reveal the SAM-dependent translation inhibition mechanism

DOE Office of Scientific and Technical Information (OSTI.GOV)

Lu, C.; Smith, A.M.; Fuchs, R.T.

2010-01-07

Three distinct classes of S-adenosyl-L-methionine (SAM)-responsive riboswitches have been identified that regulate bacterial gene expression at the levels of transcription attenuation or translation inhibition. The SMK box (SAM-III) translational riboswitch has been identified in the SAM synthetase gene in members of the Lactobacillales. Here we report the 2.2-{angstrom} crystal structure of the Enterococcus faecalis SMK box riboswitch. The Y-shaped riboswitch organizes its conserved nucleotides around a three-way junction for SAM recognition. The Shine-Dalgarno sequence, which is sequestered by base-pairing with the anti-Shine-Dalgarno sequence in response to SAM binding, also directly participates in SAM recognition. The riboswitch makes extensive interactions withmore » the adenosine and sulfonium moieties of SAM but does not appear to recognize the tail of the methionine moiety. We captured a structural snapshot of the SMK box riboswitch sampling the near-cognate ligand S-adenosyl-L-homocysteine (SAH) in which SAH was found to adopt an alternative conformation and fails to make several key interactions.« less
Crystal structure and functional characterization of yeast YLR011wp, an enzyme with NAD(P)H-FMN and ferric iron reductase activities.

PubMed

Liger, Dominique; Graille, Marc; Zhou, Cong-Zhao; Leulliot, Nicolas; Quevillon-Cheruel, Sophie; Blondeau, Karine; Janin, Joël; van Tilbeurgh, Herman

2004-08-13

Flavodoxins are involved in a variety of electron transfer reactions that are essential for life. Although FMN-binding proteins are well characterized in prokaryotic organisms, information is scarce for eukaryotic flavodoxins. We describe the 2.0-A resolution crystal structure of the Saccharomyces cerevisiae YLR011w gene product, a predicted flavoprotein. YLR011wp indeed adopts a flavodoxin fold, binds the FMN cofactor, and self-associates as a homodimer. Despite the absence of the flavodoxin key fingerprint motif involved in FMN binding, YLR011wp binds this cofactor in a manner very analogous to classical flavodoxins. YLR011wp closest structural homologue is the homodimeric Bacillus subtilis Yhda protein (25% sequence identity) whose homodimer perfectly superimposes onto the YLR011wp one. Yhda, whose function is not documented, has 53% sequence identity with the Bacillus sp. OY1-2 azoreductase. We show that YLR011wp has an NAD(P)H-dependent FMN reductase and a strong ferricyanide reductase activity. We further demonstrate a weak but specific reductive activity on azo dyes and nitrocompounds.
Study of Binding Interaction between Pif80 Protein Fragment and Aragonite

NASA Astrophysics Data System (ADS)

Du, Yuan-Peng; Chang, Hsun-Hui; Yang, Sheng-Yu; Huang, Shing-Jong; Tsai, Yu-Ju; Huang, Joseph Jen-Tse; Chan, Jerry Chun Chung

2016-08-01

Pif is a crucial protein for the formation of the nacreous layer in Pinctada fucata. Three non-acidic peptide fragments of the aragonite-binding domain (Pif80) are selected, which contain multiple copies of the repeat sequence DDRK, to study the interaction between non-acidic peptides and aragonite. The polypeptides DDRKDDRKGGK (Pif80-11) and DDRKDDRKGGKDDRKDDRKGGK (Pif80-22) have similar binding affinity to aragonite. Solid-state NMR data indicate that the backbones of Pif80-11 and Pif80-22 peptides bound on aragonite adopt a random-coil conformation. Pif80-11 is a lot more effective than Pif80-22 in promoting the nucleation of aragonite on the substrate of β-chitin. Our results suggest that the structural arrangement at a protein-mineral interface depends on the surface structure of the mineral substrate and the protein sequence. The side chains of the basic residues, which function as anchors to the aragonite surface, have uniform structures. The role of basic residues as anchors in protein-mineral interaction may play an important role in biomineralization.
Sequence-specific activation of the DNA sensor cGAS by Y-form DNA structures as found in primary HIV-1 cDNA.

PubMed

Herzner, Anna-Maria; Hagmann, Cristina Amparo; Goldeck, Marion; Wolter, Steven; Kübler, Kirsten; Wittmann, Sabine; Gramberg, Thomas; Andreeva, Liudmila; Hopfner, Karl-Peter; Mertens, Christina; Zillinger, Thomas; Jin, Tengchuan; Xiao, Tsan Sam; Bartok, Eva; Coch, Christoph; Ackermann, Damian; Hornung, Veit; Ludwig, Janos; Barchet, Winfried; Hartmann, Gunther; Schlee, Martin

2015-10-01

Cytosolic DNA that emerges during infection with a retrovirus or DNA virus triggers antiviral type I interferon responses. So far, only double-stranded DNA (dsDNA) over 40 base pairs (bp) in length has been considered immunostimulatory. Here we found that unpaired DNA nucleotides flanking short base-paired DNA stretches, as in stem-loop structures of single-stranded DNA (ssDNA) derived from human immunodeficiency virus type 1 (HIV-1), activated the type I interferon-inducing DNA sensor cGAS in a sequence-dependent manner. DNA structures containing unpaired guanosines flanking short (12- to 20-bp) dsDNA (Y-form DNA) were highly stimulatory and specifically enhanced the enzymatic activity of cGAS. Furthermore, we found that primary HIV-1 reverse transcripts represented the predominant viral cytosolic DNA species during early infection of macrophages and that these ssDNAs were highly immunostimulatory. Collectively, our study identifies unpaired guanosines in Y-form DNA as a highly active, minimal cGAS recognition motif that enables detection of HIV-1 ssDNA.
Ebola virus RNA editing depends on the primary editing site sequence and an upstream secondary structure.

PubMed

Mehedi, Masfique; Hoenen, Thomas; Robertson, Shelly; Ricklefs, Stacy; Dolan, Michael A; Taylor, Travis; Falzarano, Darryl; Ebihara, Hideki; Porcella, Stephen F; Feldmann, Heinz

2013-01-01

Ebolavirus (EBOV), the causative agent of a severe hemorrhagic fever and a biosafety level 4 pathogen, increases its genome coding capacity by producing multiple transcripts encoding for structural and nonstructural glycoproteins from a single gene. This is achieved through RNA editing, during which non-template adenosine residues are incorporated into the EBOV mRNAs at an editing site encoding for 7 adenosine residues. However, the mechanism of EBOV RNA editing is currently not understood. In this study, we report for the first time that minigenomes containing the glycoprotein gene editing site can undergo RNA editing, thereby eliminating the requirement for a biosafety level 4 laboratory to study EBOV RNA editing. Using a newly developed dual-reporter minigenome, we have characterized the mechanism of EBOV RNA editing, and have identified cis-acting sequences that are required for editing, located between 9 nt upstream and 9 nt downstream of the editing site. Moreover, we show that a secondary structure in the upstream cis-acting sequence plays an important role in RNA editing. EBOV RNA editing is glycoprotein gene-specific, as a stretch encoding for 7 adenosine residues located in the viral polymerase gene did not serve as an editing site, most likely due to an absence of the necessary cis-acting sequences. Finally, the EBOV protein VP30 was identified as a trans-acting factor for RNA editing, constituting a novel function for this protein. Overall, our results provide novel insights into the RNA editing mechanism of EBOV, further understanding of which might result in novel intervention strategies against this viral pathogen.

Distribution and prediction of catalytic domains in 2-oxoglutarate dependent dioxygenases

PubMed Central

2012-01-01

Background The 2-oxoglutarate dependent superfamily is a diverse group of non-haem dioxygenases, and is present in prokaryotes, eukaryotes, and archaea. The enzymes differ in substrate preference and reaction chemistry, a factor that precludes their classification by homology studies and electronic annotation schemes alone. In this work, I propose and explore the rationale of using substrates to classify structurally similar alpha-ketoglutarate dependent enzymes. Findings Differential catalysis in phylogenetic clades of 2-OG dependent enzymes, is determined by the interactions of a subset of active-site amino acids. Identifying these with existing computational methods is challenging and not feasible for all proteins. A clustering protocol based on validated mechanisms of catalysis of known molecules, in tandem with group specific hidden markov model profiles is able to differentiate and sequester these enzymes. Access to this repository is by a web server that compares user defined unknown sequences to these pre-defined profiles and outputs a list of predicted catalytic domains. The server is free and is accessible at the following URL ( http://comp-biol.theacms.in/H2OGpred.html). Conclusions The proposed stratification is a novel attempt at classifying and predicting 2-oxoglutarate dependent function. In addition, the server will provide researchers with a tool to compare their data to a comprehensive list of HMM profiles of catalytic domains. This work, will aid efforts by investigators to screen and characterize putative 2-OG dependent sequences. The profile database will be updated at regular intervals. PMID:22862831
Fine-tuning structural RNA alignments in the twilight zone

PubMed Central

2010-01-01

Background A widely used method to find conserved secondary structure in RNA is to first construct a multiple sequence alignment, and then fold the alignment, optimizing a score based on thermodynamics and covariance. This method works best around 75% sequence similarity. However, in a "twilight zone" below 55% similarity, the sequence alignment tends to obscure the covariance signal used in the second phase. Therefore, while the overall shape of the consensus structure may still be found, the degree of conservation cannot be estimated reliably. Results Based on a combination of available methods, we present a method named planACstar for improving structure conservation in structural alignments in the twilight zone. After constructing a consensus structure by alignment folding, planACstar abandons the original sequence alignment, refolds the sequences individually, but consistent with the consensus, aligns the structures, irrespective of sequence, by a pure structure alignment method, and derives an improved sequence alignment from the alignment of structures, to be re-submitted to alignment folding, etc.. This circle may be iterated as long as structural conservation improves, but normally, one step suffices. Conclusions Employing the tools ClustalW, RNAalifold, and RNAforester, we find that for sequences with 30-55% sequence identity, structural conservation can be improved by 10% on average, with a large variation, measured in terms of RNAalifold's own criterion, the structure conservation index. PMID:20433706
Brownian dynamics simulations of sequence-dependent duplex denaturation in dynamically superhelical DNA

NASA Astrophysics Data System (ADS)

Mielke, Steven P.; Grønbech-Jensen, Niels; Krishnan, V. V.; Fink, William H.; Benham, Craig J.

2005-09-01

The topological state of DNA in vivo is dynamically regulated by a number of processes that involve interactions with bound proteins. In one such process, the tracking of RNA polymerase along the double helix during transcription, restriction of rotational motion of the polymerase and associated structures, generates waves of overtwist downstream and undertwist upstream from the site of transcription. The resulting superhelical stress is often sufficient to drive double-stranded DNA into a denatured state at locations such as promoters and origins of replication, where sequence-specific duplex opening is a prerequisite for biological function. In this way, transcription and other events that actively supercoil the DNA provide a mechanism for dynamically coupling genetic activity with regulatory and other cellular processes. Although computer modeling has provided insight into the equilibrium dynamics of DNA supercoiling, to date no model has appeared for simulating sequence-dependent DNA strand separation under the nonequilibrium conditions imposed by the dynamic introduction of torsional stress. Here, we introduce such a model and present results from an initial set of computer simulations in which the sequences of dynamically superhelical, 147 base pair DNA circles were systematically altered in order to probe the accuracy with which the model can predict location, extent, and time of stress-induced duplex denaturation. The results agree both with well-tested statistical mechanical calculations and with available experimental information. Additionally, we find that sites susceptible to denaturation show a propensity for localizing to supercoil apices, suggesting that base sequence determines locations of strand separation not only through the energetics of interstrand interactions, but also by influencing the geometry of supercoiling.
Magnetic resonance imaging of the femoral trochlea: evaluation of anatomical landmarks and grading articular cartilage in cadaveric knees.

PubMed

Muhle, Claus; Ahn, Joong Mo; Trudell, Debra; Resnick, Donald

2008-06-01

The purpose of the study was to define magnetic resonance imaging (MRI) findings before and after contrast medium opacification of the knee joint in cadaveric specimens to demonstrate anatomical landmarks of the trochlear surface in relation to the neighboring structures, and to evaluate different MRI sequences in the detection of cartilage defects of the trochlear and patellar surface of the knee. The morphology and relationship of the proximal trochlear surface to the prefemoral fat of the distal femur were investigated by use of different MR sequences before and after intra-articular gadolinium administration into the knee joint in ten cadaveric knees. Anatomic sections were subsequently obtained. In addition, evaluation of the articular surface of the trochlea was performed by two independent observers. The cartilage surfaces were graded using a 2-point system, and results were compared with macroscopic findings. Of 40 cartilage surfaces evaluated, histopathologic findings showed 9 normal surfaces, 20 containing partial-thickness defects, and 11 containing full-thickness defects. Compared with macroscopic data, sensitivity of MR sequences for the two reviewers was between 17 and 90%; specificity, 75 and 100%; positive predictive value, 75 and 100%; negative predictive value, 20 and 100%, depending on patellar or trochlea lesions. Interobserver variability for the presence of disease, which was measured using the kappa statistic, was dependent on the MR sequence used between 0.243 and 0.851. Magnetic resonance imaging sequences can be used to evaluate the cartilage of the trochlear surface with less accuracy when compared with the results of grading the articular cartilage of the patella.
Brownian dynamics simulations of sequence-dependent duplex denaturation in dynamically superhelical DNA.

PubMed

Mielke, Steven P; Grønbech-Jensen, Niels; Krishnan, V V; Fink, William H; Benham, Craig J

2005-09-22

The topological state of DNA in vivo is dynamically regulated by a number of processes that involve interactions with bound proteins. In one such process, the tracking of RNA polymerase along the double helix during transcription, restriction of rotational motion of the polymerase and associated structures, generates waves of overtwist downstream and undertwist upstream from the site of transcription. The resulting superhelical stress is often sufficient to drive double-stranded DNA into a denatured state at locations such as promoters and origins of replication, where sequence-specific duplex opening is a prerequisite for biological function. In this way, transcription and other events that actively supercoil the DNA provide a mechanism for dynamically coupling genetic activity with regulatory and other cellular processes. Although computer modeling has provided insight into the equilibrium dynamics of DNA supercoiling, to date no model has appeared for simulating sequence-dependent DNA strand separation under the nonequilibrium conditions imposed by the dynamic introduction of torsional stress. Here, we introduce such a model and present results from an initial set of computer simulations in which the sequences of dynamically superhelical, 147 base pair DNA circles were systematically altered in order to probe the accuracy with which the model can predict location, extent, and time of stress-induced duplex denaturation. The results agree both with well-tested statistical mechanical calculations and with available experimental information. Additionally, we find that sites susceptible to denaturation show a propensity for localizing to supercoil apices, suggesting that base sequence determines locations of strand separation not only through the energetics of interstrand interactions, but also by influencing the geometry of supercoiling.
Mechanism for Coordinated RNA Packaging and Genome Replication by Rotavirus Polymerase VP1

DOE Office of Scientific and Technical Information (OSTI.GOV)

Lu, Xiaohui; McDonald, Sarah M.; Tortorici, M. Alejandra

2009-04-08

Rotavirus RNA-dependent RNA polymerase VP1 catalyzes RNA synthesis within a subviral particle. This activity depends on core shell protein VP2. A conserved sequence at the 3' end of plus-strand RNA templates is important for polymerase association and genome replication. We have determined the structure of VP1 at 2.9 {angstrom} resolution, as apoenzyme and in complex with RNA. The cage-like enzyme is similar to reovirus {lambda}3, with four tunnels leading to or from a central, catalytic cavity. A distinguishing characteristic of VP1 is specific recognition, by conserved features of the template-entry channel, of four bases, UGUG, in the conserved 3' sequence.more » Well-defined interactions with these bases position the RNA so that its 3' end overshoots the initiating register, producing a stable but catalytically inactive complex. We propose that specific 3' end recognition selects rotavirus RNA for packaging and that VP2 activates the autoinhibited VP1/RNA complex to coordinate packaging and genome replication.« less
Temporal Integration of Auditory Information Is Invariant to Temporal Grouping Cues1,2,3

PubMed Central

Tsunada, Joji

2015-01-01

Abstract Auditory perception depends on the temporal structure of incoming acoustic stimuli. Here, we examined whether a temporal manipulation that affects the perceptual grouping also affects the time dependence of decisions regarding those stimuli. We designed a novel discrimination task that required human listeners to decide whether a sequence of tone bursts was increasing or decreasing in frequency. We manipulated temporal perceptual-grouping cues by changing the time interval between the tone bursts, which led to listeners hearing the sequences as a single sound for short intervals or discrete sounds for longer intervals. Despite these strong perceptual differences, this manipulation did not affect the efficiency of how auditory information was integrated over time to form a decision. Instead, the grouping manipulation affected subjects’ speed−accuracy trade-offs. These results indicate that the temporal dynamics of evidence accumulation for auditory perceptual decisions can be invariant to manipulations that affect the perceptual grouping of the evidence. PMID:26464975
The Thiamin Pyrophosphate-Motif

NASA Technical Reports Server (NTRS)

Dominiak, Paulina M.; Ciszak, Ewa M.

2003-01-01

Using databases the authors have identified a common thiamin pyrophosphate (TPP)-motif in the family of functionally diverse TPP-dependent enzymes. This common motif consists of multimeric organization of subunits, two catalytic centers, common amino acid sequence, and specific contacts to provide a flip-flop, or alternate site, mechanism of action. Each catalytic center [PP:PYR] is formed at the interface of the PP-domain binding the magnesium ion, pyrophosphate and aminopyrimidine ring of TPP, and the PYR-domain binding the aminopyrimidine ring of that cofactor. A pair of these catalytic centers constitutes the catalytic core [PP:PYR]* within these enzymes. Analysis of the structural elements of this catalytic core reveals novel definition of the common amino acid sequences, which are GX@&(G)@XXGQ, and GDGX25-30 within the PP- domain, and the E&(G)@XXG@ within the PYR-domain, where Q, corresponds to a hydrophobic amino acid. This TPP-motif provides a novel tool for annotation of TPP-dependent enzymes useful in advancing functional proteomics.
Deconstruction of the Ras switching cycle through saturation mutagenesis

PubMed Central

Bandaru, Pradeep; Shah, Neel H; Bhattacharyya, Moitrayee; Barton, John P; Kondo, Yasushi; Cofsky, Joshua C; Gee, Christine L; Chakraborty, Arup K; Kortemme, Tanja; Ranganathan, Rama; Kuriyan, John

2017-01-01

Ras proteins are highly conserved signaling molecules that exhibit regulated, nucleotide-dependent switching between active and inactive states. The high conservation of Ras requires mechanistic explanation, especially given the general mutational tolerance of proteins. Here, we use deep mutational scanning, biochemical analysis and molecular simulations to understand constraints on Ras sequence. Ras exhibits global sensitivity to mutation when regulated by a GTPase activating protein and a nucleotide exchange factor. Removing the regulators shifts the distribution of mutational effects to be largely neutral, and reveals hotspots of activating mutations in residues that restrain Ras dynamics and promote the inactive state. Evolutionary analysis, combined with structural and mutational data, argue that Ras has co-evolved with its regulators in the vertebrate lineage. Overall, our results show that sequence conservation in Ras depends strongly on the biochemical network in which it operates, providing a framework for understanding the origin of global selection pressures on proteins. DOI: http://dx.doi.org/10.7554/eLife.27810.001 PMID:28686159
Frequency-dependent seismic attenuation in the eastern United States as observed from the 2011 central Virginia earthquake and aftershock sequence

USGS Publications Warehouse

McNamara, Daniel E.; Gee, Lind; Benz, Harley M.; Chapman, Martin

2014-01-01

Ground shaking due to earthquakes in the eastern United States (EUS) is felt at significantly greater distances than in the western United States (WUS) and for some earthquakes it has been shown to display a strong preferential direction. Shaking intensity variation can be due to propagation path effects, source directivity, and/or site amplification. In this paper, we use S and Lg waves recorded from the 2011 central Virginia earthquake and aftershock sequence, in the Central Virginia Seismic Zone, to quantify attenuation as frequency‐dependent Q(f). In support of observations based on shaking intensity, we observe high Q values in the EUS relative to previous studies in the WUS with especially efficient propagation along the structural trend of the Appalachian mountains. Our analysis of Q(f) quantifies the path effects of the northeast‐trending felt distribution previously inferred from the U.S. Geological Survey (USGS) “Did You Feel It” data, historic intensity data, and the asymmetrical distribution of rockfalls and landslides.
SimRNAweb: a web server for RNA 3D structure modeling with optional restraints.

PubMed

Magnus, Marcin; Boniecki, Michał J; Dawson, Wayne; Bujnicki, Janusz M

2016-07-08

RNA function in many biological processes depends on the formation of three-dimensional (3D) structures. However, RNA structure is difficult to determine experimentally, which has prompted the development of predictive computational methods. Here, we introduce a user-friendly online interface for modeling RNA 3D structures using SimRNA, a method that uses a coarse-grained representation of RNA molecules, utilizes the Monte Carlo method to sample the conformational space, and relies on a statistical potential to describe the interactions in the folding process. SimRNAweb makes SimRNA accessible to users who do not normally use high performance computational facilities or are unfamiliar with using the command line tools. The simplest input consists of an RNA sequence to fold RNA de novo. Alternatively, a user can provide a 3D structure in the PDB format, for instance a preliminary model built with some other technique, to jump-start the modeling close to the expected final outcome. The user can optionally provide secondary structure and distance restraints, and can freeze a part of the starting 3D structure. SimRNAweb can be used to model single RNA sequences and RNA-RNA complexes (up to 52 chains). The webserver is available at http://genesilico.pl/SimRNAweb. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Molecular genetic diversity in populations of the stingless bee Plebeia remota: A case study

PubMed Central

de Oliveira Francisco, Flávio; Santiago, Leandro Rodrigues; Arias, Maria Cristina

2013-01-01

Genetic diversity is a major component of the biological diversity of an ecosystem. The survival of a population may be seriously threatened if its genetic diversity values are low. In this work, we measured the genetic diversity of the stingless bee Plebeia remota based on molecular data obtained by analyzing 15 microsatellite loci and sequencing two mitochondrial genes. Population structure and genetic diversity differed depending on the molecular marker analyzed: microsatellites showed low population structure and moderate to high genetic diversity, while mitochondrial DNA (mtDNA) showed high population structure and low diversity in three populations. Queen philopatry and male dispersal behavior are discussed as the main reasons for these findings. PMID:23569417
Molecular genetic diversity in populations of the stingless bee Plebeia remota: A case study.

PubMed

de Oliveira Francisco, Flávio; Santiago, Leandro Rodrigues; Arias, Maria Cristina

2013-03-01

Genetic diversity is a major component of the biological diversity of an ecosystem. The survival of a population may be seriously threatened if its genetic diversity values are low. In this work, we measured the genetic diversity of the stingless bee Plebeia remota based on molecular data obtained by analyzing 15 microsatellite loci and sequencing two mitochondrial genes. Population structure and genetic diversity differed depending on the molecular marker analyzed: microsatellites showed low population structure and moderate to high genetic diversity, while mitochondrial DNA (mtDNA) showed high population structure and low diversity in three populations. Queen philopatry and male dispersal behavior are discussed as the main reasons for these findings.
Mutation of domain III and domain VI in L gene conserved domain of Nipah virus

NASA Astrophysics Data System (ADS)

Jalani, Siti Aishah; Ibrahim, Nazlina

2016-11-01

Nipah virus (NiV) is the etiologic agent responsible for the respiratory illness and causes fatal encephalitis in human. NiV L protein subunit is thought to be responsible for the majority of enzymatic activities involved in viral transcription and replication. The L protein which is the viral RNA dependent RNA polymerase has high sequence homology among negative sense RNA viruses. In negative stranded RNA viruses, based on sequence alignment six conserved domain (domain I-IV) have been determined. Each domain is separated on variable regions that suggest the structure to consist concatenated functional domain. To directly address the roles of domains III and VI, site-directed mutations were constructed by the substitution of bases at sequences 2497, 2500, 5528 and 5532. Each mutated L gene can be used in future studies to test the ability for expression on in vitro translation.
Ancestral multipartite units in light-responsive plant promoters have structural features correlating with specific phototransduction pathways.

PubMed Central

Argüello-Astorga, G R; Herrera-Estrella, L R

1996-01-01

Regulation of plant gene transcription by light is mediated by multipartite cis-regulatory units. Previous attempts to identify structural features that are common to all light-responsive elements (LREs) have been unsuccessful. To address the question of what is needed to confer photoresponsiveness to a promoter, the upstream sequences from more than 110 light-regulated plant genes were analyzed by a new, phylogenetic-structural method. As a result, 30 distinct conserved DNA module arrays (CMAs) associated with light-responsive promoter regions were identified. Several of these CMAs have remained invariant throughout the evolutionary radiation of angiosperms and are conserved between homologous genes as well as between members of different gene families. The identified CMAs share a gene superfamily-specific core that correlates with the particular phytochrome-dependent transduction pathway that controls their expression, i.e. ACCTA(A/C)C(A/C) for the cGMP-dependent phenylpropanoid metabolism-associated genes, and GATA(A/T)GR for the Ca2+/calmodulin-dependent photosynthesis-associated nuclear genes. In addition to suggesting a general model for the functional and structural organization of LREs, the data obtained in this study indicate that angiosperm LREs probably evolved from complex cis-acting elements involved in regulatory processes other than photoregulation in gymnosperms. PMID:8938415
In silico methods for co-transcriptional RNA secondary structure prediction and for investigating alternative RNA structure expression.

PubMed

Meyer, Irmtraud M

2017-05-01

RNA transcripts are the primary products of active genes in any living organism, including many viruses. Their cellular destiny not only depends on primary sequence signals, but can also be determined by RNA structure. Recent experimental evidence shows that many transcripts can be assigned more than a single functional RNA structure throughout their cellular life and that structure formation happens co-transcriptionally, i.e. as the transcript is synthesised in the cell. Moreover, functional RNA structures are not limited to non-coding transcripts, but can also feature in coding transcripts. The picture that now emerges is that RNA structures constitute an additional layer of information that can be encoded in any RNA transcript (and on top of other layers of information such as protein-context) in order to exert a wide range of functional roles. Moreover, different encoded RNA structures can be expressed at different stages of a transcript's life in order to alter the transcript's behaviour depending on its actual cellular context. Similar to the concept of alternative splicing for protein-coding genes, where a single transcript can yield different proteins depending on cellular context, it is thus appropriate to propose the notion of alternative RNA structure expression for any given transcript. This review introduces several computational strategies that my group developed to detect different aspects of RNA structure expression in vivo. Two aspects are of particular interest to us: (1) RNA secondary structure features that emerge during co-transcriptional folding and (2) functional RNA structure features that are expressed at different times of a transcript's life and potentially mutually exclusive. Copyright © 2017. Published by Elsevier Inc.
Dissecting the relationship between protein structure and sequence variation

NASA Astrophysics Data System (ADS)

Shahmoradi, Amir; Wilke, Claus; Wilke Lab Team

2015-03-01

Over the past decade several independent works have shown that some structural properties of proteins are capable of predicting protein evolution. The strength and significance of these structure-sequence relations, however, appear to vary widely among different proteins, with absolute correlation strengths ranging from 0 . 1 to 0 . 8 . Here we present the results from a comprehensive search for the potential biophysical and structural determinants of protein evolution by studying more than 200 structural and evolutionary properties in a dataset of 209 monomeric enzymes. We discuss the main protein characteristics responsible for the general patterns of protein evolution, and identify sequence divergence as the main determinant of the strengths of virtually all structure-evolution relationships, explaining ~ 10 - 30 % of observed variation in sequence-structure relations. In addition to sequence divergence, we identify several protein structural properties that are moderately but significantly coupled with the strength of sequence-structure relations. In particular, proteins with more homogeneous back-bone hydrogen bond energies, large fractions of helical secondary structures and low fraction of beta sheets tend to have the strongest sequence-structure relation. BEACON-NSF center for the study of evolution in action.
Structural and Functional Basis of the Fidelity of Nucleotide Selection by Flavivirus RNA-Dependent RNA Polymerases

PubMed Central

Canard, Bruno

2018-01-01

Viral RNA-dependent RNA polymerases (RdRps) play a central role not only in viral replication, but also in the genetic evolution of viral RNAs. After binding to an RNA template and selecting 5′-triphosphate ribonucleosides, viral RdRps synthesize an RNA copy according to Watson-Crick base-pairing rules. The copy process sometimes deviates from both the base-pairing rules specified by the template and the natural ribose selectivity and, thus, the process is error-prone due to the intrinsic (in)fidelity of viral RdRps. These enzymes share a number of conserved amino-acid sequence strings, called motifs A–G, which can be defined from a structural and functional point-of-view. A co-relation is gradually emerging between mutations in these motifs and viral genome evolution or observed mutation rates. Here, we review our current knowledge on these motifs and their role on the structural and mechanistic basis of the fidelity of nucleotide selection and RNA synthesis by Flavivirus RdRps. PMID:29385764
Bacterial communities in full-scale wastewater treatment systems.

PubMed

Cydzik-Kwiatkowska, Agnieszka; Zielińska, Magdalena

2016-04-01

Bacterial metabolism determines the effectiveness of biological treatment of wastewater. Therefore, it is important to define the relations between the species structure and the performance of full-scale installations. Although there is much laboratory data on microbial consortia, our understanding of dependencies between the microbial structure and operational parameters of full-scale wastewater treatment plants (WWTP) is limited. This mini-review presents the types of microbial consortia in WWTP. Information is given on extracellular polymeric substances production as factor that is key for formation of spatial structures of microorganisms. Additionally, we discuss data on microbial groups including nitrifiers, denitrifiers, Anammox bacteria, and phosphate- and glycogen-accumulating bacteria in full-scale aerobic systems that was obtained with the use of molecular techniques, including high-throughput sequencing, to shed light on dependencies between the microbial ecology of biomass and the overall efficiency and functional stability of wastewater treatment systems. Sludge bulking in WWTPs is addressed, as well as the microbial composition of consortia involved in antibiotic and micropollutant removal.
Structural studies of ROK fructokinase YdhR from Bacillus subtilis : insights into substrate binding and fructose specificity.

DOE Office of Scientific and Technical Information (OSTI.GOV)

Nocek, B.; Stein, A.; Jedrzejczak, R.

2011-02-18

The main pathway of bacterial sugar phosphorylation utilizes specific phosphoenolpyruvate phosphotransferase system (PTS) enzymes. In addition to the classic PTS system, a PTS-independent secondary system has been described in which nucleotide-dependent sugar kinases are used for monosaccharide phosphorylation. Fructokinase (FK), which phosphorylates d-fructose with ATP as a cofactor, has been shown to be a member of this secondary system. Bioinformatic analysis has shown that FK is a member of the 'ROK' (bacterial Repressors, uncharacterized Open reading frames, and sugar Kinases) sequence family. In this study, we report the crystal structures of ROK FK from Bacillus subtilis (YdhR) (a) apo andmore » in the presence of (b) ADP and (c) ADP/d-fructose. All structures show that YdhR is a homodimer with a monomer composed of two similar {alpha}/{beta} domains forming a large cleft between domains that bind ADP and d-fructose. Enzymatic activity assays support YdhR function as an ATP-dependent fructose kinase.« less

Some links on this page may take you to non-federal websites. Their policies may differ from this site.