DOE Office of Scientific and Technical Information (OSTI.GOV)
Randall, Graham L.; Zechiedrich, E. L.; Pettitt, Bernard M.
2009-09-01
To understand how underwinding and overwinding the DNA helix affects its structure, we simulated 19 independent DNA systems with fixed degrees of twist using molecular dynamics in a system that does not allow writhe. Underwinding DNA induced spontaneous, sequence-dependent base flipping and local denaturation, while overwinding DNA induced the formation of Pauling-like DNA (P-DNA). The winding resulted in a bimodal state simultaneously including local structural failure and B-form DNA for both underwinding and extreme overwinding. Our simulations suggest that base flipping and local denaturation may provide a landscape influencing protein recognition of DNA sequence to affect, for examples, replication, transcriptionmore » and recombination. Additionally, our findings help explain results from singlemolecule experiments and demonstrate that elastic rod models are strictly valid on average only for unstressed or overwound DNA up to P-DNA formation. Finally, our data support a model in which base flipping can result from torsional stress.« less
The right inferior frontal gyrus processes nested non-local dependencies in music.
Cheung, Vincent K M; Meyer, Lars; Friederici, Angela D; Koelsch, Stefan
2018-02-28
Complex auditory sequences known as music have often been described as hierarchically structured. This permits the existence of non-local dependencies, which relate elements of a sequence beyond their temporal sequential order. Previous studies in music have reported differential activity in the inferior frontal gyrus (IFG) when comparing regular and irregular chord-transitions based on theories in Western tonal harmony. However, it is unclear if the observed activity reflects the interpretation of hierarchical structure as the effects are confounded by local irregularity. Using functional magnetic resonance imaging (fMRI), we found that violations to non-local dependencies in nested sequences of three-tone musical motifs in musicians elicited increased activity in the right IFG. This is in contrast to similar studies in language which typically report the left IFG in processing grammatical syntax. Effects of increasing auditory working demands are moreover reflected by distributed activity in frontal and parietal regions. Our study therefore demonstrates the role of the right IFG in processing non-local dependencies in music, and suggests that hierarchical processing in different cognitive domains relies on similar mechanisms that are subserved by domain-selective neuronal subpopulations.
A Generative Angular Model of Protein Structure Evolution
Golden, Michael; García-Portugués, Eduardo; Sørensen, Michael; Mardia, Kanti V.; Hamelryck, Thomas; Hein, Jotun
2017-01-01
Abstract Recently described stochastic models of protein evolution have demonstrated that the inclusion of structural information in addition to amino acid sequences leads to a more reliable estimation of evolutionary parameters. We present a generative, evolutionary model of protein structure and sequence that is valid on a local length scale. The model concerns the local dependencies between sequence and structure evolution in a pair of homologous proteins. The evolutionary trajectory between the two structures in the protein pair is treated as a random walk in dihedral angle space, which is modeled using a novel angular diffusion process on the two-dimensional torus. Coupling sequence and structure evolution in our model allows for modeling both “smooth” conformational changes and “catastrophic” conformational jumps, conditioned on the amino acid changes. The model has interpretable parameters and is comparatively more realistic than previous stochastic models, providing new insights into the relationship between sequence and structure evolution. For example, using the trained model we were able to identify an apparent sequence–structure evolutionary motif present in a large number of homologous protein pairs. The generative nature of our model enables us to evaluate its validity and its ability to simulate aspects of protein evolution conditioned on an amino acid sequence, a related amino acid sequence, a related structure or any combination thereof. PMID:28453724
Zemla, Adam T; Lang, Dorothy M; Kostova, Tanya; Andino, Raul; Ecale Zhou, Carol L
2011-06-02
Most of the currently used methods for protein function prediction rely on sequence-based comparisons between a query protein and those for which a functional annotation is provided. A serious limitation of sequence similarity-based approaches for identifying residue conservation among proteins is the low confidence in assigning residue-residue correspondences among proteins when the level of sequence identity between the compared proteins is poor. Multiple sequence alignment methods are more satisfactory--still, they cannot provide reliable results at low levels of sequence identity. Our goal in the current work was to develop an algorithm that could help overcome these difficulties by facilitating the identification of structurally (and possibly functionally) relevant residue-residue correspondences between compared protein structures. Here we present StralSV (structure-alignment sequence variability), a new algorithm for detecting closely related structure fragments and quantifying residue frequency from tight local structure alignments. We apply StralSV in a study of the RNA-dependent RNA polymerase of poliovirus, and we demonstrate that the algorithm can be used to determine regions of the protein that are relatively unique, or that share structural similarity with proteins that would be considered distantly related. By quantifying residue frequencies among many residue-residue pairs extracted from local structural alignments, one can infer potential structural or functional importance of specific residues that are determined to be highly conserved or that deviate from a consensus. We further demonstrate that considerable detailed structural and phylogenetic information can be derived from StralSV analyses. StralSV is a new structure-based algorithm for identifying and aligning structure fragments that have similarity to a reference protein. StralSV analysis can be used to quantify residue-residue correspondences and identify residues that may be of particular structural or functional importance, as well as unusual or unexpected residues at a given sequence position. StralSV is provided as a web service at http://proteinmodel.org/AS2TS/STRALSV/.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zemla, A; Lang, D; Kostova, T
2010-11-29
Most of the currently used methods for protein function prediction rely on sequence-based comparisons between a query protein and those for which a functional annotation is provided. A serious limitation of sequence similarity-based approaches for identifying residue conservation among proteins is the low confidence in assigning residue-residue correspondences among proteins when the level of sequence identity between the compared proteins is poor. Multiple sequence alignment methods are more satisfactory - still, they cannot provide reliable results at low levels of sequence identity. Our goal in the current work was to develop an algorithm that could overcome these difficulties and facilitatemore » the identification of structurally (and possibly functionally) relevant residue-residue correspondences between compared protein structures. Here we present StralSV, a new algorithm for detecting closely related structure fragments and quantifying residue frequency from tight local structure alignments. We apply StralSV in a study of the RNA-dependent RNA polymerase of poliovirus and demonstrate that the algorithm can be used to determine regions of the protein that are relatively unique or that shared structural similarity with structures that are distantly related. By quantifying residue frequencies among many residue-residue pairs extracted from local alignments, one can infer potential structural or functional importance of specific residues that are determined to be highly conserved or that deviate from a consensus. We further demonstrate that considerable detailed structural and phylogenetic information can be derived from StralSV analyses. StralSV is a new structure-based algorithm for identifying and aligning structure fragments that have similarity to a reference protein. StralSV analysis can be used to quantify residue-residue correspondences and identify residues that may be of particular structural or functional importance, as well as unusual or unexpected residues at a given sequence position.« less
Camproux, A C; Tufféry, P
2005-08-05
Understanding and predicting protein structures depend on the complexity and the accuracy of the models used to represent them. We have recently set up a Hidden Markov Model to optimally compress protein three-dimensional conformations into a one-dimensional series of letters of a structural alphabet. Such a model learns simultaneously the shape of representative structural letters describing the local conformation and the logic of their connections, i.e. the transition matrix between the letters. Here, we move one step further and report some evidence that such a model of protein local architecture also captures some accurate amino acid features. All the letters have specific and distinct amino acid distributions. Moreover, we show that words of amino acids can have significant propensities for some letters. Perspectives point towards the prediction of the series of letters describing the structure of a protein from its amino acid sequence.
Vlahovicek, K; Munteanu, M G; Pongor, S
1999-01-01
Bending is a local conformational micropolymorphism of DNA in which the original B-DNA structure is only distorted but not extensively modified. Bending can be predicted by simple static geometry models as well as by a recently developed elastic model that incorporate sequence dependent anisotropic bendability (SDAB). The SDAB model qualitatively explains phenomena including affinity of protein binding, kinking, as well as sequence-dependent vibrational properties of DNA. The vibrational properties of DNA segments can be studied by finite element analysis of a model subjected to an initial bending moment. The frequency spectrum is obtained by applying Fourier analysis to the displacement values in the time domain. This analysis shows that the spectrum of the bending vibrations quite sensitively depends on the sequence, for example the spectrum of a curved sequence is characteristically different from the spectrum of straight sequence motifs of identical basepair composition. Curvature distributions are genome-specific, and pronounced differences are found between protein-coding and regulatory regions, respectively, that is, sites of extreme curvature and/or bendability are less frequent in protein-coding regions. A WWW server is set up for the prediction of curvature and generation of 3D models from DNA sequences (http:@www.icgeb.trieste.it/dna).
NMR studies on the structure and dynamics of lac operator DNA
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lee, S.C.
Nuclear Magnetic Resonance spectroscopy was used to elucidate the relationships between structure, dynamics and function of the gene regulatory sequence corresponding to the lactose operon operator of Escherichia coli. The length of the DNA fragments examined varied from 13 to 36 base pair, containing all or part of the operator sequence. These DNA fragments are either derived genetically or synthesized chemically. Resonances of the imino protons were assigned by one dimensional inter-base pair nuclear Overhauser enhancement (NOE) measurements. Imino proton exchange rates were measured by saturation recovery methods. Results from the kinetic measurements show an interesting dynamic heterogeneity with amore » maximum opening rate centered about a GTG/CAC sequence which correlates with the biological function of the operator DNA. This particular three base pair sequence occurs frequently and often symmetrically in prokaryotic nd eukaryotic DNA sites where one anticipates specific protein interaction for gene regulation. The observed sequence dependent imino proton exchange rate may be a reflection of variation of the local structure of regulatory DNA. The results also indicate that the observed imino proton exchange rates are length dependent.« less
Poltev, Valeri; Anisimov, Victor M; Danilov, Victor I; Garcia, Dolores; Sanchez, Carolina; Deriabina, Alexandra; Gonzalez, Eduardo; Rivas, Francisco; Polteva, Nina
2014-06-01
Our previous DFT computations of deoxydinucleoside monophosphate complexes with Na(+)-ions (dDMPs) have demonstrated that the main characteristics of Watson-Crick (WC) right-handed duplex families are predefined in the local energy minima of dDMPs. In this work, we study the mechanisms of contribution of chemically monotonous sugar-phosphate backbone and the bases into the double helix irregularity. Geometry optimization of sugar-phosphate backbone produces energy minima matching the WC DNA conformations. Studying the conformational variability of dDMPs in response to sequence permutation, we found that simple replacement of bases in the previously fully optimized dDMPs, e.g. by constructing Pyr-Pur from Pur-Pyr, and Pur-Pyr from Pyr-Pur sequences, while retaining the backbone geometry, automatically produces the mutual base position characteristic of the target sequence. Based on that, we infer that the directionality and the preferable regions of the sugar-phosphate torsions, combined with the difference of purines from pyrimidines in ring shape, determines the sequence dependence of the structure of WC DNA. No such sequence dependence exists in dDMPs corresponding to other DNA conformations (e.g., Z-family and Hoogsteen duplexes). Unlike other duplexes, WC helix is unique by its ability to match the local energy minima of the free single strand to the preferable conformations of the duplex. Copyright © 2013 Wiley Periodicals, Inc.
A reduced amino acid alphabet for understanding and designing protein adaptation to mutation.
Etchebest, C; Benros, C; Bornot, A; Camproux, A-C; de Brevern, A G
2007-11-01
Protein sequence world is considerably larger than structure world. In consequence, numerous non-related sequences may adopt similar 3D folds and different kinds of amino acids may thus be found in similar 3D structures. By grouping together the 20 amino acids into a smaller number of representative residues with similar features, sequence world simplification may be achieved. This clustering hence defines a reduced amino acid alphabet (reduced AAA). Numerous works have shown that protein 3D structures are composed of a limited number of building blocks, defining a structural alphabet. We previously identified such an alphabet composed of 16 representative structural motifs (5-residues length) called Protein Blocks (PBs). This alphabet permits to translate the structure (3D) in sequence of PBs (1D). Based on these two concepts, reduced AAA and PBs, we analyzed the distributions of the different kinds of amino acids and their equivalences in the structural context. Different reduced sets were considered. Recurrent amino acid associations were found in all the local structures while other were specific of some local structures (PBs) (e.g Cysteine, Histidine, Threonine and Serine for the alpha-helix Ncap). Some similar associations are found in other reduced AAAs, e.g Ile with Val, or hydrophobic aromatic residues Trp with Phe and Tyr. We put into evidence interesting alternative associations. This highlights the dependence on the information considered (sequence or structure). This approach, equivalent to a substitution matrix, could be useful for designing protein sequence with different features (for instance adaptation to environment) while preserving mainly the 3D fold.
Computing the Partition Function for Kinetically Trapped RNA Secondary Structures
Lorenz, William A.; Clote, Peter
2011-01-01
An RNA secondary structure is locally optimal if there is no lower energy structure that can be obtained by the addition or removal of a single base pair, where energy is defined according to the widely accepted Turner nearest neighbor model. Locally optimal structures form kinetic traps, since any evolution away from a locally optimal structure must involve energetically unfavorable folding steps. Here, we present a novel, efficient algorithm to compute the partition function over all locally optimal secondary structures of a given RNA sequence. Our software, RNAlocopt runs in time and space. Additionally, RNAlocopt samples a user-specified number of structures from the Boltzmann subensemble of all locally optimal structures. We apply RNAlocopt to show that (1) the number of locally optimal structures is far fewer than the total number of structures – indeed, the number of locally optimal structures approximately equal to the square root of the number of all structures, (2) the structural diversity of this subensemble may be either similar to or quite different from the structural diversity of the entire Boltzmann ensemble, a situation that depends on the type of input RNA, (3) the (modified) maximum expected accuracy structure, computed by taking into account base pairing frequencies of locally optimal structures, is a more accurate prediction of the native structure than other current thermodynamics-based methods. The software RNAlocopt constitutes a technical breakthrough in our study of the folding landscape for RNA secondary structures. For the first time, locally optimal structures (kinetic traps in the Turner energy model) can be rapidly generated for long RNA sequences, previously impossible with methods that involved exhaustive enumeration. Use of locally optimal structure leads to state-of-the-art secondary structure prediction, as benchmarked against methods involving the computation of minimum free energy and of maximum expected accuracy. Web server and source code available at http://bioinformatics.bc.edu/clotelab/RNAlocopt/. PMID:21297972
SMARTIV: combined sequence and structure de-novo motif discovery for in-vivo RNA binding data.
Polishchuk, Maya; Paz, Inbal; Yakhini, Zohar; Mandel-Gutfreund, Yael
2018-05-25
Gene expression regulation is highly dependent on binding of RNA-binding proteins (RBPs) to their RNA targets. Growing evidence supports the notion that both RNA primary sequence and its local secondary structure play a role in specific Protein-RNA recognition and binding. Despite the great advance in high-throughput experimental methods for identifying sequence targets of RBPs, predicting the specific sequence and structure binding preferences of RBPs remains a major challenge. We present a novel webserver, SMARTIV, designed for discovering and visualizing combined RNA sequence and structure motifs from high-throughput RNA-binding data, generated from in-vivo experiments. The uniqueness of SMARTIV is that it predicts motifs from enriched k-mers that combine information from ranked RNA sequences and their predicted secondary structure, obtained using various folding methods. Consequently, SMARTIV generates Position Weight Matrices (PWMs) in a combined sequence and structure alphabet with assigned P-values. SMARTIV concisely represents the sequence and structure motif content as a single graphical logo, which is informative and easy for visual perception. SMARTIV was examined extensively on a variety of high-throughput binding experiments for RBPs from different families, generated from different technologies, showing consistent and accurate results. Finally, SMARTIV is a user-friendly webserver, highly efficient in run-time and freely accessible via http://smartiv.technion.ac.il/.
Torque measurements reveal sequence-specific cooperative transitions in supercoiled DNA
Oberstrass, Florian C.; Fernandes, Louis E.; Bryant, Zev
2012-01-01
B-DNA becomes unstable under superhelical stress and is able to adopt a wide range of alternative conformations including strand-separated DNA and Z-DNA. Localized sequence-dependent structural transitions are important for the regulation of biological processes such as DNA replication and transcription. To directly probe the effect of sequence on structural transitions driven by torque, we have measured the torsional response of a panel of DNA sequences using single molecule assays that employ nanosphere rotational probes to achieve high torque resolution. The responses of Z-forming d(pGpC)n sequences match our predictions based on a theoretical treatment of cooperative transitions in helical polymers. “Bubble” templates containing 50–100 bp mismatch regions show cooperative structural transitions similar to B-DNA, although less torque is required to disrupt strand–strand interactions. Our mechanical measurements, including direct characterization of the torsional rigidity of strand-separated DNA, establish a framework for quantitative predictions of the complex torsional response of arbitrary sequences in their biological context. PMID:22474350
USDA-ARS?s Scientific Manuscript database
The human mitochondrial glutamate dehydrogenase isozymes (hGDH1 and 2) are abundant matrix-localized proteins encoded by nuclear genes. The proteins are synthesized in the cytoplasm, with an atypically long N-terminal mitochondrial targeting sequence (MTS). The results of secondary structure predi...
Predicting residue-wise contact orders in proteins by support vector regression.
Song, Jiangning; Burrage, Kevin
2006-10-03
The residue-wise contact order (RWCO) describes the sequence separations between the residues of interest and its contacting residues in a protein sequence. It is a new kind of one-dimensional protein structure that represents the extent of long-range contacts and is considered as a generalization of contact order. Together with secondary structure, accessible surface area, the B factor, and contact number, RWCO provides comprehensive and indispensable important information to reconstructing the protein three-dimensional structure from a set of one-dimensional structural properties. Accurately predicting RWCO values could have many important applications in protein three-dimensional structure prediction and protein folding rate prediction, and give deep insights into protein sequence-structure relationships. We developed a novel approach to predict residue-wise contact order values in proteins based on support vector regression (SVR), starting from primary amino acid sequences. We explored seven different sequence encoding schemes to examine their effects on the prediction performance, including local sequence in the form of PSI-BLAST profiles, local sequence plus amino acid composition, local sequence plus molecular weight, local sequence plus secondary structure predicted by PSIPRED, local sequence plus molecular weight and amino acid composition, local sequence plus molecular weight and predicted secondary structure, and local sequence plus molecular weight, amino acid composition and predicted secondary structure. When using local sequences with multiple sequence alignments in the form of PSI-BLAST profiles, we could predict the RWCO distribution with a Pearson correlation coefficient (CC) between the predicted and observed RWCO values of 0.55, and root mean square error (RMSE) of 0.82, based on a well-defined dataset with 680 protein sequences. Moreover, by incorporating global features such as molecular weight and amino acid composition we could further improve the prediction performance with the CC to 0.57 and an RMSE of 0.79. In addition, combining the predicted secondary structure by PSIPRED was found to significantly improve the prediction performance and could yield the best prediction accuracy with a CC of 0.60 and RMSE of 0.78, which provided at least comparable performance compared with the other existing methods. The SVR method shows a prediction performance competitive with or at least comparable to the previously developed linear regression-based methods for predicting RWCO values. In contrast to support vector classification (SVC), SVR is very good at estimating the raw value profiles of the samples. The successful application of the SVR approach in this study reinforces the fact that support vector regression is a powerful tool in extracting the protein sequence-structure relationship and in estimating the protein structural profiles from amino acid sequences.
The red-sequence of 72 WINGS local galaxy clusters
NASA Astrophysics Data System (ADS)
Valentinuzzi, T.; Poggianti, B. M.; Fasano, G.; D'Onofrio, M.; Moretti, A.; Ramella, M.; Biviano, A.; Fritz, J.; Varela, J.; Bettoni, D.; Vulcani, B.; Moles, M.; Couch, W. J.; Dressler, A.; Kjærgaard, P.; Omizzolo, A.; Cava, A.
2011-12-01
We study the color - magnitude red sequence and blue fraction of 72 X-ray selected galaxy clusters at z = 0.04-0.07 from the WINGS survey, searching for correlations between the characteristics of the red sequence (RS) and the environment. We consider the slope and scatter of the red sequence, the number ratio of red luminous-to-faint galaxies, the blue fraction, and the fractions of ellipticals, S0s, and spirals that compose the RS. None of these quantities correlate with the cluster velocity dispersion, X-ray luminosity, number of cluster substructures, BCG prevalence over next brightest galaxies, and the spatial concentration of ellipticals. The properties of the RS, instead, depend strongly on local galaxy density. Higher density regions have a smaller RS scatter, a higher luminous-to-faint ratio, a lower blue fraction, and a lower spiral fraction on the RS. Our results clearly illustrate the prominent effect of the local density in setting the epoch when galaxies become passive and join the red sequence, as opposed to the mass of the galaxy host structure.
Two distinct DNA sequences recognized by transcription factors represent enthalpy and entropy optima
Yin, Yimeng; Das, Pratyush K; Jolma, Arttu; Zhu, Fangjie; Popov, Alexander; Xu, You; Nilsson, Lennart
2018-01-01
Most transcription factors (TFs) can bind to a population of sequences closely related to a single optimal site. However, some TFs can bind to two distinct sequences that represent two local optima in the Gibbs free energy of binding (ΔG). To determine the molecular mechanism behind this effect, we solved the structures of human HOXB13 and CDX2 bound to their two optimal DNA sequences, CAATAAA and TCGTAAA. Thermodynamic analyses by isothermal titration calorimetry revealed that both sites were bound with similar ΔG. However, the interaction with the CAA sequence was driven by change in enthalpy (ΔH), whereas the TCG site was bound with similar affinity due to smaller loss of entropy (ΔS). This thermodynamic mechanism that leads to at least two local optima likely affects many macromolecular interactions, as ΔG depends on two partially independent variables ΔH and ΔS according to the central equation of thermodynamics, ΔG = ΔH - TΔS. PMID:29638214
Evidence for the Concerted Evolution between Short Linear Protein Motifs and Their Flanking Regions
Chica, Claudia; Diella, Francesca; Gibson, Toby J.
2009-01-01
Background Linear motifs are short modules of protein sequences that play a crucial role in mediating and regulating many protein–protein interactions. The function of linear motifs strongly depends on the context, e.g. functional instances mainly occur inside flexible regions that are accessible for interaction. Sometimes linear motifs appear as isolated islands of conservation in multiple sequence alignments. However, they also occur in larger blocks of sequence conservation, suggesting an active role for the neighbouring amino acids. Results The evolution of regions flanking 116 functional linear motif instances was studied. The conservation of the amino acid sequence and order/disorder tendency of those regions was related to presence/absence of the instance. For the majority of the analysed instances, the pairs of sequences conserving the linear motif were also observed to maintain a similar local structural tendency and/or to have higher local sequence conservation when compared to pairs of sequences where one is missing the linear motif. Furthermore, those instances have a higher chance to co–evolve with the neighbouring residues in comparison to the distant ones. Those findings are supported by examples where the regulation of the linear motif–mediated interaction has been shown to depend on the modifications (e.g. phosphorylation) at neighbouring positions or is thought to benefit from the binding versatility of disordered regions. Conclusion The results suggest that flanking regions are relevant for linear motif–mediated interactions, both at the structural and sequence level. More interestingly, they indicate that the prediction of linear motif instances can be enriched with contextual information by performing a sequence analysis similar to the one presented here. This can facilitate the understanding of the role of these predicted instances in determining the protein function inside the broader context of the cellular network where they arise. PMID:19584925
Zhang, Yingzi; Hou, Yulong; Zhang, Yanjun; Hu, Yanjun; Zhang, Liang; Gao, Xiaolong; Zhang, Huixin; Liu, Wenyi
2018-04-16
A quasi-distributed liquid leakage (QDLL) sensor in local area is proposed and experimentally demonstrated, providing a real-time yet low-cost method than the existing local QDLL sensor. The sensor mainly consists of a flexible lamp belt (FLB) with light-emitting diodes (LEDs) and a polymer optical fiber (POF) processed with side-coupling structures. The side-coupling structures are illuminated by the LEDs one by one, forming a series of sensing probes. The lights are side-coupled into the POF through the side-coupling structure and pulse sequences are obtained from the power meters connected to the both ends of the POF. Each pulse represents a sensing probe, and the intensity of them increase when the coupling medium changes from air to liquid. The location of the leakage incident can be got by the position of each pulse in its output sequence. The influence of different side-coupling structures on side-coupling ratio are investigated. The experiment results validate the detection and localization abilities of the QDLL sensor along a 1 m-long POF with a spatial resolution of 0.1 m, which can be improved by adjusting the side-coupling structure. Furthermore, the temperature dependence is studied and can be compensated.
Bandyopadhyay, Boudhayan; Goldenzweig, Adi; Unger, Tamar; Adato, Orit; Fleishman, Sarel J; Unger, Ron; Horovitz, Amnon
2017-12-15
The GroE chaperonin system in Escherichia coli comprises GroEL and GroES and facilitates ATP-dependent protein folding in vivo and in vitro Proteins with very similar sequences and structures can differ in their dependence on GroEL for efficient folding. One potential but unverified source for GroEL dependence is frustration, wherein not all interactions in the native state are optimized energetically, thereby potentiating slow folding and misfolding. Here, we chose enhanced green fluorescent protein as a model system and subjected it to random mutagenesis, followed by screening for variants whose in vivo folding displays increased or decreased GroEL dependence. We confirmed the altered GroEL dependence of these variants with in vitro folding assays. Strikingly, mutations at positions predicted to be highly frustrated were found to correlate with decreased GroEL dependence. Conversely, mutations at positions with low frustration were found to correlate with increased GroEL dependence. Further support for this finding was obtained by showing that folding of an enhanced green fluorescent protein variant designed computationally to have reduced frustration is indeed less GroEL-dependent. Our results indicate that changes in local frustration also affect partitioning in vivo between spontaneous and chaperonin-mediated folding. Hence, the design of minimally frustrated sequences can reduce chaperonin dependence and improve protein expression levels. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
Reinharz, Vladimir; Ponty, Yann; Waldispühl, Jérôme
2013-07-01
The design of RNA sequences folding into predefined secondary structures is a milestone for many synthetic biology and gene therapy studies. Most of the current software uses similar local search strategies (i.e. a random seed is progressively adapted to acquire the desired folding properties) and more importantly do not allow the user to control explicitly the nucleotide distribution such as the GC-content in their sequences. However, the latter is an important criterion for large-scale applications as it could presumably be used to design sequences with better transcription rates and/or structural plasticity. In this article, we introduce IncaRNAtion, a novel algorithm to design RNA sequences folding into target secondary structures with a predefined nucleotide distribution. IncaRNAtion uses a global sampling approach and weighted sampling techniques. We show that our approach is fast (i.e. running time comparable or better than local search methods), seedless (we remove the bias of the seed in local search heuristics) and successfully generates high-quality sequences (i.e. thermodynamically stable) for any GC-content. To complete this study, we develop a hybrid method combining our global sampling approach with local search strategies. Remarkably, our glocal methodology overcomes both local and global approaches for sampling sequences with a specific GC-content and target structure. IncaRNAtion is available at csb.cs.mcgill.ca/incarnation/. Supplementary data are available at Bioinformatics online.
Digital Sequences and a Time Reversal-Based Impact Region Imaging and Localization Method
Qiu, Lei; Yuan, Shenfang; Mei, Hanfei; Qian, Weifeng
2013-01-01
To reduce time and cost of damage inspection, on-line impact monitoring of aircraft composite structures is needed. A digital monitor based on an array of piezoelectric transducers (PZTs) is developed to record the impact region of impacts on-line. It is small in size, lightweight and has low power consumption, but there are two problems with the impact alarm region localization method of the digital monitor at the current stage. The first one is that the accuracy rate of the impact alarm region localization is low, especially on complex composite structures. The second problem is that the area of impact alarm region is large when a large scale structure is monitored and the number of PZTs is limited which increases the time and cost of damage inspections. To solve the two problems, an impact alarm region imaging and localization method based on digital sequences and time reversal is proposed. In this method, the frequency band of impact response signals is estimated based on the digital sequences first. Then, characteristic signals of impact response signals are constructed by sinusoidal modulation signals. Finally, the phase synthesis time reversal impact imaging method is adopted to obtain the impact region image. Depending on the image, an error ellipse is generated to give out the final impact alarm region. A validation experiment is implemented on a complex composite wing box of a real aircraft. The validation results show that the accuracy rate of impact alarm region localization is approximately 100%. The area of impact alarm region can be reduced and the number of PZTs needed to cover the same impact monitoring region is reduced by more than a half. PMID:24084123
Petkevičiūtė, D; Pasi, M; Gonzalez, O; Maddocks, J H
2014-11-10
cgDNA is a package for the prediction of sequence-dependent configuration-space free energies for B-form DNA at the coarse-grain level of rigid bases. For a fragment of any given length and sequence, cgDNA calculates the configuration of the associated free energy minimizer, i.e. the relative positions and orientations of each base, along with a stiffness matrix, which together govern differences in free energies. The model predicts non-local (i.e. beyond base-pair step) sequence dependence of the free energy minimizer. Configurations can be input or output in either the Curves+ definition of the usual helical DNA structural variables, or as a PDB file of coordinates of base atoms. We illustrate the cgDNA package by comparing predictions of free energy minimizers from (a) the cgDNA model, (b) time-averaged atomistic molecular dynamics (or MD) simulations, and (c) NMR or X-ray experimental observation, for (i) the Dickerson-Drew dodecamer and (ii) three oligomers containing A-tracts. The cgDNA predictions are rather close to those of the MD simulations, but many orders of magnitude faster to compute. Both the cgDNA and MD predictions are in reasonable agreement with the available experimental data. Our conclusion is that cgDNA can serve as a highly efficient tool for studying structural variations in B-form DNA over a wide range of sequences. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Chikenji, George; Fujitsuka, Yoshimi; Takada, Shoji
2006-02-28
Predicting protein tertiary structure by folding-like simulations is one of the most stringent tests of how much we understand the principle of protein folding. Currently, the most successful method for folding-based structure prediction is the fragment assembly (FA) method. Here, we address why the FA method is so successful and its lesson for the folding problem. To do so, using the FA method, we designed a structure prediction test of "chimera proteins." In the chimera proteins, local structural preference is specific to the target sequences, whereas nonlocal interactions are only sequence-independent compaction forces. We find that these chimera proteins can find the native folds of the intact sequences with high probability indicating dominant roles of the local interactions. We further explore roles of local structural preference by exact calculation of the HP lattice model of proteins. From these results, we suggest principles of protein folding: For small proteins, compact structures that are fully compatible with local structural preference are few, one of which is the native fold. These local biases shape up the funnel-like energy landscape.
Shaping up the protein folding funnel by local interaction: Lesson from a structure prediction study
Chikenji, George; Fujitsuka, Yoshimi; Takada, Shoji
2006-01-01
Predicting protein tertiary structure by folding-like simulations is one of the most stringent tests of how much we understand the principle of protein folding. Currently, the most successful method for folding-based structure prediction is the fragment assembly (FA) method. Here, we address why the FA method is so successful and its lesson for the folding problem. To do so, using the FA method, we designed a structure prediction test of “chimera proteins.” In the chimera proteins, local structural preference is specific to the target sequences, whereas nonlocal interactions are only sequence-independent compaction forces. We find that these chimera proteins can find the native folds of the intact sequences with high probability indicating dominant roles of the local interactions. We further explore roles of local structural preference by exact calculation of the HP lattice model of proteins. From these results, we suggest principles of protein folding: For small proteins, compact structures that are fully compatible with local structural preference are few, one of which is the native fold. These local biases shape up the funnel-like energy landscape. PMID:16488978
Simulating protein folding initiation sites using an alpha-carbon-only knowledge-based force field
Buck, Patrick M.; Bystroff, Christopher
2015-01-01
Protein folding is a hierarchical process where structure forms locally first, then globally. Some short sequence segments initiate folding through strong structural preferences that are independent of their three-dimensional context in proteins. We have constructed a knowledge-based force field in which the energy functions are conditional on local sequence patterns, as expressed in the hidden Markov model for local structure (HMMSTR). Carbon-alpha force field (CALF) builds sequence specific statistical potentials based on database frequencies for α-carbon virtual bond opening and dihedral angles, pairwise contacts and hydrogen bond donor-acceptor pairs, and simulates folding via Brownian dynamics. We introduce hydrogen bond donor and acceptor potentials as α-carbon probability fields that are conditional on the predicted local sequence. Constant temperature simulations were carried out using 27 peptides selected as putative folding initiation sites, each 12 residues in length, representing several different local structure motifs. Each 0.6 μs trajectory was clustered based on structure. Simulation convergence or representativeness was assessed by subdividing trajectories and comparing clusters. For 21 of the 27 sequences, the largest cluster made up more than half of the total trajectory. Of these 21 sequences, 14 had cluster centers that were at most 2.6 Å root mean square deviation (RMSD) from their native structure in the corresponding full-length protein. To assess the adequacy of the energy function on nonlocal interactions, 11 full length native structures were relaxed using Brownian dynamics simulations. Equilibrated structures deviated from their native states but retained their overall topology and compactness. A simple potential that folds proteins locally and stabilizes proteins globally may enable a more realistic understanding of hierarchical folding pathways. PMID:19137613
High-resolution characterization of sequence signatures due to non-random cleavage of cell-free DNA.
Chandrananda, Dineika; Thorne, Natalie P; Bahlo, Melanie
2015-06-17
High-throughput sequencing of cell-free DNA fragments found in human plasma has been used to non-invasively detect fetal aneuploidy, monitor organ transplants and investigate tumor DNA. However, many biological properties of this extracellular genetic material remain unknown. Research that further characterizes circulating DNA could substantially increase its diagnostic value by allowing the application of more sophisticated bioinformatics tools that lead to an improved signal to noise ratio in the sequencing data. In this study, we investigate various features of cell-free DNA in plasma using deep-sequencing data from two pregnant women (>70X, >50X) and compare them with matched cellular DNA. We utilize a descriptive approach to examine how the biological cleavage of cell-free DNA affects different sequence signatures such as fragment lengths, sequence motifs at fragment ends and the distribution of cleavage sites along the genome. We show that the size distributions of these cell-free DNA molecules are dependent on their autosomal and mitochondrial origin as well as the genomic location within chromosomes. DNA mapping to particular microsatellites and alpha repeat elements display unique size signatures. We show how cell-free fragments occur in clusters along the genome, localizing to nucleosomal arrays and are preferentially cleaved at linker regions by correlating the mapping locations of these fragments with ENCODE annotation of chromatin organization. Our work further demonstrates that cell-free autosomal DNA cleavage is sequence dependent. The region spanning up to 10 positions on either side of the DNA cleavage site show a consistent pattern of preference for specific nucleotides. This sequence motif is present in cleavage sites localized to nucleosomal cores and linker regions but is absent in nucleosome-free mitochondrial DNA. These background signals in cell-free DNA sequencing data stem from the non-random biological cleavage of these fragments. This sequence structure can be harnessed to improve bioinformatics algorithms, in particular for CNV and structural variant detection. Descriptive measures for cell-free DNA features developed here could also be used in biomarker analysis to monitor the changes that occur during different pathological conditions.
2015-01-01
DNA oxidation by reactive oxygen species is nonrandom, potentially leading to accumulation of nucleobase damage and mutations at specific sites within the genome. We now present the first quantitative data for sequence-dependent formation of structurally defined oxidative nucleobase adducts along p53 gene-derived DNA duplexes using a novel isotope labeling-based approach. Our results reveal that local nucleobase sequence context differentially alters the yields of 2,2,4-triamino-2H-oxal-5-one (Z) and 8-oxo-7,8-dihydro-2′-deoxyguanosine (OG) in double stranded DNA. While both lesions are overproduced within endogenously methylated MeCG dinucleotides and at 5′ Gs in runs of several guanines, the formation of Z (but not OG) is strongly preferred at solvent-exposed guanine nucleobases at duplex ends. Targeted oxidation of MeCG sequences may be caused by a lowered ionization potential of guanine bases paired with MeC and the preferential intercalation of riboflavin photosensitizer adjacent to MeC:G base pairs. Importantly, some of the most frequently oxidized positions coincide with the known p53 lung cancer mutational “hotspots” at codons 245 (GGC), 248 (CGG), and 158 (CGC) respectively, supporting a possible role of oxidative degradation of DNA in the initiation of lung cancer. PMID:24571128
Perczel, András; Jákli, Imre; McAllister, Michael A; Csizmadia, Imre G
2003-06-06
Folding properties of small globular proteins are determined by their amino acid sequence (primary structure). This holds both for local (secondary structure) and for global conformational features of linear polypeptides and proteins composed from natural amino acid derivatives. It thus provides the rational basis of structure prediction algorithms. The shortest secondary structure element, the beta-turn, most typically adopts either a type I or a type II form, depending on the amino acid composition. Herein we investigate the sequence-dependent folding stability of both major types of beta-turns using simple dipeptide models (-Xxx-Yyy-). Gas-phase ab initio properties of 16 carefully selected and suitably protected dipeptide models (for example Val-Ser, Ala-Gly, Ser-Ser) were studied. For each backbone fold most probable side-chain conformers were considered. Fully optimized 321G RHF molecular structures were employed in medium level [B3LYP/6-311++G(d,p)//RHF/3-21G] energy calculations to estimate relative populations of the different backbone conformers. Our results show that the preference for beta-turn forms as calculated by quantum mechanics and observed in Xray determined proteins correlates significantly.
Kawagoshi, Taiki; Nishida, Chizuko; Ota, Hidetoshi; Kumazawa, Yoshinori; Endo, Hideki; Matsuda, Yoichi
2008-01-01
Crocodilians have several unique karyotypic features, such as small diploid chromosome numbers (30-42) and the absence of dot-shaped microchromosomes. Of the extant crocodilian species, the Siamese crocodile (Crocodylus siamensis) has no more than 2n = 30, comprising mostly bi-armed chromosomes with large centromeric heterochromatin blocks. To investigate the molecular structures of C-heterochromatin and genomic compartmentalization in the karyotype, characterized by the disappearance of tiny microchromosomes and reduced chromosome number, we performed molecular cloning of centromeric repetitive sequences and chromosome mapping of the 18S-28S rDNA and telomeric (TTAGGG)( n ) sequences. The centromeric heterochromatin was composed mainly of two repetitive sequence families whose characteristics were quite different. Two types of GC-rich CSI-HindIII family sequences, the 305 bp CSI-HindIII-S (G+C content, 61.3%) and 424 bp CSI-HindIII-M (63.1%), were localized to the intensely PI-stained centric regions of all chromosomes, except for chromosome 2 with PI-negative heterochromatin. The 94 bp CSI-DraI (G+C content, 48.9%) was tandem-arrayed satellite DNA and localized to chromosome 2 and four pairs of small-sized chromosomes. The chromosomal size-dependent genomic compartmentalization that is supposedly unique to the Archosauromorpha was probably lost in the crocodilian lineage with the disappearance of microchromosomes followed by the homogenization of centromeric repetitive sequences between chromosomes, except for chromosome 2.
2014-01-01
Background Protein sites evolve at different rates due to functional and biophysical constraints. It is usually considered that the main structural determinant of a site’s rate of evolution is its Relative Solvent Accessibility (RSA). However, a recent comparative study has shown that the main structural determinant is the site’s Local Packing Density (LPD). LPD is related with dynamical flexibility, which has also been shown to correlate with sequence variability. Our purpose is to investigate the mechanism that connects a site’s LPD with its rate of evolution. Results We consider two models: an empirical Flexibility Model and a mechanistic Stress Model. The Flexibility Model postulates a linear increase of site-specific rate of evolution with dynamical flexibility. The Stress Model, introduced here, models mutations as random perturbations of the protein’s potential energy landscape, for which we use simple Elastic Network Models (ENMs). To account for natural selection we assume a single active conformation and use basic statistical physics to derive a linear relationship between site-specific evolutionary rates and the local stress of the mutant’s active conformation. We compare both models on a large and diverse dataset of enzymes. In a protein-by-protein study we found that the Stress Model outperforms the Flexibility Model for most proteins. Pooling all proteins together we show that the Stress Model is strongly supported by the total weight of evidence. Moreover, it accounts for the observed nonlinear dependence of sequence variability on flexibility. Finally, when mutational stress is controlled for, there is very little remaining correlation between sequence variability and dynamical flexibility. Conclusions We developed a mechanistic Stress Model of evolution according to which the rate of evolution of a site is predicted to depend linearly on the local mutational stress of the active conformation. Such local stress is proportional to LPD, so that this model explains the relationship between LPD and evolutionary rate. Moreover, the model also accounts for the nonlinear dependence between evolutionary rate and dynamical flexibility. PMID:24716445
Electronic fingerprints of DNA bases on graphene.
Ahmed, Towfiq; Kilina, Svetlana; Das, Tanmoy; Haraldsen, Jason T; Rehr, John J; Balatsky, Alexander V
2012-02-08
We calculate the electronic local density of states (LDOS) of DNA nucleotide bases (A,C,G,T), deposited on graphene. We observe significant base-dependent features in the LDOS in an energy range within a few electronvolts of the Fermi level. These features can serve as electronic fingerprints for the identification of individual bases in scanning tunneling spectroscopy (STS) experiments that perform image and site dependent spectroscopy on biomolecules. Thus the fingerprints of DNA-graphene hybrid structures may provide an alternative route to DNA sequencing using STS. © 2012 American Chemical Society
Widespread signatures of local mRNA folding structure selection in four Dengue virus serotypes
2015-01-01
Background It is known that mRNA folding can affect and regulate various gene expression steps both in living organisms and in viruses. Previous studies have recognized functional RNA structures in the genome of the Dengue virus. However, these studies usually focused either on the viral untranslated regions or on very specific and limited regions at the beginning of the coding sequences, in a limited number of strains, and without considering evolutionary selection. Results Here we performed the first large scale comprehensive genomics analysis of selection for local mRNA folding strength in the Dengue virus coding sequences, based on a total of 1,670 genomes and 4 serotypes. Our analysis identified clusters of positions along the coding regions that may undergo a conserved evolutionary selection for strong or weak local folding maintained across different viral variants. Specifically, 53-66 clusters for strong folding and 49-73 clusters for weak folding (depending on serotype) aggregated of positions with a significant conservation of folding energy signals (related to partially overlapping local genomic regions) were recognized. In addition, up to 7% of these positions were found to be conserved in more than 90% of the viral genomes. Although some of the identified positions undergo frequent synonymous / non-synonymous substitutions, the selection for folding strength therein is preserved, and thus cannot be trivially explained based on sequence conservation alone. Conclusions The fact that many of the positions with significant folding related signals are conserved among different Dengue variants suggests that a better understanding of the mRNA structures in the corresponding regions may promote the development of prospective anti- Dengue vaccination strategies. The comparative genomics approach described here can be employed in the future for detecting functional regions in other pathogens with very high mutations rates. PMID:26449467
Photophysical Characterization of Enhanced 6-Methylisoxanthopterin Fluorescence in Duplex DNA.
Moreno, Andrew; Knee, J L; Mukerji, Ishita
2016-12-08
The structure and dynamic motions of bases in DNA duplexes and other constructs are important for understanding mechanisms of selectivity and recognition of DNA-binding proteins. The fluorescent guanine analogue, 6-methylisoxanthopterin 6-MI, is well suited to this purpose as it exhibits an unexpected 3- to 4-fold increase in relative quantum yield upon duplex formation when incorporated into the following sequences: ATFAA, AAFTA, or ATFTA (where F represents 6-MI). To better understand some of the factors leading to the 6-MI fluorescence increase upon duplex formation, we characterized the effect of local sequence and structural perturbations on 6-MI photophysics through temperature melts, quantum yield measurements, fluorescence quenching assays, and fluorescence lifetime measurements. By examining 21 sequences we have determined that the duplex-enhanced fluorescence (DEF) depends on the composition of bases adjacent to 6-MI and the presence of adenines at locations n ± 2 from the probe. Investigation of duplex stability and local solvent accessibility measurements support a model in which the DEF arises from a constrained geometry of 6-MI in the duplex, which remains H-bonded to cytosine, stacked with adjacent bases and inaccessible to quenchers. Perturbation of DNA structure through the introduction of an unpaired base 3' to 6-MI or a mismatched basepair increases 6-MI dynamic motion leading to fluorescence quenching and a reduction in quantum yield. Molecular dynamics simulations suggest the enhanced fluorescence results from a greater degree of twist at the X-F step relative to the quenched duplexes examined. These results point to a model where adenine residues located at n ± 2 from 6-MI induce a structural geometry with greater twist in the duplex that hinders local motion reducing dynamic quenching and producing an increase in 6-MI fluorescence.
Polarization-dependent DANES study on vertically-aligned ZnO nanorods
DOE Office of Scientific and Technical Information (OSTI.GOV)
Sun, Chengjun; Park, Chang-In; Jin, Zhenlan
2016-05-01
The local structural and local density of states of vertically-aligned ZnO nanorods were examined by using a polarization-dependent diffraction anomalous near edge structure (DANES) measurements from c-oriented ZnO nanorods at the Zn K edge with the incident x-ray electric field parallel and perpendicular to the x-ray momentum transfer direction. Orientation-dependent local structures determined by DANES were comparable with polarization-dependent EXAFS results. Unlike other techniques, polarization-dependent DANES can uniquely describe the orientation-dependent local structural properties and the local density of states of a selected element in selected-phased crystals of compounds or mixed-phased structures.
Effect of sequence-dependent rigidity on plectoneme localization in dsDNA
NASA Astrophysics Data System (ADS)
Medalion, Shlomi; Rabin, Yitzhak
2016-04-01
We use Monte-Carlo simulations to study the effect of variable rigidity on plectoneme formation and localization in supercoiled double-stranded DNA. We show that the presence of soft sequences increases the number of plectoneme branches and that the edges of the branches tend to be localized at these sequences. We propose an experimental approach to test our results in vitro, and discuss the possible role played by plectoneme localization in the search process of transcription factors for their targets (promoter regions) on the bacterial genome.
Local backbone structure prediction of proteins
De Brevern, Alexandre G.; Benros, Cristina; Gautier, Romain; Valadié, Hélène; Hazout, Serge; Etchebest, Catherine
2004-01-01
Summary A statistical analysis of the PDB structures has led us to define a new set of small 3D structural prototypes called Protein Blocks (PBs). This structural alphabet includes 16 PBs, each one is defined by the (φ, Ψ) dihedral angles of 5 consecutive residues. The amino acid distributions observed in sequence windows encompassing these PBs are used to predict by a Bayesian approach the local 3D structure of proteins from the sole knowledge of their sequences. LocPred is a software which allows the users to submit a protein sequence and performs a prediction in terms of PBs. The prediction results are given both textually and graphically. PMID:15724288
Yang, A S; Hitz, B; Honig, B
1996-06-21
The stability of beta-turns is calculated as a function of sequence and turn type with a Monte Carlo sampling technique. The conformational energy of four internal hydrogen-bonded turn types, I, I', II and II', is obtained by evaluating their gas phase energy with the CHARMM force field and accounting for solvation effects with the Finite Difference Poisson-Boltzmann (FDPB) method. All four turn types are found to be less stable than the coil state, independent of the sequence in the turn. The free-energy penalties associated with turn formation vary between 1.6 kcal/mol and 7.7 kcal/mol, depending on the sequence and turn type. Differences in turn stability arise mainly from intraresidue interactions within the two central residues of the turn. For each combination of the two central residues, except for -Gly-Gly-, the most stable beta-turn type is always found to occur most commonly in native proteins. The fact that a model based on local interactions accounts for the observed preference of specific sequences suggests that long-range tertiary interactions tend to play a secondary role in determining turn conformation. In contrast, for beta-hairpins, long-range interactions appear to dominate. Specifically, due to the right-handed twist of beta-strands, type I' turns for -Gly-Gly- are found to occur with high frequency, even when local energetics would dictate otherwise. The fact that any combination of two residues is found able to adopt a relatively low-energy turn structure explains why the amino acid sequence in turns is highly variable. The calculated free-energy cost of turn formation, when combined with related numbers obtained for alpha-helices and beta-sheets, suggests a model for the initiation of protein folding based on metastable fragments of secondary structure.
Variations in Nuclear Localization Strategies Among Pol X Family Enzymes.
Kirby, Thomas W; Pedersen, Lars C; Gabel, Scott A; Gassman, Natalie R; London, Robert E
2018-06-22
Despite the essential roles of pol X family enzymes in DNA repair, information about the structural basis of their nuclear import is limited. Recent studies revealed the unexpected presence of a functional NLS in DNA polymerase β, indicating the importance of active nuclear targeting, even for enzymes likely to leak into and out of the nucleus. The current studies further explore the active nuclear transport of these enzymes by identifying and structurally characterizing the functional NLS sequences in the three remaining human pol X enzymes: terminal deoxynucleotidyl transferase (TdT), DNA polymerase μ (pol μ), and DNA polymerase λ (pol λ). NLS identifications are based on Importin α (Impα) binding affinity determined by fluorescence polarization of fluorescein-labeled NLS peptides, X-ray crystallographic analysis of the Impα∆IBB•NLS complexes, and fluorescence-based subcellular localization studies. All three polymerases use NLS sequences located near their N-terminus; TdT and pol μ utilize monopartite NLS sequences, while pol λ utilizes a bipartite sequence, unique among the pol X family members. The pol μ NLS has relatively weak measured affinity for Impα, due in part to its proximity to the N-terminus that limits non-specific interactions of flanking residues preceding the NLS. However, this effect is partially mitigated by an N-terminal sequence unsupportive of Met1 removal by methionine aminopeptidase, leading to a 3-fold increase in affinity when the N-terminal methionine is present. Nuclear targeting is unique to each pol X family enzyme with variations dependent on the structure and unique functional role of each polymerase. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hancock, Stephen P.; Stella, Stefano; Cascio, Duilio
The abundant Fis nucleoid protein selectively binds poorly related DNA sequences with high affinities to regulate diverse DNA reactions. Fis binds DNA primarily through DNA backbone contacts and selects target sites by reading conformational properties of DNA sequences, most prominently intrinsic minor groove widths. High-affinity binding requires Fis-stabilized DNA conformational changes that vary depending on DNA sequence. In order to better understand the molecular basis for high affinity site recognition, we analyzed the effects of DNA sequence within and flanking the core Fis binding site on binding affinity and DNA structure. X-ray crystal structures of Fis-DNA complexes containing variable sequencesmore » in the noncontacted center of the binding site or variations within the major groove interfaces show that the DNA can adapt to the Fis dimer surface asymmetrically. We show that the presence and position of pyrimidine-purine base steps within the major groove interfaces affect both local DNA bending and minor groove compression to modulate affinities and lifetimes of Fis-DNA complexes. Sequences flanking the core binding site also modulate complex affinities, lifetimes, and the degree of local and global Fis-induced DNA bending. In particular, a G immediately upstream of the 15 bp core sequence inhibits binding and bending, and A-tracts within the flanking base pairs increase both complex lifetimes and global DNA curvatures. Taken together, our observations support a revised DNA motif specifying high-affinity Fis binding and highlight the range of conformations that Fis-bound DNA can adopt. Lastly, the affinities and DNA conformations of individual Fis-DNA complexes are likely to be tailored to their context-specific biological functions.« less
Hancock, Stephen P.; Stella, Stefano; Cascio, Duilio; ...
2016-03-09
The abundant Fis nucleoid protein selectively binds poorly related DNA sequences with high affinities to regulate diverse DNA reactions. Fis binds DNA primarily through DNA backbone contacts and selects target sites by reading conformational properties of DNA sequences, most prominently intrinsic minor groove widths. High-affinity binding requires Fis-stabilized DNA conformational changes that vary depending on DNA sequence. In order to better understand the molecular basis for high affinity site recognition, we analyzed the effects of DNA sequence within and flanking the core Fis binding site on binding affinity and DNA structure. X-ray crystal structures of Fis-DNA complexes containing variable sequencesmore » in the noncontacted center of the binding site or variations within the major groove interfaces show that the DNA can adapt to the Fis dimer surface asymmetrically. We show that the presence and position of pyrimidine-purine base steps within the major groove interfaces affect both local DNA bending and minor groove compression to modulate affinities and lifetimes of Fis-DNA complexes. Sequences flanking the core binding site also modulate complex affinities, lifetimes, and the degree of local and global Fis-induced DNA bending. In particular, a G immediately upstream of the 15 bp core sequence inhibits binding and bending, and A-tracts within the flanking base pairs increase both complex lifetimes and global DNA curvatures. Taken together, our observations support a revised DNA motif specifying high-affinity Fis binding and highlight the range of conformations that Fis-bound DNA can adopt. Lastly, the affinities and DNA conformations of individual Fis-DNA complexes are likely to be tailored to their context-specific biological functions.« less
Acylation-dependent protein export in Leishmania.
Denny, P W; Gokool, S; Russell, D G; Field, M C; Smith, D F
2000-04-14
The surface of the protozoan parasite Leishmania is unusual in that it consists predominantly of glycosylphosphatidylinositol-anchored glycoconjugates and proteins. Additionally, a family of hydrophilic acylated surface proteins (HASPs) has been localized to the extracellular face of the plasma membrane in infective parasite stages. These surface polypeptides lack a recognizable endoplasmic reticulum secretory signal sequence, transmembrane spanning domain, or glycosylphosphatidylinositol-anchor consensus sequence, indicating that novel mechanisms are involved in their transport and localization. Here, we show that the N-terminal domain of HASPB contains primary structural information that directs both N-myristoylation and palmitoylation and is essential for correct localization of the protein to the plasma membrane. Furthermore, the N-terminal 18 amino acids of HASPB, encoding the dual acylation site, are sufficient to target the heterologous Aequorea victoria green fluorescent protein to the cell surface of Leishmania. Mutagenesis of the predicted acylated residues confirms that modification by both myristate and palmitate is required for correct trafficking. These data suggest that HASPB is a representative of a novel class of proteins whose translocation onto the surface of eukaryotic cells is dependent upon a "non-classical" pathway involving N-myristoylation/palmitoylation. Significantly, HASPB is also translocated on to the extracellular face of the plasma membrane of transfected mammalian cells, indicating that the export signal for HASPB is recognized by a higher eukaryotic export mechanism.
Kwasigroch, Jean Marc; Rooman, Marianne
2006-07-15
Prelude&Fugue are bioinformatics tools aiming at predicting the local 3D structure of a protein from its amino acid sequence in terms of seven backbone torsion angle domains, using database-derived potentials. Prelude(&Fugue) computes all lowest free energy conformations of a protein or protein region, ranked by increasing energy, and possibly satisfying some interresidue distance constraints specified by the user. (Prelude&)Fugue detects sequence regions whose predicted structure is significantly preferred relative to other conformations in the absence of tertiary interactions. These programs can be used for predicting secondary structure, tertiary structure of short peptides, flickering early folding sequences and peptides that adopt a preferred conformation in solution. They can also be used for detecting structural weaknesses, i.e. sequence regions that are not optimal with respect to the tertiary fold. http://babylone.ulb.ac.be/Prelude_and_Fugue.
Nagamitsu, Teruyoshi; Yasuda, Mika; Saito-Morooka, Fuki; Inoue, Maki N.; Nishiyama, Mio; Goka, Koichi; Sugiura, Shinji; Maeto, Kaoru; Okabe, Kimiko; Taki, Hisatomo
2016-01-01
Declines in honeybee populations have been a recent concern. Although causes of the declines remain unclear, environmental factors may be responsible. We focused on the potential environmental determinants of local populations of wild honeybees, Apis cerana japonica, in Japan. This subspecies has little genetic variation in terms of its mitochondrial DNA sequences, and genetic variations at nuclear loci are as yet unknown. We estimated the genetic structure and environmental determinants of local genetic diversity in nuclear microsatellite genotypes of fathers and mothers, inferred from workers collected at 139 sites. The genotypes of fathers and mothers showed weak isolation by distance and negligible genetic structure. The local genetic diversity was high in central Japan, decreasing toward the peripheries, and depended on the climate and land use characteristics of the sites. The local genetic diversity decreased as the annual precipitation increased, and increased as the proportion of urban and paddy field areas increased. Positive effects of natural forest area, which have also been observed in terms of forager abundance in farms, were not detected with respect to the local genetic diversity. The findings suggest that A. cerana japonica forms a single population connected by gene flow in its main distributional range, and that climate and landscape properties potentially affect its local genetic diversity. PMID:27898704
(Pea)nuts and bolts of visual narrative: Structure and meaning in sequential image comprehension
Cohn, Neil; Paczynski, Martin; Jackendoff, Ray; Holcomb, Phillip J.; Kuperberg, Gina R.
2012-01-01
Just as syntax differentiates coherent sentences from scrambled word strings, the comprehension of sequential images must also use a cognitive system to distinguish coherent narrative sequences from random strings of images. We conducted experiments analogous to two classic studies of language processing to examine the contributions of narrative structure and semantic relatedness to processing sequential images. We compared four types of comic strips: 1) Normal sequences with both structure and meaning, 2) Semantic Only sequences (in which the panels were related to a common semantic theme, but had no narrative structure), 3) Structural Only sequences (narrative structure but no semantic relatedness), and 4) Scrambled sequences of randomly-ordered panels. In Experiment 1, participants monitored for target panels in sequences presented panel-by-panel. Reaction times were slowest to panels in Scrambled sequences, intermediate in both Structural Only and Semantic Only sequences, and fastest in Normal sequences. This suggests that both semantic relatedness and narrative structure offer advantages to processing. Experiment 2 measured ERPs to all panels across the whole sequence. The N300/N400 was largest to panels in both the Scrambled and Structural Only sequences, intermediate in Semantic Only sequences and smallest in the Normal sequences. This implies that a combination of narrative structure and semantic relatedness can facilitate semantic processing of upcoming panels (as reflected by the N300/N400). Also, panels in the Scrambled sequences evoked a larger left-lateralized anterior negativity than panels in the Structural Only sequences. This localized effect was distinct from the N300/N400, and appeared despite the fact that these two sequence types were matched on local semantic relatedness between individual panels. These findings suggest that sequential image comprehension uses a narrative structure that may be independent of semantic relatedness. Altogether, we argue that the comprehension of visual narrative is guided by an interaction between structure and meaning. PMID:22387723
A sequence-dependent rigid-base model of DNA
NASA Astrophysics Data System (ADS)
Gonzalez, O.; Petkevičiutė, D.; Maddocks, J. H.
2013-02-01
A novel hierarchy of coarse-grain, sequence-dependent, rigid-base models of B-form DNA in solution is introduced. The hierarchy depends on both the assumed range of energetic couplings, and the extent of sequence dependence of the model parameters. A significant feature of the models is that they exhibit the phenomenon of frustration: each base cannot simultaneously minimize the energy of all of its interactions. As a consequence, an arbitrary DNA oligomer has an intrinsic or pre-existing stress, with the level of this frustration dependent on the particular sequence of the oligomer. Attention is focussed on the particular model in the hierarchy that has nearest-neighbor interactions and dimer sequence dependence of the model parameters. For a Gaussian version of this model, a complete coarse-grain parameter set is estimated. The parameterized model allows, for an oligomer of arbitrary length and sequence, a simple and explicit construction of an approximation to the configuration-space equilibrium probability density function for the oligomer in solution. The training set leading to the coarse-grain parameter set is itself extracted from a recent and extensive database of a large number of independent, atomic-resolution molecular dynamics (MD) simulations of short DNA oligomers immersed in explicit solvent. The Kullback-Leibler divergence between probability density functions is used to make several quantitative assessments of our nearest-neighbor, dimer-dependent model, which is compared against others in the hierarchy to assess various assumptions pertaining both to the locality of the energetic couplings and to the level of sequence dependence of its parameters. It is also compared directly against all-atom MD simulation to assess its predictive capabilities. The results show that the nearest-neighbor, dimer-dependent model can successfully resolve sequence effects both within and between oligomers. For example, due to the presence of frustration, the model can successfully predict the nonlocal changes in the minimum energy configuration of an oligomer that are consequent upon a local change of sequence at the level of a single point mutation.
A sequence-dependent rigid-base model of DNA.
Gonzalez, O; Petkevičiūtė, D; Maddocks, J H
2013-02-07
A novel hierarchy of coarse-grain, sequence-dependent, rigid-base models of B-form DNA in solution is introduced. The hierarchy depends on both the assumed range of energetic couplings, and the extent of sequence dependence of the model parameters. A significant feature of the models is that they exhibit the phenomenon of frustration: each base cannot simultaneously minimize the energy of all of its interactions. As a consequence, an arbitrary DNA oligomer has an intrinsic or pre-existing stress, with the level of this frustration dependent on the particular sequence of the oligomer. Attention is focussed on the particular model in the hierarchy that has nearest-neighbor interactions and dimer sequence dependence of the model parameters. For a Gaussian version of this model, a complete coarse-grain parameter set is estimated. The parameterized model allows, for an oligomer of arbitrary length and sequence, a simple and explicit construction of an approximation to the configuration-space equilibrium probability density function for the oligomer in solution. The training set leading to the coarse-grain parameter set is itself extracted from a recent and extensive database of a large number of independent, atomic-resolution molecular dynamics (MD) simulations of short DNA oligomers immersed in explicit solvent. The Kullback-Leibler divergence between probability density functions is used to make several quantitative assessments of our nearest-neighbor, dimer-dependent model, which is compared against others in the hierarchy to assess various assumptions pertaining both to the locality of the energetic couplings and to the level of sequence dependence of its parameters. It is also compared directly against all-atom MD simulation to assess its predictive capabilities. The results show that the nearest-neighbor, dimer-dependent model can successfully resolve sequence effects both within and between oligomers. For example, due to the presence of frustration, the model can successfully predict the nonlocal changes in the minimum energy configuration of an oligomer that are consequent upon a local change of sequence at the level of a single point mutation.
Knowledge-based prediction of protein backbone conformation using a structural alphabet.
Vetrivel, Iyanar; Mahajan, Swapnil; Tyagi, Manoj; Hoffmann, Lionel; Sanejouand, Yves-Henri; Srinivasan, Narayanaswamy; de Brevern, Alexandre G; Cadet, Frédéric; Offmann, Bernard
2017-01-01
Libraries of structural prototypes that abstract protein local structures are known as structural alphabets and have proven to be very useful in various aspects of protein structure analyses and predictions. One such library, Protein Blocks, is composed of 16 standard 5-residues long structural prototypes. This form of analyzing proteins involves drafting its structure as a string of Protein Blocks. Predicting the local structure of a protein in terms of protein blocks is the general objective of this work. A new approach, PB-kPRED is proposed towards this aim. It involves (i) organizing the structural knowledge in the form of a database of pentapeptide fragments extracted from all protein structures in the PDB and (ii) applying a knowledge-based algorithm that does not rely on any secondary structure predictions and/or sequence alignment profiles, to scan this database and predict most probable backbone conformations for the protein local structures. Though PB-kPRED uses the structural information from homologues in preference, if available. The predictions were evaluated rigorously on 15,544 query proteins representing a non-redundant subset of the PDB filtered at 30% sequence identity cut-off. We have shown that the kPRED method was able to achieve mean accuracies ranging from 40.8% to 66.3% depending on the availability of homologues. The impact of the different strategies for scanning the database on the prediction was evaluated and is discussed. Our results highlight the usefulness of the method in the context of proteins without any known structural homologues. A scoring function that gives a good estimate of the accuracy of prediction was further developed. This score estimates very well the accuracy of the algorithm (R2 of 0.82). An online version of the tool is provided freely for non-commercial usage at http://www.bo-protscience.fr/kpred/.
Tomcho, Jeremy C; Tillman, Magdalena R; Znosko, Brent M
2015-09-01
Predicting the secondary structure of RNA is an intermediate in predicting RNA three-dimensional structure. Commonly, determining RNA secondary structure from sequence uses free energy minimization and nearest neighbor parameters. Current algorithms utilize a sequence-independent model to predict free energy contributions of dinucleotide bulges. To determine if a sequence-dependent model would be more accurate, short RNA duplexes containing dinucleotide bulges with different sequences and nearest neighbor combinations were optically melted to derive thermodynamic parameters. These data suggested energy contributions of dinucleotide bulges were sequence-dependent, and a sequence-dependent model was derived. This model assigns free energy penalties based on the identity of nucleotides in the bulge (3.06 kcal/mol for two purines, 2.93 kcal/mol for two pyrimidines, 2.71 kcal/mol for 5'-purine-pyrimidine-3', and 2.41 kcal/mol for 5'-pyrimidine-purine-3'). The predictive model also includes a 0.45 kcal/mol penalty for an A-U pair adjacent to the bulge and a -0.28 kcal/mol bonus for a G-U pair adjacent to the bulge. The new sequence-dependent model results in predicted values within, on average, 0.17 kcal/mol of experimental values, a significant improvement over the sequence-independent model. This model and new experimental values can be incorporated into algorithms that predict RNA stability and secondary structure from sequence.
Piatkowski, Pawel; Kasprzak, Joanna M; Kumar, Deepak; Magnus, Marcin; Chojnowski, Grzegorz; Bujnicki, Janusz M
2016-01-01
RNA encompasses an essential part of all known forms of life. The functions of many RNA molecules are dependent on their ability to form complex three-dimensional (3D) structures. However, experimental determination of RNA 3D structures is laborious and challenging, and therefore, the majority of known RNAs remain structurally uncharacterized. To address this problem, computational structure prediction methods were developed that either utilize information derived from known structures of other RNA molecules (by way of template-based modeling) or attempt to simulate the physical process of RNA structure formation (by way of template-free modeling). All computational methods suffer from various limitations that make theoretical models less reliable than high-resolution experimentally determined structures. This chapter provides a protocol for computational modeling of RNA 3D structure that overcomes major limitations by combining two complementary approaches: template-based modeling that is capable of predicting global architectures based on similarity to other molecules but often fails to predict local unique features, and template-free modeling that can predict the local folding, but is limited to modeling the structure of relatively small molecules. Here, we combine the use of a template-based method ModeRNA with a template-free method SimRNA. ModeRNA requires a sequence alignment of the target RNA sequence to be modeled with a template of the known structure; it generates a model that predicts the structure of a conserved core and provides a starting point for modeling of variable regions. SimRNA can be used to fold small RNAs (<80 nt) without any additional structural information, and to refold parts of models for larger RNAs that have a correctly modeled core. ModeRNA can be either downloaded, compiled and run locally or run through a web interface at http://genesilico.pl/modernaserver/ . SimRNA is currently available to download for local use as a precompiled software package at http://genesilico.pl/software/stand-alone/simrna and as a web server at http://genesilico.pl/SimRNAweb . For model optimization we use QRNAS, available at http://genesilico.pl/qrnas .
Liu, Chang
2017-01-01
The spatial organization of the genome in the nucleus is critical for many cellular processes. It has been broadly accepted that the packing of chromatin inside the nucleus is not random, but structured at several hierarchical levels. The Hi-C method combines Chromatin Conformation Capture and high-throughput sequencing, which allows interrogating genome-wide chromatin interactions. Depending on the sequencing depth, chromatin packing patterns derived from Hi-C experiments can be viewed on a chromosomal scale or at a local genic level. Here, I describe a protocol of plant in situ Hi-C library preparation, which covers procedures starting from tissue fixation to library amplification.
Export of FepA::PhoA fusion proteins to the outer membrane of Escherichia coli K-12.
Murphy, C K; Klebba, P E
1989-11-01
A library of fepA::phoA gene fusions was generated in order to study the structure and secretion of the Escherichia coli K-12 ferric enterobactin receptor, FepA. All of the fusion proteins contained various lengths of the amino-terminal portion of FepA fused in frame to the catalytic portion of bacterial alkaline phosphatase. Localization of FepA::PhoA fusion proteins in the cell envelope was dependent on the number of residues of mature FepA present at the amino terminus. Hybrids containing up to one-third of the amino-terminal portion of FepA fractionated with their periplasm, while those containing longer sequences of mature FepA were exported to the outer membrane. Outer membrane-localized fusion proteins expressed FepA sequences on the external face of the outer membrane and alkaline phosphatase moieties in the periplasmic space. From sequence determinations of the fepA::phoA fusion joints, residues within FepA which may be exposed on the periplasmic side of the outer membrane were identified.
Bandyopadhyay, Deepak; Huan, Jun; Prins, Jan; Snoeyink, Jack; Wang, Wei; Tropsha, Alexander
2009-11-01
Protein function prediction is one of the central problems in computational biology. We present a novel automated protein structure-based function prediction method using libraries of local residue packing patterns that are common to most proteins in a known functional family. Critical to this approach is the representation of a protein structure as a graph where residue vertices (residue name used as a vertex label) are connected by geometrical proximity edges. The approach employs two steps. First, it uses a fast subgraph mining algorithm to find all occurrences of family-specific labeled subgraphs for all well characterized protein structural and functional families. Second, it queries a new structure for occurrences of a set of motifs characteristic of a known family, using a graph index to speed up Ullman's subgraph isomorphism algorithm. The confidence of function inference from structure depends on the number of family-specific motifs found in the query structure compared with their distribution in a large non-redundant database of proteins. This method can assign a new structure to a specific functional family in cases where sequence alignments, sequence patterns, structural superposition and active site templates fail to provide accurate annotation.
Porras, Pablo; McDonagh, Brian; Pedrajas, Jose Rafael; Bárcena, J Antonio; Padilla, C Alicia
2010-04-01
We have previously shown that glutaredoxin 2 (Grx2) from Saccharomyces cerevisiae localizes at 3 different subcellular compartments, cytosol, mitochondrial matrix and outer membrane, as the result of different postranslational processing of one single gene. Having set the mechanism responsible for this remarkable phenomenon, we have now aimed at defining whether this diversity of subcellular localizations correlates with differences in structure and function of the Grx2 isoforms. We have determined the N-terminal sequence of the soluble mitochondrial matrix Grx2 by mass spectrometry and have determined the exact cleavage site by Mitochondrial Processing Peptidase (MPP). As a consequence of this cleavage, the mitochondrial matrix Grx2 isoform possesses a basic tetrapeptide extension at the N-terminus compared to the cytosolic form. A functional relationship to this structural difference is that mitochondrial Grx2 displays a markedly higher activity in the catalysis of GSSG reduction by the mitochondrial dithiol dihydrolipoamide. We have prepared Grx2 mutants affected on key residues inside the presequence to direct the protein to one single cellular compartment; either the cytosol, the mitochondrial membrane or the matrix and have analyzed their functional phenotypes. Strains expressing Grx2 only in the cytosol are equally sensitive to H(2)O(2) as strains lacking the gene, whereas those expressing Grx2 exclusively in the mitochondrial matrix are more resistant. Mutations on key basic residues drastically affect the cellular fate of the protein, showing that evolutionary diversification of Grx2 structural and functional properties are strictly dependent on the sequence of the targeting signal peptide. Copyright 2009 Elsevier B.V. All rights reserved.
Local dependence in random graph models: characterization, properties and statistical inference
Schweinberger, Michael; Handcock, Mark S.
2015-01-01
Summary Dependent phenomena, such as relational, spatial and temporal phenomena, tend to be characterized by local dependence in the sense that units which are close in a well-defined sense are dependent. In contrast with spatial and temporal phenomena, though, relational phenomena tend to lack a natural neighbourhood structure in the sense that it is unknown which units are close and thus dependent. Owing to the challenge of characterizing local dependence and constructing random graph models with local dependence, many conventional exponential family random graph models induce strong dependence and are not amenable to statistical inference. We take first steps to characterize local dependence in random graph models, inspired by the notion of finite neighbourhoods in spatial statistics and M-dependence in time series, and we show that local dependence endows random graph models with desirable properties which make them amenable to statistical inference. We show that random graph models with local dependence satisfy a natural domain consistency condition which every model should satisfy, but conventional exponential family random graph models do not satisfy. In addition, we establish a central limit theorem for random graph models with local dependence, which suggests that random graph models with local dependence are amenable to statistical inference. We discuss how random graph models with local dependence can be constructed by exploiting either observed or unobserved neighbourhood structure. In the absence of observed neighbourhood structure, we take a Bayesian view and express the uncertainty about the neighbourhood structure by specifying a prior on a set of suitable neighbourhood structures. We present simulation results and applications to two real world networks with ‘ground truth’. PMID:26560142
Musinova, Yana R; Kananykhina, Eugenia Y; Potashnikova, Daria M; Lisitsyna, Olga M; Sheval, Eugene V
2015-01-01
The majority of known nucleolar proteins are freely exchanged between the nucleolus and the surrounding nucleoplasm. One way proteins are retained in the nucleoli is by the presence of specific amino acid sequences, namely nucleolar localization signals (NoLSs). The mechanism by which NoLSs retain proteins inside the nucleoli is still unclear. Here, we present data showing that the charge-dependent (electrostatic) interactions of NoLSs with nucleolar components lead to nucleolar accumulation as follows: (i) known NoLSs are enriched in positively charged amino acids, but the NoLS structure is highly heterogeneous, and it is not possible to identify a consensus sequence for this type of signal; (ii) in two analyzed proteins (NF-κB-inducing kinase and HIV-1 Tat), the NoLS corresponds to a region that is enriched for positively charged amino acid residues; substituting charged amino acids with non-charged ones reduced the nucleolar accumulation in proportion to the charge reduction, and nucleolar accumulation efficiency was strongly correlated with the predicted charge of the tested sequences; and (iii) sequences containing only lysine or arginine residues (which were referred to as imitative NoLSs, or iNoLSs) are accumulated in the nucleoli in a charge-dependent manner. The results of experiments with iNoLSs suggested that charge-dependent accumulation inside the nucleoli was dependent on interactions with nucleolar RNAs. The results of this work are consistent with the hypothesis that nucleolar protein accumulation by NoLSs can be determined by the electrostatic interaction of positively charged regions with nucleolar RNAs rather than by any sequence-specific mechanism. Copyright © 2014 Elsevier B.V. All rights reserved.
Kasaliwal, Rajeev; Sankhe, Shilpa S; Lila, Anurag R; Budyal, Sweta R; Jagtap, Varsha S; Sarathi, Vijaya; Kakade, Harshal; Bandgar, Tushar; Menon, Padmavathy S; Shah, Nalini S
2013-06-01
Various techniques have been attempted to increase the yield of magnetic resonance imaging (MRI) for localization of pituitary microadenomas in corticotropin (ACTH)-dependent Cushing's syndrome (CS). To compare the performance of dynamic contrast spin echo (DC-SE) and volume interpolated 3D-spoiled gradient echo (VI-SGE) MR sequences in the diagnostic evaluation of ACTH-dependent CS. Data was analysed retrospectively from a series of ACTH-dependent CS patients treated over 2-year period at a tertiary care referral centre (2009-2011). Thirty-six patients (24 female and 12 male) were diagnosed to have ACTH-dependent CS during the study period. All patients underwent MRI by both sequences during a single examination. Cases with negative and equivocal pituitary MR imaging underwent corticotropin-releasing hormone (CRH) stimulated bilateral inferior petrosal sinus sampling (BIPSS) to confirm pituitary origin of ACTH excess state. Thirty patients were finally diagnosed to have Cushing's disease (CD) [based on histopathology proof of adenoma and/or remission (partial/complete) of hypercortisolism postsurgery]. Six patients were diagnosed to have histopathologically proven ectopic CS. Of 30 patients with CD, 24 patients had microadenomas and 6 patients had macroadenomas. DC-SE MRI sequence was able to identify microadenomas in 16 of 24 patients, whereas postcontrast VI-SGE sequence was able to identify microadenomas in 21 of 24 patients. All six patients of ectopic CS had negative pituitary MR imaging by both techniques (specificity: 100%). VI-SGE MR sequence was better for localization of pituitary microadenomas particularly when DC-SE MR sequence is negative or equivocal and should be used in addition to DC-SE MR sequence for the evaluation of ACTH-dependent CS. © 2012 John Wiley & Sons Ltd.
Wang, Shuo; Nanjunda, Rupesh; Aston, Karl; Bashkin, James K.; Wilson, W. David
2012-01-01
In order to better understand the effects of β-alanine (β) substitution and the number of heterocycles on DNA binding affinity and selectivity, the interactions of an eight-ring hairpin polyamide (PA) and two β derivatives as well as a six-heterocycle analog have been investigated with their cognate DNA sequence, 5′-TGGCTT-3′. Binding selectivity and the effects of β have been investigated with the cognate and five mutant DNAs. A set of powerful and complementary methods have been employed for both energetic and structural evaluations: UV-melting, biosensor-surface plasmon resonance, isothermal titration calorimetry, circular dichroism and a DNA ligation ladder global structure assay. The reduced number of heterocycles in the six-ring PA weakens the binding affinity; however, the smaller PA aggregates significantly less than the larger PAs, and allows us to obtain the binding thermodynamics. The PA-DNA binding enthalpy is large and negative with a large negative ΔCp, and is the primary driving component of the Gibbs free energy. The complete SPR binding results clearly show that β substitutions can substantially weaken the binding affinity of hairpin PAs in a position-dependent manner. More importantly, the changes in PA binding to the mutant DNAs further confirm the position-dependent effects on PA-DNA interaction affinity. Comparison of mutant DNA sequences also shows a different effect in recognition of T•A versus A•T base pairs. The effects of DNA mutations on binding of a single PA as well as the effects of the position of β substitution on binding tell a clear and very important story about sequence dependent binding of PAs to DNA. PMID:23167504
Processing multiple non-adjacent dependencies: evidence from sequence learning
de Vries, Meinou H.; Petersson, Karl Magnus; Geukes, Sebastian; Zwitserlood, Pienie; Christiansen, Morten H.
2012-01-01
Processing non-adjacent dependencies is considered to be one of the hallmarks of human language. Assuming that sequence-learning tasks provide a useful way to tap natural-language-processing mechanisms, we cross-modally combined serial reaction time and artificial-grammar learning paradigms to investigate the processing of multiple nested (A1A2A3B3B2B1) and crossed dependencies (A1A2A3B1B2B3), containing either three or two dependencies. Both reaction times and prediction errors highlighted problems with processing the middle dependency in nested structures (A1A2A3B3_B1), reminiscent of the ‘missing-verb effect’ observed in English and French, but not with crossed structures (A1A2A3B1_B3). Prior linguistic experience did not play a major role: native speakers of German and Dutch—which permit nested and crossed dependencies, respectively—showed a similar pattern of results for sequences with three dependencies. As for sequences with two dependencies, reaction times and prediction errors were similar for both nested and crossed dependencies. The results suggest that constraints on the processing of multiple non-adjacent dependencies are determined by the specific ordering of the non-adjacent dependencies (i.e. nested or crossed), as well as the number of non-adjacent dependencies to be resolved (i.e. two or three). Furthermore, these constraints may not be specific to language but instead derive from limitations on structured sequence learning. PMID:22688641
Domain-specific learning of grammatical structure in musical and phonological sequences.
Bly, Benjamin Martin; Carrión, Ricardo E; Rasch, Björn
2009-01-01
Artificial grammar learning depends on acquisition of abstract structural representations rather than domain-specific representational constraints, or so many studies tell us. Using an artificial grammar task, we compared learning performance in two stimulus domains in which respondents have differing tacit prior knowledge. We found that despite grammatically identical sequence structures, learning was better for harmonically related chord sequences than for letter name sequences or harmonically unrelated chord sequences. We also found transfer effects within the musical and letter name tasks, but not across the domains. We conclude that knowledge acquired in implicit learning depends not only on abstract features of structured stimuli, but that the learning of regularities is in some respects domain-specific and strongly linked to particular features of the stimulus domain.
Song, Jiangning; Burrage, Kevin; Yuan, Zheng; Huber, Thomas
2006-03-09
The majority of peptide bonds in proteins are found to occur in the trans conformation. However, for proline residues, a considerable fraction of Prolyl peptide bonds adopt the cis form. Proline cis/trans isomerization is known to play a critical role in protein folding, splicing, cell signaling and transmembrane active transport. Accurate prediction of proline cis/trans isomerization in proteins would have many important applications towards the understanding of protein structure and function. In this paper, we propose a new approach to predict the proline cis/trans isomerization in proteins using support vector machine (SVM). The preliminary results indicated that using Radial Basis Function (RBF) kernels could lead to better prediction performance than that of polynomial and linear kernel functions. We used single sequence information of different local window sizes, amino acid compositions of different local sequences, multiple sequence alignment obtained from PSI-BLAST and the secondary structure information predicted by PSIPRED. We explored these different sequence encoding schemes in order to investigate their effects on the prediction performance. The training and testing of this approach was performed on a newly enlarged dataset of 2424 non-homologous proteins determined by X-Ray diffraction method using 5-fold cross-validation. Selecting the window size 11 provided the best performance for determining the proline cis/trans isomerization based on the single amino acid sequence. It was found that using multiple sequence alignments in the form of PSI-BLAST profiles could significantly improve the prediction performance, the prediction accuracy increased from 62.8% with single sequence to 69.8% and Matthews Correlation Coefficient (MCC) improved from 0.26 with single local sequence to 0.40. Furthermore, if coupled with the predicted secondary structure information by PSIPRED, our method yielded a prediction accuracy of 71.5% and MCC of 0.43, 9% and 0.17 higher than the accuracy achieved based on the singe sequence information, respectively. A new method has been developed to predict the proline cis/trans isomerization in proteins based on support vector machine, which used the single amino acid sequence with different local window sizes, the amino acid compositions of local sequence flanking centered proline residues, the position-specific scoring matrices (PSSMs) extracted by PSI-BLAST and the predicted secondary structures generated by PSIPRED. The successful application of SVM approach in this study reinforced that SVM is a powerful tool in predicting proline cis/trans isomerization in proteins and biological sequence analysis.
Molzan, Manuela; Ottmann, Christian
2013-03-01
Myeloid leukemia factor 1 (MLF1) is associated with the development of leukemic diseases such as acute myeloid leukemia (AML) and myelodysplastic syndrome (MDS). However, information on the physiological function of MLF1 is limited and mostly derived from studies identifying MLF1 interaction partners like CSN3, MLF1IP, MADM, Manp and the 14-3-3 proteins. The 14-3-3-binding site surrounding S34 is one of the only known functional features of the MLF1 sequence, along with one nuclear export sequence (NES) and two nuclear localization sequences (NLS). It was recently shown that the subcellular localization of mouse MLF1 is dependent on 14-3-3 proteins. Based on these findings, we investigated whether the subcellular localization of human MLF1 was also directly 14-3-3-dependent. Live cell imaging with GFP-fused human MLF1 was used to study the effects of mutations and deletions on its subcellular localization. Surprisingly, we found that the subcellular localization of full-length human MLF1 is 14-3-3-independent, and is probably regulated by other as-yet-unknown proteins.
Gerresheim, Gesche K; Dünnes, Nadia; Nieder-Röhrmann, Anika; Shalamova, Lyudmila A; Fricke, Markus; Hofacker, Ivo; Höner Zu Siederdissen, Christian; Marz, Manja; Niepmann, Michael
2017-02-01
We have analyzed the binding of the liver-specific microRNA-122 (miR-122) to three conserved target sites of hepatitis C virus (HCV) RNA, two in the non-structural protein 5B (NS5B) coding region and one in the 3' untranslated region (3'UTR). miR-122 binding efficiency strongly depends on target site accessibility under conditions when the range of flanking sequences available for the formation of local RNA secondary structures changes. Our results indicate that the particular sequence feature that contributes most to the correlation between target site accessibility and binding strength varies between different target sites. This suggests that the dynamics of miRNA/Ago2 binding not only depends on the target site itself but also on flanking sequence context to a considerable extent, in particular in a small viral genome in which strong selection constraints act on coding sequence and overlapping cis-signals and model the accessibility of cis-signals. In full-length genomes, single and combination mutations in the miR-122 target sites reveal that site 5B.2 is positively involved in regulating overall genome replication efficiency, whereas mutation of site 5B.3 showed a weaker effect. Mutation of the 3'UTR site and double or triple mutants showed no significant overall effect on genome replication, whereas in a translation reporter RNA, the 3'UTR target site inhibits translation directed by the HCV 5'UTR. Thus, the miR-122 target sites in the 3'-region of the HCV genome are involved in a complex interplay in regulating different steps of the HCV replication cycle.
50 years of DNA ‘Breathing’: Reflections on Old and New Approaches
von Hippel, Peter H.; Johnson, Neil P.; Marcus, Andrew H.
2015-01-01
Summary The coding sequences for genes, and much other regulatory information involved in genome expression, are located ‘inside’ the DNA duplex. Thus the ‘macromolecular machines’ that read-out this information from the base sequence of the DNA must somehow access the DNA ‘interior’. Double-stranded (ds) DNA is a highly structured and cooperatively stabilized system at physiological temperatures, but is also only marginally stable and undergoes a cooperative ‘melting phase transition’ at temperatures not far above physiological. Furthermore, due to its length and heterogeneous sequence, with AT-rich segments being less stable than GC-rich segments, the DNA genome ‘melts’ in a multistate fashion. Therefore the DNA genome must also manifest thermally driven structural (‘breathing’) fluctuations at physiological temperatures that should reflect the heterogeneity of the dsDNA stability near the melting temperature. Thus many of the breathing fluctuations of dsDNA are likely also to be sequence dependent, and could well contain information that should be ‘readable’ and useable by regulatory proteins and protein complexes in site-specific binding reactions involving dsDNA ‘opening’. Our laboratory has been involved in studying the breathing fluctuations of duplex DNA for about 50 years. In this ‘Reflections’ article we present a relatively chronological overview of these studies, starting with the use of simple chemical probes (such as hydrogen exchange, formaldehyde and simple DNA ‘melting’ proteins) to examine the local stability of the dsDNA structure, and culminating in sophisticated spectroscopic approaches that can be used to monitor the breathing-dependent interactions of regulatory complexes with their duplex DNA targets in ‘real time’. PMID:23840028
Preservation of protein clefts in comparative models.
Piedra, David; Lois, Sergi; de la Cruz, Xavier
2008-01-16
Comparative, or homology, modelling of protein structures is the most widely used prediction method when the target protein has homologues of known structure. Given that the quality of a model may vary greatly, several studies have been devoted to identifying the factors that influence modelling results. These studies usually consider the protein as a whole, and only a few provide a separate discussion of the behaviour of biologically relevant features of the protein. Given the value of the latter for many applications, here we extended previous work by analysing the preservation of native protein clefts in homology models. We chose to examine clefts because of their role in protein function/structure, as they are usually the locus of protein-protein interactions, host the enzymes' active site, or, in the case of protein domains, can also be the locus of domain-domain interactions that lead to the structure of the whole protein. We studied how the largest cleft of a protein varies in comparative models. To this end, we analysed a set of 53507 homology models that cover the whole sequence identity range, with a special emphasis on medium and low similarities. More precisely we examined how cleft quality - measured using six complementary parameters related to both global shape and local atomic environment, depends on the sequence identity between target and template proteins. In addition to this general analysis, we also explored the impact of a number of factors on cleft quality, and found that the relationship between quality and sequence identity varies depending on cleft rank amongst the set of protein clefts (when ordered according to size), and number of aligned residues. We have examined cleft quality in homology models at a range of seq.id. levels. Our results provide a detailed view of how quality is affected by distinct parameters and thus may help the user of comparative modelling to determine the final quality and applicability of his/her cleft models. In addition, the large variability in model quality that we observed within each sequence bin, with good models present even at low sequence identities (between 20% and 30%), indicates that properly developed identification methods could be used to recover good cleft models in this sequence range.
NASA Astrophysics Data System (ADS)
Meyer, Sam; Everaers, Ralf
2015-02-01
The histone-DNA interaction in the nucleosome is a fundamental mechanism of genomic compaction and regulation, which remains largely unknown despite increasing structural knowledge of the complex. In this paper, we propose a framework for the extraction of a nanoscale histone-DNA force-field from a collection of high-resolution structures, which may be adapted to a larger class of protein-DNA complexes. We applied the procedure to a large crystallographic database extended by snapshots from molecular dynamics simulations. The comparison of the structural models first shows that, at histone-DNA contact sites, the DNA base-pairs are shifted outwards locally, consistent with locally repulsive forces exerted by the histones. The second step shows that the various force profiles of the structures under analysis derive locally from a unique, sequence-independent, quadratic repulsive force-field, while the sequence preferences are entirely due to internal DNA mechanics. We have thus obtained the first knowledge-derived nanoscale interaction potential for histone-DNA in the nucleosome. The conformations obtained by relaxation of nucleosomal DNA with high-affinity sequences in this potential accurately reproduce the experimental values of binding preferences. Finally we address the more generic binding mechanisms relevant to the 80% genomic sequences incorporated in nucleosomes, by computing the conformation of nucleosomal DNA with sequence-averaged properties. This conformation differs from those found in crystals, and the analysis suggests that repulsive histone forces are related to local stretch tension in nucleosomal DNA, mostly between adjacent contact points. This tension could play a role in the stability of the complex.
Bedbrook, Claire N; Yang, Kevin K; Rice, Austin J; Gradinaru, Viviana; Arnold, Frances H
2017-10-01
There is growing interest in studying and engineering integral membrane proteins (MPs) that play key roles in sensing and regulating cellular response to diverse external signals. A MP must be expressed, correctly inserted and folded in a lipid bilayer, and trafficked to the proper cellular location in order to function. The sequence and structural determinants of these processes are complex and highly constrained. Here we describe a predictive, machine-learning approach that captures this complexity to facilitate successful MP engineering and design. Machine learning on carefully-chosen training sequences made by structure-guided SCHEMA recombination has enabled us to accurately predict the rare sequences in a diverse library of channelrhodopsins (ChRs) that express and localize to the plasma membrane of mammalian cells. These light-gated channel proteins of microbial origin are of interest for neuroscience applications, where expression and localization to the plasma membrane is a prerequisite for function. We trained Gaussian process (GP) classification and regression models with expression and localization data from 218 ChR chimeras chosen from a 118,098-variant library designed by SCHEMA recombination of three parent ChRs. We use these GP models to identify ChRs that express and localize well and show that our models can elucidate sequence and structure elements important for these processes. We also used the predictive models to convert a naturally occurring ChR incapable of mammalian localization into one that localizes well.
Rice, Austin J.; Gradinaru, Viviana; Arnold, Frances H.
2017-01-01
There is growing interest in studying and engineering integral membrane proteins (MPs) that play key roles in sensing and regulating cellular response to diverse external signals. A MP must be expressed, correctly inserted and folded in a lipid bilayer, and trafficked to the proper cellular location in order to function. The sequence and structural determinants of these processes are complex and highly constrained. Here we describe a predictive, machine-learning approach that captures this complexity to facilitate successful MP engineering and design. Machine learning on carefully-chosen training sequences made by structure-guided SCHEMA recombination has enabled us to accurately predict the rare sequences in a diverse library of channelrhodopsins (ChRs) that express and localize to the plasma membrane of mammalian cells. These light-gated channel proteins of microbial origin are of interest for neuroscience applications, where expression and localization to the plasma membrane is a prerequisite for function. We trained Gaussian process (GP) classification and regression models with expression and localization data from 218 ChR chimeras chosen from a 118,098-variant library designed by SCHEMA recombination of three parent ChRs. We use these GP models to identify ChRs that express and localize well and show that our models can elucidate sequence and structure elements important for these processes. We also used the predictive models to convert a naturally occurring ChR incapable of mammalian localization into one that localizes well. PMID:29059183
Molecular dynamics study of some non-hydrogen-bonding base pair DNA strands
NASA Astrophysics Data System (ADS)
Tiwari, Rakesh K.; Ojha, Rajendra P.; Tiwari, Gargi; Pandey, Vishnudatt; Mall, Vijaysree
2018-05-01
In order to elucidate the structural activity of hydrophobic modified DNA, the DMMO2-D5SICS, base pair is introduced as a constituent in different set of 12-mer and 14-mer DNA sequences for the molecular dynamics (MD) simulation in explicit water solvent. AMBER 14 force field was employed for each set of duplex during the 200ns production-dynamics simulation in orthogonal-box-water solvent by the Particle-Mesh-Ewald (PME) method in infinite periodic boundary conditions (PBC) to determine conformational parameters of the complex. The force-field parameters of modified base-pair were calculated by Gaussian-code using Hartree-Fock /ab-initio methodology. RMSD Results reveal that the conformation of the duplex is sequence dependent and the binding energy of the complex depends on the position of the modified base-pair in the nucleic acid strand. We found that non-bonding energy had a significant contribution to stabilising such type of duplex in comparison to electrostatic energy. The distortion produced within strands by such type of base-pair was local and destabilised the duplex integrity near to substitution, moreover the binding energy of duplex depends on the position of substitution of hydrophobic base-pair and the DNA sequence and strongly supports the corresponding experimental study.
Ghouzam, Yassine; Postic, Guillaume; Guerin, Pierre-Edouard; de Brevern, Alexandre G.; Gelly, Jean-Christophe
2016-01-01
Protein structure prediction based on comparative modeling is the most efficient way to produce structural models when it can be performed. ORION is a dedicated webserver based on a new strategy that performs this task. The identification by ORION of suitable templates is performed using an original profile-profile approach that combines sequence and structure evolution information. Structure evolution information is encoded into profiles using structural features, such as solvent accessibility and local conformation —with Protein Blocks—, which give an accurate description of the local protein structure. ORION has recently been improved, increasing by 5% the quality of its results. The ORION web server accepts a single protein sequence as input and searches homologous protein structures within minutes. Various databases such as PDB, SCOP and HOMSTRAD can be mined to find an appropriate structural template. For the modeling step, a protein 3D structure can be directly obtained from the selected template by MODELLER and displayed with global and local quality model estimation measures. The sequence and the predicted structure of 4 examples from the CAMEO server and a recent CASP11 target from the ‘Hard’ category (T0818-D1) are shown as pertinent examples. Our web server is accessible at http://www.dsimb.inserm.fr/ORION/. PMID:27319297
Ghouzam, Yassine; Postic, Guillaume; Guerin, Pierre-Edouard; de Brevern, Alexandre G; Gelly, Jean-Christophe
2016-06-20
Protein structure prediction based on comparative modeling is the most efficient way to produce structural models when it can be performed. ORION is a dedicated webserver based on a new strategy that performs this task. The identification by ORION of suitable templates is performed using an original profile-profile approach that combines sequence and structure evolution information. Structure evolution information is encoded into profiles using structural features, such as solvent accessibility and local conformation -with Protein Blocks-, which give an accurate description of the local protein structure. ORION has recently been improved, increasing by 5% the quality of its results. The ORION web server accepts a single protein sequence as input and searches homologous protein structures within minutes. Various databases such as PDB, SCOP and HOMSTRAD can be mined to find an appropriate structural template. For the modeling step, a protein 3D structure can be directly obtained from the selected template by MODELLER and displayed with global and local quality model estimation measures. The sequence and the predicted structure of 4 examples from the CAMEO server and a recent CASP11 target from the 'Hard' category (T0818-D1) are shown as pertinent examples. Our web server is accessible at http://www.dsimb.inserm.fr/ORION/.
Genetic Rearrangements Can Modify Chromatin Features at Epialleles
Foerster, Andrea M.; Dinh, Huy Q.; Sedman, Laura; Wohlrab, Bonnie; Mittelsten Scheid, Ortrun
2011-01-01
Analogous to genetically distinct alleles, epialleles represent heritable states of different gene expression from sequence-identical genes. Alleles and epialleles both contribute to phenotypic heterogeneity. While alleles originate from mutation and recombination, the source of epialleles is less well understood. We analyze active and inactive epialleles that were found at a transgenic insert with a selectable marker gene in Arabidopsis. Both converse expression states are stably transmitted to progeny. The silent epiallele was previously shown to change its state upon loss-of-function of trans-acting regulators and drug treatments. We analyzed the composition of the epialleles, their chromatin features, their nuclear localization, transcripts, and homologous small RNA. After mutagenesis by T-DNA transformation of plants carrying the silent epiallele, we found new active alleles. These switches were associated with different, larger or smaller, and non-overlapping deletions or rearrangements in the 3′ regions of the epiallele. These cis-mutations caused different degrees of gene expression stability depending on the nature of the sequence alteration, the consequences for transcription and transcripts, and the resulting chromatin organization upstream. This illustrates a tight dependence of epigenetic regulation on local structures and indicates that sequence alterations can cause epigenetic changes at some distance in regions not directly affected by the mutation. Similar effects may also be involved in gene expression and chromatin changes in the vicinity of transposon insertions or excisions, recombination events, or DNA repair processes and could contribute to the origin of new epialleles. PMID:22028669
The Thiamine-Pyrophosphate-Motif
NASA Technical Reports Server (NTRS)
Ciszak, Ewa; Dominiak, Paulina
2004-01-01
Thiamin pyrophosphate (TPP), a derivative of vitamin B1, is a cofactor for enzymes performing catalysis in pathways of energy production including the well known decarboxylation of a-keto acid dehydrogenases followed by transketolation. TPP-dependent enzymes constitute a structurally and functionally diverse group exhibiting multimeric subunit organization, multiple domains and two chemically equivalent catalytic centers. Annotation of functional TPP-dependcnt enzymes, therefore, has not been trivial due to low sequence similarity related to this complex organization. Our approach to analysis of structures of known TPP-dependent enzymes reveals for the first time features common to this group, which we have termed the TPP-motif. The TPP-motif consists of specific spatial arrangements of structural elements and their specific contacts to provide for a flip-flop, or alternate site, enzymatic mechanism of action. Analysis of structural elements entrained in the flip-flop action displayed by TPP-dependent enzymes reveals a novel definition of the common amino acid sequences. These sequences allow for annotation of TPP-dependent enzymes, thus advancing functional proteomics. Further details of three-dimensional structures of TPP-dependent enzymes will be discussed.
The effects of DNA supercoiling on G-quadruplex formation.
Sekibo, Doreen A T; Fox, Keith R
2017-12-01
Guanine-rich DNAs can fold into four-stranded structures that contain stacks of G-quartets. Bioinformatics studies have revealed that G-rich sequences with the potential to adopt these structures are unevenly distributed throughout genomes, and are especially found in gene promoter regions. With the exception of the single-stranded telomeric DNA, all genomic G-rich sequences will always be present along with their C-rich complements, and quadruplex formation will be in competition with the corresponding Watson-Crick duplex. Quadruplex formation must therefore first require local dissociation (melting) of the duplex strands. Since negative supercoiling is known to facilitate the formation of alternative DNA structures, we have investigated G-quadruplex formation within negatively supercoiled DNA plasmids. Plasmids containing multiple copies of (G3T)n and (G3T4)n repeats, were probed with dimethylsulphate, potassium permanganate and S1 nuclease. While dimethylsulphate footprinting revealed some evidence for G-quadruplex formation in (G3T)n sequences, this was not affected by supercoiling, and permanganate failed to detect exposed thymines in the loop regions. (G3T4)n sequences were not protected from DMS and showed no reaction with permanganate. Similarly, both S1 nuclease and 2D gel electrophoresis of DNA topoisomers did not detect any supercoil-dependent structural transitions. These results suggest that negative supercoiling alone is not sufficient to drive G-quadruplex formation. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Cooper, David N.; Bacolla, Albino; Férec, Claude; Vasquez, Karen M.; Kehrer-Sawatzki, Hildegard; Chen, Jian-Min
2011-01-01
Different types of human gene mutation may vary in size, from structural variants (SVs) to single base-pair substitutions, but what they all have in common is that their nature, size and location are often determined either by specific characteristics of the local DNA sequence environment or by higher-order features of the genomic architecture. The human genome is now recognized to contain ‘pervasive architectural flaws’ in that certain DNA sequences are inherently mutation-prone by virtue of their base composition, sequence repetitivity and/or epigenetic modification. Here we explore how the nature, location and frequency of different types of mutation causing inherited disease are shaped in large part, and often in remarkably predictable ways, by the local DNA sequence environment. The mutability of a given gene or genomic region may also be influenced indirectly by a variety of non-canonical (non-B) secondary structures whose formation is facilitated by the underlying DNA sequence. Since these non-B DNA structures can interfere with subsequent DNA replication and repair, and may serve to increase mutation frequencies in generalized fashion (i.e. both in the context of subtle mutations and SVs), they have the potential to serve as a unifying concept in studies of mutational mechanisms underlying human inherited disease. PMID:21853507
How the Sequence of a Gene Specifies Structural Symmetry in Proteins
Shen, Xiaojuan; Huang, Tongcheng; Wang, Guanyu; Li, Guanglin
2015-01-01
Internal symmetry is commonly observed in the majority of fundamental protein folds. Meanwhile, sufficient evidence suggests that nascent polypeptide chains of proteins have the potential to start the co-translational folding process and this process allows mRNA to contain additional information on protein structure. In this paper, we study the relationship between gene sequences and protein structures from the viewpoint of symmetry to explore how gene sequences code for structural symmetry in proteins. We found that, for a set of two-fold symmetric proteins from left-handed beta-helix fold, intragenic symmetry always exists in their corresponding gene sequences. Meanwhile, codon usage bias and local mRNA structure might be involved in modulating translation speed for the formation of structural symmetry: a major decrease of local codon usage bias in the middle of the codon sequence can be identified as a common feature; and major or consecutive decreases in local mRNA folding energy near the boundaries of the symmetric substructures can also be observed. The results suggest that gene duplication and fusion may be an evolutionarily conserved process for this protein fold. In addition, the usage of rare codons and the formation of higher order of secondary structure near the boundaries of symmetric substructures might have coevolved as conserved mechanisms to slow down translation elongation and to facilitate effective folding of symmetric substructures. These findings provide valuable insights into our understanding of the mechanisms of translation and its evolution, as well as the design of proteins via symmetric modules. PMID:26641668
[A turning point in the knowledge of the structure-function-activity relations of elastin].
Alix, A J
2001-01-01
In this review are presented the last new results of our research group dealing with the molecular structures (atomic level) of tropoelastin, elastin and elastin derived peptides studied by using essentially methods of bioinformatics (theoretical predictions and molecular modelling) linked to experimental circular dichroism spectroscopic studies. We already had characterized both the local secondary structure and some parts of the tertiary structure of the tropoelastin and elastin molecules (human, bovine...), by using either theoretical predictions (local secondary structure, linear epitopes...) and/or experimental data (optical spectroscopic methods: Raman scattering, infrared absorption, circular dichroism). Except the cross-linking regions which are in helical conformations, the whole tropoelastin structure displays a lot of beta-reverse turns which usually belong to irregular structures in proteins. These turns play a key role in other regularly structures orientation (alpha-helix, beta-strand), thus they are very important in the native protein 3D architecture. It is particularly true for human tropoelastin, because its sequence is rich in glycines and prolines, and these residues are frequently met in beta-turns (a beta-turn is made of four consecutive residues which are stabilized by an hydrogen bond). Several types of beta-turns can be defined with the dihedral angles values phi and psi of the two central residues. Thus, by using a very recent updated set of propensities for the amino acid residues to belong to given types of reverse beta-turns (extracted from a reference set of known 3-D structures of globular proteins), we have determined, (by using our home made software COUDES), for all possible tetrapeptides of the human tropoelastin sequence, the distribution and the characterization of the possible type of turns. Thus, it is shown that the locations and/or the types of these reverse beta-turns reveal a regularity and are not all random. This confirms our hypothesis that intra-molecular elasticity of tropoelastin could be explained by the possibility of transitions between conformations involving short beta-strands and beta-turns. This result is of great interest in the construction (by using molecular biology) of elastic biomaterials derived from the elastin sequence (particularly, the elastin derived peptides corresponding to the sequence exon 21--(exon 24--exon 24...). Our study permit also to predict the conformations of specific elastin derived peptides which could have interesting biological activity. Peptides resulting from the degradation of elastin, the insoluble polymer of tropoelastin and responsible for the elasticity of vertebrate tissues, can induce biological effects and notably the regulation of matrix metalloproteinases (MMP-s) activity. Recently, it was proposed that some elastin derived hexapeptides resulting from circular permutations of VGVAPG (a three fold repetition sequence in exon 24 of human tropoelastin) possess MMP-1 production and activation regulation properties. This effect depends on the presence of the tropoelastin specific membraneous receptor 67 KDa EBP (Elastin Binding Protein). Our results obtained by using both circular dichroism spectroscopy and linear predictions confirmed the hypothesis of a structure dependent mechanism with a possibly occurring type VIII beta-turn on the first four residues of the GXXPG sequence consensus which is only present among all active peptides. Thus, we have performed extensive molecular dynamics studies, in both implicit and explicit solvent, on these active and inactive elastin derived hexapeptides. Using our own analysis method of pattern recognition of the types of the beta-reverse-turns followed during the molecular dynamics trajectory, we found that active and inactive peptides effectively form two well distinct conformational groups in which active peptides preferentially adopt conformation close to type VIII GXXP (beta-reverse-turn. The structural role of the C terminal G residue could also be explained. Additional molecular simulations on (VGVAPG)2 and (VGVAPG)3 show the formation of two or three GXXP tetrapeptides adopting a structure close to type VIII beta-reverse-turn, suggesting a local conformational preference for this motif. This observation of a specific structural single and/or repeated motif is in agreement with the circular dichroism spectra of the involved (VGVAPG)1, (VGVAPG)2 and (VGVAPG)3 peptides and then it can be proposed that their biological activities have to be linear. The final aim of this type of work is to understand more about the sequence/structure/function/activity relationships of those structured peptides in order to propose specific sequences (corresponding to specific structures) for best biological activity results.
Yamaguchi, Kosuke; Hada, Masashi; Fukuda, Yuko; Inoue, Erina; Makino, Yoshinori; Katou, Yuki; Shirahige, Katsuhiko; Okada, Yuki
2018-06-26
The question of whether retained histones in the sperm genome localize to gene-coding regions or gene deserts has been debated for years. Previous contradictory observations are likely caused by the non-uniform sensitivity of sperm chromatin to micrococcal nuclease (MNase) digestion. Sperm chromatin has a highly condensed but heterogeneous structure and is composed of 90%∼99% protamines and 1%∼10% histones. In this study, we utilized nucleoplasmin (NPM) to improve the solubility of sperm chromatin by removing protamines in vitro. NPM treatment efficiently solubilized histones while maintaining quality and quantity. Chromatin immunoprecipitation sequencing (ChIP-seq) analyses using NPM-treated sperm demonstrated the predominant localization of H4 to distal intergenic regions, whereas modified histones exhibited a modification-dependent preferential enrichment in specific genomic elements, such as H3K4me3 at CpG-rich promoters and H3K9me3 in satellite repeats, respectively, implying the existence of machinery protecting modified histones from eviction. Copyright © 2018 The Author(s). Published by Elsevier Inc. All rights reserved.
Yoo, Soonmoon; Kim, Hak Hee; Kim, Paul; Donnelly, Christopher J.; Kalinski, Ashley L.; Vuppalanchi, Deepika; Park, Michael; Lee, Seung Joon; Merianda, Tanuja T.; Perrone-Bizzozero, Nora I.; Twiss, Jeffery L.
2013-01-01
Localized translation of axonal mRNAs contributes to developmental and regenerative axon growth. Although untranslated regions (UTRs) of many different axonal mRNAs appear to drive their localization, there has been no consensus RNA structure responsible for this localization. We recently showed that limited expression of ZBP1 protein restricts axonal localization of both β-actin and GAP-43 mRNAs. β-actin 3′UTR has a defined element for interaction with ZBP1, but GAP-43 mRNA shows no homology to this RNA sequence. Here, we show that an AU-rich element (ARE) in GAP-43’s 3′UTR is necessary and sufficient for its axonal localization. Axonal GAP-43 mRNA levels increase after in vivo injury, and GAP-43 mRNA shows an increased half-life in regenerating axons. GAP-43 mRNA interacts with both HuD and ZBP1, and HuD and ZBP1 coimmunoprecipitate in an RNA-dependent fashion. Reporter mRNA with the GAP-43 ARE competes with endogenous β-actin mRNA for axonal localization and decreases axon length and branching similar to the β-actin 3′UTR competing with endogenous GAP-43 mRNA. Conversely, overexpressing GAP-43 coding sequence with it’s 3′UTR ARE increases axonal elongation and this effect is lost when just the ARE is deleted from GAP-43’s 3′UTR. PMID:23586486
Using local chromatin structure to improve CRISPR/Cas9 efficiency in zebrafish.
Chen, Yunru; Zeng, Shiyang; Hu, Ruikun; Wang, Xiangxiu; Huang, Weilai; Liu, Jiangfang; Wang, Luying; Liu, Guifen; Cao, Ying; Zhang, Yong
2017-01-01
Although the CRISPR/Cas9 has been successfully applied in zebrafish, considerable variations in efficiency have been observed for different gRNAs. The workload and cost of zebrafish mutant screening is largely dependent on the mutation rate of injected embryos; therefore, selecting more effective gRNAs is especially important for zebrafish mutant construction. Besides the sequence features, local chromatin structures may have effects on CRISPR/Cas9 efficiency, which remain largely unexplored. In the only related study in zebrafish, nucleosome organization was not found to have an effect on CRISPR/Cas9 efficiency, which is inconsistent with recent studies in vitro and in mammalian cell lines. To understand the effects of local chromatin structure on CRISPR/Cas9 efficiency in zebrafish, we first determined that CRISPR/Cas9 introduced genome editing mainly before the dome stage. Based on this observation, we reanalyzed our published nucleosome organization profiles and generated chromatin accessibility profiles in the 256-cell and dome stages using ATAC-seq technology. Our study demonstrated that chromatin accessibility showed positive correlation with CRISPR/Cas9 efficiency, but we did not observe a clear correlation between nucleosome organization and CRISPR/Cas9 efficiency. We constructed an online database for zebrafish gRNA selection based on local chromatin structure features that could prove beneficial to zebrafish homozygous mutant construction via CRISPR/Cas9.
Prudent, James R.; Hall, Jeff G.; Lyamichev, Victor L.; Brow, Mary Ann D.; Dahlberg, James E.
2007-12-11
The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof.
Invasive cleavage of nucleic acids
Prudent, James R.; Hall, Jeff G.; Lyamichev, Victor I.; Brow, Mary Ann D.; Dahlberg, James E.
1999-01-01
The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof.
Invasive cleavage of nucleic acids
Prudent, James R.; Hall, Jeff G.; Lyamichev, Victor I.; Brow, Mary Ann D.; Dahlberg, James E.
2002-01-01
The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof.
Prudent, James R.; Hall, Jeff G.; Lyamichev, Victor I.; Brow; Mary Ann D.; Dahlberg, James E.
2010-11-09
The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof.
Prudent, James R.; Hall, Jeff G.; Lyamichev, Victor I.; Brow, Mary Ann D.; Dahlberg, James E.
2000-01-01
The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof.
Prudent, James R.; Hall, Jeff G.; Lyamichev, Victor I.; Brow, Mary Ann; Dahlberg, James E.
2005-04-05
The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof.
NASA Astrophysics Data System (ADS)
Choi, Jin-Hyuck; Klinger, Yann; Ferry, Matthieu; Ritz, Jean-François; Kurtz, Robin; Rizza, Magali; Bollinger, Laurent; Davaasambuu, Battogtokh; Tsend-Ayush, Nyambayar; Demberel, Sodnomsambuu
2018-02-01
In 1905, 14 days apart, two M 8 continental strike-slip earthquakes, the Tsetserleg and Bulnay earthquakes, occurred on the Bulnay fault system, in Mongolia. Together, they ruptured four individual faults, with a total length of 676 km. Using submetric optical satellite images "Pleiades" with ground resolution of 0.5 m, complemented by field observation, we mapped in detail the entire surface rupture associated with this earthquake sequence. Surface rupture along the main Bulnay fault is 388 km in length, striking nearly E-W. The rupture is formed by a series of fault segments that are 29 km long on average, separated by geometric discontinuities. Although there is a difference of about 2 m in the average slip between the western and eastern parts of the Bulnay rupture, along-fault slip variations are overall limited, resulting in a smooth slip distribution, except for local slip deficit at segment boundaries. We show that damage, including short branches and secondary faulting, associated with the rupture propagation, occurred significantly more often along the western part of the Bulnay rupture, while the eastern part of the rupture appears more localized and thus possibly structurally simpler. Eventually, the difference of slip between the western and eastern parts of the rupture is attributed to this difference of rupture localization, associated at first order with a lateral change in the local geology. Damage associated to rupture branching appears to be located asymmetrically along the extensional side of the strike-slip rupture and shows a strong dependence on structural geologic inheritance.
Bentley, Anna M.; Normand, Guillaume; Hoyt, Jonathan
2007-01-01
The mitotic cyclins promote cell division by binding and activating cyclin-dependent kinases (CDKs). Each cyclin has a unique pattern of subcellular localization that plays a vital role in regulating cell division. During mitosis, cyclin B1 is known to localize to centrosomes, microtubules, and chromatin. To determine the mechanisms of cyclin B1 localization in M phase, we imaged full-length and mutant versions of human cyclin B1-enhanced green fluorescent protein in live cells by using spinning disk confocal microscopy. In addition to centrosome, microtubule, and chromatin localization, we found that cyclin B1 also localizes to unattached kinetochores after nuclear envelope breakdown. Kinetochore recruitment of cyclin B1 required the kinetochore proteins Hec1 and Mad2, and it was stimulated by microtubule destabilization. Mutagenesis studies revealed that cyclin B1 is recruited to kinetochores through both CDK1-dependent and -independent mechanisms. In contrast, localization of cyclin B1 to chromatin and centrosomes is independent of CDK1 binding. The N-terminal domain of cyclin B1 is necessary and sufficient for chromatin association, whereas centrosome recruitment relies on sequences within the cyclin box. Our data support a role for cyclin B1 function at unattached kinetochores, and they demonstrate that separable and distinct sequence elements target cyclin B1 to kinetochores, chromatin, and centrosomes during mitosis. PMID:17881737
A Comparative Study of Human Saposins.
Garrido-Arandia, María; Cuevas-Zuviría, Bruno; Díaz-Perales, Araceli; Pacios, Luis F
2018-02-14
Saposins are small proteins implicated in trafficking and loading of lipids onto Cluster of Differentiation 1 (CD1) receptor proteins that in turn present lipid antigens to T cells and a variety of T-cell receptors, thus playing a crucial role in innate and adaptive immune responses in humans. Despite their low sequence identity, the four types of human saposins share a similar folding pattern consisting of four helices linked by three conserved disulfide bridges. However, their lipid-binding abilities as well as their activities in extracting, transporting and loading onto CD1 molecules a variety of sphingo- and phospholipids in biological membranes display two striking characteristics: a strong pH-dependence and a structural change between a compact, closed conformation and an open conformation. In this work, we present a comparative computational study of structural, electrostatic, and dynamic features of human saposins based upon their available experimental structures. By means of structural alignments, surface analyses, calculation of pH-dependent protonation states, Poisson-Boltzmann electrostatic potentials, and molecular dynamics simulations at three pH values representative of biological media where saposins fulfill their function, our results shed light into their intrinsic features. The similarities and differences in this class of proteins depend on tiny variations of local structural details that allow saposins to be key players in triggering responses in the human immune system.
Elucidation of Peptide-Directed Palladium Surface Structure for Biologically Tunable Nanocatalysts
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bedford, Nicholas M.; Ramezani-Dakhel, Hadi; Slocik, Joseph M.
Peptide-enabled synthesis of inorganic nanostructures represents an avenue to access catalytic materials with tunable and optimized properties. This is achieved via peptide complexity and programmability that is missing in traditional ligands for catalytic nanomaterials. Unfortunately, there is limited information available to correlate peptide sequence to particle structure and catalytic activity to date. As such, the application of peptide-enabled nanocatalysts remains limited to trial and error approaches. In this paper, a hybrid experimental and computational approach is introduced to systematically elucidate biomolecule-dependent structure/function relationships for peptide-capped Pd nanocatalysts. Synchrotron X-ray techniques were used to uncover substantial particle surface structural disorder, whichmore » was dependent upon the amino acid sequence of the peptide capping ligand. Nanocatalyst configurations were then determined directly from experimental data using reverse Monte Carlo methods and further refined using molecular dynamics simulation, obtaining thermodynamically stable peptide-Pd nanoparticle configurations. Sequence-dependent catalytic property differences for C-C coupling and olefin hydrogenation were then eluddated by identification of the catalytic active sites at the atomic level and quantitative prediction of relative reaction rates. This hybrid methodology provides a clear route to determine peptide-dependent structure/function relationships, enabling the generation of guidelines for catalyst design through rational tailoring of peptide sequences« less
Elucidation of peptide-directed palladium surface structure for biologically tunable nanocatalysts.
Bedford, Nicholas M; Ramezani-Dakhel, Hadi; Slocik, Joseph M; Briggs, Beverly D; Ren, Yang; Frenkel, Anatoly I; Petkov, Valeri; Heinz, Hendrik; Naik, Rajesh R; Knecht, Marc R
2015-05-26
Peptide-enabled synthesis of inorganic nanostructures represents an avenue to access catalytic materials with tunable and optimized properties. This is achieved via peptide complexity and programmability that is missing in traditional ligands for catalytic nanomaterials. Unfortunately, there is limited information available to correlate peptide sequence to particle structure and catalytic activity to date. As such, the application of peptide-enabled nanocatalysts remains limited to trial and error approaches. In this paper, a hybrid experimental and computational approach is introduced to systematically elucidate biomolecule-dependent structure/function relationships for peptide-capped Pd nanocatalysts. Synchrotron X-ray techniques were used to uncover substantial particle surface structural disorder, which was dependent upon the amino acid sequence of the peptide capping ligand. Nanocatalyst configurations were then determined directly from experimental data using reverse Monte Carlo methods and further refined using molecular dynamics simulation, obtaining thermodynamically stable peptide-Pd nanoparticle configurations. Sequence-dependent catalytic property differences for C-C coupling and olefin hydrogenation were then elucidated by identification of the catalytic active sites at the atomic level and quantitative prediction of relative reaction rates. This hybrid methodology provides a clear route to determine peptide-dependent structure/function relationships, enabling the generation of guidelines for catalyst design through rational tailoring of peptide sequences.
Chen, Mingchen; Lin, Xingcheng; Zheng, Weihua; Onuchic, José N; Wolynes, Peter G
2016-08-25
The associative memory, water mediated, structure and energy model (AWSEM) is a coarse-grained force field with transferable tertiary interactions that incorporates local in sequence energetic biases using bioinformatically derived structural information about peptide fragments with locally similar sequences that we call memories. The memory information from the protein data bank (PDB) database guides proper protein folding. The structural information about available sequences in the database varies in quality and can sometimes lead to frustrated free energy landscapes locally. One way out of this difficulty is to construct the input fragment memory information from all-atom simulations of portions of the complete polypeptide chain. In this paper, we investigate this approach first put forward by Kwac and Wolynes in a more complete way by studying the structure prediction capabilities of this approach for six α-helical proteins. This scheme which we call the atomistic associative memory, water mediated, structure and energy model (AAWSEM) amounts to an ab initio protein structure prediction method that starts from the ground up without using bioinformatic input. The free energy profiles from AAWSEM show that atomistic fragment memories are sufficient to guide the correct folding when tertiary forces are included. AAWSEM combines the efficiency of coarse-grained simulations on the full protein level with the local structural accuracy achievable from all-atom simulations of only parts of a large protein. The results suggest that a hybrid use of atomistic fragment memory and database memory in structural predictions may well be optimal for many practical applications.
Spink, N; Brown, D G; Skelly, J V; Neidle, S
1994-01-01
The bis-benzimidazole drug Hoechst 33258 has been co-crystallized with the dodecanucleotide sequence d(CGCAAATTTGCG)2. The structure has been solved by molecular replacement and refined to an R factor of 18.5% for 2125 reflections collected on a Xentronics area detector. The drug is bound in the minor groove, at the five base-pair site 5'-ATTTG and is in a unique orientation. This is displaced by one base pair in the 5' direction compared to previously-determined structures of this drug with the sequence d(CGCGAATTCGCG)2. Reasons for this difference in behaviour are discussed in terms of several sequence-dependent structural features of the DNA, with particular reference to differences in propeller twist and minor-groove width. Images PMID:7515488
Locally adaptive MR intensity models and MRF-based segmentation of multiple sclerosis lesions
NASA Astrophysics Data System (ADS)
Galimzianova, Alfiia; Lesjak, Žiga; Likar, Boštjan; Pernuš, Franjo; Špiclin, Žiga
2015-03-01
Neuroimaging biomarkers are an important paraclinical tool used to characterize a number of neurological diseases, however, their extraction requires accurate and reliable segmentation of normal and pathological brain structures. For MR images of healthy brains the intensity models of normal-appearing brain tissue (NABT) in combination with Markov random field (MRF) models are known to give reliable and smooth NABT segmentation. However, the presence of pathology, MR intensity bias and natural tissue-dependent intensity variability altogether represent difficult challenges for a reliable estimation of NABT intensity model based on MR images. In this paper, we propose a novel method for segmentation of normal and pathological structures in brain MR images of multiple sclerosis (MS) patients that is based on locally-adaptive NABT model, a robust method for the estimation of model parameters and a MRF-based segmentation framework. Experiments on multi-sequence brain MR images of 27 MS patients show that, compared to whole-brain model and compared to the widely used Expectation-Maximization Segmentation (EMS) method, the locally-adaptive NABT model increases the accuracy of MS lesion segmentation.
Recognition of Local DNA Structures by p53 Protein
Brázda, Václav; Coufal, Jan
2017-01-01
p53 plays critical roles in regulating cell cycle, apoptosis, senescence and metabolism and is commonly mutated in human cancer. These roles are achieved by interaction with other proteins, but particularly by interaction with DNA. As a transcription factor, p53 is well known to bind consensus target sequences in linear B-DNA. Recent findings indicate that p53 binds with higher affinity to target sequences that form cruciform DNA structure. Moreover, p53 binds very tightly to non-B DNA structures and local DNA structures are increasingly recognized to influence the activity of wild-type and mutant p53. Apart from cruciform structures, p53 binds to quadruplex DNA, triplex DNA, DNA loops, bulged DNA and hemicatenane DNA. In this review, we describe local DNA structures and summarize information about interactions of p53 with these structural DNA motifs. These recent data provide important insights into the complexity of the p53 pathway and the functional consequences of wild-type and mutant p53 activation in normal and tumor cells. PMID:28208646
Maleki, Ehsan; Babashah, Hossein; Koohi, Somayyeh; Kavehvash, Zahra
2017-07-01
This paper presents an optical processing approach for exploring a large number of genome sequences. Specifically, we propose an optical correlator for global alignment and an extended moiré matching technique for local analysis of spatially coded DNA, whose output is fed to a novel three-dimensional artificial neural network for local DNA alignment. All-optical implementation of the proposed 3D artificial neural network is developed and its accuracy is verified in Zemax. Thanks to its parallel processing capability, the proposed structure performs local alignment of 4 million sequences of 150 base pairs in a few seconds, which is much faster than its electrical counterparts, such as the basic local alignment search tool.
Metal Cations in G-Quadruplex Folding and Stability
NASA Astrophysics Data System (ADS)
Bhattacharyya, Debmalya; Mirihana Arachchilage, Gayan; Basu, Soumitra
2016-09-01
This review is focused on the structural and physico-chemical aspects of metal cation coordination to G-Quadruplexes (GQ) and their effects on GQ stability and conformation. G-Quadruplex structures are non-canonical secondary structures formed by both DNA and RNA. G-quadruplexes regulate a wide range of important biochemical processes. Besides the sequence requirements, the coordination of monovalent cations in the GQ is essential for its formation and determines the stability and polymorphism of GQ structures. The nature, location and dynamics of the cation coordination and their impact on the overall GQ stability are dependent on several factors such as the ionic radii, hydration energy and the bonding strength to the O6 of guanines. The intracellular monovalent cation concentration and the localized ion concentrations determine the formation of GQs and can potentially dictate their regulatory roles. A wide range of biochemical and biophysical studies on an array of GQ enabling sequences have generated at a minimum the knowledge base that allows us to often predict the stability of GQs in presence of the physiologically relevant metal ions, however, prediction of conformation of such GQs is still out of the realm.
An improved stochastic fractal search algorithm for 3D protein structure prediction.
Zhou, Changjun; Sun, Chuan; Wang, Bin; Wang, Xiaojun
2018-05-03
Protein structure prediction (PSP) is a significant area for biological information research, disease treatment, and drug development and so on. In this paper, three-dimensional structures of proteins are predicted based on the known amino acid sequences, and the structure prediction problem is transformed into a typical NP problem by an AB off-lattice model. This work applies a novel improved Stochastic Fractal Search algorithm (ISFS) to solve the problem. The Stochastic Fractal Search algorithm (SFS) is an effective evolutionary algorithm that performs well in exploring the search space but falls into local minimums sometimes. In order to avoid the weakness, Lvy flight and internal feedback information are introduced in ISFS. In the experimental process, simulations are conducted by ISFS algorithm on Fibonacci sequences and real peptide sequences. Experimental results prove that the ISFS performs more efficiently and robust in terms of finding the global minimum and avoiding getting stuck in local minimums.
Modulating Transmembrane α-Helix Interactions through pH-Sensitive Boundary Residues.
Ng, Derek P; Deber, Charles M
2016-08-09
Changes in pH can alter the structure and activity of proteins and may be used by the cell to control molecular function. This coupling can also be used in non-native applications through the design of pH-sensitive biomolecules. For example, the pH (low) insertion peptide (pHLIP) can spontaneously insert into a lipid bilayer when the pH decreases. We have previously shown that the α-helicity and helix-helix interactions of the TM2 α-helix of the proteolipid protein (PLP) are sensitive to the local hydrophobicity at its C-terminus. Given that there is an ionizable residue (Glu-88) at the C-terminus of this transmembrane (TM) segment, we hypothesized that changing the ionization state of this residue through pH may alter the local hydrophobicity of the peptide enough to affect both its secondary structure and helix-helix interactions. To examine this phenomenon, we synthesized peptide analogues of the PLP TM2 α-helix (wild-type sequence (66)AFQYVIYGTASFFFLYGALLLAEGF(90)). Using circular dichroism and Förster resonance energy transfer in the membrane-mimetic detergent sodium dodecyl sulfate, we found that a decrease in pH increases both peptide α-helicity and the extent of self-association. This pH-dependent effect is due specifically to the presence of Glu-88 at the C-terminus. Additional experiments in which Phe-90 was mutated to residues of varying hydrophobicities indicated that the strength of this effect is dependent on the local hydrophobicity near Glu-88. Our results have implications for the design of TM peptide switches and improve our understanding of how membrane protein structure and activity can be regulated through local molecular environmental changes.
Deng, Peng; Tan, Xiaoqing; Wu, Ying; Bai, Qunhua; Jia, Yan; Xiao, Hong
2015-03-01
The ChrT gene encodes a chromate reductase enzyme which catalyzes the reduction of Cr(VI). The chromate reductase is also known as flavin mononucleotide (FMN) reductase (FMN_red). The aim of the present study was to clone the full-length ChrT DNA from Serratia sp. CQMUS2 and analyze the deduced amino acid sequence and three-dimensional structure. The putative ChrT gene fragment of Serratia sp. CQMUS2 was isolated by polymerase chain reaction (PCR), according to the known FMN_red gene sequence from Serratia sp. AS13. The flanking sequences of the ChrT gene were obtained by high efficiency TAIL-PCR, while the full-length gene of ChrT was cloned in Escherichia coli for subsequent sequencing. The nucleotide sequence of ChrT was submitted onto GenBank under the accession number, KF211434. Sequence analysis of the gene and amino acids was conducted using the Basic Local Alignment Search Tool, and open reading frame (ORF) analysis was performed using ORF Finder software. The ChrT gene was found to be an ORF of 567 bp that encodes a 188-amino acid enzyme with a calculated molecular weight of 20.4 kDa. In addition, the ChrT protein was hypothesized to be an NADPH-dependent FMN_red and a member of the flavodoxin-2 superfamily. The amino acid sequence of ChrT showed high sequence similarity to the FMN reductase genes of Klebsiella pneumonia and Raoultella ornithinolytica , which belong to the flavodoxin-2 superfamily. Furthermore, ChrT was shown to have a 85.6% similarity to the three-dimensional structure of Escherichia coli ChrR, sharing four common enzyme active sites for chromate reduction. Therefore, ChrT gene cloning and protein structure determination demonstrated the ability of the gene for chromate reduction. The results of the present study provide a basis for further studies on ChrT gene expression and protein function.
DENG, PENG; TAN, XIAOQING; WU, YING; BAI, QUNHUA; JIA, YAN; XIAO, HONG
2015-01-01
The ChrT gene encodes a chromate reductase enzyme which catalyzes the reduction of Cr(VI). The chromate reductase is also known as flavin mononucleotide (FMN) reductase (FMN_red). The aim of the present study was to clone the full-length ChrT DNA from Serratia sp. CQMUS2 and analyze the deduced amino acid sequence and three-dimensional structure. The putative ChrT gene fragment of Serratia sp. CQMUS2 was isolated by polymerase chain reaction (PCR), according to the known FMN_red gene sequence from Serratia sp. AS13. The flanking sequences of the ChrT gene were obtained by high efficiency TAIL-PCR, while the full-length gene of ChrT was cloned in Escherichia coli for subsequent sequencing. The nucleotide sequence of ChrT was submitted onto GenBank under the accession number, KF211434. Sequence analysis of the gene and amino acids was conducted using the Basic Local Alignment Search Tool, and open reading frame (ORF) analysis was performed using ORF Finder software. The ChrT gene was found to be an ORF of 567 bp that encodes a 188-amino acid enzyme with a calculated molecular weight of 20.4 kDa. In addition, the ChrT protein was hypothesized to be an NADPH-dependent FMN_red and a member of the flavodoxin-2 superfamily. The amino acid sequence of ChrT showed high sequence similarity to the FMN reductase genes of Klebsiella pneumonia and Raoultella ornithinolytica, which belong to the flavodoxin-2 superfamily. Furthermore, ChrT was shown to have a 85.6% similarity to the three-dimensional structure of Escherichia coli ChrR, sharing four common enzyme active sites for chromate reduction. Therefore, ChrT gene cloning and protein structure determination demonstrated the ability of the gene for chromate reduction. The results of the present study provide a basis for further studies on ChrT gene expression and protein function. PMID:25667630
HARMONY: a server for the assessment of protein structures
Pugalenthi, G.; Shameer, K.; Srinivasan, N.; Sowdhamini, R.
2006-01-01
Protein structure validation is an important step in computational modeling and structure determination. Stereochemical assessment of protein structures examine internal parameters such as bond lengths and Ramachandran (φ,ψ) angles. Gross structure prediction methods such as inverse folding procedure and structure determination especially at low resolution can sometimes give rise to models that are incorrect due to assignment of misfolds or mistracing of electron density maps. Such errors are not reflected as strain in internal parameters. HARMONY is a procedure that examines the compatibility between the sequence and the structure of a protein by assigning scores to individual residues and their amino acid exchange patterns after considering their local environments. Local environments are described by the backbone conformation, solvent accessibility and hydrogen bonding patterns. We are now providing HARMONY through a web server such that users can submit their protein structure files and, if required, the alignment of homologous sequences. Scores are mapped on the structure for subsequent examination that is useful to also recognize regions of possible local errors in protein structures. HARMONY server is located at PMID:16844999
Dependency Structures for Statistical Machine Translation
ERIC Educational Resources Information Center
Bach, Nguyen
2012-01-01
Dependency structures represent a sentence as a set of dependency relations. Normally the dependency structures from a tree connect all the words in a sentence. One of the most defining characters of dependency structures is the ability to bring long distance dependency between words to local dependency structures. Another the main attraction of…
The Star Formation Histories of Disk Galaxies: The Live, the Dead, and the Undead
DOE Office of Scientific and Technical Information (OSTI.GOV)
Oemler, Augustus Jr; Dressler, Alan; Abramson, Louis E.
We reexamine the properties of local galaxy populations using published surveys of star formation, structure, and gas content. After recalibrating star formation measures, we are able to reliably measure specific star formation rates well below that of the so-called “main sequence” of star formation versus mass. We find an unexpectedly large population of quiescent galaxies with star formation rates intermediate between the main sequence and passive populations and with disproportionately high star formation rates. We demonstrate that a tight main sequence is a natural outcome of most histories of star formation and has little astrophysical significance but that the quiescentmore » population requires additional astrophysics to explain its properties. Using a simple model for disk evolution based on the observed dependence of star formation on gas content in local galaxies, and assuming simple histories of cold gas inflow, we show that the evolution of galaxies away from the main sequence can be attributed to the depletion of gas due to star formation after a cutoff of gas inflow. The quiescent population is composed of galaxies in which the density of disk gas has fallen below a threshold for star formation probably set by disk stability. The evolution of galaxies beyond the quiescent state to gas exhaustion and the end of star formation requires another process, probably wind-driven mass loss. The environmental dependence of the three galaxy populations is consistent with recent numerical modeling, which indicates that cold gas inflows into galaxies are truncated at earlier epochs in denser environments.« less
Sequence-dependent response of DNA to torsional stress: a potential biological regulation mechanism.
Reymer, Anna; Zakrzewska, Krystyna; Lavery, Richard
2018-02-28
Torsional restraints on DNA change in time and space during the life of the cell and are an integral part of processes such as gene expression, DNA repair and packaging. The mechanical behavior of DNA under torsional stress has been studied on a mesoscopic scale, but little is known concerning its response at the level of individual base pairs and the effects of base pair composition. To answer this question, we have developed a geometrical restraint that can accurately control the total twist of a DNA segment during all-atom molecular dynamics simulations. By applying this restraint to four different DNA oligomers, we are able to show that DNA responds to both under- and overtwisting in a very heterogeneous manner. Certain base pair steps, in specific sequence environments, are able to absorb most of the torsional stress, leaving other steps close to their relaxed conformation. This heterogeneity also affects the local torsional modulus of DNA. These findings suggest that modifying torsional stress on DNA could act as a modulator for protein binding via the heterogeneous changes in local DNA structure.
Sequence-dependent response of DNA to torsional stress: a potential biological regulation mechanism
Reymer, Anna; Zakrzewska, Krystyna; Lavery, Richard
2018-01-01
Abstract Torsional restraints on DNA change in time and space during the life of the cell and are an integral part of processes such as gene expression, DNA repair and packaging. The mechanical behavior of DNA under torsional stress has been studied on a mesoscopic scale, but little is known concerning its response at the level of individual base pairs and the effects of base pair composition. To answer this question, we have developed a geometrical restraint that can accurately control the total twist of a DNA segment during all-atom molecular dynamics simulations. By applying this restraint to four different DNA oligomers, we are able to show that DNA responds to both under- and overtwisting in a very heterogeneous manner. Certain base pair steps, in specific sequence environments, are able to absorb most of the torsional stress, leaving other steps close to their relaxed conformation. This heterogeneity also affects the local torsional modulus of DNA. These findings suggest that modifying torsional stress on DNA could act as a modulator for protein binding via the heterogeneous changes in local DNA structure. PMID:29267977
Evolutionary and biophysical relationships among the papillomavirus E2 proteins.
Blakaj, Dukagjin M; Fernandez-Fuentes, Narcis; Chen, Zigui; Hegde, Rashmi; Fiser, Andras; Burk, Robert D; Brenowitz, Michael
2009-01-01
Infection by human papillomavirus (HPV) may result in clinical conditions ranging from benign warts to invasive cancer. The HPV E2 protein represses oncoprotein transcription and is required for viral replication. HPV E2 binds to palindromic DNA sequences of highly conserved four base pair sequences flanking an identical length variable 'spacer'. E2 proteins directly contact the conserved but not the spacer DNA. Variation in naturally occurring spacer sequences results in differential protein affinity that is dependent on their sensitivity to the spacer DNA's unique conformational and/or dynamic properties. This article explores the biophysical character of this core viral protein with the goal of identifying characteristics that associated with risk of virally caused malignancy. The amino acid sequence, 3d structure and electrostatic features of the E2 protein DNA binding domain are highly conserved; specific interactions with DNA binding sites have also been conserved. In contrast, the E2 protein's transactivation domain does not have extensive surfaces of highly conserved residues. Rather, regions of high conservation are localized to small surface patches. Implications to cancer biology are discussed.
Structural mechanics of DNA wrapping in the nucleosome.
Battistini, Federica; Hunter, Christopher A; Gardiner, Eleanor J; Packer, Martin J
2010-02-19
Experimental X-ray crystal structures and a database of calculated structural parameters of DNA octamers were used in combination to analyse the mechanics of DNA bending in the nucleosome core complex. The 1kx5 X-ray crystal structure of the nucleosome core complex was used to determine the relationship between local structure at the base-step level and the global superhelical conformation observed for nucleosome-bound DNA. The superhelix is characterised by a large curvature (597 degrees) in one plane and very little curvature (10 degrees) in the orthogonal plane. Analysis of the curvature at the level of 10-step segments shows that there is a uniform curvature of 30 degrees per helical turn throughout most of the structure but that there are two sharper kinks of 50 degrees at +/-2 helical turns from the central dyad base pair. The curvature is due almost entirely to the base-step parameter roll. There are large periodic variations in roll, which are in phase with the helical twist and account for 500 degrees of the total curvature. Although variations in the other base-step parameters perturb the local path of the DNA, they make minimal contributions to the total curvature. This implies that DNA bending in the nucleosome is achieved using the roll-slide-twist degree of freedom previously identified as the major degree of freedom in naked DNA oligomers. The energetics of bending into a nucleosome-bound conformation were therefore analysed using a database of structural parameters that we have previously developed for naked DNA oligomers. The minimum energy roll, the roll flexibility force constant and the maximum and minimum accessible roll values were obtained for each base step in the relevant octanucleotide context to account for the effects of conformational coupling that vary with sequence context. The distribution of base-step roll values and corresponding strain energy required to bend DNA into the nucleosome-bound conformation defined by the 1kx5 structure were obtained by applying a constant bending moment. When a single bending moment was applied to the entire sequence, the local details of the calculated structure did not match the experiment. However, when local 10-step bending moments were applied separately, the calculated structure showed excellent agreement with experiment. This implies that the protein applies variable bending forces along the DNA to maintain the superhelical path required for nucleosome wrapping. In particular, the 50 degrees kinks are constraints imposed by the protein rather than a feature of the 1kx5 DNA sequence. The kinks coincide with a relatively flexible region of the sequence, and this is probably a prerequisite for high-affinity nucleosome binding, but the bending strain energy is significantly higher at these points than for the rest of the sequence. In the most rigid regions of the sequence, a higher strain energy is also required to achieve the standard 30 degrees curvature per helical turn. We conclude that matching of the DNA sequence to the local roll periodicity required to achieve bending, together with the increased flexibility required at the kinks, determines the sequence selectivity of DNA wrapping in the nucleosome. 2009 Elsevier Ltd. All rights reserved.
Cytomegalovirus Basic Phosphoprotein (pUL32) Binds to Capsids In Vitro through Its Amino One-Third
Baxter, Michael K.; Gibson, Wade
2001-01-01
The cytomegalovirus (CMV) basic phosphoprotein (BPP) is a component of the tegument. It remains with the nucleocapsid fraction under conditions that remove most other tegument proteins from the virion, suggesting a direct and perhaps tight interaction with the capsid. As a step toward localizing this protein within the molecular structure of the virion and understanding its function during infection, we have investigated the BPP-capsid interaction. In this report we present evidence that the BPP interacts selectively, through its amino one-third, with CMV capsids. Radiolabeled simian CMV (SCMV) BPP, synthesized in vitro, bound to SCMV B-capsids, and C-capsids to a lesser extent, following incubation with either isolated capsids or lysates of infected cells. Human CMV (HCMV) BPP (pUL32) also bound to SCMV capsids, and SCMV BPP likewise bound to HCMV capsids, indicating that the sequence(s) involved is conserved between the two proteins. Analysis of SCMV BPP truncation mutants localized the capsid-binding region to the amino one-third of the molecule—the portion of BPP showing the greatest sequence conservation between the SCMV and HCMV homologs. This general approach may have utility in studying the interactions of other proteins with conformation-dependent binding sites. PMID:11435566
Development of a Novel Technology for Label Free DNA Sequencing
2012-05-21
of the C-H bond stretch vibrations in the planes of the corresponding DNA bases , and in the higher-frequency side, sequence-identifier region is...composed of the N-H bond stretch vibrations in the planes of the corresponding DNA bases . In addition, the sequence-identifier dividing region almost...regions are localized at the corresponding DNA bases and exhibit a definable dependence on the sequence form of the codons under study. Final
Sequence dependency of canonical base pair opening in the DNA double helix
Villa, Alessandra
2017-01-01
The flipping-out of a DNA base from the double helical structure is a key step of many cellular processes, such as DNA replication, modification and repair. Base pair opening is the first step of base flipping and the exact mechanism is still not well understood. We investigate sequence effects on base pair opening using extensive classical molecular dynamics simulations targeting the opening of 11 different canonical base pairs in two DNA sequences. Two popular biomolecular force fields are applied. To enhance sampling and calculate free energies, we bias the simulation along a simple distance coordinate using a newly developed adaptive sampling algorithm. The simulation is guided back and forth along the coordinate, allowing for multiple opening pathways. We compare the calculated free energies with those from an NMR study and check assumptions of the model used for interpreting the NMR data. Our results further show that the neighboring sequence is an important factor for the opening free energy, but also indicates that other sequence effects may play a role. All base pairs are observed to have a propensity for opening toward the major groove. The preferred opening base is cytosine for GC base pairs, while for AT there is sequence dependent competition between the two bases. For AT opening, we identify two non-canonical base pair interactions contributing to a local minimum in the free energy profile. For both AT and CG we observe long-lived interactions with water and with sodium ions at specific sites on the open base pair. PMID:28369121
Brain-wide mapping of neural activity controlling zebrafish exploratory locomotion
Dunn, Timothy W; Mu, Yu; Narayan, Sujatha; Randlett, Owen; Naumann, Eva A; Yang, Chao-Tsung; Schier, Alexander F
2016-01-01
In the absence of salient sensory cues to guide behavior, animals must still execute sequences of motor actions in order to forage and explore. How such successive motor actions are coordinated to form global locomotion trajectories is unknown. We mapped the structure of larval zebrafish swim trajectories in homogeneous environments and found that trajectories were characterized by alternating sequences of repeated turns to the left and to the right. Using whole-brain light-sheet imaging, we identified activity relating to the behavior in specific neural populations that we termed the anterior rhombencephalic turning region (ARTR). ARTR perturbations biased swim direction and reduced the dependence of turn direction on turn history, indicating that the ARTR is part of a network generating the temporal correlations in turn direction. We also find suggestive evidence for ARTR mutual inhibition and ARTR projections to premotor neurons. Finally, simulations suggest the observed turn sequences may underlie efficient exploration of local environments. DOI: http://dx.doi.org/10.7554/eLife.12741.001 PMID:27003593
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pichon, L.; Carn, G.; Bouric, P.
1996-03-01
Positional cloning strategies for the hemochromatosis gene have previously concentrated on a target area restricted to a maximum genomic expanse of 400 kb around the HLA-A and HLA-F loci. Recently, the candidate region has been extended to 2-3 Mb on the distal side of the MHC. In this study, 10 coding sequences [hemochromatosis candidate genes (HCG) I to X] were isolated by cDNA selection using YACs covering the HLA-A/HLA-F subregion. Two of these (HCG II and HCG IV) belong to multigene families, as well as other sequences already described in this region, i.e., P5, pMC 6.7, and HLA class I.more » Fingerprinting of the four YACSs overlapping the region was performed and allowed partial localization of the different multigene family sequences on each YAC without defining their exact positions. Fingerprinting on cosmids isolated from the ICRF chromosome 6-specific cosmid library allowed more precise localization of the redundant sequences in all of the multigene families and revealed their apparent organization in clusters. Further examination of these intertwined sequences demonstrated that this structural organization resulted from a succession of complex phenomena, including duplications and contractions. This study presents a precise description of the structural organization of the HLA-A/HLA-F region and a determination of the sequences involved in the megabase size polymorphism observed among the A3, A24, and A31 haplotypes. 29 refs., 2 figs., 2 tabs.« less
SeqHound: biological sequence and structure database as a platform for bioinformatics research
2002-01-01
Background SeqHound has been developed as an integrated biological sequence, taxonomy, annotation and 3-D structure database system. It provides a high-performance server platform for bioinformatics research in a locally-hosted environment. Results SeqHound is based on the National Center for Biotechnology Information data model and programming tools. It offers daily updated contents of all Entrez sequence databases in addition to 3-D structural data and information about sequence redundancies, sequence neighbours, taxonomy, complete genomes, functional annotation including Gene Ontology terms and literature links to PubMed. SeqHound is accessible via a web server through a Perl, C or C++ remote API or an optimized local API. It provides functionality necessary to retrieve specialized subsets of sequences, structures and structural domains. Sequences may be retrieved in FASTA, GenBank, ASN.1 and XML formats. Structures are available in ASN.1, XML and PDB formats. Emphasis has been placed on complete genomes, taxonomy, domain and functional annotation as well as 3-D structural functionality in the API, while fielded text indexing functionality remains under development. SeqHound also offers a streamlined WWW interface for simple web-user queries. Conclusions The system has proven useful in several published bioinformatics projects such as the BIND database and offers a cost-effective infrastructure for research. SeqHound will continue to develop and be provided as a service of the Blueprint Initiative at the Samuel Lunenfeld Research Institute. The source code and examples are available under the terms of the GNU public license at the Sourceforge site http://sourceforge.net/projects/slritools/ in the SLRI Toolkit. PMID:12401134
Gong, Wei; Russell, Michael; Suzuki, Keiko; Riabowol, Karl
2006-04-01
ING1 is a type II tumor suppressor that affects cell growth, stress signaling, apoptosis, and DNA repair by altering chromatin structure and regulating transcription. Decreased ING1 expression is seen in several human cancers, and mislocalization has been noted in diverse types of cancer cells. Aberrant targeting may, therefore, functionally inactivate ING1. Bioinformatics analysis identified a sequence between the nuclear localization sequence and plant homeodomain domains of ING1 that closely matched the binding motif of 14-3-3 proteins that target cargo proteins to specific subcellular locales. We find that the widely expressed p33(ING1b) splicing isoform of ING1 interacts with members of the 14-3-3 family of proteins and that this interaction is regulated by the phosphorylation status of ING1. 14-3-3 binding resulted in significant amounts of p33(ING1b) protein being tethered in the cytoplasm. As shown previously, ectopic expression of p33(ING1b) increased levels of the p21(Waf1) cyclin-dependent kinase inhibitor upon UV-induced DNA damage. Overexpression of 14-3-3 inhibited the up-regulation of p21(Waf1) by p33(ING1b), consistent with the idea that mislocalization blocks at least one of ING1's biological activities. These data support the idea that the 14-3-3 proteins play a crucial role in regulating the activity of p33(ING1b) by directing its subcellular localization.
Deciphering the shape and deformation of secondary structures through local conformation analysis
2011-01-01
Background Protein deformation has been extensively analysed through global methods based on RMSD, torsion angles and Principal Components Analysis calculations. Here we use a local approach, able to distinguish among the different backbone conformations within loops, α-helices and β-strands, to address the question of secondary structures' shape variation within proteins and deformation at interface upon complexation. Results Using a structural alphabet, we translated the 3 D structures of large sets of protein-protein complexes into sequences of structural letters. The shape of the secondary structures can be assessed by the structural letters that modeled them in the structural sequences. The distribution analysis of the structural letters in the three protein compartments (surface, core and interface) reveals that secondary structures tend to adopt preferential conformations that differ among the compartments. The local description of secondary structures highlights that curved conformations are preferred on the surface while straight ones are preferred in the core. Interfaces display a mixture of local conformations either preferred in core or surface. The analysis of the structural letters transition occurring between protein-bound and unbound conformations shows that the deformation of secondary structure is tightly linked to the compartment preference of the local conformations. Conclusion The conformation of secondary structures can be further analysed and detailed thanks to a structural alphabet which allows a better description of protein surface, core and interface in terms of secondary structures' shape and deformation. Induced-fit modification tendencies described here should be valuable information to identify and characterize regions under strong structural constraints for functional reasons. PMID:21284872
Deciphering the shape and deformation of secondary structures through local conformation analysis.
Baussand, Julie; Camproux, Anne-Claude
2011-02-01
Protein deformation has been extensively analysed through global methods based on RMSD, torsion angles and Principal Components Analysis calculations. Here we use a local approach, able to distinguish among the different backbone conformations within loops, α-helices and β-strands, to address the question of secondary structures' shape variation within proteins and deformation at interface upon complexation. Using a structural alphabet, we translated the 3 D structures of large sets of protein-protein complexes into sequences of structural letters. The shape of the secondary structures can be assessed by the structural letters that modeled them in the structural sequences. The distribution analysis of the structural letters in the three protein compartments (surface, core and interface) reveals that secondary structures tend to adopt preferential conformations that differ among the compartments. The local description of secondary structures highlights that curved conformations are preferred on the surface while straight ones are preferred in the core. Interfaces display a mixture of local conformations either preferred in core or surface. The analysis of the structural letters transition occurring between protein-bound and unbound conformations shows that the deformation of secondary structure is tightly linked to the compartment preference of the local conformations. The conformation of secondary structures can be further analysed and detailed thanks to a structural alphabet which allows a better description of protein surface, core and interface in terms of secondary structures' shape and deformation. Induced-fit modification tendencies described here should be valuable information to identify and characterize regions under strong structural constraints for functional reasons.
A phase transition in energy-filtered RNA secondary structures.
Han, Hillary S W; Reidys, Christian M
2012-10-01
In this article we study the effect of energy parameters on minimum free energy (mfe) RNA secondary structures. Employing a simplified combinatorial energy model that is only dependent on the diagram representation and is not sequence-specific, we prove the following dichotomy result. Mfe structures derived via the Turner energy parameters contain only finitely many complex irreducible substructures, and just minor parameter changes produce a class of mfe structures that contain a large number of small irreducibles. We localize the exact point at which the distribution of irreducibles experiences this phase transition from a discrete limit to a central limit distribution and, subsequently, put our result into the context of quantifying the effect of sparsification of the folding of these respective mfe structures. We show that the sparsification of realistic mfe structures leads to a constant time and space reduction, and that the sparsification of the folding of structures with modified parameters leads to a linear time and space reduction. We, furthermore, identify the limit distribution at the phase transition as a Rayleigh distribution.
Roux-Rouquie, M; Marilley, M
2000-09-15
We have modeled local DNA sequence parameters to search for DNA architectural motifs involved in transcription regulation and promotion within the Xenopus laevis ribosomal gene promoter and the intergenic spacer (IGS) sequences. The IGS was found to be shaped into distinct topological domains. First, intrinsic bends split the IGS into domains of common but different helical features. Local parameters at inter-domain junctions exhibit a high variability with respect to intrinsic curvature, bendability and thermal stability. Secondly, the repeated sequence blocks of the IGS exhibit right-handed supercoiled structures which could be related to their enhancer properties. Thirdly, the gene promoter presents both inherent curvature and minor groove narrowing which may be viewed as motifs of a structural code for protein recognition and binding. Such pre-existing deformations could simply be remodeled during the binding of the transcription complex. Alternatively, these deformations could pre-shape the promoter in such a way that further remodeling is facilitated. Mutations shown to abolish promoter curvature as well as intrinsic minor groove narrowing, in a variant which maintained full transcriptional activity, bring circumstantial evidence for structurally-preorganized motifs in relation to transcription regulation and promotion. Using well documented X. laevis rDNA regulatory sequences we showed that computer modeling may be of invaluable assistance in assessing encrypted architectural motifs. The evidence of these DNA topological motifs with respect to the concept of structural code is discussed.
Association mining of dependency between time series
NASA Astrophysics Data System (ADS)
Hafez, Alaaeldin
2001-03-01
Time series analysis is considered as a crucial component of strategic control over a broad variety of disciplines in business, science and engineering. Time series data is a sequence of observations collected over intervals of time. Each time series describes a phenomenon as a function of time. Analysis on time series data includes discovering trends (or patterns) in a time series sequence. In the last few years, data mining has emerged and been recognized as a new technology for data analysis. Data Mining is the process of discovering potentially valuable patterns, associations, trends, sequences and dependencies in data. Data mining techniques can discover information that many traditional business analysis and statistical techniques fail to deliver. In this paper, we adapt and innovate data mining techniques to analyze time series data. By using data mining techniques, maximal frequent patterns are discovered and used in predicting future sequences or trends, where trends describe the behavior of a sequence. In order to include different types of time series (e.g. irregular and non- systematic), we consider past frequent patterns of the same time sequences (local patterns) and of other dependent time sequences (global patterns). We use the word 'dependent' instead of the word 'similar' for emphasis on real life time series where two time series sequences could be completely different (in values, shapes, etc.), but they still react to the same conditions in a dependent way. In this paper, we propose the Dependence Mining Technique that could be used in predicting time series sequences. The proposed technique consists of three phases: (a) for all time series sequences, generate their trend sequences, (b) discover maximal frequent trend patterns, generate pattern vectors (to keep information of frequent trend patterns), use trend pattern vectors to predict future time series sequences.
SFESA: a web server for pairwise alignment refinement by secondary structure shifts.
Tong, Jing; Pei, Jimin; Grishin, Nick V
2015-09-03
Protein sequence alignment is essential for a variety of tasks such as homology modeling and active site prediction. Alignment errors remain the main cause of low-quality structure models. A bioinformatics tool to refine alignments is needed to make protein alignments more accurate. We developed the SFESA web server to refine pairwise protein sequence alignments. Compared to the previous version of SFESA, which required a set of 3D coordinates for a protein, the new server will search a sequence database for the closest homolog with an available 3D structure to be used as a template. For each alignment block defined by secondary structure elements in the template, SFESA evaluates alignment variants generated by local shifts and selects the best-scoring alignment variant. A scoring function that combines the sequence score of profile-profile comparison and the structure score of template-derived contact energy is used for evaluation of alignments. PROMALS pairwise alignments refined by SFESA are more accurate than those produced by current advanced alignment methods such as HHpred and CNFpred. In addition, SFESA also improves alignments generated by other software. SFESA is a web-based tool for alignment refinement, designed for researchers to compute, refine, and evaluate pairwise alignments with a combined sequence and structure scoring of alignment blocks. To our knowledge, the SFESA web server is the only tool that refines alignments by evaluating local shifts of secondary structure elements. The SFESA web server is available at http://prodata.swmed.edu/sfesa.
From Globular Clusters to Tidal Dwarfs: Structure Formation in Tidal Tails
NASA Astrophysics Data System (ADS)
Knierman, K.; Hunsberger, S.; Gallagher, S.; Charlton, J.; Whitmore, B.; Hibbard, J.; Kundu, A.; Zaritsky, D.
1999-12-01
Galaxy interactions trigger star formation in tidal debris. How does this star formation depend on the local and global physical conditions? Using WFPC2/HST images, we investigate the range of structure within tidal tails of four classic ``Toomre Sequence'' mergers: NGC 4038/9 (``Antennae''), NGC 7252 (``Atoms for Peace''), NGC 3921, and NGC 3256. These tails contain a variety of stellar associations with sizes from globular clusters up to dwarf Irregulars. We explore whether there is a continuum between the two extremes. Our eight fields sample seven tidal tails at a variety of stages in the evolutionary sequence. Some of these tails are rich in HI while others are HI poor. Large tidal dwarfs are embedded in three of the tails. Using V and I WFPC2 images, we measure luminosities and colors of substructures within the tidal tails. The properties of globular cluster candidates in the tails will be contrasted with those of the hundreds of young clusters in the central regions of these mergers. We address whether globular clusters form and survive in the tidal tails and whether tidal dwarfs are composed of only young stars. By comparing the properties of structures in the tails of the four mergers with different ages, we examine systematic evolution of structure along the evolutionary sequence and as a function of HI content. We acknowledge support from NASA through STScI, and from NSF for an REU supplement for Karen Knierman.
DNA Barcode Sequence Identification Incorporating Taxonomic Hierarchy and within Taxon Variability
Little, Damon P.
2011-01-01
For DNA barcoding to succeed as a scientific endeavor an accurate and expeditious query sequence identification method is needed. Although a global multiple–sequence alignment can be generated for some barcoding markers (e.g. COI, rbcL), not all barcoding markers are as structurally conserved (e.g. matK). Thus, algorithms that depend on global multiple–sequence alignments are not universally applicable. Some sequence identification methods that use local pairwise alignments (e.g. BLAST) are unable to accurately differentiate between highly similar sequences and are not designed to cope with hierarchic phylogenetic relationships or within taxon variability. Here, I present a novel alignment–free sequence identification algorithm–BRONX–that accounts for observed within taxon variability and hierarchic relationships among taxa. BRONX identifies short variable segments and corresponding invariant flanking regions in reference sequences. These flanking regions are used to score variable regions in the query sequence without the production of a global multiple–sequence alignment. By incorporating observed within taxon variability into the scoring procedure, misidentifications arising from shared alleles/haplotypes are minimized. An explicit treatment of more inclusive terminals allows for separate identifications to be made for each taxonomic level and/or for user–defined terminals. BRONX performs better than all other methods when there is imperfect overlap between query and reference sequences (e.g. mini–barcode queries against a full–length barcode database). BRONX consistently produced better identifications at the genus–level for all query types. PMID:21857897
The amino acid motif L/IIxxFE defines a novel actin-binding sequence in PDZ-RhoGEF
Banerjee, Jayashree; Fischer, Christopher C.; Wedegaertner, Philip B.
2009-01-01
PDZ-RhoGEF is a member of the regulator of G protein signaling (RGS) domain-containing RhoGEFs (RGS-RhoGEFs) that link activated heterotrimeric G protein α subunits of the G12 family to activation of the small GTPase RhoA. Unique among the RGS-RhoGEFs, PDZ-RhoGEF contains a short sequence that localizes the protein to the actin cytoskeleton. In this report, we demonstrate that the actin-binding domain, located between amino acids 561–585, directly binds to F-actin in vitro. Extensive mutagenesis identifies isoleucine 568, isoleucine 569, phenylalanine 572, and glutamic acid 573 as necessary for binding to actin and for co-localization with the actin cytoskeleton in cells. These results define a novel actin-binding sequence in PDZ-RhoGEF with a critical amino acid motif of IIxxFE. Moreover, sequence analysis identifies a similar actin-binding motif in the N-terminus of the RhoGEF frabin, and, as with PDZ-RhoGEF, mutagenesis and actin interaction experiments demonstrate a motif of LIxxFE, consisting of the key amino acids leucine 23, isoleucine 24, phenylalanine 27, and glutamic acid 28. Taken together, results with PDZ-RhoGEF and frabin identify a novel actin binding sequence. Lastly, inducible dimerization of the actin-binding region of PDZ-RhoGEF revealed a dimerization-dependent actin bundling activity in vitro. PDZ-RhoGEF exists in cells as a dimer, raising the possibility that PDZ-RhoGEF could influence actin structure independent of its ability to activate RhoA. PMID:19618964
Modulation of frustration in folding by sequence permutation.
Nobrega, R Paul; Arora, Karunesh; Kathuria, Sagar V; Graceffa, Rita; Barrea, Raul A; Guo, Liang; Chakravarthy, Srinivas; Bilsel, Osman; Irving, Thomas C; Brooks, Charles L; Matthews, C Robert
2014-07-22
Folding of globular proteins can be envisioned as the contraction of a random coil unfolded state toward the native state on an energy surface rough with local minima trapping frustrated species. These substructures impede productive folding and can serve as nucleation sites for aggregation reactions. However, little is known about the relationship between frustration and its underlying sequence determinants. Chemotaxis response regulator Y (CheY), a 129-amino acid bacterial protein, has been shown previously to populate an off-pathway kinetic trap in the microsecond time range. The frustration has been ascribed to premature docking of the N- and C-terminal subdomains or, alternatively, to the formation of an unproductive local-in-sequence cluster of branched aliphatic side chains, isoleucine, leucine, and valine (ILV). The roles of the subdomains and ILV clusters in frustration were tested by altering the sequence connectivity using circular permutations. Surprisingly, the stability and buried surface area of the intermediate could be increased or decreased depending on the location of the termini. Comparison with the results of small-angle X-ray-scattering experiments and simulations points to the accelerated formation of a more compact, on-pathway species for the more stable intermediate. The effect of chain connectivity in modulating the structures and stabilities of the early kinetic traps in CheY is better understood in terms of the ILV cluster model. However, the subdomain model captures the requirement for an intact N-terminal domain to access the native conformation. Chain entropy and aliphatic-rich sequences play crucial roles in biasing the early events leading to frustration in the folding of CheY.
Structural polymorphism at LCR and its role in beta-globin gene regulation.
Kukreti, Shrikant; Kaur, Harpreet; Kaushik, Mahima; Bansal, Aparna; Saxena, Sarika; Kaushik, Shikha; Kukreti, Ritushree
2010-09-01
Information on the secondary structures and conformational manifestations of eukaryotic DNA and their biological significance with reference to gene regulation and expression is limited. The human beta-globin gene Locus Control Region (LCR), a dominant regulator of globin gene expression, is a contiguous piece of DNA with five tissue-specific DNase I-hypersensitive sites (HSs). Since these HSs have a high density of transcription factor binding sites, structural interdependencies between HSs and different promoters may directly or indirectly regulate LCR functions. Mutations and SNPs may stabilize or destabilize the local secondary structures, affecting the gene expression by changes in the protein-DNA recognition patterns. Various palindromic or quasi-palindromic segments within LCR, could cause structural polymorphism and geometrical switching of DNA. This emphasizes the importance of understanding of the sequence-dependent variations of the DNA structure. Such structural motifs might act as regulatory elements. The local conformational variability of a DNA segment or action of a DNA specific protein is key to create and maintain active chromatin domains and affect transcription of various tissue specific beta-globin genes. We, summarize here the current status of beta-globin LCR structure and function. Further structural studies at molecular level and functional genomics might solve the regulatory puzzles that control the beta-globin gene locus. Copyright (c) 2010 Elsevier Masson SAS. All rights reserved.
Bobrova, E V; Liakhovetskiĭ, V A; Borshchevskaia, E R
2011-01-01
The dependence of errors during reproduction of a sequence of hand movements without visual feedback on the previous right- and left-hand performance ("prehistory") and on positions in space of sequence elements (random or ordered by the explicit rule) was analyzed. It was shown that the preceding information about the ordered positions of the sequence elements was used during right-hand movements, whereas left-hand movements were performed with involvement of the information about the random sequence. The data testify to a central mechanism of the analysis of spatial structure of sequence elements. This mechanism activates movement coding specific for the left hemisphere (vector coding) in case of an ordered sequence structure and positional coding specific for the right hemisphere in case of a random sequence structure.
Neurophysiology of Hungarian subject-verb dependencies with varying intervening complexity.
Jolsvai, Hajnal; Sussman, Elyse; Csuhaj, Roland; Csépe, Valéria
2011-12-01
Non-adjacent dependencies are thought to be more costly to process than sentences wherein dependents immediately follow or precede what they depend on. In English locality effects have been revealed, while in languages with rich case marking (German and Hindi) sentence final structures show anti-locality-effects. The motivation of the current study is to test whether locality effects can be directly applied to a typologically different language than those investigated so far. Hungarian is a "topic prominent" language; it permits a variation of possible word sequencing for semantic reasons, including SVO word order. Hungarian also has a rich morphological system (e.g., rich case system) and postpositions to indicate grammatical functions. In the present ERP study, Hungarian subject-verb dependencies were compared by manipulating the mismatch of number agreement between the sentence's initial noun phrase and the sentence's final intransitive verb as well as the complexity of the intervening sentence material, interrupting the dependencies. Possible lexical class and frequency or cloze-probability effects for the first two words of the intervening sentence material were revealed when used separate baseline for each word, while at the third word of the intervening material as well as at the main verb ERPs were not modulated by complexity but at the verb ERPs were enhanced by grammaticality. Ungrammatical sentences enlarged the amplitude of both LAN and P600 components at the main verb. These results are in line with studies suggesting that the retrieval of the first element of a dependency is not influenced by distance from the second element, as the first element is directly accessible when needed for integration (e.g., McElree, 2000). Copyright © 2011 Elsevier B.V. All rights reserved.
Huang, Xin; Gollin, Susanne M.; Raja, Siva; Godfrey, Tony E.
2002-01-01
Amplification of chromosomal band 11q13 is a common event in human cancer. It has been reported in about 45% of head and neck carcinomas and in other cancers including esophageal, breast, liver, lung, and bladder cancer. To understand the mechanism of 11q13 amplification and to identify the potential oncogene(s) driving it, we have fine-mapped the structure of the amplicon in oral squamous cell carcinoma cell lines and localized the proximal and distal breakpoints. A 5-Mb physical map of the region has been prepared from which sequence is available. We quantified copy number of sequence-tagged site markers at 42–550 kb intervals along the length of the amplicon and defined the amplicon core and breakpoints by using TaqMan-based quantitative microsatellite analysis. The core of the amplicon maps to a 1.5-Mb region. The proximal breakpoint localizes to two intervals between sequence-tagged site markers, 550 kb and 160 kb in size, and the distal breakpoint maps to a 250 kb interval. The cyclin D1 gene maps to the amplicon core, as do two new expressed sequence tag clusters. We have analyzed one of these expressed sequence tag clusters and now report that it contains a previously uncharacterized gene, TAOS1 (tumor amplified and overexpressed sequence 1), which is both amplified and overexpressed in oral cancer cells. The data suggest that TAOS1 may be an amplification-dependent candidate oncogene with a role in the development and/or progression of human tumors, including oral squamous cell carcinomas. The approach described here should be useful for characterizing amplified genomic regions in a wide variety of tumors. PMID:12172009
A Method for WD40 Repeat Detection and Secondary Structure Prediction
Wang, Yang; Jiang, Fan; Zhuo, Zhu; Wu, Xian-Hui; Wu, Yun-Dong
2013-01-01
WD40-repeat proteins (WD40s), as one of the largest protein families in eukaryotes, play vital roles in assembling protein-protein/DNA/RNA complexes. WD40s fold into similar β-propeller structures despite diversified sequences. A program WDSP (WD40 repeat protein Structure Predictor) has been developed to accurately identify WD40 repeats and predict their secondary structures. The method is designed specifically for WD40 proteins by incorporating both local residue information and non-local family-specific structural features. It overcomes the problem of highly diversified protein sequences and variable loops. In addition, WDSP achieves a better prediction in identifying multiple WD40-domain proteins by taking the global combination of repeats into consideration. In secondary structure prediction, the average Q3 accuracy of WDSP in jack-knife test reaches 93.7%. A disease related protein LRRK2 was used as a representive example to demonstrate the structure prediction. PMID:23776530
Paiardini, Alessandro; Bossa, Francesco; Pascarella, Stefano
2004-01-01
The wealth of biological information provided by structural and genomic projects opens new prospects of understanding life and evolution at the molecular level. In this work, it is shown how computational approaches can be exploited to pinpoint protein structural features that remain invariant upon long evolutionary periods in the fold-type I, PLP-dependent enzymes. A nonredundant set of 23 superposed crystallographic structures belonging to this superfamily was built. Members of this family typically display high-structural conservation despite low-sequence identity. For each structure, a multiple-sequence alignment of orthologous sequences was obtained, and the 23 alignments were merged using the structural information to obtain a comprehensive multiple alignment of 921 sequences of fold-type I enzymes. The structurally conserved regions (SCRs), the evolutionarily conserved residues, and the conserved hydrophobic contacts (CHCs) were extracted from this data set, using both sequence and structural information. The results of this study identified a structural pattern of hydrophobic contacts shared by all of the superfamily members of fold-type I enzymes and involved in native interactions. This profile highlights the presence of a nucleus for this fold, in which residues participating in the most conserved native interactions exhibit preferential evolutionary conservation, that correlates significantly (r = 0.70) with the extent of mean hydrophobic contact value of their apolar fraction. PMID:15498941
Protein structure prediction with local adjust tabu search algorithm
2014-01-01
Background Protein folding structure prediction is one of the most challenging problems in the bioinformatics domain. Because of the complexity of the realistic protein structure, the simplified structure model and the computational method should be adopted in the research. The AB off-lattice model is one of the simplification models, which only considers two classes of amino acids, hydrophobic (A) residues and hydrophilic (B) residues. Results The main work of this paper is to discuss how to optimize the lowest energy configurations in 2D off-lattice model and 3D off-lattice model by using Fibonacci sequences and real protein sequences. In order to avoid falling into local minimum and faster convergence to the global minimum, we introduce a novel method (SATS) to the protein structure problem, which combines simulated annealing algorithm and tabu search algorithm. Various strategies, such as the new encoding strategy, the adaptive neighborhood generation strategy and the local adjustment strategy, are adopted successfully for high-speed searching the optimal conformation corresponds to the lowest energy of the protein sequences. Experimental results show that some of the results obtained by the improved SATS are better than those reported in previous literatures, and we can sure that the lowest energy folding state for short Fibonacci sequences have been found. Conclusions Although the off-lattice models is not very realistic, they can reflect some important characteristics of the realistic protein. It can be found that 3D off-lattice model is more like native folding structure of the realistic protein than 2D off-lattice model. In addition, compared with some previous researches, the proposed hybrid algorithm can more effectively and more quickly search the spatial folding structure of a protein chain. PMID:25474708
Pieper-Fürst, U.; Madkour, M. H.; Mayer, F.; Steinbüchel, A.
1994-01-01
The N-terminal amino acid sequence of the polyhydroxyalkanoic acid (PHA) granule-associated M(r)-15,500 protein of Rhodococcus ruber (the GA14 protein) was analyzed. The sequence revealed that the corresponding structural gene is represented by open reading frame 3, encoding a protein with a calculated M(r) of 14,175 which was recently localized downstream of the PHA synthase gene (U. Pieper and A. Steinbüchel, FEMS Microbiol. Lett. 96:73-80, 1992). A recombinant strain of Escherichia coli XL1-Blue carrying the hybrid plasmid (pSKXA10*) with open reading frame 3 overexpressed the GA14 protein. The GA14 protein was subsequently purified in a three-step procedure including chromatography on DEAE-Sephacel, phenyl-Sepharose CL-4B, and Superose 12. Determination of the molecular weight by gel filtration as well as electron microscopic studies indicates that a tetrameric structure of the recombinant, native GA14 protein is most likely. Immunoelectron microscopy demonstrated a localization of the GA14 protein at the periphery of PHA granules as well as close to the cell membrane in R. ruber. Investigations of PHA-leaky and PHA-negative mutants of R. ruber indicated that expression of the GA14 protein depended strongly on PHA synthesis. Images PMID:8021220
Lateral trends and vertical sequences in estuarine sediments, Willapa Bay, Washington
Clifton, H. Edward; Phillips, L.
1980-01-01
Willapa Bay is a sizable estuary on the southern coast of Washington- Relatively unmodified in a geologic sense by human activity the bay provides an excellent example of modern depositional facies in an estuarine setting. Studies of these deposits indicate that consistent lateral trends exist in sediment texture and sedimentary structures. The texture changes from sandy at the mouth of the bay to muddy in its upper parts. In any part of the bay , sediment is coarsest in the channel bottoms, where lag deposits accumulate. The sediment tends to fine in an upslope direction and is finest in supratidal flat deposits of silt and clay. The nature of sedimentary structures depends on the combination of physical and biological processes and sediment textures. Bedforms exist wherever the bed is sandy. In the main tidal channels sandwaves and dunes up to 4 meters high occur. In tributary channels and at the margins of the main channel, at shallower depths and under less intense currents , the structures are generally less than a meter high. Current ripples occur in t he sandy bed of all of the tidal channels and in runoff channels cross the tidal flat. Symmetric long-crested ripples are produced by wave action over the sandy intertidal flat. Internal structures in the bay's sediment depend not only on the nature of the bedform but also on the rate of bioturbation relative to physical processes. Under fields of large sandwaves or dunes, medium- to large-scale tabular and trough crossbedding predominates. This crossbedding generally is unidirectional, reflecting the locally dominant current (ebb or flood). Ripple bedding predominates elsewhere in sandy sediment within the channels. Where sand transport is diminished, as on the floor of the upper tributary channels, bioturbation exceeds the rate of production of physical structures and bedding is destroyed. The depositional banks in such areas tend to be sites of rapid sediment accumulation and bedding in the form of interlayered sand (commonly ripple bedded) and mud persists. On intertidal flats the sediment accumulates slowly and bioturbation erases nearly all physical structures. Bedding is preserved only where deposition is locally rapid , as in topographic depressions or on the depositional banks of runoff channels, or where faunal activity is inhibited, as beneath mounds of blue-green algae. The rate of sedimentation is slower still on the supratidal flats, but the general paucity of faunal activity allows the preservation of thin alternations of fine sand , silt or clay. The lateral migration of the tidal channels produces vertical sequences in which topographically higher facies are superposed on one another. Near the mouth of the estuary the upward sequence: lag deposit — crossbedded sand — ripple or planar-bedded sand is typical. The crossbedding shows a general upward decrease in thickness and a progression from trough to tabular units. In the main tidal channel - in the central estuary and in sandy tributary channels, the typical vertical sequence resembles that near the mouth , with the exception that the sequence is capped by bioturbated sandy or muddy tide flat deposits. In the upper estuary , where muddy sediment predominates, a typical sequence shows the progression-. bioturbated lag deposit — gently dipping interlaminated sand and mud layers of the accretionary bank — bioturbated mud flat deposits — thinly laminated fine supratidal deposits.
Nucleotide sequence of the gag gene and gag-pol junction of feline leukemia virus.
Laprevotte, I; Hampe, A; Sherr, C J; Galibert, F
1984-01-01
The nucleotide sequence of the gag gene of feline leukemia virus and its flanking sequences were determined and compared with the corresponding sequences of two strains of feline sarcoma virus and with that of the Moloney strain of murine leukemia virus. A high degree of nucleotide sequence homology between the feline leukemia virus and murine leukemia virus gag genes was observed, suggesting that retroviruses of domestic cats and laboratory mice have a common, proximal evolutionary progenitor. The predicted structure of the complete feline leukemia virus gag gene precursor suggests that the translation of nonglycosylated and glycosylated gag gene polypeptides is initiated at two different AUG codons. These initiator codons fall in the same reading frame and are separated by a 222-base-pair segment which encodes an amino terminal signal peptide. The nucleotide sequence predicts the order of amino acids in each of the individual gag-coded proteins (p15, p12, p30, p10), all of which derive from the gag gene precursor. Stable stem-and-loop secondary structures are proposed for two regions of viral RNA. The first falls within sequences at the 5' end of the viral genome, together with adjacent palindromic sequences which may play a role in dimer linkage of RNA subunits. The second includes coding sequences at the gag-pol junction and is proposed to be involved in translation of the pol gene product. Sequence analysis of the latter region shows that the gag and pol genes are translated in different reading frames. Classical consensus splice donor and acceptor sequences could not be localized to regions which would permit synthesis of the expected gag-pol precursor protein. Alternatively, we suggest that the pol gene product (RNA-dependent DNA polymerase) could be translated by a frameshift suppressing mechanism which could involve cleavage modification of stems and loops in a manner similar to that observed in tRNA processing. PMID:6328019
Pan, Xiaoyong; Shen, Hong-Bin
2018-05-02
RNA-binding proteins (RBPs) take over 5∼10% of the eukaryotic proteome and play key roles in many biological processes, e.g. gene regulation. Experimental detection of RBP binding sites is still time-intensive and high-costly. Instead, computational prediction of the RBP binding sites using pattern learned from existing annotation knowledge is a fast approach. From the biological point of view, the local structure context derived from local sequences will be recognized by specific RBPs. However, in computational modeling using deep learning, to our best knowledge, only global representations of entire RNA sequences are employed. So far, the local sequence information is ignored in the deep model construction process. In this study, we present a computational method iDeepE to predict RNA-protein binding sites from RNA sequences by combining global and local convolutional neural networks (CNNs). For the global CNN, we pad the RNA sequences into the same length. For the local CNN, we split a RNA sequence into multiple overlapping fixed-length subsequences, where each subsequence is a signal channel of the whole sequence. Next, we train deep CNNs for multiple subsequences and the padded sequences to learn high-level features, respectively. Finally, the outputs from local and global CNNs are combined to improve the prediction. iDeepE demonstrates a better performance over state-of-the-art methods on two large-scale datasets derived from CLIP-seq. We also find that the local CNN run 1.8 times faster than the global CNN with comparable performance when using GPUs. Our results show that iDeepE has captured experimentally verified binding motifs. https://github.com/xypan1232/iDeepE. xypan172436@gmail.com or hbshen@sjtu.edu.cn. Supplementary data are available at Bioinformatics online.
Sequence periodicity in nucleosomal DNA and intrinsic curvature.
Nair, T Murlidharan
2010-05-17
Most eukaryotic DNA contained in the nucleus is packaged by wrapping DNA around histone octamers. Histones are ubiquitous and bind most regions of chromosomal DNA. In order to achieve smooth wrapping of the DNA around the histone octamer, the DNA duplex should be able to deform and should possess intrinsic curvature. The deformability of DNA is a result of the non-parallelness of base pair stacks. The stacking interaction between base pairs is sequence dependent. The higher the stacking energy the more rigid the DNA helix, thus it is natural to expect that sequences that are involved in wrapping around the histone octamer should be unstacked and possess intrinsic curvature. Intrinsic curvature has been shown to be dictated by the periodic recurrence of certain dinucleotides. Several genome-wide studies directed towards mapping of nucleosome positions have revealed periodicity associated with certain stretches of sequences. In the current study, these sequences have been analyzed with a view to understand their sequence-dependent structures. Higher order DNA structures and the distribution of molecular bend loci associated with 146 base nucleosome core DNA sequence from C. elegans and chicken have been analyzed using the theoretical model for DNA curvature. The curvature dispersion calculated by cyclically permuting the sequences revealed that the molecular bend loci were delocalized throughout the nucleosome core region and had varying degrees of intrinsic curvature. The higher order structures associated with nucleosomes of C.elegans and chicken calculated from the sequences revealed heterogeneity with respect to the deviation of the DNA axis. The results points to the possibility of context dependent curvature of varying degrees to be associated with nucleosomal DNA.
Aftershocks driven by afterslip and fluid pressure sweeping through a fault-fracture mesh
Ross, Zachary E.; Rollins, Christopher; Cochran, Elizabeth S.; Hauksson, Egill; Avouac, Jean-Philippe; Ben-Zion, Yehuda
2017-01-01
A variety of physical mechanisms are thought to be responsible for the triggering and spatiotemporal evolution of aftershocks. Here we analyze a vigorous aftershock sequence and postseismic geodetic strain that occurred in the Yuha Desert following the 2010 Mw 7.2 El Mayor-Cucapah earthquake. About 155,000 detected aftershocks occurred in a network of orthogonal faults and exhibit features of two distinct mechanisms for aftershock triggering. The earliest aftershocks were likely driven by afterslip that spread away from the main shock with the logarithm of time. A later pulse of aftershocks swept again across the Yuha Desert with square root time dependence and swarm-like behavior; together with local geological evidence for hydrothermalism, these features suggest that the events were driven by fluid diffusion. The observations illustrate how multiple driving mechanisms and the underlying fault structure jointly control the evolution of an aftershock sequence.
Hills, Ronald D.; Kathuria, Sagar V.; Wallace, Louise A.; Day, Iain J.; Brooks, Charles L.; Matthews, C. Robert
2010-01-01
The thermodynamic hypothesis of Anfinsen postulates that structures and stabilities of globular proteins are determined by their amino acid sequences. Chain topology, however, is known to influence the folding reaction, in that motifs with a preponderance of local interactions typically fold more rapidly than those with a larger fraction of non-local interactions. Together, the topology and sequence can modulate the energy landscape and influence the rate at which the protein folds to the native conformation. To explore the relationship of sequence and topology in the folding of βα–repeat proteins, which are dominated by local interactions, a combined experimental and simulation analysis was performed on two members of the flavodoxin-like, α/β/α sandwich fold. Spo0F and the N-terminal receiver domain of NtrC (NT-NtrC) have similar topologies but low sequence identity, enabling a test of the effects of sequence on folding. Experimental results demonstrated that both response-regulator proteins fold via parallel channels through highly structured sub-millisecond intermediates before accessing their cis prolyl peptide bond-containing native conformations. Global analysis of the experimental results preferentially places these intermediates off the productive folding pathway. Sequence-sensitive Gō-model simulations conclude that frustration in the folding in Spo0F, corresponding to the appearance of the off-pathway intermediate, reflects competition for intra-subdomain van der Waals contacts between its N- and C-terminal subdomains. The extent of transient, premature structure appears to correlate with the number of isoleucine, leucine and valine (ILV) side-chains that form a large sequence-local cluster involving the central β-sheet and helices α2, α3 and α4. The failure to detect the off-pathway species in the simulations of NT-NtrC may reflect the reduced number of ILV side-chains in its corresponding hydrophobic cluster. The location of the hydrophobic clusters in the structure may also be related to the differing functional properties of these response regulators. Comparison with the results of previous experimental and simulation analyses on the homologous CheY argues that prematurely-folded unproductive intermediates are a common property of the βα-repeat motif. PMID:20226790
Detection of nucleic acid sequences by invader-directed cleavage
Brow, Mary Ann D.; Hall, Jeff Steven Grotelueschen; Lyamichev, Victor; Olive, David Michael; Prudent, James Robert
1999-01-01
The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The 5' nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof. The present invention further relates to methods and devices for the separation of nucleic acid molecules based by charge.
NASA Astrophysics Data System (ADS)
Mielke, Steven P.; Grønbech-Jensen, Niels; Krishnan, V. V.; Fink, William H.; Benham, Craig J.
2005-09-01
The topological state of DNA in vivo is dynamically regulated by a number of processes that involve interactions with bound proteins. In one such process, the tracking of RNA polymerase along the double helix during transcription, restriction of rotational motion of the polymerase and associated structures, generates waves of overtwist downstream and undertwist upstream from the site of transcription. The resulting superhelical stress is often sufficient to drive double-stranded DNA into a denatured state at locations such as promoters and origins of replication, where sequence-specific duplex opening is a prerequisite for biological function. In this way, transcription and other events that actively supercoil the DNA provide a mechanism for dynamically coupling genetic activity with regulatory and other cellular processes. Although computer modeling has provided insight into the equilibrium dynamics of DNA supercoiling, to date no model has appeared for simulating sequence-dependent DNA strand separation under the nonequilibrium conditions imposed by the dynamic introduction of torsional stress. Here, we introduce such a model and present results from an initial set of computer simulations in which the sequences of dynamically superhelical, 147 base pair DNA circles were systematically altered in order to probe the accuracy with which the model can predict location, extent, and time of stress-induced duplex denaturation. The results agree both with well-tested statistical mechanical calculations and with available experimental information. Additionally, we find that sites susceptible to denaturation show a propensity for localizing to supercoil apices, suggesting that base sequence determines locations of strand separation not only through the energetics of interstrand interactions, but also by influencing the geometry of supercoiling.
Mielke, Steven P; Grønbech-Jensen, Niels; Krishnan, V V; Fink, William H; Benham, Craig J
2005-09-22
The topological state of DNA in vivo is dynamically regulated by a number of processes that involve interactions with bound proteins. In one such process, the tracking of RNA polymerase along the double helix during transcription, restriction of rotational motion of the polymerase and associated structures, generates waves of overtwist downstream and undertwist upstream from the site of transcription. The resulting superhelical stress is often sufficient to drive double-stranded DNA into a denatured state at locations such as promoters and origins of replication, where sequence-specific duplex opening is a prerequisite for biological function. In this way, transcription and other events that actively supercoil the DNA provide a mechanism for dynamically coupling genetic activity with regulatory and other cellular processes. Although computer modeling has provided insight into the equilibrium dynamics of DNA supercoiling, to date no model has appeared for simulating sequence-dependent DNA strand separation under the nonequilibrium conditions imposed by the dynamic introduction of torsional stress. Here, we introduce such a model and present results from an initial set of computer simulations in which the sequences of dynamically superhelical, 147 base pair DNA circles were systematically altered in order to probe the accuracy with which the model can predict location, extent, and time of stress-induced duplex denaturation. The results agree both with well-tested statistical mechanical calculations and with available experimental information. Additionally, we find that sites susceptible to denaturation show a propensity for localizing to supercoil apices, suggesting that base sequence determines locations of strand separation not only through the energetics of interstrand interactions, but also by influencing the geometry of supercoiling.
Observing Holliday junction branch migration one step at a time
NASA Astrophysics Data System (ADS)
Ha, Taekjip
2004-03-01
During genetic recombination, two homologous DNA molecules undergo strand exchange to form a four-way DNA (Holliday) junction and the recognition and processing of this species by branch migration and junction resolving enzymes determine the outcome. We have used single molecule fluorescence techniques to study two intrinsic structural dynamics of the Holliday junction, stacking conformer transitions and spontaneous branch migration. Our studies show that the dynamics of branch migration, resolved with one base pair resolution, is determined by the stability of conformers which in turn depends on the local DNA sequences. Therefore, the energy landscape of Holliday junction branch migation is not uniform, but is rugged.
Bedford, Nicholas M; Hughes, Zak E; Tang, Zhenghua; Li, Yue; Briggs, Beverly D; Ren, Yang; Swihart, Mark T; Petkov, Valeri G; Naik, Rajesh R; Knecht, Marc R; Walsh, Tiffany R
2016-01-20
Peptide-enabled nanoparticle (NP) synthesis routes can create and/or assemble functional nanomaterials under environmentally friendly conditions, with properties dictated by complex interactions at the biotic/abiotic interface. Manipulation of this interface through sequence modification can provide the capability for material properties to be tailored to create enhanced materials for energy, catalysis, and sensing applications. Fully realizing the potential of these materials requires a comprehensive understanding of sequence-dependent structure/function relationships that is presently lacking. In this work, the atomic-scale structures of a series of peptide-capped Au NPs are determined using a combination of atomic pair distribution function analysis of high-energy X-ray diffraction data and advanced molecular dynamics (MD) simulations. The Au NPs produced with different peptide sequences exhibit varying degrees of catalytic activity for the exemplar reaction 4-nitrophenol reduction. The experimentally derived atomic-scale NP configurations reveal sequence-dependent differences in structural order at the NP surface. Replica exchange with solute-tempering MD simulations are then used to predict the morphology of the peptide overlayer on these Au NPs and identify factors determining the structure/catalytic properties relationship. We show that the amount of exposed Au surface, the underlying surface structural disorder, and the interaction strength of the peptide with the Au surface all influence catalytic performance. A simplified computational prediction of catalytic performance is developed that can potentially serve as a screening tool for future studies. Our approach provides a platform for broadening the analysis of catalytic peptide-enabled metallic NP systems, potentially allowing for the development of rational design rules for property enhancement.
Roux-Rouquie, Magali; Marilley, Monique
2000-01-01
We have modeled local DNA sequence parameters to search for DNA architectural motifs involved in transcription regulation and promotion within the Xenopus laevis ribosomal gene promoter and the intergenic spacer (IGS) sequences. The IGS was found to be shaped into distinct topological domains. First, intrinsic bends split the IGS into domains of common but different helical features. Local parameters at inter-domain junctions exhibit a high variability with respect to intrinsic curvature, bendability and thermal stability. Secondly, the repeated sequence blocks of the IGS exhibit right-handed supercoiled structures which could be related to their enhancer properties. Thirdly, the gene promoter presents both inherent curvature and minor groove narrowing which may be viewed as motifs of a structural code for protein recognition and binding. Such pre-existing deformations could simply be remodeled during the binding of the transcription complex. Alternatively, these deformations could pre-shape the promoter in such a way that further remodeling is facilitated. Mutations shown to abolish promoter curvature as well as intrinsic minor groove narrowing, in a variant which maintained full transcriptional activity, bring circumstantial evidence for structurally-preorganized motifs in relation to transcription regulation and promotion. Using well documented X.laevis rDNA regulatory sequences we showed that computer modeling may be of invaluable assistance in assessing encrypted architectural motifs. The evidence of these DNA topological motifs with respect to the concept of structural code is discussed. PMID:10982860
Domain Specificity of MAP3K Family Members, MLK and Tak1, for JNK Signaling in Drosophila
Stronach, Beth; Lennox, Ashley L.; Garlena, Rebecca A.
2014-01-01
A highly diverse set of protein kinases functions as early responders in the mitogen- and stress-activated protein kinase (MAPK/SAPK) signaling pathways. For instance, humans possess 14 MAPK kinase kinases (MAP3Ks) that activate Jun kinase (JNK) signaling downstream. A major challenge is to decipher the selective and redundant functions of these upstream MAP3Ks. Taking advantage of the relative simplicity of Drosophila melanogaster as a model system, we assessed MAP3K signaling specificity in several JNK-dependent processes during development and stress response. Our approach was to generate molecular chimeras between two MAP3K family members, the mixed lineage kinase, Slpr, and the TGF-β activated kinase, Tak1, which share 32% amino acid identity across the kinase domain but otherwise differ in sequence and domain structure, and then test the contributions of various domains for protein localization, complementation of mutants, and activation of signaling. We found that overexpression of the wild-type kinases stimulated JNK signaling in alternate contexts, so cells were capable of responding to both MAP3Ks, but with distinct outcomes. Relative to wild-type, the catalytic domain swaps compensated weakly or not at all, despite having a shared substrate, the JNK kinase Hep. Tak1 C-terminal domain-containing constructs were inhibitory in Tak1 signaling contexts, including tumor necrosis factor-dependent cell death and innate immune signaling; however, depressing antimicrobial gene expression did not necessarily cause phenotypic susceptibility to infection. These same constructs were neutral in the context of Slpr-dependent developmental signaling, reflecting differential subcellular protein localization and by inference, point of activation. Altogether, our findings suggest that the selective deployment of a particular MAP3K can be attributed in part to its inherent sequence differences, cellular localization, and binding partner availability. PMID:24429281
Impact of target mRNA structure on siRNA silencing efficiency: A large-scale study.
Gredell, Joseph A; Berger, Angela K; Walton, S Patrick
2008-07-01
The selection of active siRNAs is generally based on identifying siRNAs with certain sequence and structural properties. However, the efficiency of RNA interference has also been shown to depend on the structure of the target mRNA, primarily through studies using exogenous transcripts with well-defined secondary structures in the vicinity of the target sequence. While these studies provide a means for examining the impact of target sequence and structure independently, the predicted secondary structures for these transcripts are often not reflective of structures that form in full-length, native mRNAs where interactions can occur between relatively remote segments of the mRNAs. Here, using a combination of experimental results and analysis of a large dataset, we demonstrate that the accessibility of certain local target structures on the mRNA is an important determinant in the gene silencing ability of siRNAs. siRNAs targeting the enhanced green fluorescent protein were chosen using a minimal siRNA selection algorithm followed by classification based on the predicted minimum free energy structures of the target transcripts. Transfection into HeLa and HepG2 cells revealed that siRNAs targeting regions of the mRNA predicted to have unpaired 5'- and 3'-ends resulted in greater gene silencing than regions predicted to have other types of secondary structure. These results were confirmed by analysis of gene silencing data from previously published siRNAs, which showed that mRNA target regions unpaired at either the 5'-end or 3'-end were silenced, on average, approximately 10% more strongly than target regions unpaired in the center or primarily paired throughout. We found this effect to be independent of the structure of the siRNA guide strand. Taken together, these results suggest minimal requirements for nucleation of hybridization between the siRNA guide strand and mRNA and that both mRNA and guide strand structure should be considered when choosing candidate siRNAs. (c) 2008 Wiley Periodicals, Inc.
Impact of target mRNA structure on siRNA silencing efficiency: a large-scale study
Gredell, Joseph A.; Berger, Angela K.; Walton, S. Patrick
2009-01-01
The selection of active siRNAs is generally based on identifying siRNAs with certain sequence and structural properties. However, the efficiency of RNA interference has also been shown to depend on the structure of the target mRNA, primarily through studies using exogenous transcripts with well-defined secondary structures in the vicinity of the target sequence. While these studies provide a means for examining the impact of target sequence and structure independently, the predicted secondary structures for these transcripts are often not reflective of structures that form in full-length, native mRNAs where interactions can occur between relatively remote segments of the mRNAs. Here, using a combination of experimental results and analysis of a large dataset, we demonstrate that the accessibility of certain local target structures on the mRNA is an important determinant in the gene silencing ability of siRNAs. siRNAs targeting the enhanced green fluorescent protein were chosen using a minimal siRNA selection algorithm followed by classification based on the predicted minimum free energy structures of the target transcripts. Transfection into HeLa and HepG2 cells revealed that siRNAs targeting regions of the mRNA predicted to have unpaired 5’- and 3’-ends resulted in greater gene silencing than regions predicted to have other types of secondary structure. These results were confirmed by analysis of gene silencing data from previously published siRNAs, which showed that mRNA target regions unpaired at either the 5’-end or 3’-end were silenced, on average, ~10% more strongly than target regions unpaired in the center or primarily paired throughout. We found this effect to be independent of the structure of the siRNA guide strand. Taken together, these results suggest minimal requirements for nucleation of hybridization between the siRNA guide strand and mRNA and that both mRNA and guide strand structure should be considered when choosing candidate siRNAs. PMID:18306428
A protein-dependent side-chain rotamer library.
Bhuyan, Md Shariful Islam; Gao, Xin
2011-12-14
Protein side-chain packing problem has remained one of the key open problems in bioinformatics. The three main components of protein side-chain prediction methods are a rotamer library, an energy function and a search algorithm. Rotamer libraries summarize the existing knowledge of the experimentally determined structures quantitatively. Depending on how much contextual information is encoded, there are backbone-independent rotamer libraries and backbone-dependent rotamer libraries. Backbone-independent libraries only encode sequential information, whereas backbone-dependent libraries encode both sequential and locally structural information. However, side-chain conformations are determined by spatially local information, rather than sequentially local information. Since in the side-chain prediction problem, the backbone structure is given, spatially local information should ideally be encoded into the rotamer libraries. In this paper, we propose a new type of backbone-dependent rotamer library, which encodes structural information of all the spatially neighboring residues. We call it protein-dependent rotamer libraries. Given any rotamer library and a protein backbone structure, we first model the protein structure as a Markov random field. Then the marginal distributions are estimated by the inference algorithms, without doing global optimization or search. The rotamers from the given library are then re-ranked and associated with the updated probabilities. Experimental results demonstrate that the proposed protein-dependent libraries significantly outperform the widely used backbone-dependent libraries in terms of the side-chain prediction accuracy and the rotamer ranking ability. Furthermore, without global optimization/search, the side-chain prediction power of the protein-dependent library is still comparable to the global-search-based side-chain prediction methods.
Peptide Folding and Translocation Across the Water-Membrane Interface
NASA Technical Reports Server (NTRS)
Pohorille, Andrew; Chang, Sherwood (Technical Monitor)
1997-01-01
The ability of small peptides to organize at aqueous interfaces was examined by performing a series of large-scale, molecular dynamics computer simulations of several peptides composed of two amino acids, nonpolar leucine (L) and polar glutamine (Q). The peptides differed in size and sequence of the amino acids. Studies on dipeptides LL, LQ, QL and QQ were extended to two heptamers, LQQLLQL and LQLQLQL, designed to maximize interfacial stability of an alpha-helix and a beta-strand, respectively, by exposing polar side chains to water and nonpolar side chains to a nonpolar phase. Finally, a transition of an undecamer, composed entirely of leucine residues, from a disordered structure in water to an alpha-helix in a nonpolar phase representing the interior of the membrane was investigated. Complete folding of a peptide in solution was accomplished for the first time in computer simulations. The simulations revealed several basic principles governing the sequence-dependent organization of peptides at interfaces. Short peptides tend to accumulate at interfaces and acquire ordered structures, providing that they have a proper sequence of polar and nonpolar amino acids. The dominant factor determining the interfacial structure of peptides is the hydrophobic effect, which is manifested at aqueous interfaces as a tendency for polar and nonpolar groups of the solute to segregate into the aqueous and nonpolar phases, respectively. If peptides consist of nonpolar residue's only, they become inserted into the nonpolar phase. As demonstrated by the example of the leucine undecamer, such peptides fold into an alpha-helix as they partition into the nonpolar medium. The folding proceeds through an intermediate, called 3-10-helix, which remains in equilibrium with the alpha-helix. Once in the nonpolar environment, the peptides can readily change their orientation with respect to the interface from parallel to perpendicular, especially in response to local electric fields. The ability of nonpolar peptides to modify both the structure and orientation with respect to the interface from parallel to perpendicular, especially in response to local electric fields. The ability of nonpolar peptides to modify both the structure and orientation with changing external conditions may have provided a simple mechanism of transmitting signals from the environment to the interior of a cell.
The C-Terminal Sequence of RhoB Directs Protein Degradation through an Endo-Lysosomal Pathway
Ramos, Irene; Herrera, Mónica; Stamatakis, Konstantinos
2009-01-01
Background Protein degradation is essential for cell homeostasis. Targeting of proteins for degradation is often achieved by specific protein sequences or posttranslational modifications such as ubiquitination. Methodology/Principal Findings By using biochemical and genetic tools we have monitored the localization and degradation of endogenous and chimeric proteins in live primary cells by confocal microscopy and ultra-structural analysis. Here we identify an eight amino acid sequence from the C-terminus of the short-lived GTPase RhoB that directs the rapid degradation of both RhoB and chimeric proteins bearing this sequence through a lysosomal pathway. Elucidation of the RhoB degradation pathway unveils a mechanism dependent on protein isoprenylation and palmitoylation that involves sorting of the protein into multivesicular bodies, mediated by the ESCRT machinery. Moreover, RhoB sorting is regulated by late endosome specific lipid dynamics and is altered in human genetic lipid traffic disease. Conclusions/Significance Our findings characterize a short-lived cytosolic protein that is degraded through a lysosomal pathway. In addition, we define a novel motif for protein sorting and rapid degradation, which allows controlling protein levels by means of clinically used drugs. PMID:19956591
SARNAclust: Semi-automatic detection of RNA protein binding motifs from immunoprecipitation data
Dotu, Ivan; Adamson, Scott I.; Coleman, Benjamin; Fournier, Cyril; Ricart-Altimiras, Emma; Eyras, Eduardo
2018-01-01
RNA-protein binding is critical to gene regulation, controlling fundamental processes including splicing, translation, localization and stability, and aberrant RNA-protein interactions are known to play a role in a wide variety of diseases. However, molecular understanding of RNA-protein interactions remains limited; in particular, identification of RNA motifs that bind proteins has long been challenging, especially when such motifs depend on both sequence and structure. Moreover, although RNA binding proteins (RBPs) often contain more than one binding domain, algorithms capable of identifying more than one binding motif simultaneously have not been developed. In this paper we present a novel pipeline to determine binding peaks in crosslinking immunoprecipitation (CLIP) data, to discover multiple possible RNA sequence/structure motifs among them, and to experimentally validate such motifs. At the core is a new semi-automatic algorithm SARNAclust, the first unsupervised method to identify and deconvolve multiple sequence/structure motifs simultaneously. SARNAclust computes similarity between sequence/structure objects using a graph kernel, providing the ability to isolate the impact of specific features through the bulge graph formalism. Application of SARNAclust to synthetic data shows its capability of clustering 5 motifs at once with a V-measure value of over 0.95, while GraphClust achieves only a V-measure of 0.083 and RNAcontext cannot detect any of the motifs. When applied to existing eCLIP sets, SARNAclust finds known motifs for SLBP and HNRNPC and novel motifs for several other RBPs such as AGGF1, AKAP8L and ILF3. We demonstrate an experimental validation protocol, a targeted Bind-n-Seq-like high-throughput sequencing approach that relies on RNA inverse folding for oligo pool design, that can validate the components within the SLBP motif. Finally, we use this protocol to experimentally interrogate the SARNAclust motif predictions for protein ILF3. Our results support a newly identified partially double-stranded UUUUUGAGA motif similar to that known for the splicing factor HNRNPC. PMID:29596423
Iwamoto, Susumu; Tokumasu, Seiji; Suyama, Yoshihisa; Kakishima, Makoto
2005-01-01
We investigated intraspecific diversity and genetic structures of a saprotrophic fungus--Thysanophora penicillioides--based on sequences of nuclear ribosomal internal transcribed spacer (ITS) in 15 discontinuous Abies mariesii forests of Japan. In such a well-defined morphological species, numerous unexpected ITS variations were revealed: 12 ITS sequence types detected in 254 isolates collected from 15 local populations were classified into five ITS sequence groups. Maximally, four ITS groups consisted of seven ITS types coexisting in one population. However, group 1 was dominant with approximately 65%; in particular, one haplotype, 1a, was most dominant with approximately 60% in respective populations. Therefore, few differences were recognized in genetic structure among local populations, implying that the gene flow of each lineage of the fungus occurs among local populations without geographic limitations. However, minor haplotypes in some ITS groups were found only in restricted areas, suggesting that they might expand steadily from their places of origin to neighboring A. mariesii forests. Aggregating sequence data of seven European strains and four North American strains from various substrates to those of Japanese strains, 18 ITS sequence types and 28 variable sites were recognized. They were clustered into nine lineages by phylogenetic analyses of the beta-tubulin and combined ITS and beta-tubulin datasets. According to phylogenetic species recognition by the concordance of genealogies, respective lineages correspond to phylogenetic species. Plural phylogenetic species coexist in a local population in an A. mariesii forest in Japan.
A generative, probabilistic model of local protein structure.
Boomsma, Wouter; Mardia, Kanti V; Taylor, Charles C; Ferkinghoff-Borg, Jesper; Krogh, Anders; Hamelryck, Thomas
2008-07-01
Despite significant progress in recent years, protein structure prediction maintains its status as one of the prime unsolved problems in computational biology. One of the key remaining challenges is an efficient probabilistic exploration of the structural space that correctly reflects the relative conformational stabilities. Here, we present a fully probabilistic, continuous model of local protein structure in atomic detail. The generative model makes efficient conformational sampling possible and provides a framework for the rigorous analysis of local sequence-structure correlations in the native state. Our method represents a significant theoretical and practical improvement over the widely used fragment assembly technique by avoiding the drawbacks associated with a discrete and nonprobabilistic approach.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gordon, R.D.; Fieles, W.E.; Schotland, D.L.
1987-01-01
A peptide corresponding to amino acid residues 1783-1794 near the C terminus of the electric eel sodium channel primary sequence of the eel (Electrophorus electricus) sodium channel has been synthesized and used to raise an antiserum in rabbits. This antiserum specifically recognized the peptide in a solid-phase radioimmunoassay. Specificity of the antiserum for the native channel protein was shown by its specific binding to a 280-kDa protein in immunoblots of eel electroplax membrane proteins. The antiserum also specifically labeled the innervated membrane of the eel electroplax in immunofluorescent studies. The membrane topology of the peptide recognized by this antiserum wasmore » proved in binding studies using oriented electroplax membrane vesicles. These vesicles were 98% right-side-out as determined by (/sup 3/H)saxitoxin binding. Binding of the antipeptide antiserum to this fraction was measured before and after permeabilization with 0.01% saponin. Specific binding to intact vesicles was low, but this binding increased 10-fold after permeabilization, implying a cytoplasmic orientation for the peptide. Confirmation for this orientation was then sought by localizing the antibody bound to intact electroplax cells with immunogold electron microscopy. The data imply that the region of the sodium channel primary sequence near the C terminus that is recognized by the anitserum is localized on the cytoplasmic side of the membrane; this localization provides some further constraints on models of sodium channel tertiary structure.« less
Sequencing of Dust Filter Production Process Using Design Structure Matrix (DSM)
NASA Astrophysics Data System (ADS)
Sari, R. M.; Matondang, A. R.; Syahputri, K.; Anizar; Siregar, I.; Rizkya, I.; Ursula, C.
2018-01-01
Metal casting company produces machinery spare part for manufactures. One of the product produced is dust filter. Most of palm oil mill used this product. Since it is used in most of palm oil mill, company often have problems to address this product. One of problem is the disordered of production process. It carried out by the job sequencing. The important job that should be solved first, least implement, while less important job and could be completed later, implemented first. Design Structure Matrix (DSM) used to analyse and determine priorities in the production process. DSM analysis is sort of production process through dependency sequencing. The result of dependency sequences shows the sequence process according to the inter-process linkage considering before and after activities. Finally, it demonstrates their activities to the coupled activities for metal smelting, refining, grinding, cutting container castings, metal expenditure of molds, metal casting, coating processes, and manufacture of molds of sand.
Relationships between residue Voronoi volume and sequence conservation in proteins.
Liu, Jen-Wei; Cheng, Chih-Wen; Lin, Yu-Feng; Chen, Shao-Yu; Hwang, Jenn-Kang; Yen, Shih-Chung
2018-02-01
Functional and biophysical constraints can cause different levels of sequence conservation in proteins. Previously, structural properties, e.g., relative solvent accessibility (RSA) and packing density of the weighted contact number (WCN), have been found to be related to protein sequence conservation (CS). The Voronoi volume has recently been recognized as a new structural property of the local protein structural environment reflecting CS. However, for surface residues, it is sensitive to water molecules surrounding the protein structure. Herein, we present a simple structural determinant termed the relative space of Voronoi volume (RSV); it uses the Voronoi volume and the van der Waals volume of particular residues to quantify the local structural environment. RSV (range, 0-1) is defined as (Voronoi volume-van der Waals volume)/Voronoi volume of the target residue. The concept of RSV describes the extent of available space for every protein residue. RSV and Voronoi profiles with and without water molecules (RSVw, RSV, VOw, and VO) were compared for 554 non-homologous proteins. RSV (without water) showed better Pearson's correlations with CS than did RSVw, VO, or VOw values. The mean correlation coefficient between RSV and CS was 0.51, which is comparable to the correlation between RSA and CS (0.49) and that between WCN and CS (0.56). RSV is a robust structural descriptor with and without water molecules and can quantitatively reflect evolutionary information in a single protein structure. Therefore, it may represent a practical structural determinant to study protein sequence, structure, and function relationships. Copyright © 2017 Elsevier B.V. All rights reserved.
Common 5S rRNA variants are likely to be accepted in many sequence contexts
NASA Technical Reports Server (NTRS)
Zhang, Zhengdong; D'Souza, Lisa M.; Lee, Youn-Hyung; Fox, George E.
2003-01-01
Over evolutionary time RNA sequences which are successfully fixed in a population are selected from among those that satisfy the structural and chemical requirements imposed by the function of the RNA. These sequences together comprise the structure space of the RNA. In principle, a comprehensive understanding of RNA structure and function would make it possible to enumerate which specific RNA sequences belong to a particular structure space and which do not. We are using bacterial 5S rRNA as a model system to attempt to identify principles that can be used to predict which sequences do or do not belong to the 5S rRNA structure space. One promising idea is the very intuitive notion that frequently seen sequence changes in an aligned data set of naturally occurring 5S rRNAs would be widely accepted in many other 5S rRNA sequence contexts. To test this hypothesis, we first developed well-defined operational definitions for a Vibrio region of the 5S rRNA structure space and what is meant by a highly variable position. Fourteen sequence variants (10 point changes and 4 base-pair changes) were identified in this way, which, by the hypothesis, would be expected to incorporate successfully in any of the known sequences in the Vibrio region. All 14 of these changes were constructed and separately introduced into the Vibrio proteolyticus 5S rRNA sequence where they are not normally found. Each variant was evaluated for its ability to function as a valid 5S rRNA in an E. coli cellular context. It was found that 93% (13/14) of the variants tested are likely valid 5S rRNAs in this context. In addition, seven variants were constructed that, although present in the Vibrio region, did not meet the stringent criteria for a highly variable position. In this case, 86% (6/7) are likely valid. As a control we also examined seven variants that are seldom or never seen in the Vibrio region of 5S rRNA sequence space. In this case only two of seven were found to be potentially valid. The results demonstrate that changes that occur multiple times in a local region of RNA sequence space in fact usually will be accepted in any sequence context in that same local region.
Guo, Kang-kang; Tang, Qing-hai; Zhang, Yan-ming; Kang, Kai; He, Lei
2011-05-18
The membrane topology and molecular mechanisms for endoplasmic reticulum (ER) localization of classical swine fever virus (CSFV) non-structural 2 (NS2) protien is unclear. We attempted to elucidate the subcellular localization, and the molecular mechanisms responsible for the localization of this protein in our study. The NS2 gene was amplified by reverse transcription polymerase chain reaction, with the transmembrane region and hydrophilicity of the NS2 protein was predicted by bioinformatics analysis. Twelve cDNAs of the NS2 gene were amplified by the PCR deletion method and cloned into a eukaryotic expression vector, which was transfected into a swine umbilical vein endothelial cell line (SUVEC). Subcellular localization of the NS2 protein was characterized by confocal microscopy, and western blots were carried out to analyze protein expression. Our results showed that the -NH2 terminal of the CSFV NS2 protein was highly hydrophobic and the protein localized in the ER. At least four transmembrane regions and two internal signal peptide sequences (amino acids103-138 and 220-262) were identified and thought to be critical for its trans-localization to the ER. This is the first study to identify the internal signal peptide sequences of the CSFV NS2 protein and its subcellular localization, providing the foundation for further exploration of this protein's function of this protein and its role in CSFV pathogenesis.
Sequence periodicity in nucleosomal DNA and intrinsic curvature
2010-01-01
Background Most eukaryotic DNA contained in the nucleus is packaged by wrapping DNA around histone octamers. Histones are ubiquitous and bind most regions of chromosomal DNA. In order to achieve smooth wrapping of the DNA around the histone octamer, the DNA duplex should be able to deform and should possess intrinsic curvature. The deformability of DNA is a result of the non-parallelness of base pair stacks. The stacking interaction between base pairs is sequence dependent. The higher the stacking energy the more rigid the DNA helix, thus it is natural to expect that sequences that are involved in wrapping around the histone octamer should be unstacked and possess intrinsic curvature. Intrinsic curvature has been shown to be dictated by the periodic recurrence of certain dinucleotides. Several genome-wide studies directed towards mapping of nucleosome positions have revealed periodicity associated with certain stretches of sequences. In the current study, these sequences have been analyzed with a view to understand their sequence-dependent structures. Results Higher order DNA structures and the distribution of molecular bend loci associated with 146 base nucleosome core DNA sequence from C. elegans and chicken have been analyzed using the theoretical model for DNA curvature. The curvature dispersion calculated by cyclically permuting the sequences revealed that the molecular bend loci were delocalized throughout the nucleosome core region and had varying degrees of intrinsic curvature. Conclusions The higher order structures associated with nucleosomes of C.elegans and chicken calculated from the sequences revealed heterogeneity with respect to the deviation of the DNA axis. The results points to the possibility of context dependent curvature of varying degrees to be associated with nucleosomal DNA. PMID:20487515
A class of covariate-dependent spatiotemporal covariance functions
Reich, Brian J; Eidsvik, Jo; Guindani, Michele; Nail, Amy J; Schmidt, Alexandra M.
2014-01-01
In geostatistics, it is common to model spatially distributed phenomena through an underlying stationary and isotropic spatial process. However, these assumptions are often untenable in practice because of the influence of local effects in the correlation structure. Therefore, it has been of prolonged interest in the literature to provide flexible and effective ways to model non-stationarity in the spatial effects. Arguably, due to the local nature of the problem, we might envision that the correlation structure would be highly dependent on local characteristics of the domain of study, namely the latitude, longitude and altitude of the observation sites, as well as other locally defined covariate information. In this work, we provide a flexible and computationally feasible way for allowing the correlation structure of the underlying processes to depend on local covariate information. We discuss the properties of the induced covariance functions and discuss methods to assess its dependence on local covariate information by means of a simulation study and the analysis of data observed at ozone-monitoring stations in the Southeast United States. PMID:24772199
NASA Astrophysics Data System (ADS)
Mantel, Claire; Korhonen, Jari; Pedersen, Jesper M.; Bech, Søren; Andersen, Jakob Dahl; Forchhammer, Søren
2015-01-01
This paper focuses on the influence of ambient light on the perceived quality of videos displayed on Liquid Crystal Display (LCD) with local backlight dimming. A subjective test assessing the quality of videos with two backlight dimming methods and three lighting conditions, i.e. no light, low light level (5 lux) and higher light level (60 lux) was organized to collect subjective data. Results show that participants prefer the method exploiting local dimming possibilities to the conventional full backlight but that this preference varies depending on the ambient light level. The clear preference for one method at the low light conditions decreases at the high ambient light, confirming that the ambient light significantly attenuates the perception of the leakage defect (light leaking through dark pixels). Results are also highly dependent on the content of the sequence, which can modulate the effect of the ambient light from having an important influence on the quality grades to no influence at all.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bedford, Nicholas M.; Hughes, Zak E.; Tang, Zhenghua
Peptide-enabled nanoparticle (NP) synthesis routes can create and/or assemble functional nanomaterials under environmentally friendly conditions, with properties dictated by complex interactions at the biotic/abiotic interface. Manipulation of this interface through sequence modification can provide the capability for material properties to be tailored to create enhanced materials for energy, catalysis, and sensing applications. Fully realizing the potential of these materials requires a comprehensive understanding of sequence-dependent structure/function relationships that is presently lacking. In this work, the atomic-scale structures of a series of peptide-capped Au NPs are determined using a combination of atomic pair distribution function analysis of high-energy X-ray diffraction datamore » and advanced molecular dynamics (MD) simulations. The Au NPs produced with different peptide sequences exhibit varying degrees of catalytic activity for the exemplar reaction 4-nitrophenol reduction. The experimentally derived atomic-scale NP configurations reveal sequence-dependent differences in structural order at the NP surface. Replica exchange with solute-tempering MD simulations are then used to predict the morphology of the peptide overlayer on these Au NPs and identify factors determining the structure/catalytic properties relationship. We show that the amount of exposed Au surface, the underlying surface structural disorder, and the interaction strength of the peptide with the Au surface all influence catalytic performance. A simplified computational prediction of catalytic performance is developed that can potentially serve as a screening tool for future studies. Our approach provides a platform for broadening the analysis of catalytic peptide-enabled metallic NP systems, potentially allowing for the development of rational design rules for property enhancement.« less
Pan, Feng; Man, Viet Hoang; Roland, Christopher; Sagui, Celeste
2018-04-26
Expansions of both GGC and CCG sequences lead to a number of expandable, trinucleotide repeat (TR) neurodegenerative diseases. Understanding of these diseases involves, among other things, the structural characterization of the atypical DNA and RNA secondary structures. We have performed molecular dynamics simulations of (GCC) n and (GGC) n homoduplexes in order to characterize their conformations, stability, and dynamics. Each TR has two reading frames, which results in eight nonequivalent RNA/DNA homoduplexes, characterized by CpG or GpC steps between the Watson-Crick base pairs. Free energy maps for the eight homoduplexes indicate that the C-mismatches prefer anti-anti conformations, while G-mismatches prefer anti-syn conformations. Comparison between three modifications of the DNA AMBER force field shows good agreement for the mismatch free energy maps. The mismatches in DNA-GCC (but not CCG) are extrahelical, forming an extended e-motif. The mismatched duplexes exhibit characteristic sequence-dependent step twist, with strong variations in the G-rich sequences and the e-motif. The distribution of Na + is highly localized around the mismatches, especially G-mismatches. In the e-motif, there is strong Na + binding by two G(N7) atoms belonging to the pseudo GpC step created when cytosines are extruded and by extrahelical cytosines. Finally, we used a novel technique based on fast melting by means of an infrared laser pulse to classify the relative stability of the different DNA-CCG and -GGC homoduplexes.
NASA Astrophysics Data System (ADS)
Sitaula, R. P.; Aschoff, J.
2013-12-01
Regional-scale sequence stratigraphic correlation, well log analysis, syntectonic unconformity mapping, isopach maps, and depositional environment maps of the upper Mesaverde Group (UMG) in Uinta basin, Utah suggest higher accommodation in northeastern part (Natural Buttes area) and local development of lacustrine facies due to increased subsidence caused by uplift of San Rafael Swell (SRS) in southern and Uinta Uplift in northern parts. Recently discovered lacustrine facies in Natural Buttes area are completely different than the dominant fluvial facies in outcrops along Book Cliffs and could have implications for significant amount of tight-gas sand production from this area. Data used for sequence stratigraphic correlation, isopach maps and depositional environmental maps include > 100 well logs, 20 stratigraphic profiles, 35 sandstone thin sections and 10 outcrop-based gamma ray profiles. Seven 4th order depositional sequences (~0.5 my duration) are identified and correlated within UMG. Correlation was constructed using a combination of fluvial facies and stacking patterns in outcrops, chert-pebble conglomerates and tidally influenced strata. These surfaces were extrapolated into subsurface by matching GR profiles. GR well logs and core log of Natural Buttes area show intervals of coarsening upward patterns suggesting possible lacustrine intervals that might contain high TOC. Locally, younger sequences are completely truncated across SRS whereas older sequences are truncated and thinned toward SRS. The cycles of truncation and thinning represent phases of SRS uplift. Thinning possibly related with the Uinta Uplift is also observed in northwestern part. Paleocurrents are consistent with interpretation of periodic segmentation and deflection of sedimentation. Regional paleocurrents are generally E-NE-directed in Sequences 1-4, and N-directed in Sequences 5-7. From isopach maps and paleocurrent direction it can be interpreted that uplift of SRS changed route of sediment supply from west to southwest. Locally, paleocurrents are highly variable near SRS further suggesting UMG basin-fill was partitioned by uplift of SRS. Sandstone composition analysis also suggests the uplift of SRS causing the variation of source rocks in upper sequences than the lower sequences. In conclusion, we suggest that Uinta basin was episodically partitioned during the deposition of UMG due to uplift of Laramide structures in the basin and accommodation was localized in northeastern part. Understanding of structural controls on accommodation, sedimentation patterns and depositional environments will aid prediction of the best-producing gas reservoirs.
Membrane raft association is a determinant of plasma membrane localization.
Diaz-Rohrer, Blanca B; Levental, Kandice R; Simons, Kai; Levental, Ilya
2014-06-10
The lipid raft hypothesis proposes lateral domains driven by preferential interactions between sterols, sphingolipids, and specific proteins as a central mechanism for the regulation of membrane structure and function; however, experimental limitations in defining raft composition and properties have prevented unequivocal demonstration of their functional relevance. Here, we establish a quantitative, functional relationship between raft association and subcellular protein sorting. By systematic mutation of the transmembrane and juxtamembrane domains of a model transmembrane protein, linker for activation of T-cells (LAT), we generated a panel of variants possessing a range of raft affinities. These mutations revealed palmitoylation, transmembrane domain length, and transmembrane sequence to be critical determinants of membrane raft association. Moreover, plasma membrane (PM) localization was strictly dependent on raft partitioning across the entire panel of unrelated mutants, suggesting that raft association is necessary and sufficient for PM sorting of LAT. Abrogation of raft partitioning led to mistargeting to late endosomes/lysosomes because of a failure to recycle from early endosomes. These findings identify structural determinants of raft association and validate lipid-driven domain formation as a mechanism for endosomal protein sorting.
Membrane raft association is a determinant of plasma membrane localization
Diaz-Rohrer, Blanca B.; Levental, Kandice R.; Simons, Kai; Levental, Ilya
2014-01-01
The lipid raft hypothesis proposes lateral domains driven by preferential interactions between sterols, sphingolipids, and specific proteins as a central mechanism for the regulation of membrane structure and function; however, experimental limitations in defining raft composition and properties have prevented unequivocal demonstration of their functional relevance. Here, we establish a quantitative, functional relationship between raft association and subcellular protein sorting. By systematic mutation of the transmembrane and juxtamembrane domains of a model transmembrane protein, linker for activation of T-cells (LAT), we generated a panel of variants possessing a range of raft affinities. These mutations revealed palmitoylation, transmembrane domain length, and transmembrane sequence to be critical determinants of membrane raft association. Moreover, plasma membrane (PM) localization was strictly dependent on raft partitioning across the entire panel of unrelated mutants, suggesting that raft association is necessary and sufficient for PM sorting of LAT. Abrogation of raft partitioning led to mistargeting to late endosomes/lysosomes because of a failure to recycle from early endosomes. These findings identify structural determinants of raft association and validate lipid-driven domain formation as a mechanism for endosomal protein sorting. PMID:24912166
Taddei, Angela; Schober, Heiko; Gasser, Susan M.
2010-01-01
The budding yeast nucleus, like those of other eukaryotic species, is highly organized with respect to both chromosomal sequences and enzymatic activities. At the nuclear periphery interactions of nuclear pores with chromatin, mRNA, and transport factors promote efficient gene expression, whereas centromeres, telomeres, and silent chromatin are clustered and anchored away from pores. Internal nuclear organization appears to be function-dependent, reflecting localized sites for tRNA transcription, rDNA transcription, ribosome assembly, and DNA repair. Recent advances have identified new proteins involved in the positioning of chromatin and have allowed testing of the functional role of higher-order chromatin organization. The unequal distribution of silent information regulatory factors and histone modifying enzymes, which arises in part from the juxtaposition of telomeric repeats, has been shown to influence chromatin-mediated transcriptional repression. Other localization events suppress unwanted recombination. These findings highlight the contribution budding yeast genetics and cytology have made to dissecting the functional role of nuclear structure. PMID:20554704
Sequence-similar, structure-dissimilar protein pairs in the PDB.
Kosloff, Mickey; Kolodny, Rachel
2008-05-01
It is often assumed that in the Protein Data Bank (PDB), two proteins with similar sequences will also have similar structures. Accordingly, it has proved useful to develop subsets of the PDB from which "redundant" structures have been removed, based on a sequence-based criterion for similarity. Similarly, when predicting protein structure using homology modeling, if a template structure for modeling a target sequence is selected by sequence alone, this implicitly assumes that all sequence-similar templates are equivalent. Here, we show that this assumption is often not correct and that standard approaches to create subsets of the PDB can lead to the loss of structurally and functionally important information. We have carried out sequence-based structural superpositions and geometry-based structural alignments of a large number of protein pairs to determine the extent to which sequence similarity ensures structural similarity. We find many examples where two proteins that are similar in sequence have structures that differ significantly from one another. The source of the structural differences usually has a functional basis. The number of such proteins pairs that are identified and the magnitude of the dissimilarity depend on the approach that is used to calculate the differences; in particular sequence-based structure superpositioning will identify a larger number of structurally dissimilar pairs than geometry-based structural alignments. When two sequences can be aligned in a statistically meaningful way, sequence-based structural superpositioning provides a meaningful measure of structural differences. This approach and geometry-based structure alignments reveal somewhat different information and one or the other might be preferable in a given application. Our results suggest that in some cases, notably homology modeling, the common use of nonredundant datasets, culled from the PDB based on sequence, may mask important structural and functional information. We have established a data base of sequence-similar, structurally dissimilar protein pairs that will help address this problem (http://luna.bioc.columbia.edu/rachel/seqsimstrdiff.htm).
Absence of auditory 'global interference' in autism.
Foxton, Jessica M; Stewart, Mary E; Barnard, Louise; Rodgers, Jacqui; Young, Allan H; O'Brien, Gregory; Griffiths, Timothy D
2003-12-01
There has been considerable recent interest in the cognitive style of individuals with Autism Spectrum Disorder (ASD). One theory, that of weak central coherence, concerns an inability to combine stimulus details into a coherent whole. Here we test this theory in the case of sound patterns, using a new definition of the details (local structure) and the coherent whole (global structure). Thirteen individuals with a diagnosis of autism or Asperger's syndrome and 15 control participants were administered auditory tests, where they were required to match local pitch direction changes between two auditory sequences. When the other local features of the sequence pairs were altered (the actual pitches and relative time points of pitch direction change), the control participants obtained lower scores compared with when these details were left unchanged. This can be attributed to interference from the global structure, defined as the combination of the local auditory details. In contrast, the participants with ASD did not obtain lower scores in the presence of such mismatches. This was attributed to the absence of interference from an auditory coherent whole. The results are consistent with the presence of abnormal interactions between local and global auditory perception in ASD.
Disconnecting structure and dynamics in glassy thin films
Sussman, Daniel M.; Cubuk, Ekin D.; Liu, Andrea J.
2017-01-01
Nanometrically thin glassy films depart strikingly from the behavior of their bulk counterparts. We investigate whether the dynamical differences between a bulk and thin film polymeric glass former can be understood by differences in local microscopic structure. Machine learning methods have shown that local structure can serve as the foundation for successful, predictive models of particle rearrangement dynamics in bulk systems. By contrast, in thin glassy films, we find that particles at the center of the film and those near the surface are structurally indistinguishable despite exhibiting very different dynamics. Next, we show that structure-independent processes, already present in bulk systems and demonstrably different from simple facilitated dynamics, are crucial for understanding glassy dynamics in thin films. Our analysis suggests a picture of glassy dynamics in which two dynamical processes coexist, with relative strengths that depend on the distance from an interface. One of these processes depends on local structure and is unchanged throughout most of the film, while the other is purely Arrhenius, does not depend on local structure, and is strongly enhanced near the free surface of a film. PMID:28928147
Majoros, William H.; Campbell, Michael S.; Holt, Carson; DeNardo, Erin K.; Ware, Doreen; Allen, Andrew S.; Yandell, Mark; Reddy, Timothy E.
2017-01-01
Abstract Motivation: The accurate interpretation of genetic variants is critical for characterizing genotype–phenotype associations. Because the effects of genetic variants can depend strongly on their local genomic context, accurate genome annotations are essential. Furthermore, as some variants have the potential to disrupt or alter gene structure, variant interpretation efforts stand to gain from the use of individualized annotations that account for differences in gene structure between individuals or strains. Results: We describe a suite of software tools for identifying possible functional changes in gene structure that may result from sequence variants. ACE (‘Assessing Changes to Exons’) converts phased genotype calls to a collection of explicit haplotype sequences, maps transcript annotations onto them, detects gene-structure changes and their possible repercussions, and identifies several classes of possible loss of function. Novel transcripts predicted by ACE are commonly supported by spliced RNA-seq reads, and can be used to improve read alignment and transcript quantification when an individual-specific genome sequence is available. Using publicly available RNA-seq data, we show that ACE predictions confirm earlier results regarding the quantitative effects of nonsense-mediated decay, and we show that predicted loss-of-function events are highly concordant with patterns of intolerance to mutations across the human population. ACE can be readily applied to diverse species including animals and plants, making it a broadly useful tool for use in eukaryotic population-based resequencing projects, particularly for assessing the joint impact of all variants at a locus. Availability and Implementation: ACE is written in open-source C ++ and Perl and is available from geneprediction.org/ACE Contact: myandell@genetics.utah.edu or tim.reddy@duke.edu Supplementary information: Supplementary information is available at Bioinformatics online. PMID:28011790
Majoros, William H; Campbell, Michael S; Holt, Carson; DeNardo, Erin K; Ware, Doreen; Allen, Andrew S; Yandell, Mark; Reddy, Timothy E
2017-05-15
The accurate interpretation of genetic variants is critical for characterizing genotype-phenotype associations. Because the effects of genetic variants can depend strongly on their local genomic context, accurate genome annotations are essential. Furthermore, as some variants have the potential to disrupt or alter gene structure, variant interpretation efforts stand to gain from the use of individualized annotations that account for differences in gene structure between individuals or strains. We describe a suite of software tools for identifying possible functional changes in gene structure that may result from sequence variants. ACE ('Assessing Changes to Exons') converts phased genotype calls to a collection of explicit haplotype sequences, maps transcript annotations onto them, detects gene-structure changes and their possible repercussions, and identifies several classes of possible loss of function. Novel transcripts predicted by ACE are commonly supported by spliced RNA-seq reads, and can be used to improve read alignment and transcript quantification when an individual-specific genome sequence is available. Using publicly available RNA-seq data, we show that ACE predictions confirm earlier results regarding the quantitative effects of nonsense-mediated decay, and we show that predicted loss-of-function events are highly concordant with patterns of intolerance to mutations across the human population. ACE can be readily applied to diverse species including animals and plants, making it a broadly useful tool for use in eukaryotic population-based resequencing projects, particularly for assessing the joint impact of all variants at a locus. ACE is written in open-source C ++ and Perl and is available from geneprediction.org/ACE. myandell@genetics.utah.edu or tim.reddy@duke.edu. Supplementary information is available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
Chu, Chien-Hsin; Chang, Lung-Chun; Hsu, Hong-Ming; Wei, Shu-Yi; Liu, Hsing-Wei; Lee, Yu; Kuo, Chung-Chi; Indra, Dharmu; Chen, Chinpan; Ong, Shiou-Jeng; Tai, Jung-Hsiang
2011-01-01
Nuclear proteins usually contain specific peptide sequences, referred to as nuclear localization signals (NLSs), for nuclear import. These signals remain unexplored in the protozoan pathogen, Trichomonas vaginalis. The nuclear import of a Myb2 transcription factor was studied here using immunodetection of a hemagglutinin-tagged Myb2 overexpressed in the parasite. The tagged Myb2 was localized to the nucleus as punctate signals. With mutations of its polybasic sequences, 48KKQK51 and 61KR62, Myb2 was localized to the nucleus, but the signal was diffusive. When fused to a C-terminal non-nuclear protein, the Myb2 sequence spanning amino acid (aa) residues 48 to 143, which is embedded within the R2R3 DNA-binding domain (aa 40 to 156), was essential and sufficient for efficient nuclear import of a bacterial tetracycline repressor (TetR), and yet the transport efficiency was reduced with an additional fusion of a firefly luciferase to TetR, while classical NLSs from the simian virus 40 T-antigen had no function in this assay system. Myb2 nuclear import and DNA-binding activity were substantially perturbed with mutation of a conserved isoleucine (I74) in helix 2 to proline that altered secondary structure and ternary folding of the R2R3 domain. Disruption of DNA-binding activity alone by point mutation of a lysine residue, K51, preceding the structural domain had little effect on Myb2 nuclear localization, suggesting that nuclear translocation of Myb2, which requires an ordered structural domain, is independent of its DNA binding activity. These findings provide useful information for testing whether myriad Mybs in the parasite use a common module to regulate nuclear import. PMID:22021237
NASA Astrophysics Data System (ADS)
Wertgeim, Igor I.
2018-02-01
We investigate stationary and non-stationary solutions of nonlinear equations of the long-wave approximation for the Marangoni convection caused by a localized source of heat or a surface active impurity (surfactant) in a thin horizontal layer of a viscous incompressible fluid with a free surface. The distribution of heat or concentration flux is determined by the uniform vertical gradient of temperature or impurity concentration, distorted by the imposition of a slightly inhomogeneous heating or of surfactant, localized in the horizontal plane. The lower boundary of the layer is considered thermally insulated or impermeable, whereas the upper boundary is free and deformable. The equations obtained in the long-wave approximation are formulated in terms of the amplitudes of the temperature distribution or impurity concentration, deformation of the surface, and vorticity. For a simplification of the problem, a sequence of nonlinear equations is obtained, which in the simplest form leads to a nonlinear Schrödinger equation with a localized potential. The basic state of the system, its dependence on the parameters and stability are investigated. For stationary solutions localized in the region of the surface tension inhomogeneity, domains of parameters corresponding to different spatial patterns are delineated.
Parniewski, P; Galazka, G; Wilk, A; Klysik, J
1989-01-01
Synthetic sequence GATCC(AG)7ATCG(AT)4CG(AG)7 was cloned into plasmid and its structural behavior under the influence of supercoiling was analysed by chemical modification at variety of experimental conditions. It was found that this sequence adopts at least two different non-B conformations depending on -delta and pH values. Moreover, 12 nucleotide long non-pur.pyr spacer region separating two identical (AG)7 blocks does not provide a significant energy barrier protecting against unusual structures formation. Images PMID:2644622
Pierucci, Debora; Brumme, Thomas; Girard, Jean-Christophe; Calandra, Matteo; Silly, Mathieu G; Sirotti, Fausto; Barbier, Antoine; Mauri, Francesco; Ouerghi, Abdelkarim
2016-09-15
The transport properties of few-layer graphene are the directly result of a peculiar band structure near the Dirac point. Here, for epitaxial graphene grown on SiC, we determine the effect of charge transfer from the SiC substrate on the local density of states (LDOS) of trilayer graphene using scaning tunneling microscopy/spectroscopy and angle resolved photoemission spectroscopy (ARPES). Different spectra are observed and are attributed to the existence of two stable polytypes of trilayer: Bernal (ABA) and rhomboedreal (ABC) staking. Their electronic properties strongly depend on the charge transfer from the substrate. We show that the LDOS of ABC stacking shows an additional peak located above the Dirac point in comparison with the LDOS of ABA stacking. The observed LDOS features, reflecting the underlying symmetry of the two polytypes, were reproduced by explicit calculations within density functional theory (DFT) including the charge transfer from the substrate. These findings demonstrate the pronounced effect of stacking order and charge transfer on the electronic structure of trilayer or few layer graphene. Our approach represents a significant step toward understand the electronic properties of graphene layer under electrical field.
Sequence-Mandated, Distinct Assembly of Giant Molecules
Zhang, Wei; Lu, Xinlin; Mao, Jialin; ...
2017-10-24
Although controlling the primary structure of synthetic polymers is itself a great challenge, the potential of sequence control for tailoring hierarchical structures remains to be exploited, especially in the creation of new and unconventional phases. A series of model amphiphilic chain-like giant molecules was designed and synthesized by interconnecting both hydrophobic and hydrophilic molecular nanoparticles in precisely defined sequence and composition to investigate their sequence-dependent phase structures. Not only compositional variation changed the self-assembled supramolecular phases, but also specific sequences induce unconventional phase formation, including Frank-Kasper phases. The formation mechanism was attributed to the conformational change driven by the collectivemore » hydrogen bonding and the sequence-mandated topology of the molecules. Lastly, these results show that sequence control in synthetic polymers can have a dramatic impact on polymer properties and self-assembly.« less
Sequence-Mandated, Distinct Assembly of Giant Molecules
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhang, Wei; Lu, Xinlin; Mao, Jialin
Although controlling the primary structure of synthetic polymers is itself a great challenge, the potential of sequence control for tailoring hierarchical structures remains to be exploited, especially in the creation of new and unconventional phases. A series of model amphiphilic chain-like giant molecules was designed and synthesized by interconnecting both hydrophobic and hydrophilic molecular nanoparticles in precisely defined sequence and composition to investigate their sequence-dependent phase structures. Not only compositional variation changed the self-assembled supramolecular phases, but also specific sequences induce unconventional phase formation, including Frank-Kasper phases. The formation mechanism was attributed to the conformational change driven by the collectivemore » hydrogen bonding and the sequence-mandated topology of the molecules. Lastly, these results show that sequence control in synthetic polymers can have a dramatic impact on polymer properties and self-assembly.« less
2009-01-01
Background Polymerase chain reaction (PCR) is very useful in many areas of molecular biology research. It is commonly observed that PCR success is critically dependent on design of an effective primer pair. Current tools for primer design do not adequately address the problem of PCR failure due to mis-priming on target-related sequences and structural variations in the genome. Methods We have developed an integrated graphical web-based application for primer design, called RExPrimer, which was written in Python language. The software uses Primer3 as the primer designing core algorithm. Locally stored sequence information and genomic variant information were hosted on MySQLv5.0 and were incorporated into RExPrimer. Results RExPrimer provides many functionalities for improved PCR primer design. Several databases, namely annotated human SNP databases, insertion/deletion (indel) polymorphisms database, pseudogene database, and structural genomic variation databases were integrated into RExPrimer, enabling an effective without-leaving-the-website validation of the resulting primers. By incorporating these databases, the primers reported by RExPrimer avoid mis-priming to related sequences (e.g. pseudogene, segmental duplication) as well as possible PCR failure because of structural polymorphisms (SNP, indel, and copy number variation (CNV)). To prevent mismatching caused by unexpected SNPs in the designed primers, in particular the 3' end (SNP-in-Primer), several SNP databases covering the broad range of population-specific SNP information are utilized to report SNPs present in the primer sequences. Population-specific SNP information also helps customize primer design for a specific population. Furthermore, RExPrimer offers a graphical user-friendly interface through the use of scalable vector graphic image that intuitively presents resulting primers along with the corresponding gene structure. In this study, we demonstrated the program effectiveness in successfully generating primers for strong homologous sequences. Conclusion The improvements for primer design incorporated into RExPrimer were demonstrated to be effective in designing primers for challenging PCR experiments. Integration of SNP and structural variation databases allows for robust primer design for a variety of PCR applications, irrespective of the sequence complexity in the region of interest. This software is freely available at http://www4a.biotec.or.th/rexprimer. PMID:19958502
Boutin, Arnaud; Pinsard, Basile; Boré, Arnaud; Carrier, Julie; Fogel, Stuart M; Doyon, Julien
2018-04-01
Sleep benefits motor memory consolidation. This mnemonic process is thought to be mediated by thalamo-cortical spindle activity during NREM-stage2 sleep episodes as well as changes in striatal and hippocampal activity. However, direct experimental evidence supporting the contribution of such sleep-dependent physiological mechanisms to motor memory consolidation in humans is lacking. In the present study, we combined EEG and fMRI sleep recordings following practice of a motor sequence learning (MSL) task to determine whether spindle oscillations support sleep-dependent motor memory consolidation by transiently synchronizing and coordinating specialized cortical and subcortical networks. To that end, we conducted EEG source reconstruction on spindle epochs in both cortical and subcortical regions using novel deep-source localization techniques. Coherence-based metrics were adopted to estimate functional connectivity between cortical and subcortical structures over specific frequency bands. Our findings not only confirm the critical and functional role of NREM-stage2 sleep spindles in motor skill consolidation, but provide first-time evidence that spindle oscillations [11-17 Hz] may be involved in sleep-dependent motor memory consolidation by locally reactivating and functionally binding specific task-relevant cortical and subcortical regions within networks including the hippocampus, putamen, thalamus and motor-related cortical regions. Copyright © 2018 Elsevier Inc. All rights reserved.
Discretized torsional dynamics and the folding of an RNA chain.
Fernández, A; Salthú, R; Cendra, H
1999-08-01
The aim of this work is to implement a discrete coarse codification of local torsional states of the RNA chain backbone in order to explore the long-time limit dynamics and ultimately obtain a coarse solution to the RNA folding problem. A discrete representation of the soft-mode dynamics is turned into an algorithm for a rough structure prediction. The algorithm itself is inherently parallel, as it evaluates concurrent folding possibilities by pattern recognition, but it may be implemented in a personal computer as a chain of perturbation-translation-renormalization cycles performed on a binary matrix of local topological constraints. This requires suitable representational tools and a periodic quenching of the dynamics for system renormalization. A binary coding of local topological constraints associated with each structural motif is introduced, with each local topological constraint corresponding to a local torsional state. This treatment enables us to adopt a computation time step far larger than hydrodynamic drag time scales. Accordingly, the solvent is no longer treated as a hydrodynamic drag medium. Instead we incorporate its capacity for forming local conformation-dependent dielectric domains. Each translation of the matrix of local topological constraints (LTM's) depends on the conformation-dependent local dielectric created by a confined solvent. Folding pathways are resolved as transitions between patterns of locally encoded structural signals which change within the 1 ns-100 ms time scale range. These coarse folding pathways are generated by a search at regular intervals for structural patterns in the LTM. Each pattern is recorded as a base-pairing pattern (BPP) matrix, a consensus-evaluation operation subject to a renormalization feedback loop. Since several mutually conflicting consensus evaluations might occur at a given time, the need arises for a probabilistic approach appropriate for an ensemble of RNA molecules. Thus, a statistical dynamics of consensus formation is determined by the time evolution of the base pairing probability matrix. These dynamics are generated for a functional RNA molecule, a representative of the so-called group I ribozymes, in order to test the model. The resulting ensemble of conformations is sharply peaked and the most probable structure features the predominance of all phylogenetically conserved intrachain helices tantamount to ribozyme function. Furthermore, the magnesium-aided cooperativity that leads to the shaping of the catalytic core is elucidated. Once the predictive folding algorithm has been implemented, the validity of the so-called "adiabatic approximation" is tested. This approximation requires that conformational microstates be lumped up into BPP's which are treated as quasiequilibrium states, while folding pathways are coarsely represented as sequences of BPP transitions. To test the validity of this adiabatic ansatz, a computation of the coarse Shannon information entropy sigma associated to the specific partition of conformation space into BPP's is performed taking into account the LTM evolution and contrasted with the adiabatic computation. The results reveal a subordination of torsional microstate dynamics to BPP transitions within time scales relevant to folding. This adiabatic entrainment in the long-time limit is thus identified as responsible for the expediency of the folding process.
NASA Astrophysics Data System (ADS)
Oosthuizen, Carel J.; Cowley, Paul D.; Kyle, Scotty R.; Bloomer, Paulette
2016-12-01
Physical and/or physiological constraints are assumed to isolate fish populations confined to or dependent on estuarine habitats. Strong isolation by distance is thus expected to affect connectivity. Such structuring has important implications for sustainable utilisation and replenishment of estuarine stocks that are heavily exploited. Here we present a preliminary investigation of the phylogenetic relationships of the riverbream (Acanthopagrus species) along the southern African coast and the geographic genetic structure of what appears to be a locally endemic species or lineage. Mitochondrial DNA (mtDNA) cytochrome b sequences support the notion that the species occurring along the southern African coast is A. vagus and not A. berda as previously thought. Yet, the taxonomy of this widespread Indo-West Pacific species or species-complex requires more in-depth investigation. No genetic differentiation was detected among estuarine populations of A. vagus based on the analyses of mtDNA ND2 gene sequences and 10 polymorphic nuclear microsatellite markers. The star-like genealogy and statistical analyses are consistent with a recent population expansion event. Spatial analyses of microsatellite genotypes fail to reject the null hypothesis of panmixia, indicative of a recent population expansion or ongoing gene flow between different estuaries. The northern localities were identified as containing most of the observed variation. This study not only provides insight into the phylogenetic relationship of A. vagus relative to other Acanthopagrus species but also sheds light on the demographic history and contemporary gene flow of the species.
Modeling DNA bubble formation at the atomic scale
DOE Office of Scientific and Technical Information (OSTI.GOV)
Beleva, V; Rasmussen, K. O.; Garcia, A. E.
We describe the fluctuations of double stranded DNA molecules using a minimalist Go model over a wide range of temperatures. Minimalist models allow us to describe, at the atomic level, the opening and formation of bubbles in DNA double helices. This model includes all the geometrical constraints in helix melting imposed by the 3D structure of the molecule. The DNA forms melted bubbles within double helices. These bubbles form and break as a function of time. The equilibrium average number of broken base pairs shows a sharp change as a function of T. We observe a temperature profile of sequencemore » dependent bubble formation similar to those measured by Zeng et al. Long nuclei acid molecules melt partially through the formations of bubbles. It is known that CG rich sequences melt at higher temperatures than AT rich sequences. The melting temperature, however, is not solely determined by the CG content, but by the sequence through base stacking and solvent interactions. Recently, models that incorporate the sequence and nonlinear dynamics of DNA double strands have shown that DNA exhibits a very rich dynamics. Recent extensions of the Bishop-Peyrard model show that fluctuations in the DNA structure lead to opening in localized regions, and that these regions in the DNA are associated with transcription initiation sites. 1D and 2D models of DNA may contain enough information about stacking and base pairing interactions, but lack the coupling between twisting, bending and base pair opening imposed by the double helical structure of DNA that all atom models easily describe. However, the complexity of the energy function used in all atom simulations (including solvent, ions, etc) does not allow for the description of DNA folding/unfolding events that occur in the microsecond time scale.« less
Memory and other properties of multiple test procedures generated by entangled graphs.
Maurer, Willi; Bretz, Frank
2013-05-10
Methods for addressing multiplicity in clinical trials have attracted much attention during the past 20 years. They include the investigation of new classes of multiple test procedures, such as fixed sequence, fallback and gatekeeping procedures. More recently, sequentially rejective graphical test procedures have been introduced to construct and visualize complex multiple test strategies. These methods propagate the local significance level of a rejected null hypothesis to not-yet rejected hypotheses. In the graph defining the test procedure, hypotheses together with their local significance levels are represented by weighted vertices and the propagation rule by weighted directed edges. An algorithm provides the rules for updating the local significance levels and the transition weights after rejecting an individual hypothesis. These graphical procedures have no memory in the sense that the origin of the propagated significance level is ignored in subsequent iterations. However, in some clinical trial applications, memory is desirable to reflect the underlying dependence structure of the study objectives. In such cases, it would allow the further propagation of significance levels to be dependent on their origin and thus reflect the grouped parent-descendant structures of the hypotheses. We will give examples of such situations and show how to induce memory and other properties by convex combination of several individual graphs. The resulting entangled graphs provide an intuitive way to represent the underlying relative importance relationships between the hypotheses, are as easy to perform as the original individual graphs, remain sequentially rejective and control the familywise error rate in the strong sense. Copyright © 2012 John Wiley & Sons, Ltd.
Identification of a Herbal Powder by Deoxyribonucleic Acid Barcoding and Structural Analyses.
Sheth, Bhavisha P; Thaker, Vrinda S
2015-10-01
Authentic identification of plants is essential for exploiting their medicinal properties as well as to stop the adulteration and malpractices with the trade of the same. To identify a herbal powder obtained from a herbalist in the local vicinity of Rajkot, Gujarat, using deoxyribonucleic acid (DNA) barcoding and molecular tools. The DNA was extracted from a herbal powder and selected Cassia species, followed by the polymerase chain reaction (PCR) and sequencing of the rbcL barcode locus. Thereafter the sequences were subjected to National Center for Biotechnology Information (NCBI) basic local alignment search tool (BLAST) analysis, followed by the protein three-dimension structure determination of the rbcL protein from the herbal powder and Cassia species namely Cassia fistula, Cassia tora and Cassia javanica (sequences obtained in the present study), Cassia Roxburghii, and Cassia abbreviata (sequences retrieved from Genbank). Further, the multiple and pairwise structural alignment were carried out in order to identify the herbal powder. The nucleotide sequences obtained from the selected species of Cassia were submitted to Genbank (Accession No. JX141397, JX141405, JX141420). The NCBI BLAST analysis of the rbcL protein from the herbal powder showed an equal sequence similarity (with reference to different parameters like E value, maximum identity, total score, query coverage) to C. javanica and C. roxburghii. In order to solve the ambiguities of the BLAST result, a protein structural approach was implemented. The protein homology models obtained in the present study were submitted to the protein model database (PM0079748-PM0079753). The pairwise structural alignment of the herbal powder (as template) and C. javanica and C. roxburghii (as targets individually) revealed a close similarity of the herbal powder with C. javanica. A strategy as used here, incorporating the integrated use of DNA barcoding and protein structural analyses could be adopted, as a novel rapid and economic procedure, especially in cases when protein coding loci are considered. Authentic identification of plants is essential for exploiting their medicinal properties as well as to stop the adulteration and malpractices with the trade of the same. A herbal powder was obtained from a herbalist in the local vicinity of Rajkot, Gujarat. An integrated approach using DNA barcoding and structural analyses was carried out to identify the herbal powder. The herbal powder was identified as Cassia javanica L.
Lorenzo, J Ramiro; Alonso, Leonardo G; Sánchez, Ignacio E
2015-01-01
Asparagine residues in proteins undergo spontaneous deamidation, a post-translational modification that may act as a molecular clock for the regulation of protein function and turnover. Asparagine deamidation is modulated by protein local sequence, secondary structure and hydrogen bonding. We present NGOME, an algorithm able to predict non-enzymatic deamidation of internal asparagine residues in proteins in the absence of structural data, using sequence-based predictions of secondary structure and intrinsic disorder. Compared to previous algorithms, NGOME does not require three-dimensional structures yet yields better predictions than available sequence-only methods. Four case studies of specific proteins show how NGOME may help the user identify deamidation-prone asparagine residues, often related to protein gain of function, protein degradation or protein misfolding in pathological processes. A fifth case study applies NGOME at a proteomic scale and unveils a correlation between asparagine deamidation and protein degradation in yeast. NGOME is freely available as a webserver at the National EMBnet node Argentina, URL: http://www.embnet.qb.fcen.uba.ar/ in the subpage "Protein and nucleic acid structure and sequence analysis".
3D coupled heat and mass transfer processes at the scale of sedimentary basisn
NASA Astrophysics Data System (ADS)
Cacace, M.; Scheck-Wenderoth, M.; Kaiser, B. O.
2014-12-01
We use coupled 3D simulations of fluid, heat, and transport based on a 3D structural model of a complex geological setting, the Northeast German Basin (NEGB). The geological structure of the NEGB is characterized by a relatively thick layer of Permian Zechstein salt, structured in differnet diapirs (up to 5000 m thick) and pillows locally reaching nearly the surface. Salt is thermally more conductive than other sediments, hydraulically impervious but highly solvable. Thus salt structures have first order influence on the temperature distribution, the deep flow regime and the salinity of groundawater bearing aquifers. In addition, the post-Permian sedimentary sequence is vertically subdivided into several aquifers and aquitards. The shallow Quaternary to late Tertiary freshwater aquifer is separated from the underlying Mesozoic saline aquifers by an embedded Tertiary clay enriched aquitard (Rupelian Aquitard). An important feature of this aquitard is that hydraulic connections between the upper and lower aquifers exist in areas where the Rupelian Aquitard is missing (hydrogeological windows). By means of 3D numerical simulations we explore the role of heat conduction, pressure, and density driven groundwater flow as well as fluid viscosity-related and salinity-dependent effects on the resulting flow and temperature fields. Our results suggest that the regional temperature distribution within the basin results from interactions between regional pressure forces and thermal diffusion locally enhanced by thermal conductivity contrasts between the different sedimentary rocks with the highly conductive salt. Buoyancy forces triggered by temperature-dependent fluid density variations affect only locally the internal thermal configuration. Locations, geometry, and wavelengths of convective thermal anomalies are mainly controlled by the permeability field and thickness values of the respective geological layers. Numerical results from 3D thermo-haline numerical simulations suggest that hydrogeological windows act as preferential domains of hydraulic interconnectivity between the different aquifers at depth, and enable vigorous heat and mass transport which causes a mixing of warm and saline groundwater with cold and less saline groundwater within both aquifers.
Predicting PDZ domain mediated protein interactions from structure
2013-01-01
Background PDZ domains are structural protein domains that recognize simple linear amino acid motifs, often at protein C-termini, and mediate protein-protein interactions (PPIs) in important biological processes, such as ion channel regulation, cell polarity and neural development. PDZ domain-peptide interaction predictors have been developed based on domain and peptide sequence information. Since domain structure is known to influence binding specificity, we hypothesized that structural information could be used to predict new interactions compared to sequence-based predictors. Results We developed a novel computational predictor of PDZ domain and C-terminal peptide interactions using a support vector machine trained with PDZ domain structure and peptide sequence information. Performance was estimated using extensive cross validation testing. We used the structure-based predictor to scan the human proteome for ligands of 218 PDZ domains and show that the predictions correspond to known PDZ domain-peptide interactions and PPIs in curated databases. The structure-based predictor is complementary to the sequence-based predictor, finding unique known and novel PPIs, and is less dependent on training–testing domain sequence similarity. We used a functional enrichment analysis of our hits to create a predicted map of PDZ domain biology. This map highlights PDZ domain involvement in diverse biological processes, some only found by the structure-based predictor. Based on this analysis, we predict novel PDZ domain involvement in xenobiotic metabolism and suggest new interactions for other processes including wound healing and Wnt signalling. Conclusions We built a structure-based predictor of PDZ domain-peptide interactions, which can be used to scan C-terminal proteomes for PDZ interactions. We also show that the structure-based predictor finds many known PDZ mediated PPIs in human that were not found by our previous sequence-based predictor and is less dependent on training–testing domain sequence similarity. Using both predictors, we defined a functional map of human PDZ domain biology and predict novel PDZ domain function. Users may access our structure-based and previous sequence-based predictors at http://webservice.baderlab.org/domains/POW. PMID:23336252
Algorithm, applications and evaluation for protein comparison by Ramanujan Fourier transform.
Zhao, Jian; Wang, Jiasong; Hua, Wei; Ouyang, Pingkai
2015-12-01
The amino acid sequence of a protein determines its chemical properties, chain conformation and biological functions. Protein sequence comparison is of great importance to identify similarities of protein structures and infer their functions. Many properties of a protein correspond to the low-frequency signals within the sequence. Low frequency modes in protein sequences are linked to the secondary structures, membrane protein types, and sub-cellular localizations of the proteins. In this paper, we present Ramanujan Fourier transform (RFT) with a fast algorithm to analyze the low-frequency signals of protein sequences. The RFT method is applied to similarity analysis of protein sequences with the Resonant Recognition Model (RRM). The results show that the proposed fast RFT method on protein comparison is more efficient than commonly used discrete Fourier transform (DFT). RFT can detect common frequencies as significant feature for specific protein families, and the RFT spectrum heat-map of protein sequences demonstrates the information conservation in the sequence comparison. The proposed method offers a new tool for pattern recognition, feature extraction and structural analysis on protein sequences. Copyright © 2015 Elsevier Ltd. All rights reserved.
Effects of the local structure dependence of evaporation fields on field evaporation behavior
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yao, Lan; Marquis, Emmanuelle A., E-mail: emarq@umich.edu; Withrow, Travis
2015-12-14
Accurate three dimensional reconstructions of atomic positions and full quantification of the information contained in atom probe microscopy data rely on understanding the physical processes taking place during field evaporation of atoms from needle-shaped specimens. However, the modeling framework for atom probe microscopy has only limited quantitative justification. Building on the continuum field models previously developed, we introduce a more physical approach with the selection of evaporation events based on density functional theory calculations. This model reproduces key features observed experimentally in terms of sequence of evaporation, evaporation maps, and depth resolution, and provides insights into the physical limit formore » spatial resolution.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Deutscher, J.; Pevec, B.; Beyreuther, K.
1986-10-21
The amino acid sequence of histidine-containing protein (HPr) from Streptococcus faecalis has been determined by direct Edman degradation of intact HPr and by amino acid sequence analysis of tryptic peptides, V8 proteolyptic peptides, thermolytic peptides, and cyanogen bromide cleavage products. HPr from S. faecalis was found to contain 89 amino acid residues, corresponding to a molecular weight of 9438. The amino acid sequence of HPr from S. faecalis shows extended homology to the primary structure of HPr proteins from other bacteria. Besides the phosphoenolpyruvate-dependent phosphorylation of a histidyl residue in HPr, catalyzed by enzyme I of the bacterial phosphotransferase system,more » HPr was also found to be phosphorylated at a seryl residue in an ATP-dependent protein kinase catalyzed reaction. The site of ATP-dependent phosphorylation in HPr of S faecalis has now been determined. (/sup 32/P)P-Ser-HPr was digested with three different proteases, and in each case, a single labeled peptide was isolated. Following digestion with subtilisin, they obtained a peptide with the sequence -(P)Ser-Ile-Met-. Using chymotrypsin, they isolated a peptide with the sequence -Ser-Val-Asn-Leu-Lys-(P)Ser-Ile-Met-Gly-Val-Met-. The longest labeled peptide was obtained with V8 staphylococcal protease. According to amino acid analysis, this peptide contained 36 out of the 89 amino acid residues of HPr. The following sequence of 12 amino acid residues of the V8 peptide was determined: -Tyr-Lys-Gly-Lys-Ser-Val-Asn-Leu-Lys-(P)Ser-Ile-Met-. Thus, the site of ATP-dependent phosphorylation was determined to be Ser-46 within the primary structure of HPr.« less
Sequence-dependent DNA deformability studied using molecular dynamics simulations.
Fujii, Satoshi; Kono, Hidetoshi; Takenaka, Shigeori; Go, Nobuhiro; Sarai, Akinori
2007-01-01
Proteins recognize specific DNA sequences not only through direct contact between amino acids and bases, but also indirectly based on the sequence-dependent conformation and deformability of the DNA (indirect readout). We used molecular dynamics simulations to analyze the sequence-dependent DNA conformations of all 136 possible tetrameric sequences sandwiched between CGCG sequences. The deformability of dimeric steps obtained by the simulations is consistent with that by the crystal structures. The simulation results further showed that the conformation and deformability of the tetramers can highly depend on the flanking base pairs. The conformations of xATx tetramers show the most rigidity and are not affected by the flanking base pairs and the xYRx show by contrast the greatest flexibility and change their conformations depending on the base pairs at both ends, suggesting tetramers with the same central dimer can show different deformabilities. These results suggest that analysis of dimeric steps alone may overlook some conformational features of DNA and provide insight into the mechanism of indirect readout during protein-DNA recognition. Moreover, the sequence dependence of DNA conformation and deformability may be used to estimate the contribution of indirect readout to the specificity of protein-DNA recognition as well as nucleosome positioning and large-scale behavior of nucleic acids.
iPARTS2: an improved tool for pairwise alignment of RNA tertiary structures, version 2.
Yang, Chung-Han; Shih, Cheng-Ting; Chen, Kun-Tze; Lee, Po-Han; Tsai, Ping-Han; Lin, Jian-Cheng; Yen, Ching-Yu; Lin, Tiao-Yin; Lu, Chin Lung
2016-07-08
Since its first release in 2010, iPARTS has become a valuable tool for globally or locally aligning two RNA 3D structures. It was implemented by a structural alphabet (SA)-based approach, which uses an SA of 23 letters to reduce RNA 3D structures into 1D sequences of SA letters and applies traditional sequence alignment to these SA-encoded sequences for determining their global or local similarity. In this version, we have re-implemented iPARTS into a new web server iPARTS2 by constructing a totally new SA, which consists of 92 elements with each carrying both information of base and backbone geometry for a representative nucleotide. This SA is significantly different from the one used in iPARTS, because the latter consists of only 23 elements with each carrying only the backbone geometry information of a representative nucleotide. Our experimental results have shown that iPARTS2 outperforms its previous version iPARTS and also achieves better accuracy than other popular tools, such as SARA, SETTER and RASS, in RNA alignment quality and function prediction. iPARTS2 takes as input two RNA 3D structures in the PDB format and outputs their global or local alignments with graphical display. iPARTS2 is now available online at http://genome.cs.nthu.edu.tw/iPARTS2/. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Featured Article: Nuclear export of opioid growth factor receptor is CRM1 dependent.
Kren, Nancy P; Zagon, Ian S; McLaughlin, Patricia J
2016-02-01
Opioid growth factor receptor (OGFr) facilitates growth inhibition in the presence of its specific ligand opioid growth factor (OGF), chemically termed [Met(5)]-enkephalin. The function of the OGF-OGFr axis requires the receptor to translocate to the nucleus. However, the mechanism of nuclear export of OGFr is unknown. In this study, endogenous OGFr, as well as exogenously expressed OGFr-EGFP, demonstrated significant nuclear accumulation in response to leptomycin B (LMB), an inhibitor of CRM1-dependent nuclear export, suggesting that OGFr is exported in a CRM1-dependent manner. One consensus sequence for a nuclear export signal (NES) was identified. Mutation of the associated leucines, L217 L220 L223 and L225, to alanine resulted in decreased nuclear accumulation. NES-EGFP responded to LMB, indicating that this sequence is capable of functioning as an export signal in isolation. To determine why the sequence functions differently in isolation than as a full length protein, the localization of subNES was evaluated in the presence and absence of MG132, a potent inhibitor of proteosomal degradation. MG132 had no effect of subNES localization. The role of tandem repeats located at the C-terminus of OGFr was examined for their role in nuclear trafficking. Six of seven tandem repeats were removed to form deltaTR. DeltaTR localized exclusively to the nucleus indicating that the tandem repeats may contribute to the localization of the receptor. Similar to the loss of cellular proliferation activity (i.e. inhibition) recorded with subNES, deltaTR also demonstrated a significant loss of inhibitory activity indicating that the repeats may be integral to receptor function. These experiments reveal that OGFr contains one functional NES, L217 L220 L223 and L225 and can be exported from the nucleus in a CRM1-dependent manner. © 2015 by the Society for Experimental Biology and Medicine.
NASA Astrophysics Data System (ADS)
Dixit, V. K.; Porwal, S.; Singh, S. D.; Sharma, T. K.; Ghosh, Sandip; Oak, S. M.
2014-02-01
Temperature dependence of the photoluminescence (PL) peak energy of bulk and quantum well (QW) structures is studied by using a new phenomenological model for including the effect of localized states. In general an anomalous S-shaped temperature dependence of the PL peak energy is observed for many materials which is usually associated with the localization of excitons in band-tail states that are formed due to potential fluctuations. Under such conditions, the conventional models of Varshni, Viña and Passler fail to replicate the S-shaped temperature dependence of the PL peak energy and provide inconsistent and unrealistic values of the fitting parameters. The proposed formalism persuasively reproduces the S-shaped temperature dependence of the PL peak energy and provides an accurate determination of the exciton localization energy in bulk and QW structures along with the appropriate values of material parameters. An example of a strained InAs0.38P0.62/InP QW is presented by performing detailed temperature and excitation intensity dependent PL measurements and subsequent in-depth analysis using the proposed model. Versatility of the new formalism is tested on a few other semiconductor materials, e.g. GaN, nanotextured GaN, AlGaN and InGaN, which are known to have a significant contribution from the localized states. A quantitative evaluation of the fractional contribution of the localized states is essential for understanding the temperature dependence of the PL peak energy of bulk and QW well structures having a large contribution of the band-tail states.
DNA Shape Dominates Sequence Affinity in Nucleosome Formation
NASA Astrophysics Data System (ADS)
Freeman, Gordon S.; Lequieu, Joshua P.; Hinckley, Daniel M.; Whitmer, Jonathan K.; de Pablo, Juan J.
2014-10-01
Nucleosomes provide the basic unit of compaction in eukaryotic genomes, and the mechanisms that dictate their position at specific locations along a DNA sequence are of central importance to genetics. In this Letter, we employ molecular models of DNA and proteins to elucidate various aspects of nucleosome positioning. In particular, we show how DNA's histone affinity is encoded in its sequence-dependent shape, including subtle deviations from the ideal straight B-DNA form and local variations of minor groove width. By relying on high-precision simulations of the free energy of nucleosome complexes, we also demonstrate that, depending on DNA's intrinsic curvature, histone binding can be dominated by bending interactions or electrostatic interactions. More generally, the results presented here explain how sequence, manifested as the shape of the DNA molecule, dominates molecular recognition in the problem of nucleosome positioning.
XPD-dependent activation of apoptosis in response to triplex-induced DNA damage
Kaushik Tiwari, Meetu; Rogers, Faye A.
2013-01-01
DNA sequences capable of forming triplexes are prevalent in the human genome and have been found to be intrinsically mutagenic. Consequently, a balance between DNA repair and apoptosis is critical to counteract their effect on genomic integrity. Using triplex-forming oligonucleotides to synthetically create altered helical distortions, we have determined that pro-apoptotic pathways are activated by the formation of triplex structures. Moreover, the TFIIH factor, XPD, occupies a central role in triggering apoptosis in response to triplex-induced DNA strand breaks. Here, we show that triplexes are capable of inducing XPD-independent double strand breaks, which result in the formation of γH2AX foci. XPD was subsequently recruited to the triplex-induced double strand breaks and co-localized with γH2AX at the damage site. Furthermore, phosphorylation of H2AX tyrosine 142 was found to stimulate the signaling pathway of XPD-dependent apoptosis. We suggest that this mechanism may play an active role in minimizing genomic instability induced by naturally occurring noncanonical structures, perhaps protecting against cancer initiation. PMID:23913414
Zamiri, Bita; Reddy, Kaalak; Macgregor, Robert B; Pearson, Christopher E
2014-02-21
Certain DNA and RNA sequences can form G-quadruplexes, which can affect genetic instability, promoter activity, RNA splicing, RNA stability, and neurite mRNA localization. Amyotrophic lateral sclerosis and frontotemporal dementia can be caused by expansion of a (GGGGCC)n repeat in the C9orf72 gene. Mutant r(GGGGCC)n- and r(GGCCCC)n-containing transcripts aggregate in nuclear foci, possibly sequestering repeat-binding proteins such as ASF/SF2 and hnRNPA1, suggesting a toxic RNA pathogenesis, as occurs in myotonic dystrophy. Furthermore, the C9orf72 repeat RNA was recently demonstrated to undergo the noncanonical repeat-associated non-AUG translation (RAN translation) into pathologic dipeptide repeats in patient brains, a process that is thought to depend upon RNA structure. We previously demonstrated that the r(GGGGCC)n RNA forms repeat tract length-dependent G-quadruplex structures that bind the ASF/SF2 protein. Here we show that the cationic porphyrin (5,10,15,20-tetra(N-methyl-4-pyridyl) porphyrin (TMPyP4)), which can bind some G-quadruplex-forming sequences, can bind and distort the G-quadruplex formed by r(GGGGCC)8, and this ablates the interaction of either hnRNPA1 or ASF/SF2 with the repeat. These findings provide proof of concept that nucleic acid binding small molecules, such as TMPyP4, can distort the secondary structure of the C9orf72 repeat, which may beneficially disrupt protein interactions, which may ablate either protein sequestration and/or RAN translation into potentially toxic dipeptides. Disruption of secondary structure formation of the C9orf72 RNA repeats may be a viable therapeutic avenue, as well as a means to test the role of RNA structure upon RAN translation.
Single-Molecule Denaturation Mapping of Genomic DNA in Nanofluidic Channels
NASA Astrophysics Data System (ADS)
Reisner, Walter; Larsen, Niels; Kristensen, Anders; Tegenfeldt, Jonas O.; Flyvbjerg, Henrik
2009-03-01
We have developed a new DNA barcoding technique based on the partial denaturation of extended fluorescently labeled DNA molecules. We partially melt DNA extended in nanofluidic channels via a combination of local heating and added chemical denaturants. The melted molecules, imaged via a standard fluorescence videomicroscopy setup, exhibit a nonuniform fluorescence profile corresponding to a series of local dips and peaks in the intensity trace along the stretched molecule. We show that this barcode is consistent with the presence of locally melted regions and can be explained by calculations of sequence-dependent melting probability. We believe this melting mapping technology is the first optically based single molecule technique sensitive to genome wide sequence variation that does not require an additional enzymatic labeling or restriction scheme.
Quantification of loading in biomechanical testing: the influence of dissection sequence.
Funabashi, Martha; El-Rich, Marwan; Prasad, Narasimha; Kawchuk, Gregory N
2015-09-18
Sequential dissection is a technique used to investigate loads experienced by articular tissues. When the joint of interest is tested in an unconstrained manner, its kinematics change with each tissue removal. To address this limitation, sufficiently rigid robots are used to constrain joint kinematics. While this approach can quantify loads experienced by each tissue, it does not assure similar results when removal order is changed. Specifically, structure loading is assumed to be independent of removal order if the structure behaves linearly (i.e. principle of superposition applies), but dependent on removal order when response is affected by material and/or geometry nonlinearities and/or viscoelasticiy (e.g. biological tissues). Therefore, this experiment was conducted to evaluate if structure loading created through robotic testing is dependent on the order in which connectors are removed. Six identical models were 3D printed. Each model was composed of 2 rigid bodies and 3 connecting structures with nonlinear time-dependent behavior. To these models, pure rotations were applied about a predefined static center of rotation using a parallel robot. A unique dissection sequence was used for each of the six models and the same movements applied robotically after each dissection. When comparing the moments experienced by each structure between different removal sequences, a statistically significant difference (p<0.05) was observed. These results suggest that even in an optimized environment, the sequence in which nonlinear viscoelastic structures are removed influence model loading. These findings support prior work suggesting that tissue loads obtained from robotic testing are specific to removal order. Copyright © 2015 Elsevier Ltd. All rights reserved.
Ghosh, Jayadri Sekhar; Bhattacharya, Samik; Pal, Amita
2017-06-01
The unavailability of the reproductive structure and unpredictability of vegetative characters for the identification and phylogenetic study of bamboo prompted the application of molecular techniques for greater resolution and consensus. We first employed internal transcribed spacer (ITS1, 5.8S rRNA and ITS2) sequences to construct the phylogenetic tree of 21 tropical bamboo species. While the sequence alone could grossly reconstruct the traditional phylogeny amongst the 21-tropical species studied, some anomalies were encountered that prompted a further refinement of the phylogenetic analyses. Therefore, we integrated the secondary structure of the ITS sequences to derive individual sequence-structure matrix to gain more resolution on the phylogenetic reconstruction. The results showed that ITS sequence-structure is the reliable alternative to the conventional phenotypic method for the identification of bamboo species. The best-fit topology obtained by the sequence-structure based phylogeny over the sole sequence based one underscores closer clustering of all the studied Bambusa species (Sub-tribe Bambusinae), while Melocanna baccifera, which belongs to Sub-Tribe Melocanneae, disjointedly clustered as an out-group within the consensus phylogenetic tree. In this study, we demonstrated the dependability of the combined (ITS sequence+structure-based) approach over the only sequence-based analysis for phylogenetic relationship assessment of bamboo.
Protein functional features are reflected in the patterns of mRNA translation speed.
López, Daniel; Pazos, Florencio
2015-07-09
The degeneracy of the genetic code makes it possible for the same amino acid string to be coded by different messenger RNA (mRNA) sequences. These "synonymous mRNAs" may differ largely in a number of aspects related to their overall translational efficiency, such as secondary structure content and availability of the encoded transfer RNAs (tRNAs). Consequently, they may render different yields of the translated polypeptides. These mRNA features related to translation efficiency are also playing a role locally, resulting in a non-uniform translation speed along the mRNA, which has been previously related to some protein structural features and also used to explain some dramatic effects of "silent" single-nucleotide-polymorphisms (SNPs). In this work we perform the first large scale analysis of the relationship between three experimental proxies of mRNA local translation efficiency and the local features of the corresponding encoded proteins. We found that a number of protein functional and structural features are reflected in the patterns of ribosome occupancy, secondary structure and tRNA availability along the mRNA. One or more of these proxies of translation speed have distinctive patterns around the mRNA regions coding for certain protein local features. In some cases the three patterns follow a similar trend. We also show specific examples where these patterns of translation speed point to the protein's important structural and functional features. This support the idea that the genome not only codes the protein functional features as sequences of amino acids, but also as subtle patterns of mRNA properties which, probably through local effects on the translation speed, have some consequence on the final polypeptide. These results open the possibility of predicting a protein's functional regions based on a single genomic sequence, and have implications for heterologous protein expression and fine-tuning protein function.
Network evolution induced by asynchronous stimuli through spike-timing-dependent plasticity.
Yuan, Wu-Jie; Zhou, Jian-Fang; Zhou, Changsong
2013-01-01
In sensory neural system, external asynchronous stimuli play an important role in perceptual learning, associative memory and map development. However, the organization of structure and dynamics of neural networks induced by external asynchronous stimuli are not well understood. Spike-timing-dependent plasticity (STDP) is a typical synaptic plasticity that has been extensively found in the sensory systems and that has received much theoretical attention. This synaptic plasticity is highly sensitive to correlations between pre- and postsynaptic firings. Thus, STDP is expected to play an important role in response to external asynchronous stimuli, which can induce segregative pre- and postsynaptic firings. In this paper, we study the impact of external asynchronous stimuli on the organization of structure and dynamics of neural networks through STDP. We construct a two-dimensional spatial neural network model with local connectivity and sparseness, and use external currents to stimulate alternately on different spatial layers. The adopted external currents imposed alternately on spatial layers can be here regarded as external asynchronous stimuli. Through extensive numerical simulations, we focus on the effects of stimulus number and inter-stimulus timing on synaptic connecting weights and the property of propagation dynamics in the resulting network structure. Interestingly, the resulting feedforward structure induced by stimulus-dependent asynchronous firings and its propagation dynamics reflect both the underlying property of STDP. The results imply a possible important role of STDP in generating feedforward structure and collective propagation activity required for experience-dependent map plasticity in developing in vivo sensory pathways and cortices. The relevance of the results to cue-triggered recall of learned temporal sequences, an important cognitive function, is briefly discussed as well. Furthermore, this finding suggests a potential application for examining STDP by measuring neural population activity in a cultured neural network.
Functional specificity of a Hox protein mediated by the recognition of minor groove structure.
Joshi, Rohit; Passner, Jonathan M; Rohs, Remo; Jain, Rinku; Sosinsky, Alona; Crickmore, Michael A; Jacob, Vinitha; Aggarwal, Aneel K; Honig, Barry; Mann, Richard S
2007-11-02
The recognition of specific DNA-binding sites by transcription factors is a critical yet poorly understood step in the control of gene expression. Members of the Hox family of transcription factors bind DNA by making nearly identical major groove contacts via the recognition helices of their homeodomains. In vivo specificity, however, often depends on extended and unstructured regions that link Hox homeodomains to a DNA-bound cofactor, Extradenticle (Exd). Using a combination of structure determination, computational analysis, and in vitro and in vivo assays, we show that Hox proteins recognize specific Hox-Exd binding sites via residues located in these extended regions that insert into the minor groove but only when presented with the correct DNA sequence. Our results suggest that these residues, which are conserved in a paralog-specific manner, confer specificity by recognizing a sequence-dependent DNA structure instead of directly reading a specific DNA sequence.
Fluorescent DNA-templated silver nanoclusters
NASA Astrophysics Data System (ADS)
Lin, Ruoqian
Because of the ultra-small size and biocompatibility of silver nanoclusters, they have attracted much research interest for their applications in biolabeling. Among the many ways of synthesizing silver nanoclusters, DNA templated method is particularly attractive---the high tunability of DNA sequences provides another degree of freedom for controlling the chemical and photophysical properties. However, systematic studies about how DNA sequences and concentrations are controlling the photophysical properties are still lacking. The aim of this thesis is to investigate the binding mechanisms of silver clusters binding and single stranded DNAs. Here in this thesis, we report synthesis and characterization of DNA-templated silver nanoclusters and provide a systematic interrogation of the effects of DNA concentrations and sequences, including lengths and secondary structures. We performed a series of syntheses utilizing five different sequences to explore the optimal synthesis condition. By characterizing samples with UV-vis and fluorescence spectroscopy, we achieved the most proper reactants ratio and synthesis conditions. Two of them were chosen for further concentration dependence studies and sequence dependence studies. We found that cytosine-rich sequences are more likely to produce silver nanoclusters with stronger fluorescence signals; however, sequences with hairpin secondary structures are more capable in stabilizing silver nanoclusters. In addition, the fluorescence peak emission intensities and wavelengths of the DNA templated silver clusters have sequence dependent fingerprints. This potentially can be applied to sequence sensing in the future. However all the current conclusions are not warranted; there is still difficulty in formulating general rules in DNA strand design and silver nanocluster production. Further investigation of more sequences could solve these questions in the future.
Scieglinska, D; Widłak, W; Konopka, W; Poutanen, M; Rahman, N; Huhtaniemi, I; Krawczyk, Z
2001-01-01
The rat Hst70 gene and its mouse counterpart Hsp70.2 belong to the family of Hsp70 heat shock genes and are specifically expressed in male germ cells. Previous studies regarding the structure of the 5' region of the transcription unit of these genes as well as localization of the 'cis' elements conferring their testis-specific expression gave contradictory results [Widlak, Markkula, Krawczyk, Kananen and Huhtaniemi (1995) Biochim. Biophys. Acta 1264, 191-200; Dix, Rosario-Herrle, Gotoh, Mori, Goulding, Barret and Eddy (1996) Dev. Biol. 174, 310-321]. In the present paper we solve these controversies and show that the 5' untranslated region (UTR) of the Hst70 gene contains an intron which is localized similar to that of the mouse Hsp70.2 gene. Reverse transcriptase-mediated PCR, Northern blotting and RNase protection analysis revealed that the transcription initiation of both genes starts at two main distant sites, and one of them is localized within the intron. As a result two populations of Hst70 gene transcripts with similar sizes but different 5' UTR structures can be detected in total testicular RNA. Functional analysis of the Hst70 gene promoter in transgenic mice and transient transfection assays proved that the DNA fragment of approx. 360 bp localized upstream of the ATG transcription start codon is the minimal promoter required for testis-specific expression of the HST70/chloramphenicol acetyltransferase transgene. These experiments also suggest that the expression of the gene may depend on 'cis' regulatory elements localized within exon 1 and the intron sequences. PMID:11563976
Dispersal, mating events and fine-scale genetic structure in the lesser flat-headed bats.
Hua, Panyu; Zhang, Libiao; Guo, Tingting; Flanders, Jon; Zhang, Shuyi
2013-01-01
Population genetic structure has important consequences in evolutionary processes and conservation genetics in animals. Fine-scale population genetic structure depends on the pattern of landscape, the permanent movement of individuals, and the dispersal of their genes during temporary mating events. The lesser flat-headed bat (Tylonycteris pachypus) is a nonmigratory Asian bat species that roosts in small groups within the internodes of bamboo stems and the habitats are fragmented. Our previous parentage analyses revealed considerable extra-group mating in this species. To assess the spatial limits and sex-biased nature of gene flow in the same population, we used 20 microsatellite loci and mtDNA sequencing of the ND2 gene to quantify genetic structure among 54 groups of adult flat-headed bats, at nine localities in South China. AMOVA and F(ST) estimates revealed significant genetic differentiation among localities. Alternatively, the pairwise F(ST) values among roosting groups appeared to be related to the incidence of associated extra-group breeding, suggesting the impact of mating events on fine-scale genetic structure. Global spatial autocorrelation analyses showed positive genetic correlation for up to 3 km, indicating the role of fragmented habitat and the specialized social organization as a barrier in the movement of individuals among bamboo forests. The male-biased dispersal pattern resulted in weaker spatial genetic structure between localities among males than among females, and fine-scale analyses supported that relatedness levels within internodes were higher among females than among males. Finally, only females were more related to their same sex roost mates than to individuals from neighbouring roosts, suggestive of natal philopatry in females.
Automatic Tool for Local Assembly Structures
DOE Office of Scientific and Technical Information (OSTI.GOV)
Whole community shotgun sequencing of total DNA (i.e. metagenomics) and total RNA (i.e. metatranscriptomics) has provided a wealth of information in the microbial community structure, predicted functions, metabolic networks, and is even able to reconstruct complete genomes directly. Here we present ATLAS (Automatic Tool for Local Assembly Structures) a comprehensive pipeline for assembly, annotation, genomic binning of metagenomic and metatranscriptomic data with an integrated framework for Multi-Omics. This will provide an open source tool for the Multi-Omic community at large.
2006-01-01
Most eukaryotic mRNAs are monocistronic and translated by cap-dependent initiation. LINE-1 RNA is exceptional because it is naturally dicistronic, encoding two proteins essential for retrotransposition, ORF1p and ORF2p. Here, we show that sequences upstream of ORF1 and ORF2 in mouse L1 function as internal ribosome entry sites (IRESes). Deletion analysis of the ORF1 IRES indicates that RNA structure is critical for its function. Conversely, the ORF2 IRES localizes to 53 nt near the 3′ end of ORF1, and appears to depend upon sequence rather than structure. The 40 nt intergenic region (IGR) is not essential for ORF2 IRES function or retrotransposition. Because of strong cis-preference for both proteins during L1 retrotransposition, correct stoichiometry of the two proteins can only be achieved post-transcriptionally. Although the precise stoichiometry is unknown, the retrotransposition intermediate likely contains hundreds of ORF1ps for every ORF2p, together with one L1 RNA. IRES-mediated translation initiation is a well-established mechanism of message-specific regulation, hence, unique mechanisms for the recognition and control of these two IRESes in the L1 RNA could explain differences in translational efficiency of ORF1 and ORF2. In addition, translational regulation may provide an additional layer of control on L1 retrotransposition efficiency, thereby protecting the integrity of the genome. PMID:16464823
Computational Modeling of Proteins based on Cellular Automata: A Method of HP Folding Approximation.
Madain, Alia; Abu Dalhoum, Abdel Latif; Sleit, Azzam
2018-06-01
The design of a protein folding approximation algorithm is not straightforward even when a simplified model is used. The folding problem is a combinatorial problem, where approximation and heuristic algorithms are usually used to find near optimal folds of proteins primary structures. Approximation algorithms provide guarantees on the distance to the optimal solution. The folding approximation approach proposed here depends on two-dimensional cellular automata to fold proteins presented in a well-studied simplified model called the hydrophobic-hydrophilic model. Cellular automata are discrete computational models that rely on local rules to produce some overall global behavior. One-third and one-fourth approximation algorithms choose a subset of the hydrophobic amino acids to form H-H contacts. Those algorithms start with finding a point to fold the protein sequence into two sides where one side ignores H's at even positions and the other side ignores H's at odd positions. In addition, blocks or groups of amino acids fold the same way according to a predefined normal form. We intend to improve approximation algorithms by considering all hydrophobic amino acids and folding based on the local neighborhood instead of using normal forms. The CA does not assume a fixed folding point. The proposed approach guarantees one half approximation minus the H-H endpoints. This lower bound guaranteed applies to short sequences only. This is proved as the core and the folds of the protein will have two identical sides for all short sequences.
Score distributions of gapped multiple sequence alignments down to the low-probability tail
NASA Astrophysics Data System (ADS)
Fieth, Pascal; Hartmann, Alexander K.
2016-08-01
Assessing the significance of alignment scores of optimally aligned DNA or amino acid sequences can be achieved via the knowledge of the score distribution of random sequences. But this requires obtaining the distribution in the biologically relevant high-scoring region, where the probabilities are exponentially small. For gapless local alignments of infinitely long sequences this distribution is known analytically to follow a Gumbel distribution. Distributions for gapped local alignments and global alignments of finite lengths can only be obtained numerically. To obtain result for the small-probability region, specific statistical mechanics-based rare-event algorithms can be applied. In previous studies, this was achieved for pairwise alignments. They showed that, contrary to results from previous simple sampling studies, strong deviations from the Gumbel distribution occur in case of finite sequence lengths. Here we extend the studies to multiple sequence alignments with gaps, which are much more relevant for practical applications in molecular biology. We study the distributions of scores over a large range of the support, reaching probabilities as small as 10-160, for global and local (sum-of-pair scores) multiple alignments. We find that even after suitable rescaling, eliminating the sequence-length dependence, the distributions for multiple alignment differ from the pairwise alignment case. Furthermore, we also show that the previously discussed Gaussian correction to the Gumbel distribution needs to be refined, also for the case of pairwise alignments.
Ringwald, M; Schuh, R; Vestweber, D; Eistetter, H; Lottspeich, F; Engel, J; Dölz, R; Jähnig, F; Epplen, J; Mayer, S
1987-01-01
We have determined the amino acid sequence of the Ca2+-dependent cell adhesion molecule uvomorulin as it appears on the cell surface. The extracellular part of the molecule exhibits three internally repeated domains of 112 residues which are most likely generated by gene duplication. Each of the repeated domains contains two highly conserved units which could represent putative Ca2+-binding sites. Secondary structure predictions suggest that the putative Ca2+-binding units are located in external loops at the surface of the protein. The protein sequence exhibits a single membrane-spanning region and a cytoplasmic domain. Sequence comparison reveals extensive homology to the chicken L-CAM. Both uvomorulin and L-CAM are identical in 65% of their entire amino acid sequence suggesting a common origin for both CAMs. Images Fig. 1. Fig. 4. Fig. 7. PMID:3501370
Underwound DNA under Tension: Structure, Elasticity, and Sequence-Dependent Behaviors
NASA Astrophysics Data System (ADS)
Sheinin, Maxim Y.; Forth, Scott; Marko, John F.; Wang, Michelle D.
2011-09-01
DNA melting under torsion plays an important role in a wide variety of cellular processes. In the present Letter, we have investigated DNA melting at the single-molecule level using an angular optical trap. By directly measuring force, extension, torque, and angle of DNA, we determined the structural and elastic parameters of torsionally melted DNA. Our data reveal that under moderate forces, the melted DNA assumes a left-handed structure as opposed to an open bubble conformation and is highly torsionally compliant. We have also discovered that at low forces melted DNA properties are highly dependent on DNA sequence. These results provide a more comprehensive picture of the global DNA force-torque phase diagram.
Daroles, Laura; Gribaudo, Simona; Doulazmi, Mohamed; Scotto-Lomassese, Sophie; Dubacq, Caroline; Mandairon, Nathalie; Greer, Charles August; Didier, Anne; Trembleau, Alain; Caillé, Isabelle
2016-07-15
In the adult brain, structural plasticity allowing gain or loss of synapses remodels circuits to support learning. In fragile X syndrome, the absence of fragile X mental retardation protein (FMRP) leads to defects in plasticity and learning deficits. FMRP is a master regulator of local translation but its implication in learning-induced structural plasticity is unknown. Using an olfactory learning task requiring adult-born olfactory bulb neurons and cell-specific ablation of FMRP, we investigated whether learning shapes adult-born neuron morphology during their synaptic integration and its dependence on FMRP. We used alpha subunit of the calcium/calmodulin-dependent kinase II (αCaMKII) mutant mice with altered dendritic localization of αCaMKII messenger RNA, as well as a reporter of αCaMKII local translation to investigate the role of this FMRP messenger RNA target in learning-dependent structural plasticity. Learning induces profound changes in dendritic architecture and spine morphology of adult-born neurons that are prevented by ablation of FMRP in adult-born neurons and rescued by an metabotropic glutamate receptor 5 antagonist. Moreover, dendritically translated αCaMKII is necessary for learning and associated structural modifications and learning triggers an FMRP-dependent increase of αCaMKII dendritic translation in adult-born neurons. Our results strongly suggest that FMRP mediates structural plasticity of olfactory bulb adult-born neurons to support olfactory learning through αCaMKII local translation. This reveals a new role for FMRP-regulated dendritic local translation in learning-induced structural plasticity. This might be of clinical relevance for the understanding of critical periods disruption in autism spectrum disorder patients, among which fragile X syndrome is the primary monogenic cause. Copyright © 2016 Society of Biological Psychiatry. Published by Elsevier Inc. All rights reserved.
Penno, Christophe; Sharma, Virag; Coakley, Arthur; O'Connell Motherway, Mary; van Sinderen, Douwe; Lubkowska, Lucyna; Kireeva, Maria L; Kashlev, Mikhail; Baranov, Pavel V; Atkins, John F
2015-04-21
Escherichia coli and yeast DNA-dependent RNA polymerases are shown to mediate efficient nascent transcript stem loop formation-dependent RNA-DNA hybrid realignment. The realignment was discovered on the heteropolymeric sequence T5C5 and yields transcripts lacking a C residue within a corresponding U5C4. The sequence studied is derived from a Roseiflexus insertion sequence (IS) element where the resulting transcriptional slippage is required for transposase synthesis. The stability of the RNA structure, the proximity of the stem loop to the slippage site, the length and composition of the slippage site motif, and the identity of its 3' adjacent nucleotides (nt) are crucial for transcripts lacking a single C. In many respects, the RNA structure requirements for this slippage resemble those for hairpin-dependent transcription termination. In a purified in vitro system, the slippage efficiency ranges from 5% to 75% depending on the concentration ratios of the nucleotides specified by the slippage sequence and the 3' nt context. The only previous proposal of stem loop mediated slippage, which was in Ebola virus expression, was based on incorrect data interpretation. We propose a mechanical slippage model involving the RNAP translocation state as the main motor in slippage directionality and efficiency. It is distinct from previously described models, including the one proposed for paramyxovirus, where following random movement efficiency is mainly dependent on the stability of the new realigned hybrid. In broadening the scope for utilization of transcription slippage for gene expression, the stimulatory structure provides parallels with programmed ribosomal frameshifting at the translation level.
The cotton centromere contains a Ty3-gypsy-like LTR retroelement.
Luo, Song; Mach, Jennifer; Abramson, Bradley; Ramirez, Rolando; Schurr, Robert; Barone, Pierluigi; Copenhaver, Gregory; Folkerts, Otto
2012-01-01
The centromere is a repeat-rich structure essential for chromosome segregation; with the long-term aim of understanding centromere structure and function, we set out to identify cotton centromere sequences. To isolate centromere-associated sequences from cotton, (Gossypium hirsutum) we surveyed tandem and dispersed repetitive DNA in the genus. Centromere-associated elements in other plants include tandem repeats and, in some cases, centromere-specific retroelements. Examination of cotton genomic survey sequences for tandem repeats yielded sequences that did not localize to the centromere. However, among the repetitive sequences we also identified a gypsy-like LTR retrotransposon (Centromere Retroelement Gossypium, CRG) that localizes to the centromere region of all chromosomes in domestic upland cotton, Gossypium hirsutum, the major commercially grown cotton. The location of the functional centromere was confirmed by immunostaining with antiserum to the centromere-specific histone CENH3, which co-localizes with CRG hybridization on metaphase mitotic chromosomes. G. hirsutum is an allotetraploid composed of A and D genomes and CRG is also present in the centromere regions of other AD cotton species. Furthermore, FISH and genomic dot blot hybridization revealed that CRG is found in D-genome diploid cotton species, but not in A-genome diploid species, indicating that this retroelement may have invaded the A-genome centromeres during allopolyploid formation and amplified during evolutionary history. CRG is also found in other diploid Gossypium species, including B and E2 genome species, but not in the C, E1, F, and G genome species tested. Isolation of this centromere-specific retrotransposon from Gossypium provides a probe for further understanding of centromere structure, and a tool for future engineering of centromere mini-chromosomes in this important crop species.
The Cotton Centromere Contains a Ty3-gypsy-like LTR Retroelement
Luo, Song; Mach, Jennifer; Abramson, Bradley; Ramirez, Rolando; Schurr, Robert; Barone, Pierluigi; Copenhaver, Gregory; Folkerts, Otto
2012-01-01
The centromere is a repeat-rich structure essential for chromosome segregation; with the long-term aim of understanding centromere structure and function, we set out to identify cotton centromere sequences. To isolate centromere-associated sequences from cotton, (Gossypium hirsutum) we surveyed tandem and dispersed repetitive DNA in the genus. Centromere-associated elements in other plants include tandem repeats and, in some cases, centromere-specific retroelements. Examination of cotton genomic survey sequences for tandem repeats yielded sequences that did not localize to the centromere. However, among the repetitive sequences we also identified a gypsy-like LTR retrotransposon (Centromere Retroelement Gossypium, CRG) that localizes to the centromere region of all chromosomes in domestic upland cotton, Gossypium hirsutum, the major commercially grown cotton. The location of the functional centromere was confirmed by immunostaining with antiserum to the centromere-specific histone CENH3, which co-localizes with CRG hybridization on metaphase mitotic chromosomes. G. hirsutum is an allotetraploid composed of A and D genomes and CRG is also present in the centromere regions of other AD cotton species. Furthermore, FISH and genomic dot blot hybridization revealed that CRG is found in D-genome diploid cotton species, but not in A-genome diploid species, indicating that this retroelement may have invaded the A-genome centromeres during allopolyploid formation and amplified during evolutionary history. CRG is also found in other diploid Gossypium species, including B and E2 genome species, but not in the C, E1, F, and G genome species tested. Isolation of this centromere-specific retrotransposon from Gossypium provides a probe for further understanding of centromere structure, and a tool for future engineering of centromere mini-chromosomes in this important crop species. PMID:22536361
p53 Specifically Binds Triplex DNA In Vitro and in Cells
Brázdová, Marie; Tichý, Vlastimil; Helma, Robert; Bažantová, Pavla; Polášková, Alena; Krejčí, Aneta; Petr, Marek; Navrátilová, Lucie; Tichá, Olga; Nejedlý, Karel; Bennink, Martin L.; Subramaniam, Vinod; Bábková, Zuzana; Martínek, Tomáš; Lexa, Matej; Adámik, Matej
2016-01-01
Triplex DNA is implicated in a wide range of biological activities, including regulation of gene expression and genomic instability leading to cancer. The tumor suppressor p53 is a central regulator of cell fate in response to different type of insults. Sequence and structure specific modes of DNA recognition are core attributes of the p53 protein. The focus of this work is the structure-specific binding of p53 to DNA containing triplex-forming sequences in vitro and in cells and the effect on p53-driven transcription. This is the first DNA binding study of full-length p53 and its deletion variants to both intermolecular and intramolecular T.A.T triplexes. We demonstrate that the interaction of p53 with intermolecular T.A.T triplex is comparable to the recognition of CTG-hairpin non-B DNA structure. Using deletion mutants we determined the C-terminal DNA binding domain of p53 to be crucial for triplex recognition. Furthermore, strong p53 recognition of intramolecular T.A.T triplexes (H-DNA), stabilized by negative superhelicity in plasmid DNA, was detected by competition and immunoprecipitation experiments, and visualized by AFM. Moreover, chromatin immunoprecipitation revealed p53 binding T.A.T forming sequence in vivo. Enhanced reporter transactivation by p53 on insertion of triplex forming sequence into plasmid with p53 consensus sequence was observed by luciferase reporter assays. In-silico scan of human regulatory regions for the simultaneous presence of both consensus sequence and T.A.T motifs identified a set of candidate p53 target genes and p53-dependent activation of several of them (ABCG5, ENOX1, INSR, MCC, NFAT5) was confirmed by RT-qPCR. Our results show that T.A.T triplex comprises a new class of p53 binding sites targeted by p53 in a DNA structure-dependent mode in vitro and in cells. The contribution of p53 DNA structure-dependent binding to the regulation of transcription is discussed. PMID:27907175
Towards Long-Range RNA Structure Prediction in Eukaryotic Genes.
Pervouchine, Dmitri D
2018-06-15
The ability to form an intramolecular structure plays a fundamental role in eukaryotic RNA biogenesis. Proximate regions in the primary transcripts fold into a local secondary structure, which is then hierarchically assembled into a tertiary structure that is stabilized by RNA-binding proteins and long-range intramolecular base pairings. While the local RNA structure can be predicted reasonably well for short sequences, long-range structure at the scale of eukaryotic genes remains problematic from the computational standpoint. The aim of this review is to list functional examples of long-range RNA structures, to summarize current comparative methods of structure prediction, and to highlight their advances and limitations in the context of long-range RNA structures. Most comparative methods implement the “first-align-then-fold” principle, i.e., they operate on multiple sequence alignments, while functional RNA structures often reside in non-conserved parts of the primary transcripts. The opposite “first-fold-then-align” approach is currently explored to a much lesser extent. Developing novel methods in both directions will improve the performance of comparative RNA structure analysis and help discover novel long-range structures, their higher-order organization, and RNA⁻RNA interactions across the transcriptome.
Lee, Hui Sun; Im, Wonpil
2016-04-01
Molecular recognition by protein mostly occurs in a local region on the protein surface. Thus, an efficient computational method for accurate characterization of protein local structural conservation is necessary to better understand biology and drug design. We present a novel local structure alignment tool, G-LoSA. G-LoSA aligns protein local structures in a sequence order independent way and provides a GA-score, a chemical feature-based and size-independent structure similarity score. Our benchmark validation shows the robust performance of G-LoSA to the local structures of diverse sizes and characteristics, demonstrating its universal applicability to local structure-centric comparative biology studies. In particular, G-LoSA is highly effective in detecting conserved local regions on the entire surface of a given protein. In addition, the applications of G-LoSA to identifying template ligands and predicting ligand and protein binding sites illustrate its strong potential for computer-aided drug design. We hope that G-LoSA can be a useful computational method for exploring interesting biological problems through large-scale comparison of protein local structures and facilitating drug discovery research and development. G-LoSA is freely available to academic users at http://im.compbio.ku.edu/GLoSA/. © 2016 The Protein Society.
NASA Astrophysics Data System (ADS)
Zhang, Yanhua; Sorjonen-Ward, Peter; Ord, Alison; Kontinen, Asko
2015-04-01
Numerical simulations of geological processes may be used in several ways. On the one hand there is an analytical, or forensic approach, analogous to geophysical inversion, to constrain boundary conditions and to demonstrate how a particular geological process or sequence of events is feasible, or even probable. Alternatively, or additionally, modeling of earth processes can be used in a predictive sense, where forward modeling of various scenarios representing different initial states and applied boundary conditions and processes can provide generic or specific insights - depending on model complexity - which may be applied to problems as diverse as geohazard risk assessment and mineral exploration. These two approaches are complementary, and either may be emphasized, depending on the degree of understanding or density of data in a given study area. Here we review how the results of modeling can be used to develop and test structural scenarios and hypotheses and how they can be integrated with new data sets, in this case, deep crustal and upper crustal high resolution reflection seismic data acquired in recent years in the Paleoproterozoic Outokumpu ore district in eastern Finland. A range of process models have been devised and run for the Outokumpu mineral system, including coupled convective reactive transport models, coupled thermomechanical models assessing thermal regimes in rifting, and coupled mechanical and fluid flow models, but here we focus on the results of mechanical modeling using the finite element code FLAC. Models designed at different scales have provided simple and plausible solutions that affirm the geometric and kinematic scenarios based on regional and mine-scale structural data. At regional scale, FLAC models effectively simulated the partitioning of deformation into NW-SE trending ductile shear zones and domains where coeval folding and thrusting have NE-trending axial trends. At a more detailed district scale, development of local duplexing during folding of lithologically and mechanically diverse layered sequences - the serpentinites of the Outokumpu assemblage and the enclosing metaturbidites - was demonstrated in the FLAC simulations. The overall geometry is very reminiscent of dilational zones in ramp-flat imbricate fault systems that facilitate orogenic gold mineralization. At Outokumpu however, there is no evidence for hydrothermal transport of copper during regional metamorphism and deformation, yet the overall tabular form of the deposit demands significant structural mobilization. Hence, the system may be regarded as closed during peak metamorphic conditions, with essentially local remobilization and redistribution of components, possibly locally facilitated by decarbonation and dehydration reactions within altered metaperidotite lenses. Although we also simulated permeability structures and local fluid pathways in and around lenticular bodies - serpentinite proxies - that were both stronger and weaker than enclosing rock units, it must be admitted that there are few experimental or theoretically calculated constraints on rock behavior under such conditions. Thus, there are some earth environments and process that still elude our modeling capacity, with respect to thermodynamics and rheological behavior, in particular, the role of diffusion and mechanical behaviour of rocks dominated by quartz-sulfide mineralogy, subjected to amphibolites facies conditions for tens of millions of years.
Du, Yushen; Wu, Nicholas C; Jiang, Lin; Zhang, Tianhao; Gong, Danyang; Shu, Sara; Wu, Ting-Ting; Sun, Ren
2016-11-01
Identification and annotation of functional residues are fundamental questions in protein sequence analysis. Sequence and structure conservation provides valuable information to tackle these questions. It is, however, limited by the incomplete sampling of sequence space in natural evolution. Moreover, proteins often have multiple functions, with overlapping sequences that present challenges to accurate annotation of the exact functions of individual residues by conservation-based methods. Using the influenza A virus PB1 protein as an example, we developed a method to systematically identify and annotate functional residues. We used saturation mutagenesis and high-throughput sequencing to measure the replication capacity of single nucleotide mutations across the entire PB1 protein. After predicting protein stability upon mutations, we identified functional PB1 residues that are essential for viral replication. To further annotate the functional residues important to the canonical or noncanonical functions of viral RNA-dependent RNA polymerase (vRdRp), we performed a homologous-structure analysis with 16 different vRdRp structures. We achieved high sensitivity in annotating the known canonical polymerase functional residues. Moreover, we identified a cluster of noncanonical functional residues located in the loop region of the PB1 β-ribbon. We further demonstrated that these residues were important for PB1 protein nuclear import through the interaction with Ran-binding protein 5. In summary, we developed a systematic and sensitive method to identify and annotate functional residues that are not restrained by sequence conservation. Importantly, this method is generally applicable to other proteins about which homologous-structure information is available. To fully comprehend the diverse functions of a protein, it is essential to understand the functionality of individual residues. Current methods are highly dependent on evolutionary sequence conservation, which is usually limited by sampling size. Sequence conservation-based methods are further confounded by structural constraints and multifunctionality of proteins. Here we present a method that can systematically identify and annotate functional residues of a given protein. We used a high-throughput functional profiling platform to identify essential residues. Coupling it with homologous-structure comparison, we were able to annotate multiple functions of proteins. We demonstrated the method with the PB1 protein of influenza A virus and identified novel functional residues in addition to its canonical function as an RNA-dependent RNA polymerase. Not limited to virology, this method is generally applicable to other proteins that can be functionally selected and about which homologous-structure information is available. Copyright © 2016 Du et al.
Exploring the Sequence-based Prediction of Folding Initiation Sites in Proteins.
Raimondi, Daniele; Orlando, Gabriele; Pancsa, Rita; Khan, Taushif; Vranken, Wim F
2017-08-18
Protein folding is a complex process that can lead to disease when it fails. Especially poorly understood are the very early stages of protein folding, which are likely defined by intrinsic local interactions between amino acids close to each other in the protein sequence. We here present EFoldMine, a method that predicts, from the primary amino acid sequence of a protein, which amino acids are likely involved in early folding events. The method is based on early folding data from hydrogen deuterium exchange (HDX) data from NMR pulsed labelling experiments, and uses backbone and sidechain dynamics as well as secondary structure propensities as features. The EFoldMine predictions give insights into the folding process, as illustrated by a qualitative comparison with independent experimental observations. Furthermore, on a quantitative proteome scale, the predicted early folding residues tend to become the residues that interact the most in the folded structure, and they are often residues that display evolutionary covariation. The connection of the EFoldMine predictions with both folding pathway data and the folded protein structure suggests that the initial statistical behavior of the protein chain with respect to local structure formation has a lasting effect on its subsequent states.
Tubby family proteins are adapters for ciliary trafficking of integral membrane proteins
Shimada, Issei S.; Loriot, Evan
2017-01-01
The primary cilium is a paradigmatic organelle for studying compartmentalized signaling; however, unlike soluble protein trafficking, processes targeting integral membrane proteins to cilia are poorly understood. In this study, we determine that the tubby family protein TULP3 functions as a general adapter for ciliary trafficking of structurally diverse integral membrane cargo, including multiple reported and novel rhodopsin family G protein–coupled receptors (GPCRs) and the polycystic kidney disease–causing polycystin 1/2 complex. The founding tubby family member TUB also localizes to cilia similar to TULP3 and determines trafficking of a subset of these GPCRs to neuronal cilia. Using minimal ciliary localization sequences from GPCRs and fibrocystin (also implicated in polycystic kidney disease), we demonstrate these motifs to be sufficient and TULP3 dependent for ciliary trafficking. We propose a three-step model for TULP3/TUB-mediated ciliary trafficking, including the capture of diverse membrane cargo by the tubby domain in a phosphoinositide 4,5-bisphosphate (PI(4,5)P2)-dependent manner, ciliary delivery by intraflagellar transport complex A binding to the TULP3/TUB N terminus, and subsequent release into PI(4,5)P2-deficient ciliary membrane. PMID:28154160
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pestov, Nikolay B., E-mail: korn@mail.ibch.ru; Dmitriev, Ruslan I.; Kostina, Maria B.
Highlights: Black-Right-Pointing-Pointer Full-length secretory pathway Ca-ATPase (SPCA2) cloned from rat duodenum. Black-Right-Pointing-Pointer ATP2C2 gene (encoding SPCA2) exists only in genomes of Tetrapoda. Black-Right-Pointing-Pointer Rat and pig SPCA2 are expressed in intestines, lung and some secretory glands. Black-Right-Pointing-Pointer Subcellular localization of SPCA2 may depend on tissue type. Black-Right-Pointing-Pointer In rat duodenum, SPCA2 is localized in plasma membrane-associated compartments. -- Abstract: Secretory pathway Ca-ATPases are less characterized mammalian calcium pumps than plasma membrane Ca-ATPases and sarco-endoplasmic reticulum Ca-ATPases. Here we report analysis of molecular evolution, alternative splicing, tissue-specific expression and subcellular localization of the second isoform of the secretory pathway Ca-ATPase (SPCA2),more » the product of the ATP2C2 gene. The primary structure of SPCA2 from rat duodenum deduced from full-length transcript contains 944 amino acid residues, and exhibits 65% sequence identity with known SPCA1. The rat SPCA2 sequence is also highly homologous to putative human protein KIAA0703, however, the latter seems to have an aberrant N-terminus originating from intron 2. The tissue-specificity of SPCA2 expression is different from ubiquitous SPCA1. Rat SPCA2 transcripts were detected predominantly in gastrointestinal tract, lung, trachea, lactating mammary gland, skin and preputial gland. In the newborn pig, the expression profile is very similar with one remarkable exception: porcine bulbourethral gland gave the strongest signal. Upon overexpression in cultured cells, SPCA2 shows an intracellular distribution with remarkable enrichment in Golgi. However, in vivo SPCA2 may be localized in compartments that differ among various tissues: it is intracellular in epidermis, but enriched in plasma membranes of the intestinal epithelium. Analysis of SPCA2 sequences from various vertebrate species argue that ATP2C2 gene radiated from ATP2C1 (encoding SPCA1) during adaptation of tetrapod ancestors to terrestrial habitats.« less
Cao, Guangli; Meng, Xiangkun; Xue, Renyu; Zhu, Yuexiong; Zhang, Xiaorong; Pan, Zhonghua; Zheng, Xiaojian; Gong, Chengliang
2012-07-01
A novel Bombyx mori cypovirus 1 isolated from infected silkworm larvae and tentatively assigned as Bombyx mori cypovirus 1 isolate Suzhou (BmCPV-SZ). The complete nucleotide sequences of genomic segments S1-S10 from BmCPV-SZ were determined. All segments possessed a single open reading frame; however, bioinformatic evidence suggested a short overlapping coding sequence in S1. Each BmCPV-SZ segment possessed the conserved terminal sequences AGUAA and GUUAGCC at the 5' and 3' ends, respectively. The conserved A/G at the -3 position in relation to the AUG codon could be found in the BmCPV-SZ genome, and it was postulated that this conserved A/G may be the most important nucleotide for efficient translation initiation in cypoviruses (CPVs). Examination of the putative amino acid sequences encoded by BmCPV-SZ revealed some characteristic motifs. Homology searches showed that viral structural proteins VP1, VP3, and VP4 had localized homologies with proteins of Rice ragged stunt virus , a member of the genus Oryzavirus within the family Reoviridae. A phylogenetic tree based on RNA-dependent RNA polymerase sequences demonstrated that CPV is more closely related to Rice ragged stunt virus and Aedes pseudoscutellaris reovirus than to other members of Reoviridae, suggesting that they may have originated from common ancestors.
Sequence and Structure Dependent DNA-DNA Interactions
NASA Astrophysics Data System (ADS)
Kopchick, Benjamin; Qiu, Xiangyun
Molecular forces between dsDNA strands are largely dominated by electrostatics and have been extensively studied. Quantitative knowledge has been accumulated on how DNA-DNA interactions are modulated by varied biological constituents such as ions, cationic ligands, and proteins. Despite its central role in biology, the sequence of DNA has not received substantial attention and ``random'' DNA sequences are typically used in biophysical studies. However, ~50% of human genome is composed of non-random-sequence DNAs, particularly repetitive sequences. Furthermore, covalent modifications of DNA such as methylation play key roles in gene functions. Such DNAs with specific sequences or modifications often take on structures other than the canonical B-form. Here we present series of quantitative measurements of the DNA-DNA forces with the osmotic stress method on different DNA sequences, from short repeats to the most frequent sequences in genome, and to modifications such as bromination and methylation. We observe peculiar behaviors that appear to be strongly correlated with the incurred structural changes. We speculate the causalities in terms of the differences in hydration shell and DNA surface structures.
Length-independent structural similarities enrich the antibody CDR canonical class model.
Nowak, Jaroslaw; Baker, Terry; Georges, Guy; Kelm, Sebastian; Klostermann, Stefan; Shi, Jiye; Sridharan, Sudharsan; Deane, Charlotte M
2016-01-01
Complementarity-determining regions (CDRs) are antibody loops that make up the antigen binding site. Here, we show that all CDR types have structurally similar loops of different lengths. Based on these findings, we created length-independent canonical classes for the non-H3 CDRs. Our length variable structural clusters show strong sequence patterns suggesting either that they evolved from the same original structure or result from some form of convergence. We find that our length-independent method not only clusters a larger number of CDRs, but also predicts canonical class from sequence better than the standard length-dependent approach. To demonstrate the usefulness of our findings, we predicted cluster membership of CDR-L3 sequences from 3 next-generation sequencing datasets of the antibody repertoire (over 1,000,000 sequences). Using the length-independent clusters, we can structurally classify an additional 135,000 sequences, which represents a ∼20% improvement over the standard approach. This suggests that our length-independent canonical classes might be a highly prevalent feature of antibody space, and could substantially improve our ability to accurately predict the structure of novel CDRs identified by next-generation sequencing.
A local structure model for network analysis
Casleton, Emily; Nordman, Daniel; Kaiser, Mark
2017-04-01
The statistical analysis of networks is a popular research topic with ever widening applications. Exponential random graph models (ERGMs), which specify a model through interpretable, global network features, are common for this purpose. In this study we introduce a new class of models for network analysis, called local structure graph models (LSGMs). In contrast to an ERGM, a LSGM specifies a network model through local features and allows for an interpretable and controllable local dependence structure. In particular, LSGMs are formulated by a set of full conditional distributions for each network edge, e.g., the probability of edge presence/absence, depending onmore » neighborhoods of other edges. Additional model features are introduced to aid in specification and to help alleviate a common issue (occurring also with ERGMs) of model degeneracy. Finally, the proposed models are demonstrated on a network of tornadoes in Arkansas where a LSGM is shown to perform significantly better than a model without local dependence.« less
A local structure model for network analysis
DOE Office of Scientific and Technical Information (OSTI.GOV)
Casleton, Emily; Nordman, Daniel; Kaiser, Mark
The statistical analysis of networks is a popular research topic with ever widening applications. Exponential random graph models (ERGMs), which specify a model through interpretable, global network features, are common for this purpose. In this study we introduce a new class of models for network analysis, called local structure graph models (LSGMs). In contrast to an ERGM, a LSGM specifies a network model through local features and allows for an interpretable and controllable local dependence structure. In particular, LSGMs are formulated by a set of full conditional distributions for each network edge, e.g., the probability of edge presence/absence, depending onmore » neighborhoods of other edges. Additional model features are introduced to aid in specification and to help alleviate a common issue (occurring also with ERGMs) of model degeneracy. Finally, the proposed models are demonstrated on a network of tornadoes in Arkansas where a LSGM is shown to perform significantly better than a model without local dependence.« less
Sequence dependent aggregation of peptides and fibril formation
NASA Astrophysics Data System (ADS)
Hung, Nguyen Ba; Le, Duy-Manh; Hoang, Trinh X.
2017-09-01
Deciphering the links between amino acid sequence and amyloid fibril formation is key for understanding protein misfolding diseases. Here we use Monte Carlo simulations to study the aggregation of short peptides in a coarse-grained model with hydrophobic-polar (HP) amino acid sequences and correlated side chain orientations for hydrophobic contacts. A significant heterogeneity is observed in the aggregate structures and in the thermodynamics of aggregation for systems of different HP sequences and different numbers of peptides. Fibril-like ordered aggregates are found for several sequences that contain the common HPH pattern, while other sequences may form helix bundles or disordered aggregates. A wide variation of the aggregation transition temperatures among sequences, even among those of the same hydrophobic fraction, indicates that not all sequences undergo aggregation at a presumable physiological temperature. The transition is found to be the most cooperative for sequences forming fibril-like structures. For a fibril-prone sequence, it is shown that fibril formation follows the nucleation and growth mechanism. Interestingly, a binary mixture of peptides of an aggregation-prone and a non-aggregation-prone sequence shows the association and conversion of the latter to the fibrillar structure. Our study highlights the role of a sequence in selecting fibril-like aggregates and also the impact of a structural template on fibril formation by peptides of unrelated sequences.
Akrami, Haleh; Moghimi, Sahar
2017-01-01
We investigated the role of culture in processing hierarchical syntactic structures in music. We examined whether violation of non-local dependencies manifest in event related potentials (ERP) for Western and Iranian excerpts by recording EEG while participants passively listened to sequences of modified/original excerpts. We also investigated oscillatory and synchronization properties of brain responses during processing of hierarchical structures. For the Western excerpt, subjective ratings of conclusiveness were marginally significant and the difference in the ERP components fell short of significance. However, ERP and behavioral results showed that while listening to culturally familiar music, subjects comprehended whether or not the hierarchical syntactic structure was fulfilled. Irregularities in the hierarchical structures of the Iranian excerpt elicited an early negativity in the central regions bilaterally, followed by two later negativities from 450-700 to 750-950 ms. The latter manifested throughout the scalp. Moreover, violations of hierarchical structure in the Iranian excerpt were associated with (i) an early decrease in the long range alpha phase synchronization, (ii) an early increase in the oscillatory activity in the beta band over the central areas, and (iii) a late decrease in the theta band phase synchrony between left anterior and right posterior regions. Results suggest that rhythmic structures and melodic fragments, representative of Iranian music, created a familiar context in which recognition of complex non-local syntactic structures was feasible for Iranian listeners. Processing of neural responses to the Iranian excerpt indicated neural mechanisms for processing of hierarchical syntactic structures in music at different levels of cortical integration.
Effect of Methylation on Local Mechanics and Hydration Structure of DNA.
Teng, Xiaojing; Hwang, Wonmuk
2018-04-24
Cytosine methylation affects mechanical properties of DNA and potentially alters the hydration fingerprint for recognition by proteins. The atomistic origin for these effects is not well understood, and we address this via all-atom molecular dynamics simulations. We find that the stiffness of the methylated dinucleotide step changes marginally, whereas the neighboring steps become stiffer. Stiffening is further enhanced for consecutively methylated steps, providing a mechanistic origin for the effect of hypermethylation. Steric interactions between the added methyl groups and the nonpolar groups of the neighboring nucleotides are responsible for the stiffening in most cases. By constructing hydration maps, we found that methylation also alters the surface hydration structure in distinct ways. Its resistance to deformation may contribute to the stiffening of DNA for deformational modes lacking steric interactions. These results highlight the sequence- and deformational-mode-dependent effects of cytosine methylation. Copyright © 2018 Biophysical Society. Published by Elsevier Inc. All rights reserved.
Characterization and chromosomal localization of the gene for human rhodopsin kinase
DOE Office of Scientific and Technical Information (OSTI.GOV)
Khani, S.C.; Yamamoto, S.; Dryja, T.P.
1996-08-01
G-protein-dependent receptor kinases (GRKs) play a key role in the adapatation of receptors to persistent stimuli. In rod photoreceptors rhodopsin kinase (RK) mediates rapid densensitization of rod photoreceptors to light by catalyzing phosphorylation of the visual pigment rhodopsin. To study the structure and mechanism of FRKs in human photoreceptors, we have isolated and characterized cDNA and genomic clones derived from the human RK locus using a bovine rhodopsin kinase cDNA fragment as a probe. The RK locus, assigned to chromosome 13 band q34, is composed of seven exons that encode a protein 92% identical in amino acid sequence to bovinemore » rhodopsin kinase. The marked difference between the structure of this gene and that of another recently clone human GRK gene suggests the existence of a wide evolutionary gap between members of the GRK gene family. 39 refs., 3 figs.« less
Chin, Wutharath; Dognon, Jean-Pierre; Piuzzi, François; Tardivel, Benjamin; Dimicoli, Iliana; Mons, Michel
2005-01-19
Laser desorption of model peptides coupled to laser spectroscopic techniques enables the gas-phase observation of genuine secondary structures of biology. Spectroscopic evidence for the formation of beta-turns in gas-phase peptide chains containing glycine and phenylalanine residues establishes the intrinsic stability of these forms and their ability to compete with other stable structures. The precise characterization of local minima on the potential energy surface from IR spectroscopy constitutes an acute assessment for the state-of-the-art quantum mechanical calculations also presented. The observation of different types of beta-turns depending upon the residue order within the sequence is found to be consistent with the residue propensities in beta-turns of proteins, which suggests that the prevalence of glycine in type II and II' turns stems essentially from an energetic origin, already at play under isolated conditions.
NASA Astrophysics Data System (ADS)
Haugstad, A.; Battisti, D. S.; Armour, K.
2016-12-01
Earth's climate sensitivity depends critically on the strength of radiative feedbacks linking surface warming to changes in top-of-atmosphere (TOA) radiation. Many studies use a simplistic idea of radiative feedbacks, either by treating them as global mean quantities, or by assuming they can be defined uniquely by geographic location and thus that TOA radiative response depends only on local surface warming. For example, a uniform increase in sea-surface temperature has been widely used as a surrogate for global warming (e.g., Cess et al 1990 and the CMIP 'aqua4k' simulations), with the assumption that this produces the same radiative feedbacks as those arising from a doubling of carbon dioxide - even though the spatial patterns of warming differ. However, evidence suggests that these assumptions are not valid, and local feedbacks may be integrally dependent on the structure of warming or type of climate forcing applied (Rose et al 2014). This study thus investigates the following questions: to what extent do local feedbacks depend on the structure and type of forcing applied? And, to what extent do they depend on the pattern of surface temperature change induced by that forcing? Using an idealized framework of an aquaplanet atmosphere-only model, we show that radiative feedbacks are indeed dependent on the large scale structure of warming and type of forcing applied. For example, the climate responds very differently to two forcings of equal global magnitude but applied in different global regions; the pattern of local feedbacks arising from uniform warming are not the same as that arising from polar amplified warming; and the same local feedbacks can be induced by distinct forcing patterns, provided that they produce the same pattern of surface temperature change. These findings suggest that the so-called `efficacies' of climate forcings can be understood simply in terms of how local feedbacks depend on the temperature patterns they induce.
Cicconi, Alessandro; Micheli, Emanuela; Vernì, Fiammetta; Jackson, Alison; Gradilla, Ana Citlali; Cipressa, Francesca; Raimondo, Domenico; Bosso, Giuseppe; Wakefield, James G.; Ciapponi, Laura; Cenci, Giovanni; Gatti, Maurizio
2017-01-01
Abstract Drosophila telomeres are sequence-independent structures maintained by transposition to chromosome ends of three specialized retroelements rather than by telomerase activity. Fly telomeres are protected by the terminin complex that includes the HOAP, HipHop, Moi and Ver proteins. These are fast evolving, non-conserved proteins that localize and function exclusively at telomeres, protecting them from fusion events. We have previously suggested that terminin is the functional analogue of shelterin, the multi-protein complex that protects human telomeres. Here, we use electrophoretic mobility shift assay (EMSA) and atomic force microscopy (AFM) to show that Ver preferentially binds single-stranded DNA (ssDNA) with no sequence specificity. We also show that Moi and Ver form a complex in vivo. Although these two proteins are mutually dependent for their localization at telomeres, Moi neither binds ssDNA nor facilitates Ver binding to ssDNA. Consistent with these results, we found that Ver-depleted telomeres form RPA and γH2AX foci, like the human telomeres lacking the ssDNA-binding POT1 protein. Collectively, our findings suggest that Drosophila telomeres possess a ssDNA overhang like the other eukaryotes, and that the terminin complex is architecturally and functionally similar to shelterin. PMID:27940556
A hidden markov model derived structural alphabet for proteins.
Camproux, A C; Gautier, R; Tufféry, P
2004-06-04
Understanding and predicting protein structures depends on the complexity and the accuracy of the models used to represent them. We have set up a hidden Markov model that discretizes protein backbone conformation as series of overlapping fragments (states) of four residues length. This approach learns simultaneously the geometry of the states and their connections. We obtain, using a statistical criterion, an optimal systematic decomposition of the conformational variability of the protein peptidic chain in 27 states with strong connection logic. This result is stable over different protein sets. Our model fits well the previous knowledge related to protein architecture organisation and seems able to grab some subtle details of protein organisation, such as helix sub-level organisation schemes. Taking into account the dependence between the states results in a description of local protein structure of low complexity. On an average, the model makes use of only 8.3 states among 27 to describe each position of a protein structure. Although we use short fragments, the learning process on entire protein conformations captures the logic of the assembly on a larger scale. Using such a model, the structure of proteins can be reconstructed with an average accuracy close to 1.1A root-mean-square deviation and for a complexity of only 3. Finally, we also observe that sequence specificity increases with the number of states of the structural alphabet. Such models can constitute a very relevant approach to the analysis of protein architecture in particular for protein structure prediction.
Hirota, Tadao; Hirohata, Tetsuo; Mashima, Hiroshi; Satoh, Toshiyuki; Obara, Yoshiaki
2004-11-01
Genetic structure of the large Japanese field mouse populations in suburban landscape of West Tokyo, Japan was determined using mitochondrial DNA control region sequence. Samples were collected from six habitats linked by forests and green tract along the Tama River, and from two forests segregated by urban areas from those continuous habitats. Thirty-five haplotypes were detected in 221 animals. Four to eight haplotypes were found within each local population belonging to the continuous landscape. Some haplotypes were shared by two or three adjacent local populations. On the other hand, two isolated habitats were occupied by one or two indigenous haplotypes. Significant genetic differentiation between all pairs of local populations, except for one pair in the continuous habitats, was found by analysis of molecular variance (amova). The geographical distance between habitats did not explain the large variance of pairwise F(ST)-values among local populations. F(ST)-values between local populations segregated by urban areas were higher than those between local populations in the continuous habitat, regardless of geographical distance. The results of this study demonstrated quantitatively that urban areas inhibit the migration of Apodemus speciosus, whereas a linear green tract along a river functions as a corridor. Moreover, it preserves the metapopulation structure of A. speciosus as well as the corridors in suburban landscape.
Eroglu, Duygu Yilmaz; Ozmutlu, H Cenk
2014-01-01
We developed mixed integer programming (MIP) models and hybrid genetic-local search algorithms for the scheduling problem of unrelated parallel machines with job sequence and machine-dependent setup times and with job splitting property. The first contribution of this paper is to introduce novel algorithms which make splitting and scheduling simultaneously with variable number of subjobs. We proposed simple chromosome structure which is constituted by random key numbers in hybrid genetic-local search algorithm (GAspLA). Random key numbers are used frequently in genetic algorithms, but it creates additional difficulty when hybrid factors in local search are implemented. We developed algorithms that satisfy the adaptation of results of local search into the genetic algorithms with minimum relocation operation of genes' random key numbers. This is the second contribution of the paper. The third contribution of this paper is three developed new MIP models which are making splitting and scheduling simultaneously. The fourth contribution of this paper is implementation of the GAspLAMIP. This implementation let us verify the optimality of GAspLA for the studied combinations. The proposed methods are tested on a set of problems taken from the literature and the results validate the effectiveness of the proposed algorithms.
Influence of DNA sequence on the structure of minicircles under torsional stress
Wang, Qian; Irobalieva, Rossitza N.; Chiu, Wah; Schmid, Michael F.; Fogg, Jonathan M.; Zechiedrich, Lynn
2017-01-01
Abstract The sequence dependence of the conformational distribution of DNA under various levels of torsional stress is an important unsolved problem. Combining theory and coarse-grained simulations shows that the DNA sequence and a structural correlation due to topology constraints of a circle are the main factors that dictate the 3D structure of a 336 bp DNA minicircle under torsional stress. We found that DNA minicircle topoisomers can have multiple bend locations under high torsional stress and that the positions of these sharp bends are determined by the sequence, and by a positive mechanical correlation along the sequence. We showed that simulations and theory are able to provide sequence-specific information about individual DNA minicircles observed by cryo-electron tomography (cryo-ET). We provided a sequence-specific cryo-ET tomogram fitting of DNA minicircles, registering the sequence within the geometric features. Our results indicate that the conformational distribution of minicircles under torsional stress can be designed, which has important implications for using minicircle DNA for gene therapy. PMID:28609782
Structure and Sequence Search on Aptamer-Protein Docking
NASA Astrophysics Data System (ADS)
Xiao, Jiajie; Bonin, Keith; Guthold, Martin; Salsbury, Freddie
2015-03-01
Interactions between proteins and deoxyribonucleic acid (DNA) play a significant role in the living systems, especially through gene regulation. However, short nucleic acids sequences (aptamers) with specific binding affinity to specific proteins exhibit clinical potential as therapeutics. Our capillary and gel electrophoresis selection experiments show that specific sequences of aptamers can be selected that bind specific proteins. Computationally, given the experimentally-determined structure and sequence of a thrombin-binding aptamer, we can successfully dock the aptamer onto thrombin in agreement with experimental structures of the complex. In order to further study the conformational flexibility of this thrombin-binding aptamer and to potentially develop a predictive computational model of aptamer-binding, we use GPU-enabled molecular dynamics simulations to both examine the conformational flexibility of the aptamer in the absence of binding to thrombin, and to determine our ability to fold an aptamer. This study should help further de-novo predictions of aptamer sequences by enabling the study of structural and sequence-dependent effects on aptamer-protein docking specificity.
Rotondi, Kenneth S; Gierasch, Lila M
2003-07-08
The experiments described here explore the role of local sequence in the folding of cellular retinoic acid binding protein I (CRABP I). This is a 136-residue, 10-stranded, antiparallel beta-barrel protein with seven beta-hairpins and is a member of the intracellular lipid binding protein (iLBP) family. The relative roles of local and global sequence information in governing the folding of this class of proteins are not well-understood. In question is whether the beta-turns are locally defined by short-range interactions within their sequences, and are thus able to play an active role in reducing the conformational space available to the folding chain, or whether the turns are passive, relying upon global forces to form. Short (six- and seven-residue) peptides corresponding to the seven CRABP I turns were analyzed by circular dichroism and NMR for their tendencies to take up the conformations they adopt in the context of the native protein. The results indicate that two of the peptides, encompassing turns III and IV in CRABP I, have a strong intrinsic bias to form native turns. Intriguingly, these turns are on linked hairpins in CRABP I and represent the best-conserved turns in the iLBP family. These results suggest that local sequence may play an important role in narrowing the conformational ensemble of CRABP I during folding.
A protein block based fold recognition method for the annotation of twilight zone sequences.
Suresh, V; Ganesan, K; Parthasarathy, S
2013-03-01
The description of protein backbone was recently improved with a group of structural fragments called Structural Alphabets instead of the regular three states (Helix, Sheet and Coil) secondary structure description. Protein Blocks is one of the Structural Alphabets used to describe each and every region of protein backbone including the coil. According to de Brevern (2000) the Protein Blocks has 16 structural fragments and each one has 5 residues in length. Protein Blocks fragments are highly informative among the available Structural Alphabets and it has been used for many applications. Here, we present a protein fold recognition method based on Protein Blocks for the annotation of twilight zone sequences. In our method, we align the predicted Protein Blocks of a query amino acid sequence with a library of assigned Protein Blocks of 953 known folds using the local pair-wise alignment. The alignment results with z-value ≥ 2.5 and P-value ≤ 0.08 are predicted as possible folds. Our method is able to recognize the possible folds for nearly 35.5% of the twilight zone sequences with their predicted Protein Block sequence obtained by pb_prediction, which is available at Protein Block Export server.
Overcoming Sequence Misalignments with Weighted Structural Superposition
Khazanov, Nickolay A.; Damm-Ganamet, Kelly L.; Quang, Daniel X.; Carlson, Heather A.
2012-01-01
An appropriate structural superposition identifies similarities and differences between homologous proteins that are not evident from sequence alignments alone. We have coupled our Gaussian-weighted RMSD (wRMSD) tool with a sequence aligner and seed extension (SE) algorithm to create a robust technique for overlaying structures and aligning sequences of homologous proteins (HwRMSD). HwRMSD overcomes errors in the initial sequence alignment that would normally propagate into a standard RMSD overlay. SE can generate a corrected sequence alignment from the improved structural superposition obtained by wRMSD. HwRMSD’s robust performance and its superiority over standard RMSD are demonstrated over a range of homologous proteins. Its better overlay results in corrected sequence alignments with good agreement to HOMSTRAD. Finally, HwRMSD is compared to established structural alignment methods: FATCAT, SSM, CE, and Dalilite. Most methods are comparable at placing residue pairs within 2 Å, but HwRMSD places many more residue pairs within 1 Å, providing a clear advantage. Such high accuracy is essential in drug design, where small distances can have a large impact on computational predictions. This level of accuracy is also needed to correct sequence alignments in an automated fashion, especially for omics-scale analysis. HwRMSD can align homologs with low sequence identity and large conformational differences, cases where both sequence-based and structural-based methods may fail. The HwRMSD pipeline overcomes the dependency of structural overlays on initial sequence pairing and removes the need to determine the best sequence-alignment method, substitution matrix, and gap parameters for each unique pair of homologs. PMID:22733542
Kachhap, Sangita; Singh, Balvinder
2015-01-01
In most of homeodomain-DNA complexes, glutamine or lysine is present at 50th position and interacts with 5th and 6th nucleotide of core recognition region. Molecular dynamics simulations of Msx-1-DNA complex (Q50-TG) and its variant complexes, that is specific (Q50K-CC), nonspecific (Q50-CC) having mutation in DNA and (Q50K-TG) in protein, have been carried out. Analysis of protein-DNA interactions and structure of DNA in specific and nonspecific complexes show that amino acid residues use sequence-dependent shape of DNA to interact. The binding free energies of all four complexes were analysed to define role of amino acid residue at 50th position in terms of binding strength considering the variation in DNA on stability of protein-DNA complexes. The order of stability of protein-DNA complexes shows that specific complexes are more stable than nonspecific ones. Decomposition analysis shows that N-terminal amino acid residues have been found to contribute maximally in binding free energy of protein-DNA complexes. Among specific protein-DNA complexes, K50 contributes more as compared to Q50 towards binding free energy in respective complexes. The sequence dependence of local conformation of DNA enables Q50/Q50K to make hydrogen bond with nucleotide(s) of DNA. The changes in amino acid sequence of protein are accommodated and stabilized around TAAT core region of DNA having variation in nucleotides.
A computational proposal for designing structured RNA pools for in vitro selection of RNAs.
Kim, Namhee; Gan, Hin Hark; Schlick, Tamar
2007-04-01
Although in vitro selection technology is a versatile experimental tool for discovering novel synthetic RNA molecules, finding complex RNA molecules is difficult because most RNAs identified from random sequence pools are simple motifs, consistent with recent computational analysis of such sequence pools. Thus, enriching in vitro selection pools with complex structures could increase the probability of discovering novel RNAs. Here we develop an approach for engineering sequence pools that links RNA sequence space regions with corresponding structural distributions via a "mixing matrix" approach combined with a graph theory analysis. We define five classes of mixing matrices motivated by covariance mutations in RNA; these constructs define nucleotide transition rates and are applied to chosen starting sequences to yield specific nonrandom pools. We examine the coverage of sequence space as a function of the mixing matrix and starting sequence via clustering analysis. We show that, in contrast to random sequences, which are associated only with a local region of sequence space, our designed pools, including a structured pool for GTP aptamers, can target specific motifs. It follows that experimental synthesis of designed pools can benefit from using optimized starting sequences, mixing matrices, and pool fractions associated with each of our constructed pools as a guide. Automation of our approach could provide practical tools for pool design applications for in vitro selection of RNAs and related problems.
Novel complex MAD phasing and RNase H structural insights using selenium oligonucleotides
DOE Office of Scientific and Technical Information (OSTI.GOV)
Abdur, Rob; Gerlits, Oksana O.; Gan, Jianhua
2014-02-01
Selenium-derivatized oligonucleotides may facilitate phase determination and high-resolution structure determination for protein–nucleic acid crystallography. The Se atom-specific mutagenesis (SAM) strategy may also enhance the study of nuclease catalysis. The crystal structures of protein–nucleic acid complexes are commonly determined using selenium-derivatized proteins via MAD or SAD phasing. Here, the first protein–nucleic acid complex structure determined using selenium-derivatized nucleic acids is reported. The RNase H–RNA/DNA complex is used as an example to demonstrate the proof of principle. The high-resolution crystal structure indicates that this selenium replacement results in a local subtle unwinding of the RNA/DNA substrate duplex, thereby shifting the RNA scissilemore » phosphate closer to the transition state of the enzyme-catalyzed reaction. It was also observed that the scissile phosphate forms a hydrogen bond to the water nucleophile and helps to position the water molecule in the structure. Consistently, it was discovered that the substitution of a single O atom by a Se atom in a guide DNA sequence can largely accelerate RNase H catalysis. These structural and catalytic studies shed new light on the guide-dependent RNA cleavage.« less
NASA Astrophysics Data System (ADS)
Sethaphong, Latsavongsakda
This work examines smart material properties of rational self-assembly and molecular recognition found in nano-biosystems. Exploiting the sequence and structural information encoded within nucleic acids and proteins will permit programmed synthesis of nanomaterials and help create molecular machines that may carry out new roles involving chemical catalysis and bioenergy. Responsive to different ionic environments thru self-reorgnization, nucleic acids (NA) are nature's signature smart material; organisms such as viruses and bacteria use features of NAs to react to their environment and orchestrate their lifecycle. Furthermore, nucleic acid systems (both RNA and DNA) are currently exploited as scaffolds; recent applications have been showcased to build bioelectronics and biotemplated nanostructures via directed assembly of multidimensional nanoelectronic devices 1. Since the most stable and rudimentary structure of nucleic acids is the helical duplex, these were modeled in order to examine the influence of the microenvironment, sequence, and cation-dependent perturbations of their canonical forms. Due to their negatively charged phosphate backbone, NA's rely on counterions to overcome the inherent repulsive forces that arise from the assembly of two complementary strands. As a realistic model system, we chose the HIV-TAR helix (PDB ID: 397D) to study specific sequence motifs on cation sequestration. At physiologically relevant concentrations of sodium and potassium ions, we observed sequence based effects where purine stretches were adept in retaining high residency cations. The transitional space between adenine and guanosine nucleotides (ApG step) in a sequence proved the most favorable. This work was the first to directly show these subtle interactions of sequence based cationic sequestration and may be useful for controlling metallization of nucleic acids in conductive nanowires. Extending the study further, we explored the degree to which the structure of NA duplexes alone interacted with cations distinct from a specific sequence. Under physiologically relevant conditions, a duplex of RNA polyguanine-polycitidine was highly responsive and able to sequester cations to the middle of the purine stretches. The least responsive structure was a DNA polyadenine-polythymine duplex. A random sequence DNA duplex contorted into an RNA-like helix resulted in cationic dynamics similar to RNA systems. These studies showed that cation diffusive binding events in nucleic acid duplex structures are sequence specific and heavily influenced by structural aspects helical forms to account for much of the differences observed. Although structural information in nucleic acids is encoded within their sequence, linking amino acid sequence to protein structure is murkier; the structural information within proteins is encoded by the folding process itself: a complex phenomenon driven toward the equilibrium state of the active conformation. Upwards of two thirds of a protein's sequence can be substituted with similar amino acids without significantly perturbing its function; conserved residues of about 10% seem to be vital; since evolutionary selection pressure in proteins operates 3-dimenionally, a linear sequence is partially informative. We explored this problem by folding de-novo the cytosolic portion of the membrane protein, cellulose synthase, CESA1 from upland cotton, Gossypium hirsutum (Ghcesa1). The cytoplasmic region was generated by homology modeling and refined with molecular dynamics. These mutations impair local structural flexibility which likely results in cellulose that is produced at a lower rate and is less crystalline. Additional modeling of fragments of cellulose synthases from the model plant, Arabidopsis thaliana, offered novel insights into the function of conserved cytosolic domains within plant cellulose synthases. Transport mechanisms related to the transmembrane region revealed significant differences between plants and a bacterial complex. These studies generated possible mutations that may allow for the creation of new synthases and identified other avenues of research in order to develop technologies that may alter the crystallinity and other useful properties of cellulose. 1. Karplus, K., SAM-T08, HMM-based protein structure prediction. Nucleic Acids Research, 2009. 37: p. W492-W497.
Tobias, Fernando; Keiderling, Timothy A
2016-05-10
Poly(glutamic acid) at low pH self-assembles after incubation at higher temperature into fibrils composed of antiparallel sheets that are stacked in a β2-type structure whose amide carbonyls have bifurcated H-bonds involving the side chains from the next sheet. Oligomers of Glu can also form such structures, and isotope labeling has provided insight into their out-of-register antiparallel structure [ Biomacromolecules 2013 , 14 , 3880 - 3891 ]. In this paper we report IR and VCD spectra and transmission electron micrograph (TEM) images for a series of alternately sequenced oligomers, Lys-(Aaa-Glu)5-Lys-NH2, where Aaa was varied over a variety of polar, aliphatic, or aromatic residues. Their spectral and TEM data show that these oligopeptides self-assemble into different structures, both local and morphological, that are dependent on both the nature of the Aaa side chains and growth conditions employed. Such alternate peptides substituted with small or polar residues, Ala and Thr, do not yield fibrils; but with β-branched aliphatic residues, Val and Ile, that could potentially pack with Glu side chains, these oligopeptides do show evidence of β2-stacking. By contrast, for Leu, with longer side chains, only β1-stacking is seen while with even larger Phe side chains, either β-form can be detected separately, depending on preparation conditions. These structures are dependent on high temperature incubation after reducing the pH and in some cases after sonication of initial fibril forms and reincubation. Some of these fibrillar peptides, but not all, show enhanced VCD, which can offer evidence for formation of long, multistrand, often twisted structures. Substitution of Glu with residues having selected side chains yields a variety of morphologies, leading to both β1- and β2-structures, that overall suggests two different packing modes for the hydrophobic side chains depending on size and type.
ECE-imaging of the H-mode pedestal (invited).
Tobias, B J; Austin, M E; Boom, J E; Burrell, K H; Classen, I G J; Domier, C W; Luhmann, N C; Nazikian, R; Snyder, P B
2012-10-01
A synthetic diagnostic has been developed that reproduces the highly structured electron cyclotron emission (ECE) spectrum radiated from the edge region of H-mode discharges. The modeled dependence on local perturbations of the equilibrium plasma pressure allows for interpretation of ECE data for diagnosis of local quantities. Forward modeling of the diagnostic response in this region allows for improved mapping of the observed fluctuations to flux surfaces within the plasma, allowing for the poloidal mode number of coherent structures to be resolved. In addition, other spectral features that are dependent on both T(e) and n(e) contain information about pedestal structure and the electron energy distribution of localized phenomena, such as edge filaments arising during edge-localized mode (ELM) activity.
NASA Astrophysics Data System (ADS)
Tian, Wen-Yan; Kuang, Xiao-Yu; Li, Hui-Fang; Li, Yan-Fang; Ying-Li
2009-01-01
A theoretical method for studying the inter-relation between the local structure and EPR spectra is established by diagonalizing the complete energy matrices. For [M(H 2O) 6]XCl 6:Mn 2+ (M = Zn, Mg, Cd, Ca; X = Pt, Sn) systems, the calculated results demonstrate that the local structures around the octahedral Mn 2+ centers in the doped systems are very similar despite of the host crystals being different. Furthermore, it is shown that the EPR zero-field parameter D depends simultaneously on the local structure parameters R and θ while ( a - F) depends mainly on R, whether the doped systems are at liquid-nitrogen temperature or room temperature.
Evolution of ribozymes in the presence of a mineral surface
Stephenson, James D.; Popović, Milena; Bristow, Thomas F.
2016-01-01
Mineral surfaces are often proposed as the sites of critical processes in the emergence of life. Clay minerals in particular are thought to play significant roles in the origin of life including polymerizing, concentrating, organizing, and protecting biopolymers. In these scenarios, the impact of minerals on biopolymer folding is expected to influence evolutionary processes. These processes include both the initial emergence of functional structures in the presence of the mineral and the subsequent transition away from the mineral-associated niche. The initial evolution of function depends upon the number and distribution of sequences capable of functioning in the presence of the mineral, and the transition to new environments depends upon the overlap between sequences that evolve on the mineral surface and sequences that can perform the same functions in the mineral's absence. To examine these processes, we evolved self-cleaving ribozymes in vitro in the presence or absence of Na-saturated montmorillonite clay mineral particles. Starting from a shared population of random sequences, RNA populations were evolved in parallel, along separate evolutionary trajectories. Comparative sequence analysis and activity assays show that the impact of this clay mineral on functional structure selection was minimal; it neither prevented common structures from emerging, nor did it promote the emergence of new structures. This suggests that montmorillonite does not improve RNA's ability to evolve functional structures; however, it also suggests that RNAs that do evolve in contact with montmorillonite retain the same structures in mineral-free environments, potentially facilitating an evolutionary transition away from a mineral-associated niche. PMID:27793980
High Electrical Conductivity of Single Metal-Organic Chains.
Ares, Pablo; Amo-Ochoa, Pilar; Soler, José M; Palacios, Juan José; Gómez-Herrero, Julio; Zamora, Félix
2018-05-01
Molecular wires are essential components for future nanoscale electronics. However, the preparation of individual long conductive molecules is still a challenge. MMX metal-organic polymers are quasi-1D sequences of single halide atoms (X) bridging subunits with two metal ions (MM) connected by organic ligands. They are excellent electrical conductors as bulk macroscopic crystals and as nanoribbons. However, according to theoretical calculations, the electrical conductance found in the experiments should be even higher. Here, a novel and simple drop-casting procedure to isolate bundles of few to single MMX chains is demonstrated. Furthermore, an exponential dependence of the electrical resistance of one or two MMX chains as a function of their length that does not agree with predictions based on their theoretical band structure is reported. This dependence is attributed to strong Anderson localization originated by structural defects. Theoretical modeling confirms that the current is limited by structural defects, mainly vacancies of iodine atoms, through which the current is constrained to flow. Nevertheless, measurable electrical transport along distances beyond 250 nm surpasses that of all other molecular wires reported so far. This work places in perspective the role of defects in 1D wires and their importance for molecular electronics. © 2018 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Luo, Xiongbiao; Mori, Kensaku
2014-06-01
Endoscope 3-D motion tracking, which seeks to synchronize pre- and intra-operative images in endoscopic interventions, is usually performed as video-volume registration that optimizes the similarity between endoscopic video and pre-operative images. The tracking performance, in turn, depends significantly on whether a similarity measure can successfully characterize the difference between video sequences and volume rendering images driven by pre-operative images. The paper proposes a discriminative structural similarity measure, which uses the degradation of structural information and takes image correlation or structure, luminance, and contrast into consideration, to boost video-volume registration. By applying the proposed similarity measure to endoscope tracking, it was demonstrated to be more accurate and robust than several available similarity measures, e.g., local normalized cross correlation, normalized mutual information, modified mean square error, or normalized sum squared difference. Based on clinical data evaluation, the tracking error was reduced significantly from at least 14.6 mm to 4.5 mm. The processing time was accelerated more than 30 frames per second using graphics processing unit.
Galectin-3 in angiogenesis and metastasis
Funasaka, Tatsuyoshi; Raz, Avraham; Nangia-Makker, Pratima
2014-01-01
Galectin-3 is a member of the family of β-galactoside-binding lectins characterized by evolutionarily conserved sequences defined by structural similarities in their carbohydrate-recognition domains. Galectin-3 is a unique, chimeric protein consisting of three distinct structural motifs: (i) a short NH2 terminal domain containing a serine phosphorylation site; (ii) a repetitive proline-rich collagen-α-like sequence cleavable by matrix metalloproteases; and (iii) a globular COOH-terminal domain containing a carbohydrate-binding motif and an NWGR anti-death motif. It is ubiquitously expressed and has diverse biological functions depending on its subcellular localization. Galectin-3 is mainly found in the cytoplasm, also seen in the nucleus and can be secreted by non-classical, secretory pathways. In general, secreted galectin-3 mediates cell migration, cell adhesion and cell–cell interactions through the binding with high affinity to galactose-containing glycoproteins on the cell surface. Cytoplasmic galectin-3 exhibits anti-apoptotic activity and regulates several signal transduction pathways, whereas nuclear galectin-3 has been associated with pre-mRNA splicing and gene expression. Its unique chimeric structure enables it to interact with a plethora of ligands and modulate diverse functions such as cell growth, adhesion, migration, invasion, angiogenesis, immune function, apoptosis and endocytosis emphasizing its significance in the process of tumor progression. In this review, we have focused on the role of galectin-3 in tumor metastasis with special emphasis on angiogenesis. PMID:25138305
Robust Measurement via A Fused Latent and Graphical Item Response Theory Model.
Chen, Yunxiao; Li, Xiaoou; Liu, Jingchen; Ying, Zhiliang
2018-03-12
Item response theory (IRT) plays an important role in psychological and educational measurement. Unlike the classical testing theory, IRT models aggregate the item level information, yielding more accurate measurements. Most IRT models assume local independence, an assumption not likely to be satisfied in practice, especially when the number of items is large. Results in the literature and simulation studies in this paper reveal that misspecifying the local independence assumption may result in inaccurate measurements and differential item functioning. To provide more robust measurements, we propose an integrated approach by adding a graphical component to a multidimensional IRT model that can offset the effect of unknown local dependence. The new model contains a confirmatory latent variable component, which measures the targeted latent traits, and a graphical component, which captures the local dependence. An efficient proximal algorithm is proposed for the parameter estimation and structure learning of the local dependence. This approach can substantially improve the measurement, given no prior information on the local dependence structure. The model can be applied to measure both a unidimensional latent trait and multidimensional latent traits.
Streamwise-Localized Solutions with natural 1-fold symmetry
NASA Astrophysics Data System (ADS)
Altmeyer, Sebastian; Willis, Ashley; Hof, Björn
2014-11-01
It has been proposed in recent years that turbulence is organized around unstable invariant solutions, which provide the building blocks of the chaotic dynamics. In direct numerical simulations of pipe flow we show that when imposing a minimal symmetry constraint (reflection in an axial plane only) the formation of turbulence can indeed be explained by dynamical systems concepts. The hypersurface separating laminar from turbulent motion, the edge of turbulence, is spanned by the stable manifolds of an exact invariant solution, a periodic orbit of a spatially localized structure. The turbulent states themselves (turbulent puffs in this case) are shown to arise in a bifurcation sequence from a related localized solution (the upper branch orbit). The rather complex bifurcation sequence involves secondary Hopf bifurcations, frequency locking and a period doubling cascade until eventually turbulent puffs arise. In addition we report preliminary results of the transition sequence for pipe flow without symmetry constraints.
2016-01-01
Abstract Molecular recognition by protein mostly occurs in a local region on the protein surface. Thus, an efficient computational method for accurate characterization of protein local structural conservation is necessary to better understand biology and drug design. We present a novel local structure alignment tool, G‐LoSA. G‐LoSA aligns protein local structures in a sequence order independent way and provides a GA‐score, a chemical feature‐based and size‐independent structure similarity score. Our benchmark validation shows the robust performance of G‐LoSA to the local structures of diverse sizes and characteristics, demonstrating its universal applicability to local structure‐centric comparative biology studies. In particular, G‐LoSA is highly effective in detecting conserved local regions on the entire surface of a given protein. In addition, the applications of G‐LoSA to identifying template ligands and predicting ligand and protein binding sites illustrate its strong potential for computer‐aided drug design. We hope that G‐LoSA can be a useful computational method for exploring interesting biological problems through large‐scale comparison of protein local structures and facilitating drug discovery research and development. G‐LoSA is freely available to academic users at http://im.compbio.ku.edu/GLoSA/. PMID:26813336
Pairwise Sequence Alignment Library
DOE Office of Scientific and Technical Information (OSTI.GOV)
Jeff Daily, PNNL
2015-05-20
Vector extensions, such as SSE, have been part of the x86 CPU since the 1990s, with applications in graphics, signal processing, and scientific applications. Although many algorithms and applications can naturally benefit from automatic vectorization techniques, there are still many that are difficult to vectorize due to their dependence on irregular data structures, dense branch operations, or data dependencies. Sequence alignment, one of the most widely used operations in bioinformatics workflows, has a computational footprint that features complex data dependencies. The trend of widening vector registers adversely affects the state-of-the-art sequence alignment algorithm based on striped data layouts. Therefore, amore » novel SIMD implementation of a parallel scan-based sequence alignment algorithm that can better exploit wider SIMD units was implemented as part of the Parallel Sequence Alignment Library (parasail). Parasail features: Reference implementations of all known vectorized sequence alignment approaches. Implementations of Smith Waterman (SW), semi-global (SG), and Needleman Wunsch (NW) sequence alignment algorithms. Implementations across all modern CPU instruction sets including AVX2 and KNC. Language interfaces for C/C++ and Python.« less
Surveying unsteady flows by means of movie sequences - A case study
NASA Astrophysics Data System (ADS)
Freymuth, P.; Bank, W.; Finaish, F.
Photographic surveying techniques and their results are presented for vortical pattern development in unsteady two-dimensional flows, which depends on a multitude of parameters that have heretofore hampered broad investigation, in order to delineate the more important parametric dependencies. Samples are given from 100 films representing over 2000 sequences consisting of 400,000 photographic frames. Attention is given to the problems posed by resolution of time and lateral dimensions, spanwise vortical structure, and the dependence of angle of attack on Reynolds number and flow geometry.
Automated antibody structure prediction using Accelrys tools: Results and best practices
Fasnacht, Marc; Butenhof, Ken; Goupil-Lamy, Anne; Hernandez-Guzman, Francisco; Huang, Hongwei; Yan, Lisa
2014-01-01
We describe the methodology and results from our participation in the second Antibody Modeling Assessment experiment. During the experiment we predicted the structure of eleven unpublished antibody Fv fragments. Our prediction methods centered on template-based modeling; potential templates were selected from an antibody database based on their sequence similarity to the target in the framework regions. Depending on the quality of the templates, we constructed models of the antibody framework regions either using a single, chimeric or multiple template approach. The hypervariable loop regions in the initial models were rebuilt by grafting the corresponding regions from suitable templates onto the model. For the H3 loop region, we further refined models using ab initio methods. The final models were subjected to constrained energy minimization to resolve severe local structural problems. The analysis of the models submitted show that Accelrys tools allow for the construction of quite accurate models for the framework and the canonical CDR regions, with RMSDs to the X-ray structure on average below 1 Å for most of these regions. The results show that accurate prediction of the H3 hypervariable loops remains a challenge. Furthermore, model quality assessment of the submitted models show that the models are of quite high quality, with local geometry assessment scores similar to that of the target X-ray structures. Proteins 2014; 82:1583–1598. © 2014 The Authors. Proteins published by Wiley Periodicals, Inc. PMID:24833271
Ding, Jiarui; Condon, Anne; Shah, Sohrab P
2018-05-21
Single-cell RNA-sequencing has great potential to discover cell types, identify cell states, trace development lineages, and reconstruct the spatial organization of cells. However, dimension reduction to interpret structure in single-cell sequencing data remains a challenge. Existing algorithms are either not able to uncover the clustering structures in the data or lose global information such as groups of clusters that are close to each other. We present a robust statistical model, scvis, to capture and visualize the low-dimensional structures in single-cell gene expression data. Simulation results demonstrate that low-dimensional representations learned by scvis preserve both the local and global neighbor structures in the data. In addition, scvis is robust to the number of data points and learns a probabilistic parametric mapping function to add new data points to an existing embedding. We then use scvis to analyze four single-cell RNA-sequencing datasets, exemplifying interpretable two-dimensional representations of the high-dimensional single-cell RNA-sequencing data.
2010-01-01
Background Comparative genomics methods such as phylogenetic profiling can mine powerful inferences from inherently noisy biological data sets. We introduce Sites Inferred by Metabolic Background Assertion Labeling (SIMBAL), a method that applies the Partial Phylogenetic Profiling (PPP) approach locally within a protein sequence to discover short sequence signatures associated with functional sites. The approach is based on the basic scoring mechanism employed by PPP, namely the use of binomial distribution statistics to optimize sequence similarity cutoffs during searches of partitioned training sets. Results Here we illustrate and validate the ability of the SIMBAL method to find functionally relevant short sequence signatures by application to two well-characterized protein families. In the first example, we partitioned a family of ABC permeases using a metabolic background property (urea utilization). Thus, the TRUE set for this family comprised members whose genome of origin encoded a urea utilization system. By moving a sliding window across the sequence of a permease, and searching each subsequence in turn against the full set of partitioned proteins, the method found which local sequence signatures best correlated with the urea utilization trait. Mapping of SIMBAL "hot spots" onto crystal structures of homologous permeases reveals that the significant sites are gating determinants on the cytosolic face rather than, say, docking sites for the substrate-binding protein on the extracellular face. In the second example, we partitioned a protein methyltransferase family using gene proximity as a criterion. In this case, the TRUE set comprised those methyltransferases encoded near the gene for the substrate RF-1. SIMBAL identifies sequence regions that map onto the substrate-binding interface while ignoring regions involved in the methyltransferase reaction mechanism in general. Neither method for training set construction requires any prior experimental characterization. Conclusions SIMBAL shows that, in functionally divergent protein families, selected short sequences often significantly outperform their full-length parent sequence for making functional predictions by sequence similarity, suggesting avenues for improved functional classifiers. When combined with structural data, SIMBAL affords the ability to localize and model functional sites. PMID:20102603
Ganesan, K; Parthasarathy, S
2011-12-01
Annotation of any newly determined protein sequence depends on the pairwise sequence identity with known sequences. However, for the twilight zone sequences which have only 15-25% identity, the pair-wise comparison methods are inadequate and the annotation becomes a challenging task. Such sequences can be annotated by using methods that recognize their fold. Bowie et al. described a 3D1D profile method in which the amino acid sequences that fold into a known 3D structure are identified by their compatibility to that known 3D structure. We have improved the above method by using the predicted secondary structure information and employ it for fold recognition from the twilight zone sequences. In our Protein Secondary Structure 3D1D (PSS-3D1D) method, a score (w) for the predicted secondary structure of the query sequence is included in finding the compatibility of the query sequence to the known fold 3D structures. In the benchmarks, the PSS-3D1D method shows a maximum of 21% improvement in predicting correctly the α + β class of folds from the sequences with twilight zone level of identity, when compared with the 3D1D profile method. Hence, the PSS-3D1D method could offer more clues than the 3D1D method for the annotation of twilight zone sequences. The web based PSS-3D1D method is freely available in the PredictFold server at http://bioinfo.bdu.ac.in/servers/ .
A palindrome-mediated mechanism distinguishes translocations involving LCR-B of chromosome 22q11.2.
Gotter, Anthony L; Shaikh, Tamim H; Budarf, Marcia L; Rhodes, C Harker; Emanuel, Beverly S
2004-01-01
Two known recurrent constitutional translocations, t(11;22) and t(17;22), as well as a non-recurrent t(4;22), display derivative chromosomes that have joined to a common site within the low copy repeat B (LCR-B) region of 22q11.2. This breakpoint is located between two AT-rich inverted repeats that form a nearly perfect palindrome. Breakpoints within the 11q23, 17q11 and 4q35 partner chromosomes also fall near the center of palindromic sequences. In the present work the breakpoints of a fourth translocation involving LCR-B, a balanced ependymoma-associated t(1;22), were characterized not only to localize this junction relative to known genes, but also to further understand the mechanism underlying these rearrangements. FISH mapping was used to localize the 22q11.2 breakpoint to LCR-B and the 1p21 breakpoint to single BAC clones. STS mapping narrowed the 1p21.2 breakpoint to a 1990 bp AT-rich region, and junction fragments were amplified by nested PCR. Junction fragment-derived sequence indicates that the 1p21.2 breakpoint splits a 278 nt palindrome capable of forming stem-loop secondary structure. In contrast, the 1p21.2 reference genomic sequence from clones in the database does not exhibit this configuration, suggesting a predisposition for regional genomic instability perhaps etiologic for this rearrangement. Given its similarity to known chromosomal fragile site (FRA) sequences, this polymorphic 1p21.2 sequence may represent one of the FRA1 loci. Comparative analysis of the secondary structure of sequences surrounding translocation breakpoints that involve LCR-B with those not involving this region indicate a unique ability of the former to form stem-loop structures. The relative likelihood of forming these configurations appears to be related to the rate of translocation occurrence. Further analysis suggests that constitutional translocations in general occur between sequences of similar melting temperature and propensity for secondary structure.
A palindrome-mediated mechanism distinguishes translocations involving LCR-B of chromosome 22q11.2
Gotter, Anthony L.; Shaikh, Tamim H.; Budarf, Marcia L.; Rhodes, C. Harker; Emanuel, Beverly S.
2010-01-01
Two known recurrent constitutional translocations, t(11;22) and t(17;22), as well as a non-recurrent t(4;22), display derivative chromosomes that have joined to a common site within the low copy repeat B (LCR-B) region of 22q11.2. This breakpoint is located between two AT-rich inverted repeats that form a nearly perfect palindrome. Breakpoints within the 11q23, 17q11 and 4q35 partner chromosomes also fall near the center of palindromic sequences. In the present work the breakpoints of a fourth translocation involving LCR-B, a balanced ependymoma-associated t(1;22), were characterized not only to localize this junction relative to known genes, but also to further understand the mechanism underlying these rearrangements. FISH mapping was used to localize the 22q11.2 breakpoint to LCR-B and the 1p21 breakpoint to single BAC clones. STS mapping narrowed the 1p21.2 breakpoint to a 1990 bp AT-rich region, and junction fragments were amplified by nested PCR. Junction fragment-derived sequence indicates that the 1p21.2 breakpoint splits a 278 nt palindrome capable of forming stem–loop secondary structure. In contrast, the 1p21.2 reference genomic sequence from clones in the database does not exhibit this configuration, suggesting a predisposition for regional genomic instability perhaps etiologic for this rearrangement. Given its similarity to known chromosomal fragile site (FRA) sequences, this polymorphic 1p21.2 sequence may represent one of the FRA1 loci. Comparative analysis of the secondary structure of sequences surrounding translocation breakpoints that involve LCR-B with those not involving this region indicate a unique ability of the former to form stem–loop structures. The relative likelihood of forming these configurations appears to be related to the rate of translocation occurrence. Further analysis suggests that constitutional translocations in general occur between sequences of similar melting temperature and propensity for secondary structure. PMID:14613967
De Novo Protein Structure Prediction
NASA Astrophysics Data System (ADS)
Hung, Ling-Hong; Ngan, Shing-Chung; Samudrala, Ram
An unparalleled amount of sequence data is being made available from large-scale genome sequencing efforts. The data provide a shortcut to the determination of the function of a gene of interest, as long as there is an existing sequenced gene with similar sequence and of known function. This has spurred structural genomic initiatives with the goal of determining as many protein folds as possible (Brenner and Levitt, 2000; Burley, 2000; Brenner, 2001; Heinemann et al., 2001). The purpose of this is twofold: First, the structure of a gene product can often lead to direct inference of its function. Second, since the function of a protein is dependent on its structure, direct comparison of the structures of gene products can be more sensitive than the comparison of sequences of genes for detecting homology. Presently, structural determination by crystallography and NMR techniques is still slow and expensive in terms of manpower and resources, despite attempts to automate the processes. Computer structure prediction algorithms, while not providing the accuracy of the traditional techniques, are extremely quick and inexpensive and can provide useful low-resolution data for structure comparisons (Bonneau and Baker, 2001). Given the immense number of structures which the structural genomic projects are attempting to solve, there would be a considerable gain even if the computer structure prediction approach were applicable to a subset of proteins.
Muro-Pastor, Alicia M.; Valladares, Ana; Flores, Enrique; Herrero, Antonia
1999-01-01
The heterocyst is the site of nitrogen fixation in aerobically grown cultures of some filamentous cyanobacteria. Heterocyst development in Anabaena sp. strain PCC 7120 is dependent on the global nitrogen regulator NtcA and requires, among others, the products of the hetR and hetC genes. Expression of hetC, tested by RNA- DNA hybridization, was impaired in an ntcA mutant. A nitrogen-regulated, NtcA-dependent putative transcription start point was localized at nucleotide −571 with respect to the hetC translational start. Sequences upstream from this transcription start point exhibit the structure of the canonical cyanobacterial promoter activated by NtcA, and purified NtcA protein specifically bound to a DNA fragment containing this promoter. Activation of expression of hetC during heterocyst development appears thus to be directly operated by NtcA. NtcA-mediated activation of hetR expression was not impaired in a hetC mutant, indicating that HetC is not an NtcA-dependent element required for hetR induction. PMID:10542167
Bonham, Andrew J.; Wenta, Nikola; Osslund, Leah M.; Prussin, Aaron J.; Vinkemeier, Uwe; Reich, Norbert O.
2013-01-01
The DNA-binding specificity and affinity of the dimeric human transcription factor (TF) STAT1, were assessed by total internal reflectance fluorescence protein-binding microarrays (TIRF-PBM) to evaluate the effects of protein phosphorylation, higher-order polymerization and small-molecule inhibition. Active, phosphorylated STAT1 showed binding preferences consistent with prior characterization, whereas unphosphorylated STAT1 showed a weak-binding preference for one-half of the GAS consensus site, consistent with recent models of STAT1 structure and function in response to phosphorylation. This altered-binding preference was further tested by use of the inhibitor LLL3, which we show to disrupt STAT1 binding in a sequence-dependent fashion. To determine if this sequence-dependence is specific to STAT1 and not a general feature of human TF biology, the TF Myc/Max was analysed and tested with the inhibitor Mycro3. Myc/Max inhibition by Mycro3 is sequence independent, suggesting that the sequence-dependent inhibition of STAT1 may be specific to this system and a useful target for future inhibitor design. PMID:23180800
Patterns of Post-Glacial Genetic Differentiation in Marginal Populations of a Marine Microalga
Tahvanainen, Pia; Alpermann, Tilman J.; Figueroa, Rosa Isabel; John, Uwe; Hakanen, Päivi; Nagai, Satoshi; Blomster, Jaanika; Kremp, Anke
2012-01-01
This study investigates the genetic structure of an eukaryotic microorganism, the toxic dinoflagellate Alexandrium ostenfeldii, from the Baltic Sea, a geologically young and ecologically marginal brackish water estuary which is predicted to support evolution of distinct, genetically impoverished lineages of marine macroorganisms. Analyses of the internal transcribed spacer (ITS) sequences and Amplified Fragment Length Polymorphism (AFLP) of 84 A. ostenfeldii isolates from five different Baltic locations and multiple external sites revealed that Baltic A. ostenfeldii is phylogenetically differentiated from other lineages of the species and micro-geographically fragmented within the Baltic Sea. Significant genetic differentiation (F ST) between northern and southern locations was correlated to geographical distance. However, instead of discrete genetic units or continuous genetic differentiation, the analysis of population structure suggests a complex and partially hierarchic pattern of genetic differentiation. The observed pattern suggests that initial colonization was followed by local differentiation and varying degrees of dispersal, most likely depending on local habitat conditions and prevailing current systems separating the Baltic Sea populations. Local subpopulations generally exhibited low levels of overall gene diversity. Association analysis suggests predominately asexual reproduction most likely accompanied by frequency shifts of clonal lineages during planktonic growth. Our results indicate that the general pattern of genetic differentiation and reduced genetic diversity of Baltic populations found in large organisms also applies to microscopic eukaryotic organisms. PMID:23300940
Patterns of post-glacial genetic differentiation in marginal populations of a marine microalga.
Tahvanainen, Pia; Alpermann, Tilman J; Figueroa, Rosa Isabel; John, Uwe; Hakanen, Päivi; Nagai, Satoshi; Blomster, Jaanika; Kremp, Anke
2012-01-01
This study investigates the genetic structure of an eukaryotic microorganism, the toxic dinoflagellate Alexandrium ostenfeldii, from the Baltic Sea, a geologically young and ecologically marginal brackish water estuary which is predicted to support evolution of distinct, genetically impoverished lineages of marine macroorganisms. Analyses of the internal transcribed spacer (ITS) sequences and Amplified Fragment Length Polymorphism (AFLP) of 84 A. ostenfeldii isolates from five different Baltic locations and multiple external sites revealed that Baltic A. ostenfeldii is phylogenetically differentiated from other lineages of the species and micro-geographically fragmented within the Baltic Sea. Significant genetic differentiation (F(ST)) between northern and southern locations was correlated to geographical distance. However, instead of discrete genetic units or continuous genetic differentiation, the analysis of population structure suggests a complex and partially hierarchic pattern of genetic differentiation. The observed pattern suggests that initial colonization was followed by local differentiation and varying degrees of dispersal, most likely depending on local habitat conditions and prevailing current systems separating the Baltic Sea populations. Local subpopulations generally exhibited low levels of overall gene diversity. Association analysis suggests predominately asexual reproduction most likely accompanied by frequency shifts of clonal lineages during planktonic growth. Our results indicate that the general pattern of genetic differentiation and reduced genetic diversity of Baltic populations found in large organisms also applies to microscopic eukaryotic organisms.
Recognition of coarse-grained protein tertiary structure.
Lezon, Timothy; Banavar, Jayanth R; Maritan, Amos
2004-05-15
A model of the protein backbone is considered in which each residue is characterized by the location of its C(alpha) atom and one of a discrete set of conformal (phi, psi) states. We investigate the key differences between a description that offers a locally precise fit to known backbone structures and one that provides a globally accurate fit to protein structures. Using a statistical scoring scheme and threading, a protein's local best-fit conformation is highly recognizable, but its global structure cannot be directly determined from an amino acid sequence. The incorporation of information about the conformal states of neighboring residues along the chain allows one to accurately translate the local structure into a global structure. We present a two-step algorithm, which recognizes up to 95% of the tested protein native-state structures to within a 2.5 A root mean square deviation. Copyright 2004 Wiley-Liss, Inc.
Contingency Table Browser - prediction of early stage protein structure.
Kalinowska, Barbara; Krzykalski, Artur; Roterman, Irena
2015-01-01
The Early Stage (ES) intermediate represents the starting structure in protein folding simulations based on the Fuzzy Oil Drop (FOD) model. The accuracy of FOD predictions is greatly dependent on the accuracy of the chosen intermediate. A suitable intermediate can be constructed using the sequence-structure relationship information contained in the so-called contingency table - this table expresses the likelihood of encountering various structural motifs for each tetrapeptide fragment in the amino acid sequence. The limited accuracy with which such structures could previously be predicted provided the motivation for a more indepth study of the contingency table itself. The Contingency Table Browser is a tool which can visualize, search and analyze the table. Our work presents possible applications of Contingency Table Browser, among them - analysis of specific protein sequences from the point of view of their structural ambiguity.
Cdc6 localizes to S- and G2-phase centrosomes in a cell cycle-dependent manner
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kim, Gwang Su; Kang, Jeeheon; Bang, Sung Woong
2015-01-16
Highlights: • Cdc6 protein is a component of the pre-replicative complex required for chromosomal replication initiation. • Cdc6 localized to centrosomes of S and G2 phases in a cell cycle-dependent manner. • The centrosomal localization was governed by centrosomal localization signal sequences of Cdc6. • Deletions or substitution mutations on the centrosomal localization signal interfered with centrosomal localization of the Cdc6 proteins. - Abstract: The Cdc6 protein has been primarily investigated as a component of the pre-replicative complex for the initiation of chromosome replication, which contributes to maintenance of chromosomal integrity. Here, we show that Cdc6 localized to the centrosomesmore » during S and G2 phases of the cell cycle. The centrosomal localization was mediated by Cdc6 amino acid residues 311–366, which are conserved within other Cdc6 homologues and contains a putative nuclear export signal. Deletions or substitutions of the amino acid residues did not allow the proteins to localize to centrosomes. In contrast, DsRed tag fused to the amino acid residues localized to centrosomes. These results indicated that a centrosome localization signal is contained within amino acid residues 311–366. The cell cycle-dependent centrosomal localization of Cdc6 in S and G2 phases suggest a novel function of Cdc6 in centrosomes.« less
Iterative refinement of structure-based sequence alignments by Seed Extension
Kim, Changhoon; Tai, Chin-Hsien; Lee, Byungkook
2009-01-01
Background Accurate sequence alignment is required in many bioinformatics applications but, when sequence similarity is low, it is difficult to obtain accurate alignments based on sequence similarity alone. The accuracy improves when the structures are available, but current structure-based sequence alignment procedures still mis-align substantial numbers of residues. In order to correct such errors, we previously explored the possibility of replacing the residue-based dynamic programming algorithm in structure alignment procedures with the Seed Extension algorithm, which does not use a gap penalty. Here, we describe a new procedure called RSE (Refinement with Seed Extension) that iteratively refines a structure-based sequence alignment. Results RSE uses SE (Seed Extension) in its core, which is an algorithm that we reported recently for obtaining a sequence alignment from two superimposed structures. The RSE procedure was evaluated by comparing the correctly aligned fractions of residues before and after the refinement of the structure-based sequence alignments produced by popular programs. CE, DaliLite, FAST, LOCK2, MATRAS, MATT, TM-align, SHEBA and VAST were included in this analysis and the NCBI's CDD root node set was used as the reference alignments. RSE improved the average accuracy of sequence alignments for all programs tested when no shift error was allowed. The amount of improvement varied depending on the program. The average improvements were small for DaliLite and MATRAS but about 5% for CE and VAST. More substantial improvements have been seen in many individual cases. The additional computation times required for the refinements were negligible compared to the times taken by the structure alignment programs. Conclusion RSE is a computationally inexpensive way of improving the accuracy of a structure-based sequence alignment. It can be used as a standalone procedure following a regular structure-based sequence alignment or to replace the traditional iterative refinement procedures based on residue-level dynamic programming algorithm in many structure alignment programs. PMID:19589133
Johnson, Christopher M; Chen, Yuqing; Lee, Heejin; Ke, Ailong; Weaver, Keith E; Dunny, Gary M
2014-03-04
Anti-Q is a small RNA encoded on pCF10, an antibiotic resistance plasmid of Enterococcus faecalis, which negatively regulates conjugation of the plasmid. In this study we sought to understand how Anti-Q is generated relative to larger transcripts of the same operon. We found that Anti-Q folds into a branched structure that functions as a factor-independent terminator. In vitro and in vivo, termination is dependent on the integrity of this structure as well as the presence of a 3' polyuridine tract, but is not dependent on other downstream sequences. In vitro, terminated transcripts are released from RNA polymerase after synthesis. In vivo, a mutant with reduced termination efficiency demonstrated loss of tight control of conjugation function. A search of bacterial genomes revealed the presence of sequences that encode Anti-Q-like RNA structures. In vitro and in vivo experiments demonstrated that one of these functions as a terminator. This work reveals a previously unappreciated flexibility in the structure of factor-independent terminators and identifies a mechanism for generation of functional small RNAs; it should also inform annotation of bacterial sequence features, such as terminators, functional sRNAs, and operons.
Johnson, Christopher M.; Chen, Yuqing; Lee, Heejin; Ke, Ailong; Weaver, Keith E.; Dunny, Gary M.
2014-01-01
Anti-Q is a small RNA encoded on pCF10, an antibiotic resistance plasmid of Enterococcus faecalis, which negatively regulates conjugation of the plasmid. In this study we sought to understand how Anti-Q is generated relative to larger transcripts of the same operon. We found that Anti-Q folds into a branched structure that functions as a factor-independent terminator. In vitro and in vivo, termination is dependent on the integrity of this structure as well as the presence of a 3′ polyuridine tract, but is not dependent on other downstream sequences. In vitro, terminated transcripts are released from RNA polymerase after synthesis. In vivo, a mutant with reduced termination efficiency demonstrated loss of tight control of conjugation function. A search of bacterial genomes revealed the presence of sequences that encode Anti-Q–like RNA structures. In vitro and in vivo experiments demonstrated that one of these functions as a terminator. This work reveals a previously unappreciated flexibility in the structure of factor-independent terminators and identifies a mechanism for generation of functional small RNAs; it should also inform annotation of bacterial sequence features, such as terminators, functional sRNAs, and operons. PMID:24550474
Turci, Marco; Romanelli, Maria Grazia; Lorenzi, Pamela; Righi, Paola; Bertazzoni, Umberto
2006-01-03
Human T-cell lymphotropic viruses (HTLV) types I and II are closely related oncogenic retroviruses that have been associated with lymphoproliferative and neurological disorders. The proviral genome encodes a trans-regulatory Tax protein that activates viral genes and upregulates various cellular genes involved in both cell growth and transformation. Tax proteins of HTLV-I (Tax-I) and HTLV-II (Tax-II) exhibit more than 77% aa homology and expression of either Tax-I or Tax-II is sufficient for immortalization of cultured T lymphocytes. Tax-I shuttles from the nucleus to the cytoplasm and accumulates within the nucleus, whereas Tax-II is found mainly in the cytoplasm. In the present study we have used recombinant vectors to analyze the size and structure of the nuclear localization domain within the Tax-II protein sequence. The Tax-II protein was expressed in HeLa cells either as the complete protein, or regions thereof, that were individually fused to the green fluorescent protein (GFP). Immunoblot analysis of the fused Tax-II products confirmed their expression and size. Fluorescence microscopy studies indicated that the complete Tax-II as well as N-truncated forms presented a punctuate cytoplasmic distribution and that a nuclear localization determinant is confined to within the first 60 aa of Tax-II. Accordingly, site directed mutagenesis and deletion of specific sequences within the first 60 aa showed that the nuclear determinant lies within the first 41 residues of Tax-II. These results point to a direct involvement of the amino-terminal residues of Tax-II protein in determining its nuclear functionality.
Bao, Yunhe; White, Cindy L; Luger, Karolin
2006-08-25
Poly(dA.dT) DNA sequence elements are thought to promote transcription by either excluding nucleosomes or by altering their structural or dynamic properties. Here, the stability and structure of a defined nucleosome core particle containing a 16 base-pair poly(dA.dT) element (A16 NCP) was investigated. The A16 NCP requires a significantly higher temperature for histone octamer sliding in vitro compared to comparable nucleosomes that do not contain a poly(dA.dT) element. Fluorescence resonance energy transfer showed that the interactions between the nucleosomal DNA ends and the histone octamer were destabilized in A16 NCP. The crystal structure of A16 NCP was determined to a resolution of 3.2 A. The overall structure was maintained except for local deviations in DNA conformation. These results are consistent with previous in vivo and in vitro observations that poly(dA.dT) elements cause only modest changes in DNA accessibility and modest increases in steady-state transcription levels.
Role of Sequence and Structural Polymorphism on the Mechanical Properties of Amyloid Fibrils
Kim, Jae In; Na, Sungsoo; Eom, Kilho
2014-01-01
Amyloid fibrils playing a critical role in disease expression, have recently been found to exhibit the excellent mechanical properties such as elastic modulus in the order of 10 GPa, which is comparable to that of other mechanical proteins such as microtubule, actin filament, and spider silk. These remarkable mechanical properties of amyloid fibrils are correlated with their functional role in disease expression. This suggests the importance in understanding how these excellent mechanical properties are originated through self-assembly process that may depend on the amino acid sequence. However, the sequence-structure-property relationship of amyloid fibrils has not been fully understood yet. In this work, we characterize the mechanical properties of human islet amyloid polypeptide (hIAPP) fibrils with respect to their molecular structures as well as their amino acid sequence by using all-atom explicit water molecular dynamics (MD) simulation. The simulation result suggests that the remarkable bending rigidity of amyloid fibrils can be achieved through a specific self-aggregation pattern such as antiparallel stacking of β strands (peptide chain). Moreover, we have shown that a single point mutation of hIAPP chain constituting a hIAPP fibril significantly affects the thermodynamic stability of hIAPP fibril formed by parallel stacking of peptide chain, and that a single point mutation results in a significant change in the bending rigidity of hIAPP fibrils formed by antiparallel stacking of β strands. This clearly elucidates the role of amino acid sequence on not only the equilibrium conformations of amyloid fibrils but also their mechanical properties. Our study sheds light on sequence-structure-property relationships of amyloid fibrils, which suggests that the mechanical properties of amyloid fibrils are encoded in their sequence-dependent molecular architecture. PMID:24551113
NoFold: RNA structure clustering without folding or alignment.
Middleton, Sarah A; Kim, Junhyong
2014-11-01
Structures that recur across multiple different transcripts, called structure motifs, often perform a similar function-for example, recruiting a specific RNA-binding protein that then regulates translation, splicing, or subcellular localization. Identifying common motifs between coregulated transcripts may therefore yield significant insight into their binding partners and mechanism of regulation. However, as most methods for clustering structures are based on folding individual sequences or doing many pairwise alignments, this results in a tradeoff between speed and accuracy that can be problematic for large-scale data sets. Here we describe a novel method for comparing and characterizing RNA secondary structures that does not require folding or pairwise alignment of the input sequences. Our method uses the idea of constructing a distance function between two objects by their respective distances to a collection of empirical examples or models, which in our case consists of 1973 Rfam family covariance models. Using this as a basis for measuring structural similarity, we developed a clustering pipeline called NoFold to automatically identify and annotate structure motifs within large sequence data sets. We demonstrate that NoFold can simultaneously identify multiple structure motifs with an average sensitivity of 0.80 and precision of 0.98 and generally exceeds the performance of existing methods. We also perform a cross-validation analysis of the entire set of Rfam families, achieving an average sensitivity of 0.57. We apply NoFold to identify motifs enriched in dendritically localized transcripts and report 213 enriched motifs, including both known and novel structures. © 2014 Middleton and Kim; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
INFO-RNA--a fast approach to inverse RNA folding.
Busch, Anke; Backofen, Rolf
2006-08-01
The structure of RNA molecules is often crucial for their function. Therefore, secondary structure prediction has gained much interest. Here, we consider the inverse RNA folding problem, which means designing RNA sequences that fold into a given structure. We introduce a new algorithm for the inverse folding problem (INFO-RNA) that consists of two parts; a dynamic programming method for good initial sequences and a following improved stochastic local search that uses an effective neighbor selection method. During the initialization, we design a sequence that among all sequences adopts the given structure with the lowest possible energy. For the selection of neighbors during the search, we use a kind of look-ahead of one selection step applying an additional energy-based criterion. Afterwards, the pre-ordered neighbors are tested using the actual optimization criterion of minimizing the structure distance between the target structure and the mfe structure of the considered neighbor. We compared our algorithm to RNAinverse and RNA-SSD for artificial and biological test sets. Using INFO-RNA, we performed better than RNAinverse and in most cases, we gained better results than RNA-SSD, the probably best inverse RNA folding tool on the market. www.bioinf.uni-freiburg.de?Subpages/software.html.
NASA Astrophysics Data System (ADS)
Lazo, Edmundo; Saavedra, Eduardo; Humire, Fernando; Castro, Cristobal; Cortés-Cortés, Francisco
2015-09-01
We study the localization properties of direct transmission lines when we distribute two values of inductances LA and LB according to a generalized Thue-Morse aperiodic sequence generated by the inflation rule: A → ABm-1, B → BAm-1, m ≥ 2 and integer. We regain the usual Thue-Morse sequence for m = 2. We numerically study the changes produced in the localization properties of the I (ω) electric current function with increasing m values. We demonstrate that the m = 2 case does not belong to the family m ≥ 3, because when m changes from m = 2 to m = 3, the number of extended states decreases significantly. However, for m ≫ 3, the localization properties become similar to the m = 2 case. Also, the
Using chaos to generate variations on movement sequences
NASA Astrophysics Data System (ADS)
Bradley, Elizabeth; Stuart, Joshua
1998-12-01
We describe a method for introducing variations into predefined motion sequences using a chaotic symbol-sequence reordering technique. A progression of symbols representing the body positions in a dance piece, martial arts form, or other motion sequence is mapped onto a chaotic trajectory, establishing a symbolic dynamics that links the movement sequence and the attractor structure. A variation on the original piece is created by generating a trajectory with slightly different initial conditions, inverting the mapping, and using special corpus-based graph-theoretic interpolation schemes to smooth any abrupt transitions. Sensitive dependence guarantees that the variation is different from the original; the attractor structure and the symbolic dynamics guarantee that the two resemble one another in both aesthetic and mathematical senses.
Ishikawa, Kazuki; Matsuoka, Satoshi; Hara, Hiroshi; Matsumoto, Kouji
2017-10-18
The Min system, which inhibits assembly of the cytokinetic protein FtsZ, is largely responsible for positioning the division site in rod-shaped bacteria. It has been reported that MinJ, which bridges DivIVA and MinD, is targeted to the cell poles by an interaction with DivIVA, and that MinJ in turn recruits MinCD to the cell poles. MinC, however, is located primarily at active division sites at mid-cell when expressed from its native promoter. Surprisingly, we found that Bacillus subtilis MinD is located at nascent septal membranes and at an asymmetric site on lateral membranes between nascent septal membranes in filamentous cells lacking MinJ or DivIVA. Bacillus subtilis MinD has two amphipathic α-helices rich in basic amino acid residues at its C-terminus; one of these, named MTS1 here, is the counterpart of the membrane targeting sequence (MTS) in Escherichia coli MinD while the other, named MTS-like sequence (MTSL), is the nearest helix to MTS1. These amphipathic helices were located independently at nascent septal membranes in cells lacking MinJ or DivIVA, whereas elimination of the helices from the wild type protein reduced its localization considerably. MinD variants with altered MTS1 and MTSL, in which basic amino acid residues were replaced with proline or acidic residues, were not located at nascent septal membranes, indicating that the binding to the nascent septal membranes requires basic residues and a helical structure. The septal localization of MTSL, but not of MTS1, was dependent on host cell MinD. These results suggest that MinD is targeted to nascent septal membranes via its C-terminal amphipathic α-helices in B. subtilis cells lacking MinJ or DivIVA. Moreover, the diffuse distribution of MinD lacking both MTSs suggests that only a small fraction of MinD depends on MinJ for its localization to nascent septal membranes.
Rtools: a web server for various secondary structural analyses on single RNA sequences.
Hamada, Michiaki; Ono, Yukiteru; Kiryu, Hisanori; Sato, Kengo; Kato, Yuki; Fukunaga, Tsukasa; Mori, Ryota; Asai, Kiyoshi
2016-07-08
The secondary structures, as well as the nucleotide sequences, are the important features of RNA molecules to characterize their functions. According to the thermodynamic model, however, the probability of any secondary structure is very small. As a consequence, any tool to predict the secondary structures of RNAs has limited accuracy. On the other hand, there are a few tools to compensate the imperfect predictions by calculating and visualizing the secondary structural information from RNA sequences. It is desirable to obtain the rich information from those tools through a friendly interface. We implemented a web server of the tools to predict secondary structures and to calculate various structural features based on the energy models of secondary structures. By just giving an RNA sequence to the web server, the user can get the different types of solutions of the secondary structures, the marginal probabilities such as base-paring probabilities, loop probabilities and accessibilities of the local bases, the energy changes by arbitrary base mutations as well as the measures for validations of the predicted secondary structures. The web server is available at http://rtools.cbrc.jp, which integrates software tools, CentroidFold, CentroidHomfold, IPKnot, CapR, Raccess, Rchange and RintD. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Scoring-and-unfolding trimmed tree assembler: concepts, constructs and comparisons.
Narzisi, Giuseppe; Mishra, Bud
2011-01-15
Mired by its connection to a well-known -complete combinatorial optimization problem-namely, the Shortest Common Superstring Problem (SCSP)-historically, the whole-genome sequence assembly (WGSA) problem has been assumed to be amenable only to greedy and heuristic methods. By placing efficiency as their first priority, these methods opted to rely only on local searches, and are thus inherently approximate, ambiguous or error prone, especially, for genomes with complex structures. Furthermore, since choice of the best heuristics depended critically on the properties of (e.g. errors in) the input data and the available long range information, these approaches hindered designing an error free WGSA pipeline. We dispense with the idea of limiting the solutions to just the approximated ones, and instead favor an approach that could potentially lead to an exhaustive (exponential-time) search of all possible layouts. Its computational complexity thus must be tamed through a constrained search (Branch-and-Bound) and quick identification and pruning of implausible overlays. For his purpose, such a method necessarily relies on a set of score functions (oracles) that can combine different structural properties (e.g. transitivity, coverage, physical maps, etc.). We give a detailed description of this novel assembly framework, referred to as Scoring-and-Unfolding Trimmed Tree Assembler (SUTTA), and present experimental results on several bacterial genomes using next-generation sequencing technology data. We also report experimental evidence that the assembly quality strongly depends on the choice of the minimum overlap parameter k. SUTTA's binaries are freely available to non-profit institutions for research and educational purposes at http://www.bioinformatics.nyu.edu.
Tu, Z; Hagedorn, H H
1997-02-01
Pyruvate carboxylase (PC, pyruvate: carbon dioxide ligase [ADP-forming], EC 6.4.1.1) was purified from the yellow fever mosquito, Aedes aegypti. The purified PC showed two polypeptides of similar M(r) (133 and 128 k). The N-terminal sequences of both polypeptides were shown to be very similar, if not identical. A polyclonal antiserum against the 133 kDa polypeptide cross-reacted strongly with the 128 kDa polypeptide. PC was found in all tissues examined. Using a semi-quantitative Western blot assay, PC was shown to be concentrated in the indirect flight muscles and fat body preparations. The ratios of the 133 to 128 kDa polypeptides were shown to differ in various tissues and an Aedes albopictus cell line. The indirect flight muscle was the only tissue in which the 128 kDa polypeptide was more abundant, while both the midgut and the cell line showed almost exclusively the 133 kDa polypeptide. Both peptides were present in varying amounts in brain, malpighian tubule, ovary and fat body preparation. The two isoforms of PC could play different roles in the flight muscle and other tissues. Clones covering a complete cDNA of PC of A. aegypti were obtained using a directional approach. The 3952 bp nucleotide sequence, including a 3585 bp coding region, was determined from these cDNA clones. The deduced 1195 amino acid sequence has a calculated M(r) of 132,200. A putative mitochondrial targeting sequence was determined by comparing the deduced amino acid sequence to the N-terminal sequences of the mature protein. The presence of a mitochondrial targeting sequence indicates that the mosquito PC encoded by the cloned cDNA may be localized in the mitochondria. After the targeting sequence, three functional domains were identified in the following order; biotin carboxylase (BC), carboxyltransferase (CT) and biotin carboxyl carrier protein (BCCP). The mosquito PC showed very high similarity to PCs from other sources (55.1-75.2% identity). Genomic Southern analysis indicated that there could be two similar PC genes or a single PC gene with allelic polymorphism in the A. aegypti genome. The evolutionary relationship of PCs among different organisms was consistent with the accepted evolutionary relationship of their host organisms. The evolution of the domain structures of the biotin-dependent carboxylases including PC was also investigated. This analysis indicates that biotin-dependent carboxylases evolved from a common origin. The analysis also provides evidence for early gene duplication events that shaped the family of biotin-dependent carboxylases. Clear evidence for the coevolution of BC and BCCP domains is presented, although they are associated with very different CT domains and the relative position of the three functional domains varies between members of the biotin-dependent carboxylases.
Detection of nucleic acids by multiple sequential invasive cleavages
Hall, Jeff G.; Lyamichev, Victor I.; Mast, Andrea L.; Brow, Mary Ann D.
1999-01-01
The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof. The present invention further relates to methods and devices for the separation of nucleic acid molecules based on charge. The present invention also provides methods for the detection of non-target cleavage products via the formation of a complete and activated protein binding region. The invention further provides sensitive and specific methods for the detection of human cytomegalovirus nucleic acid in a sample.
Hall, Jeff G.; Lyamichev, Victor I.; Mast, Andrea L.; Brow, Mary Ann; Kwiatkowski, Robert W.; Vavra, Stephanie H.
2005-03-29
The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof. The present invention further relates to methods and devices for the separation of nucleic acid molecules based on charge. The present invention also provides methods for the detection of non-target cleavage products via the formation of a complete and activated protein binding region. The invention further provides sensitive and specific methods for the detection of nucleic acid from various viruses in a sample.
Detection of nucleic acids by multiple sequential invasive cleavages 02
Hall, Jeff G.; Lyamichev, Victor I.; Mast, Andrea L.; Brow, Mary Ann D.
2002-01-01
The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof. The present invention further relates to methods and devices for the separation of nucleic acid molecules based on charge. The present invention also provides methods for the detection of non-target cleavage products via the formation of a complete and activated protein binding region. The invention further provides sensitive and specific methods for the detection of human cytomegalovirus nucleic acid in a sample.
Detection of nucleic acids by multiple sequential invasive cleavages
Hall, Jeff G; Lyamichev, Victor I; Mast, Andrea L; Brow, Mary Ann D
2012-10-16
The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof. The present invention further relates to methods and devices for the separation of nucleic acid molecules based on charge. The present invention also provides methods for the detection of non-target cleavage products via the formation of a complete and activated protein binding region. The invention further provides sensitive and specific methods for the detection of human cytomegalovirus nucleic acid in a sample.
Iwasaki, Yuki; Abe, Takashi; Wada, Kennosuke; Wada, Yoshiko; Ikemura, Toshimichi
2013-11-20
With the remarkable increase of genomic sequence data of microorganisms, novel tools are needed for comprehensive analyses of the big sequence data available. The self-organizing map (SOM) is an effective tool for clustering and visualizing high-dimensional data, such as oligonucleotide composition on one map. By modifying the conventional SOM, we developed batch-learning SOM (BLSOM), which allowed classification of sequence fragments (e.g., 1 kb) according to phylotypes, solely depending on oligonucleotide composition. Metagenomics studies of uncultivable microorganisms in clinical and environmental samples should allow extensive surveys of genes important in life sciences. BLSOM is most suitable for phylogenetic assignment of metagenomic sequences, because fragmental sequences can be clustered according to phylotypes, solely depending on oligonucleotide composition. We first constructed oligonucleotide BLSOMs for all available sequences from genomes of known species, and by mapping metagenomic sequences on these large-scale BLSOMs, we can predict phylotypes of individual metagenomic sequences, revealing a microbial community structure of uncultured microorganisms, including viruses. BLSOM has shown that influenza viruses isolated from humans and birds clearly differ in oligonucleotide composition. Based on this host-dependent oligonucleotide composition, we have proposed strategies for predicting directional changes of virus sequences and for surveilling potentially hazardous strains when introduced into humans from non-human sources.
Geologic Map of the Gold Creek Gold District, Elko County, Nevada
Ketner, Keith B.
2007-01-01
The Gold Creek, Nev. area displays important stratigraphic and structural relationships between Paleozoic and early Tertiary sedimentary strata in an area dominated by large intrusive bodies of Mesozoic age and extensive volcanic fields of middle to late Tertiary age. An autochthonous sequence includes the Cambrian and Proterozoic(?) Prospect Mountain Quartzite and the overlying Cambrian and Ordovician Tennessee Mountain Formation. This autochthon is overlain by three allochthonous plates each composed of a distinctive sequence of strata and having a distinctive internal structure. The structurally lowest plate is composed of the Havallah sequence, locally of Mississippian and Pennsylvanian age, which is folded on north-south trending axes. The next higher plate is composed of somewhat younger Pennsylvanian and Permian strata cut by east-west trending low-angle faults. The highest plate is composed of early Tertiary non-marine sedimentary and igneous rocks folded on varied but mainly north-south trending axes. The question of whether the allochthonous plates were emplaced by contractional or extensional forces is indeterminate from the local evidence. Mineral deposits include gold placers of moderate size and small pockets of base metals, none of which is currently being exploited.
Localized structural frustration for evaluating the impact of sequence variants
Kumar, Sushant; Clarke, Declan; Gerstein, Mark
2016-01-01
Population-scale sequencing is increasingly uncovering large numbers of rare single-nucleotide variants (SNVs) in coding regions of the genome. The rarity of these variants makes it challenging to evaluate their deleteriousness with conventional phenotype–genotype associations. Protein structures provide a way of addressing this challenge. Previous efforts have focused on globally quantifying the impact of SNVs on protein stability. However, local perturbations may severely impact protein functionality without strongly disrupting global stability (e.g. in relation to catalysis or allostery). Here, we describe a workflow in which localized frustration, quantifying unfavorable local interactions, is employed as a metric to investigate such effects. Using this workflow on the Protein Databank, we find that frustration produces many immediately intuitive results: for instance, disease-related SNVs create stronger changes in localized frustration than non-disease related variants, and rare SNVs tend to disrupt local interactions to a larger extent than common variants. Less obviously, we observe that somatic SNVs associated with oncogenes and tumor suppressor genes (TSGs) induce very different changes in frustration. In particular, those associated with TSGs change the frustration more in the core than the surface (by introducing loss-of-function events), whereas those associated with oncogenes manifest the opposite pattern, creating gain-of-function events. PMID:27915290
In vitro fluorescence studies of transcription factor IIB-DNA interaction.
Górecki, Andrzej; Figiel, Małgorzata; Dziedzicka-Wasylewska, Marta
2015-01-01
General transcription factor TFIIB is one of the basal constituents of the preinitiation complex of eukaryotic RNA polymerase II, acting as a bridge between the preinitiation complex and the polymerase, and binding promoter DNA in an asymmetric manner, thereby defining the direction of the transcription. Methods of fluorescence spectroscopy together with circular dichroism spectroscopy were used to observe conformational changes in the structure of recombinant human TFIIB after binding to specific DNA sequence. To facilitate the exploration of the structural changes, several site-directed mutations have been introduced altering the fluorescence properties of the protein. Our observations showed that binding of specific DNA sequences changed the protein structure and dynamics, and TFIIB may exist in two conformational states, which can be described by a different microenvironment of W52. Fluorescence studies using both intrinsic and exogenous fluorophores showed that these changes significantly depended on the recognition sequence and concerned various regions of the protein, including those interacting with other transcription factors and RNA polymerase II. DNA binding can cause rearrangements in regions of proteins interacting with the polymerase in a manner dependent on the recognized sequences, and therefore, influence the gene expression.
Time-dependent local and average structural evolution of δ-phase 239Pu-Ga alloys
Smith, Alice I.; Page, Katharine L.; Siewenie, Joan E.; ...
2016-08-05
Here, plutonium metal is a very unusual element, exhibiting six allotropes at ambient pressure, between room temperature and its melting point, a complicated phase diagram, and a complex electronic structure. Many phases of plutonium metal are unstable with changes in temperature, pressure, chemical additions, or time. This strongly affects structure and properties, and becomes of high importance, particularly when considering effects on structural integrity over long periods of time [1]. This paper presents a time-dependent neutron total scattering study of the local and average structure of naturally aging δ-phase 239Pu-Ga alloys, together with preliminary results on neutron tomography characterization.
de Borba, Luana; Villordo, Sergio M; Iglesias, Nestor G; Filomatori, Claudia V; Gebhard, Leopoldo G; Gamarnik, Andrea V
2015-03-01
The dengue virus genome is a dynamic molecule that adopts different conformations in the infected cell. Here, using RNA folding predictions, chemical probing analysis, RNA binding assays, and functional studies, we identified new cis-acting elements present in the capsid coding sequence that facilitate cyclization of the viral RNA by hybridization with a sequence involved in a local dumbbell structure at the viral 3' untranslated region (UTR). The identified interaction differentially enhances viral replication in mosquito and mammalian cells. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
Fast online and index-based algorithms for approximate search of RNA sequence-structure patterns
2013-01-01
Background It is well known that the search for homologous RNAs is more effective if both sequence and structure information is incorporated into the search. However, current tools for searching with RNA sequence-structure patterns cannot fully handle mutations occurring on both these levels or are simply not fast enough for searching large sequence databases because of the high computational costs of the underlying sequence-structure alignment problem. Results We present new fast index-based and online algorithms for approximate matching of RNA sequence-structure patterns supporting a full set of edit operations on single bases and base pairs. Our methods efficiently compute semi-global alignments of structural RNA patterns and substrings of the target sequence whose costs satisfy a user-defined sequence-structure edit distance threshold. For this purpose, we introduce a new computing scheme to optimally reuse the entries of the required dynamic programming matrices for all substrings and combine it with a technique for avoiding the alignment computation of non-matching substrings. Our new index-based methods exploit suffix arrays preprocessed from the target database and achieve running times that are sublinear in the size of the searched sequences. To support the description of RNA molecules that fold into complex secondary structures with multiple ordered sequence-structure patterns, we use fast algorithms for the local or global chaining of approximate sequence-structure pattern matches. The chaining step removes spurious matches from the set of intermediate results, in particular of patterns with little specificity. In benchmark experiments on the Rfam database, our improved online algorithm is faster than the best previous method by up to factor 45. Our best new index-based algorithm achieves a speedup of factor 560. Conclusions The presented methods achieve considerable speedups compared to the best previous method. This, together with the expected sublinear running time of the presented index-based algorithms, allows for the first time approximate matching of RNA sequence-structure patterns in large sequence databases. Beyond the algorithmic contributions, we provide with RaligNAtor a robust and well documented open-source software package implementing the algorithms presented in this manuscript. The RaligNAtor software is available at http://www.zbh.uni-hamburg.de/ralignator. PMID:23865810
Characterization of the UGA-recoding and SECIS-binding activities of SECIS-binding protein 2.
Bubenik, Jodi L; Miniard, Angela C; Driscoll, Donna M
2014-01-01
Selenium, a micronutrient, is primarily incorporated into human physiology as selenocysteine (Sec). The 25 Sec-containing proteins in humans are known as selenoproteins. Their synthesis depends on the translational recoding of the UGA stop codon to allow Sec insertion. This requires a stem-loop structure in the 3' untranslated region of eukaryotic mRNAs known as the Selenocysteine Insertion Sequence (SECIS). The SECIS is recognized by SECIS-binding protein 2 (SBP2) and this RNA:protein interaction is essential for UGA recoding to occur. Genetic mutations cause SBP2 deficiency in humans, resulting in a broad set of symptoms due to differential effects on individual selenoproteins. Progress on understanding the different phenotypes requires developing robust tools to investigate SBP2 structure and function. In this study we demonstrate that SBP2 protein produced by in vitro translation discriminates among SECIS elements in a competitive UGA recoding assay and has a much higher specific activity than bacterially expressed protein. We also show that a purified recombinant protein encompassing amino acids 517-777 of SBP2 binds to SECIS elements with high affinity and selectivity. The affinity of the SBP2:SECIS interaction correlated with the ability of a SECIS to compete for UGA recoding activity in vitro. The identification of a 250 amino acid sequence that mediates specific, selective SECIS-binding will facilitate future structural studies of the SBP2:SECIS complex. Finally, we identify an evolutionarily conserved core cysteine signature in SBP2 sequences from the vertebrate lineage. Mutation of multiple, but not single, cysteines impaired SECIS-binding but did not affect protein localization in cells.
Ferrocene-oligonucleotide conjugates for electrochemical probing of DNA.
Ihara, T; Maruo, Y; Takenaka, S; Takagi, M
1996-01-01
Toward the development of a universal, sensitive and convenient method of DNA (or RNA) detection, electrochemically active oligonucleotides were prepared by covalent linkage of a ferrocenyl group to the 5'-aminohexyl-terminated synthetic oligonucleotides. Using these electrochemically active probes, we have been able to demonstrate the detection of DNA and RNA at femtomole levels by HPLC equipped with an ordinary electrochemical detector (ECD) [Takenaka,S., Uto,Y., Kondo,H., Ihara,T. and Takagi,M. (1994) Anal. Biochem., 218, 436-443]. Thermodynamic and electrochemical studies of the interaction between the probes and the targets are presented here. The thermodynamics obtained revealed that the conjugation stabilizes the triple-helix complexes by 2-3 kcal mol-1 (1-2 orders increment in binding constant) at 298 K, which corresponds to the effect of elongation of additional several base triplets. The main cause of this thermodynamic stabilization by the conjugation is likely to be the overall conformational change of whole structure of the conjugate rather than the additional local interaction. The redox potential of the probe was independent of the target structure, which is either single- or double stranded. However, the potential is slightly dependent (with a 10-30 mV negative shift on complexation) on the extra sequence in the target, probably because the individual sequence is capable of contacting or interacting with the ferrocenyl group in a slightly different way from each other. This small potential shift itself, however, does not cause any inconvenience on practical applications in detecting the probes by using ECD. These results lead to the conclusion that the redox-active probes are very useful for the microanalysis of nucleic acids due to the stability of the complexes, high detection sensitivity and wide applicability to the target structures (DNA and RNA; single- and double strands) and the sequences. PMID:8932383
NASA Astrophysics Data System (ADS)
Snyder, J. E.; Harris, V. G.; Koon, N. C.; Sui, X.; Kryder, M. H.
1996-10-01
Anisotropic local structure has been observed around both the Fe and Ba ions in the amorphous precursor to Ba-hexaferrite thin films, using polarization-dependent extended x-ray-absorption fine structure. This anisotropic local structure, consisting mainly of a network of Fe-O octahedra, determines the orientation of the fast-growing basal planes during crystallization, and thus the directions of the c axes and the resulting magnetic anisotropy.
Power law tails in phylogenetic systems.
Qin, Chongli; Colwell, Lucy J
2018-01-23
Covariance analysis of protein sequence alignments uses coevolving pairs of sequence positions to predict features of protein structure and function. However, current methods ignore the phylogenetic relationships between sequences, potentially corrupting the identification of covarying positions. Here, we use random matrix theory to demonstrate the existence of a power law tail that distinguishes the spectrum of covariance caused by phylogeny from that caused by structural interactions. The power law is essentially independent of the phylogenetic tree topology, depending on just two parameters-the sequence length and the average branch length. We demonstrate that these power law tails are ubiquitous in the large protein sequence alignments used to predict contacts in 3D structure, as predicted by our theory. This suggests that to decouple phylogenetic effects from the interactions between sequence distal sites that control biological function, it is necessary to remove or down-weight the eigenvectors of the covariance matrix with largest eigenvalues. We confirm that truncating these eigenvectors improves contact prediction.
Dynamic effects of memory in a cobweb model with competing technologies
NASA Astrophysics Data System (ADS)
Agliari, Anna; Naimzada, Ahmad; Pecora, Nicolò
2017-02-01
We analyze a simple model based on the cobweb demand-supply framework with costly innovators and free imitators and study the endogenous dynamics of price and firms' fractions in a homogeneous good market. The evolutionary selection between technologies depends on a performance measure in which a memory parameter is introduced. The resulting dynamics is then described by a two-dimensional map. In addition to the locally stabilizing effect due to the presence of memory, we show the existence of a double stability threshold which entails for different dynamic scenarios occurring when the memory parameter takes extreme values (i.e. when consideration of the last profit realization prevails or it is too much neglected). The eventuality of different coexisting attractors as well as the structure of the basins of attraction that characterizes the path dependence property of the model with memory is shown. In particular, through global analysis we also illustrate particular bifurcations sequences that may increase the complexity of the related basins of attraction.
Martin, Juliette; Regad, Leslie; Etchebest, Catherine; Camproux, Anne-Claude
2008-11-15
Interresidue protein contacts in proteins structures and at protein-protein interface are classically described by the amino acid types of interacting residues and the local structural context of the contact, if any, is described using secondary structures. In this study, we present an alternate analysis of interresidue contact using local structures defined by the structural alphabet introduced by Camproux et al. This structural alphabet allows to describe a 3D structure as a sequence of prototype fragments called structural letters, of 27 different types. Each residue can then be assigned to a particular local structure, even in loop regions. The analysis of interresidue contacts within protein structures defined using Voronoï tessellations reveals that pairwise contact specificity is greater in terms of structural letters than amino acids. Using a simple heuristic based on specificity score comparison, we find that 74% of the long-range contacts within protein structures are better described using structural letters than amino acid types. The investigation is extended to a set of protein-protein complexes, showing that the similar global rules apply as for intraprotein contacts, with 64% of the interprotein contacts best described by local structures. We then present an evaluation of pairing functions integrating structural letters to decoy scoring and show that some complexes could benefit from the use of structural letter-based pairing functions.
Polytypism in the ground state structure of the Lennard-Jonesium.
Pártay, Lívia B; Ortner, Christoph; Bartók, Albert P; Pickard, Chris J; Csányi, Gábor
2017-07-26
We present a systematic study of the stability of nineteen different periodic structures using the finite range Lennard-Jones potential model discussing the effects of pressure, potential truncation, cutoff distance and Lennard-Jones exponents. The structures considered are the hexagonal close packed (hcp), face centred cubic (fcc) and seventeen other polytype stacking sequences, such as dhcp and 9R. We found that at certain pressure and cutoff distance values, neither fcc nor hcp is the ground state structure as previously documented, but different polytypic sequences. This behaviour shows a strong dependence on the way the tail of the potential is truncated.
Learning Temporal Statistics for Sensory Predictions in Aging.
Luft, Caroline Di Bernardi; Baker, Rosalind; Goldstone, Aimee; Zhang, Yang; Kourtzi, Zoe
2016-03-01
Predicting future events based on previous knowledge about the environment is critical for successful everyday interactions. Here, we ask which brain regions support our ability to predict the future based on implicit knowledge about the past in young and older age. Combining behavioral and fMRI measurements, we test whether training on structured temporal sequences improves the ability to predict upcoming sensory events; we then compare brain regions involved in learning predictive structures between young and older adults. Our behavioral results demonstrate that exposure to temporal sequences without feedback facilitates the ability of young and older adults to predict the orientation of an upcoming stimulus. Our fMRI results provide evidence for the involvement of corticostriatal regions in learning predictive structures in both young and older learners. In particular, we showed learning-dependent fMRI responses for structured sequences in frontoparietal regions and the striatum (putamen) for young adults. However, for older adults, learning-dependent activations were observed mainly in subcortical (putamen, thalamus) regions but were weaker in frontoparietal regions. Significant correlations of learning-dependent behavioral and fMRI changes in these regions suggest a strong link between brain activations and behavioral improvement rather than general overactivation. Thus, our findings suggest that predicting future events based on knowledge of temporal statistics engages brain regions involved in implicit learning in both young and older adults.
Dover, James H.; Tailleur, Irvin L.; Dumoulin, Julie A.
2004-01-01
The map depicts the field distribution and contact relations between stratigraphic units, the tectonic relations between major stratigraphic sequences, and the detailed internal structure of these sequences. The stratigraphic sequences formed in a variety of continental margin depositional environments, and subsequently underwent a complexde formational history of imbricate thrust faulting and folding. A compilation of micro and macro fossil identifications is included in this data set.
The problem and promise of scale dependency in community phylogenetics.
Swenson, Nathan G; Enquist, Brian J; Pither, Jason; Thompson, Jill; Zimmerman, Jess K
2006-10-01
The problem of scale dependency is widespread in investigations of ecological communities. Null model investigations of community assembly exemplify the challenges involved because they typically include subjectively defined "regional species pools." The burgeoning field of community phylogenetics appears poised to face similar challenges. Our objective is to quantify the scope of the problem of scale dependency by comparing the phylogenetic structure of assemblages across contrasting geographic and taxonomic scales. We conduct phylogenetic analyses on communities within three tropical forests, and perform a sensitivity analysis with respect to two scaleable inputs: taxonomy and species pool size. We show that (1) estimates of phylogenetic overdispersion within local assemblages depend strongly on the taxonomic makeup of the local assemblage and (2) comparing the phylogenetic structure of a local assemblage to a species pool drawn from increasingly larger geographic scales results in an increased signal of phylogenetic clustering. We argue that, rather than posing a problem, "scale sensitivities" are likely to reveal general patterns of diversity that could help identify critical scales at which local or regional influences gain primacy for the structuring of communities. In this way, community phylogenetics promises to fill an important gap in community ecology and biogeography research.
iSS-PseDNC: identifying splicing sites using pseudo dinucleotide composition.
Chen, Wei; Feng, Peng-Mian; Lin, Hao; Chou, Kuo-Chen
2014-01-01
In eukaryotic genes, exons are generally interrupted by introns. Accurately removing introns and joining exons together are essential processes in eukaryotic gene expression. With the avalanche of genome sequences generated in the postgenomic age, it is highly desired to develop automated methods for rapid and effective detection of splice sites that play important roles in gene structure annotation and even in RNA splicing. Although a series of computational methods were proposed for splice site identification, most of them neglected the intrinsic local structural properties. In the present study, a predictor called "iSS-PseDNC" was developed for identifying splice sites. In the new predictor, the sequences were formulated by a novel feature-vector called "pseudo dinucleotide composition" (PseDNC) into which six DNA local structural properties were incorporated. It was observed by the rigorous cross-validation tests on two benchmark datasets that the overall success rates achieved by iSS-PseDNC in identifying splice donor site and splice acceptor site were 85.45% and 87.73%, respectively. It is anticipated that iSS-PseDNC may become a useful tool for identifying splice sites and that the six DNA local structural properties described in this paper may provide novel insights for in-depth investigations into the mechanism of RNA splicing.
COMPUTER SIMULATION STUDY OF AMYLOID FIBRIL FORMATION BY PALINDROMIC SEQUENCES IN PRION PEPTIDES
Wagoner, Victoria; Cheon, Mookyung; Chang, Iksoo; Hall, Carol
2011-01-01
We simulate the aggregation of large systems containing palindromic peptides from the Syrian hamster prion protein SHaPrP 113–120 (AGAAAAGA) and the mouse prion protein MoPrP 111–120 (VAGAAAAGAV) and eight sequence variations: GAAAAAAG, (AG)4, A8, GAAAGAAA, A10, V10, GAVAAAAVAG, and VAVAAAAVAV The first two peptides are thought to act as the Velcro that holds the parent prion proteins together in amyloid structures and can form fibrils themselves. Kinetic events along the fibrillization pathway influence the types of structures that occur and variations in the sequence affect aggregation kinetics and fibrillar structure. Discontinuous molecular dynamics simulations using the PRIME20 force field are performed on systems containing 48 peptides starting from a random coil configuration. Depending on the sequence, fibrillar structures form spontaneously over a range of temperatures, below which amorphous aggregates form and above which no aggregation occurs. AGAAAAGA forms well organized fibrillar structures whereas VAGAAAAGAV forms less well organized structures that are partially fibrillar and partially amorphous. The degree of order in the fibrillar structure stems in part from the types of kinetic events leading up to its formation, with AGAAAAGA forming less amorphous structures early in the simulation than VAGAAAAGAV. The ability to form fibrils increases as the chain length and the length of the stretch of hydrophobic residues increase. However as the hydrophobicity of the sequence increases, the ability to form well-ordered structures decreases. Thus, longer hydrophobic sequences form slightly disordered aggregates that are partially fibrillar and partially amorphous. Subtle changes in sequence result in slightly different fibril structures. PMID:21557317
Structural basis of toxicity and immunity in contact-dependent growth inhibition (CDI) systems.
Morse, Robert P; Nikolakakis, Kiel C; Willett, Julia L E; Gerrick, Elias; Low, David A; Hayes, Christopher S; Goulding, Celia W
2012-12-26
Contact-dependent growth inhibition (CDI) systems encode polymorphic toxin/immunity proteins that mediate competition between neighboring bacterial cells. We present crystal structures of CDI toxin/immunity complexes from Escherichia coli EC869 and Burkholderia pseudomallei 1026b. Despite sharing little sequence identity, the toxin domains are structurally similar and have homology to endonucleases. The EC869 toxin is a Zn(2+)-dependent DNase capable of completely degrading the genomes of target cells, whereas the Bp1026b toxin cleaves the aminoacyl acceptor stems of tRNA molecules. Each immunity protein binds and inactivates its cognate toxin in a unique manner. The EC869 toxin/immunity complex is stabilized through an unusual β-augmentation interaction. In contrast, the Bp1026b immunity protein exploits shape and charge complementarity to occlude the toxin active site. These structures represent the initial glimpse into the CDI toxin/immunity network, illustrating how sequence-diverse toxins adopt convergent folds yet retain distinct binding interactions with cognate immunity proteins. Moreover, we present visual demonstration of CDI toxin delivery into a target cell.
Yamashita, Yuichi; Tani, Jun
2008-01-01
It is generally thought that skilled behavior in human beings results from a functional hierarchy of the motor control system, within which reusable motor primitives are flexibly integrated into various sensori-motor sequence patterns. The underlying neural mechanisms governing the way in which continuous sensori-motor flows are segmented into primitives and the way in which series of primitives are integrated into various behavior sequences have, however, not yet been clarified. In earlier studies, this functional hierarchy has been realized through the use of explicit hierarchical structure, with local modules representing motor primitives in the lower level and a higher module representing sequences of primitives switched via additional mechanisms such as gate-selecting. When sequences contain similarities and overlap, however, a conflict arises in such earlier models between generalization and segmentation, induced by this separated modular structure. To address this issue, we propose a different type of neural network model. The current model neither makes use of separate local modules to represent primitives nor introduces explicit hierarchical structure. Rather than forcing architectural hierarchy onto the system, functional hierarchy emerges through a form of self-organization that is based on two distinct types of neurons, each with different time properties (“multiple timescales”). Through the introduction of multiple timescales, continuous sequences of behavior are segmented into reusable primitives, and the primitives, in turn, are flexibly integrated into novel sequences. In experiments, the proposed network model, coordinating the physical body of a humanoid robot through high-dimensional sensori-motor control, also successfully situated itself within a physical environment. Our results suggest that it is not only the spatial connections between neurons but also the timescales of neural activity that act as important mechanisms leading to functional hierarchy in neural systems. PMID:18989398
Maximum-Likelihood Detection Of Noncoherent CPM
NASA Technical Reports Server (NTRS)
Divsalar, Dariush; Simon, Marvin K.
1993-01-01
Simplified detectors proposed for use in maximum-likelihood-sequence detection of symbols in alphabet of size M transmitted by uncoded, full-response continuous phase modulation over radio channel with additive white Gaussian noise. Structures of receivers derived from particular interpretation of maximum-likelihood metrics. Receivers include front ends, structures of which depends only on M, analogous to those in receivers of coherent CPM. Parts of receivers following front ends have structures, complexity of which would depend on N.
Learning multimodal dictionaries.
Monaci, Gianluca; Jost, Philippe; Vandergheynst, Pierre; Mailhé, Boris; Lesage, Sylvain; Gribonval, Rémi
2007-09-01
Real-world phenomena involve complex interactions between multiple signal modalities. As a consequence, humans are used to integrate at each instant perceptions from all their senses in order to enrich their understanding of the surrounding world. This paradigm can be also extremely useful in many signal processing and computer vision problems involving mutually related signals. The simultaneous processing of multimodal data can, in fact, reveal information that is otherwise hidden when considering the signals independently. However, in natural multimodal signals, the statistical dependencies between modalities are in general not obvious. Learning fundamental multimodal patterns could offer deep insight into the structure of such signals. In this paper, we present a novel model of multimodal signals based on their sparse decomposition over a dictionary of multimodal structures. An algorithm for iteratively learning multimodal generating functions that can be shifted at all positions in the signal is proposed, as well. The learning is defined in such a way that it can be accomplished by iteratively solving a generalized eigenvector problem, which makes the algorithm fast, flexible, and free of user-defined parameters. The proposed algorithm is applied to audiovisual sequences and it is able to discover underlying structures in the data. The detection of such audio-video patterns in audiovisual clips allows to effectively localize the sound source on the video in presence of substantial acoustic and visual distractors, outperforming state-of-the-art audiovisual localization algorithms.
Jenjaroenpun, Piroon; Chew, Chee Siang; Yong, Tai Pang; Choowongkomon, Kiattawee; Thammasorn, Wimada; Kuznetsov, Vladimir A
2015-01-01
A triplex target DNA site (TTS), a stretch of DNA that is composed of polypurines, is able to form a triple-helix (triplex) structure with triplex-forming oligonucleotides (TFOs) and is able to influence the site-specific modulation of gene expression and/or the modification of genomic DNA. The co-localization of a genomic TTS with gene regulatory signals and functional genome structures suggests that TFOs could potentially be exploited in antigene strategies for the therapy of cancers and other genetic diseases. Here, we present the TTS Mapping and Integration (TTSMI; http://ttsmi.bii.a-star.edu.sg) database, which provides a catalog of unique TTS locations in the human genome and tools for analyzing the co-localization of TTSs with genomic regulatory sequences and signals that were identified using next-generation sequencing techniques and/or predicted by computational models. TTSMI was designed as a user-friendly tool that facilitates (i) fast searching/filtering of TTSs using several search terms and criteria associated with sequence stability and specificity, (ii) interactive filtering of TTSs that co-localize with gene regulatory signals and non-B DNA structures, (iii) exploration of dynamic combinations of the biological signals of specific TTSs and (iv) visualization of a TTS simultaneously with diverse annotation tracks via the UCSC genome browser. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Normanno, Davide; Vanzi, Francesco; Pavone, Francesco Saverio
2008-01-01
Gene expression regulation is a fundamental biological process which deploys specific sets of genomic information depending on physiological or environmental conditions. Several transcription factors (including lac repressor, LacI) are present in the cell at very low copy number and increase their local concentration by binding to multiple sites on DNA and looping the intervening sequence. In this work, we employ single-molecule manipulation to experimentally address the role of DNA supercoiling in the dynamics and stability of LacI-mediated DNA looping. We performed measurements over a range of degrees of supercoiling between −0.026 and +0.026, in the absence of axial stretching forces. A supercoiling-dependent modulation of the lifetimes of both the looped and unlooped states was observed. Our experiments also provide evidence for multiple structural conformations of the LacI–DNA complex, depending on torsional constraints. The supercoiling-dependent modulation demonstrated here adds an important element to the model of the lac operon. In fact, the complex network of proteins acting on the DNA in a living cell constantly modifies its topological and mechanical properties: our observations demonstrate the possibility of establishing a signaling pathway from factors affecting DNA supercoiling to transcription factors responsible for the regulation of specific sets of genes. PMID:18310101
Luckow, H.G.; Pavlis, T.L.; Serpa, L.F.; Guest, B.; Wagner, D.L.; Snee, L.; Hensley, T.M.; Korjenkov, A.
2005-01-01
New 1:24,000 scale mapping, geochemical analyses of volcanic rocks, and Ar/Ar and tephrochronology analyses of the Wingate Wash, northern Owlshead Mountain and Southern Panamint Mountain region document a complex structural history constrained by syntectonic volcanism and sedimentation. In this study, the region is divided into five structural domains with distinct, but related, histories: (1) The southern Panamint domain is a structurally intact, gently south-tilted block dominated by a middle Miocene volcanic center recognized as localized hypabyssal intrusives surrounded by proximal facies pyroclastic rocks. This Miocene volcanic sequence is an unusual alkaline volcanic assemblage ranging from trachybasalt to rhyolite, but dominated by trachyandesite. The volcanic rocks are overlain in the southwestern Panamint Mountains by a younger (Late Miocene?) fanglomerate sequence. (2) An upper Wingate Wash domain is characterized by large areas of Quaternary cover and complex overprinting of older structure by Quaternary deformation. Quaternary structures record ???N-S shortening concurrent with ???E-W extension accommodated by systems of strike-slip and thrust faults. (3) A central Wingate Wash domain contains a complex structural history that is closely tied to the stratigraphic evolution. In this domain, a middle Miocene volcanic package contains two distinct assemblages; a lower sequence dominated by alkaline pyroclastic rocks similar to the southern Panamint sequence and an upper basaltic sequence of alkaline basalt and basanites. This volcanic sequence is in turn overlain by a coarse clastic sedimentary sequence that records the unroofing of adjacent ranges and development of ???N-S trending, west-tilted fault blocks. We refer to this sedimentary sequence as the Lost Lake assemblage. (4) The lower Wingate Wash/northern Owlshead domain is characterized by a gently north-dipping stratigraphic sequence with an irregular unconformity at the base developed on granitic basement. The unconformity is locally overlain by channelized deposits of older Tertiary(?) red conglomerate, some of which predate the onset of extensive volcanism, but in most of the area is overlain by a moderately thick package of Middle Miocene trachybasalt, trachyandesitic, ash flows, lithic tuff, basaltic cinder, basanites, and dacitic pyroclastic, debris, and lahar flows with localized exposures of sedimentary rocks. The upper part of the Miocene stratigraphic sequence in this domain is comprised of coarse grained-clastic sediments that are apparently middle Miocene based on Ar/Ar dating of interbedded volcanic rocks. This sedimentary sequence, however, is lithologically indistinguishable from the structurally adjacent Late Miocene Lost Lake assemblage and a stratigraphically overlying Plio-Pleistocene alluvial fan; a relationship that handicaps tracing structures through this domain. This domain is also structurally complex and deformed by a series of northwest-southeast-striking, east-dipping, high-angle oblique, sinistral, normal faults that are cut by left-lateral strike-slip faults. The contact between the southern Panamint domain and the adjacent domains is a complex fault system that we interpret as a zone of Late Miocene distributed sinistral slip that is variably overprinted in different portions of the mapped area. The net sinistral slip across the Wingate Wash fault system is estimated at 7-9 km, based on offset of Proterozoic Crystal Springs Formation beneath the middle Miocene unconformity to as much as 15 km based on offset volcanic facies in Middle Miocene rocks. To the south of Wingate Wash, the northern Owlshead Mountains are also cut by a sinistral, northwest-dipping, oblique normal fault, (referred to as the Filtonny Fault) with significant slip that separates the Lower Wingate Wash and central Owlshead domains. The Filtonny Fault may represent a young conjugate fault to the dextral Southern Death Valley fault system and may be the northwest
Ozmutlu, H. Cenk
2014-01-01
We developed mixed integer programming (MIP) models and hybrid genetic-local search algorithms for the scheduling problem of unrelated parallel machines with job sequence and machine-dependent setup times and with job splitting property. The first contribution of this paper is to introduce novel algorithms which make splitting and scheduling simultaneously with variable number of subjobs. We proposed simple chromosome structure which is constituted by random key numbers in hybrid genetic-local search algorithm (GAspLA). Random key numbers are used frequently in genetic algorithms, but it creates additional difficulty when hybrid factors in local search are implemented. We developed algorithms that satisfy the adaptation of results of local search into the genetic algorithms with minimum relocation operation of genes' random key numbers. This is the second contribution of the paper. The third contribution of this paper is three developed new MIP models which are making splitting and scheduling simultaneously. The fourth contribution of this paper is implementation of the GAspLAMIP. This implementation let us verify the optimality of GAspLA for the studied combinations. The proposed methods are tested on a set of problems taken from the literature and the results validate the effectiveness of the proposed algorithms. PMID:24977204
Yoo, Soonmoon; Kim, Hak H; Kim, Paul; Donnelly, Christopher J; Kalinski, Ashley L; Vuppalanchi, Deepika; Park, Michael; Lee, Seung J; Merianda, Tanuja T; Perrone-Bizzozero, Nora I; Twiss, Jeffery L
2013-09-01
Localized translation of axonal mRNAs contributes to developmental and regenerative axon growth. Although untranslated regions (UTRs) of many different axonal mRNAs appear to drive their localization, there has been no consensus RNA structure responsible for this localization. We recently showed that limited expression of ZBP1 protein restricts axonal localization of both β-actin and GAP-43 mRNAs. β-actin 3'UTR has a defined element for interaction with ZBP1, but GAP-43 mRNA shows no homology to this RNA sequence. Here, we show that an AU-rich regulatory element (ARE) in GAP-43's 3'UTR is necessary and sufficient for its axonal localization. Axonal GAP-43 mRNA levels increase after in vivo injury, and GAP-43 mRNA shows an increased half-life in regenerating axons. GAP-43 mRNA interacts with both HuD and ZBP1, and HuD and ZBP1 co-immunoprecipitate in an RNA-dependent fashion. Reporter mRNA with the GAP-43 ARE competes with endogenous β-actin mRNA for axonal localization and decreases axon length and branching similar to the β-actin 3'UTR competing with endogenous GAP-43 mRNA. Conversely, over-expressing GAP-43 coding sequence with its 3'UTR ARE increases axonal elongation and this effect is lost when just the ARE is deleted from GAP-43's 3'UTR. We have recently found that over-expression of GAP-43 using an axonally targeted construct with the 3'UTRs of GAP-43 promoted elongating growth of axons, while restricting the mRNA to the cell body with the 3'UTR of γ-actin had minimal effect on axon length. In this study, we show that the ARE in GAP-43's 3'UTR is responsible for localization of GAP-43 mRNA into axons and is sufficient for GAP-43 protein's role in elongating axonal growth. © 2013 International Society for Neurochemistry.
Yu, Ning; Wei, Yu-Long; Zhang, Xin; Zhu, Ning; Wang, Yan-Li; Zhu, Yue; Zhang, Hai-Ping; Li, Fen-Mei; Yang, Lan; Sun, Jia-Qi; Sun, Ai-Dong
2017-07-11
Trachelospermum jasminoides is commonly used in traditional Chinese medicine. However, the use of the plant's local alternatives is frequent, causing potential clinical problems. The T. jasminoides sold in the medicine market is commonly dried and sliced, making traditional identification methods difficult. In this study, the ITS2 region was evaluated on 127 sequences representing T. jasminoides and its local alternatives according to PCR and sequencing rates, intra- and inter-specific divergences, secondary structure, and discrimination capacity. Results indicated the 100% success rates of PCR and sequencing and the obvious presence of a barcoding gap. Results of BLAST 1, nearest distance and neighbor-joining tree methods showed that barcode ITS2 could successfully identify all the texted samples. The secondary structures of the ITS2 region provided another dimensionality for species identification. Two-dimensional images were obtained for better and easier identification. Previous studies on DNA barcoding concentrated more on the same family, genus, or species. However, an ideal barcode should be variable enough to identify closely related species. Meanwhile, the barcodes should also be conservative in identifying distantly related species. This study highlights the application of barcode ITS2 in solving practical problems in the distantly related local alternatives of medical plants.
Sánchez-Quitian, Zilpa A; Schneider, Cristopher Z; Ducati, Rodrigo G; de Azevedo, Walter F; Bloch, Carlos; Basso, Luiz A; Santos, Diógenes S
2010-03-01
The emergence of drug-resistant strains of Mycobacterium tuberculosis, the causative agent of tuberculosis, has exacerbated the treatment and control of this disease. Cytidine deaminase (CDA) is a pyrimidine salvage pathway enzyme that recycles cytidine and 2'-deoxycytidine for uridine and 2'-deoxyuridine synthesis, respectively. A probable M. tuberculosis CDA-coding sequence (cdd, Rv3315c) was cloned, sequenced, expressed in Escherichia coli BL21(DE3), and purified to homogeneity. Mass spectrometry, N-terminal amino acid sequencing, gel filtration chromatography, and metal analysis of M. tuberculosis CDA (MtCDA) were carried out. These results and multiple sequence alignment demonstrate that MtCDA is a homotetrameric Zn(2+)-dependent metalloenzyme. Steady-state kinetic measurements yielded the following parameters: K(m)=1004 microM and k(cat)=4.8s(-1) for cytidine, and K(m)=1059 microM and k(cat)=3.5s(-1) for 2'-deoxycytidine. The pH dependence of k(cat) and k(cat)/K(M) for cytidine indicate that protonation of a single ionizable group with apparent pK(a) value of 4.3 abolishes activity, and protonation of a group with pK(a) value of 4.7 reduces binding. MtCDA was crystallized and crystal diffracted at 2.0 A resolution. Analysis of the crystallographic structure indicated the presence of a Zn(2+) coordinated by three conserved cysteines and the structure exhibits the canonical cytidine deaminase fold. (c) 2009 Elsevier Inc. All rights reserved.
Menzies, Georgina E.; Reed, Simon H.; Brancale, Andrea; Lewis, Paul D.
2015-01-01
The mutational pattern for the TP53 tumour suppressor gene in lung tumours differs to other cancer types by having a higher frequency of G:C>T:A transversions. The aetiology of this differing mutation pattern is still unknown. Benzo[a]pyrene,diol epoxide (BPDE) is a potent cigarette smoke carcinogen that forms guanine adducts at TP53 CpG mutation hotspot sites including codons 157, 158, 245, 248 and 273. We performed molecular modelling of BPDE-adducted TP53 duplex sequences to determine the degree of local distortion caused by adducts which could influence the ability of nucleotide excision repair. We show that BPDE adducted codon 157 has greater structural distortion than other TP53 G:C>T:A hotspot sites and that sequence context more distal to adjacent bases must influence local distortion. Using TP53 trinucleotide mutation signatures for lung cancer in smokers and non-smokers we further show that codons 157 and 273 have the highest mutation probability in smokers. Combining this information with adduct structural data we predict that G:C>T:A mutations at codon 157 in lung tumours of smokers are predominantly caused by BPDE. Our results provide insight into how different DNA sequence contexts show variability in DNA distortion at mutagen adduct sites that could compromise DNA repair at well characterized cancer related mutation hotspots. PMID:26400171
Saini, Harsh; Raicar, Gaurav; Dehzangi, Abdollah; Lal, Sunil; Sharma, Alok
2015-12-07
Protein subcellular localization is an important topic in proteomics since it is related to a protein׳s overall function, helps in the understanding of metabolic pathways, and in drug design and discovery. In this paper, a basic approximation technique from natural language processing called the linear interpolation smoothing model is applied for predicting protein subcellular localizations. The proposed approach extracts features from syntactical information in protein sequences to build probabilistic profiles using dependency models, which are used in linear interpolation to determine how likely is a sequence to belong to a particular subcellular location. This technique builds a statistical model based on maximum likelihood. It is able to deal effectively with high dimensionality that hinders other traditional classifiers such as Support Vector Machines or k-Nearest Neighbours without sacrificing performance. This approach has been evaluated by predicting subcellular localizations of Gram positive and Gram negative bacterial proteins. Copyright © 2015 Elsevier Ltd. All rights reserved.
Ruhlman, Tracey A; Zhang, Jin; Blazier, John C; Sabir, Jamal S M; Jansen, Robert K
2017-04-01
There is a misinterpretation in the literature regarding the variable orientation of the small single copy region of plastid genomes (plastomes). The common phenomenon of small and large single copy inversion, hypothesized to occur through intramolecular recombination between inverted repeats (IR) in a circular, single unit-genome, in fact, more likely occurs through recombination-dependent replication (RDR) of linear plastome templates. If RDR can be primed through both intra- and intermolecular recombination, then this mechanism could not only create inversion isomers of so-called single copy regions, but also an array of alternative sequence arrangements. We used Illumina paired-end and PacBio single-molecule real-time (SMRT) sequences to characterize repeat structure in the plastome of Monsonia emarginata (Geraniaceae). We used OrgConv and inspected nucleotide alignments to infer ancestral nucleotides and identify gene conversion among repeats and mapped long (>1 kb) SMRT reads against the unit-genome assembly to identify alternative sequence arrangements. Although M. emarginata lacks the canonical IR, we found that large repeats (>1 kilobase; kb) represent ∼22% of the plastome nucleotide content. Among the largest repeats (>2 kb), we identified GC-biased gene conversion and mapping filtered, long SMRT reads to the M. emarginata unit-genome assembly revealed alternative, substoichiometric sequence arrangements. We offer a model based on RDR and gene conversion between long repeated sequences in the M. emarginata plastome and provide support that both intra-and intermolecular recombination between large repeats, particularly in repeat-rich plastomes, varies unit-genome structure while homogenizing the nucleotide sequence of repeats. © 2017 Botanical Society of America.
Fortin, Connor H; Schulze, Katharina V; Babbitt, Gregory A
2015-01-01
It is now widely-accepted that DNA sequences defining DNA-protein interactions functionally depend upon local biophysical features of DNA backbone that are important in defining sites of binding interaction in the genome (e.g. DNA shape, charge and intrinsic dynamics). However, these physical features of DNA polymer are not directly apparent when analyzing and viewing Shannon information content calculated at single nucleobases in a traditional sequence logo plot. Thus, sequence logos plots are severely limited in that they convey no explicit information regarding the structural dynamics of DNA backbone, a feature often critical to binding specificity. We present TRX-LOGOS, an R software package and Perl wrapper code that interfaces the JASPAR database for computational regulatory genomics. TRX-LOGOS extends the traditional sequence logo plot to include Shannon information content calculated with regard to the dinucleotide-based BI-BII conformation shifts in phosphate linkages on the DNA backbone, thereby adding a visual measure of intrinsic DNA flexibility that can be critical for many DNA-protein interactions. TRX-LOGOS is available as an R graphics module offered at both SourceForge and as a download supplement at this journal. To demonstrate the general utility of TRX logo plots, we first calculated the information content for 416 Saccharomyces cerevisiae transcription factor binding sites functionally confirmed in the Yeastract database and matched to previously published yeast genomic alignments. We discovered that flanking regions contain significantly elevated information content at phosphate linkages than can be observed at nucleobases. We also examined broader transcription factor classifications defined by the JASPAR database, and discovered that many general signatures of transcription factor binding are locally more information rich at the level of DNA backbone dynamics than nucleobase sequence. We used TRX-logos in combination with MEGA 6.0 software for molecular evolutionary genetics analysis to visually compare the human Forkhead box/FOX protein evolution to its binding site evolution. We also compared the DNA binding signatures of human TP53 tumor suppressor determined by two different laboratory methods (SELEX and ChIP-seq). Further analysis of the entire yeast genome, center aligned at the start codon, also revealed a distinct sequence-independent 3 bp periodic pattern in information content, present only in coding region, and perhaps indicative of the non-random organization of the genetic code. TRX-LOGOS is useful in any situation in which important information content in DNA can be better visualized at the positions of phosphate linkages (i.e. dinucleotides) where the dynamic properties of the DNA backbone functions to facilitate DNA-protein interaction.
Applying Agrep to r-NSA to solve multiple sequences approximate matching.
Ni, Bing; Wong, Man-Hon; Lam, Chi-Fai David; Leung, Kwong-Sak
2014-01-01
This paper addresses the approximate matching problem in a database consisting of multiple DNA sequences, where the proposed approach applies Agrep to a new truncated suffix array, r-NSA. The construction time of the structure is linear to the database size, and the computations of indexing a substring in the structure are constant. The number of characters processed in applying Agrep is analysed theoretically, and the theoretical upper-bound can approximate closely the empirical number of characters, which is obtained through enumerating the characters in the actual structure built. Experiments are carried out using (synthetic) random DNA sequences, as well as (real) genome sequences including Hepatitis-B Virus and X-chromosome. Experimental results show that, compared to the straight-forward approach that applies Agrep to multiple sequences individually, the proposed approach solves the matching problem in much shorter time. The speed-up of our approach depends on the sequence patterns, and for highly similar homologous genome sequences, which are the common cases in real-life genomes, it can be up to several orders of magnitude.
Prediction of protein secondary structure content for the twilight zone sequences.
Homaeian, Leila; Kurgan, Lukasz A; Ruan, Jishou; Cios, Krzysztof J; Chen, Ke
2007-11-15
Secondary protein structure carries information about local structural arrangements, which include three major conformations: alpha-helices, beta-strands, and coils. Significant majority of successful methods for prediction of the secondary structure is based on multiple sequence alignment. However, multiple alignment fails to provide accurate results when a sequence comes from the twilight zone, that is, it is characterized by low (<30%) homology. To this end, we propose a novel method for prediction of secondary structure content through comprehensive sequence representation, called PSSC-core. The method uses a multiple linear regression model and introduces a comprehensive feature-based sequence representation to predict amount of helices and strands for sequences from the twilight zone. The PSSC-core method was tested and compared with two other state-of-the-art prediction methods on a set of 2187 twilight zone sequences. The results indicate that our method provides better predictions for both helix and strand content. The PSSC-core is shown to provide statistically significantly better results when compared with the competing methods, reducing the prediction error by 5-7% for helix and 7-9% for strand content predictions. The proposed feature-based sequence representation uses a comprehensive set of physicochemical properties that are custom-designed for each of the helix and strand content predictions. It includes composition and composition moment vectors, frequency of tetra-peptides associated with helical and strand conformations, various property-based groups like exchange groups, chemical groups of the side chains and hydrophobic group, auto-correlations based on hydrophobicity, side-chain masses, hydropathy, and conformational patterns for beta-sheets. The PSSC-core method provides an alternative for predicting the secondary structure content that can be used to validate and constrain results of other structure prediction methods. At the same time, it also provides useful insight into design of successful protein sequence representations that can be used in developing new methods related to prediction of different aspects of the secondary protein structure. (c) 2007 Wiley-Liss, Inc.
Beam’s-eye-view imaging during non-coplanar lung SBRT
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yip, Stephen S. F., E-mail: syip@lroc.harvard.edu; Rottmann, Joerg; Berbeco, Ross I.
Purpose: Beam’s-eye-view (BEV) imaging with an electronic portal imaging device (EPID) can be performed during lung stereotactic body radiation therapy (SBRT) to monitor the tumor location in real-time. Image quality for each patient and treatment field depends on several factors including the patient anatomy and the gantry and couch angles. The authors investigated the angular dependence of automatic tumor localization during non-coplanar lung SBRT delivery. Methods: All images were acquired at a frame rate of 12 Hz with an amorphous silicon EPID. A previously validated markerless lung tumor localization algorithm was employed with manual localization as the reference. From tenmore » SBRT patients, 12 987 image frames of 123 image sequences acquired at 48 different gantry–couch rotations were analyzed. δ was defined by the position difference of the automatic and manual localization. Results: Regardless of the couch angle, the best tracking performance was found in image sequences with a gantry angle within 20° of 250° (δ = 1.40 mm). Image sequences acquired with gantry angles of 150°, 210°, and 350° also led to good tracking performances with δ = 1.77–2.00 mm. Overall, the couch angle was not correlated with the tracking results. Among all the gantry–couch combinations, image sequences acquired at (θ = 30°, ϕ = 330°), (θ = 210°, ϕ = 10°), and (θ = 250°, ϕ = 30°) led to the best tracking results with δ = 1.19–1.82 mm. The worst performing combinations were (θ = 90° and 230°, ϕ = 10°) and (θ = 270°, ϕ = 30°) with δ > 3.5 mm. However, 35% (17/48) of the gantry–couch rotations demonstrated substantial variability in tracking performances between patients. For example, the field angle (θ = 70°, ϕ = 10°) was acquired for five patients. While the tracking errors were ≤1.98 mm for three patients, poor performance was found for the other two patients with δ ≥ 2.18 mm, leading to average tracking error of 2.70 mm. Only one image sequence was acquired for all other gantry–couch rotations (δ = 1.18–10.29 mm). Conclusions: Non-coplanar beams with gantry–couch rotation of (θ = 30°, ϕ = 330°), (θ = 210°, ϕ = 10°), and (θ = 250°, ϕ = 30°) have the highest accuracy for BEV lung tumor localization. Additionally, gantry angles of 150°, 210°, 250°, and 350° also offer good tracking performance. The beam geometries (θ = 90° and 230°, ϕ = 10°) and (θ = 270°, ϕ = 30°) are associated with substantial automatic localization errors. Overall, lung tumor visibility and tracking performance were patient dependent for a substantial number of the gantry–couch angle combinations studied.« less
Park, Suehyun; Joo, Heesun; Kim, Jun Soo
2018-01-31
Directing the motion of molecules/colloids in any specific direction is of great interest in many applications of chemistry, physics, and biological sciences, where regulated positioning or transportation of materials is highly desired. Using Brownian dynamics simulations of coarse-grained models of a long, double-stranded DNA molecule and positively charged nanoparticles, we observed that the motion of a single nanoparticle bound to and wrapped by the DNA molecule can be directed along a gradient of DNA local flexibility. The flexibility gradient is constructed along a 0.8 kilobase-pair DNA molecule such that local persistence length decreases gradually from 50 nm to 40 nm, mimicking a gradual change in sequence-dependent flexibility. Nanoparticles roll over a long DNA molecule from less flexible regions towards more flexible ones as a result of the decreasing energetic cost of DNA bending and wrapping. In addition, the rolling becomes slightly accelerated as the positive charge of nanoparticles decreases due to a lower free energy barrier of DNA detachment from charged nanoparticle for processive rolling. This study suggests that the variation in DNA local flexibility can be utilized in constructing and manipulating supramolecular assemblies of DNA molecules and nanoparticles in structural DNA nanotechnology.
Scanpath memory binding: multiple read-out experiments
NASA Astrophysics Data System (ADS)
Stark, Lawrence W.; Privitera, Claudio M.; Yang, Huiyang; Azzariti, Michela; Ho, Yeuk F.; Chan, Angie; Krischer, Christof; Weinberger, Adam
1999-05-01
The scanpath theory proposed that an internal spatial- cognitive model controls perception and the active looking eye movements, EMs, of the scanpath sequence. Evidence for this came from new quantitative methods, experiments with ambiguous figures and visual imagery and from MRI studies, all on cooperating human subjects. Besides recording EMs, we introduce other experimental techniques wherein the subject must depend upon memory bindings as in visual imagery, but may call upon other motor behaviors than EMs to read-out the remembered patterns. How is the internal model distributed and operationally assembled. The concept of binding speaks to the assigning of values for the model and its execution in various parts of the brain. Current neurological information helps to localize different aspects of the spatial-cognitive model in the brain. We suppose that there are several levels of 'binding' -- semantic or symbolic binding, structural binding for the spatial locations of the regions-of-interest and sequential binding for the dynamic execution program that yields the sequence of EMs. Our aim is to dissect out respective contributions of these different forms of binding.
Thomsen, Rune; Pallesen, Jonatan; Daugaard, Tina F; Børglum, Anders D; Nielsen, Anders L
2013-11-01
Subcellular RNA localization plays an important role in development, cell differentiation, and cell migration. For a comprehensive description of the population of protrusion localized mRNAs in astrocytes we separated protrusions from cell bodies in a Boyden chamber and performed high-throughput direct RNA sequencing. The mRNAs with localization in astrocyte protrusions encode proteins belonging to a variety of functional groups indicating involvement of RNA localization for a palette of cellular functions. The mRNA encoding the intermediate filament protein Nestin was among the identified mRNAs. By RT-qPCR and RNA FISH analysis we confirmed Nestin mRNA localization in cell protrusions and also protrusion localization of Nestin protein. Nestin mRNA localization was dependent of Fragile X mental retardation syndrome proteins Fmrp and Fxr1, and the Nestin 3'-UTR was sufficient to mediate protrusion mRNA localization. The mRNAs for two other intermediate filament proteins in astrocytes, Gfap and Vimentin, have moderate and no protrusion localization, respectively, showing that individual intermediate filament components have different localization mechanisms. The correlated localization of Nestin mRNA with Nestin protein in cell protrusions indicates the presence of a regulatory mechanism at the mRNA localization level for the Nestin intermediate filament protein with potential importance for astrocyte functions during brain development and maintenance. Copyright © 2013 Wiley Periodicals, Inc.
Comprehensive analysis of the dynamic structure of nuclear localization signals.
Yamagishi, Ryosuke; Okuyama, Takahide; Oba, Shuntaro; Shimada, Jiro; Chaen, Shigeru; Kaneko, Hiroki
2015-12-01
Most transcription and epigenetic factors in eukaryotic cells have nuclear localization signals (NLSs) and are transported to the nucleus by nuclear transport proteins. Understanding the features of NLSs and the mechanisms of nuclear transport might help understand gene expression regulation, somatic cell reprogramming, thus leading to the treatment of diseases associated with abnormal gene expression. Although many studies analyzed the amino acid sequence of NLSs, few studies investigated their three-dimensional structure. Therefore, we conducted a statistical investigation of the dynamic structure of NLSs by extracting the conformation of these sequences from proteins examined by X-ray crystallography and using a quantity defined as conformational determination rate (a ratio between the number of amino acids determining the conformation and the number of all amino acids included in a certain region). We found that determining the conformation of NLSs is more difficult than determining the conformation of other regions and that NLSs may tend to form more heteropolymers than monomers. Therefore, these findings strongly suggest that NLSs are intrinsically disordered regions.
Rigoutsos, Isidore; Riek, Peter; Graham, Robert M; Novotny, Jiri
2003-08-01
One of the promising methods of protein structure prediction involves the use of amino acid sequence-derived patterns. Here we report on the creation of non-degenerate motif descriptors derived through data mining of training sets of residues taken from the transmembrane-spanning segments of polytopic proteins. These residues correspond to short regions in which there is a deviation from the regular alpha-helical character (i.e. pi-helices, 3(10)-helices and kinks). A 'search engine' derived from these motif descriptors correctly identifies, and discriminates amongst instances of the above 'non-canonical' helical motifs contained in the SwissProt/TrEMBL database of protein primary structures. Our results suggest that deviations from alpha-helicity are encoded locally in sequence patterns only about 7-9 residues long and can be determined in silico directly from the amino acid sequence. Delineation of such variations in helical habit is critical to understanding the complex structure-function relationships of polytopic proteins and for drug discovery. The success of our current methodology foretells development of similar prediction tools capable of identifying other structural motifs from sequence alone. The method described here has been implemented and is available on the World Wide Web at http://cbcsrv.watson.ibm.com/Ttkw.html.
Perceptions of randomness in binary sequences: Normative, heuristic, or both?
Reimers, Stian; Donkin, Chris; Le Pelley, Mike E
2018-03-01
When people consider a series of random binary events, such as tossing an unbiased coin and recording the sequence of heads (H) and tails (T), they tend to erroneously rate sequences with less internal structure or order (such as HTTHT) as more probable than sequences containing more structure or order (such as HHHHH). This is traditionally explained as a local representativeness effect: Participants assume that the properties of long sequences of random outcomes-such as an equal proportion of heads and tails, and little internal structure-should also apply to short sequences. However, recent theoretical work has noted that the probability of a particular sequence of say, heads and tails of length n, occurring within a larger (>n) sequence of coin flips actually differs by sequence, so P(HHHHH)
Sequence Dependent Interactions Between DNA and Single-Walled Carbon Nanotubes
NASA Astrophysics Data System (ADS)
Roxbury, Daniel
It is known that single-stranded DNA adopts a helical wrap around a single-walled carbon nanotube (SWCNT), forming a water-dispersible hybrid molecule. The ability to sort mixtures of SWCNTs based on chirality (electronic species) has recently been demonstrated using special short DNA sequences that recognize certain matching SWCNTs of specific chirality. This thesis investigates the intricacies of DNA-SWCNT sequence-specific interactions through both experimental and molecular simulation studies. The DNA-SWCNT binding strengths were experimentally quantified by studying the kinetics of DNA replacement by a surfactant on the surface of particular SWCNTs. Recognition ability was found to correlate strongly with measured binding strength, e.g. DNA sequence (TAT)4 was found to bind 20 times stronger to the (6,5)-SWCNT than sequence (TAT)4T. Next, using replica exchange molecular dynamics (REMD) simulations, equilibrium structures formed by (a) single-strands and (b) multiple-strands of 12-mer oligonucleotides adsorbed on various SWCNTs were explored. A number of structural motifs were discovered in which the DNA strand wraps around the SWCNT and 'stitches' to itself via hydrogen bonding. Great variability among equilibrium structures was observed and shown to be directly influenced by DNA sequence and SWCNT type. For example, the (6,5)-SWCNT DNA recognition sequence, (TAT)4, was found to wrap in a tight single-stranded right-handed helical conformation. In contrast, DNA sequence T12 forms a beta-barrel left-handed structure on the same SWCNT. These are the first theoretical indications that DNA-based SWCNT selectivity can arise on a molecular level. In a biomedical collaboration with the Mayo Clinic, pathways for DNA-SWCNT internalization into healthy human endothelial cells were explored. Through absorbance spectroscopy, TEM imaging, and confocal fluorescence microscopy, we showed that intracellular concentrations of SWCNTs far exceeded those of the incubation solution, which suggested an energy-dependent pathway. Additionally, by means of pharmacological inhibition and vector-induced gene knockout studies, the DNA-SWCNTs were shown to enter the cells via Rac1-mediated macropinocytosis.
Kanamori, Hiroshi; Yuhashi, Kazuhito; Ohnishi, Shin; Koike, Kazuhiko; Kodama, Tatsuhiko
2010-05-01
The hepatitis C virus NS5B RNA-dependent RNA polymerase (RdRp) is a key enzyme involved in viral replication. Interaction between NS5B RdRp and the viral RNA sequence is likely to be an important step in viral RNA replication. The C-terminal half of the NS5B-coding sequence, which contains the important cis-acting replication element, has been identified as an NS5B-binding sequence. In the present study, we confirm the specific binding of NS5B to one of the RNA stem-loop structures in the region, 5BSL3.2. In addition, we show that NS5B binds to the complementary strand of 5BSL3.2 (5BSL3.2N). The bulge structure of 5BSL3.2N was shown to be indispensable for tight binding to NS5B. In vitro RdRp activity was inhibited by 5BSL3.2N, indicating the importance of the RNA element in the polymerization by RdRp. These results suggest the involvement of the RNA stem-loop structure of the negative strand in the replication process.
Waldispühl, Jérôme; Ponty, Yann
2011-11-01
The analysis of the relationship between sequences and structures (i.e., how mutations affect structures and reciprocally how structures influence mutations) is essential to decipher the principles driving molecular evolution, to infer the origins of genetic diseases, and to develop bioengineering applications such as the design of artificial molecules. Because their structures can be predicted from the sequence data only, RNA molecules provide a good framework to study this sequence-structure relationship. We recently introduced a suite of algorithms called RNAmutants which allows a complete exploration of RNA sequence-structure maps in polynomial time and space. Formally, RNAmutants takes an input sequence (or seed) to compute the Boltzmann-weighted ensembles of mutants with exactly k mutations, and sample mutations from these ensembles. However, this approach suffers from major limitations. Indeed, since the Boltzmann probabilities of the mutations depend of the free energy of the structures, RNAmutants has difficulties to sample mutant sequences with low G+C-contents. In this article, we introduce an unbiased adaptive sampling algorithm that enables RNAmutants to sample regions of the mutational landscape poorly covered by classical algorithms. We applied these methods to sample mutations with low G+C-contents. These adaptive sampling techniques can be easily adapted to explore other regions of the sequence and structural landscapes which are difficult to sample. Importantly, these algorithms come at a minimal computational cost. We demonstrate the insights offered by these techniques on studies of complete RNA sequence structures maps of sizes up to 40 nucleotides. Our results indicate that the G+C-content has a strong influence on the size and shape of the evolutionary accessible sequence and structural spaces. In particular, we show that low G+C-contents favor the apparition of internal loops and thus possibly the synthesis of tertiary structure motifs. On the other hand, high G+C-contents significantly reduce the size of the evolutionary accessible mutational landscapes.
The rRNA evolution and procaryotic phylogeny
NASA Technical Reports Server (NTRS)
Fox, G. E.
1986-01-01
Studies of ribosomal RNA primary structure allow reconstruction of phylogenetic trees for prokaryotic organisms. Such studies reveal major dichotomy among the bacteria that separates them into eubacteria and archaebacteria. Both groupings are further segmented into several major divisions. The results obtained from 5S rRNA sequences are essentially the same as those obtained with the 16S rRNA data. In the case of Gram negative bacteria the ribosomal RNA sequencing results can also be directly compared with hybridization studies and cytochrome c sequencing studies. There is again excellent agreement among the several methods. It seems likely then that the overall picture of microbial phylogeny that is emerging from the RNA sequence studies is a good approximation of the true history of these organisms. The RNA data allow examination of the evolutionary process in a semi-quantitative way. The secondary structures of these RNAs are largely established. As a result it is possible to recognize examples of local structural evolution. Evolutionary pathways accounting for these events can be proposed and their probability can be assessed.
The amyloid fold of Gad m 1 epitopes governs IgE binding
Sánchez, Rosa; Martínez, Javier; Castro, Ana; Pedrosa, María; Quirce, Santiago; Rodríguez-Pérez, Rosa; Gasset, María
2016-01-01
Amyloids are polymeric structural states formed from locally or totally unfolded protein chains that permit surface reorganizations, stability enhancements and interaction properties that are absent in the precursor monomers. β-Parvalbumin, the major allergen in fish allergy, forms amyloids that are recognized by IgE in the patient sera, suggesting a yet unknown pathological role for these assemblies. We used Gad m 1 as the fish β-parvalbumin model and a combination of approaches, including peptide arrays, recombinant wt and mutant chains, biophysical characterizations, protease digestions, mass spectrometry, dot-blot and ELISA assays to gain insights into the role of amyloids in the IgE interaction. We found that Gad m 1 immunoreactive regions behave as sequence-dependent conformational epitopes that provide a 1000-fold increase in affinity and the structural repetitiveness required for optimal IgE binding and cross-linking upon folding into amyloids. These findings support the amyloid state as a key entity in type I food allergy. PMID:27597317
1996-01-01
Mutations in the Caenorhabditis elegans gene unc-89 result in nematodes having disorganized muscle structure in which thick filaments are not organized into A-bands, and there are no M-lines. Beginning with a partial cDNA from the C. elegans sequencing project, we have cloned and sequenced the unc-89 gene. An unc-89 allele, st515, was found to contain an 84-bp deletion and a 10-bp duplication, resulting in an in- frame stop codon within predicted unc-89 coding sequence. Analysis of the complete coding sequence for unc-89 predicts a novel 6,632 amino acid polypeptide consisting of sequence motifs which have been implicated in protein-protein interactions. UNC-89 begins with 67 residues of unique sequences, SH3, dbl/CDC24, and PH domains, 7 immunoglobulins (Ig) domains, a putative KSP-containing multiphosphorylation domain, and ends with 46 Ig domains. A polyclonal antiserum raised to a portion of unc-89 encoded sequence reacts to a twitchin-sized polypeptide from wild type, but truncated polypeptides from st515 and from the amber allele e2338. By immunofluorescent microscopy, this antiserum localizes to the middle of A-bands, consistent with UNC-89 being a structural component of the M-line. Previous studies indicate that myofilament lattice assembly begins with positional cues laid down in the basement membrane and muscle cell membrane. We propose that the intracellular protein UNC-89 responds to these signals, localizes, and then participates in assembling an M-line. PMID:8603916
Reduced extinction of hippocampal-dependent memories in CPEB knockout mice.
Berger-Sweeney, Joanne; Zearfoss, N Ruth; Richter, Joel D
2006-01-01
CPEB is a sequence-specific RNA binding protein that regulates translation at synapses. In neurons of CPEB knockout mice, synaptic efficacy is reduced. Here, we have performed a battery of behavioral tests and find that relative to wild-type animals, CPEB knockout mice, although similar on many baseline behaviors, have reduced extinction of memories on two hippocampal-dependent tasks. A corresponding microarray analysis reveals that about 0.14% of hippocampal genes have an altered expression in the CPEB knockout mouse. These data suggest that CPEB-dependent local protein synthesis may be an important cellular mechanism underlying extinction of hippocampal-dependent memories.
Evolutionary profiles from the QR factorization of multiple sequence alignments
Sethi, Anurag; O'Donoghue, Patrick; Luthey-Schulten, Zaida
2005-01-01
We present an algorithm to generate complete evolutionary profiles that represent the topology of the molecular phylogenetic tree of the homologous group. The method, based on the multidimensional QR factorization of numerically encoded multiple sequence alignments, removes redundancy from the alignments and orders the protein sequences by increasing linear dependence, resulting in the identification of a minimal basis set of sequences that spans the evolutionary space of the homologous group of proteins. We observe a general trend that these smaller, more evolutionarily balanced profiles have comparable and, in many cases, better performance in database searches than conventional profiles containing hundreds of sequences, constructed in an iterative and computationally intensive procedure. For more diverse families or superfamilies, with sequence identity <30%, structural alignments, based purely on the geometry of the protein structures, provide better alignments than pure sequence-based methods. Merging the structure and sequence information allows the construction of accurate profiles for distantly related groups. These structure-based profiles outperformed other sequence-based methods for finding distant homologs and were used to identify a putative class II cysteinyl-tRNA synthetase (CysRS) in several archaea that eluded previous annotation studies. Phylogenetic analysis showed the putative class II CysRSs to be a monophyletic group and homology modeling revealed a constellation of active site residues similar to that in the known class I CysRS. PMID:15741270
Bor, Daniel; Billington, Jac; Baron-Cohen, Simon
2007-10-01
SINGLE CASE: DT is a savant with exceptional abilities in numerical memory and mathematical calculations. DT also has an elaborate form of synaesthesia for visually presented digits. Further more, DT also has Asperger syndrome (AS). We carried out two preliminary investigations to establish whether these conditions may contribute to his savant abilities. In an fMRI digit span study, DT showed hyperactivity in lateral prefrontal cortex when encoding digits, compared with controls. In addition, while controls showed raised lateral prefrontal activation in response to structured (compared to unstructured) sequences of digits, DT's neural activity did not differ between these two conditions. In addition, controls showed a significant performance advantage for structured, compared with unstructured sequences whereas no such pattern was found for DT. We suggest that this performance pattern reflects that DT focuses less on external mathematical structure, since for him all digit sequences have internal structure linked to his synaesthesia. Finally, DT did not activate extra-striate regions normally associated with synaesthesia, suggesting that he has an unusual and more abstract and conceptual form of synaesthesia. This appears to generate structured, highly-chunked content that enhances encoding of digits and aids both recall and calculation. People with AS preferentially attend to local features of stimuli. To test this in DT, we administered the Navon task. Relative to controls, DT was faster at finding a target at the local level, and was less distracted by interference from the global level. The propensity to focus on local detail, in concert with a form of synaesthesia that provides structure to all digits, may account for DT's exceptional numerical memory and calculation ability. This neural and cognitive pattern needs to be tested in a series of similar cases, and with more constrained control groups, to confirm the significance of this association.
Time fluctuation analysis of forest fire sequences
NASA Astrophysics Data System (ADS)
Vega Orozco, Carmen D.; Kanevski, Mikhaïl; Tonini, Marj; Golay, Jean; Pereira, Mário J. G.
2013-04-01
Forest fires are complex events involving both space and time fluctuations. Understanding of their dynamics and pattern distribution is of great importance in order to improve the resource allocation and support fire management actions at local and global levels. This study aims at characterizing the temporal fluctuations of forest fire sequences observed in Portugal, which is the country that holds the largest wildfire land dataset in Europe. This research applies several exploratory data analysis measures to 302,000 forest fires occurred from 1980 to 2007. The applied clustering measures are: Morisita clustering index, fractal and multifractal dimensions (box-counting), Ripley's K-function, Allan Factor, and variography. These algorithms enable a global time structural analysis describing the degree of clustering of a point pattern and defining whether the observed events occur randomly, in clusters or in a regular pattern. The considered methods are of general importance and can be used for other spatio-temporal events (i.e. crime, epidemiology, biodiversity, geomarketing, etc.). An important contribution of this research deals with the analysis and estimation of local measures of clustering that helps understanding their temporal structure. Each measure is described and executed for the raw data (forest fires geo-database) and results are compared to reference patterns generated under the null hypothesis of randomness (Poisson processes) embedded in the same time period of the raw data. This comparison enables estimating the degree of the deviation of the real data from a Poisson process. Generalizations to functional measures of these clustering methods, taking into account the phenomena, were also applied and adapted to detect time dependences in a measured variable (i.e. burned area). The time clustering of the raw data is compared several times with the Poisson processes at different thresholds of the measured function. Then, the clustering measure value depends on the threshold which helps to understand the time pattern of the studied events. Our findings detected the presence of overdensity of events in particular time periods and showed that the forest fire sequences in Portugal can be considered as a multifractal process with a degree of time-clustering of the events. Key words: time sequences, Morisita index, fractals, multifractals, box-counting, Ripley's K-function, Allan Factor, variography, forest fires, point process. Acknowledgements This work was partly supported by the SNFS Project No. 200021-140658, "Analysis and Modelling of Space-Time Patterns in Complex Regions". References - Kanevski M. (Editor). 2008. Advanced Mapping of Environmental Data: Geostatistics, Machine Learning and Bayesian Maximum Entropy. London / Hoboken: iSTE / Wiley. - Telesca L. and Pereira M.G. 2010. Time-clustering investigation of fire temporal fluctuations in Portugal, Nat. Hazards Earth Syst. Sci., vol. 10(4): 661-666. - Vega Orozco C., Tonini M., Conedera M., Kanevski M. (2012) Cluster recognition in spatial-temporal sequences: the case of forest fires, Geoinformatica, vol. 16(4): 653-673.
Structure optimisation by thermal cycling for the hydrophobic-polar lattice model of protein folding
NASA Astrophysics Data System (ADS)
Günther, Florian; Möbius, Arnulf; Schreiber, Michael
2017-03-01
The function of a protein depends strongly on its spatial structure. Therefore the transition from an unfolded stage to the functional fold is one of the most important problems in computational molecular biology. Since the corresponding free energy landscapes exhibit huge numbers of local minima, the search for the lowest-energy configurations is very demanding. Because of that, efficient heuristic algorithms are of high value. In the present work, we investigate whether and how the thermal cycling (TC) approach can be applied to the hydrophobic-polar (HP) lattice model of protein folding. Evaluating the efficiency of TC for a set of two- and three-dimensional examples, we compare the performance of this strategy with that of multi-start local search (MSLS) procedures and that of simulated annealing (SA). For this aim, we incorporated several simple but rather efficient modifications into the standard procedures: in particular, a strong improvement was achieved by also allowing energy conserving state modifications. Furthermore, the consideration of ensembles instead of single samples was found to greatly improve the efficiency of TC. In the framework of different benchmarks, for all considered HP sequences, we found TC to be far superior to SA, and to be faster than Wang-Landau sampling.
Localized structural frustration for evaluating the impact of sequence variants.
Kumar, Sushant; Clarke, Declan; Gerstein, Mark
2016-12-01
Population-scale sequencing is increasingly uncovering large numbers of rare single-nucleotide variants (SNVs) in coding regions of the genome. The rarity of these variants makes it challenging to evaluate their deleteriousness with conventional phenotype-genotype associations. Protein structures provide a way of addressing this challenge. Previous efforts have focused on globally quantifying the impact of SNVs on protein stability. However, local perturbations may severely impact protein functionality without strongly disrupting global stability (e.g. in relation to catalysis or allostery). Here, we describe a workflow in which localized frustration, quantifying unfavorable local interactions, is employed as a metric to investigate such effects. Using this workflow on the Protein Databank, we find that frustration produces many immediately intuitive results: for instance, disease-related SNVs create stronger changes in localized frustration than non-disease related variants, and rare SNVs tend to disrupt local interactions to a larger extent than common variants. Less obviously, we observe that somatic SNVs associated with oncogenes and tumor suppressor genes (TSGs) induce very different changes in frustration. In particular, those associated with TSGs change the frustration more in the core than the surface (by introducing loss-of-function events), whereas those associated with oncogenes manifest the opposite pattern, creating gain-of-function events. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Tron, Adriana E; Comelli, Raúl N; Gonzalez, Daniel H
2005-12-27
Homeodomain-leucine zipper (HD-Zip) proteins, unlike most homeodomain proteins, bind a pseudopalindromic DNA sequence as dimers. We have investigated the structure of the DNA complexes formed by two HD-Zip proteins with different nucleotide preferences at the central position of the binding site using footprinting and interference methods. The results indicate that the respective complexes are not symmetric, with the strand bearing a central purine (top strand) showing higher protection around the central region and the bottom strand protected toward the 3' end. Binding to a sequence with a nonpreferred central base pair produces a decrease in protection in either the top or the bottom strand, depending upon the protein. Modeling studies derived from the complex formed by the monomeric Antennapedia homeodomain with DNA indicate that in the HD-Zip/DNA complex the recognition helix of one of the monomers is displaced within the major groove respective to the other one. This monomer seems to lose contacts with a part of the recognition sequence upon binding to the nonpreferred site. The results show that the structure of the complex formed by HD-Zip proteins with DNA is dependent upon both protein intrinsic characteristics and the nucleotides present at the central position of the recognition sequence.
Tome, Jacob M; Ozer, Abdullah; Pagano, John M; Gheba, Dan; Schroth, Gary P; Lis, John T
2014-06-01
RNA-protein interactions play critical roles in gene regulation, but methods to quantitatively analyze these interactions at a large scale are lacking. We have developed a high-throughput sequencing-RNA affinity profiling (HiTS-RAP) assay by adapting a high-throughput DNA sequencer to quantify the binding of fluorescently labeled protein to millions of RNAs anchored to sequenced cDNA templates. Using HiTS-RAP, we measured the affinity of mutagenized libraries of GFP-binding and NELF-E-binding aptamers to their respective targets and identified critical regions of interaction. Mutations additively affected the affinity of the NELF-E-binding aptamer, whose interaction depended mainly on a single-stranded RNA motif, but not that of the GFP aptamer, whose interaction depended primarily on secondary structure.
Alternative DNA structure formation in the mutagenic human c-MYC promoter
del Mundo, Imee Marie A.; Zewail-Foote, Maha; Kerwin, Sean M.
2017-01-01
Abstract Mutation ‘hotspot’ regions in the genome are susceptible to genetic instability, implicating them in diseases. These hotspots are not random and often co-localize with DNA sequences potentially capable of adopting alternative DNA structures (non-B DNA, e.g. H-DNA and G4-DNA), which have been identified as endogenous sources of genomic instability. There are regions that contain overlapping sequences that may form more than one non-B DNA structure. The extent to which one structure impacts the formation/stability of another, within the sequence, is not fully understood. To address this issue, we investigated the folding preferences of oligonucleotides from a chromosomal breakpoint hotspot in the human c-MYC oncogene containing both potential G4-forming and H-DNA-forming elements. We characterized the structures formed in the presence of G4-DNA-stabilizing K+ ions or H-DNA-stabilizing Mg2+ ions using multiple techniques. We found that under conditions favorable for H-DNA formation, a stable intramolecular triplex DNA structure predominated; whereas, under K+-rich, G4-DNA-forming conditions, a plurality of unfolded and folded species were present. Thus, within a limited region containing sequences with the potential to adopt multiple structures, only one structure predominates under a given condition. The predominance of H-DNA implicates this structure in the instability associated with the human c-MYC oncogene. PMID:28334873
Evaluating the accuracy of SHAPE-directed RNA secondary structure predictions
Sükösd, Zsuzsanna; Swenson, M. Shel; Kjems, Jørgen; Heitsch, Christine E.
2013-01-01
Recent advances in RNA structure determination include using data from high-throughput probing experiments to improve thermodynamic prediction accuracy. We evaluate the extent and nature of improvements in data-directed predictions for a diverse set of 16S/18S ribosomal sequences using a stochastic model of experimental SHAPE data. The average accuracy for 1000 data-directed predictions always improves over the original minimum free energy (MFE) structure. However, the amount of improvement varies with the sequence, exhibiting a correlation with MFE accuracy. Further analysis of this correlation shows that accurate MFE base pairs are typically preserved in a data-directed prediction, whereas inaccurate ones are not. Thus, the positive predictive value of common base pairs is consistently higher than the directed prediction accuracy. Finally, we confirm sequence dependencies in the directability of thermodynamic predictions and investigate the potential for greater accuracy improvements in the worst performing test sequence. PMID:23325843
Jia, Da; Gomez, Timothy S; Metlagel, Zoltan; Umetani, Junko; Otwinowski, Zbyszek; Rosen, Michael K; Billadeau, Daniel D
2010-06-08
We recently showed that the Wiskott-Aldrich syndrome protein (WASP) family member, WASH, localizes to endosomal subdomains and regulates endocytic vesicle scission in an Arp2/3-dependent manner. Mechanisms regulating WASH activity are unknown. Here we show that WASH functions in cells within a 500 kDa core complex containing Strumpellin, FAM21, KIAA1033 (SWIP), and CCDC53. Although recombinant WASH is constitutively active toward the Arp2/3 complex, the reconstituted core assembly is inhibited, suggesting that it functions in cells to regulate actin dynamics through WASH. FAM21 interacts directly with CAPZ and inhibits its actin-capping activity. Four of the five core components show distant (approximately 15% amino acid sequence identify) but significant structural homology to components of a complex that negatively regulates the WASP family member, WAVE. Moreover, biochemical and electron microscopic analyses show that the WASH and WAVE complexes are structurally similar. Thus, these two distantly related WASP family members are controlled by analogous structurally related mechanisms. Strumpellin is mutated in the human disease hereditary spastic paraplegia, and its link to WASH suggests that misregulation of actin dynamics on endosomes may play a role in this disorder.
Forlani, Giuseppe; Makarova, Kira S.; Ruszkowski, Milosz; ...
2015-08-03
Proline plays a crucial role in cell growth and stress responses, and its accumulation is essential for the tolerance of adverse environmental conditions in plants. Two routes are used to biosynthesize proline in plants. The main route uses glutamate as a precursor, while in the other route proline is derived from ornithine. The terminal step of both pathways, the conversion of δ 1-pyrroline-5-carboxylate (P5C) to L-proline, is catalyzed by P5C reductase (P5CR) using NADH or NADPH as a cofactor. Since P5CRs are important housekeeping enzymes, they are conserved across all domains of life and appear to be relatively unaffected throughoutmore » evolution. However, global analysis of these enzymes unveiled significant functional diversity in the preference for cofactors (NADPH vs. NADH), variation in metal dependence and the differences in the oligomeric state. In our study we investigated evolutionary patterns through phylogenetic and structural analysis of P5CR representatives from all kingdoms of life, with emphasis on the plant species. We attempted to correlate local sequence/structure variation among the functionally and structurally characterized members of the family.« less
Pappas, Eleftherios P; Seimenis, Ioannis; Dellios, Dimitrios; Kollias, Georgios; Lampropoulos, Kostas I; Karaiskos, Pantelis
2018-06-25
This work focuses on MR-related sequence dependent geometric distortions, which are associated with B 0 inhomogeneity and patient-induced distortion (susceptibility differences and chemical shift effects), in MR images used in stereotactic radiosurgery (SRS) applications. Emphasis is put on characterizing distortion at target brain areas identified by gadolinium diethylenetriamine pentaacetic acid (Gd-DTPA) paramagnetic contrast agent uptake. A custom-made phantom for distortion detection was modified to accommodate two small cylindrical inserts, simulating small brain targets. The inserts were filled with Gd-DTPA solutions of various concentrations (0-20 mM). The phantom was scanned at 1.5 T unit using both the reversed read gradient polarity (to determine the overall distortion as reflected by the inserts centroid offset) and the field mapping (to determine B 0 inhomogeneity related distortion in the vicinity of the inserts) techniques. Post-Gd patient images involving a total of 10 brain metastases/targets were also studied using a similar methodology. For the specific imaging conditions, contrast agent presence was found to evidently affect phantom insert position, with centroid offset extending up to 0.068 mm mM -1 (0.208 ppm mM -1 ). The Gd-DTPA induced distortion in patient images was of the order of 0.5 mm for the MRI protocol used, in agreement with the phantom results. Total localization uncertainty of metastases-targets in patient images ranged from 0.35 mm to 0.87 mm, depending on target location, with an average value of 0.54 mm (2.24 ppm). This relative wide range of target localization uncertainty results from the fact that the B 0 inhomogeneity distortion vector in a specific location may add to or partly counterbalance Gd-DTPA induced distortion, thus increasing or decreasing, respectively, the total sequence dependent distortion. Although relatively small, the sequence dependent distortion in Gd-DTPA enhanced brain images can be easily taken into account for SRS treatment planning and target definition purposes by carefully inspecting both the forward and reversed polarity series.
Wang, Shih-Ting; Lin, Yiyang; Spencer, Ryan K.; ...
2017-08-03
Determining the structural origins of amyloid fibrillation is essential for understanding both the pathology of amyloidosis and the rational design of inhibitors to prevent or reverse amyloid formation. In this work, the decisive roles of peptide structures on amyloid self-assembly and morphological diversity were investigated by the design of eight amyloidogenic peptides derived from islet amyloid polypeptide. Among the segments, two distinct morphologies were highlighted in the form of twisted and planar (untwisted) ribbons with varied diameters, thicknesses, and lengths. In particular, transformation of amyloid fibrils from twisted ribbons into untwisted structures was triggered by substitution of the C-terminal serinemore » with threonine, where the side chain methyl group was responsible for the distinct morphological change. This effect was confirmed following serine substitution with alanine and valine and was ascribed to the restriction of intersheet torsional strain through the increased hydrophobic interactions and hydrogen bonding. We also studied the variation of fibril morphology (i.e., association and helicity) and peptide aggregation propensity by increasing the hydrophobicity of the peptide side group, capping the N-terminus, and extending sequence length. Lastly, we anticipate that our insights into sequence-dependent fibrillation and morphological diversity will shed light on the structural interpretation of amyloidogenesis and development of structure-specific imaging agents and aggregation inhibitors.« less
SvABA: genome-wide detection of structural variants and indels by local assembly.
Wala, Jeremiah A; Bandopadhayay, Pratiti; Greenwald, Noah F; O'Rourke, Ryan; Sharpe, Ted; Stewart, Chip; Schumacher, Steve; Li, Yilong; Weischenfeldt, Joachim; Yao, Xiaotong; Nusbaum, Chad; Campbell, Peter; Getz, Gad; Meyerson, Matthew; Zhang, Cheng-Zhong; Imielinski, Marcin; Beroukhim, Rameen
2018-04-01
Structural variants (SVs), including small insertion and deletion variants (indels), are challenging to detect through standard alignment-based variant calling methods. Sequence assembly offers a powerful approach to identifying SVs, but is difficult to apply at scale genome-wide for SV detection due to its computational complexity and the difficulty of extracting SVs from assembly contigs. We describe SvABA, an efficient and accurate method for detecting SVs from short-read sequencing data using genome-wide local assembly with low memory and computing requirements. We evaluated SvABA's performance on the NA12878 human genome and in simulated and real cancer genomes. SvABA demonstrates superior sensitivity and specificity across a large spectrum of SVs and substantially improves detection performance for variants in the 20-300 bp range, compared with existing methods. SvABA also identifies complex somatic rearrangements with chains of short (<1000 bp) templated-sequence insertions copied from distant genomic regions. We applied SvABA to 344 cancer genomes from 11 cancer types and found that short templated-sequence insertions occur in ∼4% of all somatic rearrangements. Finally, we demonstrate that SvABA can identify sites of viral integration and cancer driver alterations containing medium-sized (50-300 bp) SVs. © 2018 Wala et al.; Published by Cold Spring Harbor Laboratory Press.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ramalho, T.O.; Figueira, A.R.; Sotero, A.J.
2014-09-15
The emergence of viruses in Coffee (Coffea arabica and Coffea canephora), the most widely traded agricultural commodity in the world, is of critical concern. The RNA1 (6552 nt) of Coffee ringspot virus is organized into five open reading frames (ORFs) capable of encoding the viral nucleocapsid (ORF1p), phosphoprotein (ORF2p), putative cell-to-cell movement protein (ORF3p), matrix protein (ORF4p) and glycoprotein (ORF5p). Each ORF is separated by a conserved intergenic junction. RNA2 (5945 nt), which completes the bipartite genome, encodes a single protein (ORF6p) with homology to RNA-dependent RNA polymerases. Phylogenetic analysis of L protein sequences firmly establishes CoRSV as a membermore » of the recently proposed Dichorhavirus genus. Predictive algorithms, in planta protein expression, and a yeast-based nuclear import assay were used to determine the nucleophillic character of five CoRSV proteins. Finally, the temperature-dependent ability of CoRSV to establish systemic infections in an initially local lesion host was quantified. - Highlights: • We report genome sequence determination for Coffee ringspot virus (CoRSV). • CoRSV should be considered a member of the proposed Dichorhavirus genus. • We report temperature-dependent systemic infection of an initially local lesion host. • We report in planta protein and localization data for five CoRSV proteins. • In silico predictions of the CoRSV proteins were validated using in vivo assays.« less
Pastor, N; Pardo, L; Weinstein, H
1997-01-01
The binding of the TATA box-binding protein (TBP) to a TATA sequence in DNA is essential for eukaryotic basal transcription. TBP binds in the minor groove of DNA, causing a large distortion of the DNA helix. Given the apparent stereochemical equivalence of AT and TA basepairs in the minor groove, DNA deformability must play a significant role in binding site selection, because not all AT-rich sequences are bound effectively by TBP. To gain insight into the precise role that the properties of the TATA sequence have in determining the specificity of the DNA substrates of TBP, the solution structure and dynamics of seven DNA dodecamers have been studied by using molecular dynamics simulations. The analysis of the structural properties of basepair steps in these TATA sequences suggests a reason for the preference for alternating pyrimidine-purine (YR) sequences, but indicates that these properties cannot be the sole determinant of the sequence specificity of TBP. Rather, recognition depends on the interplay between the inherent deformability of the DNA and steric complementarity at the molecular interface. Images FIGURE 2 PMID:9251783
NASA Astrophysics Data System (ADS)
Ding, Jun
Metallic glasses (MGs), discovered five decades ago as a newcomer in the family of glasses, are of current interest because of their unique structures and properties. There are also many fundamental materials science issues that remain unresolved for metallic glasses, as well as their predecessor above glass transition temperature, the supercooled liquids. In particular, it is a major challenge to characterize the local structure and unveil the structure-property relationship for these amorphous materials. This thesis presents a systematic study of the local structure of metallic glasses as well as supercooled liquids via classical and ab initio molecular dynamics simulations. Three typical MG models are chosen as representative candidate, Cu64 Zr36, Pd82Si18 and Mg65Cu 25Y10 systems, while the former is dominant with full icosahedra short-range order and the prism-type short-range order dominate for latter two. Furthermore, we move to unravel the underlying structural signature among several properties in metallic glasses. Firstly, the temperature dependence of specific heat and liquid fragility between Cu-Zr and Mg-Cu-Y (also Pd-Si) in supercooled liquids are quite distinct: gradual versus fast evolution of specific heat and viscosity/relaxation time with undercooling. Their local structural ordering are found to relate with the temperature dependence of specific heat and relaxation time. Then elastic heterogeneity has been studied to correlate with local structure in Cu-Zr MGs. Specifically, this part covers how the degree of elastic deformation correlates with the internal structure at the atomic level, how to quantitatively evaluate the local solidity/liquidity in MGs and how the network of interpenetrating connection of icosahedra determine the corresponding shear modulus. Finally, we have illustrated the structure signature of quasi-localized low-frequency vibrational normal modes, which resides the intriguing vibrational properties in MGs. Specifically, the local atomic packing structure in a model MG strongly correlate with the corresponding participation fraction in quasi-localized soft modes, while the highest and lowest participation correspond to geometrically unfavored motifs and ISRO respectively. In addition, we clearly demonstrate that quasi-localized low-frequency vibrational modes correlate strongly with fertile sites for shear transformations in a MG.
CircularLogo: A lightweight web application to visualize intra-motif dependencies.
Ye, Zhenqing; Ma, Tao; Kalmbach, Michael T; Dasari, Surendra; Kocher, Jean-Pierre A; Wang, Liguo
2017-05-22
The sequence logo has been widely used to represent DNA or RNA motifs for more than three decades. Despite its intelligibility and intuitiveness, the traditional sequence logo is unable to display the intra-motif dependencies and therefore is insufficient to fully characterize nucleotide motifs. Many methods have been developed to quantify the intra-motif dependencies, but fewer tools are available for visualization. We developed CircularLogo, a web-based interactive application, which is able to not only visualize the position-specific nucleotide consensus and diversity but also display the intra-motif dependencies. Applying CircularLogo to HNF6 binding sites and tRNA sequences demonstrated its ability to show intra-motif dependencies and intuitively reveal biomolecular structure. CircularLogo is implemented in JavaScript and Python based on the Django web framework. The program's source code and user's manual are freely available at http://circularlogo.sourceforge.net . CircularLogo web server can be accessed from http://bioinformaticstools.mayo.edu/circularlogo/index.html . CircularLogo is an innovative web application that is specifically designed to visualize and interactively explore intra-motif dependencies.
Experimental Observation of Dynamical Localization in Laser-Kicked Molecular Rotors.
Bitter, M; Milner, V
2016-09-30
The periodically kicked rotor is a paradigm system for studying quantum effects on classically chaotic dynamics. The wave function of the quantum rotor localizes in angular momentum space, similarly to Anderson localization of the electronic wave function in disordered solids. Here, we observe dynamical localization in a system of true quantum rotors by subjecting nitrogen molecules to periodic sequences of femtosecond pulses. Exponential distribution of the molecular angular momentum-the hallmark of dynamical localization-is measured directly by means of coherent Raman scattering. We demonstrate the suppressed rotational energy growth with the number of laser kicks and study the dependence of the localization length on the kick strength. Because of its quantum coherent nature, both timing and amplitude noise are shown to destroy the localization and revive the diffusive growth of energy.
ERIC Educational Resources Information Center
Dawson, Colin; Gerken, LouAnn
2011-01-01
While many constraints on learning must be relatively experience-independent, past experience provides a rich source of guidance for subsequent learning. Discovering structure in some domain can inform a learner's future hypotheses about that domain. If a general property accounts for particular sub-patterns, a rational learner should not…
Beamer, B A; Negri, C; Yen, C J; Gavrilova, O; Rumberger, J M; Durcan, M J; Yarnall, D P; Hawkins, A L; Griffin, C A; Burns, D K; Roth, J; Reitman, M; Shuldiner, A R
1997-04-28
We determined the chromosomal localization and partial genomic structure of the coding region of the human PPAR gamma gene (hPPAR gamma), a nuclear receptor important for adipocyte differentiation and function. Sequence analysis and long PCR of human genomic DNA with primers that span putative introns revealed that intron positions and sizes of hPPAR gamma are similar to those previously determined for the mouse PPAR gamma gene[13]. Fluorescent in situ hybridization localized hPPAR gamma to chromosome 3, band 3p25. Radiation hybrid mapping with two independent primer pairs was consistent with hPPAR gamma being within 1.5 Mb of marker D3S1263 on 3p25-p24.2. These sequences of the intron/exon junctions of the 6 coding exons shared by hPPAR gamma 1 and hPPAR gamma 2 will facilitate screening for possible mutations. Furthermore, D3S1263 is a suitable polymorphic marker for linkage analysis to evaluate PPAR gamma's potential contribution to genetic susceptibility to obesity, lipoatrophy, insulin resistance, and diabetes.
Hoffman, Brett; Li, Zhubing; Liu, Qiang
2015-08-01
Hepatitis C virus (HCV) non-structural protein 5A (NS5A) is essential for viral replication; however, its effect on HCV RNA translation remains controversial partially due to the use of reporters lacking the 3' UTR, where NS5A binds to the poly(U/UC) sequence. We investigated the role of NS5A in HCV translation using a monocistronic RNA containing a Renilla luciferase gene flanked by the HCV UTRs. We found that NS5A downregulated viral RNA translation in a dose-dependent manner. This downregulation required both the 5' and 3' UTRs of HCV because substitution of either sequence with the 5' and 3' UTRs of enterovirus 71 or a cap structure at the 5' end eliminated the effects of NS5A on translation. Translation of the HCV genomic RNA was also downregulated by NS5A. The inhibition of HCV translation by NS5A required the poly(U/UC) sequence in the 3' UTR as NS5A did not affect translation when it was deleted. In addition, we showed that, whilst the amphipathic α-helix of NS5A has no effect on viral translation, the three domains of NS5A can inhibit translation independently, also dependent on the presence of the poly(U/UC) sequence in the 3' UTR. These results suggested that NS5A downregulated HCV RNA translation through a mechanism involving the poly(U/UC) sequence in the 3' UTR.
FRET Imaging of Diatoms Expressing a Biosilica-Localized Ribose Sensor
Marshall, Kathryn E.; Robinson, Errol W.; Hengel, Shawna M.; Paša-Tolić, Ljiljana; Roesijadi, Guritno
2012-01-01
Future materials are envisioned to include bio-assembled, hybrid, three-dimensional nanosystems that incorporate functional proteins. Diatoms are amenable to genetic modification for localization of recombinant proteins in the biosilica cell wall. However, the full range of protein functionalities that can be accommodated by the modified porous biosilica has yet to be described. Our objective was to functionalize diatom biosilica with a reagent-less sensor dependent on ligand-binding and conformational change to drive FRET-based signaling capabilities. A fusion protein designed to confer such properties included a bacterial periplasmic ribose binding protein (R) flanked by CyPet (C) and YPet (Y), cyan and yellow fluorescent proteins that act as a FRET pair. The structure and function of the CRY recombinant chimeric protein was confirmed by expression in E. coli prior to transformation of the diatom Thalassiosira pseudonana. Mass spectrometry of the recombinant CRY showed 97% identity with the deduced amino acid sequence. CRY with and without an N-terminal Sil3 tag for biosilica localization exhibited characteristic ribose-dependent changes in FRET, with similar dissociation constants of 123.3 µM and 142.8 µM, respectively. The addition of the Sil3 tag did not alter the affinity of CRY for the ribose substrate. Subsequent transformation of T. pseudonana with a vector encoding Sil3-CRY resulted in fluorescence localization in the biosilica and changes in FRET in both living cells and isolated frustules in response to ribose. This work demonstrated that the nano-architecture of the genetically modified biosilica cell wall was able to support the functionality of the relatively complex Sil3-CyPet-RBP-YPet fusion protein with its requirement for ligand-binding and conformational change for FRET-signal generation. PMID:22470473
FRET imaging of diatoms expressing a biosilica-localized ribose sensor.
Marshall, Kathryn E; Robinson, Errol W; Hengel, Shawna M; Paša-Tolić, Ljiljana; Roesijadi, Guritno
2012-01-01
Future materials are envisioned to include bio-assembled, hybrid, three-dimensional nanosystems that incorporate functional proteins. Diatoms are amenable to genetic modification for localization of recombinant proteins in the biosilica cell wall. However, the full range of protein functionalities that can be accommodated by the modified porous biosilica has yet to be described. Our objective was to functionalize diatom biosilica with a reagent-less sensor dependent on ligand-binding and conformational change to drive FRET-based signaling capabilities. A fusion protein designed to confer such properties included a bacterial periplasmic ribose binding protein (R) flanked by CyPet (C) and YPet (Y), cyan and yellow fluorescent proteins that act as a FRET pair. The structure and function of the CRY recombinant chimeric protein was confirmed by expression in E. coli prior to transformation of the diatom Thalassiosira pseudonana. Mass spectrometry of the recombinant CRY showed 97% identity with the deduced amino acid sequence. CRY with and without an N-terminal Sil3 tag for biosilica localization exhibited characteristic ribose-dependent changes in FRET, with similar dissociation constants of 123.3 µM and 142.8 µM, respectively. The addition of the Sil3 tag did not alter the affinity of CRY for the ribose substrate. Subsequent transformation of T. pseudonana with a vector encoding Sil3-CRY resulted in fluorescence localization in the biosilica and changes in FRET in both living cells and isolated frustules in response to ribose. This work demonstrated that the nano-architecture of the genetically modified biosilica cell wall was able to support the functionality of the relatively complex Sil3-CyPet-RBP-YPet fusion protein with its requirement for ligand-binding and conformational change for FRET-signal generation.
Xu, Tingting; Zhou, Cong-Zhao; Xiao, Jianxi; Liu, Jinsong
2018-02-20
Naturally occurring interruptions in nonfibrillar collagen play key roles in molecular flexibility, collagen degradation, and ligand binding. The structural feature of the interruption sequences and the molecular basis for their functions have not been well studied. Here, we focused on a G5G type natural interruption sequence G-POALO-G from human type XIX collagen, a homotrimer collagen, as this sequence possesses distinct properties compared with those of a pathological similar Gly mutation sequence in collagen mimic peptides. We determined the crystal structures of the host-guest peptide (GPO) 3 -GPOALO-(GPO) 4 to 1.03 Å resolution in two crystal forms. In these structures, the interruption zone brings localized disruptions to the triple helix and introduces a light 6-8° bend with the same directional preference to the whole molecule, which may correspond structurally to the first physiological kink site in type XIX collagen. Furthermore, at the G5G interruption site, the presence of Ala and Leu residues, both with free N-H groups, allows the formation of more direct and water-mediated interchain hydrogen bonds than in the related Gly → Ala structure. These could partly explain the difference in thermal stability between the different interruptions. In addition, our structures provide a detailed view of the dynamic property of such an interrupted zone with respect to hydrogen bonding topology, torsion angles, and helical parameters. Our results, for the first time, also identified the binding of zinc to the end of the triple helix. These findings will shed light on how the interruption sequence influences the conformation of the collagen molecule and provide a structural basis for further functional studies.
Genomic Organization of the Drosophila Telomere RetrotransposableElements
DOE Office of Scientific and Technical Information (OSTI.GOV)
George, J.A.; DeBaryshe, P.G.; Traverse, K.L.
2006-10-16
The emerging sequence of the heterochromatic portion of the Drosophila melanogaster genome, with the most recent update of euchromatic sequence, gives the first genome-wide view of the chromosomal distribution of the telomeric retrotransposons, HeT-A, TART, and Tahre. As expected, these elements are entirely excluded from euchromatin, although sequence fragments of HeT-A and TART 3 untranslated regions are found in nontelomeric heterochromatin on the Y chromosome. The proximal ends of HeT-A/TART arrays appear to be a transition zone because only here do other transposable elements mix in the array. The sharp distinction between the distribution of telomeric elements and that ofmore » other transposable elements suggests that chromatin structure is important in telomere element localization. Measurements reported here show (1) D. melanogaster telomeres are very long, in the size range reported for inbred mouse strains (averaging 46 kb per chromosome end in Drosophila stock 2057). As in organisms with telomerase, their length varies depending on genotype. There is also slight under-replication in polytene nuclei. (2) Surprisingly, the relationship between the number of HeT-A and TART elements is not stochastic but is strongly correlated across stocks, supporting the idea that the two elements are interdependent. Although currently assembled portions of the HeT-A/TART arrays are from the most-proximal part of long arrays, {approx}61% of the total HeT-A sequence in these regions consists of intact, potentially active elements with little evidence of sequence decay, making it likely that the content of the telomere arrays turns over more extensively than has been thought.« less
On the Impact of Widening Vector Registers on Sequence Alignment
DOE Office of Scientific and Technical Information (OSTI.GOV)
Daily, Jeffrey A.; Kalyanaraman, Anantharaman; Krishnamoorthy, Sriram
2016-09-22
Vector extensions, such as SSE, have been part of the x86 since the 1990s, with applications in graphics, signal processing, and scientific applications. Although many algorithms and applications can naturally benefit from automatic vectorization techniques, there are still many that are difficult to vectorize due to their dependence on irregular data structures, dense branch operations, or data dependencies. Sequence alignment, one of the most widely used operations in bioinformatics workflows, has a computational footprint that features complex data dependencies. In this paper, we demonstrate that the trend of widening vector registers adversely affects the state-of-the-art sequence alignment algorithm based onmore » striped data layouts. We present a practically efficient SIMD implementation of a parallel scan based sequence alignment algorithm that can better exploit wider SIMD units. We conduct comprehensive workload and use case analyses to characterize the relative behavior of the striped and scan approaches and identify the best choice of algorithm based on input length and SIMD width.« less
Crystal structure of bacillus subtilis YdaF protein : a putative ribosomal N-acetyltransferase.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Brunzelle, J. S.; Wu, R.; Korolev, S. V.
2004-12-01
Comparative sequence analysis suggests that the ydaF gene encodes a protein (YdaF) that functions as an N-acetyltransferase, more specifically, a ribosomal N-acetyltransferase. Sequence analysis using basic local alignment search tool (BLAST) suggests that YdaF belongs to a large family of proteins (199 proteins found in 88 unique species of bacteria, archaea, and eukaryotes). YdaF also belongs to the COG1670, which includes the Escherichia coli RimL protein that is known to acetylate ribosomal protein L12. N-acetylation (NAT) has been found in all kingdoms. NAT enzymes catalyze the transfer of an acetyl group from acetyl-CoA (AcCoA) to a primary amino group. Formore » example, NATs can acetylate the N-terminal {alpha}-amino group, the {epsilon}-amino group of lysine residues, aminoglycoside antibiotics, spermine/speridine, or arylalkylamines such as serotonin. The crystal structure of the alleged ribosomal NAT protein, YdaF, from Bacillus subtilis presented here was determined as a part of the Midwest Center for Structural Genomics. The structure maintains the conserved tertiary structure of other known NATs and a high sequence similarity in the presumed AcCoA binding pocket in spite of a very low overall level of sequence identity to other NATs of known structure.« less
NASA Astrophysics Data System (ADS)
Gallet, F.; Bolmont, E.; Mathis, S.; Charbonnel, C.; Amard, L.
2017-08-01
Context. Star-planet interactions must be taken into account in stellar models to understand the dynamical evolution of close-in planets. The dependence of the tidal interactions on the structural and rotational evolution of the star is of particular importance and should be correctly treated. Aims: We quantify how tidal dissipation in the convective envelope of rotating low-mass stars evolves from the pre-main sequence up to the red-giant branch depending on the initial stellar mass. We investigate the consequences of this evolution on planetary orbital evolution. Methods: We couple the tidal dissipation formalism previously described to the stellar evolution code STAREVOL and apply this coupling to rotating stars with masses between 0.3 and 1.4 M⊙. As a first step, this formalism assumes a simplified bi-layer stellar structure with corresponding averaged densities for the radiative core and the convective envelope. We use a frequency-averaged treatment of the dissipation of tidal inertial waves in the convection zone (but neglect the dissipation of tidal gravity waves in the radiation zone). In addition, we generalize a recent work by following the orbital evolution of close-in planets using the new tidal dissipation predictions for advanced phases of stellar evolution. Results: On the pre-main sequence the evolution of tidal dissipation is controlled by the evolution of the internal structure of the contracting star. On the main sequence it is strongly driven by the variation of surface rotation that is impacted by magnetized stellar winds braking. The main effect of taking into account the rotational evolution of the stars is to lower the tidal dissipation strength by about four orders of magnitude on the main sequence, compared to a normalized dissipation rate that only takes into account structural changes. Conclusions: The evolution of the dissipation strongly depends on the evolution of the internal structure and rotation of the star. From the pre-main sequence up to the tip of the red-giant branch, it varies by several orders of magnitude, with strong consequences for the orbital evolution of close-in massive planets. These effects are the strongest during the pre-main sequence, implying that the planets are mainly sensitive to the star's early history.
NASA Technical Reports Server (NTRS)
Dominiak, P.; Ciszak, Ewa
2004-01-01
Thiamin pyrophosphate (TPP)-dependent enzymes are a divergent family of TPP and metal ion binding proteins that perform a wide range of functions with the common decarboxylation steps of a -(O=)C-C(OH)- fragment of alpha-ketoacids and alpha- hydroxyaldehydes. To determine how structure and catalytic action are conserved in the context of large sequence differences existing within this family of enzymes, we have carried out an analysis of TPP-dependent enzymes of known structures. The common structure of TPP-dependent enzymes is formed at the interface of four alpha/beta domains from at least two subunits, which provide for two metal and TPP-binding sites. Residues around these catalytic sites are conserved for functional purpose, while those further away from TPP are conserved for structural reasons. Together they provide a network of contacts required for flip-flop catalytic action within TPP-dependent enzymes. Thus our analysis defines a TPP-action motif that is proposed for annotating TPP-dependent enzymes for advancing functional proteomics.
Local atomic and magnetic structure of dilute magnetic semiconductor (Ba ,K ) (Zn,Mn ) 2As2
NASA Astrophysics Data System (ADS)
Frandsen, Benjamin A.; Gong, Zizhou; Terban, Maxwell W.; Banerjee, Soham; Chen, Bijuan; Jin, Changqing; Feygenson, Mikhail; Uemura, Yasutomo J.; Billinge, Simon J. L.
2016-09-01
We have studied the atomic and magnetic structure of the dilute ferromagnetic semiconductor system (Ba ,K )(Zn ,Mn )2As2 through atomic and magnetic pair distribution function analysis of temperature-dependent x-ray and neutron total scattering data. We detected a change in curvature of the temperature-dependent unit cell volume of the average tetragonal crystallographic structure at a temperature coinciding with the onset of ferromagnetic order. We also observed the existence of a well-defined local orthorhombic structure on a short length scale of ≲5 Å , resulting in a rather asymmetrical local environment of the Mn and As ions. Finally, the magnetic PDF revealed ferromagnetic alignment of Mn spins along the crystallographic c axis, with robust nearest-neighbor ferromagnetic correlations that exist even above the ferromagnetic ordering temperature. We discuss these results in the context of other experiments and theoretical studies on this system.
Comparative modeling without implicit sequence alignments.
Kolinski, Andrzej; Gront, Dominik
2007-10-01
The number of known protein sequences is about thousand times larger than the number of experimentally solved 3D structures. For more than half of the protein sequences a close or distant structural analog could be identified. The key starting point in a classical comparative modeling is to generate the best possible sequence alignment with a template or templates. With decreasing sequence similarity, the number of errors in the alignments increases and these errors are the main causes of the decreasing accuracy of the molecular models generated. Here we propose a new approach to comparative modeling, which does not require the implicit alignment - the model building phase explores geometric, evolutionary and physical properties of a template (or templates). The proposed method requires prior identification of a template, although the initial sequence alignment is ignored. The model is built using a very efficient reduced representation search engine CABS to find the best possible superposition of the query protein onto the template represented as a 3D multi-featured scaffold. The criteria used include: sequence similarity, predicted secondary structure consistency, local geometric features and hydrophobicity profile. For more difficult cases, the new method qualitatively outperforms existing schemes of comparative modeling. The algorithm unifies de novo modeling, 3D threading and sequence-based methods. The main idea is general and could be easily combined with other efficient modeling tools as Rosetta, UNRES and others.
Multi-level machine learning prediction of protein-protein interactions in Saccharomyces cerevisiae.
Zubek, Julian; Tatjewski, Marcin; Boniecki, Adam; Mnich, Maciej; Basu, Subhadip; Plewczynski, Dariusz
2015-01-01
Accurate identification of protein-protein interactions (PPI) is the key step in understanding proteins' biological functions, which are typically context-dependent. Many existing PPI predictors rely on aggregated features from protein sequences, however only a few methods exploit local information about specific residue contacts. In this work we present a two-stage machine learning approach for prediction of protein-protein interactions. We start with the carefully filtered data on protein complexes available for Saccharomyces cerevisiae in the Protein Data Bank (PDB) database. First, we build linear descriptions of interacting and non-interacting sequence segment pairs based on their inter-residue distances. Secondly, we train machine learning classifiers to predict binary segment interactions for any two short sequence fragments. The final prediction of the protein-protein interaction is done using the 2D matrix representation of all-against-all possible interacting sequence segments of both analysed proteins. The level-I predictor achieves 0.88 AUC for micro-scale, i.e., residue-level prediction. The level-II predictor improves the results further by a more complex learning paradigm. We perform 30-fold macro-scale, i.e., protein-level cross-validation experiment. The level-II predictor using PSIPRED-predicted secondary structure reaches 0.70 precision, 0.68 recall, and 0.70 AUC, whereas other popular methods provide results below 0.6 threshold (recall, precision, AUC). Our results demonstrate that multi-scale sequence features aggregation procedure is able to improve the machine learning results by more than 10% as compared to other sequence representations. Prepared datasets and source code for our experimental pipeline are freely available for download from: http://zubekj.github.io/mlppi/ (open source Python implementation, OS independent).
NMRDSP: an accurate prediction of protein shape strings from NMR chemical shifts and sequence data.
Mao, Wusong; Cong, Peisheng; Wang, Zhiheng; Lu, Longjian; Zhu, Zhongliang; Li, Tonghua
2013-01-01
Shape string is structural sequence and is an extremely important structure representation of protein backbone conformations. Nuclear magnetic resonance chemical shifts give a strong correlation with the local protein structure, and are exploited to predict protein structures in conjunction with computational approaches. Here we demonstrate a novel approach, NMRDSP, which can accurately predict the protein shape string based on nuclear magnetic resonance chemical shifts and structural profiles obtained from sequence data. The NMRDSP uses six chemical shifts (HA, H, N, CA, CB and C) and eight elements of structure profiles as features, a non-redundant set (1,003 entries) as the training set, and a conditional random field as a classification algorithm. For an independent testing set (203 entries), we achieved an accuracy of 75.8% for S8 (the eight states accuracy) and 87.8% for S3 (the three states accuracy). This is higher than only using chemical shifts or sequence data, and confirms that the chemical shift and the structure profile are significant features for shape string prediction and their combination prominently improves the accuracy of the predictor. We have constructed the NMRDSP web server and believe it could be employed to provide a solid platform to predict other protein structures and functions. The NMRDSP web server is freely available at http://cal.tongji.edu.cn/NMRDSP/index.jsp.
A TALE-inspired computational screen for proteins that contain approximate tandem repeats.
Perycz, Malgorzata; Krwawicz, Joanna; Bochtler, Matthias
2017-01-01
TAL (transcription activator-like) effectors (TALEs) are bacterial proteins that are secreted from bacteria to plant cells to act as transcriptional activators. TALEs and related proteins (RipTALs, BurrH, MOrTL1 and MOrTL2) contain approximate tandem repeats that differ in conserved positions that define specificity. Using PERL, we screened ~47 million protein sequences for TALE-like architecture characterized by approximate tandem repeats (between 30 and 43 amino acids in length) and sequence variability in conserved positions, without requiring sequence similarity to TALEs. Candidate proteins were scored according to their propensity for nuclear localization, secondary structure, repeat sequence complexity, as well as covariation and predicted structural proximity of variable residues. Biological context was tentatively inferred from co-occurrence of other domains and interactome predictions. Approximate repeats with TALE-like features that merit experimental characterization were found in a protein of chestnut blight fungus, a eukaryotic plant pathogen.
A TALE-inspired computational screen for proteins that contain approximate tandem repeats
Krwawicz, Joanna
2017-01-01
TAL (transcription activator-like) effectors (TALEs) are bacterial proteins that are secreted from bacteria to plant cells to act as transcriptional activators. TALEs and related proteins (RipTALs, BurrH, MOrTL1 and MOrTL2) contain approximate tandem repeats that differ in conserved positions that define specificity. Using PERL, we screened ~47 million protein sequences for TALE-like architecture characterized by approximate tandem repeats (between 30 and 43 amino acids in length) and sequence variability in conserved positions, without requiring sequence similarity to TALEs. Candidate proteins were scored according to their propensity for nuclear localization, secondary structure, repeat sequence complexity, as well as covariation and predicted structural proximity of variable residues. Biological context was tentatively inferred from co-occurrence of other domains and interactome predictions. Approximate repeats with TALE-like features that merit experimental characterization were found in a protein of chestnut blight fungus, a eukaryotic plant pathogen. PMID:28617832
PredictProtein—an open resource for online prediction of protein structural and functional features
Yachdav, Guy; Kloppmann, Edda; Kajan, Laszlo; Hecht, Maximilian; Goldberg, Tatyana; Hamp, Tobias; Hönigschmid, Peter; Schafferhans, Andrea; Roos, Manfred; Bernhofer, Michael; Richter, Lothar; Ashkenazy, Haim; Punta, Marco; Schlessinger, Avner; Bromberg, Yana; Schneider, Reinhard; Vriend, Gerrit; Sander, Chris; Ben-Tal, Nir; Rost, Burkhard
2014-01-01
PredictProtein is a meta-service for sequence analysis that has been predicting structural and functional features of proteins since 1992. Queried with a protein sequence it returns: multiple sequence alignments, predicted aspects of structure (secondary structure, solvent accessibility, transmembrane helices (TMSEG) and strands, coiled-coil regions, disulfide bonds and disordered regions) and function. The service incorporates analysis methods for the identification of functional regions (ConSurf), homology-based inference of Gene Ontology terms (metastudent), comprehensive subcellular localization prediction (LocTree3), protein–protein binding sites (ISIS2), protein–polynucleotide binding sites (SomeNA) and predictions of the effect of point mutations (non-synonymous SNPs) on protein function (SNAP2). Our goal has always been to develop a system optimized to meet the demands of experimentalists not highly experienced in bioinformatics. To this end, the PredictProtein results are presented as both text and a series of intuitive, interactive and visually appealing figures. The web server and sources are available at http://ppopen.rostlab.org. PMID:24799431
Mosaic organization of DNA nucleotides
NASA Technical Reports Server (NTRS)
Peng, C. K.; Buldyrev, S. V.; Havlin, S.; Simons, M.; Stanley, H. E.; Goldberger, A. L.
1994-01-01
Long-range power-law correlations have been reported recently for DNA sequences containing noncoding regions. We address the question of whether such correlations may be a trivial consequence of the known mosaic structure ("patchiness") of DNA. We analyze two classes of controls consisting of patchy nucleotide sequences generated by different algorithms--one without and one with long-range power-law correlations. Although both types of sequences are highly heterogenous, they are quantitatively distinguishable by an alternative fluctuation analysis method that differentiates local patchiness from long-range correlations. Application of this analysis to selected DNA sequences demonstrates that patchiness is not sufficient to account for long-range correlation properties.
Lyapunov exponents for one-dimensional aperiodic photonic bandgap structures
NASA Astrophysics Data System (ADS)
Kissel, Glen J.
2011-10-01
Existing in the "gray area" between perfectly periodic and purely randomized photonic bandgap structures are the socalled aperoidic structures whose layers are chosen according to some deterministic rule. We consider here a onedimensional photonic bandgap structure, a quarter-wave stack, with the layer thickness of one of the bilayers subject to being either thin or thick according to five deterministic sequence rules and binary random selection. To produce these aperiodic structures we examine the following sequences: Fibonacci, Thue-Morse, Period doubling, Rudin-Shapiro, as well as the triadic Cantor sequence. We model these structures numerically with a long chain (approximately 5,000,000) of transfer matrices, and then use the reliable algorithm of Wolf to calculate the (upper) Lyapunov exponent for the long product of matrices. The Lyapunov exponent is the statistically well-behaved variable used to characterize the Anderson localization effect (exponential confinement) when the layers are randomized, so its calculation allows us to more precisely compare the purely randomized structure with its aperiodic counterparts. It is found that the aperiodic photonic systems show much fine structure in their Lyapunov exponents as a function of frequency, and, in a number of cases, the exponents are quite obviously fractal.
A Particle Swarm Optimization-Based Approach with Local Search for Predicting Protein Folding.
Yang, Cheng-Hong; Lin, Yu-Shiun; Chuang, Li-Yeh; Chang, Hsueh-Wei
2017-10-01
The hydrophobic-polar (HP) model is commonly used for predicting protein folding structures and hydrophobic interactions. This study developed a particle swarm optimization (PSO)-based algorithm combined with local search algorithms; specifically, the high exploration PSO (HEPSO) algorithm (which can execute global search processes) was combined with three local search algorithms (hill-climbing algorithm, greedy algorithm, and Tabu table), yielding the proposed HE-L-PSO algorithm. By using 20 known protein structures, we evaluated the performance of the HE-L-PSO algorithm in predicting protein folding in the HP model. The proposed HE-L-PSO algorithm exhibited favorable performance in predicting both short and long amino acid sequences with high reproducibility and stability, compared with seven reported algorithms. The HE-L-PSO algorithm yielded optimal solutions for all predicted protein folding structures. All HE-L-PSO-predicted protein folding structures possessed a hydrophobic core that is similar to normal protein folding.
Rigoutsos, Isidore; Riek, Peter; Graham, Robert M.; Novotny, Jiri
2003-01-01
One of the promising methods of protein structure prediction involves the use of amino acid sequence-derived patterns. Here we report on the creation of non-degenerate motif descriptors derived through data mining of training sets of residues taken from the transmembrane-spanning segments of polytopic proteins. These residues correspond to short regions in which there is a deviation from the regular α-helical character (i.e. π-helices, 310-helices and kinks). A ‘search engine’ derived from these motif descriptors correctly identifies, and discriminates amongst instances of the above ‘non-canonical’ helical motifs contained in the SwissProt/TrEMBL database of protein primary structures. Our results suggest that deviations from α-helicity are encoded locally in sequence patterns only about 7–9 residues long and can be determined in silico directly from the amino acid sequence. Delineation of such variations in helical habit is critical to understanding the complex structure–function relationships of polytopic proteins and for drug discovery. The success of our current methodology foretells development of similar prediction tools capable of identifying other structural motifs from sequence alone. The method described here has been implemented and is available on the World Wide Web at http://cbcsrv.watson.ibm.com/Ttkw.html. PMID:12888523
DOE Office of Scientific and Technical Information (OSTI.GOV)
Carbonell, Alberto; Martinez de Alba, Angel-Emilio; Flores, Ricardo
2008-02-05
Infection by viroids, non-protein-coding circular RNAs, occurs with the accumulation of 21-24 nt viroid-derived small RNAs (vd-sRNAs) with characteristic properties of small interfering RNAs (siRNAs) associated to RNA silencing. The vd-sRNAs most likely derive from dicer-like (DCL) enzymes acting on viroid-specific dsRNA, the key elicitor of RNA silencing, or on the highly structured genomic RNA. Previously, viral dsRNAs delivered mechanically or agroinoculated have been shown to interfere with virus infection in a sequence-specific manner. Here, we report similar results with members of the two families of nuclear- and chloroplast-replicating viroids. Moreover, homologous vd-sRNAs co-delivered mechanically also interfered with one ofmore » the viroids examined. The interference was sequence-specific, temperature-dependent and, in some cases, also dependent on the dose of the co-inoculated dsRNA or vd-sRNAs. The sequence-specific nature of these effects suggests the involvement of the RNA induced silencing complex (RISC), which provides sequence specificity to RNA silencing machinery. Therefore, viroid titer in natural infections might be regulated by the concerted action of DCL and RISC. Viroids could have evolved their secondary structure as a compromise between resistance to DCL and RISC, which act preferentially against RNAs with compact and relaxed secondary structures, respectively. In addition, compartmentation, association with proteins or active replication might also help viroids to elude their host RNA silencing machinery.« less
A Bayesian Framework for Human Body Pose Tracking from Depth Image Sequences
Zhu, Youding; Fujimura, Kikuo
2010-01-01
This paper addresses the problem of accurate and robust tracking of 3D human body pose from depth image sequences. Recovering the large number of degrees of freedom in human body movements from a depth image sequence is challenging due to the need to resolve the depth ambiguity caused by self-occlusions and the difficulty to recover from tracking failure. Human body poses could be estimated through model fitting using dense correspondences between depth data and an articulated human model (local optimization method). Although it usually achieves a high accuracy due to dense correspondences, it may fail to recover from tracking failure. Alternately, human pose may be reconstructed by detecting and tracking human body anatomical landmarks (key-points) based on low-level depth image analysis. While this method (key-point based method) is robust and recovers from tracking failure, its pose estimation accuracy depends solely on image-based localization accuracy of key-points. To address these limitations, we present a flexible Bayesian framework for integrating pose estimation results obtained by methods based on key-points and local optimization. Experimental results are shown and performance comparison is presented to demonstrate the effectiveness of the proposed approach. PMID:22399933
Entropic fluctuations in DNA sequences
NASA Astrophysics Data System (ADS)
Thanos, Dimitrios; Li, Wentian; Provata, Astero
2018-03-01
The Local Shannon Entropy (LSE) in blocks is used as a complexity measure to study the information fluctuations along DNA sequences. The LSE of a DNA block maps the local base arrangement information to a single numerical value. It is shown that despite this reduction of information, LSE allows to extract meaningful information related to the detection of repetitive sequences in whole chromosomes and is useful in finding evolutionary differences between organisms. More specifically, large regions of tandem repeats, such as centromeres, can be detected based on their low LSE fluctuations along the chromosome. Furthermore, an empirical investigation of the appropriate block sizes is provided and the relationship of LSE properties with the structure of the underlying repetitive units is revealed by using both computational and mathematical methods. Sequence similarity between the genomic DNA of closely related species also leads to similar LSE values at the orthologous regions. As an application, the LSE covariance function is used to measure the evolutionary distance between several primate genomes.
Maurer-Stroh, Sebastian; Gao, He; Han, Hao; Baeten, Lies; Schymkowitz, Joost; Rousseau, Frederic; Zhang, Louxin; Eisenhaber, Frank
2013-02-01
Data mining in protein databases, derivatives from more fundamental protein 3D structure and sequence databases, has considerable unearthed potential for the discovery of sequence motif--structural motif--function relationships as the finding of the U-shape (Huf-Zinc) motif, originally a small student's project, exemplifies. The metal ion zinc is critically involved in universal biological processes, ranging from protein-DNA complexes and transcription regulation to enzymatic catalysis and metabolic pathways. Proteins have evolved a series of motifs to specifically recognize and bind zinc ions. Many of these, so called zinc fingers, are structurally independent globular domains with discontinuous binding motifs made up of residues mostly far apart in sequence. Through a systematic approach starting from the BRIX structure fragment database, we discovered that there exists another predictable subset of zinc-binding motifs that not only have a conserved continuous sequence pattern but also share a characteristic local conformation, despite being included in totally different overall folds. While this does not allow general prediction of all Zn binding motifs, a HMM-based web server, Huf-Zinc, is available for prediction of these novel, as well as conventional, zinc finger motifs in protein sequences. The Huf-Zinc webserver can be freely accessed through this URL (http://mendel.bii.a-star.edu.sg/METHODS/hufzinc/).
Percolation in random-Sierpiński carpets: A real space renormalization group approach
NASA Astrophysics Data System (ADS)
Perreau, Michel; Peiro, Joaquina; Berthier, Serge
1996-11-01
The site percolation transition in random Sierpiński carpets is investigated by real space renormalization. The fixed point is not unique like in regular translationally invariant lattices, but depends on the number k of segmentation steps of the generation process of the fractal. It is shown that, for each scale invariance ratio n, the sequence of fixed points pn,k is increasing with k, and converges when k-->∞ toward a limit pn strictly less than 1. Moreover, in such scale invariant structures, the percolation threshold does not depend only on the scale invariance ratio n, but also on the scale. The sequence pn,k and pn are calculated for n=4, 8, 16, 32, and 64, and for k=1 to k=11, and k=∞. The corresponding thermal exponent sequence νn,k is calculated for n=8 and 16, and for k=1 to k=5, and k=∞. Suggestions are made for an experimental test in physical self-similar structures.
Ling, Roger; Firth, Andrew E
2017-08-01
Programmed -1 ribosomal frameshifting is a mechanism of gene expression whereby specific signals within messenger RNAs direct a proportion of ribosomes to shift -1 nt and continue translating in the new reading frame. Such frameshifting normally depends on an RNA structure stimulator 3'-adjacent to a 'slippery' heptanucleotide shift site sequence. Recently we identified an unusual frameshifting mechanism in encephalomyocarditis virus, where the stimulator involves a trans-acting virus protein. Thus, in contrast to other examples of -1 frameshifting, the efficiency of frameshifting in encephalomyocarditis virus is best studied in the context of virus infection. Here we use metabolic labelling to analyse the frameshifting efficiency of wild-type and mutant viruses. Confirming previous results, frameshifting depends on a G_GUU_UUU shift site sequence and a 3'-adjacent stem-loop structure, but is not appreciably affected by the 'StopGo' sequence present ~30 nt upstream. At late timepoints, frameshifting was estimated to be 46-76 % efficient.
NASA Astrophysics Data System (ADS)
Ginsburger, Kévin; Poupon, Fabrice; Beaujoin, Justine; Estournet, Delphine; Matuschke, Felix; Mangin, Jean-François; Axer, Markus; Poupon, Cyril
2018-02-01
White matter is composed of irregularly packed axons leading to a structural disorder in the extra-axonal space. Diffusion MRI experiments using oscillating gradient spin echo sequences have shown that the diffusivity transverse to axons in this extra-axonal space is dependent on the frequency of the employed sequence. In this study, we observe the same frequency-dependence using 3D simulations of the diffusion process in disordered media. We design a novel white matter numerical phantom generation algorithm which constructs biomimicking geometric configurations with few design parameters, and enables to control the level of disorder of the generated phantoms. The influence of various geometrical parameters present in white matter, such as global angular dispersion, tortuosity, presence of Ranvier nodes, beading, on the extra-cellular perpendicular diffusivity frequency dependence was investigated by simulating the diffusion process in numerical phantoms of increasing complexity and fitting the resulting simulated diffusion MR signal attenuation with an adequate analytical model designed for trapezoidal OGSE sequences. This work suggests that angular dispersion and especially beading have non-negligible effects on this extracellular diffusion metrics that may be measured using standard OGSE DW-MRI clinical protocols.
Cross cultural differences in unconscious knowledge.
Kiyokawa, Sachiko; Dienes, Zoltán; Tanaka, Daisuke; Yamada, Ayumi; Crowe, Louise
2012-07-01
Previous studies have indicated cross cultural differences in conscious processes, such that Asians have a global preference and Westerners a more analytical one. We investigated whether these biases also apply to unconscious knowledge. In Experiment 1, Japanese and UK participants memorized strings of large (global) letters made out of small (local) letters. The strings constituted one sequence of letters at a global level and a different sequence at a local level. Implicit learning occurred at the global and not the local level for the Japanese but equally at both levels for the English. In Experiment 2, the Japanese preference for global over local processing persisted even when structure existed only at the local but not global level. In Experiment 3, Japanese and UK participants were asked to attend to just one of the levels, global or local. Now the cultural groups performed similarly, indicating that the bias largely reflects preference rather than ability (although the data left room for residual ability differences). In Experiment 4, the greater global advantage of Japanese rather English was confirmed for strings made of Japanese kana rather than Roman letters. That is, the cultural difference is not due to familiarity of the sequence elements. In sum, we show for the first time that cultural biases strongly affect the type of unconscious knowledge people acquire. Copyright © 2012 Elsevier B.V. All rights reserved.
Mapping Simple Repeated DNA Sequences in Heterochromatin of Drosophila Melanogaster
Lohe, A. R.; Hilliker, A. J.; Roberts, P. A.
1993-01-01
Heterochromatin in Drosophila has unusual genetic, cytological and molecular properties. Highly repeated DNA sequences (satellites) are the principal component of heterochromatin. Using probes from cloned satellites, we have constructed a chromosome map of 10 highly repeated, simple DNA sequences in heterochromatin of mitotic chromosomes of Drosophila melanogaster. Despite extensive sequence homology among some satellites, chromosomal locations could be distinguished by stringent in situ hybridizations for each satellite. Only two of the localizations previously determined using gradient-purified bulk satellite probes are correct. Eight new satellite localizations are presented, providing a megabase-level chromosome map of one-quarter of the genome. Five major satellites each exhibit a multichromosome distribution, and five minor satellites hybridize to single sites on the Y chromosome. Satellites closely related in sequence are often located near one another on the same chromosome. About 80% of Y chromosome DNA is composed of nine simple repeated sequences, in particular (AAGAC)(n) (8 Mb), (AAGAG)(n) (7 Mb) and (AATAT)(n) (6 Mb). Similarly, more than 70% of the DNA in chromosome 2 heterochromatin is composed of five simple repeated sequences. We have also generated a high resolution map of satellites in chromosome 2 heterochromatin, using a series of translocation chromosomes whose breakpoints in heterochromatin were ordered by N-banding. Finally, staining and banding patterns of heterochromatic regions are correlated with the locations of specific repeated DNA sequences. The basis for the cytochemical heterogeneity in banding appears to depend exclusively on the different satellite DNAs present in heterochromatin. PMID:8375654
Distribution and Features of the Six Classes of Peroxiredoxins
Poole, Leslie B.; Nelson, Kimberly J.
2016-01-01
Peroxiredoxins are cysteine-dependent peroxide reductases that group into 6 different, structurally discernable classes. In 2011, our research team reported the application of a bioinformatic approach called active site profiling to extract active site-proximal sequence segments from the 29 distinct, structurally-characterized peroxiredoxins available at the time. These extracted sequences were then used to create unique profiles for the six groups which were subsequently used to search GenBank(nr), allowing identification of ∼3500 peroxiredoxin sequences and their respective subgroups. Summarized in this minireview are the features and phylogenetic distributions of each of these peroxiredoxin subgroups; an example is also provided illustrating the use of the web accessible, searchable database known as PREX to identify subfamily-specific peroxiredoxin sequences for the organism Vitis vinifera (grape). PMID:26810075
NASA Astrophysics Data System (ADS)
Gref, Orman; Weizman, Moshe; Rhein, Holger; Gabriel, Onno; Gernert, Ulrich; Schlatmann, Rutger; Boit, Christian; Friedrich, Felice
2016-06-01
A conductive atomic force microscope is used to study the local topography and conductivity of laser-fired aluminum contacts on KOH-structured multicrystalline silicon surfaces. A significant increase in conductivity is observed in the laser-affected area. The area size and spatial uniformity of this enhanced conductivity depends on the laser energy fluence. The laser-affected area shows three ring-shaped regimes of different conductance depending on the local aluminum and oxygen concentration. Finally, it was found that the topographic surface structure determined by the silicon grain orientation does not significantly affect the laser-firing process.
Song, Jiangning; Yuan, Zheng; Tan, Hao; Huber, Thomas; Burrage, Kevin
2007-12-01
Disulfide bonds are primary covalent crosslinks between two cysteine residues in proteins that play critical roles in stabilizing the protein structures and are commonly found in extracy-toplasmatic or secreted proteins. In protein folding prediction, the localization of disulfide bonds can greatly reduce the search in conformational space. Therefore, there is a great need to develop computational methods capable of accurately predicting disulfide connectivity patterns in proteins that could have potentially important applications. We have developed a novel method to predict disulfide connectivity patterns from protein primary sequence, using a support vector regression (SVR) approach based on multiple sequence feature vectors and predicted secondary structure by the PSIPRED program. The results indicate that our method could achieve a prediction accuracy of 74.4% and 77.9%, respectively, when averaged on proteins with two to five disulfide bridges using 4-fold cross-validation, measured on the protein and cysteine pair on a well-defined non-homologous dataset. We assessed the effects of different sequence encoding schemes on the prediction performance of disulfide connectivity. It has been shown that the sequence encoding scheme based on multiple sequence feature vectors coupled with predicted secondary structure can significantly improve the prediction accuracy, thus enabling our method to outperform most of other currently available predictors. Our work provides a complementary approach to the current algorithms that should be useful in computationally assigning disulfide connectivity patterns and helps in the annotation of protein sequences generated by large-scale whole-genome projects. The prediction web server and Supplementary Material are accessible at http://foo.maths.uq.edu.au/~huber/disulfide
Export requirements of pneumolysin in Streptococcus pneumoniae.
Price, Katherine E; Greene, Neil G; Camilli, Andrew
2012-07-01
Streptococcus pneumoniae is a major causative agent of otitis media, pneumonia, bacteremia, and meningitis. Pneumolysin (Ply), a member of the cholesterol-dependent cytolysins (CDCs), is produced by virtually all clinical isolates of S. pneumoniae, and ply mutant strains are severely attenuated in mouse models of colonization and infection. In contrast to all other known members of the CDC family, Ply lacks a signal peptide for export outside the cell. Instead, Ply has been hypothesized to be released upon autolysis or, alternatively, via a nonautolytic mechanism that remains undefined. We show that an exogenously added signal sequence is not sufficient for Sec-dependent Ply secretion in S. pneumoniae but is sufficient in the surrogate host Bacillus subtilis. Previously, we showed that Ply is localized primarily to the cell wall compartment in the absence of detectable cell lysis. Here we show that Ply released by autolysis cannot reassociate with intact cells, suggesting that there is a Ply export mechanism that is coupled to cell wall localization of the protein. This putative export mechanism is capable of secreting a related CDC without its signal sequence. We show that B. subtilis can export Ply, suggesting that the export pathway is conserved. Finally, through truncation and domain swapping analyses, we show that export is dependent on domain 2 of Ply.
Export Requirements of Pneumolysin in Streptococcus pneumoniae
Price, Katherine E.; Greene, Neil G.
2012-01-01
Streptococcus pneumoniae is a major causative agent of otitis media, pneumonia, bacteremia, and meningitis. Pneumolysin (Ply), a member of the cholesterol-dependent cytolysins (CDCs), is produced by virtually all clinical isolates of S. pneumoniae, and ply mutant strains are severely attenuated in mouse models of colonization and infection. In contrast to all other known members of the CDC family, Ply lacks a signal peptide for export outside the cell. Instead, Ply has been hypothesized to be released upon autolysis or, alternatively, via a nonautolytic mechanism that remains undefined. We show that an exogenously added signal sequence is not sufficient for Sec-dependent Ply secretion in S. pneumoniae but is sufficient in the surrogate host Bacillus subtilis. Previously, we showed that Ply is localized primarily to the cell wall compartment in the absence of detectable cell lysis. Here we show that Ply released by autolysis cannot reassociate with intact cells, suggesting that there is a Ply export mechanism that is coupled to cell wall localization of the protein. This putative export mechanism is capable of secreting a related CDC without its signal sequence. We show that B. subtilis can export Ply, suggesting that the export pathway is conserved. Finally, through truncation and domain swapping analyses, we show that export is dependent on domain 2 of Ply. PMID:22563048
DOE Office of Scientific and Technical Information (OSTI.GOV)
Sampaio, S.O.; Mei, C.; Butcher, E.C.
The mucosal addressin cell adhesion molecule-1 (MAdCAM-1) is expressed selectively at venular sites of lymphocyte extravasation into mucosal lymphoid tissues and lamina propria, where it directs local lymphocyte trafficking. MAdCAM-1 is a multifunctional type I transmembrane adhesion molecule comprising two distal Ig domains involved in {alpha}4{beta}7 integrin binding, a mucin-like region able to display L-selectin-binding carbohydrates, and a membrane-proximal Ig domain homologous to IgA. We show in this work that the MAdCAM-1 gene is located on chromosome 10 and contains five exons. The signal peptide and each one of the three Ig domains are encoded by a distinct exon, whereasmore » the transmembrane, cytoplasmic tail, and 3{prime}-untranslated region of MAdCAM-1 are combined on a single exon. The mucin-like region and the third Ig domain are encoded together on exon 4. An alternatively spliced MAdCAM-1 mRNA is identified that lacks the mucin/IgA-homologous exon 4-encoded sequences. This short variant of MAdCAM-1 may be specialized to support {alpha}4{beta}7-dependent adhesion strengthening, independent of carbohydrate-presenting function. Sequences 5{prime} of the transcription start site include tandem nuclear factor-KB sites; AP-1, AP-2, and signal peptide-1 binding sites; and an estrogen response element. Our findings reinforce the correspondence between the multidomain structure and versatile functions of this vascular addressin, and suggest an additional level of regulation of carbohydrate-presenting capability, and thus of its importance in lectin-mediated vs. {alpha}4{beta}7-dependent adhesive events in lymphocyte trafficking. 46 refs., 6 figs., 1 tab.« less
Neural/Bayes network predictor for inheritable cardiac disease pathogenicity and phenotype.
Burghardt, Thomas P; Ajtai, Katalin
2018-04-11
The cardiac muscle sarcomere contains multiple proteins contributing to contraction energy transduction and its regulation during a heartbeat. Inheritable heart disease mutants affect most of them but none more frequently than the ventricular myosin motor and cardiac myosin binding protein c (mybpc3). These co-localizing proteins have mybpc3 playing a regulatory role to the energy transducing motor. Residue substitution and functional domain assignment of each mutation in the protein sequence decides, under the direction of a sensible disease model, phenotype and pathogenicity. The unknown model mechanism is decided here using a method combing neural and Bayes networks. Missense single nucleotide polymorphisms (SNPs) are clues for the disease mechanism summarized in an extensive database collecting mutant sequence location and residue substitution as independent variables that imply the dependent disease phenotype and pathogenicity characteristics in 4 dimensional data points (4ddps). The SNP database contains entries with the majority having one or both dependent data entries unfulfilled. A neural network relating causes (mutant residue location and substitution) and effects (phenotype and pathogenicity) is trained, validated, and optimized using fulfilled 4ddps. It then predicts unfulfilled 4ddps providing the implicit disease model. A discrete Bayes network interprets fulfilled and predicted 4ddps with conditional probabilities for phenotype and pathogenicity given mutation location and residue substitution thus relating the neural network implicit model to explicit features of the motor and mybpc3 sequence and structural domains. Neural/Bayes network forecasting automates disease mechanism modeling by leveraging the world wide human missense SNP database that is in place and expanding. Copyright © 2018 The Authors. Published by Elsevier Ltd.. All rights reserved.
Structural hot spots for the solubility of globular proteins
Ganesan, Ashok; Siekierska, Aleksandra; Beerten, Jacinte; Brams, Marijke; Van Durme, Joost; De Baets, Greet; Van der Kant, Rob; Gallardo, Rodrigo; Ramakers, Meine; Langenberg, Tobias; Wilkinson, Hannah; De Smet, Frederik; Ulens, Chris; Rousseau, Frederic; Schymkowitz, Joost
2016-01-01
Natural selection shapes protein solubility to physiological requirements and recombinant applications that require higher protein concentrations are often problematic. This raises the question whether the solubility of natural protein sequences can be improved. We here show an anti-correlation between the number of aggregation prone regions (APRs) in a protein sequence and its solubility, suggesting that mutational suppression of APRs provides a simple strategy to increase protein solubility. We show that mutations at specific positions within a protein structure can act as APR suppressors without affecting protein stability. These hot spots for protein solubility are both structure and sequence dependent but can be computationally predicted. We demonstrate this by reducing the aggregation of human α-galactosidase and protective antigen of Bacillus anthracis through mutation. Our results indicate that many proteins possess hot spots allowing to adapt protein solubility independently of structure and function. PMID:26905391
Nanoscale structure in AgSbTe2 determined by diffuse elastic neutron scattering
DOE Office of Scientific and Technical Information (OSTI.GOV)
Specht, Eliot D; Ma, Jie; Delaire, Olivier A
2015-01-01
Diffuse elastic neutron scattering measurements confirm that AgSbTe2 has a hierarchical structure, with defects on length scales from nanometers to microns. While scattering from mesoscale structure is consistent with previously-proposed structures in which Ag and Sb order on a NaCl lattice, more diffuse scattering from nanoscale structure suggests a structural rearrangement in which hexagonal layers form a combination of (ABC), (ABA), and (AAB) stacking sequences. The AgCrSe2 structure is the best-fitting model for the local atomic arrangements.
Jacquin, Hugo; Gilson, Amy; Shakhnovich, Eugene; Cocco, Simona; Monasson, Rémi
2016-05-01
Inverse statistical approaches to determine protein structure and function from Multiple Sequence Alignments (MSA) are emerging as powerful tools in computational biology. However the underlying assumptions of the relationship between the inferred effective Potts Hamiltonian and real protein structure and energetics remain untested so far. Here we use lattice protein model (LP) to benchmark those inverse statistical approaches. We build MSA of highly stable sequences in target LP structures, and infer the effective pairwise Potts Hamiltonians from those MSA. We find that inferred Potts Hamiltonians reproduce many important aspects of 'true' LP structures and energetics. Careful analysis reveals that effective pairwise couplings in inferred Potts Hamiltonians depend not only on the energetics of the native structure but also on competing folds; in particular, the coupling values reflect both positive design (stabilization of native conformation) and negative design (destabilization of competing folds). In addition to providing detailed structural information, the inferred Potts models used as protein Hamiltonian for design of new sequences are able to generate with high probability completely new sequences with the desired folds, which is not possible using independent-site models. Those are remarkable results as the effective LP Hamiltonians used to generate MSA are not simple pairwise models due to the competition between the folds. Our findings elucidate the reasons for the success of inverse approaches to the modelling of proteins from sequence data, and their limitations.
Solis, Armando D
2014-01-01
The most informative probability distribution functions (PDFs) describing the Ramachandran phi-psi dihedral angle pair, a fundamental descriptor of backbone conformation of protein molecules, are derived from high-resolution X-ray crystal structures using an information-theoretic approach. The Information Maximization Device (IMD) is established, based on fundamental information-theoretic concepts, and then applied specifically to derive highly resolved phi-psi maps for all 20 single amino acid and all 8000 triplet sequences at an optimal resolution determined by the volume of current data. The paper shows that utilizing the latent information contained in all viable high-resolution crystal structures found in the Protein Data Bank (PDB), totaling more than 77,000 chains, permits the derivation of a large number of optimized sequence-dependent PDFs. This work demonstrates the effectiveness of the IMD and the superiority of the resulting PDFs by extensive fold recognition experiments and rigorous comparisons with previously published triplet PDFs. Because it automatically optimizes PDFs, IMD results in improved performance of knowledge-based potentials, which rely on such PDFs. Furthermore, it provides an easy computational recipe for empirically deriving other kinds of sequence-dependent structural PDFs with greater detail and precision. The high-resolution phi-psi maps derived in this work are available for download.
Xu, Yilei; Roy-Chowdhury, Amit K
2007-05-01
In this paper, we present a theory for combining the effects of motion, illumination, 3D structure, albedo, and camera parameters in a sequence of images obtained by a perspective camera. We show that the set of all Lambertian reflectance functions of a moving object, at any position, illuminated by arbitrarily distant light sources, lies "close" to a bilinear subspace consisting of nine illumination variables and six motion variables. This result implies that, given an arbitrary video sequence, it is possible to recover the 3D structure, motion, and illumination conditions simultaneously using the bilinear subspace formulation. The derivation builds upon existing work on linear subspace representations of reflectance by generalizing it to moving objects. Lighting can change slowly or suddenly, locally or globally, and can originate from a combination of point and extended sources. We experimentally compare the results of our theory with ground truth data and also provide results on real data by using video sequences of a 3D face and the entire human body with various combinations of motion and illumination directions. We also show results of our theory in estimating 3D motion and illumination model parameters from a video sequence.
Bassi, G S; Murchie, A I; Lilley, D M
1996-01-01
The hammerhead ribozyme undergoes an ion-dependent folding process into the active conformation. We find that the folding can be blocked at specific stages by changes of sequence or functionality within the core. In the the absence of added metal ions, the global structure of the hammerhead is extended, with a large angle subtended between stems I and II. No core sequence changes appear to alter this geometry, consistent with an unstructured core under these conditions. Upon addition of low concentrations of magnesium ions, the hammerhead folds by an association of stems II and III, to include a large angle between them. This stage is inhibited or altered by mutations within the oligopurine sequence lying between stems II and III, and folding is completely prevented by an A14G mutation. Further increase in magnesium ion concentration brings about a second stage of folding in the natural sequence hammerhead, involving a reorientation of stem I, which rotates around into the same direction of stem II. Because this transition occurs over the same range of magnesium ion concentration over which the hammerhead ribozyme becomes active, it is likely that the final conformation is most closely related to the active form of the structure. Magnesium ion-dependent folding into this conformation is prevented by changes at G5, notably removal of the 2'-hydroxyl group and replacement of the base by cytidine. The ability to dissect the folding process by means of sequence changes suggests that two separate ion-dependent stages are involved in the folding of the hammerhead ribozyme into the active conformation. PMID:8752086
Experimental Observation of Dynamical Localization in Laser-Kicked Molecular Rotors
NASA Astrophysics Data System (ADS)
Bitter, M.; Milner, V.
2016-09-01
The periodically kicked rotor is a paradigm system for studying quantum effects on classically chaotic dynamics. The wave function of the quantum rotor localizes in angular momentum space, similarly to Anderson localization of the electronic wave function in disordered solids. Here, we observe dynamical localization in a system of true quantum rotors by subjecting nitrogen molecules to periodic sequences of femtosecond pulses. Exponential distribution of the molecular angular momentum—the hallmark of dynamical localization—is measured directly by means of coherent Raman scattering. We demonstrate the suppressed rotational energy growth with the number of laser kicks and study the dependence of the localization length on the kick strength. Because of its quantum coherent nature, both timing and amplitude noise are shown to destroy the localization and revive the diffusive growth of energy.
Tertiary structural propensities reveal fundamental sequence/structure relationships.
Zheng, Fan; Zhang, Jian; Grigoryan, Gevorg
2015-05-05
Extracting useful generalizations from the continually growing Protein Data Bank (PDB) is of central importance. We hypothesize that the PDB contains valuable quantitative information on the level of local tertiary structural motifs (TERMs). We show that by breaking a protein structure into its constituent TERMs, and querying the PDB to characterize the natural ensemble matching each, we can estimate the compatibility of the structure with a given amino acid sequence through a metric we term "structure score." Considering submissions from recent Critical Assessment of Structure Prediction (CASP) experiments, we found a strong correlation (R = 0.69) between structure score and model accuracy, with poorly predicted regions readily identifiable. This performance exceeds that of leading atomistic statistical energy functions. Furthermore, TERM-based analysis of two prototypical multi-state proteins rapidly produced structural insights fully consistent with prior extensive experimental studies. We thus find that TERM-based analysis should have considerable utility for protein structural biology. Copyright © 2015 Elsevier Ltd. All rights reserved.
Epitope mapping of the domains of human angiotensin converting enzyme.
Kugaevskaya, Elena V; Kolesanova, Ekaterina F; Kozin, Sergey A; Veselovsky, Alexander V; Dedinsky, Ilya R; Elisseeva, Yulia E
2006-06-01
Somatic angiotensin converting enzyme (sACE), contains in its single chain two homologous domains (called N- and C-domains), each bearing a functional zinc-dependent active site. The present study aims to define the differences between two sACE domains and to localize experimentally revealed antigenic determinants (B-epitopes) in the recently determined three-dimensional structure of testicular tACE. The predicted linear antigenic determinants of human sACE were determined by peptide scanning ("PEPSCAN") approach. Essential difference was demonstrated between locations of the epitopes in the N- and C-domains. Comparison of arrangement of epitopes in the human domains with the corresponding sequences of some mammalian sACEs enabled to classify the revealed antigenic determinants as variable or conserved areas. The location of antigenic determinants with respect to various structural elements and to functionally important sites of the human sACE C-domain was estimated. The majority of antigenic sites of the C-domain were located at the irregular elements and at the boundaries of secondary structure elements. The data show structural differences between the sACE domains. The experimentally revealed antigenic determinants were in agreement with the recently determined crystal tACE structure. New potential applications are open to successfully produce mono-specific and group-specific antipeptide antibodies.
Bialonska, Dobroslawa; Song, Kenneth; Bolton, Philip H.
2011-01-01
Tumor cell lines can replicate faster than normal cells and many also have defective DNA repair pathways. This has lead to the investigation of the inhibition of DNA repair proteins as a means of therapeutic intervention. An alternative approach is to hide or mask damaged DNA from the repair systems. We have developed a protocol to investigate the structures of the complexes of damaged DNA with drug like molecules. Nucleotide resolution structural information can be obtained using an improved hydroxyl radical cleavage protocol. The use of a dTn tail increases the length of the smallest fragments of interest and allows efficient co-precipitation of the fragments with poly(A). The use of a fluorescent label, on the 5′ end of the dTn tail, in conjunction with modified cleavage reaction conditions, avoids the lifetime and other problems with 32P labeling. The structures of duplex DNAs containing AC and CC mismatches in the presence and absence of minor groove binders have been investigated as have those of the fully complementary DNA. The results indicate that the structural perturbations of the mismatches are localized, are sequence dependent and that the presence of a mismatch can alter the binding of drug like molecules. PMID:21893212
Statistical discovery of site inter-dependencies in sub-molecular hierarchical protein structuring
2012-01-01
Background Much progress has been made in understanding the 3D structure of proteins using methods such as NMR and X-ray crystallography. The resulting 3D structures are extremely informative, but do not always reveal which sites and residues within the structure are of special importance. Recently, there are indications that multiple-residue, sub-domain structural relationships within the larger 3D consensus structure of a protein can be inferred from the analysis of the multiple sequence alignment data of a protein family. These intra-dependent clusters of associated sites are used to indicate hierarchical inter-residue relationships within the 3D structure. To reveal the patterns of associations among individual amino acids or sub-domain components within the structure, we apply a k-modes attribute (aligned site) clustering algorithm to the ubiquitin and transthyretin families in order to discover associations among groups of sites within the multiple sequence alignment. We then observe what these associations imply within the 3D structure of these two protein families. Results The k-modes site clustering algorithm we developed maximizes the intra-group interdependencies based on a normalized mutual information measure. The clusters formed correspond to sub-structural components or binding and interface locations. Applying this data-directed method to the ubiquitin and transthyretin protein family multiple sequence alignments as a test bed, we located numerous interesting associations of interdependent sites. These clusters were then arranged into cluster tree diagrams which revealed four structural sub-domains within the single domain structure of ubiquitin and a single large sub-domain within transthyretin associated with the interface among transthyretin monomers. In addition, several clusters of mutually interdependent sites were discovered for each protein family, each of which appear to play an important role in the molecular structure and/or function. Conclusions Our results demonstrate that the method we present here using a k-modes site clustering algorithm based on interdependency evaluation among sites obtained from a sequence alignment of homologous proteins can provide significant insights into the complex, hierarchical inter-residue structural relationships within the 3D structure of a protein family. PMID:22793672
Statistical discovery of site inter-dependencies in sub-molecular hierarchical protein structuring.
Durston, Kirk K; Chiu, David Ky; Wong, Andrew Kc; Li, Gary Cl
2012-07-13
Much progress has been made in understanding the 3D structure of proteins using methods such as NMR and X-ray crystallography. The resulting 3D structures are extremely informative, but do not always reveal which sites and residues within the structure are of special importance. Recently, there are indications that multiple-residue, sub-domain structural relationships within the larger 3D consensus structure of a protein can be inferred from the analysis of the multiple sequence alignment data of a protein family. These intra-dependent clusters of associated sites are used to indicate hierarchical inter-residue relationships within the 3D structure. To reveal the patterns of associations among individual amino acids or sub-domain components within the structure, we apply a k-modes attribute (aligned site) clustering algorithm to the ubiquitin and transthyretin families in order to discover associations among groups of sites within the multiple sequence alignment. We then observe what these associations imply within the 3D structure of these two protein families. The k-modes site clustering algorithm we developed maximizes the intra-group interdependencies based on a normalized mutual information measure. The clusters formed correspond to sub-structural components or binding and interface locations. Applying this data-directed method to the ubiquitin and transthyretin protein family multiple sequence alignments as a test bed, we located numerous interesting associations of interdependent sites. These clusters were then arranged into cluster tree diagrams which revealed four structural sub-domains within the single domain structure of ubiquitin and a single large sub-domain within transthyretin associated with the interface among transthyretin monomers. In addition, several clusters of mutually interdependent sites were discovered for each protein family, each of which appear to play an important role in the molecular structure and/or function. Our results demonstrate that the method we present here using a k-modes site clustering algorithm based on interdependency evaluation among sites obtained from a sequence alignment of homologous proteins can provide significant insights into the complex, hierarchical inter-residue structural relationships within the 3D structure of a protein family.
The identification and functional annotation of RNA structures conserved in vertebrates
Seemann, Stefan E.; Mirza, Aashiq H.; Hansen, Claus; Bang-Berthelsen, Claus H.; Garde, Christian; Christensen-Dalsgaard, Mikkel; Torarinsson, Elfar; Yao, Zizhen; Workman, Christopher T.; Pociot, Flemming; Nielsen, Henrik; Tommerup, Niels; Ruzzo, Walter L.; Gorodkin, Jan
2017-01-01
Structured elements of RNA molecules are essential in, e.g., RNA stabilization, localization, and protein interaction, and their conservation across species suggests a common functional role. We computationally screened vertebrate genomes for conserved RNA structures (CRSs), leveraging structure-based, rather than sequence-based, alignments. After careful correction for sequence identity and GC content, we predict ∼516,000 human genomic regions containing CRSs. We find that a substantial fraction of human–mouse CRS regions (1) colocalize consistently with binding sites of the same RNA binding proteins (RBPs) or (2) are transcribed in corresponding tissues. Additionally, a CaptureSeq experiment revealed expression of many of our CRS regions in human fetal brain, including 662 novel ones. For selected human and mouse candidate pairs, qRT-PCR and in vitro RNA structure probing supported both shared expression and shared structure despite low abundance and low sequence identity. About 30,000 CRS regions are located near coding or long noncoding RNA genes or within enhancers. Structured (CRS overlapping) enhancer RNAs and extended 3′ ends have significantly increased expression levels over their nonstructured counterparts. Our findings of transcribed uncharacterized regulatory regions that contain CRSs support their RNA-mediated functionality. PMID:28487280
Hidden Structural Codes in Protein Intrinsic Disorder.
Borkosky, Silvia S; Camporeale, Gabriela; Chemes, Lucía B; Risso, Marikena; Noval, María Gabriela; Sánchez, Ignacio E; Alonso, Leonardo G; de Prat Gay, Gonzalo
2017-10-17
Intrinsic disorder is a major structural category in biology, accounting for more than 30% of coding regions across the domains of life, yet consists of conformational ensembles in equilibrium, a major challenge in protein chemistry. Anciently evolved papillomavirus genomes constitute an unparalleled case for sequence to structure-function correlation in cases in which there are no folded structures. E7, the major transforming oncoprotein of human papillomaviruses, is a paradigmatic example among the intrinsically disordered proteins. Analysis of a large number of sequences of the same viral protein allowed for the identification of a handful of residues with absolute conservation, scattered along the sequence of its N-terminal intrinsically disordered domain, which intriguingly are mostly leucine residues. Mutation of these led to a pronounced increase in both α-helix and β-sheet structural content, reflected by drastic effects on equilibrium propensities and oligomerization kinetics, and uncovers the existence of local structural elements that oppose canonical folding. These folding relays suggest the existence of yet undefined hidden structural codes behind intrinsic disorder in this model protein. Thus, evolution pinpoints conformational hot spots that could have not been identified by direct experimental methods for analyzing or perturbing the equilibrium of an intrinsically disordered protein ensemble.
Local thermodynamic mapping for effective liquid density-functional theory
NASA Technical Reports Server (NTRS)
Kyrlidis, Agathagelos; Brown, Robert A.
1992-01-01
The structural-mapping approximation introduced by Lutsko and Baus (1990) in the generalized effective-liquid approximation is extended to include a local thermodynamic mapping based on a spatially dependent effective density for approximating the solid phase in terms of the uniform liquid. This latter approximation, called the local generalized effective-liquid approximation (LGELA) yields excellent predictions for the free energy of hard-sphere solids and for the conditions of coexistence of a hard-sphere fcc solid with a liquid. Moreover, the predicted free energy remains single valued for calculations with more loosely packed crystalline structures, such as the diamond lattice. The spatial dependence of the weighted density makes the LGELA useful in the study of inhomogeneous solids.
Höhm, Sandra; Herzlieb, Marcel; Rosenfeld, Arkadi; Krüger, Jörg; Bonse, Jörn
2015-01-12
Two-color double-fs-pulse experiments were performed on silicon wafers to study the temporally distributed energy deposition in the formation of laser-induced periodic surface structures (LIPSS). A Mach-Zehnder interferometer generated parallel or cross-polarized double-pulse sequences at 400 and 800 nm wavelength, with inter-pulse delays up to a few picoseconds between the sub-ablation 50-fs-pulses. Multiple two-color double-pulse sequences were collinearly focused by a spherical mirror to the sample. The resulting LIPSS characteristics (periods, areas) were analyzed by scanning electron microscopy. A wavelength-dependent plasmonic mechanism is proposed to explain the delay-dependence of the LIPSS. These two-color experiments extend previous single-color studies and prove the importance of the ultrafast energy deposition for LIPSS formation.
Sohn, Hae-Jin; Kim, Jong-Hyun; Shin, Myeong-Heon; Song, Kyoung-Ju; Shin, Ho-Joon
2010-03-01
Naegleria fowleri destroys target cells by trogocytosis, a phagocytosis mechanism, and a process of piecemeal ingestion of target cells by food-cups. Phagocytosis is an actin-dependent process that involves polymerization of monomeric G-actin into filamentous F-actin. However, despite the numerous studies concerning phagocytosis, its role in the N. fowleri food-cup formation related with trogocytosis has been poorly reported. In this study, we cloned and characterized an Nf-actin gene to elucidate the role of Nf-actin gene in N. fowleri pathogenesis. The Nf-actin gene is composed of 1,128-bp and produced a 54.1-kDa recombinant protein (Nf-actin). The sequence identity was 82% with nonpathogenic Naegleria gruberi but has no sequence identity with other mammals or human actin gene. Anti-Nf-actin polyclonal antibody was produced in BALB/c mice immunized with recombinant Nf-actin. The Nf-actin was localized on the cytoplasm, pseudopodia, and especially, food-cup structure (amoebastome) in N. fowleri trophozoites using immunofluorescence assay. When N. fowleri co-cultured with Chinese hamster ovary cells, Nf-actin was observed to localize around on phagocytic food-cups. We also observed that N. fowleri treated with cytochalasin D as actin polymerization inhibitor or transfected with antisense oligomer of Nf-actin gene had shown the reduced ability of food-cup formation and in vitro cytotoxicity. Finally, it suggests that Nf-actin plays an important role in phagocytic activity of pathogenic N. fowleri.
Dual-echo ASL based assessment of motor networks: a feasibility study
NASA Astrophysics Data System (ADS)
Storti, Silvia Francesca; Boscolo Galazzo, Ilaria; Pizzini, Francesca B.; Menegaz, Gloria
2018-04-01
Objective. Dual-echo arterial spin labeling (DE-ASL) technique has been recently proposed for the simultaneous acquisition of ASL and blood-oxygenation-level-dependent (BOLD)-functional magnetic resonance imaging (fMRI) data. The assessment of this technique in detecting functional connectivity at rest or during motor and motor imagery tasks is still unexplored both per-se and in comparison with conventional methods. The purpose is to quantify the sensitivity of the DE-ASL sequence with respect to the conventional fMRI sequence (cvBOLD) in detecting brain activations, and to assess and compare the relevance of node features in decoding the network structure. Approach. Thirteen volunteers were scanned acquiring a pseudo-continuous DE-ASL sequence from which the concomitant BOLD (ccBOLD) simultaneously to the ASL can be extracted. The approach consists of two steps: (i) model-based analyses for assessing brain activations at individual and group levels, followed by statistical analysis for comparing the activation elicited by the three sequences under two conditions (motor and motor imagery), respectively; (ii) brain connectivity graph-theoretical analysis for assessing and comparing the network models properties. Main results. Our results suggest that cvBOLD and ccBOLD have comparable sensitivity in detecting the regions involved in the active task, whereas ASL offers a higher degree of co-localization with smaller activation volumes. The connectivity results and the comparative analysis of node features across sequences revealed that there are no strong changes between rest and tasks and that the differences between the sequences are limited to few connections. Significance. Considering the comparable sensitivity of the ccBOLD and cvBOLD sequences in detecting activated brain regions, the results demonstrate that DE-ASL can be successfully applied in functional studies allowing to obtain both ASL and BOLD information within a single sequence. Further, DE-ASL is a powerful technique for research and clinical applications allowing to perform quantitative comparisons as well as to characterize functional connectivity.
Alternative DNA structure formation in the mutagenic human c-MYC promoter.
Del Mundo, Imee Marie A; Zewail-Foote, Maha; Kerwin, Sean M; Vasquez, Karen M
2017-05-05
Mutation 'hotspot' regions in the genome are susceptible to genetic instability, implicating them in diseases. These hotspots are not random and often co-localize with DNA sequences potentially capable of adopting alternative DNA structures (non-B DNA, e.g. H-DNA and G4-DNA), which have been identified as endogenous sources of genomic instability. There are regions that contain overlapping sequences that may form more than one non-B DNA structure. The extent to which one structure impacts the formation/stability of another, within the sequence, is not fully understood. To address this issue, we investigated the folding preferences of oligonucleotides from a chromosomal breakpoint hotspot in the human c-MYC oncogene containing both potential G4-forming and H-DNA-forming elements. We characterized the structures formed in the presence of G4-DNA-stabilizing K+ ions or H-DNA-stabilizing Mg2+ ions using multiple techniques. We found that under conditions favorable for H-DNA formation, a stable intramolecular triplex DNA structure predominated; whereas, under K+-rich, G4-DNA-forming conditions, a plurality of unfolded and folded species were present. Thus, within a limited region containing sequences with the potential to adopt multiple structures, only one structure predominates under a given condition. The predominance of H-DNA implicates this structure in the instability associated with the human c-MYC oncogene. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Cluster formation in precompound nuclei in the time-dependent framework
DOE Office of Scientific and Technical Information (OSTI.GOV)
Schuetrumpf, B.; Nazarewicz, W.
Background: Modern applications of nuclear time-dependent density functional theory (TDDFT) are often capable of providing quantitative description of heavy ion reactions. However, the structures of precompound (preequilibrium, prefission) states produced in heavy ion reactions are difficult to assess theoretically in TDDFT as the single-particle density alone is a weak indicator of shell structure and cluster states. Purpose: We employ the time-dependent nucleon localization function (NLF) to reveal the structure of precompound states in nuclear reactions involving light and medium-mass ions. We primarily focus on spin saturated systems with N = Z . Furthermore, we study reactions with oxygen and carbonmore » ions, for which some experimental evidence for α clustering in precompound states exists. Method: We utilize the symmetry-free TDDFT approach with the Skyrme energy density functional UNEDF1 and compute the time-dependent NLFs to describe 16O + 16O, 40Ca + 16O, 40Ca + 40Ca , and 16,18O + 12C collisions at energies above the Coulomb barrier. Results: We show that NLFs reveal a variety of time-dependent modes involving cluster structures. For instance, the 16O + 16O collision results in a vibrational mode of a quasimolecular α - 12 C - 12 C- α state. For heavier ions, a variety of cluster configurations are predicted. For the collision of 16,18O + 12C, we showed that the precompound system has a tendency to form α clusters. This result supports the experimental findings that the presence of cluster structures in the projectile and target nuclei gives rise to strong entrance channel effects and enhanced α emission. Conclusion: The time-dependent nucleon localization measure is a very good indicator of cluster structures in complex precompound states formed in heavy-ion fusion reactions. Finally, the localization reveals the presence of collective vibrations involving cluster structures, which dominate the initial dynamics of the fusing system.« less
Cluster formation in precompound nuclei in the time-dependent framework
Schuetrumpf, B.; Nazarewicz, W.
2017-12-15
Background: Modern applications of nuclear time-dependent density functional theory (TDDFT) are often capable of providing quantitative description of heavy ion reactions. However, the structures of precompound (preequilibrium, prefission) states produced in heavy ion reactions are difficult to assess theoretically in TDDFT as the single-particle density alone is a weak indicator of shell structure and cluster states. Purpose: We employ the time-dependent nucleon localization function (NLF) to reveal the structure of precompound states in nuclear reactions involving light and medium-mass ions. We primarily focus on spin saturated systems with N = Z . Furthermore, we study reactions with oxygen and carbonmore » ions, for which some experimental evidence for α clustering in precompound states exists. Method: We utilize the symmetry-free TDDFT approach with the Skyrme energy density functional UNEDF1 and compute the time-dependent NLFs to describe 16O + 16O, 40Ca + 16O, 40Ca + 40Ca , and 16,18O + 12C collisions at energies above the Coulomb barrier. Results: We show that NLFs reveal a variety of time-dependent modes involving cluster structures. For instance, the 16O + 16O collision results in a vibrational mode of a quasimolecular α - 12 C - 12 C- α state. For heavier ions, a variety of cluster configurations are predicted. For the collision of 16,18O + 12C, we showed that the precompound system has a tendency to form α clusters. This result supports the experimental findings that the presence of cluster structures in the projectile and target nuclei gives rise to strong entrance channel effects and enhanced α emission. Conclusion: The time-dependent nucleon localization measure is a very good indicator of cluster structures in complex precompound states formed in heavy-ion fusion reactions. Finally, the localization reveals the presence of collective vibrations involving cluster structures, which dominate the initial dynamics of the fusing system.« less
Cluster formation in precompound nuclei in the time-dependent framework
NASA Astrophysics Data System (ADS)
Schuetrumpf, B.; Nazarewicz, W.
2017-12-01
Background: Modern applications of nuclear time-dependent density functional theory (TDDFT) are often capable of providing quantitative description of heavy ion reactions. However, the structures of precompound (preequilibrium, prefission) states produced in heavy ion reactions are difficult to assess theoretically in TDDFT as the single-particle density alone is a weak indicator of shell structure and cluster states. Purpose: We employ the time-dependent nucleon localization function (NLF) to reveal the structure of precompound states in nuclear reactions involving light and medium-mass ions. We primarily focus on spin saturated systems with N =Z . Furthermore, we study reactions with oxygen and carbon ions, for which some experimental evidence for α clustering in precompound states exists. Method: We utilize the symmetry-free TDDFT approach with the Skyrme energy density functional UNEDF1 and compute the time-dependent NLFs to describe 16O + 16O,40Ca + 16O, 40Ca + 40Ca, and O,1816 + 12C collisions at energies above the Coulomb barrier. Results: We show that NLFs reveal a variety of time-dependent modes involving cluster structures. For instance, the 16O + 16O collision results in a vibrational mode of a quasimolecular α - 12C - 12C-α state. For heavier ions, a variety of cluster configurations are predicted. For the collision of O,1816 + 12C, we showed that the precompound system has a tendency to form α clusters. This result supports the experimental findings that the presence of cluster structures in the projectile and target nuclei gives rise to strong entrance channel effects and enhanced α emission. Conclusion: The time-dependent nucleon localization measure is a very good indicator of cluster structures in complex precompound states formed in heavy-ion fusion reactions. The localization reveals the presence of collective vibrations involving cluster structures, which dominate the initial dynamics of the fusing system.
Steckelberg, Anna-Lena; Akiyama, Benjamin M; Costantino, David A; Sit, Tim L; Nix, Jay C; Kieft, Jeffrey S
2018-06-19
Folded RNA elements that block processive 5' → 3' cellular exoribonucleases (xrRNAs) to produce biologically active viral noncoding RNAs have been discovered in flaviviruses, potentially revealing a new mode of RNA maturation. However, whether this RNA structure-dependent mechanism exists elsewhere and, if so, whether a singular RNA fold is required, have been unclear. Here we demonstrate the existence of authentic RNA structure-dependent xrRNAs in dianthoviruses, plant-infecting viruses unrelated to animal-infecting flaviviruses. These xrRNAs have no sequence similarity to known xrRNAs; thus, we used a combination of biochemistry and virology to characterize their sequence requirements and mechanism of stopping exoribonucleases. By solving the structure of a dianthovirus xrRNA by X-ray crystallography, we reveal a complex fold that is very different from that of the flavivirus xrRNAs. However, both versions of xrRNAs contain a unique topological feature, a pseudoknot that creates a protective ring around the 5' end of the RNA structure; this may be a defining structural feature of xrRNAs. Single-molecule FRET experiments reveal that the dianthovirus xrRNAs undergo conformational changes and can use "codegradational remodeling," exploiting the exoribonucleases' degradation-linked helicase activity to help form their resistant structure; such a mechanism has not previously been reported. Convergent evolution has created RNA structure-dependent exoribonuclease resistance in different contexts, which establishes it as a general RNA maturation mechanism and defines xrRNAs as an authentic functional class of RNAs.
Pomel, Sébastien; Diogon, Marie; Bouchard, Philippe; Pradel, Lydie; Ravet, Viviane; Coffe, Gérard; Viguès, Bernard
2006-02-01
Previous attempts to identify the membrane skeleton of Paramecium cells have revealed a protein pattern that is both complex and specific. The most prominent structural elements, epiplasmic scales, are centered around ciliary units and are closely apposed to the cytoplasmic side of the inner alveolar membrane. We sought to characterize epiplasmic scale proteins (epiplasmins) at the molecular level. PCR approaches enabled the cloning and sequencing of two closely related genes by amplifications of sequences from a macronuclear genomic library. Using these two genes (EPI-1 and EPI-2), we have contributed to the annotation of the Paramecium tetraurelia macronuclear genome and identified 39 additional (paralogous) sequences. Two orthologous sequences were found in the Tetrahymena thermophila genome. Structural analysis of the 43 sequences indicates that the hallmark of this new multigenic family is a 79 aa domain flanked by two Q-, P- and V-rich stretches of sequence that are much more variable in amino-acid composition. Such features clearly distinguish members of the multigenic family from epiplasmic proteins previously sequenced in other ciliates. The expression of Green Fluorescent Protein (GFP)-tagged epiplasmin showed significant labeling of epiplasmic scales as well as oral structures. We expect that the GFP construct described herein will prove to be a useful tool for comparative subcellular localization of different putative epiplasmins in Paramecium.
Zhang, Li; Liao, Bo; Li, Dachao; Zhu, Wen
2009-07-21
Apoptosis, or programmed cell death, plays an important role in development of an organism. Obtaining information on subcellular location of apoptosis proteins is very helpful to understand the apoptosis mechanism. In this paper, based on the concept that the position distribution information of amino acids is closely related with the structure and function of proteins, we introduce the concept of distance frequency [Matsuda, S., Vert, J.P., Ueda, N., Toh, H., Akutsu, T., 2005. A novel representation of protein sequences for prediction of subcellular location using support vector machines. Protein Sci. 14, 2804-2813] and propose a novel way to calculate distance frequencies. In order to calculate the local features, each protein sequence is separated into p parts with the same length in our paper. Then we use the novel representation of protein sequences and adopt support vector machine to predict subcellular location. The overall prediction accuracy is significantly improved by jackknife test.
Dynamical decoupling of local transverse random telegraph noise in a two-qubit gate
NASA Astrophysics Data System (ADS)
D'Arrigo, A.; Falci, G.; Paladino, E.
2015-10-01
Achieving high-fidelity universal two-qubit gates is a central requisite of any implementation of quantum information processing. The presence of spurious fluctuators of various physical origin represents a limiting factor for superconducting nanodevices. Operating qubits at optimal points, where the qubit-fluctuator interaction is transverse with respect to the single qubit Hamiltonian, considerably improved single qubit gates. Further enhancement has been achieved by dynamical decoupling (DD). In this article we investigate DD of transverse random telegraph noise acting locally on each of the qubits forming an entangling gate. Our analysis is based on the exact numerical solution of the stochastic Schrödinger equation. We evaluate the gate error under local periodic, Carr-Purcell and Uhrig DD sequences. We find that a threshold value of the number, n, of pulses exists above which the gate error decreases with a sequence-specific power-law dependence on n. Below threshold, DD may even increase the error with respect to the unconditioned evolution, a behaviour reminiscent of the anti-Zeno effect.
Mixing due Pulsating Turbulent Jets
NASA Astrophysics Data System (ADS)
Grosshans, Holger; Nygård, Alexander; Fuchs, Laszlo
Combustion efficiency and the formation of soot and/or NOx in Internal- Combustion engines depends strongly on the local air/fuel mixture, the local flow conditions and temperature. Modern diesel engines employ high injection pressure for improved atomization, but mixing is controlled largely by the flow in the cylinder. By injecting the fuel in pulses one can gain control over the atomization, evaporation and the mixing of the gaseous fuel. We show that the pulsatile injection of fuel enhances fuel break-up and the entrainment of ambient air into the fuel stream. The entrainment level depends on fuel property, such as fuel/air viscosity and density ratio, fuel surface-tension, injection speed and injection sequencing. Examples of enhanced break-up and mixing are given.
Shideler, G.L.
1994-01-01
Middle Miocene siliciclastic deposits comprising the Calvert Cliffs section at the Baltimore Gas and Electric Company's (BG&E) nuclear power plant site in southern Maryland were analyzed in terms of lithostratigraphy, sedimentary structures, and granulometric parameters, to interprete paleo-environments within a sequence-stratigraphic framework. In terms of sequence-stratigraphic models, the BG&E section can be interpreted as consisting of two genetic stratigraphic sequences (Galloway model), namely, a shelf sequence and an overlying deltaic sequence. Using the Exxon model, the section consists of two third-order (1-5 m.y. duration) depositional sequences. The stratigraphic sequences of the BG&E section reflect both relatively short-term eustatic transgressive events, as well as a long-term regressive trend with associated local deltation and coastal progradation. The regression probably signified a regional basinward shift of depocenters within the Salisbury embayment during Miocene time. -from Author
T box riboswitches in Actinobacteria: Translational regulation via novel tRNA interactions
Sherwood, Anna V.; Grundy, Frank J.; Henkin, Tina M.
2015-01-01
The T box riboswitch regulates many amino acid-related genes in Gram-positive bacteria. T box riboswitch-mediated gene regulation was shown previously to occur at the level of transcription attenuation via structural rearrangements in the 5′ untranslated (leader) region of the mRNA in response to binding of a specific uncharged tRNA. In this study, a novel group of isoleucyl-tRNA synthetase gene (ileS) T box leader sequences found in organisms of the phylum Actinobacteria was investigated. The Stem I domains of these RNAs lack several highly conserved elements that are essential for interaction with the tRNA ligand in other T box RNAs. Many of these RNAs were predicted to regulate gene expression at the level of translation initiation through tRNA-dependent stabilization of a helix that sequesters a sequence complementary to the Shine–Dalgarno (SD) sequence, thus freeing the SD sequence for ribosome binding and translation initiation. We demonstrated specific binding to the cognate tRNAIle and tRNAIle-dependent structural rearrangements consistent with regulation at the level of translation initiation, providing the first biochemical demonstration, to our knowledge, of translational regulation in a T box riboswitch. PMID:25583497
DisAp-dependent striated fiber elongation is required to organize ciliary arrays
Galati, Domenico F.; Bonney, Stephanie; Kronenberg, Zev; Clarissa, Christina; Yandell, Mark; Elde, Nels C.; Jerka-Dziadosz, Maria; Giddings, Thomas H.; Frankel, Joseph
2014-01-01
Cilia-organizing basal bodies (BBs) are microtubule scaffolds that are visibly asymmetrical because they have attached auxiliary structures, such as striated fibers. In multiciliated cells, BB orientation aligns to ensure coherent ciliary beating, but the mechanisms that maintain BB orientation are unclear. For the first time in Tetrahymena thermophila, we use comparative whole-genome sequencing to identify the mutation in the BB disorientation mutant disA-1. disA-1 abolishes the localization of the novel protein DisAp to T. thermophila striated fibers (kinetodesmal fibers; KFs), which is consistent with DisAp’s similarity to the striated fiber protein SF-assemblin. We demonstrate that DisAp is required for KFs to elongate and to resist BB disorientation in response to ciliary forces. Newly formed BBs move along KFs as they approach their cortical attachment sites. However, because they contain short KFs that are rotated, BBs in disA-1 cells display aberrant spacing and disorientation. Therefore, DisAp is a novel KF component that is essential for force-dependent KF elongation and BB orientation in multiciliary arrays. PMID:25533842
Genome-wide characterization of centromeric satellites from multiple mammalian genomes.
Alkan, Can; Cardone, Maria Francesca; Catacchio, Claudia Rita; Antonacci, Francesca; O'Brien, Stephen J; Ryder, Oliver A; Purgato, Stefania; Zoli, Monica; Della Valle, Giuliano; Eichler, Evan E; Ventura, Mario
2011-01-01
Despite its importance in cell biology and evolution, the centromere has remained the final frontier in genome assembly and annotation due to its complex repeat structure. However, isolation and characterization of the centromeric repeats from newly sequenced species are necessary for a complete understanding of genome evolution and function. In recent years, various genomes have been sequenced, but the characterization of the corresponding centromeric DNA has lagged behind. Here, we present a computational method (RepeatNet) to systematically identify higher-order repeat structures from unassembled whole-genome shotgun sequence and test whether these sequence elements correspond to functional centromeric sequences. We analyzed genome datasets from six species of mammals representing the diversity of the mammalian lineage, namely, horse, dog, elephant, armadillo, opossum, and platypus. We define candidate monomer satellite repeats and demonstrate centromeric localization for five of the six genomes. Our analysis revealed the greatest diversity of centromeric sequences in horse and dog in contrast to elephant and armadillo, which showed high-centromeric sequence homogeneity. We could not isolate centromeric sequences within the platypus genome, suggesting that centromeres in platypus are not enriched in satellite DNA. Our method can be applied to the characterization of thousands of other vertebrate genomes anticipated for sequencing in the near future, providing an important tool for annotation of centromeres.
Webster, G D; Sanderson, M R; Skelly, J V; Neidle, S; Swann, P F; Li, B F; Tickle, I J
1990-01-01
The crystal structure of the dodecanucleotide d(CGCAAGCTGGCG) has been determined to a resolution of 2.5 A and refined to an R factor of 19.3% for 1710 reflections. The sequence crystallizes as a B-type double helix, with two G(anti).A(syn) base pairs. These are stabilized by three-center hydrogen bonds to pyrimidines that induce perturbations in base-pair geometry. The central AGCT region of the helix has a wide (greater than 6 A) minor groove. PMID:2395870
Doyle, Colleen M; Rumfeldt, Jessica A; Broom, Helen R; Sekhar, Ashok; Kay, Lewis E; Meiering, Elizabeth M
2016-03-08
The chemical shifts of backbone amide protons in proteins are sensitive reporters of local structural stability and conformational heterogeneity, which can be determined from their readily measured linear and nonlinear temperature-dependences, respectively. Here we report analyses of amide proton temperature-dependences for native dimeric Cu, Zn superoxide dismutase (holo pWT SOD1) and structurally diverse mutant SOD1s associated with amyotrophic lateral sclerosis (ALS). Holo pWT SOD1 loses structure with temperature first at its periphery and, while having extremely high global stability, nevertheless exhibits extensive conformational heterogeneity, with ∼1 in 5 residues showing evidence for population of low energy alternative states. The holo G93A and E100G ALS mutants have moderately decreased global stability, whereas V148I is slightly stabilized. Comparison of the holo mutants as well as the marginally stable immature monomeric unmetalated and disulfide-reduced (apo(2SH)) pWT with holo pWT shows that changes in the local structural stability of individual amides vary greatly, with average changes corresponding to differences in global protein stability measured by differential scanning calorimetry. Mutants also exhibit altered conformational heterogeneity compared to pWT. Strikingly, substantial increases as well as decreases in local stability and conformational heterogeneity occur, in particular upon maturation and for G93A. Thus, the temperature-dependence of amide shifts for SOD1 variants is a rich source of information on the location and extent of perturbation of structure upon covalent changes and ligand binding. The implications for potential mechanisms of toxic misfolding of SOD1 in disease and for general aspects of protein energetics, including entropy-enthalpy compensation, are discussed.
Zhang, Yiming; Jin, Quan; Wang, Shuting; Ren, Ren
2011-05-01
The mobile behavior of 1481 peptides in ion mobility spectrometry (IMS), which are generated by protease digestion of the Drosophila melanogaster proteome, is modeled and predicted based on two different types of characterization methods, i.e. sequence-based approach and structure-based approach. In this procedure, the sequence-based approach considers both the amino acid composition of a peptide and the local environment profile of each amino acid in the peptide; the structure-based approach is performed with the CODESSA protocol, which regards a peptide as a common organic compound and generates more than 200 statistically significant variables to characterize the whole structure profile of a peptide molecule. Subsequently, the nonlinear support vector machine (SVM) and Gaussian process (GP) as well as linear partial least squares (PLS) regression is employed to correlate the structural parameters of the characterizations with the IMS drift times of these peptides. The obtained quantitative structure-spectrum relationship (QSSR) models are evaluated rigorously and investigated systematically via both one-deep and two-deep cross-validations as well as the rigorous Monte Carlo cross-validation (MCCV). We also give a comprehensive comparison on the resulting statistics arising from the different combinations of variable types with modeling methods and find that the sequence-based approach can give the QSSR models with better fitting ability and predictive power but worse interpretability than the structure-based approach. In addition, though the QSSR modeling using sequence-based approach is not needed for the preparation of the minimization structures of peptides before the modeling, it would be considerably efficient as compared to that using structure-based approach. Copyright © 2011 Elsevier Ltd. All rights reserved.
Mahajan, Gaurang; Mande, Shekhar C
2017-04-04
A comprehensive map of the human-M. tuberculosis (MTB) protein interactome would help fill the gaps in our understanding of the disease, and computational prediction can aid and complement experimental studies towards this end. Several sequence-based in silico approaches tap the existing data on experimentally validated protein-protein interactions (PPIs); these PPIs serve as templates from which novel interactions between pathogen and host are inferred. Such comparative approaches typically make use of local sequence alignment, which, in the absence of structural details about the interfaces mediating the template interactions, could lead to incorrect inferences, particularly when multi-domain proteins are involved. We propose leveraging the domain-domain interaction (DDI) information in PDB complexes to score and prioritize candidate PPIs between host and pathogen proteomes based on targeted sequence-level comparisons. Our method picks out a small set of human-MTB protein pairs as candidates for physical interactions, and the use of functional meta-data suggests that some of them could contribute to the in vivo molecular cross-talk between pathogen and host that regulates the course of the infection. Further, we present numerical data for Pfam domain families that highlights interaction specificity on the domain level. Not every instance of a pair of domains, for which interaction evidence has been found in a few instances (i.e. structures), is likely to functionally interact. Our sorting approach scores candidates according to how "distant" they are in sequence space from known examples of DDIs (templates). Thus, it provides a natural way to deal with the heterogeneity in domain-level interactions. Our method represents a more informed application of local alignment to the sequence-based search for potential human-microbial interactions that uses available PPI data as a prior. Our approach is somewhat limited in its sensitivity by the restricted size and diversity of the template dataset, but, given the rapid accumulation of solved protein complex structures, its scope and utility are expected to keep steadily improving.
Local Structure and Anisotropy in the Amorphous Precursor= to Ba-Hexaferrite Thin Films
NASA Astrophysics Data System (ADS)
Snyder, J. E.; Harris, V. G.; Koon, N. C.; Sui, X.; Kryder, M. H.
1996-03-01
Ba-hexaferrite thin-films for recording media applications are commonly fabricated by a two-step process: sputter-deposition of an amorphous precursor, followed by annealing to crystallize the BaFe_12O_19 phase. The magnetic anisotropy of the crystalline films can be either in-plane or perpendicular, depending on the sputtering process used in the first step. However, conventional characterization techniques (x-ray diffraction and TEM) have been unable to observe any structure in the amorphous precursor films. In this study, such films are investigated by PD-EXAFS (polarization-dependent extended x-ray absorption fine structure). An anisotropic local ordered structure is observed around both Fe and Ba atoms in the "amorphous" films. This anisotropic local structure appears to determine the orientation of the fast-growing basal plane directions during crystallization, and thus the directions of the c-axes and the magnetic anisotropy. Results suggest that the structure of the amorphous films consists of networks made up of units of Fe atoms surrounded by their O nearest neighbors, that are connected together. Ba atoms appear to fit into in-between spaces as network-modifiers.
Machine learning prediction for classification of outcomes in local minimisation
NASA Astrophysics Data System (ADS)
Das, Ritankar; Wales, David J.
2017-01-01
Machine learning schemes are employed to predict which local minimum will result from local energy minimisation of random starting configurations for a triatomic cluster. The input data consists of structural information at one or more of the configurations in optimisation sequences that converge to one of four distinct local minima. The ability to make reliable predictions, in terms of the energy or other properties of interest, could save significant computational resources in sampling procedures that involve systematic geometry optimisation. Results are compared for two energy minimisation schemes, and for neural network and quadratic functions of the inputs.
Ruwe, Lena; Moshammer, Kai; Hansen, Nils; Kohse-Höinghaus, Katharina
2018-04-25
In this study, we experimentally investigate the high-temperature oxidation kinetics of n-pentane, 1-pentene and 2-methyl-2-butene (2M2B) in a combustion environment using flame-sampling molecular beam mass spectrometry. The selected C5 fuels are prototypes for linear and branched, saturated and unsaturated fuel components, featuring different C-C and C-H bond structures. It is shown that the formation tendency of species, such as polycyclic aromatic hydrocarbons (PAHs), yielded through mass growth reactions increases drastically in the sequence n-pentane < 1-pentene < 2M2B. This comparative study enables valuable insights into fuel-dependent reaction sequences of the gas-phase combustion mechanism that provide explanations for the observed difference in the PAH formation tendency. First, we investigate the fuel-structure-dependent formation of small hydrocarbon species that are yielded as intermediate species during the fuel decomposition, because these species are at the origin of the subsequent mass growth reaction pathways. Second, we review typical PAH formation reactions inspecting repetitive growth sequences in dependence of the molecular fuel structure. Third, we discuss how differences in the intermediate species pool influence the formation reactions of key aromatic ring species that are important for the PAH growth process underlying soot formation. As a main result it was found that for the fuels featuring a C[double bond, length as m-dash]C double bond, the chemistry of their allylic fuel radicals and their decomposition products strongly influences the combination reactions to the initially formed aromatic ring species and as a consequence, the PAH formation tendency.
Protein family clustering for structural genomics.
Yan, Yongpan; Moult, John
2005-10-28
A major goal of structural genomics is the provision of a structural template for a large fraction of protein domains. The magnitude of this task depends on the number and nature of protein sequence families. With a large number of bacterial genomes now fully sequenced, it is possible to obtain improved estimates of the number and diversity of families in that kingdom. We have used an automated clustering procedure to group all sequences in a set of genomes into protein families. Bench-marking shows the clustering method is sensitive at detecting remote family members, and has a low level of false positives. This comprehensive protein family set has been used to address the following questions. (1) What is the structure coverage for currently known families? (2) How will the number of known apparent families grow as more genomes are sequenced? (3) What is a practical strategy for maximizing structure coverage in future? Our study indicates that approximately 20% of known families with three or more members currently have a representative structure. The study indicates also that the number of apparent protein families will be considerably larger than previously thought: We estimate that, by the criteria of this work, there will be about 250,000 protein families when 1000 microbial genomes have been sequenced. However, the vast majority of these families will be small, and it will be possible to obtain structural templates for 70-80% of protein domains with an achievable number of representative structures, by systematically sampling the larger families.
The computational linguistics of biological sequences
DOE Office of Scientific and Technical Information (OSTI.GOV)
Searls, D.
1995-12-31
This tutorial was one of eight tutorials selected to be presented at the Third International Conference on Intelligent Systems for Molecular Biology which was held in the United Kingdom from July 16 to 19, 1995. Protein sequences are analogous in many respects, particularly their folding behavior. Proteins have a much richer variety of interactions, but in theory the same linguistic principles could come to bear in describing dependencies between distant residues that arise by virtue of three-dimensional structure. This tutorial will concentrate on nucleic acid sequences.
Stone, Jonathan W; Bleckley, Samuel; Lavelle, Sean; Schroeder, Susan J
2015-01-01
We present new modifications to the Wuchty algorithm in order to better define and explore possible conformations for an RNA sequence. The new features, including parallelization, energy-independent lonely pair constraints, context-dependent chemical probing constraints, helix filters, and optional multibranch loops, provide useful tools for exploring the landscape of RNA folding. Chemical probing alone may not necessarily define a single unique structure. The helix filters and optional multibranch loops are global constraints on RNA structure that are an especially useful tool for generating models of encapsidated viral RNA for which cryoelectron microscopy or crystallography data may be available. The computations generate a combinatorially complete set of structures near a free energy minimum and thus provide data on the density and diversity of structures near the bottom of a folding funnel for an RNA sequence. The conformational landscapes for some RNA sequences may resemble a low, wide basin rather than a steep funnel that converges to a single structure.
Cloning and characterization of a Prevotella melaninogenica hemolysin.
Allison, H E; Hillman, J D
1997-01-01
Hemolysins have been proven to be important virulence factors in many medically relevant pathogenic organisms. Their production has also been implicated in the etiology of periodontal disease. Hemolytic strain 361B of Prevotella melaninogenica, a putative etiologic agent of periodontal disease, was used in this study. The cloning, sequencing, and characterization of phyA, the structural gene for a P. melaninogenica hemolysin, is described. No extensive sequence homology could be identified between phyA and any reported sequence at either the nucleotide or amino acid level. As predicted from sequence analysis, this gene produces a 39-kDa protein which has hemolytic activity as measured by zymogram analysis. Unlike many Ca2+-dependent bacterial hemolysins, both the cloned and native PhyA proteins were enhanced by the presence of EDTA in a dose-dependent fashion with 40 mM EDTA allowing maximum activity. Ca2+ and Mg2+ were found to be inhibitory. The hemolytic activity also was found to have a dose-dependent endpoint. Through recovery of hemolytic activity from a spent reaction, this endpoint was shown to be the result of end product inhibition. This is the first report describing the cloning and sequencing of a gene from P. melaninogenica. PMID:9199448
Cloning and characterization of a Prevotella melaninogenica hemolysin.
Allison, H E; Hillman, J D
1997-07-01
Hemolysins have been proven to be important virulence factors in many medically relevant pathogenic organisms. Their production has also been implicated in the etiology of periodontal disease. Hemolytic strain 361B of Prevotella melaninogenica, a putative etiologic agent of periodontal disease, was used in this study. The cloning, sequencing, and characterization of phyA, the structural gene for a P. melaninogenica hemolysin, is described. No extensive sequence homology could be identified between phyA and any reported sequence at either the nucleotide or amino acid level. As predicted from sequence analysis, this gene produces a 39-kDa protein which has hemolytic activity as measured by zymogram analysis. Unlike many Ca2+-dependent bacterial hemolysins, both the cloned and native PhyA proteins were enhanced by the presence of EDTA in a dose-dependent fashion with 40 mM EDTA allowing maximum activity. Ca2+ and Mg2+ were found to be inhibitory. The hemolytic activity also was found to have a dose-dependent endpoint. Through recovery of hemolytic activity from a spent reaction, this endpoint was shown to be the result of end product inhibition. This is the first report describing the cloning and sequencing of a gene from P. melaninogenica.
Barendt, Pamela A.; Shah, Najaf A.; Barendt, Gregory A.; Kothari, Parth A.; Sarkar, Casim A.
2013-01-01
While the ribosome has evolved to function in complex intracellular environments, these contexts do not easily allow for the study of its inherent capabilities. We have used a synthetic, well-defined, Escherichia coli (E. coli)-based translation system in conjunction with ribosome display, a powerful in vitro selection method, to identify ribosome binding sites (RBSs) that can promote the efficient translation of messenger RNAs (mRNAs) with a leader length representative of natural E. coli mRNAs. In previous work, we used a longer leader sequence and unexpectedly recovered highly efficient cytosine-rich sequences with complementarity to the 16S ribosomal RNA (rRNA) and similarity to eukaryotic RBSs. In the current study, Shine-Dalgarno (SD) sequences were prevalent but non-SD sequences were also heavily enriched and were dominated by novel guanine- and uracil-rich motifs which showed statistically significant complementarity to the 16S rRNA. Additionally, only SD motifs exhibited position-dependent decreases in sequence entropy, indicating that non-SD motifs likely operate by increasing the local concentration of ribosomes in the vicinity of the start codon, rather than by a position-dependent mechanism. These results further support the putative generality of mRNA-rRNA complementarity in facilitating mRNA translation, but also suggest that context (e.g., leader length and composition) dictates the specific subset of possible RBSs that are used for efficient translation of a given transcript. PMID:23427812
2014-01-01
Background Due to rapid sequencing of genomes, there are now millions of deposited protein sequences with no known function. Fast sequence-based comparisons allow detecting close homologs for a protein of interest to transfer functional information from the homologs to the given protein. Sequence-based comparison cannot detect remote homologs, in which evolution has adjusted the sequence while largely preserving structure. Structure-based comparisons can detect remote homologs but most methods for doing so are too expensive to apply at a large scale over structural databases of proteins. Recently, fragment-based structural representations have been proposed that allow fast detection of remote homologs with reasonable accuracy. These representations have also been used to obtain linearly-reducible maps of protein structure space. It has been shown, as additionally supported from analysis in this paper that such maps preserve functional co-localization of the protein structure space. Methods Inspired by a recent application of the Latent Dirichlet Allocation (LDA) model for conducting structural comparisons of proteins, we propose higher-order LDA-obtained topic-based representations of protein structures to provide an alternative route for remote homology detection and organization of the protein structure space in few dimensions. Various techniques based on natural language processing are proposed and employed to aid the analysis of topics in the protein structure domain. Results We show that a topic-based representation is just as effective as a fragment-based one at automated detection of remote homologs and organization of protein structure space. We conduct a detailed analysis of the information content in the topic-based representation, showing that topics have semantic meaning. The fragment-based and topic-based representations are also shown to allow prediction of superfamily membership. Conclusions This work opens exciting venues in designing novel representations to extract information about protein structures, as well as organizing and mining protein structure space with mature text mining tools. PMID:25080993
NASA Technical Reports Server (NTRS)
Nordheim, A.; Rich, A.
1983-01-01
Three 8-base pair (bp) segments of alternating purine-pyrimidine from the simian virus 40 enhancer region form Z-DNA on negative supercoiling; minichromosome DNase I-hypersensitive sites determined by others bracket these three segments. A survey of transcriptional enhancer sequences reveals a pattern of potential Z-DNA-forming regions which occur in pairs 50-80 bp apart. This may influence local chromatin structure and may be related to transcriptional activation.
Limits of neutral drift: lessons from the in vitro evolution of two ribozymes.
Petrie, Katherine L; Joyce, Gerald F
2014-10-01
The relative contributions of adaptive selection and neutral drift to genetic change are unknown but likely depend on the inherent abundance of functional genotypes in sequence space and how accessible those genotypes are to one another. To better understand the relative roles of selection and drift in evolution, local fitness landscapes for two different RNA ligase ribozymes were examined using a continuous in vitro evolution system under conditions that foster the capacity for neutral drift to mediate genetic change. The exploration of sequence space was accelerated by increasing the mutation rate using mutagenic nucleotide analogs. Drift was encouraged by carrying out evolution within millions of separate compartments to exploit the founder effect. Deep sequencing of individuals from the evolved populations revealed that the distribution of genotypes did not escape the starting local fitness peak, remaining clustered around the sequence used to initiate evolution. This is consistent with a fitness landscape where high-fitness genotypes are sparse and well isolated, and suggests, at least in this context, that neutral drift alone is not a primary driver of genetic change. Neutral drift does, however, provide a repository of genetic variation upon which adaptive selection can act.
From Globular Clusters to Tidal Dwarfs: Structure Formation in the Tidal Tails of Merging Pairs
NASA Astrophysics Data System (ADS)
Knierman, K. A.; Gallagher, S. C.; Charlton, J. C.; Hunsberger, S. D.; Whitmore, B. C.; Kundu, A.; Hibbard, J. E.; Zaritsky, D. F.
2001-05-01
Using V and I images obtained with the Wide Field Planetary Camera 2 (WFPC2) of the Hubble Space Telescope, we investigate compact stellar structures within tidal tails. Six regions of tidal debris in the four classic ``Toomre Sequence'' mergers: NGC 4038/9 (``Antennae''), NGC 3256, NGC 3921, and NGC 7252 (``Atoms for Peace'') have been studied in order to explore how the star formation depends upon the local and global physical conditions. These mergers sample a range of stages in the evolutionary sequence, and include HI--rich and HI--poor environments. The six tails are found to contain a variety of stellar structures, with sizes ranging from those of globular clusters up to those of dwarf galaxies. From V and I WFPC2 images, we measure the luminosities and colors of the star clusters. NGC 3256 is found to have a large population of young clusters lying along both tails, similar to those found in the inner region of the merger. In contrast, NGC 4038/9 has no clusters in the observed region of the tail, only less luminous point sources likely to be individual stars. NGC 3921 and NGC 7252 have small populations of clusters that are concentrated in certain regions of the tail, and particularly in the prominent tidal dwarfs in the eastern and western tails of NGC 7252. The two cluster--rich tails of NGC 3256 are not distinguished from the others by their ages or by their total HI masses. We acknowledge support from NASA through STScI, and from NSF for an REU supplement for Karen Knierman.
Martin, Brent R; Deerinck, Thomas J; Ellisman, Mark H; Taylor, Susan S; Tsien, Roger Y
2007-09-01
The tetracysteine sequence YRECCPGCCMWR fused to the N terminus of green fluorescent protein (GFP) self-aggregates upon biarsenical labeling in living cells or in vitro. Such dye-triggered aggregates form temperature-dependent morphologies and are dispersed by photobleaching. Fusion of the biarsenical aggregating GFP to the regulatory (R) or catalytic (C) subunit of PKA traps intact holoenzyme in compact fluorescent puncta upon biarsenical labeling. Contrary to the classical model of PKA activation, elevated cAMP does not allow RIalpha and Calpha to diffuse far apart unless the pseudosubstrate inhibitor PKI or locally concentrated substrate is coexpressed. However, RIIalpha releases Calpha upon elevated cAMP alone, dependent on autophosphorylation of the RIIalpha inhibitory domain. DAKAP1alpha overexpression induced R and C outer mitochondrial colocalization and showed similar regulation. Overall, effective separation of type I PKA is substrate dependent, whereas type II PKA dissociation relies on autophosphorylation.
Yunus, Muhammad Amir; Lin, Xiaoyan; Bailey, Dalan; Karakasiliotis, Ioannis; Chaudhry, Yasmin; Vashist, Surender; Zhang, Guo; Thorne, Lucy; Kao, C. Cheng
2014-01-01
ABSTRACT All members of the Caliciviridae family of viruses produce a subgenomic RNA during infection. The subgenomic RNA typically encodes only the major and minor capsid proteins, but in murine norovirus (MNV), the subgenomic RNA also encodes the VF1 protein, which functions to suppress host innate immune responses. To date, the mechanism of norovirus subgenomic RNA synthesis has not been characterized. We have previously described the presence of an evolutionarily conserved RNA stem-loop structure on the negative-sense RNA, the complementary sequence of which codes for the viral RNA-dependent RNA polymerase (NS7). The conserved stem-loop is positioned 6 nucleotides 3′ of the start site of the subgenomic RNA in all caliciviruses. We demonstrate that the conserved stem-loop is essential for MNV viability. Mutant MNV RNAs with substitutions in the stem-loop replicated poorly until they accumulated mutations that revert to restore the stem-loop sequence and/or structure. The stem-loop sequence functions in a noncoding context, as it was possible to restore the replication of an MNV mutant by introducing an additional copy of the stem-loop between the NS7- and VP1-coding regions. Finally, in vitro biochemical data suggest that the stem-loop sequence is sufficient for the initiation of viral RNA synthesis by the recombinant MNV RNA-dependent RNA polymerase, confirming that the stem-loop forms the core of the norovirus subgenomic promoter. IMPORTANCE Noroviruses are a significant cause of viral gastroenteritis, and it is important to understand the mechanism of norovirus RNA synthesis. Here we describe the identification of an RNA stem-loop structure that functions as the core of the norovirus subgenomic RNA promoter in cells and in vitro. This work provides new insights into the molecular mechanisms of norovirus RNA synthesis and the sequences that determine the recognition of viral RNA by the RNA-dependent RNA polymerase. PMID:25392209
Yunus, Muhammad Amir; Lin, Xiaoyan; Bailey, Dalan; Karakasiliotis, Ioannis; Chaudhry, Yasmin; Vashist, Surender; Zhang, Guo; Thorne, Lucy; Kao, C Cheng; Goodfellow, Ian
2015-01-15
All members of the Caliciviridae family of viruses produce a subgenomic RNA during infection. The subgenomic RNA typically encodes only the major and minor capsid proteins, but in murine norovirus (MNV), the subgenomic RNA also encodes the VF1 protein, which functions to suppress host innate immune responses. To date, the mechanism of norovirus subgenomic RNA synthesis has not been characterized. We have previously described the presence of an evolutionarily conserved RNA stem-loop structure on the negative-sense RNA, the complementary sequence of which codes for the viral RNA-dependent RNA polymerase (NS7). The conserved stem-loop is positioned 6 nucleotides 3' of the start site of the subgenomic RNA in all caliciviruses. We demonstrate that the conserved stem-loop is essential for MNV viability. Mutant MNV RNAs with substitutions in the stem-loop replicated poorly until they accumulated mutations that revert to restore the stem-loop sequence and/or structure. The stem-loop sequence functions in a noncoding context, as it was possible to restore the replication of an MNV mutant by introducing an additional copy of the stem-loop between the NS7- and VP1-coding regions. Finally, in vitro biochemical data suggest that the stem-loop sequence is sufficient for the initiation of viral RNA synthesis by the recombinant MNV RNA-dependent RNA polymerase, confirming that the stem-loop forms the core of the norovirus subgenomic promoter. Noroviruses are a significant cause of viral gastroenteritis, and it is important to understand the mechanism of norovirus RNA synthesis. Here we describe the identification of an RNA stem-loop structure that functions as the core of the norovirus subgenomic RNA promoter in cells and in vitro. This work provides new insights into the molecular mechanisms of norovirus RNA synthesis and the sequences that determine the recognition of viral RNA by the RNA-dependent RNA polymerase. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
Sequence dependence of electron-induced DNA strand breakage revealed by DNA nanoarrays
Keller, Adrian; Rackwitz, Jenny; Cauët, Emilie; Liévin, Jacques; Körzdörfer, Thomas; Rotaru, Alexandru; Gothelf, Kurt V.; Besenbacher, Flemming; Bald, Ilko
2014-01-01
The electronic structure of DNA is determined by its nucleotide sequence, which is for instance exploited in molecular electronics. Here we demonstrate that also the DNA strand breakage induced by low-energy electrons (18 eV) depends on the nucleotide sequence. To determine the absolute cross sections for electron induced single strand breaks in specific 13 mer oligonucleotides we used atomic force microscopy analysis of DNA origami based DNA nanoarrays. We investigated the DNA sequences 5′-TT(XYX)3TT with X = A, G, C and Y = T, BrU 5-bromouracil and found absolute strand break cross sections between 2.66 · 10−14 cm2 and 7.06 · 10−14 cm2. The highest cross section was found for 5′-TT(ATA)3TT and 5′-TT(ABrUA)3TT, respectively. BrU is a radiosensitizer, which was discussed to be used in cancer radiation therapy. The replacement of T by BrU into the investigated DNA sequences leads to a slight increase of the absolute strand break cross sections resulting in sequence-dependent enhancement factors between 1.14 and 1.66. Nevertheless, the variation of strand break cross sections due to the specific nucleotide sequence is considerably higher. Thus, the present results suggest the development of targeted radiosensitizers for cancer radiation therapy. PMID:25487346
Ciolkowski, Ingo; Wanke, Dierk; Birkenbihl, Rainer P; Somssich, Imre E
2008-09-01
WRKY transcription factors have been shown to play a major role in regulating, both positively and negatively, the plant defense transcriptome. Nearly all studied WRKY factors appear to have a stereotypic binding preference to one DNA element termed the W-box. How specificity for certain promoters is accomplished therefore remains completely unknown. In this study, we tested five distinct Arabidopsis WRKY transcription factor subfamily members for their DNA binding selectivity towards variants of the W-box embedded in neighboring DNA sequences. These studies revealed for the first time differences in their binding site preferences, which are partly dependent on additional adjacent DNA sequences outside of the TTGACY-core motif. A consensus WRKY binding site derived from these studies was used for in silico analysis to identify potential target genes within the Arabidopsis genome. Furthermore, we show that even subtle amino acid substitutions within the DNA binding region of AtWRKY11 strongly impinge on its binding activity. Additionally, all five factors were found localized exclusively to the plant cell nucleus and to be capable of trans-activating expression of a reporter gene construct in vivo.
Gene organization and alternative splicing of human prohormone convertase PC8.
Goodge, K A; Thomas, R J; Martin, T J; Gillespie, M T
1998-01-01
The mammalian Ca2+-dependent serine protease prohormone convertase PC8 is expressed ubiquitously, being transcribed as 3.5, 4.3 and 6.0 kb mRNA isoforms in various tissues. To determine the origin of these various mRNA isoforms we report the characterization of the human PC8 gene, which has been previously localized to chromosome 11q23-24. Consisting of 16 exons, the human PC8 gene spans approx. 27 kb. A comparison of the position of intron-exon junctions of the human PC8 gene with the gene structures of previously reported prohormone convertase genes demonstrated a divergence of the human PC8 from the highly conserved nature of the gene organization of this enzyme family. The nucleotide sequence of the 5'-flanking region of the human PC8 is reported and possesses putative promoter elements characteristic of a GC-rich promoter. Further supporting the potential role of a GC-rich promoter element, multiple transcriptional initiation sites within a 200 bp region were demonstrated. We propose that the various mRNA isoforms of PC8 result from the inclusion of intronic sequences within transcripts. PMID:9820811
Accurate Simulation and Detection of Coevolution Signals in Multiple Sequence Alignments
Ackerman, Sharon H.; Tillier, Elisabeth R.; Gatti, Domenico L.
2012-01-01
Background While the conserved positions of a multiple sequence alignment (MSA) are clearly of interest, non-conserved positions can also be important because, for example, destabilizing effects at one position can be compensated by stabilizing effects at another position. Different methods have been developed to recognize the evolutionary relationship between amino acid sites, and to disentangle functional/structural dependencies from historical/phylogenetic ones. Methodology/Principal Findings We have used two complementary approaches to test the efficacy of these methods. In the first approach, we have used a new program, MSAvolve, for the in silico evolution of MSAs, which records a detailed history of all covarying positions, and builds a global coevolution matrix as the accumulated sum of individual matrices for the positions forced to co-vary, the recombinant coevolution, and the stochastic coevolution. We have simulated over 1600 MSAs for 8 protein families, which reflect sequences of different sizes and proteins with widely different functions. The calculated coevolution matrices were compared with the coevolution matrices obtained for the same evolved MSAs with different coevolution detection methods. In a second approach we have evaluated the capacity of the different methods to predict close contacts in the representative X-ray structures of an additional 150 protein families using only experimental MSAs. Conclusions/Significance Methods based on the identification of global correlations between pairs were found to be generally superior to methods based only on local correlations in their capacity to identify coevolving residues using either simulated or experimental MSAs. However, the significant variability in the performance of different methods with different proteins suggests that the simulation of MSAs that replicate the statistical properties of the experimental MSA can be a valuable tool to identify the coevolution detection method that is most effective in each case. PMID:23091608
A critical analysis of computational protein design with sparse residue interaction graphs
Georgiev, Ivelin S.
2017-01-01
Protein design algorithms enumerate a combinatorial number of candidate structures to compute the Global Minimum Energy Conformation (GMEC). To efficiently find the GMEC, protein design algorithms must methodically reduce the conformational search space. By applying distance and energy cutoffs, the protein system to be designed can thus be represented using a sparse residue interaction graph, where the number of interacting residue pairs is less than all pairs of mutable residues, and the corresponding GMEC is called the sparse GMEC. However, ignoring some pairwise residue interactions can lead to a change in the energy, conformation, or sequence of the sparse GMEC vs. the original or the full GMEC. Despite the widespread use of sparse residue interaction graphs in protein design, the above mentioned effects of their use have not been previously analyzed. To analyze the costs and benefits of designing with sparse residue interaction graphs, we computed the GMECs for 136 different protein design problems both with and without distance and energy cutoffs, and compared their energies, conformations, and sequences. Our analysis shows that the differences between the GMECs depend critically on whether or not the design includes core, boundary, or surface residues. Moreover, neglecting long-range interactions can alter local interactions and introduce large sequence differences, both of which can result in significant structural and functional changes. Designs on proteins with experimentally measured thermostability show it is beneficial to compute both the full and the sparse GMEC accurately and efficiently. To this end, we show that a provable, ensemble-based algorithm can efficiently compute both GMECs by enumerating a small number of conformations, usually fewer than 1000. This provides a novel way to combine sparse residue interaction graphs with provable, ensemble-based algorithms to reap the benefits of sparse residue interaction graphs while avoiding their potential inaccuracies. PMID:28358804
Nanolayered Features of Collagen-like Peptides
NASA Technical Reports Server (NTRS)
Valluzzi, Regina; Bini, Elisabetta; Haas, Terry; Cebe, Peggy; Kaplan, David L.
2003-01-01
We have been investigating collagen-like model oligopeptides as molecular bases for complex ordered biomimetic materials. The collagen-like molecules incorporate aspects of native collagen sequence and secondary structure. Designed modifications to native primary and secondary structure have been incorporated to control the nanostructure and microstructure of the collagen-like materials produced. We find that the collagen-like molecules form a number of lyotropic rod liquid crystalline phases, which because of their strong temperature dependence in the liquid state can also be viewed as solvent intercalated thermotropic liquid crystals. The liquid crystalline phases formed by the molecules can be captured in the solid state by drying off solvent, resulting in solid nanopatterned (chemically and physically) thermally stable (to greater than 100 C) materials. Designed sequences which stabilize smectic phases have allowed a variety of nanoscale multilayered biopolymeric materials to be developed. Preliminary investigations suggest that chemical patterns running perpendicular to the smectic layer plane can be functionalized and used to localize a variety of organic, inorganic, and organometallic moieties in very simple multilayered nanocomposites. The phase behavior of collagen-like oligopeptide materials is described, emphasizing the correlation between mesophase, molecular orientation, and chemical patterning at the microscale and nanoscale. In many cases, the textures observed for smectic and hexatic phase collagens are remarkably similar to the complex (and not fully understood) helicoids observed in biological collagen-based tissues. Comparisons between biological morphologies and collagen model liquid crystalline (and solidified materials) textures may help us understand the molecular features which impart order and function to the extracellular matrix and to collagen-based mineralized tissues. Initial studies have utilized synthetic collagen-like peptides while future work will also focus on similar sequences generated via genetic engineering methods.
Jin, Hao; Mo, Lanxin; Pan, Lin; Hou, Qaingchaun; Li, Chuanjuan; Darima, Iaptueva; Yu, Jie
2018-05-09
Traditional fermented dairy foods including cottage cheese have been major components of the Buryatia diet for centuries. Buryatian cheeses have maintained not only their unique taste and flavor but also their rich natural lactic acid bacteria (LAB) content. However, relatively few studies have described their microbial communities or explored their potential to serve as LAB resources. In this study, the bacterial microbiota community of 7 traditional artisan cheeses produced by local Buryatian families was investigated using single-molecule, real-time sequencing. In addition, we compared the bacterial microbiota of the Buryatian cheese samples with data sets of cheeses from Kazakhstan and Italy. Furthermore, we isolated and preserved several LAB samples from Buryatian cheese. A total of 62 LAB strains (belonging to 6 genera and 14 species or subspecies) were isolated from 7 samples of Buryatian cheese. Full-length 16S rRNA sequencing of the microbiota revealed 145 species of 82 bacterial genera, belonging to 7 phyla. The most dominant species was Lactococcus lactis (43.89%). Data sets of cheeses from Italy and Kazakhstan were retrieved from public databases. Principal component analysis and multivariate ANOVA showed marked differences in the structure of the microbiota communities in the cheese data sets from the 3 regions. Linear discriminant analyses of the effect size identified 48 discriminant bacterial clades among the 3 groups, which might have contributed to the observed structural differences. Our results indicate that the bacterial communities of traditional artisan cheeses vary depending on geographic origin. In addition, we isolated novel and valuable LAB resources for the improvement of cottage cheese production. Copyright © 2018 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
A Novel Helicase-Type Protein in the Nucleolus: Protein NOH61
Zirwes, Rudolf F.; Eilbracht, Jens; Kneissel, Sandra; Schmidt-Zachmann, Marion S.
2000-01-01
We report the identification, cDNA cloning, and molecular characterization of a novel, constitutive nucleolar protein. The cDNA-deduced amino acid sequence of the human protein defines a polypeptide of a calculated mass of 61.5 kDa and an isoelectric point of 9.9. Inspection of the primary sequence disclosed that the protein is a member of the family of “DEAD-box” proteins, representing a subgroup of putative ATP-dependent RNA helicases. ATPase activity of the recombinant protein is evident and stimulated by a variety of polynucleotides tested. Immunolocalization studies revealed that protein NOH61 (nucleolar helicase of 61 kDa) is highly conserved during evolution and shows a strong accumulation in nucleoli. Biochemical experiments have shown that protein NOH61 synthesized in vitro sediments with ∼11.5 S, i.e., apparently as homo-oligomeric structures. By contrast, sucrose gradient centrifugation analysis of cellular extracts obtained with buffers of elevated ionic strength (600 mM NaCl) revealed that the solubilized native protein sediments with ∼4 S, suggestive of the monomeric form. Interestingly, protein NOH61 has also been identified as a specific constituent of free nucleoplasmic 65S preribosomal particles but is absent from cytoplasmic ribosomes. Treatment of cultured cells with 1) the transcription inhibitor actinomycin D and 2) RNase A results in a complete dissociation of NOH61 from nucleolar structures. The specific intracellular localization and its striking sequence homology to other known RNA helicases lead to the hypothesis that protein NOH61 might be involved in ribosome synthesis, most likely during the assembly process of the large (60S) ribosomal subunit. PMID:10749921
Modeling Structural Dynamics of Biomolecular Complexes by Coarse-Grained Molecular Simulations.
Takada, Shoji; Kanada, Ryo; Tan, Cheng; Terakawa, Tsuyoshi; Li, Wenfei; Kenzaki, Hiroo
2015-12-15
Due to hierarchic nature of biomolecular systems, their computational modeling calls for multiscale approaches, in which coarse-grained (CG) simulations are used to address long-time dynamics of large systems. Here, we review recent developments and applications of CG modeling methods, focusing on our methods primarily for proteins, DNA, and their complexes. These methods have been implemented in the CG biomolecular simulator, CafeMol. Our CG model has resolution such that ∼10 non-hydrogen atoms are grouped into one CG particle on average. For proteins, each amino acid is represented by one CG particle. For DNA, one nucleotide is simplified by three CG particles, representing sugar, phosphate, and base. The protein modeling is based on the idea that proteins have a globally funnel-like energy landscape, which is encoded in the structure-based potential energy function. We first describe two representative minimal models of proteins, called the elastic network model and the classic Go̅ model. We then present a more elaborate protein model, which extends the minimal model to incorporate sequence and context dependent local flexibility and nonlocal contacts. For DNA, we describe a model developed by de Pablo's group that was tuned to well reproduce sequence-dependent structural and thermodynamic experimental data for single- and double-stranded DNAs. Protein-DNA interactions are modeled either by the structure-based term for specific cases or by electrostatic and excluded volume terms for nonspecific cases. We also discuss the time scale mapping in CG molecular dynamics simulations. While the apparent single time step of our CGMD is about 10 times larger than that in the fully atomistic molecular dynamics for small-scale dynamics, large-scale motions can be further accelerated by two-orders of magnitude with the use of CG model and a low friction constant in Langevin dynamics. Next, we present four examples of applications. First, the classic Go̅ model was used to emulate one ATP cycle of a molecular motor, kinesin. Second, nonspecific protein-DNA binding was studied by a combination of elaborate protein and DNA models. Third, a transcription factor, p53, that contains highly fluctuating regions was simulated on two perpendicularly arranged DNA segments, addressing intersegmental transfer of p53. Fourth, we simulated structural dynamics of dinucleosomes connected by a linker DNA finding distinct types of internucleosome docking and salt-concentration-dependent compaction. Finally, we discuss many of limitations in the current approaches and future directions. Especially, more accurate electrostatic treatment and a phospholipid model that matches our CG resolutions are of immediate importance.
Rhythm sensitivity in macaque monkeys
Selezneva, Elena; Deike, Susann; Knyazeva, Stanislava; Scheich, Henning; Brechmann, André; Brosch, Michael
2013-01-01
This study provides evidence that monkeys are rhythm sensitive. We composed isochronous tone sequences consisting of repeating triplets of two short tones and one long tone which humans perceive as repeating triplets of two weak and one strong beat. This regular sequence was compared to an irregular sequence with the same number of randomly arranged short and long tones with no such beat structure. To search for indication of rhythm sensitivity we employed an oddball paradigm in which occasional duration deviants were introduced in the sequences. In a pilot study on humans we showed that subjects more easily detected these deviants when they occurred in a regular sequence. In the monkeys we searched for spontaneous behaviors the animals executed concomitant with the deviants. We found that monkeys more frequently exhibited changes of gaze and facial expressions to the deviants when they occurred in the regular sequence compared to the irregular sequence. In addition we recorded neuronal firing and local field potentials from 175 sites of the primary auditory cortex during sequence presentation. We found that both types of neuronal signals differentiated regular from irregular sequences. Both signals were stronger in regular sequences and occurred after the onset of the long tones, i.e., at the position of the strong beat. Local field potential responses were also significantly larger for the durational deviants in regular sequences, yet in a later time window. We speculate that these temporal pattern-selective mechanisms with a focus on strong beats and their deviants underlie the perception of rhythm in the chosen sequences. PMID:24046732
Singh, Bipin K; Pandey, Praveen C
2016-07-20
Engineering of thermally tunable terahertz photonic and omnidirectional bandgaps has been demonstrated theoretically in one-dimensional quasi-periodic photonic crystals (PCs) containing semiconductor and dielectric materials. The considered quasi-periodic structures are taken in the form of Fibonacci, Thue-Morse, and double periodic sequences. We have shown that the photonic and omnidirectional bandgaps in the quasi-periodic structures with semiconductor constituents are strongly depend on the temperature, thickness of the constituted semiconductor and dielectric material layers, and generations of the quasi-periodic sequences. It has been found that the number of photonic bandgaps increases with layer thickness and generation of the quasi-periodic sequences. Omnidirectional bandgaps in the structures have also been obtained. Results show that the bandwidths of photonic and omnidirectional bandgaps are tunable by changing the temperature and lattice parameters of the structures. The generation of quasi-periodic sequences can also change the properties of photonic and omnidirectional bandgaps remarkably. The frequency range of the photonic and omnidirectional bandgaps can be tuned by the change of temperature and layer thickness of the considered quasi-periodic structures. This work will be useful to design tunable terahertz PC devices.
Farjami, Elaheh; Clima, Lilia; Gothelf, Kurt V; Ferapontova, Elena E
2010-06-01
A DNA molecular beacon approach was used for the analysis of interactions between DNA and Methylene Blue (MB) as a redox indicator of a hybridization event. DNA hairpin structures of different length and guanine (G) content were immobilized onto gold electrodes in their folded states through the alkanethiol linker at the 5'-end. Binding of MB to the folded hairpin DNA was electrochemically studied and compared with binding to the duplex structure formed by hybridization of the hairpin DNA to a complementary DNA strand. Variation of the electrochemical signal from the DNA-MB complex was shown to depend primarily on the DNA length and sequence used: the G-C base pairs were the preferential sites of MB binding in the duplex. For short 20 nts long DNA sequences, the increased electrochemical response from MB bound to the duplex structure was consistent with the increased amount of bound and electrochemically readable MB molecules (i.e. MB molecules that are available for the electron transfer (ET) reaction with the electrode). With longer DNA sequences, the balance between the amounts of the electrochemically readable MB molecules bound to the hairpin DNA and to the hybrid was opposite: a part of the MB molecules bound to the long-sequence DNA duplex seem to be electrochemically mute due to long ET distance. The increasing electrochemical response from MB bound to the short-length DNA hybrid contrasts with the decreasing signal from MB bound to the long-length DNA hybrid and allows an "off"-"on" genosensor development.
Identification of a Signal-Responsive Nuclear Export Sequence in Class II Histone Deacetylases
McKinsey, Timothy A.; Zhang, Chun Li; Olson, Eric N.
2001-01-01
Activation of muscle-specific genes by the MEF2 transcription factor is inhibited by class II histone deacetylases (HDACs) 4 and 5, which contain carboxy-terminal deacetylase domains and amino-terminal extensions required for association with MEF2. The inhibitory action of HDACs is overcome by myogenic signals which disrupt MEF2-HDAC interactions and stimulate nuclear export of these transcriptional repressors. Nucleocytoplasmic trafficking of HDAC5 is mediated by binding of the chaperone protein 14-3-3 to two phosphoserine residues (Ser-259 and Ser-498) in its amino-terminal extension. Here we show that HDAC4 and -5 each contain a signal-responsive nuclear export sequence (NES) at their extreme carboxy termini. The NES is conserved in another class II HDAC, HDAC7, but is absent in class I HDACs and the HDAC-related corepressor, MEF2-interacting transcription repressor. Our results suggest that this conserved NES is inactive in unphosphorylated HDAC5, which is localized to the nucleus, and that calcium-calmodulin-dependent protein kinase (CaMK)-dependent binding of 14-3-3 to phosphoserines 259 and 498 activates the NES, with consequent export of the transcriptional repressor to the cytoplasm. A single amino acid substitution in this NES is sufficient to retain HDAC5 in the nucleus in the face of CaMK signaling. These findings provide molecular insight into the mechanism by which extracellular cues alter chromatin structure to promote muscle differentiation and other MEF2-regulated processes. PMID:11509672
Krylov, V; Tlapáková, T; Mácha, J; Curlej, J; Ryban, L; Chrenek, P
2008-01-01
For chromosomal localization of the hFVIII human transgene in F2 and F3 generation of transgenic rabbits, FISH-TSA was applied. A short cDNA probe (1250 bp) targeted chromosomes 3, 7, 8, 9 and 18 of an F2 male (animal 1-3-8). Two transgenic offspring (F3) revealed signal positions in chromosome 3 and chromosomes 3 and 7, respectively. Sequencing and structure analysis of the rabbit orthologous gene revealed high similarity to its human counterpart. Part of the sequenced cDNA (1310 bp) served as a probe for FISH-TSA analysis. The rabbit gene was localized in the q arm terminus of the X chromosome. This result is in agreement with reciprocal chromosome painting between the rabbit and the human. The presented FISH-TSA method provides strong signals without any interspecies reactivity.
Local atomic and magnetic structure of dilute magnetic semiconductor ( Ba , K ) ( Zn , Mn ) 2 As 2
Frandsen, Benjamin A.; Gong, Zizhou; Terban, Maxwell W.; ...
2016-09-06
We studied the atomic and magnetic structure of the dilute ferromagnetic semiconductor system (Ba,K)(Zn,Mn) 2As 2 through atomic and magnetic pair distribution function analysis of temperature-dependent x-ray and neutron total scattering data. Furthermore, we detected a change in curvature of the temperature-dependent unit cell volume of the average tetragonal crystallographic structure at a temperature coinciding with the onset of ferromagnetic order. We also observed the existence of a well-defined local orthorhombic structure on a short length scale of ≲5Å, resulting in a rather asymmetrical local environment of the Mn and As ions. Finally, the magnetic PDF revealed ferromagnetic alignment ofmore » Mn spins along the crystallographic c axis, with robust nearest-neighbor ferromagnetic correlations that exist even above the ferromagnetic ordering temperature. Finally, we discuss these results in the context of other experiments and theoretical studies on this system.« less
Conservation of Fold and Topology of Functional Elements in Thiamin Pyrophosphate Enzymes
NASA Technical Reports Server (NTRS)
Dominiak, P.; Ciszak, E. M.
2005-01-01
Thiamin pyrophosphate (TPP)-dependent enzymes are a highly divergent family of proteins binding both TPP and metal ions. They perform decarboxylation-hydroxyaldehydes. Prior -ketoacids and of a common - (O=)C-C(OH)- fragment of to knowledge of three-dimensional structures of these enzmes, the GDGY25-30NN sequence was used to identify these enzymes. Subsequently, a number of structural studies on those enzymes revealed multi-subunit organization and the features of the two duplicate cofactor binding sites. Analyzing the structures of 44 structurally known enzymes, we found that the common structure of these enzymes is reduced to 180-220 amino acid long fragments of two PP and two PYR domains that form the [PP:PYR]2 binding center of two cofactor molecules. The structures of PP and PYR are arranged in a similar fold-sheet with triplets of helices on both sides.Dconsisting of a six-stranded Residues surrounding the cofactors are not strictly conserved, but they provide the same interatomic contacts required for the catalytic functions that these enzymes perform while maintaining interactive structural integrity. These structural and functional amino acids are topological counterparts located in the same positions of the conserved fold of sets of PP and PYR domains. Additional parallels include short fragments of sequences that link these amino acids to the fold and function. This report on the structural commonalities amongst TPP dependent enzymes is thought to contribute new approaches to annotation that may assist in advancing the functional proteomics of TPP dependent enzymes, and trace their complexity within evolutionary context.
Communication: On the origin of the non-Arrhenius behavior in water reorientation dynamics.
Stirnemann, Guillaume; Laage, Damien
2012-07-21
We combine molecular dynamics simulations and analytic modeling to determine the origin of the non-Arrhenius temperature dependence of liquid water's reorientation and hydrogen-bond dynamics between 235 K and 350 K. We present a quantitative model connecting hydrogen-bond exchange dynamics to local structural fluctuations, measured by the asphericity of Voronoi cells associated with each water molecule. For a fixed local structure the regular Arrhenius behavior is recovered, and the global anomalous temperature dependence is demonstrated to essentially result from a continuous shift in the unimodal structure distribution upon cooling. The non-Arrhenius behavior can thus be explained without invoking an equilibrium between distinct structures. In addition, the large width of the homogeneous structural distribution is shown to cause a growing dynamical heterogeneity and a non-exponential relaxation at low temperature.
Lasserre, Moira; Fresia, Pablo; Greif, Gonzalo; Iraola, Gregorio; Castro-Ramos, Miguel; Juambeltz, Arturo; Nuñez, Álvaro; Naya, Hugo; Robello, Carlos; Berná, Luisa
2018-01-02
Bovine tuberculosis (bTB) poses serious risks to animal welfare and economy, as well as to public health as a zoonosis. Its etiological agent, Mycobacterium bovis, belongs to the Mycobacterium tuberculosis complex (MTBC), a group of genetically monomorphic organisms featured by a remarkably high overall nucleotide identity (99.9%). Indeed, this characteristic is of major concern for correct typing and determination of strain-specific traits based on sequence diversity. Due to its historical economic dependence on cattle production, Uruguay is deeply affected by the prevailing incidence of Mycobacterium bovis. With the world's highest number of cattle per human, and its intensive cattle production, Uruguay represents a particularly suited setting to evaluate genomic variability among isolates, and the diversity traits associated to this pathogen. We compared 186 genomes from MTBC strains isolated worldwide, and found a highly structured population in M. bovis. The analysis of 23 new M. bovis genomes, belonging to strains isolated in Uruguay evidenced three groups present in the country. Despite presenting an expected highly conserved genomic structure and sequence, these strains segregate into a clustered manner within the worldwide phylogeny. Analysis of the non-pe/ppe differential areas against a reference genome defined four main sources of variability, namely: regions of difference (RD), variable genes, duplications and novel genes. RDs and variant analysis segregated the strains into clusters that are concordant with their spoligotype identities. Due to its high homoplasy rate, spoligotyping failed to reflect the true genomic diversity among worldwide representative strains, however, it remains a good indicator for closely related populations. This study introduces a comprehensive population structure analysis of worldwide M. bovis isolates. The incorporation and analysis of 23 novel Uruguayan M. bovis genomes, sheds light onto the genomic diversity of this pathogen, evidencing the existence of greater genetic variability among strains than previously contemplated.
Structures of Bacterial Biosynthetic Arginine Decarboxylases
DOE Office of Scientific and Technical Information (OSTI.GOV)
F Forouhar; S Lew; J Seetharaman
2011-12-31
Biosynthetic arginine decarboxylase (ADC; also known as SpeA) plays an important role in the biosynthesis of polyamines from arginine in bacteria and plants. SpeA is a pyridoxal-5'-phosphate (PLP)-dependent enzyme and shares weak sequence homology with several other PLP-dependent decarboxylases. Here, the crystal structure of PLP-bound SpeA from Campylobacter jejuni is reported at 3.0 {angstrom} resolution and that of Escherichia coli SpeA in complex with a sulfate ion is reported at 3.1 {angstrom} resolution. The structure of the SpeA monomer contains two large domains, an N-terminal TIM-barrel domain followed by a {beta}-sandwich domain, as well as two smaller helical domains. Themore » TIM-barrel and {beta}-sandwich domains share structural homology with several other PLP-dependent decarboxylases, even though the sequence conservation among these enzymes is less than 25%. A similar tetramer is observed for both C. jejuni and E. coli SpeA, composed of two dimers of tightly associated monomers. The active site of SpeA is located at the interface of this dimer and is formed by residues from the TIM-barrel domain of one monomer and a highly conserved loop in the {beta}-sandwich domain of the other monomer. The PLP cofactor is recognized by hydrogen-bonding, {pi}-stacking and van der Waals interactions.« less
Gurevich, Svetlana V
2014-10-28
The dynamics of a single breathing localized structure in a three-component reaction-diffusion system subjected to time-delayed feedback is investigated. It is shown that variation of the delay time and the feedback strength can lead either to stabilization of the breathing or to delay-induced periodic or quasi-periodic oscillations of the localized structure. A bifurcation analysis of the system in question is provided and an order parameter equation is derived that describes the dynamics of the localized structure in the vicinity of the Andronov-Hopf bifurcation. With the aid of this equation, the boundaries of the stabilization domains as well as the dependence of the oscillation radius on delay parameters can be explicitly derived, providing a robust mechanism to control the behaviour of the breathing localized structure in a straightforward manner. © 2014 The Author(s) Published by the Royal Society. All rights reserved.
Competition between B-Z and B-L transitions in a single DNA molecule: Computational studies
NASA Astrophysics Data System (ADS)
Kwon, Ah-Young; Nam, Gi-Moon; Johner, Albert; Kim, Seyong; Hong, Seok-Cheol; Lee, Nam-Kyung
2016-02-01
Under negative torsion, DNA adopts left-handed helical forms, such as Z-DNA and L-DNA. Using the random copolymer model developed for a wormlike chain, we represent a single DNA molecule with structural heterogeneity as a helical chain consisting of monomers which can be characterized by different helical senses and pitches. By Monte Carlo simulation, where we take into account bending and twist fluctuations explicitly, we study sequence dependence of B-Z transitions under torsional stress and tension focusing on the interaction with B-L transitions. We consider core sequences, (GC) n repeats or (TG) n repeats, which can interconvert between the right-handed B form and the left-handed Z form, imbedded in a random sequence, which can convert to left-handed L form with different (tension dependent) helical pitch. We show that Z-DNA formation from the (GC) n sequence is always supported by unwinding torsional stress but Z-DNA formation from the (TG) n sequence, which are more costly to convert but numerous, can be strongly influenced by the quenched disorder in the surrounding random sequence.
Van de Cavey, Joris; Hartsuiker, Robert J
2016-01-01
Cognitive processing in many domains (e.g., sentence comprehension, music listening, and math solving) requires sequential information to be organized into an integrational structure. There appears to be some overlap in integrational processing across domains, as shown by cross-domain interference effects when for example linguistic and musical stimuli are jointly presented (Koelsch, Gunter, Wittfoth, & Sammler, 2005; Slevc, Rosenberg, & Patel, 2009). These findings support theories of overlapping resources for integrational processing across domains (cfr. SSIRH Patel, 2003; SWM, Kljajevic, 2010). However, there are some limitations to the studies mentioned above, such as the frequent use of unnaturalistic integrational difficulties. In recent years, the idea has risen that evidence for domain-generality in structural processing might also be yielded though priming paradigms (cfr. Scheepers, 2003). The rationale behind this is that integrational processing across domains regularly requires the processing of dependencies across short or long distances in the sequence, involving respectively less or more syntactic working memory resources (cfr. SWM, Kljajevic, 2010), and such processing decisions might persist over time. However, whereas recent studies have shown suggestive priming of integrational structure between language and arithmetics (though often dependent on arithmetic performance, cfr. Scheepers et al., 2011; Scheepers & Sturt, 2014), it remains to be investigated to what extent we can also find evidence for priming in other domains, such as music and action (cfr. SWM, Kljajevic, 2010). Experiment 1a showed structural priming from the processing of musical sequences onto the position in the sentence structure (early or late) to which a relative clause was attached in subsequent sentence completion. Importantly, Experiment 1b showed that a similar structural manipulation based on non-hierarchically ordered color sequences did not yield any priming effect, suggesting that the priming effect is not based on linear order, but integrational dependency. Finally, Experiment 2 presented primes in four domains (relative clause sentences, music, mathematics, and structured descriptions of actions), and consistently showed priming within and across domains. These findings provide clear evidence for domain-general structural processing mechanisms. Copyright © 2015 Elsevier B.V. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Min, B.I.; Oguchi, T.; Jansen, H.J.F.
1986-07-15
Ground-state electronic and structural properties of Lu under pressure are investigated with use of the self-consistent all-electron total-energy linear muffin-tin orbital band-structure method within a local-density-functional approximation. Pressure-induced structural transitions are found to occur in the following sequence: hcp--(Sm-type)--dhcp--fcc, which is the same as that observed in the crystal structures of the trivalent rare-earth metals with decreasing atomic number. This structural transition is correlated with the increase in the number of d-italic electrons under pressure.
Duan, Ming-Rui; Nan, Jie; Liang, Yu-He; Mao, Peng; Lu, Lu; Li, Lanfen; Wei, Chunhong; Lai, Luhua; Li, Yi; Su, Xiao-Dong
2007-01-01
WRKY proteins, defined by the conserved WRKYGQK sequence, are comprised of a large superfamily of transcription factors identified specifically from the plant kingdom. This superfamily plays important roles in plant disease resistance, abiotic stress, senescence as well as in some developmental processes. In this study, the Arabidopsis WRKY1 was shown to be involved in the salicylic acid signaling pathway and partially dependent on NPR1; a C-terminal domain of WRKY1, AtWRKY1-C, was constructed for structural studies. Previous investigations showed that DNA binding of the WRKY proteins was localized at the WRKY domains and these domains may define novel zinc-binding motifs. The crystal structure of the AtWRKY1-C determined at 1.6 Å resolution has revealed that this domain is composed of a globular structure with five β strands, forming an antiparallel β-sheet. A novel zinc-binding site is situated at one end of the β-sheet, between strands β4 and β5. Based on this high-resolution crystal structure and site-directed mutagenesis, we have defined and confirmed that the DNA-binding residues of AtWRKY1-C are located at β2 and β3 strands. These results provided us with structural information to understand the mechanism of transcriptional control and signal transduction events of the WRKY proteins. PMID:17264121
Accounting for Local Dependence with the Rasch Model: The Paradox of Information Increase.
Andrich, David
Test theories imply statistical, local independence. Where local independence is violated, models of modern test theory that account for it have been proposed. One violation of local independence occurs when the response to one item governs the response to a subsequent item. Expanding on a formulation of this kind of violation between two items in the dichotomous Rasch model, this paper derives three related implications. First, it formalises how the polytomous Rasch model for an item constituted by summing the scores of the dependent items absorbs the dependence in its threshold structure. Second, it shows that as a consequence the unit when the dependence is accounted for is not the same as if the items had no response dependence. Third, it explains the paradox, known, but not explained in the literature, that the greater the dependence of the constituent items the greater the apparent information in the constituted polytomous item when it should provide less information.
Facile rhenium-peptide conjugate synthesis using a one-pot derived Re(CO)3 reagent.
Chanawanno, Kullapa; Kondeti, Vinay; Caporoso, Joel; Paruchuri, Sailaja; Leeper, Thomas C; Herrick, Richard S; Ziegler, Christopher J
2016-03-21
We have synthesized two Re(CO)3-modified lysine complexes (1 and 2), where the metal is attached to the amino acid at the Nε position, via a one-pot Schiff base formation reaction. These compounds can be used in the solid phase synthesis of peptides, and to date we have produced four conjugate systems incorporating neurotensin, bombesin, leutenizing hormone releasing hormone, and a nuclear localization sequence. We observed uptake into human umbilical vascular endothelial cells as well as differential uptake depending on peptide sequence identity, as characterized by fluorescence and rhenium elemental analysis.
Swarm v2: highly-scalable and high-resolution amplicon clustering.
Mahé, Frédéric; Rognes, Torbjørn; Quince, Christopher; de Vargas, Colomban; Dunthorn, Micah
2015-01-01
Previously we presented Swarm v1, a novel and open source amplicon clustering program that produced fine-scale molecular operational taxonomic units (OTUs), free of arbitrary global clustering thresholds and input-order dependency. Swarm v1 worked with an initial phase that used iterative single-linkage with a local clustering threshold (d), followed by a phase that used the internal abundance structures of clusters to break chained OTUs. Here we present Swarm v2, which has two important novel features: (1) a new algorithm for d = 1 that allows the computation time of the program to scale linearly with increasing amounts of data; and (2) the new fastidious option that reduces under-grouping by grafting low abundant OTUs (e.g., singletons and doubletons) onto larger ones. Swarm v2 also directly integrates the clustering and breaking phases, dereplicates sequencing reads with d = 0, outputs OTU representatives in fasta format, and plots individual OTUs as two-dimensional networks.
Kaushik, Mahima; Kukreti, Shrikant
2015-01-01
Our previous work on structural polymorphism shown at a single nucleotide polymorphism (SNP) (A → G) site located on HS4 region of locus control region (LCR) of β-globin gene has established a hairpin → duplex equilibrium corresponding to A → B like DNA transition (Kaushik M, Kukreti, R., Grover, D., Brahmachari, S.K. and Kukreti S. Nucleic Acids Res. 2003; Kaushik M, Kukreti S. Nucleic Acids Res. 2006). The G-allele of A → G SNP has been shown to be significantly associated with the occurrence of β-thalassemia. Considering the significance of this 11-nt long quasi-palindromic sequence [5'-TGGGG(G/A)CCCCA; HP(G/A)11] of β-globin gene LCR, we further explored the differential behavior of the same DNA sequence with its RNA counterpart, using various biophysical and biochemical techniques. In contrast to its DNA counterpart exhibiting a A → B structural transition and an equilibrium between duplex and hairpin forms, the studied RNA oligonucleotide sequence [5'-UGGGG(G/A)CCCCA; RHP(G/A)11] existed only in duplex form (A-conformation) and did not form hairpin. The single residue difference from A to G led to the unusual thermal stability of the RNA structure formed by the studied sequence. Since, naturally occurring mutations and various SNP sites may stabilize or destabilize the local DNA/RNA secondary structures, these structural transitions may affect the gene expression by a change in the protein-DNA recognition patterns.
Computational design of RNAs with complex energy landscapes.
Höner zu Siederdissen, Christian; Hammer, Stefan; Abfalter, Ingrid; Hofacker, Ivo L; Flamm, Christoph; Stadler, Peter F
2013-12-01
RNA has become an integral building material in synthetic biology. Dominated by their secondary structures, which can be computed efficiently, RNA molecules are amenable not only to in vitro and in vivo selection, but also to rational, computation-based design. While the inverse folding problem of constructing an RNA sequence with a prescribed ground-state structure has received considerable attention for nearly two decades, there have been few efforts to design RNAs that can switch between distinct prescribed conformations. We introduce a user-friendly tool for designing RNA sequences that fold into multiple target structures. The underlying algorithm makes use of a combination of graph coloring and heuristic local optimization to find sequences whose energy landscapes are dominated by the prescribed conformations. A flexible interface allows the specification of a wide range of design goals. We demonstrate that bi- and tri-stable "switches" can be designed easily with moderate computational effort for the vast majority of compatible combinations of desired target structures. RNAdesign is freely available under the GPL-v3 license. Copyright © 2013 Wiley Periodicals, Inc.
Davey, James A; Chica, Roberto A
2015-04-01
Computational protein design (CPD) predictions are highly dependent on the structure of the input template used. However, it is unclear how small differences in template geometry translate to large differences in stability prediction accuracy. Herein, we explored how structural changes to the input template affect the outcome of stability predictions by CPD. To do this, we prepared alternate templates by Rotamer Optimization followed by energy Minimization (ROM) and used them to recapitulate the stability of 84 protein G domain β1 mutant sequences. In the ROM process, side-chain rotamers for wild-type (WT) or mutant sequences are optimized on crystal or nuclear magnetic resonance (NMR) structures prior to template minimization, resulting in alternate structures termed ROM templates. We show that use of ROM templates prepared from sequences known to be stable results predominantly in improved prediction accuracy compared to using the minimized crystal or NMR structures. Conversely, ROM templates prepared from sequences that are less stable than the WT reduce prediction accuracy by increasing the number of false positives. These observed changes in prediction outcomes are attributed to differences in side-chain contacts made by rotamers in ROM templates. Finally, we show that ROM templates prepared from sequences that are unfolded or that adopt a nonnative fold result in the selective enrichment of sequences that are also unfolded or that adopt a nonnative fold, respectively. Our results demonstrate the existence of a rotamer bias caused by the input template that can be harnessed to skew predictions toward sequences displaying desired characteristics. © 2014 The Protein Society.
SCit: web tools for protein side chain conformation analysis.
Gautier, R; Camproux, A-C; Tufféry, P
2004-07-01
SCit is a web server providing services for protein side chain conformation analysis and side chain positioning. Specific services use the dependence of the side chain conformations on the local backbone conformation, which is described using a structural alphabet that describes the conformation of fragments of four-residue length in a limited library of structural prototypes. Based on this concept, SCit uses sets of rotameric conformations dependent on the local backbone conformation of each protein for side chain positioning and the identification of side chains with unlikely conformations. The SCit web server is accessible at http://bioserv.rpbs.jussieu.fr/SCit.
Zerze, Gül H; Best, Robert B; Mittal, Jeetain
2015-11-19
We use all-atom molecular simulation with explicit solvent to study the properties of selected intrinsically disordered proteins and unfolded states of foldable proteins, which include chain dimensions and shape, secondary structure propensity, solvent accessible surface area, and contact formation. We find that the qualitative scaling behavior of the chains matches expectations from theory under ambient conditions. In particular, unfolded globular proteins tend to be more collapsed under the same conditions than charged disordered sequences of the same length. However, inclusion of explicit solvent in addition naturally captures temperature-dependent solvation effects, which results in an initial collapse of the chains as temperature is increased, in qualitative agreement with experiment. There is a universal origin to the collapse, revealed in the change of hydration of individual residues as a function of temperature: namely, that the initial collapse is driven by unfavorable solvation free energy of individual residues, which in turn has a strong temperature dependence. We also observe that in unfolded globular proteins, increased temperature also initially favors formation of native-like (rather than non-native-like) structure. Our results help to establish how sequence encodes the degree of intrinsic disorder or order as well as its response to changes in environmental conditions.
Locality and Word Order in Active Dependency Formation in Bangla.
Chacón, Dustin A; Imtiaz, Mashrur; Dasgupta, Shirsho; Murshed, Sikder M; Dan, Mina; Phillips, Colin
2016-01-01
Research on filler-gap dependencies has revealed that there are constraints on possible gap sites, and that real-time sentence processing is sensitive to these constraints. This work has shown that comprehenders have preferences for potential gap sites, and immediately detect when these preferences are not met. However, neither the mechanisms that select preferred gap sites nor the mechanisms used to detect whether these preferences are met are well-understood. In this paper, we report on three experiments in Bangla, a language in which gaps may occur in either a pre-verbal embedded clause or a post-verbal embedded clause. This word order variation allows us to manipulate whether the first gap linearly available is contained in the same clause as the filler, which allows us to dissociate structural locality from linear locality. In Experiment 1, an untimed ambiguity resolution task, we found a global bias to resolve a filler-gap dependency with the first gap linearly available, regardless of structural hierarchy. In Experiments 2 and 3, which use the filled-gap paradigm, we found sensitivity to disruption only when the blocked gap site is both structurally and linearly local, i.e., the filler and the gap site are contained in the same clause. This suggests that comprehenders may not show sensitivity to the disruption of all preferred gap resolutions.
Subtelomeric Rearrangements and Copy Number Variations in People with Intellectual Disabilities
ERIC Educational Resources Information Center
Christofolini, D. M.; De Paula Ramos, M. A.; Kulikowski, L. D.; Da Silva Bellucco, F. T.; Belangero, S. I. N.; Brunoni, D.; Melaragno, M. I.
2010-01-01
Background: The most prevalent type of structural variation in the human genome is represented by copy number variations that can affect transcription levels, sequence, structure and function of genes. Method: In the present study, we used the multiplex ligation-dependent probe amplification (MLPA) technique and quantitative PCR for the detection…
Guerreiro, Marco Alexandre; Peršoh, Derek; Begerow, Dominik; Krauss, Jochen
2018-01-01
Epichloë endophytes associated with cool-season grass species can protect their hosts from herbivory and can suppress mycorrhizal colonization of the hosts’ roots. However, little is known about whether or not Epichloë endophyte infection can also change the foliar fungal assemblages of the host. We tested 52 grassland study sites along a land-use intensity gradient in three study regions over two seasons (spring vs. summer) to determine whether Epichloë infection of the host grass Lolium perenne changes the fungal community structure in leaves. Foliar fungal communities were assessed by Next Generation Sequencing of the ITS rRNA gene region. Fungal community structure was strongly affected by study region and season in our study, while land-use intensity and infection with Epichloë endophytes had no significant effects. We conclude that effects on non-systemic endophytes resulting from land use practices and Epichloë infection reported in other studies were masked by local and seasonal variability in this study’s grassland sites. PMID:29780665
Medhi, Darpan; Goldman, Alastair Sh; Lichten, Michael
2016-11-18
The budding yeast genome contains regions where meiotic recombination initiates more frequently than in others. This pattern parallels enrichment for the meiotic chromosome axis proteins Hop1 and Red1. These proteins are important for Spo11-catalyzed double strand break formation; their contribution to crossover recombination remains undefined. Using the sequence-specific VMA1 -derived endonuclease (VDE) to initiate recombination in meiosis, we show that chromosome structure influences the choice of proteins that resolve recombination intermediates to form crossovers. At a Hop1-enriched locus, most VDE-initiated crossovers, like most Spo11-initiated crossovers, required the meiosis-specific MutLγ resolvase. In contrast, at a locus with lower Hop1 occupancy, most VDE-initiated crossovers were MutLγ-independent. In pch2 mutants, the two loci displayed similar Hop1 occupancy levels, and VDE-induced crossovers were similarly MutLγ-dependent. We suggest that meiotic and mitotic recombination pathways coexist within meiotic cells, and that features of meiotic chromosome structure determine whether one or the other predominates in different regions.
Rašić, Gordana; Schama, Renata; Powell, Rosanna; Maciel-de Freitas, Rafael; Endersby-Harshman, Nancy M; Filipović, Igor; Sylvestre, Gabriel; Máspero, Renato C; Hoffmann, Ary A
2015-01-01
Dengue is the most prevalent global arboviral disease that affects over 300 million people every year. Brazil has the highest number of dengue cases in the world, with the most severe epidemics in the city of Rio de Janeiro (Rio). The effective control of dengue is critically dependent on the knowledge of population genetic structuring in the primary dengue vector, the mosquito Aedes aegypti. We analyzed mitochondrial and nuclear genomewide single nucleotide polymorphism markers generated via Restriction-site Associated DNA sequencing, as well as traditional microsatellite markers in Ae. aegypti from Rio. We found four divergent mitochondrial lineages and a strong spatial structuring of mitochondrial variation, in contrast to the overall nuclear homogeneity across Rio. Despite a low overall differentiation in the nuclear genome, we detected strong spatial structure for variation in over 20 genes that have a significantly altered expression in response to insecticides, xenobiotics, and pathogens, including the novel biocontrol agent Wolbachia. Our results indicate that high genetic diversity, spatially unconstrained admixing likely mediated by male dispersal, along with locally heterogeneous genetic variation that could affect insecticide resistance and mosquito vectorial capacity, set limits to the effectiveness of measures to control dengue fever in Rio. PMID:26495042
Structural and sequencing analysis of local target DNA recognition by MLV integrase.
Aiyer, Sriram; Rossi, Paolo; Malani, Nirav; Schneider, William M; Chandar, Ashwin; Bushman, Frederic D; Montelione, Gaetano T; Roth, Monica J
2015-06-23
Target-site selection by retroviral integrase (IN) proteins profoundly affects viral pathogenesis. We describe the solution nuclear magnetic resonance structure of the Moloney murine leukemia virus IN (M-MLV) C-terminal domain (CTD) and a structural homology model of the catalytic core domain (CCD). In solution, the isolated MLV IN CTD adopts an SH3 domain fold flanked by a C-terminal unstructured tail. We generated a concordant MLV IN CCD structural model using SWISS-MODEL, MMM-tree and I-TASSER. Using the X-ray crystal structure of the prototype foamy virus IN target capture complex together with our MLV domain structures, residues within the CCD α2 helical region and the CTD β1-β2 loop were predicted to bind target DNA. The role of these residues was analyzed in vivo through point mutants and motif interchanges. Viable viruses with substitutions at the IN CCD α2 helical region and the CTD β1-β2 loop were tested for effects on integration target site selection. Next-generation sequencing and analysis of integration target sequences indicate that the CCD α2 helical region, in particular P187, interacts with the sequences distal to the scissile bonds whereas the CTD β1-β2 loop binds to residues proximal to it. These findings validate our structural model and disclose IN-DNA interactions relevant to target site selection. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Exploration of the Structure of the High Temperature Phase of the Hexagonal RMnO3 System
NASA Astrophysics Data System (ADS)
Wu, T.; Tyson, T. A.; Zhang, H.; Yu, T.; Page, K.; Ghose, S.
Temperature dependent structural studies of the high temperature phase of hexagonal RMnO3 systems have been conducted. Both long range and local structural probes have been utilized. Discussions of the appropriate space groups and local distortions relevant to length scale will be given. Ab initio MD simulations are used to interpret the observations. This work is supported by DOE Grant DE-FG02-07ER46402.
Network Analysis of Protein Adaptation: Modeling the Functional Impact of Multiple Mutations
Beleva Guthrie, Violeta; Masica, David L; Fraser, Andrew; Federico, Joseph; Fan, Yunfan; Camps, Manel; Karchin, Rachel
2018-01-01
Abstract The evolution of new biochemical activities frequently involves complex dependencies between mutations and rapid evolutionary radiation. Mutation co-occurrence and covariation have previously been used to identify compensating mutations that are the result of physical contacts and preserve protein function and fold. Here, we model pairwise functional dependencies and higher order interactions that enable evolution of new protein functions. We use a network model to find complex dependencies between mutations resulting from evolutionary trade-offs and pleiotropic effects. We present a method to construct these networks and to identify functionally interacting mutations in both extant and reconstructed ancestral sequences (Network Analysis of Protein Adaptation). The time ordering of mutations can be incorporated into the networks through phylogenetic reconstruction. We apply NAPA to three distantly homologous β-lactamase protein clusters (TEM, CTX-M-3, and OXA-51), each of which has experienced recent evolutionary radiation under substantially different selective pressures. By analyzing the network properties of each protein cluster, we identify key adaptive mutations, positive pairwise interactions, different adaptive solutions to the same selective pressure, and complex evolutionary trajectories likely to increase protein fitness. We also present evidence that incorporating information from phylogenetic reconstruction and ancestral sequence inference can reduce the number of spurious links in the network, whereas preserving overall network community structure. The analysis does not require structural or biochemical data. In contrast to function-preserving mutation dependencies, which are frequently from structural contacts, gain-of-function mutation dependencies are most commonly between residues distal in protein structure. PMID:29522102
NASA Astrophysics Data System (ADS)
Yusof, Nik Yusnoraini; Bakar, Farah Diba Abu; Mahadi, Nor Muhammad; Raih, Mohd Firdaus; Murad, Abdul Munir Abdul
2015-09-01
A cDNA encoding Fe(II) 2-oxoglutarate (2OG) dependent dioxygenases was isolated from psychrophilic yeast, Glaciozyma antarctica PI12. We have successfully amplified 1,029 bp cDNA sequence that encodes 342 amino acid with predicted molecular weight 38 kDa. The prediction protein was analysed using various bioinformatics tools to explore the properties of the protein. Based on a BLAST search analysis, the Fe2OX amino acid sequence showed 61% identity to the sequence of oxoglutarate/iron-dependent oxygenase from Rhodosporidium toruloides NP11. SignalP prediction showed that the Fe2OX protein contains no putative signal peptide, which suggests that this enzyme most probably localised intracellularly.The structure of Fe2OX was predicted by homology modelling using MODELLER9v11. The model with the lowest objective function was selected from hundred models generated using MODELLER9v11. Analysis of the structure revealed the longer loop at Fe2OX from G.antarctica that might be responsible for the flexibility of the structure, which contributes to its adaptation to low temperatures. Fe2OX hold a highly conserved Fe(II) binding HXD/E…H triad motif. The binding site for 2-oxoglutarate was found conserved for Arg280 among reported studies, however the Phe268 was found to be different in Fe2OX.
Vembanur, Srivathsan; Venkateshwaran, Vasudevan; Garde, Shekhar
2014-04-29
We focus on the conformational stability, structure, and dynamics of hydrophobic/charged homopolymers and heteropolymers at the vapor-liquid interface of water using extensive molecular dynamics simulations. Hydrophobic polymers collapse into globular structures in bulk water but unfold and sample a broad range of conformations at the vapor-liquid interface of water. We show that adding a pair of charges to a hydrophobic polymer at the interface can dramatically change its conformations, stabilizing hairpinlike structures, with molecular details depending on the location of the charged pair in the sequence. The translational dynamics of homopolymers and heteropolymers are also different, whereas the homopolymers skate on the interface with low drag, the tendency of charged groups to remain hydrated pulls the heteropolymers toward the liquid side of the interface, thus pinning them, increasing drag, and slowing the translational dynamics. The conformational dynamics of heteropolymers are also slower than that of the homopolymer and depend on the location of the charged groups in the sequence. Conformational dynamics are most restricted for the end-charged heteropolymer and speed up as the charge pair is moved toward the center of the sequence. We rationalize these trends using the fundamental understanding of the effects of the interface on primitive pair-level interactions between two hydrophobic groups and between oppositely charged ions in its vicinity.
Pairwise graphical models for structural health monitoring with dense sensor arrays
NASA Astrophysics Data System (ADS)
Mohammadi Ghazi, Reza; Chen, Justin G.; Büyüköztürk, Oral
2017-09-01
Through advances in sensor technology and development of camera-based measurement techniques, it has become affordable to obtain high spatial resolution data from structures. Although measured datasets become more informative by increasing the number of sensors, the spatial dependencies between sensor data are increased at the same time. Therefore, appropriate data analysis techniques are needed to handle the inference problem in presence of these dependencies. In this paper, we propose a novel approach that uses graphical models (GM) for considering the spatial dependencies between sensor measurements in dense sensor networks or arrays to improve damage localization accuracy in structural health monitoring (SHM) application. Because there are always unobserved damaged states in this application, the available information is insufficient for learning the GMs. To overcome this challenge, we propose an approximated model that uses the mutual information between sensor measurements to learn the GMs. The study is backed by experimental validation of the method on two test structures. The first is a three-story two-bay steel model structure that is instrumented by MEMS accelerometers. The second experimental setup consists of a plate structure and a video camera to measure the displacement field of the plate. Our results show that considering the spatial dependencies by the proposed algorithm can significantly improve damage localization accuracy.
Role of internal demagnetizing field for the dynamics of a surface-modulated magnonic crystal
NASA Astrophysics Data System (ADS)
Langer, M.; Röder, F.; Gallardo, R. A.; Schneider, T.; Stienen, S.; Gatel, C.; Hübner, R.; Bischoff, L.; Lenz, K.; Lindner, J.; Landeros, P.; Fassbender, J.
2017-05-01
This work aims to demonstrate and understand the key role of local demagnetizing fields in hybrid structures consisting of a continuous thin film with a stripe modulation on top. To understand the complex spin dynamics of these structures, the magnonic crystal was reconstructed in two different ways—performing micromagnetic simulations based on the structural shape as well as based on the internal demagnetizing field, which both are mapped on the nanoscale using electron holography. The simulations yield the frequency-field dependence as well as the angular dependence revealing the governing role of the internal field landscape around the backward-volume geometry. Simple rules for the propagation vector and the mode localization are formulated in order to explain the calculated mode profiles. Treating internal demagnetizing fields equivalent to anisotropies, the complex angle-dependent spin-wave behavior is described for an in-plane rotation of the external field.
[Processes of logical thought in a case of cerebral vascular lesion].
Blanco Men ndez, R; Aguado Balsas, A M
Reasoning and logical thought processes have traditionally been attributed to frontal lobe function or,on the other hand, have been considered as diffuse functions of the brain. However, there is today evidence enough about the possibility to find dissociations in thought processes, depending on logical structure of the experimental tasks and referring to different areas of the brain, frontal and post rolandic ones. To study possible dissociations between thought structures corresponding to categorical and relational logic, on one hand, and propositional logic on the other hand. The case of a brain injured patient with vascular etiology, localized in left frontal parietal cortex, is presented. A specific battery of reasoning tests has been administered. . A differential performance at some reasoning experimental tasks has been found depending on such logical conceptual structures. The possibility of establishing dissociations among certain logical thought and intelectual functions depending on localization of possible brain lesion (frontal versus temporal) is discussed.
Community detection in sequence similarity networks based on attribute clustering
Chowdhary, Janamejaya; Loeffler, Frank E.; Smith, Jeremy C.
2017-07-24
Networks are powerful tools for the presentation and analysis of interactions in multi-component systems. A commonly studied mesoscopic feature of networks is their community structure, which arises from grouping together similar nodes into one community and dissimilar nodes into separate communities. Here in this paper, the community structure of protein sequence similarity networks is determined with a new method: Attribute Clustering Dependent Communities (ACDC). Sequence similarity has hitherto typically been quantified by the alignment score or its expectation value. However, pair alignments with the same score or expectation value cannot thus be differentiated. To overcome this deficiency, the method constructs,more » for pair alignments, an extended alignment metric, the link attribute vector, which includes the score and other alignment characteristics. Rescaling components of the attribute vectors qualitatively identifies a systematic variation of sequence similarity within protein superfamilies. The problem of community detection is then mapped to clustering the link attribute vectors, selection of an optimal subset of links and community structure refinement based on the partition density of the network. ACDC-predicted communities are found to be in good agreement with gold standard sequence databases for which the "ground truth" community structures (or families) are known. ACDC is therefore a community detection method for sequence similarity networks based entirely on pair similarity information. A serial implementation of ACDC is available from https://cmb.ornl.gov/resources/developments« less
Community detection in sequence similarity networks based on attribute clustering
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chowdhary, Janamejaya; Loeffler, Frank E.; Smith, Jeremy C.
Networks are powerful tools for the presentation and analysis of interactions in multi-component systems. A commonly studied mesoscopic feature of networks is their community structure, which arises from grouping together similar nodes into one community and dissimilar nodes into separate communities. Here in this paper, the community structure of protein sequence similarity networks is determined with a new method: Attribute Clustering Dependent Communities (ACDC). Sequence similarity has hitherto typically been quantified by the alignment score or its expectation value. However, pair alignments with the same score or expectation value cannot thus be differentiated. To overcome this deficiency, the method constructs,more » for pair alignments, an extended alignment metric, the link attribute vector, which includes the score and other alignment characteristics. Rescaling components of the attribute vectors qualitatively identifies a systematic variation of sequence similarity within protein superfamilies. The problem of community detection is then mapped to clustering the link attribute vectors, selection of an optimal subset of links and community structure refinement based on the partition density of the network. ACDC-predicted communities are found to be in good agreement with gold standard sequence databases for which the "ground truth" community structures (or families) are known. ACDC is therefore a community detection method for sequence similarity networks based entirely on pair similarity information. A serial implementation of ACDC is available from https://cmb.ornl.gov/resources/developments« less
Dynamically corrected gates for singlet-triplet spin qubits with control-dependent errors
NASA Astrophysics Data System (ADS)
Jacobson, N. Tobias; Witzel, Wayne M.; Nielsen, Erik; Carroll, Malcolm S.
2013-03-01
Magnetic field inhomogeneity due to random polarization of quasi-static local magnetic impurities is a major source of environmentally induced error for singlet-triplet double quantum dot (DQD) spin qubits. Moreover, for singlet-triplet qubits this error may depend on the applied controls. This effect is significant when a static magnetic field gradient is applied to enable full qubit control. Through a configuration interaction analysis, we observe that the dependence of the field inhomogeneity-induced error on the DQD bias voltage can vary systematically as a function of the controls for certain experimentally relevant operating regimes. To account for this effect, we have developed a straightforward prescription for adapting dynamically corrected gate sequences that assume control-independent errors into sequences that compensate for systematic control-dependent errors. We show that accounting for such errors may lead to a substantial increase in gate fidelities. Sandia National Laboratories is a multi-program laboratory managed and operated by Sandia Corporation, a wholly owned subsidiary of Lockheed Martin Corporation, for the U.S. DOE's National Nuclear Security Administration under contract DE-AC04-94AL85000.