Science.gov

Sample records for structure based predictive

  1. Structure-Based Predictions of Activity Cliffs

    PubMed Central

    Husby, Jarmila; Bottegoni, Giovanni; Kufareva, Irina; Abagyan, Ruben; Cavalli, Andrea

    2015-01-01

    In drug discovery, it is generally accepted that neighboring molecules in a given descriptors' space display similar activities. However, even in regions that provide strong predictability, structurally similar molecules can occasionally display large differences in potency. In QSAR jargon, these discontinuities in the activity landscape are known as ‘activity cliffs’. In this study, we assessed the reliability of ligand docking and virtual ligand screening schemes in predicting activity cliffs. We performed our calculations on a diverse, independently collected database of cliff-forming co-crystals. Starting from ideal situations, which allowed us to establish our baseline, we progressively moved toward simulating more realistic scenarios. Ensemble- and template-docking achieved a significant level of accuracy, suggesting that, despite the well-known limitations of empirical scoring schemes, activity cliffs can be accurately predicted by advanced structure-based methods. PMID:25918827

  2. Improving structure-based function prediction using molecular dynamics

    PubMed Central

    Glazer, Dariya S.; Radmer, Randall J.; Altman, Russ B.

    2009-01-01

    Summary The number of molecules with solved three-dimensional structure but unknown function is increasing rapidly. Particularly problematic are novel folds with little detectable similarity to molecules of known function. Experimental assays can determine the functions of such molecules, but are time-consuming and expensive. Computational approaches can identify potential functional sites; however, these approaches generally rely on single static structures and do not use information about dynamics. In fact, structural dynamics can enhance function prediction: we coupled molecular dynamics simulations with structure-based function prediction algorithms that identify Ca2+ binding sites. When applied to 11 challenging proteins, both methods showed substantial improvement in performance, revealing 22 more sites in one case and 12 more in the other, with a modest increase in apparent false positives. Thus, we show that treating molecules as dynamic entities improves the performance of structure-based function prediction methods. PMID:19604472

  3. OPTIMIZATION BIAS IN ENERGY-BASED STRUCTURE PREDICTION

    PubMed Central

    Petrella, Robert J.

    2014-01-01

    Physics-based computational approaches to predicting the structure of macromolecules such as proteins are gaining increased use, but there are remaining challenges. In the current work, it is demonstrated that in energy-based prediction methods, the degree of optimization of the sampled structures can influence the prediction results. In particular, discrepancies in the degree of local sampling can bias the predictions in favor of the oversampled structures by shifting the local probability distributions of the minimum sampled energies. In simple systems, it is shown that the magnitude of the errors can be calculated from the energy surface, and for certain model systems, derived analytically. Further, it is shown that for energy wells whose forms differ only by a randomly assigned energy shift, the optimal accuracy of prediction is achieved when the sampling around each structure is equal. Energy correction terms can be used in cases of unequal sampling to reproduce the total probabilities that would occur under equal sampling, but optimal corrections only partially restore the prediction accuracy lost to unequal sampling. For multiwell systems, the determination of the correction terms is a multibody problem; it is shown that the involved cross-correlation multiple integrals can be reduced to simpler integrals. The possible implications of the current analysis for macromolecular structure prediction are discussed. PMID:25552783

  4. Benchmark data sets for structure-based computational target prediction.

    PubMed

    Schomburg, Karen T; Rarey, Matthias

    2014-08-25

    Structure-based computational target prediction methods identify potential targets for a bioactive compound. Methods based on protein-ligand docking so far face many challenges, where the greatest probably is the ranking of true targets in a large data set of protein structures. Currently, no standard data sets for evaluation exist, rendering comparison and demonstration of improvements of methods cumbersome. Therefore, we propose two data sets and evaluation strategies for a meaningful evaluation of new target prediction methods, i.e., a small data set consisting of three target classes for detailed proof-of-concept and selectivity studies and a large data set consisting of 7992 protein structures and 72 drug-like ligands allowing statistical evaluation with performance metrics on a drug-like chemical space. Both data sets are built from openly available resources, and any information needed to perform the described experiments is reported. We describe the composition of the data sets, the setup of screening experiments, and the evaluation strategy. Performance metrics capable to measure the early recognition of enrichments like AUC, BEDROC, and NSLR are proposed. We apply a sequence-based target prediction method to the large data set to analyze its content of nontrivial evaluation cases. The proposed data sets are used for method evaluation of our new inverse screening method iRAISE. The small data set reveals the method's capability and limitations to selectively distinguish between rather similar protein structures. The large data set simulates real target identification scenarios. iRAISE achieves in 55% excellent or good enrichment a median AUC of 0.67 and RMSDs below 2.0 Å for 74% and was able to predict the first true target in 59 out of 72 cases in the top 2% of the protein data set of about 8000 structures.

  5. Protein secondary structure prediction using logic-based machine learning.

    PubMed

    Muggleton, S; King, R D; Sternberg, M J

    1992-10-01

    Many attempts have been made to solve the problem of predicting protein secondary structure from the primary sequence but the best performance results are still disappointing. In this paper, the use of a machine learning algorithm which allows relational descriptions is shown to lead to improved performance. The Inductive Logic Programming computer program, Golem, was applied to learning secondary structure prediction rules for alpha/alpha domain type proteins. The input to the program consisted of 12 non-homologous proteins (1612 residues) of known structure, together with a background knowledge describing the chemical and physical properties of the residues. Golem learned a small set of rules that predict which residues are part of the alpha-helices--based on their positional relationships and chemical and physical properties. The rules were tested on four independent non-homologous proteins (416 residues) giving an accuracy of 81% (+/- 2%). This is an improvement, on identical data, over the previously reported result of 73% by King and Sternberg (1990, J. Mol. Biol., 216, 441-457) using the machine learning program PROMIS, and of 72% using the standard Garnier-Osguthorpe-Robson method. The best previously reported result in the literature for the alpha/alpha domain type is 76%, achieved using a neural net approach. Machine learning also has the advantage over neural network and statistical methods in producing more understandable results. PMID:1480619

  6. SAM-T08, HMM-based protein structure prediction

    PubMed Central

    Karplus, Kevin

    2009-01-01

    The SAM-T08 web server is a protein structure prediction server that provides several useful intermediate results in addition to the final predicted 3D structure: three multiple sequence alignments of putative homologs using different iterated search procedures, prediction of local structure features including various backbone and burial properties, calibrated E-values for the significance of template searches of PDB and residue–residue contact predictions. The server has been validated as part of the CASP8 assessment of structure prediction as having good performance across all classes of predictions. The SAM-T08 server is available at http://compbio.soe.ucsc.edu/SAM_T08/T08-query.html PMID:19483096

  7. ArchPRED: a template based loop structure prediction server.

    PubMed

    Fernandez-Fuentes, Narcis; Zhai, Jun; Fiser, András

    2006-07-01

    ArchPRED server (http://www.fiserlab.org/servers/archpred) implements a novel fragment-search based method for predicting loop conformations. The inputs to the server are the atomic coordinates of the query protein and the position of the loop. The algorithm selects candidate loop fragments from a regularly updated loop library (Search Space) by matching the length, the types of bracing secondary structures of the query and by satisfying the geometrical restraints imposed by the stem residues. Subsequently, candidate loops are inserted in the query protein framework where their side chains are rebuilt and their fit is assessed by the root mean square deviation (r.m.s.d.) of stem regions and by the number of rigid body clashes with the environment. In the final step remaining candidate loops are ranked by a Z-score that combines information on sequence similarity and fit of predicted and observed [/psi] main chain dihedral angle propensities. The final loop conformation is built in the protein structure and annealed in the environment using conjugate gradient minimization. The prediction method was benchmarked on artificially prepared search datasets where all trivial sequence similarities on the SCOP superfamily level were removed. Under these conditions it was possible to predict loops of length 4, 8 and 12 with coverage of 98, 78 and 28% with at least of 0.22, 1.38 and 2.47 A of r.m.s.d. accuracy, respectively. In a head to head comparison on loops extracted from freshly deposited new protein folds the current method outperformed in a approximately 5:1 ratio an earlier developed database search method. PMID:16844985

  8. PROTEUS2: a web server for comprehensive protein structure prediction and structure-based annotation.

    PubMed

    Montgomerie, Scott; Cruz, Joseph A; Shrivastava, Savita; Arndt, David; Berjanskii, Mark; Wishart, David S

    2008-07-01

    PROTEUS2 is a web server designed to support comprehensive protein structure prediction and structure-based annotation. PROTEUS2 accepts either single sequences (for directed studies) or multiple sequences (for whole proteome annotation) and predicts the secondary and, if possible, tertiary structure of the query protein(s). Unlike most other tools or servers, PROTEUS2 bundles signal peptide identification, transmembrane helix prediction, transmembrane beta-strand prediction, secondary structure prediction (for soluble proteins) and homology modeling (i.e. 3D structure generation) into a single prediction pipeline. Using a combination of progressive multi-sequence alignment, structure-based mapping, hidden Markov models, multi-component neural nets and up-to-date databases of known secondary structure assignments, PROTEUS is able to achieve among the highest reported levels of predictive accuracy for signal peptides (Q2 = 94%), membrane spanning helices (Q2 = 87%) and secondary structure (Q3 score of 81.3%). PROTEUS2's homology modeling services also provide high quality 3D models that compare favorably with those generated by SWISS-MODEL and 3D JigSaw (within 0.2 A RMSD). The average PROTEUS2 prediction takes approximately 3 min per query sequence. The PROTEUS2 server along with source code for many of its modules is accessible a http://wishart.biology.ualberta.ca/proteus2.

  9. Characterization and Prediction of Protein Flexibility Based on Structural Alphabets

    PubMed Central

    Liu, Bin

    2016-01-01

    Motivation. To assist efforts in determining and exploring the functional properties of proteins, it is desirable to characterize and predict protein flexibilities. Results. In this study, the conformational entropy is used as an indicator of the protein flexibility. We first explore whether the conformational change can capture the protein flexibility. The well-defined decoy structures are converted into one-dimensional series of letters from a structural alphabet. Four different structure alphabets, including the secondary structure in 3-class and 8-class, the PB structure alphabet (16-letter), and the DW structure alphabet (28-letter), are investigated. The conformational entropy is then calculated from the structure alphabet letters. Some of the proteins show high correlation between the conformation entropy and the protein flexibility. We then predict the protein flexibility from basic amino acid sequence. The local structures are predicted by the dual-layer model and the conformational entropy of the predicted class distribution is then calculated. The results show that the conformational entropy is a good indicator of the protein flexibility, but false positives remain a problem. The DW structure alphabet performs the best, which means that more subtle local structures can be captured by large number of structure alphabet letters. Overall this study provides a simple and efficient method for the characterization and prediction of the protein flexibility. PMID:27660756

  10. Characterization and Prediction of Protein Flexibility Based on Structural Alphabets

    PubMed Central

    Liu, Bin

    2016-01-01

    Motivation. To assist efforts in determining and exploring the functional properties of proteins, it is desirable to characterize and predict protein flexibilities. Results. In this study, the conformational entropy is used as an indicator of the protein flexibility. We first explore whether the conformational change can capture the protein flexibility. The well-defined decoy structures are converted into one-dimensional series of letters from a structural alphabet. Four different structure alphabets, including the secondary structure in 3-class and 8-class, the PB structure alphabet (16-letter), and the DW structure alphabet (28-letter), are investigated. The conformational entropy is then calculated from the structure alphabet letters. Some of the proteins show high correlation between the conformation entropy and the protein flexibility. We then predict the protein flexibility from basic amino acid sequence. The local structures are predicted by the dual-layer model and the conformational entropy of the predicted class distribution is then calculated. The results show that the conformational entropy is a good indicator of the protein flexibility, but false positives remain a problem. The DW structure alphabet performs the best, which means that more subtle local structures can be captured by large number of structure alphabet letters. Overall this study provides a simple and efficient method for the characterization and prediction of the protein flexibility.

  11. SAbPred: a structure-based antibody prediction server

    PubMed Central

    Dunbar, James; Krawczyk, Konrad; Leem, Jinwoo; Marks, Claire; Nowak, Jaroslaw; Regep, Cristian; Georges, Guy; Kelm, Sebastian; Popovic, Bojana; Deane, Charlotte M.

    2016-01-01

    SAbPred is a server that makes predictions of the properties of antibodies focusing on their structures. Antibody informatics tools can help improve our understanding of immune responses to disease and aid in the design and engineering of therapeutic molecules. SAbPred is a single platform containing multiple applications which can: number and align sequences; automatically generate antibody variable fragment homology models; annotate such models with estimated accuracy alongside sequence and structural properties including potential developability issues; predict paratope residues; and predict epitope patches on protein antigens. The server is available at http://opig.stats.ox.ac.uk/webapps/sabpred. PMID:27131379

  12. Improving protein structure prediction using multiple sequence-based contact predictions

    PubMed Central

    Wu, Sitao; Szilagyi, Andras; Zhang, Yang

    2011-01-01

    Summary Although residue-residue contact maps dictate the topology of proteins, sequence-based ab initio contact predictions have been found little use in actual structure prediction due to the low accuracy. We developed a composite set of nine SVM-based contact predictors which are used in I-TASSER simulation in combination with sparse template contact restraints. When testing the strategy on 273 non-homologous targets, remarkable improvements of I-TASSER models were observed for both easy and hard targets, with P-value by student s t-test below 0.00001 and 0.001, respectively. In several cases, TM-score increases by >30%, which essentially converts “non-foldable” targets into “foldable” ones. In CASP9, I-TASSER employed ab initio contact predictions, and generated models for 26 FM targets with a GDT-score 16% and 44% higher than the second and third best servers from other groups, respectively. These findings demonstrate a new avenue to improve the accuracy of protein structure prediction especially for free-modeling targets. PMID:21827953

  13. Prediction of Silicon-Based Layered Structures for Optoelectronic Applications

    NASA Astrophysics Data System (ADS)

    Luo, Wei; Ma, Yanming; Gong, Xingao; Xiang, Hongjun; CCMG Team

    2015-03-01

    A method based on the particle swarm optimization (PSO) algorithm is presented to design quasi-two-dimensional (Q2D) materials. With this development, various single-layer and bi-layer materials in C, Si, Ge, Sn, and Pb were predicted. A new Si bi-layer structure is found to have a much-favored energy than the previously widely accepted configuration. Both single-layer and bi-layer Si materials have small band gaps, limiting their usages in optoelectronic applications. Hydrogenation has therefore been used to tune the electronic and optical properties of Si layers. We discover two hydrogenated materials of layered Si8H2andSi6H2 possessing quasi-direct band gaps of 0.75 eV and 1.59 eV, respectively. Their potential applications for light emitting diode and photovoltaics are proposed and discussed. Our study opened up the possibility of hydrogenated Si layered materials as next-generation optoelectronic devices.

  14. Structure-Based Prediction of Protein-Folding Transition Paths.

    PubMed

    Jacobs, William M; Shakhnovich, Eugene I

    2016-09-01

    We propose a general theory to describe the distribution of protein-folding transition paths. We show that transition paths follow a predictable sequence of high-free-energy transient states that are separated by free-energy barriers. Each transient state corresponds to the assembly of one or more discrete, cooperative units, which are determined directly from the native structure. We show that the transition state on a folding pathway is reached when a small number of critical contacts are formed between a specific set of substructures, after which folding proceeds downhill in free energy. This approach suggests a natural resolution for distinguishing parallel folding pathways and provides a simple means to predict the rate-limiting step in a folding reaction. Our theory identifies a common folding mechanism for proteins with diverse native structures and establishes general principles for the self-assembly of polymers with specific interactions. PMID:27602721

  15. Structure-Based Prediction of Protein-Folding Transition Paths

    NASA Astrophysics Data System (ADS)

    Jacobs, William M.; Shakhnovich, Eugene I.

    2016-09-01

    We propose a general theory to describe the distribution of protein-folding transition paths. We show that transition paths follow a predictable sequence of high-free-energy transient states that are separated by free-energy barriers. Each transient state corresponds to the assembly of one or more discrete, cooperative units, which are determined directly from the native structure. We show that the transition state on a folding pathway is reached when a small number of critical contacts are formed between a specific set of substructures, after which folding proceeds downhill in free energy. This approach suggests a natural resolution for distinguishing parallel folding pathways and provides a simple means to predict the rate-limiting step in a folding reaction. Our theory identifies a common folding mechanism for proteins with diverse native structures and establishes general principles for the self-assembly of polymers with specific interactions.

  16. Finite Element Based HWB Centerbody Structural Optimization and Weight Prediction

    NASA Technical Reports Server (NTRS)

    Gern, Frank H.

    2012-01-01

    This paper describes a scalable structural model suitable for Hybrid Wing Body (HWB) centerbody analysis and optimization. The geometry of the centerbody and primary wing structure is based on a Vehicle Sketch Pad (VSP) surface model of the aircraft and a FLOPS compatible parameterization of the centerbody. Structural analysis, optimization, and weight calculation are based on a Nastran finite element model of the primary HWB structural components, featuring centerbody, mid section, and outboard wing. Different centerbody designs like single bay or multi-bay options are analyzed and weight calculations are compared to current FLOPS results. For proper structural sizing and weight estimation, internal pressure and maneuver flight loads are applied. Results are presented for aerodynamic loads, deformations, and centerbody weight.

  17. Shape and secondary structure prediction for ncRNAs including pseudoknots based on linear SVM

    PubMed Central

    2013-01-01

    Background Accurate secondary structure prediction provides important information to undefirstafinding the tertiary structures and thus the functions of ncRNAs. However, the accuracy of the native structure derivation of ncRNAs is still not satisfactory, especially on sequences containing pseudoknots. It is recently shown that using the abstract shapes, which retain adjacency and nesting of structural features but disregard the length details of helix and loop regions, can improve the performance of structure prediction. In this work, we use SVM-based feature selection to derive the consensus abstract shape of homologous ncRNAs and apply the predicted shape to structure prediction including pseudoknots. Results Our approach was applied to predict shapes and secondary structures on hundreds of ncRNA data sets with and without psuedoknots. The experimental results show that we can achieve 18% higher accuracy in shape prediction than the state-of-the-art consensus shape prediction tools. Using predicted shapes in structure prediction allows us to achieve approximate 29% higher sensitivity and 10% higher positive predictive value than other pseudoknot prediction tools. Conclusions Extensive analysis of RNA properties based on SVM allows us to identify important properties of sequences and structures related to their shapes. The combination of mass data analysis and SVM-based feature selection makes our approach a promising method for shape and structure prediction. The implemented tools, Knot Shape and Knot Structure are open source software and can be downloaded at: http://www.cse.msu.edu/~achawana/KnotShape. PMID:23369147

  18. Structure-Based Predictive model for Coal Char Combustion.

    SciTech Connect

    Hurt, R.; Colo, J; Essenhigh, R.; Hadad, C; Stanley, E.

    1997-09-24

    During the third quarter of this project, progress was made on both major technical tasks. Progress was made in the chemistry department at OSU on the calculation of thermodynamic properties for a number of model organic compounds. Modelling work was carried out at Brown to adapt a thermodynamic model of carbonaceous mesophase formation, originally applied to pitch carbonization, to the prediction of coke texture in coal combustion. This latter work makes use of the FG-DVC model of coal pyrolysis developed by Advanced Fuel Research to specify the pool of aromatic clusters that participate in the order/disorder transition. This modelling approach shows promise for the mechanistic prediction of the rank dependence of char structure and will therefore be pursued further. Crystalline ordering phenomena were also observed in a model char prepared from phenol-formaldehyde carbonized at 900{degrees}C and 1300{degrees}C using high-resolution TEM fringe imaging. Dramatic changes occur in the structure between 900 and 1300{degrees}C, making this char a suitable candidate for upcoming in situ work on the hot stage TEM. Work also proceeded on molecular dynamics simulations at Boston University and on equipment modification and testing for the combustion experiments with widely varying flame types at Ohio State.

  19. Protein structure prediction provides comparable performance to crystallographic structures in docking-based virtual screening.

    PubMed

    Du, Hongying; Brender, Jeffrey R; Zhang, Jian; Zhang, Yang

    2015-01-01

    Structure based virtual screening has largely been limited to protein targets for which either an experimental structure is available or a strongly homologous template exists so that a high-resolution model can be constructed. The performance of state of the art protein structure predictions in virtual screening in systems where only weakly homologous templates are available is largely untested. Using the challenging DUD database of structural decoys, we show here that even using templates with only weak sequence homology (<30% sequence identity) structural models can be constructed by I-TASSER which achieve comparable enrichment rates to using the experimental bound crystal structure in the majority of the cases studied. For 65% of the targets, the I-TASSER models, which are constructed essentially in the apo conformations, reached 70% of the virtual screening performance of using the holo-crystal structures. A correlation was observed between the success of I-TASSER in modeling the global fold and local structures in the binding pockets of the proteins versus the relative success in virtual screening. The virtual screening performance can be further improved by the recognition of chemical features of the ligand compounds. These results suggest that the combination of structure-based docking and advanced protein structure modeling methods should be a valuable approach to the large-scale drug screening and discovery studies, especially for the proteins lacking crystallographic structures.

  20. An RNA secondary structure prediction method based on minimum and suboptimal free energy structures.

    PubMed

    Fu, Haoyue; Yang, Lianping; Zhang, Xiangde

    2015-09-01

    The function of an RNA-molecule is mainly determined by its tertiary structures. And its secondary structure is an important determinant of its tertiary structure. The comparative methods usually give better results than the single-sequence methods. Based on minimum and suboptimal free energy structures, the paper presents a novel method for predicting conserved secondary structure of a group of related RNAs. In the method, the information from the known RNA structures is used as training data in a SVM (Support Vector Machine) classifier. Our method has been tested on the benchmark dataset given by Puton et al. The results show that the average sensitivity of our method is higher than that of other comparative methods such as CentroidAlifold, MXScrana, RNAalifold, and TurboFold. PMID:26100179

  1. Structural kinematics based damage zone prediction in gradient structures using vibration database

    NASA Astrophysics Data System (ADS)

    Talha, Mohammad; Ashokkumar, Chimpalthradi R.

    2014-05-01

    To explore the applications of functionally graded materials (FGMs) in dynamic structures, structural kinematics based health monitoring technique becomes an important problem. Depending upon the displacements in three dimensions, the health of the material to withstand dynamic loads is inferred in this paper, which is based on the net compressive and tensile displacements that each structural degree of freedom takes. These net displacements at each finite element node predicts damage zones of the FGM where the material is likely to fail due to a vibration response which is categorized according to loading condition. The damage zone prediction of a dynamically active FGMs plate have been accomplished using Reddy's higher-order theory. The constituent material properties are assumed to vary in the thickness direction according to the power-law behavior. The proposed C0 finite element model (FEM) is applied to get net tensile and compressive displacement distributions across the structures. A plate made of Aluminum/Ziconia is considered to illustrate the concept of structural kinematics-based health monitoring aspects of FGMs.

  2. Structure-Based Predictive model for Coal Char Combustion.

    SciTech Connect

    Hurt, R.; Calo, J.; Essenhigh, R.; Hadad, C.; Stanley, E.

    1997-06-25

    During the second quarter of this project, progress was made on both major technical tasks. Three parallel efforts were initiated on the modeling of carbon structural evolution. Structural ordering during carbonization was studied by a numerical simulation scheme proposed by Alan Kerstein involving molecular weight growth and rotational mobility. Work was also initiated to adapt a model of carbonaceous mesophase formation, originally developed under parallel NSF funding, to the prediction of coke texture. This latter work makes use of the FG-DVC model of coal pyrolysis developed by Advanced Fuel Research to specify the pool of aromatic clusters that participate in the order/disorder transition. Boston University has initiated molecular dynamics simulations of carbonization processes and Ohio State has begun theoretical treatment of surface reactions. Experimental work has also begun on model compound studies at Brown and on pilot-scale combustion systems with widely varying flame types at OSE. The work on mobility / growth models shows great promise and is discussed in detail in the body of the report.

  3. STRUCTURE BASED PREDICTIVE MODEL FOR COAL CHAR COMBUSTION

    SciTech Connect

    Robert Hurt; Joseph Calo; Robert Essenhigh; Christopher Hadad

    2001-06-15

    This report is part on the ongoing effort at Brown University and Ohio State University to develop structure based models of coal combustion. A very fundamental approach is taken to the description of coal chars and their reaction processes, and the results are therefore expected to have broad applicability to the spectrum of carbon materials of interest in energy technologies. This quarter, the project was in a period no-cost extension and discussions were held about the end phase of the project and possible continuations. The technical tasks were essentially dormant this period, but presentations of results were made, and plans were formulated for renewed activity in the fiscal year 2001.

  4. Energy-based RNA consensus secondary structure prediction in multiple sequence alignments.

    PubMed

    Washietl, Stefan; Bernhart, Stephan H; Kellis, Manolis

    2014-01-01

    Many biologically important RNA structures are conserved in evolution leading to characteristic mutational patterns. RNAalifold is a widely used program to predict consensus secondary structures in multiple alignments by combining evolutionary information with traditional energy-based RNA folding algorithms. Here we describe the theory and applications of the RNAalifold algorithm. Consensus secondary structure prediction not only leads to significantly more accurate structure models, but it also allows to study structural conservation of functional RNAs. PMID:24639158

  5. Energy-based RNA consensus secondary structure prediction in multiple sequence alignments.

    PubMed

    Washietl, Stefan; Bernhart, Stephan H; Kellis, Manolis

    2014-01-01

    Many biologically important RNA structures are conserved in evolution leading to characteristic mutational patterns. RNAalifold is a widely used program to predict consensus secondary structures in multiple alignments by combining evolutionary information with traditional energy-based RNA folding algorithms. Here we describe the theory and applications of the RNAalifold algorithm. Consensus secondary structure prediction not only leads to significantly more accurate structure models, but it also allows to study structural conservation of functional RNAs.

  6. Structure Based Predictive Model for Coal Char Combustion

    SciTech Connect

    Robert Hurt; Joseph Calo; Robert Essenhigh; Christopher Hadad

    2000-12-30

    This unique collaborative project has taken a very fundamental look at the origin of structure, and combustion reactivity of coal chars. It was a combined experimental and theoretical effort involving three universities and collaborators from universities outside the U.S. and from U.S. National Laboratories and contract research companies. The project goal was to improve our understanding of char structure and behavior by examining the fundamental chemistry of its polyaromatic building blocks. The project team investigated the elementary oxidative attack on polyaromatic systems, and coupled with a study of the assembly processes that convert these polyaromatic clusters to mature carbon materials (or chars). We believe that the work done in this project has defined a powerful new science-based approach to the understanding of char behavior. The work on aromatic oxidation pathways made extensive use of computational chemistry, and was led by Professor Christopher Hadad in the Department of Chemistry at Ohio State University. Laboratory experiments on char structure, properties, and combustion reactivity were carried out at both OSU and Brown, led by Principle Investigators Joseph Calo, Robert Essenhigh, and Robert Hurt. Modeling activities were divided into two parts: first unique models of crystal structure development were formulated by the team at Brown (PI'S Hurt and Calo) with input from Boston University and significant collaboration with Dr. Alan Kerstein at Sandia and with Dr. Zhong-Ying chen at SAIC. Secondly, new combustion models were developed and tested, led by Professor Essenhigh at OSU, Dieter Foertsch (a collaborator at the University of Stuttgart), and Professor Hurt at Brown. One product of this work is the CBK8 model of carbon burnout, which has already found practical use in CFD codes and in other numerical models of pulverized fuel combustion processes, such as EPRI's NOxLOI Predictor. The remainder of the report consists of detailed technical

  7. Knowledge base and neural network approach for protein secondary structure prediction.

    PubMed

    Patel, Maulika S; Mazumdar, Himanshu S

    2014-11-21

    Protein structure prediction is of great relevance given the abundant genomic and proteomic data generated by the genome sequencing projects. Protein secondary structure prediction is addressed as a sub task in determining the protein tertiary structure and function. In this paper, a novel algorithm, KB-PROSSP-NN, which is a combination of knowledge base and modeling of the exceptions in the knowledge base using neural networks for protein secondary structure prediction (PSSP), is proposed. The knowledge base is derived from a proteomic sequence-structure database and consists of the statistics of association between the 5-residue words and corresponding secondary structure. The predicted results obtained using knowledge base are refined with a Backpropogation neural network algorithm. Neural net models the exceptions of the knowledge base. The Q3 accuracy of 90% and 82% is achieved on the RS126 and CB396 test sets respectively which suggest improvement over existing state of art methods.

  8. A permutation based simulated annealing algorithm to predict pseudoknotted RNA secondary structures.

    PubMed

    Tsang, Herbert H; Wiese, Kay C

    2015-01-01

    Pseudoknots are RNA tertiary structures which perform essential biological functions. This paper discusses SARNA-Predict-pk, a RNA pseudoknotted secondary structure prediction algorithm based on Simulated Annealing (SA). The research presented here extends previous work of SARNA-Predict and further examines the effect of the new algorithm to include prediction of RNA secondary structure with pseudoknots. An evaluation of the performance of SARNA-Predict-pk in terms of prediction accuracy is made via comparison with several state-of-the-art prediction algorithms using 20 individual known structures from seven RNA classes. We measured the sensitivity and specificity of nine prediction algorithms. Three of these are dynamic programming algorithms: Pseudoknot (pknotsRE), NUPACK, and pknotsRG-mfe. One is using the statistical clustering approach: Sfold and the other five are heuristic algorithms: SARNA-Predict-pk, ILM, STAR, IPknot and HotKnots algorithms. The results presented in this paper demonstrate that SARNA-Predict-pk can out-perform other state-of-the-art algorithms in terms of prediction accuracy. This supports the use of the proposed method on pseudoknotted RNA secondary structure prediction of other known structures. PMID:26558299

  9. AWSEM-MD: Protein Structure Prediction Using Coarse-grained Physical Potentials and Bioinformatically Based Local Structure Biasing

    PubMed Central

    Davtyan, Aram; Schafer, Nicholas P.; Zheng, Weihua; Clementi, Cecilia; Wolynes, Peter G.; Papoian, Garegin A.

    2012-01-01

    The Associative memory, Water mediated, Structure and Energy Model (AWSEM) is a coarse-grained protein force field. AWSEM contains physically motivated terms, such as hydrogen bonding, as well as a bioinformatically based local structure biasing term, which efficiently takes into account many-body effects that are modulated by the local sequence. When combined with appropriate local or global alignments to choose memories, AWSEM can be used to perform de novo protein structure prediction. Herein we present structure prediction results for a particular choice of local sequence alignment method based on short residue sequences called fragments. We demonstrate the model’s structure prediction capabilities for three levels of global homology between the target sequence and those proteins used for local structure biasing, all of which assume that the structure of the target sequence is not known. When there are no homologs in the database of structures used for local structure biasing, AWSEM calculations produce structural predictions that are somewhat improved compared with prior works using related approaches. The inclusion of a small number of structures from homologous sequences improves structure prediction only marginally but when the fragment search is restricted to only homologous sequences, AWSEM can perform high resolution structure prediction and can be used for kinetics and dynamics studies. PMID:22545654

  10. PSRna: Prediction of small RNA secondary structures based on reverse complementary folding method.

    PubMed

    Li, Jin; Xu, Chengzhen; Wang, Lei; Liang, Hong; Feng, Weixing; Cai, Zhongxi; Wang, Ying; Cong, Wang; Liu, Yunlong

    2016-08-01

    Prediction of RNA secondary structures is an important problem in computational biology and bioinformatics, since RNA secondary structures are fundamental for functional analysis of RNA molecules. However, small RNA secondary structures are scarce and few algorithms have been specifically designed for predicting the secondary structures of small RNAs. Here we propose an algorithm named "PSRna" for predicting small-RNA secondary structures using reverse complementary folding and characteristic hairpin loops of small RNAs. Unlike traditional algorithms that usually generate multi-branch loops and 5[Formula: see text] end self-folding, PSRna first estimated the maximum number of base pairs of RNA secondary structures based on the dynamic programming algorithm and a path matrix is constructed at the same time. Second, the backtracking paths are extracted from the path matrix based on backtracking algorithm, and each backtracking path represents a secondary structure. To improve accuracy, the predicted RNA secondary structures are filtered based on their free energy, where only the secondary structure with the minimum free energy was identified as the candidate secondary structure. Our experiments on real data show that the proposed algorithm is superior to two popular methods, RNAfold and RNAstructure, in terms of sensitivity, specificity and Matthews correlation coefficient (MCC). PMID:27045556

  11. PSRna: Prediction of small RNA secondary structures based on reverse complementary folding method.

    PubMed

    Li, Jin; Xu, Chengzhen; Wang, Lei; Liang, Hong; Feng, Weixing; Cai, Zhongxi; Wang, Ying; Cong, Wang; Liu, Yunlong

    2016-08-01

    Prediction of RNA secondary structures is an important problem in computational biology and bioinformatics, since RNA secondary structures are fundamental for functional analysis of RNA molecules. However, small RNA secondary structures are scarce and few algorithms have been specifically designed for predicting the secondary structures of small RNAs. Here we propose an algorithm named "PSRna" for predicting small-RNA secondary structures using reverse complementary folding and characteristic hairpin loops of small RNAs. Unlike traditional algorithms that usually generate multi-branch loops and 5[Formula: see text] end self-folding, PSRna first estimated the maximum number of base pairs of RNA secondary structures based on the dynamic programming algorithm and a path matrix is constructed at the same time. Second, the backtracking paths are extracted from the path matrix based on backtracking algorithm, and each backtracking path represents a secondary structure. To improve accuracy, the predicted RNA secondary structures are filtered based on their free energy, where only the secondary structure with the minimum free energy was identified as the candidate secondary structure. Our experiments on real data show that the proposed algorithm is superior to two popular methods, RNAfold and RNAstructure, in terms of sensitivity, specificity and Matthews correlation coefficient (MCC).

  12. Protein Structure Prediction using a Docking-based Hierarchical Folding scheme

    PubMed Central

    Kifer, Ilona; Nussinov, Ruth; Wolfson, Haim J.

    2011-01-01

    The pathways by which proteins fold into their specific native structure is still an unsolved mystery. Currently many methods for protein structure prediction are available, most of them tackle the problem by relying on the vast amounts of data collected from known protein structures. These methods are often not concerned with the route the protein follows to reach its final fold. This work is based on the premise that proteins fold in a hierarchical manner. We present FOBIA, an automated method for predicting a protein structure. FOBIA consists of two main stages: the first finds matches between parts of the target sequence and independently-folding structural units using profile-profile comparison. The second assembles these units into a 3D structure by searching and ranking their possible orientations towards each other using a docking-based approach. We have previously reported an application of an initial version of this strategy to homology based targets. Since then we have considerably enhanced our method’s abilities to allow it to address the more difficult template-based target category. This allows us to now apply FOBIA to the Template-Based targets of CASP8 and to show that it is both very efficient and promising. Our method can provide an alternative for Template-Based structure prediction, and in particular, the docking-based ranking technique presented here can be incorporated into any profile-profile comparison based method. PMID:21445943

  13. Protein structure prediction using a docking-based hierarchical folding scheme.

    PubMed

    Kifer, Ilona; Nussinov, Ruth; Wolfson, Haim J

    2011-06-01

    The pathways by which proteins fold into their specific native structure are still an unsolved mystery. Currently, many methods for protein structure prediction are available, and most of them tackle the problem by relying on the vast amounts of data collected from known protein structures. These methods are often not concerned with the route the protein follows to reach its final fold. This work is based on the premise that proteins fold in a hierarchical manner. We present FOBIA, an automated method for predicting a protein structure. FOBIA consists of two main stages: the first finds matches between parts of the target sequence and independently folding structural units using profile-profile comparison. The second assembles these units into a 3D structure by searching and ranking their possible orientations toward each other using a docking-based approach. We have previously reported an application of an initial version of this strategy to homology based targets. Since then we have considerably enhanced our method's abilities to allow it to address the more difficult template-based target category. This allows us to now apply FOBIA to the template-based targets of CASP8 and to show that it is both very efficient and promising. Our method can provide an alternative for template-based structure prediction, and in particular, the docking-basedranking technique presented here can be incorporated into any profile-profile comparison based method. PMID:21445943

  14. RNA secondary structure prediction based on SHAPE data in helix regions.

    PubMed

    Lotfi, Mohadeseh; Zare-Mirakabad, Fatemeh; Montaseri, Soheila

    2015-09-01

    RNA molecules play important and fundamental roles in biological processes. Frequently, the functional form of single-stranded RNA molecules requires a specific tertiary structure. Classically, RNA structure determination has mostly been accomplished by X-Ray crystallography or Nuclear Magnetic Resonance approaches. These experimental methods are time consuming and expensive. In the past two decades, some computational methods and algorithms have been developed for RNA secondary structure prediction. In these algorithms, minimum free energy is known as the best criterion. However, the results of algorithms show that minimum free energy is not a sufficient criterion to predict RNA secondary structure. These algorithms need some additional knowledge about the structure, which has to be added in the methods. Recently, the information obtained from some experimental data, called SHAPE, can greatly improve the consistency between the native and predicted RNA secondary structure. In this paper, we investigate the influence of SHAPE data on four types of RNA substructures, helices, loops, base pairs from the start and end of helices and two base pairs from the start and end of helices. The results show that SHAPE data in helix regions can improve the prediction. We represent a new method to apply SHAPE data in helix regions for finding RNA secondary structure. Finally, we compare the results of the method on a set of RNAs to predict minimum free energy structure based on considering all SHAPE data and only SHAPE data in helix regions as pseudo free energy and without SHAPE data (without any pseudo free energy). The results show that RNA secondary structure prediction based on considering only SHAPE data in helix regions is more successful than not considering SHAPE data and it provides competitive results in comparison with considering all SHAPE data.

  15. Link prediction based on hyperbolic mapping with community structure for complex networks

    NASA Astrophysics Data System (ADS)

    Wang, Zuxi; Wu, Yao; Li, Qingguang; Jin, Fengdong; Xiong, Wei

    2016-05-01

    Link prediction is becoming a concerned topic in the complex network field in recent years. However, the existing link prediction methods are unsatisfactory for processing topological information and have high time complexity. This paper presents a novel method of Link Prediction with Community Structure (LPCS) based on hyperbolic mapping. Different from the existing link prediction methods, to utilize global structure information of the network, LPCS deals with the network from an overall perspective. LPCS takes full advantage of the community structure and its hierarchical organization to map networks into hyperbolic space, and obtains the hyperbolic coordinates which depict the global structure information of the network, then uses hyperbolic distance to describe the similarity between the nodes, finally predicts missing links according to the degree of the similarity between unconnected node pairs. The combination of the hyperbolic geometry framework and the community structure makes LPCS perform well in predicting missing links, and the time complexity of LPCS is linear, which makes LPCS can be applied to handle large scale networks in acceptable time. LPCS outperforms many state-of-the-art link prediction methods in the networks obeying power-law degree distribution.

  16. Towards universal structure-based prediction of class II MHC epitopes for diverse allotypes.

    PubMed

    Bordner, Andrew J

    2010-01-01

    The binding of peptide fragments of antigens to class II MHC proteins is a crucial step in initiating a helper T cell immune response. The discovery of these peptide epitopes is important for understanding the normal immune response and its misregulation in autoimmunity and allergies and also for vaccine design. In spite of their biomedical importance, the high diversity of class II MHC proteins combined with the large number of possible peptide sequences make comprehensive experimental determination of epitopes for all MHC allotypes infeasible. Computational methods can address this need by predicting epitopes for a particular MHC allotype. We present a structure-based method for predicting class II epitopes that combines molecular mechanics docking of a fully flexible peptide into the MHC binding cleft followed by binding affinity prediction using a machine learning classifier trained on interaction energy components calculated from the docking solution. Although the primary advantage of structure-based prediction methods over the commonly employed sequence-based methods is their applicability to essentially any MHC allotype, this has not yet been convincingly demonstrated. In order to test the transferability of the prediction method to different MHC proteins, we trained the scoring method on binding data for DRB1*0101 and used it to make predictions for multiple MHC allotypes with distinct peptide binding specificities including representatives from the other human class II MHC loci, HLA-DP and HLA-DQ, as well as for two murine allotypes. The results showed that the prediction method was able to achieve significant discrimination between epitope and non-epitope peptides for all MHC allotypes examined, based on AUC values in the range 0.632-0.821. We also discuss how accounting for peptide binding in multiple registers to class II MHC largely explains the systematically worse performance of prediction methods for class II MHC compared with those for class I MHC

  17. Effect of Using Suboptimal Alignments in Template-Based Protein Structure Prediction

    PubMed Central

    Chen, Hao; Kihara, Daisuke

    2010-01-01

    Computational protein structure prediction remains a challenging task in protein bioinformatics. In the recent years, the importance of template-based structure prediction is increasing due to the growing number of protein structures solved by the structural genomics projects. To capitalize the significant efforts and investments paid on the structural genomics projects, it is urgent to establish effective ways to use the solved structures as templates by developing methods for exploiting remotely related proteins that cannot be simply identified by homology. In this work, we examine the effect of employing suboptimal alignments in template-based protein structure prediction. We showed that suboptimal alignments are often more accurate than the optimal one, and such accurate suboptimal alignments can occur even at a very low rank of the alignment score. Suboptimal alignments contain a significant number of correct amino acid residue contacts. Moreover, suboptimal alignments can improve template-based models when used as input to Modeller. Finally, we employ suboptimal alignments for handling a contact potential in a probabilistic way in a threading program, SUPRB. The probabilistic contacts strategy outperforms the partly thawed approach which only uses the optimal alignment in defining residue contacts and also the reranking strategy, which uses the contact potential in reranking alignments. The comparison with existing methods in the template-recognition test shows that SUPRB is very competitive and outperform existing methods. PMID:21058297

  18. Structural link prediction based on ant colony approach in social networks

    NASA Astrophysics Data System (ADS)

    Sherkat, Ehsan; Rahgozar, Maseud; Asadpour, Masoud

    2015-02-01

    As the size and number of online social networks are increasing day by day, social network analysis has become a popular issue in many branches of science. The link prediction is one of the key rolling issues in the analysis of social network's evolution. As the size of social networks is increasing, the necessity for scalable link prediction algorithms is being felt more. The aim of this paper is to introduce a new unsupervised structural link prediction algorithm based on the ant colony approach. Recently, ant colony approach has been used for solving some graph problems. Different kinds of networks are used for testing the proposed approach. In some networks, the proposed scalable algorithm has the best result in comparison to other structural unsupervised link prediction algorithms. In order to evaluate the algorithm results, methods like the top- n precision, area under the Receiver Operating Characteristic (ROC) and Precision-Recall curves are carried out on real-world networks.

  19. Facing the challenges of structure-based target prediction by inverse virtual screening.

    PubMed

    Schomburg, Karen T; Bietz, Stefan; Briem, Hans; Henzler, Angela M; Urbaczek, Sascha; Rarey, Matthias

    2014-06-23

    Computational target prediction for bioactive compounds is a promising field in assessing off-target effects. Structure-based methods not only predict off-targets, but, simultaneously, binding modes, which are essential for understanding the mode of action and rationally designing selective compounds. Here, we highlight the current open challenges of computational target prediction methods based on protein structures and show why inverse screening rather than sequential pairwise protein-ligand docking methods are needed. A new inverse screening method based on triangle descriptors is introduced: iRAISE (inverse Rapid Index-based Screening Engine). A Scoring Cascade considering the reference ligand as well as the ligand and active site coverage is applied to overcome interprotein scoring noise of common protein-ligand scoring functions. Furthermore, a statistical evaluation of a score cutoff for each individual protein pocket is used. The ranking and binding mode prediction capabilities are evaluated on different datasets and compared to inverse docking and pharmacophore-based methods. On the Astex Diverse Set, iRAISE ranks more than 35% of the targets to the first position and predicts more than 80% of the binding modes with a root-mean-square deviation (RMSD) accuracy of <2.0 Å. With a median computing time of 5 s per protein, large amounts of protein structures can be screened rapidly. On a test set with 7915 protein structures and 117 query ligands, iRAISE predicts the first true positive in a ranked list among the top eight ranks (median), i.e., among 0.28% of the targets. PMID:24851945

  20. Genomic-scale comparison of sequence- and structure-based methods of function prediction: Does structure provide additional insight?

    PubMed Central

    Fetrow, Jacquelyn S.; Siew, Naomi; Di Gennaro, Jeannine A.; Martinez-Yamout, Maria; Dyson, H. Jane; Skolnick, Jeffrey

    2001-01-01

    A function annotation method using the sequence-to-structure-to-function paradigm is applied to the identification of all disulfide oxidoreductases in the Saccharomyces cerevisiae genome. The method identifies 27 sequences as potential disulfide oxidoreductases. All previously known thioredoxins, glutaredoxins, and disulfide isomerases are correctly identified. Three of the 27 predictions are probable false-positives. Three novel predictions, which subsequently have been experimentally validated, are presented. Two additional novel predictions suggest a disulfide oxidoreductase regulatory mechanism for two subunits (OST3 and OST6) of the yeast oligosaccharyltransferase complex. Based on homology, this prediction can be extended to a potential tumor suppressor gene, N33, in humans, whose biochemical function was not previously known. Attempts to obtain a folded, active N33 construct to test the prediction were unsuccessful. The results show that structure prediction coupled with biochemically relevant structural motifs is a powerful method for the function annotation of genome sequences and can provide more detailed, robust predictions than function prediction methods that rely on sequence comparison alone. PMID:11316881

  1. FINDSITE: a combined evolution/structure-based approach to protein function prediction

    PubMed Central

    Brylinski, Michal

    2009-01-01

    A key challenge of the post-genomic era is the identification of the function(s) of all the molecules in a given organism. Here, we review the status of sequence and structure-based approaches to protein function inference and ligand screening that can provide functional insights for a significant fraction of the ∼50% of ORFs of unassigned function in an average proteome. We then describe FINDSITE, a recently developed algorithm for ligand binding site prediction, ligand screening and molecular function prediction, which is based on binding site conservation across evolutionary distant proteins identified by threading. Importantly, FINDSITE gives comparable results when high-resolution experimental structures as well as predicted protein models are used. PMID:19324930

  2. Structure- and Sequence-Based Function Prediction for Non-Homologous Proteins

    PubMed Central

    Sael, Lee; Chitale, Meghana; Kihara, Daisuke

    2012-01-01

    The structural genomics projects have been accumulating an increasing number of protein structures, many of which remain functionally unknown. In parallel effort to experimental methods, computational methods are expected to make a significant contribution for functional elucidation of such proteins. However, conventional computational methods that transfer functions from homologous proteins do not help much for these uncharacterized protein structures because they do not have apparent structural or sequence similarity with the known proteins. Here, we briefly review two avenues of computational function prediction methods, i.e. structure-based methods and sequence-based methods. The focus is on our recently developments of local structure-based methods and sequence-based methods, which can effectively extract function information from distantly related proteins. Two structure-based methods, Pocket-Surfer and Patch-Surfer, identify similar known ligand binding sites for pocket regions in a query protein without using global protein fold similarity information. Two sequence-based methods, PFP and ESG, make use of weakly similar sequences that are conventionally discarded in homology based function annotation. Combined together with experimental methods we hope that computational methods will make leading contribution in functional elucidation of the protein structures. PMID:22270458

  3. Structural predictions based on the compositions of cathodic materials by first-principles calculations

    NASA Astrophysics Data System (ADS)

    Li, Yang; Lian, Fang; Chen, Ning; Hao, Zhen-jia; Chou, Kuo-chih

    2015-05-01

    A first-principles method is applied to comparatively study the stability of lithium metal oxides with layered or spinel structures to predict the most energetically favorable structure for different compositions. The binding and reaction energies of the real or virtual layered LiMO2 and spinel LiM2O4 (M = Sc-Cu, Y-Ag, Mg-Sr, and Al-In) are calculated. The effect of element M on the structural stability, especially in the case of multiple-cation compounds, is discussed herein. The calculation results indicate that the phase stability depends on both the binding and reaction energies. The oxidation state of element M also plays a role in determining the dominant structure, i.e., layered or spinel phase. Moreover, calculation-based theoretical predictions of the phase stability of the doped materials agree with the previously reported experimental data.

  4. An integrative structure-based framework for predicting biological effects mediated by antipeptide antibodies.

    PubMed

    Caoili, Salvador Eugenio C

    2015-12-01

    A general framework is presented for predicting quantitative biological effects mediated by antipeptide antibodies, primarily on the basis of antigen structure (possibly featuring intrinsic disorder) analyzed to estimate epitope-paratope binding affinities, which in turn is considered within the context of dose-response relationships as regards antibody concentration. This is illustrated mainly using an approach based on protein structural energetics, whereby expected amounts of solvent-accessible surface area buried upon epitope-paratope binding are related to the corresponding binding affinity, which is estimated from putative B-cell epitope structure with implicit treatment of paratope structure, for antipeptide antibodies either reacting with peptides or cross-reacting with cognate protein antigens. Key methods described are implemented in SAPPHIRE/SUITE (Structural-energetic Analysis Program for Predicting Humoral Immune Response Epitopes/SAPPHIRE User Interface Tool Ensemble; publicly accessible via http://freeshell.de/~badong/suite.htm). Representative results thus obtained are compared with published experimental data on binding affinities and quantitative biological effects, with special attention to loss of paratope sidechain conformational entropy (neglected in previous analyses) and in light of key in-vivo constraints on antigen-antibody binding affinity and antibody-mediated effects. Implications for further refinement of B-cell epitope prediction methods are discussed as regards envisioned biomedical applications including the development of prophylactic and therapeutic antibodies, peptide-based vaccines and immunodiagnostics. PMID:26410103

  5. Towards Universal Structure-Based Prediction of Class II MHC Epitopes for Diverse Allotypes

    PubMed Central

    Bordner, Andrew J.

    2010-01-01

    The binding of peptide fragments of antigens to class II MHC proteins is a crucial step in initiating a helper T cell immune response. The discovery of these peptide epitopes is important for understanding the normal immune response and its misregulation in autoimmunity and allergies and also for vaccine design. In spite of their biomedical importance, the high diversity of class II MHC proteins combined with the large number of possible peptide sequences make comprehensive experimental determination of epitopes for all MHC allotypes infeasible. Computational methods can address this need by predicting epitopes for a particular MHC allotype. We present a structure-based method for predicting class II epitopes that combines molecular mechanics docking of a fully flexible peptide into the MHC binding cleft followed by binding affinity prediction using a machine learning classifier trained on interaction energy components calculated from the docking solution. Although the primary advantage of structure-based prediction methods over the commonly employed sequence-based methods is their applicability to essentially any MHC allotype, this has not yet been convincingly demonstrated. In order to test the transferability of the prediction method to different MHC proteins, we trained the scoring method on binding data for DRB1*0101 and used it to make predictions for multiple MHC allotypes with distinct peptide binding specificities including representatives from the other human class II MHC loci, HLA-DP and HLA-DQ, as well as for two murine allotypes. The results showed that the prediction method was able to achieve significant discrimination between epitope and non-epitope peptides for all MHC allotypes examined, based on AUC values in the range 0.632–0.821. We also discuss how accounting for peptide binding in multiple registers to class II MHC largely explains the systematically worse performance of prediction methods for class II MHC compared with those for class I

  6. An Energy Based Fatigue Life Prediction Framework for In-Service Structural Components

    SciTech Connect

    H. Ozaltun; M. H.H. Shen; T. George; C. Cross

    2011-06-01

    An energy based fatigue life prediction framework has been developed for calculation of remaining fatigue life of in service gas turbine materials. The purpose of the life prediction framework is to account aging effect caused by cyclic loadings on fatigue strength of gas turbine engines structural components which are usually designed for very long life. Previous studies indicate the total strain energy dissipated during a monotonic fracture process and a cyclic process is a material property that can be determined by measuring the area underneath the monotonic true stress-strain curve and the sum of the area within each hysteresis loop in the cyclic process, respectively. The energy-based fatigue life prediction framework consists of the following entities: (1) development of a testing procedure to achieve plastic energy dissipation per life cycle and (2) incorporation of an energy-based fatigue life calculation scheme to determine the remaining fatigue life of in-service gas turbine materials. The accuracy of the remaining fatigue life prediction method was verified by comparison between model approximation and experimental results of Aluminum 6061-T6. The comparison shows promising agreement, thus validating the capability of the framework to produce accurate fatigue life prediction.

  7. Protein subcellular localization prediction based on compartment-specific features and structure conservation

    PubMed Central

    Su, Emily Chia-Yu; Chiu, Hua-Sheng; Lo, Allan; Hwang, Jenn-Kang; Sung, Ting-Yi; Hsu, Wen-Lian

    2007-01-01

    Background Protein subcellular localization is crucial for genome annotation, protein function prediction, and drug discovery. Determination of subcellular localization using experimental approaches is time-consuming; thus, computational approaches become highly desirable. Extensive studies of localization prediction have led to the development of several methods including composition-based and homology-based methods. However, their performance might be significantly degraded if homologous sequences are not detected. Moreover, methods that integrate various features could suffer from the problem of low coverage in high-throughput proteomic analyses due to the lack of information to characterize unknown proteins. Results We propose a hybrid prediction method for Gram-negative bacteria that combines a one-versus-one support vector machines (SVM) model and a structural homology approach. The SVM model comprises a number of binary classifiers, in which biological features derived from Gram-negative bacteria translocation pathways are incorporated. In the structural homology approach, we employ secondary structure alignment for structural similarity comparison and assign the known localization of the top-ranked protein as the predicted localization of a query protein. The hybrid method achieves overall accuracy of 93.7% and 93.2% using ten-fold cross-validation on the benchmark data sets. In the assessment of the evaluation data sets, our method also attains accurate prediction accuracy of 84.0%, especially when testing on sequences with a low level of homology to the training data. A three-way data split procedure is also incorporated to prevent overestimation of the predictive performance. In addition, we show that the prediction accuracy should be approximately 85% for non-redundant data sets of sequence identity less than 30%. Conclusion Our results demonstrate that biological features derived from Gram-negative bacteria translocation pathways yield a significant

  8. Genetic programming based quantitative structure-retention relationships for the prediction of Kovats retention indices.

    PubMed

    Goel, Purva; Bapat, Sanket; Vyas, Renu; Tambe, Amruta; Tambe, Sanjeev S

    2015-11-13

    The development of quantitative structure-retention relationships (QSRR) aims at constructing an appropriate linear/nonlinear model for the prediction of the retention behavior (such as Kovats retention index) of a solute on a chromatographic column. Commonly, multi-linear regression and artificial neural networks are used in the QSRR development in the gas chromatography (GC). In this study, an artificial intelligence based data-driven modeling formalism, namely genetic programming (GP), has been introduced for the development of quantitative structure based models predicting Kovats retention indices (KRI). The novelty of the GP formalism is that given an example dataset, it searches and optimizes both the form (structure) and the parameters of an appropriate linear/nonlinear data-fitting model. Thus, it is not necessary to pre-specify the form of the data-fitting model in the GP-based modeling. These models are also less complex, simple to understand, and easy to deploy. The effectiveness of GP in constructing QSRRs has been demonstrated by developing models predicting KRIs of light hydrocarbons (case study-I) and adamantane derivatives (case study-II). In each case study, two-, three- and four-descriptor models have been developed using the KRI data available in the literature. The results of these studies clearly indicate that the GP-based models possess an excellent KRI prediction accuracy and generalization capability. Specifically, the best performing four-descriptor models in both the case studies have yielded high (>0.9) values of the coefficient of determination (R(2)) and low values of root mean squared error (RMSE) and mean absolute percent error (MAPE) for training, test and validation set data. The characteristic feature of this study is that it introduces a practical and an effective GP-based method for developing QSRRs in gas chromatography that can be gainfully utilized for developing other types of data-driven models in chromatography science

  9. Genetic programming based quantitative structure-retention relationships for the prediction of Kovats retention indices.

    PubMed

    Goel, Purva; Bapat, Sanket; Vyas, Renu; Tambe, Amruta; Tambe, Sanjeev S

    2015-11-13

    The development of quantitative structure-retention relationships (QSRR) aims at constructing an appropriate linear/nonlinear model for the prediction of the retention behavior (such as Kovats retention index) of a solute on a chromatographic column. Commonly, multi-linear regression and artificial neural networks are used in the QSRR development in the gas chromatography (GC). In this study, an artificial intelligence based data-driven modeling formalism, namely genetic programming (GP), has been introduced for the development of quantitative structure based models predicting Kovats retention indices (KRI). The novelty of the GP formalism is that given an example dataset, it searches and optimizes both the form (structure) and the parameters of an appropriate linear/nonlinear data-fitting model. Thus, it is not necessary to pre-specify the form of the data-fitting model in the GP-based modeling. These models are also less complex, simple to understand, and easy to deploy. The effectiveness of GP in constructing QSRRs has been demonstrated by developing models predicting KRIs of light hydrocarbons (case study-I) and adamantane derivatives (case study-II). In each case study, two-, three- and four-descriptor models have been developed using the KRI data available in the literature. The results of these studies clearly indicate that the GP-based models possess an excellent KRI prediction accuracy and generalization capability. Specifically, the best performing four-descriptor models in both the case studies have yielded high (>0.9) values of the coefficient of determination (R(2)) and low values of root mean squared error (RMSE) and mean absolute percent error (MAPE) for training, test and validation set data. The characteristic feature of this study is that it introduces a practical and an effective GP-based method for developing QSRRs in gas chromatography that can be gainfully utilized for developing other types of data-driven models in chromatography science.

  10. Automated protein motif generation in the structure-based protein function prediction tool ProMOL.

    PubMed

    Osipovitch, Mikhail; Lambrecht, Mitchell; Baker, Cameron; Madha, Shariq; Mills, Jeffrey L; Craig, Paul A; Bernstein, Herbert J

    2015-12-01

    ProMOL, a plugin for the PyMOL molecular graphics system, is a structure-based protein function prediction tool. ProMOL includes a set of routines for building motif templates that are used for screening query structures for enzyme active sites. Previously, each motif template was generated manually and required supervision in the optimization of parameters for sensitivity and selectivity. We developed an algorithm and workflow for the automation of motif building and testing routines in ProMOL. The algorithm uses a set of empirically derived parameters for optimization and requires little user intervention. The automated motif generation algorithm was first tested in a performance comparison with a set of manually generated motifs based on identical active sites from the same 112 PDB entries. The two sets of motifs were equally effective in identifying alignments with homologs and in rejecting alignments with unrelated structures. A second set of 296 active site motifs were generated automatically, based on Catalytic Site Atlas entries with literature citations, as an expansion of the library of existing manually generated motif templates. The new motif templates exhibited comparable performance to the existing ones in terms of hit rates against native structures, homologs with the same EC and Pfam designations, and randomly selected unrelated structures with a different EC designation at the first EC digit, as well as in terms of RMSD values obtained from local structural alignments of motifs and query structures. This research is supported by NIH grant GM078077. PMID:26573864

  11. 3D Chemical Similarity Networks for Structure-Based Target Prediction and Scaffold Hopping.

    PubMed

    Lo, Yu-Chen; Senese, Silvia; Damoiseaux, Robert; Torres, Jorge Z

    2016-08-19

    Target identification remains a major challenge for modern drug discovery programs aimed at understanding the molecular mechanisms of drugs. Computational target prediction approaches like 2D chemical similarity searches have been widely used but are limited to structures sharing high chemical similarity. Here, we present a new computational approach called chemical similarity network analysis pull-down 3D (CSNAP3D) that combines 3D chemical similarity metrics and network algorithms for structure-based drug target profiling, ligand deorphanization, and automated identification of scaffold hopping compounds. In conjunction with 2D chemical similarity fingerprints, CSNAP3D achieved a >95% success rate in correctly predicting the drug targets of 206 known drugs. Significant improvement in target prediction was observed for HIV reverse transcriptase (HIVRT) compounds, which consist of diverse scaffold hopping compounds targeting the nucleotidyltransferase binding site. CSNAP3D was further applied to a set of antimitotic compounds identified in a cell-based chemical screen and identified novel small molecules that share a pharmacophore with Taxol and display a Taxol-like mechanism of action, which were validated experimentally using in vitro microtubule polymerization assays and cell-based assays.

  12. Structure Based Thermostability Prediction Models for Protein Single Point Mutations with Machine Learning Tools

    PubMed Central

    Jia, Lei; Yarlagadda, Ramya; Reed, Charles C.

    2015-01-01

    Thermostability issue of protein point mutations is a common occurrence in protein engineering. An application which predicts the thermostability of mutants can be helpful for guiding decision making process in protein design via mutagenesis. An in silico point mutation scanning method is frequently used to find “hot spots” in proteins for focused mutagenesis. ProTherm (http://gibk26.bio.kyutech.ac.jp/jouhou/Protherm/protherm.html) is a public database that consists of thousands of protein mutants’ experimentally measured thermostability. Two data sets based on two differently measured thermostability properties of protein single point mutations, namely the unfolding free energy change (ddG) and melting temperature change (dTm) were obtained from this database. Folding free energy change calculation from Rosetta, structural information of the point mutations as well as amino acid physical properties were obtained for building thermostability prediction models with informatics modeling tools. Five supervised machine learning methods (support vector machine, random forests, artificial neural network, naïve Bayes classifier, K nearest neighbor) and partial least squares regression are used for building the prediction models. Binary and ternary classifications as well as regression models were built and evaluated. Data set redundancy and balancing, the reverse mutations technique, feature selection, and comparison to other published methods were discussed. Rosetta calculated folding free energy change ranked as the most influential features in all prediction models. Other descriptors also made significant contributions to increasing the accuracy of the prediction models. PMID:26361227

  13. Structure Based Thermostability Prediction Models for Protein Single Point Mutations with Machine Learning Tools.

    PubMed

    Jia, Lei; Yarlagadda, Ramya; Reed, Charles C

    2015-01-01

    Thermostability issue of protein point mutations is a common occurrence in protein engineering. An application which predicts the thermostability of mutants can be helpful for guiding decision making process in protein design via mutagenesis. An in silico point mutation scanning method is frequently used to find "hot spots" in proteins for focused mutagenesis. ProTherm (http://gibk26.bio.kyutech.ac.jp/jouhou/Protherm/protherm.html) is a public database that consists of thousands of protein mutants' experimentally measured thermostability. Two data sets based on two differently measured thermostability properties of protein single point mutations, namely the unfolding free energy change (ddG) and melting temperature change (dTm) were obtained from this database. Folding free energy change calculation from Rosetta, structural information of the point mutations as well as amino acid physical properties were obtained for building thermostability prediction models with informatics modeling tools. Five supervised machine learning methods (support vector machine, random forests, artificial neural network, naïve Bayes classifier, K nearest neighbor) and partial least squares regression are used for building the prediction models. Binary and ternary classifications as well as regression models were built and evaluated. Data set redundancy and balancing, the reverse mutations technique, feature selection, and comparison to other published methods were discussed. Rosetta calculated folding free energy change ranked as the most influential features in all prediction models. Other descriptors also made significant contributions to increasing the accuracy of the prediction models.

  14. Structure Based Thermostability Prediction Models for Protein Single Point Mutations with Machine Learning Tools.

    PubMed

    Jia, Lei; Yarlagadda, Ramya; Reed, Charles C

    2015-01-01

    Thermostability issue of protein point mutations is a common occurrence in protein engineering. An application which predicts the thermostability of mutants can be helpful for guiding decision making process in protein design via mutagenesis. An in silico point mutation scanning method is frequently used to find "hot spots" in proteins for focused mutagenesis. ProTherm (http://gibk26.bio.kyutech.ac.jp/jouhou/Protherm/protherm.html) is a public database that consists of thousands of protein mutants' experimentally measured thermostability. Two data sets based on two differently measured thermostability properties of protein single point mutations, namely the unfolding free energy change (ddG) and melting temperature change (dTm) were obtained from this database. Folding free energy change calculation from Rosetta, structural information of the point mutations as well as amino acid physical properties were obtained for building thermostability prediction models with informatics modeling tools. Five supervised machine learning methods (support vector machine, random forests, artificial neural network, naïve Bayes classifier, K nearest neighbor) and partial least squares regression are used for building the prediction models. Binary and ternary classifications as well as regression models were built and evaluated. Data set redundancy and balancing, the reverse mutations technique, feature selection, and comparison to other published methods were discussed. Rosetta calculated folding free energy change ranked as the most influential features in all prediction models. Other descriptors also made significant contributions to increasing the accuracy of the prediction models. PMID:26361227

  15. Structure-based prediction of transcription factor binding specificity using an integrative energy function

    PubMed Central

    Farrel, Alvin; Murphy, Jonathan; Guo, Jun-tao

    2016-01-01

    Transcription factors (TFs) regulate gene expression through binding to specific target DNA sites. Accurate annotation of transcription factor binding sites (TFBSs) at genome scale represents an essential step toward our understanding of gene regulation networks. In this article, we present a structure-based method for computational prediction of TFBSs using a novel, integrative energy (IE) function. The new energy function combines a multibody (MB) knowledge-based potential and two atomic energy terms (hydrogen bond and π interaction) that might not be accurately captured by the knowledge-based potential owing to the mean force nature and low count problem. We applied the new energy function to the TFBS prediction using a non-redundant dataset that consists of TFs from 12 different families. Our results show that the new IE function improves the prediction accuracy over the knowledge-based, statistical potentials, especially for homeodomain TFs, the second largest TF family in mammals. Contact: jguo4@uncc.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:27307632

  16. B-Pred, a structure based B-cell epitopes prediction server.

    PubMed

    Giacò, Luciano; Amicosante, Massimo; Fraziano, Maurizio; Gherardini, Pier Federico; Ausiello, Gabriele; Helmer-Citterich, Manuela; Colizzi, Vittorio; Cabibbo, Andrea

    2012-01-01

    The ability to predict immunogenic regions in selected proteins by in-silico methods has broad implications, such as allowing a quick selection of potential reagents to be used as diagnostics, vaccines, immunotherapeutics, or research tools in several branches of biological and biotechnological research. However, the prediction of antibody target sites in proteins using computational methodologies has proven to be a highly challenging task, which is likely due to the somewhat elusive nature of B-cell epitopes. This paper proposes a web-based platform for scoring potential immunological reagents based on the structures or 3D models of the proteins of interest. The method scores a protein's peptides set, which is derived from a sliding window, based on the average solvent exposure, with a filter on the average local model quality for each peptide. The platform was validated on a custom-assembled database of 1336 experimentally determined epitopes from 106 proteins for which a reliable 3D model could be obtained through standard modeling techniques. Despite showing poor sensitivity, this method can achieve a specificity of 0.70 and a positive predictive value of 0.29 by combining these two simple parameters. These values are slightly higher than those obtained with other established sequence-based or structure-based methods that have been evaluated using the same epitopes dataset. This method is implemented in a web server called B-Pred, which is accessible at http://immuno.bio.uniroma2.it/bpred. The server contains a number of original features that allow users to perform personalized reagent searches by manipulating the sliding window's width and sliding step, changing the exposure and model quality thresholds, and running sequential queries with different parameters. The B-Pred server should assist experimentalists in the rational selection of epitope antigens for a wide range of applications. PMID:22888263

  17. CASP11--An Evaluation of a Modular BCL::Fold-Based Protein Structure Prediction Pipeline.

    PubMed

    Fischer, Axel W; Heinze, Sten; Putnam, Daniel K; Li, Bian; Pino, James C; Xia, Yan; Lopez, Carlos F; Meiler, Jens

    2016-01-01

    In silico prediction of a protein's tertiary structure remains an unsolved problem. The community-wide Critical Assessment of Protein Structure Prediction (CASP) experiment provides a double-blind study to evaluate improvements in protein structure prediction algorithms. We developed a protein structure prediction pipeline employing a three-stage approach, consisting of low-resolution topology search, high-resolution refinement, and molecular dynamics simulation to predict the tertiary structure of proteins from the primary structure alone or including distance restraints either from predicted residue-residue contacts, nuclear magnetic resonance (NMR) nuclear overhauser effect (NOE) experiments, or mass spectroscopy (MS) cross-linking (XL) data. The protein structure prediction pipeline was evaluated in the CASP11 experiment on twenty regular protein targets as well as thirty-three 'assisted' protein targets, which also had distance restraints available. Although the low-resolution topology search module was able to sample models with a global distance test total score (GDT_TS) value greater than 30% for twelve out of twenty proteins, frequently it was not possible to select the most accurate models for refinement, resulting in a general decay of model quality over the course of the prediction pipeline. In this study, we provide a detailed overall analysis, study one target protein in more detail as it travels through the protein structure prediction pipeline, and evaluate the impact of limited experimental data.

  18. Delineating the relationship between amyotrophic lateral sclerosis and frontotemporal dementia: Sequence and structure-based predictions.

    PubMed

    Kumar, Vijay; Islam, Asimul; Hassan, Md Imtaiyaz; Ahmad, Faizan

    2016-09-01

    Amyotrophic lateral sclerosis (ALS) and frontotemporal dementia (FTD) are related neurodegenerative disorders which are characterized by a rapid decline in cognitive and motor functions, and short survival. Both syndromes may be present within the same family or even in the same person. The genetic findings for both diseases also support the existence of a continuum, with mutations in the same genes being found in patients with ALS, FTD or FTD/ALS. Little is known about the molecular mechanisms underlying the differences in mutations of the same protein causing either ALS or FTD. Here, we shed light on 348 ALS and FTD missense mutations in 14 genes focusing on genic intolerance and protein stability based on available 3D structures. Using EvoTol, we prioritized the disease-causing genes and their domain. The most intolerant genes predicted by EvoTol are SQSTM1 and OPTN which are involved in protein homeostasis. Further, using ENCoM (Elastic Network Contact Model) that predicts stability based on vibrational entropy, we predicted that most of the missense mutations with destabilizing energies are in the structural regions that control the protein-protein interaction, and only a few mutations affect protein folding. We found a trend that energy changes are higher for ALS compared to FTD mutations. The stability of the ALS mutants correlated well with the duration of disease progression as compared to FTD-ALS mutants. This study provides a comprehensive understanding of the mechanism of ALS and illustrates the significance of structure-energy based studies in differentiating ALS and FTD mutations. PMID:27318084

  19. Genome-based discovery, structure prediction and functional analysis of cyclic lipopeptide antibiotics in Pseudomonas species.

    PubMed

    de Bruijn, Irene; de Kock, Maarten J D; Yang, Meng; de Waard, Pieter; van Beek, Teris A; Raaijmakers, Jos M

    2007-01-01

    Analysis of microbial genome sequences have revealed numerous genes involved in antibiotic biosynthesis. In Pseudomonads, several gene clusters encoding non-ribosomal peptide synthetases (NRPSs) were predicted to be involved in the synthesis of cyclic lipopeptide (CLP) antibiotics. Most of these predictions, however, are untested and the association between genome sequence and biological function of the predicted metabolite is lacking. Here we report the genome-based identification of previously unknown CLP gene clusters in plant pathogenic Pseudomonas syringae strains B728a and DC3000 and in plant beneficial Pseudomonas fluorescens Pf0-1 and SBW25. For P. fluorescens SBW25, a model strain in studying bacterial evolution and adaptation, the structure of the CLP with a predicted 9-amino acid peptide moiety was confirmed by chemical analyses. Mutagenesis confirmed that the three identified NRPS genes are essential for CLP synthesis in strain SBW25. CLP production was shown to play a key role in motility, biofilm formation and in activity of SBW25 against zoospores of Phytophthora infestans. This is the first time that an antimicrobial metabolite is identified from strain SBW25. The results indicate that genome mining may enable the discovery of unknown gene clusters and traits that are highly relevant in the lifestyle of plant beneficial and plant pathogenic bacteria.

  20. Combining global and local measures for structure-based druggability predictions.

    PubMed

    Volkamer, Andrea; Kuhn, Daniel; Grombacher, Thomas; Rippmann, Friedrich; Rarey, Matthias

    2012-02-27

    Predicting druggability and prioritizing certain disease modifying targets for the drug development process is of high practical relevance in pharmaceutical research. DoGSiteScorer is a fully automatic algorithm for pocket and druggability prediction. Besides consideration of global properties of the pocket, also local similarities shared between pockets are reflected. Druggability scores are predicted by means of a support vector machine (SVM), trained, and tested on the druggability data set (DD) and its nonredundant version (NRDD). The DD consists of 1069 targets with assigned druggable, difficult, and undruggable classes. In 90% of the NRDD, the SVM model based on global descriptors correctly classifies a target as either druggable or undruggable. Nevertheless, global properties suffer from binding site changes due to ligand binding and from the pocket boundary definition. Therefore, local pocket properties are additionally investigated in terms of a nearest neighbor search. Local similarities are described by distance dependent histograms between atom pairs. In 88% of the DD pocket set, the nearest neighbor and the structure itself conform with their druggability type. A discriminant feature between druggable and undruggable pockets is having less short-range hydrophilic-hydrophilic pairs and more short-range lipophilic-lipophilic pairs. Our findings for global pocket descriptors coincide with previously published methods affirming that size, shape, and hydrophobicity are important global pocket descriptors for automatic druggability prediction. Nevertheless, the variety of pocket shapes and their flexibility upon ligand binding limit the automatic projection of druggable features onto descriptors. Incorporating local pocket properties is another step toward a reliable descriptor-based druggability prediction. PMID:22148551

  1. Prediction of contact residue pairs based on co-substitution between sites in protein structures.

    PubMed

    Miyazawa, Sanzo

    2013-01-01

    Residue-residue interactions that fold a protein into a unique three-dimensional structure and make it play a specific function impose structural and functional constraints in varying degrees on each residue site. Selective constraints on residue sites are recorded in amino acid orders in homologous sequences and also in the evolutionary trace of amino acid substitutions. A challenge is to extract direct dependences between residue sites by removing phylogenetic correlations and indirect dependences through other residues within a protein or even through other molecules. Rapid growth of protein families with unknown folds requires an accurate de novo prediction method for protein structure. Recent attempts of disentangling direct from indirect dependences of amino acid types between residue positions in multiple sequence alignments have revealed that inferred residue-residue proximities can be sufficient information to predict a protein fold without the use of known three-dimensional structures. Here, we propose an alternative method of inferring coevolving site pairs from concurrent and compensatory substitutions between sites in each branch of a phylogenetic tree. Substitution probability and physico-chemical changes (volume, charge, hydrogen-bonding capability, and others) accompanied by substitutions at each site in each branch of a phylogenetic tree are estimated with the likelihood of each substitution, and their direct correlations between sites are used to detect concurrent and compensatory substitutions. In order to extract direct dependences between sites, partial correlation coefficients of the characteristic changes along branches between sites, in which linear multiple dependences on feature vectors at other sites are removed, are calculated and used to rank coevolving site pairs. Accuracy of contact prediction based on the present coevolution score is comparable to that achieved by a maximum entropy model of protein sequences for 15 protein families

  2. A nonlinear viscoelastic approach to durability predictions for polymer based composite structures

    NASA Technical Reports Server (NTRS)

    Brinson, Hal F.; Hiel, C. C.

    1990-01-01

    Current industry approaches for the durability assessment of metallic structures are briefly reviewed. For polymer based composite structures, it is suggested that new approaches must be adopted to include memory or viscoelastic effects which could lead to delayed failures that might not be predicted using current techniques. A durability or accelerated life assessment plan for fiber reinforced plastics (FRP) developed and documented over the last decade or so is reviewed and discussed. Limitations to the plan are outlined and suggestions to remove the limitations are given. These include the development of a finite element code to replace the previously used lamination theory code and the development of new specimen geometries to evaluate delamination failures. The new DCB model is reviewed and results are presented. Finally, it is pointed out that new procedures are needed to determine interfacial properties and current efforts underway to determine such properties are reviewed. Suggestions for additional efforts to develop a consistent and accurate durability predictive approach for FRP structures is outlined.

  3. The Prediction of Botulinum Toxin Structure Based on in Silico and in Vitro Analysis

    NASA Astrophysics Data System (ADS)

    Suzuki, Tomonori; Miyazaki, Satoru

    2011-01-01

    Many of biological system mediated through protein-protein interactions. Knowledge of protein-protein complex structure is required for understanding the function. The determination of huge size and flexible protein-protein complex structure by experimental studies remains difficult, costly and five-consuming, therefore computational prediction of protein structures by homolog modeling and docking studies is valuable method. In addition, MD simulation is also one of the most powerful methods allowing to see the real dynamics of proteins. Here, we predict protein-protein complex structure of botulinum toxin to analyze its property. These bioinformatics methods are useful to report the relation between the flexibility of backbone structure and the activity.

  4. TFpredict and SABINE: sequence-based prediction of structural and functional characteristics of transcription factors.

    PubMed

    Eichner, Johannes; Topf, Florian; Dräger, Andreas; Wrzodek, Clemens; Wanke, Dierk; Zell, Andreas

    2013-01-01

    One of the key mechanisms of transcriptional control are the specific connections between transcription factors (TF) and cis-regulatory elements in gene promoters. The elucidation of these specific protein-DNA interactions is crucial to gain insights into the complex regulatory mechanisms and networks underlying the adaptation of organisms to dynamically changing environmental conditions. As experimental techniques for determining TF binding sites are expensive and mostly performed for selected TFs only, accurate computational approaches are needed to analyze transcriptional regulation in eukaryotes on a genome-wide level. We implemented a four-step classification workflow which for a given protein sequence (1) discriminates TFs from other proteins, (2) determines the structural superclass of TFs, (3) identifies the DNA-binding domains of TFs and (4) predicts their cis-acting DNA motif. While existing tools were extended and adapted for performing the latter two prediction steps, the first two steps are based on a novel numeric sequence representation which allows for combining existing knowledge from a BLAST scan with robust machine learning-based classification. By evaluation on a set of experimentally confirmed TFs and non-TFs, we demonstrate that our new protein sequence representation facilitates more reliable identification and structural classification of TFs than previously proposed sequence-derived features. The algorithms underlying our proposed methodology are implemented in the two complementary tools TFpredict and SABINE. The online and stand-alone versions of TFpredict and SABINE are freely available to academics at http://www.cogsys.cs.uni-tuebingen.de/software/TFpredict/ and http://www.cogsys.cs.uni-tuebingen.de/software/SABINE/.

  5. Structure-based approach to the prediction of disulfide bonds in proteins.

    PubMed

    Salam, Noeris K; Adzhigirey, Matvey; Sherman, Woody; Pearlman, David A

    2014-10-01

    Protein engineering remains an area of growing importance in pharmaceutical and biotechnology research. Stabilization of a folded protein conformation is a frequent goal in projects that deal with affinity optimization, enzyme design, protein construct design, and reducing the size of functional proteins. Indeed, it can be desirable to assess and improve protein stability in order to avoid liabilities such as aggregation, degradation, and immunogenic response that may arise during development. One way to stabilize a protein is through the introduction of disulfide bonds. Here, we describe a method to predict pairs of protein residues that can be mutated to form a disulfide bond. We combine a physics-based approach that incorporates implicit solvent molecular mechanics with a knowledge-based approach. We first assign relative weights to the terms that comprise our scoring function using a genetic algorithm applied to a set of 75 wild-type structures that each contains a disulfide bond. The method is then tested on a separate set of 13 engineered proteins comprising 15 artificial stabilizing disulfides introduced via site-directed mutagenesis. We find that the native disulfide in the wild-type proteins is scored well, on average (within the top 6% of the reasonable pairs of residues that could form a disulfide bond) while 6 out of the 15 artificial stabilizing disulfides scored within the top 13% of ranked predictions. Overall, this suggests that the physics-based approach presented here can be useful for triaging possible pairs of mutations for disulfide bond formation to improve protein stability.

  6. Thermodynamic Properties of Asphaltenes: A Predictive Approach Based On Computer Assisted Structure Elucidation and Atomistic Simulations

    SciTech Connect

    Diallo, Mamadou S.; Cagin, Tahir; Faulon, Jean Loup; Goddard, William A.

    2000-08-01

    The authors describe a new methodology for predicting the thermodynamic properties of petroleum geomacromolecules (asphaltenes and resins). This methodology combines computer assisted structure elucidation (CASE) with atomistic simulations (molecular mechanics and molecular dynamics and statistical mechanics). They use quantitative and qualitative structural data as input to a CASE program (SIGNATURE) to generate a sample of ten asphaltene model structures for a Saudi crude oil (Arab Berri). MM calculations and MD simulations are used to estimate selected volumetric and thermal properties of the model structures.

  7. Combined computational metabolite prediction and automated structure-based analysis of mass spectrometric data.

    PubMed

    Stranz, David D; Miao, Shichang; Campbell, Scott; Maydwell, George; Ekins, Sean

    2008-01-01

    ABSTRACT As high-throughput technologies have developed in the pharmaceutical industry, the demand for identification of possible metabolites using predominantly liquid chromatographic/mass spectrometry-mass spectrometry/mass spectrometry (LC/MS-MS/MS) for a large number of molecules in drug discovery has also increased. In parallel, computational technologies have also been developed to generate predictions for metabolites alongside methods to predict MS spectra and score the quality of the match with experimental spectra. The goal of the current study was to generate metabolite predictions from molecular structure with a software product, MetaDrug. In vitro microsomal incubations were used to ultimately produce MS data that could be used to verify the predictions with Apex, which is a new software tool that can predict the molecular ion spectrum and a fragmentation spectrum, automating the detailed examination of both MS and MS/MS spectra. For the test molecule imipramine used to illustrate the combined in vitro/in silico process proposed, MetaDrug predicts 16 metabolites. Following rat microsomal incubations with imipramine and analysis of the MS(n) data using the Apex software, strong evidence was found for imipramine and five metabolites and weaker evidence for five additional metabolites. This study suggests a new approach to streamline MS data analysis using a combination of predictive computational approaches with software capable of comparing the predicted metabolite output with empirical data when looking at drug metabolites.

  8. Prediction of protein secondary structure based on residue pair types and conformational states using dynamic programming algorithm.

    PubMed

    Sadeghi, Mehdi; Parto, Sahar; Arab, Shahriar; Ranjbar, Bijan

    2005-06-20

    We have used a statistical approach for protein secondary structure prediction based on information theory and simultaneously taking into consideration pairwise residue types and conformational states. Since the prediction of residue secondary structure by one residue window sliding make ambiguity in state prediction, we used a dynamic programming algorithm to find the path with maximum score. A score system for residue pairs in particular conformations is derived for adjacent neighbors up to ten residue apart in sequence. The three state overall per-residue accuracy, Q3, of this method in a jackknife test with dataset created from PDBSELECT is more than 70%.

  9. Protein Tertiary Structure Prediction Based on Main Chain Angle Using a Hybrid Bees Colony Optimization Algorithm

    NASA Astrophysics Data System (ADS)

    Mahmood, Zakaria N.; Mahmuddin, Massudi; Mahmood, Mohammed Nooraldeen

    Encoding proteins of amino acid sequence to predict classified into their respective families and subfamilies is important research area. However for a given protein, knowing the exact action whether hormonal, enzymatic, transmembranal or nuclear receptors does not depend solely on amino acid sequence but on the way the amino acid thread folds as well. This study provides a prototype system that able to predict a protein tertiary structure. Several methods are used to develop and evaluate the system to produce better accuracy in protein 3D structure prediction. The Bees Optimization algorithm which inspired from the honey bees food foraging method, is used in the searching phase. In this study, the experiment is conducted on short sequence proteins that have been used by the previous researches using well-known tools. The proposed approach shows a promising result.

  10. Predictions of Crystal Structure Based on Radius Ratio: How Reliable Are They?

    ERIC Educational Resources Information Center

    Nathan, Lawrence C.

    1985-01-01

    Discussion of crystalline solids in undergraduate curricula often includes the use of radius ratio rules as a method for predicting which type of crystal structure is likely to be adopted by a given ionic compound. Examines this topic, establishing more definitive guidelines for the use and reliability of the rules. (JN)

  11. Local structure based method for prediction of the biochemical function of proteins: Applications to glycoside hydrolases.

    PubMed

    Parasuram, Ramya; Mills, Caitlyn L; Wang, Zhouxi; Somasundaram, Saroja; Beuning, Penny J; Ondrechen, Mary Jo

    2016-01-15

    Thousands of protein structures of unknown or uncertain function have been reported as a result of high-throughput structure determination techniques developed by Structural Genomics (SG) projects. However, many of the putative functional assignments of these SG proteins in the Protein Data Bank (PDB) are incorrect. While high-throughput biochemical screening techniques have provided valuable functional information for limited sets of SG proteins, the biochemical functions for most SG proteins are still unknown or uncertain. Therefore, computational methods for the reliable prediction of protein function from structure can add tremendous value to the existing SG data. In this article, we show how computational methods may be used to predict the function of SG proteins, using examples from the six-hairpin glycosidase (6-HG) and the concanavalin A-like lectin/glucanase (CAL/G) superfamilies. Using a set of predicted functional residues, obtained from computed electrostatic and chemical properties for each protein structure, it is shown that these superfamilies may be sorted into functional families according to biochemical function. Within these superfamilies, a total of 18 SG proteins were analyzed according to their predicted, local functional sites: 13 from the 6-HG superfamily, five from the CAL/G superfamily. Within the 6-HG superfamily, an uncharacterized protein BACOVA_03626 from Bacteroides ovatus (PDB 3ON6) and a hypothetical protein BT3781 from Bacteroides thetaiotaomicron (PDB 2P0V) are shown to have very strong active site matches with exo-α-1,6-mannosidases, thus likely possessing this function. Also in this superfamily, it is shown that protein BH0842, a putative glycoside hydrolase from Bacillus halodurans (PDB 2RDY), has a predicted active site that matches well with a known α-L-galactosidase. In the CAL/G superfamily, an uncharacterized glycosyl hydrolase family 16 protein from Mycobacterium smegmatis (PDB 3RQ0) is shown to have local structural

  12. Fine-grained parallelism accelerating for RNA secondary structure prediction with pseudoknots based on FPGA.

    PubMed

    Xia, Fei; Jin, Guoqing

    2014-06-01

    PKNOTS is a most famous benchmark program and has been widely used to predict RNA secondary structure including pseudoknots. It adopts the standard four-dimensional (4D) dynamic programming (DP) method and is the basis of many variants and improved algorithms. Unfortunately, the O(N(6)) computing requirements and complicated data dependency greatly limits the usefulness of PKNOTS package with the explosion in gene database size. In this paper, we present a fine-grained parallel PKNOTS package and prototype system for accelerating RNA folding application based on FPGA chip. We adopted a series of storage optimization strategies to resolve the "Memory Wall" problem. We aggressively exploit parallel computing strategies to improve computational efficiency. We also propose several methods that collectively reduce the storage requirements for FPGA on-chip memory. To the best of our knowledge, our design is the first FPGA implementation for accelerating 4D DP problem for RNA folding application including pseudoknots. The experimental results show a factor of more than 50x average speedup over the PKNOTS-1.08 software running on a PC platform with Intel Core2 Q9400 Quad CPU for input RNA sequences. However, the power consumption of our FPGA accelerator is only about 50% of the general-purpose micro-processors.

  13. Structural MRI-Based Predictions in Patients with Treatment-Refractory Depression (TRD)

    PubMed Central

    Johnston, Blair A.; Steele, J. Douglas; Tolomeo, Serenella; Christmas, David; Matthews, Keith

    2015-01-01

    The application of machine learning techniques to psychiatric neuroimaging offers the possibility to identify robust, reliable and objective disease biomarkers both within and between contemporary syndromal diagnoses that could guide routine clinical practice. The use of quantitative methods to identify psychiatric biomarkers is consequently important, particularly with a view to making predictions relevant to individual patients, rather than at a group-level. Here, we describe predictions of treatment-refractory depression (TRD) diagnosis using structural T1-weighted brain scans obtained from twenty adult participants with TRD and 21 never depressed controls. We report 85% accuracy of individual subject diagnostic prediction. Using an automated feature selection method, the major brain regions supporting this significant classification were in the caudate, insula, habenula and periventricular grey matter. It was not, however, possible to predict the degree of ‘treatment resistance’ in individual patients, at least as quantified by the Massachusetts General Hospital (MGH-S) clinical staging method; but the insula was again identified as a region of interest. Structural brain imaging data alone can be used to predict diagnostic status, but not MGH-S staging, with a high degree of accuracy in patients with TRD. PMID:26186455

  14. Structural MRI-Based Predictions in Patients with Treatment-Refractory Depression (TRD).

    PubMed

    Johnston, Blair A; Steele, J Douglas; Tolomeo, Serenella; Christmas, David; Matthews, Keith

    2015-01-01

    The application of machine learning techniques to psychiatric neuroimaging offers the possibility to identify robust, reliable and objective disease biomarkers both within and between contemporary syndromal diagnoses that could guide routine clinical practice. The use of quantitative methods to identify psychiatric biomarkers is consequently important, particularly with a view to making predictions relevant to individual patients, rather than at a group-level. Here, we describe predictions of treatment-refractory depression (TRD) diagnosis using structural T1-weighted brain scans obtained from twenty adult participants with TRD and 21 never depressed controls. We report 85% accuracy of individual subject diagnostic prediction. Using an automated feature selection method, the major brain regions supporting this significant classification were in the caudate, insula, habenula and periventricular grey matter. It was not, however, possible to predict the degree of 'treatment resistance' in individual patients, at least as quantified by the Massachusetts General Hospital (MGH-S) clinical staging method; but the insula was again identified as a region of interest. Structural brain imaging data alone can be used to predict diagnostic status, but not MGH-S staging, with a high degree of accuracy in patients with TRD.

  15. Evaluation of machine learning algorithms for treatment outcome prediction in patients with epilepsy based on structural connectome data.

    PubMed

    Munsell, Brent C; Wee, Chong-Yaw; Keller, Simon S; Weber, Bernd; Elger, Christian; da Silva, Laura Angelica Tomaz; Nesland, Travis; Styner, Martin; Shen, Dinggang; Bonilha, Leonardo

    2015-09-01

    The objective of this study is to evaluate machine learning algorithms aimed at predicting surgical treatment outcomes in groups of patients with temporal lobe epilepsy (TLE) using only the structural brain connectome. Specifically, the brain connectome is reconstructed using white matter fiber tracts from presurgical diffusion tensor imaging. To achieve our objective, a two-stage connectome-based prediction framework is developed that gradually selects a small number of abnormal network connections that contribute to the surgical treatment outcome, and in each stage a linear kernel operation is used to further improve the accuracy of the learned classifier. Using a 10-fold cross validation strategy, the first stage in the connectome-based framework is able to separate patients with TLE from normal controls with 80% accuracy, and second stage in the connectome-based framework is able to correctly predict the surgical treatment outcome of patients with TLE with 70% accuracy. Compared to existing state-of-the-art methods that use VBM data, the proposed two-stage connectome-based prediction framework is a suitable alternative with comparable prediction performance. Our results additionally show that machine learning algorithms that exclusively use structural connectome data can predict treatment outcomes in epilepsy with similar accuracy compared with "expert-based" clinical decision. In summary, using the unprecedented information provided in the brain connectome, machine learning algorithms may uncover pathological changes in brain network organization and improve outcome forecasting in the context of epilepsy.

  16. Prediction of protein structural features from sequence data based on Shannon entropy and Kolmogorov complexity.

    PubMed

    Bywater, Robert Paul

    2015-01-01

    While the genome for a given organism stores the information necessary for the organism to function and flourish it is the proteins that are encoded by the genome that perhaps more than anything else characterize the phenotype for that organism. It is therefore not surprising that one of the many approaches to understanding and predicting protein folding and properties has come from genomics and more specifically from multiple sequence alignments. In this work I explore ways in which data derived from sequence alignment data can be used to investigate in a predictive way three different aspects of protein structure: secondary structures, inter-residue contacts and the dynamics of switching between different states of the protein. In particular the use of Kolmogorov complexity has identified a novel pathway towards achieving these goals.

  17. Damage Prediction and Estimation in Structural Mechanics Based on Data Mining

    SciTech Connect

    Sandhu, S S; Kanapady, R; Tamma, K K; Kamath, C; Kumar, V

    2001-07-23

    Damage in a material includes localized softening or cracks in a structural component due to high operational loads, or the presence of flaws in a structure due to various manufacturing processes. Methods that identify the presence, the location and the severity of damage in the structure are useful for non-destructive evaluation procedures that are typically employed in agile manufacturing and rapid prototyping systems. The current state-of-the art techniques for these inverse problems are computationally intensive or ill conditioned when insufficient data exists. Early work by a number of researchers has shown that data mining techniques can provide a potential solution to this problem. In this paper, they investigate the use of data mining techniques for predicting failure in a variety of 2D and 3D structures using artificial neural networks (ANNs) and decision trees. This work shows that if the correct features are chosen to build the model, and the model is trained on an adequate amount of data, the model can then correctly classify the failure event as well as predict location and severity of the damage in these structures.

  18. Evaluation of machine learning algorithms for treatment outcome prediction in patients with epilepsy based on structural connectome data

    PubMed Central

    Munsell, Brent C.; Wee, Chong-Yaw; Keller, Simon S.; Weber, Bernd; Elger, Christian; da Silva, Laura Angelica Tomaz; Nesland, Travis; Styner, Martin; Shen, Dinggang; Bonilha, Leonardo

    2015-01-01

    The objective of this study is to evaluate machine learning algorithms aimed at predicting surgical treatment outcomes in groups of patients with temporal lobe epilepsy (TLE) using only the structural brain connectome. Specifically, the brain connectome is reconstructed using white matter fiber tracts from presurgical diffusion tensor imaging. To achieve our objective, a two-stage connectome-based prediction framework is developed that gradually selects a small number of abnormal network connections that contribute to the surgical treatment outcome, and in each stage a linear kernel operation is used to further improve the accuracy of the learned classifier. Using a 10-fold cross validation strategy, the first stage in the connectome-based framework is able to separate patients with TLE from normal controls with 80% accuracy, and second stage in the connectome-based framework is able to correctly predict the surgical treatment outcome of patients with TLE with 70% accuracy. Compared to existing state-of-the-art methods that use VBM data, the proposed two-stage connectome-based prediction framework is a suitable alternative with comparable prediction performance. Our results additionally show that machine learning algorithms that exclusively use structural connectome data can predict treatment outcomes in epilepsy with similar accuracy compared with “expert-based” clinical decision. In summary, using the unprecedented information provided in the brain connectome, machine learning algorithms may uncover pathological changes in brain network organization and improve outcome forecasting in the context of epilepsy. PMID:26054876

  19. CRYSpred: accurate sequence-based protein crystallization propensity prediction using sequence-derived structural characteristics.

    PubMed

    Mizianty, Marcin J; Kurgan, Lukasz A

    2012-01-01

    Relatively low success rates of X-ray crystallography, which is the most popular method for solving proteins structures, motivate development of novel methods that support selection of tractable protein targets. This aspect is particularly important in the context of the current structural genomics efforts that allow for a certain degree of flexibility in the target selection. We propose CRYSpred, a novel in-silico crystallization propensity predictor that uses a set of 15 novel features which utilize a broad range of inputs including charge, hydrophobicity, and amino acid composition derived from the protein chain, and the solvent accessibility and disorder predicted from the protein sequence. Our method outperforms seven modern crystallization propensity predictors on three, independent from training dataset, benchmark test datasets. The strong predictive performance offered by the CRYSpred is attributed to the careful design of the features, utilization of the comprehensive set of inputs, and the usage of the Support Vector Machine classifier. The inputs utilized by CRYSpred are well-aligned with the existing rules-of-thumb that are used in the structural genomics studies. PMID:21919861

  20. CRYSpred: accurate sequence-based protein crystallization propensity prediction using sequence-derived structural characteristics.

    PubMed

    Mizianty, Marcin J; Kurgan, Lukasz A

    2012-01-01

    Relatively low success rates of X-ray crystallography, which is the most popular method for solving proteins structures, motivate development of novel methods that support selection of tractable protein targets. This aspect is particularly important in the context of the current structural genomics efforts that allow for a certain degree of flexibility in the target selection. We propose CRYSpred, a novel in-silico crystallization propensity predictor that uses a set of 15 novel features which utilize a broad range of inputs including charge, hydrophobicity, and amino acid composition derived from the protein chain, and the solvent accessibility and disorder predicted from the protein sequence. Our method outperforms seven modern crystallization propensity predictors on three, independent from training dataset, benchmark test datasets. The strong predictive performance offered by the CRYSpred is attributed to the careful design of the features, utilization of the comprehensive set of inputs, and the usage of the Support Vector Machine classifier. The inputs utilized by CRYSpred are well-aligned with the existing rules-of-thumb that are used in the structural genomics studies.

  1. Decomposing cerebral blood flow MRI into functional and structural components: A non-local approach based on prediction

    PubMed Central

    Kandel, Benjamin M.; Wang, Danny JJ; Detre, John A.; Gee, James C.; Avants, Brian B.

    2014-01-01

    We present RIPMMARC (Rotation Invariant Patch-based Multi-Modality Analysis aRChitecture), a flexible and widely applicable method for extracting information unique to a given modality from a multi-modal data set. We use RIPMMARC to improve interpretation of arterial spin labeling (ASL) perfusion images by removing the component of perfusion that is predicted by the underlying anatomy. Using patch-based, rotation invariant descriptors derived from the anatomical image, we learn a predictive relationship between local neuroanatomical structure and the corresponding perfusion image. This relation allows us to produce an image of perfusion that would be predicted given only the underlying anatomy and a residual image that represents perfusion information that cannot be predicted by anatomical features. Our learned structural features are significantly better at predicting brain perfusion than tissue probability maps, which are the input to standard partial volume correction techniques. Studies in test-retest data show that both the anatomically predicted and residual perfusion signal are highly replicable for a given subject. In a pediatric population, both the raw perfusion and structurally predicted images are tightly linked to age throughout adolescence throughout the brain. Interestingly, the residual perfusion also shows a strong correlation with age in select regions including the hippocampi (corr= 0.38, p-value < 10−6), precuneus (corr= −0.44, p < 10−5), and combined default mode network regions (corr= −0.45, p < 10−8) that is independent of global anatomy-perfusion trends. This finding suggests that there is a regionally heterogeneous pattern of functional specialization that is distinct from that of cortical structural development. PMID:25449745

  2. A Structure Based Model for the Prediction of Phospholipidosis Induction Potential of Small Molecules

    PubMed Central

    Sun, Hongmao; Shahane, Sampada; Xia, Menghang; Austin, Christopher P.; Huang, Ruili

    2012-01-01

    Drug-induced phospholipidosis (PLD), characterized by an intracellular accumulation of phospholipids and formation of concentric lamellar bodies, has raised concerns in the drug discovery community, due to its potential adverse effects. To evaluate the PLD induction potential, 4,161 non-redundant drug-like molecules from the National Institutes of Health Chemical Genomics Center (NCGC) Pharmaceutical Collection (NPC), the Library of Pharmacologically Active Compounds (LOPAC) and the Tocris Biosciences collection were screened in a quantitative high-throughput screening (qHTS) format. The potential of drug-lipid complex formation can be linked directly to the structures of drug molecules, and many PLD inducing drugs were found to share common structural features. Support vector machine (SVM) models were constructed by using customized atom types or Molecular Operating Environment (MOE) 2D descriptors as structural descriptors. Either the compounds from LOPAC or randomly selected from the entire dataset were used as the training set. The impact of training data with biased structural features and the impact of molecule descriptors emphasizing whole-molecule properties or detailed functional groups at the atom level on model performance were analyzed and discussed. Rebalancing strategies were applied to improve the predictive power of the SVM models. Using the under-sampling method, the consensus model using one third of the compounds randomly selected from the data set as the training set achieved high accuracy of 0.90 in predicting the remaining two thirds of the compounds constituting the test set, as measured by the area under the receiver operator characteristic curve (AUC-ROC). PMID:22725677

  3. HMM-based prediction for protein structural motifs' two local properties: solvent accessibility and backbone torsion angles.

    PubMed

    Yu, Jianyong; Xiang, Leijun; Hong, Jiang; Zhang, Weidong

    2013-02-01

    Protein structure prediction is often assisted by predicting one-dimensional structural properties including relative solvent accessibility (RSA) surface and backbone torsion angles (BTA) of residues, and these two properties are continuously varying variables because proteins can move freely in a three-dimensional space. Instead of subdividing them into a few arbitrarily defined states that many popular approaches used, this paper proposes an integrated system for realvalue prediction of protein structural motifs' two local properties, based on the modified Hidden Markov Model that we previously presented. The model was used to capture the relevance of RSA and the dependency of BTA between adjacent residues along the local protein chain in motifs with definite probabilities. These two properties were predicted according to their own probability distribution. The method was applied to a protein fragment library. For nine different classes of motifs, real values of RSA were predicted with mean absolute error (MAE) of 0.122-0.175 and Pearson's correlation coefficient (PCC) of 0.623-0.714 between predicted and actual RSA. Meanwhile, real values of BTA were obtained with MAE of 8.5⁰-29.4⁰ for Φ angles, 11.2⁰-38.5⁰ for ψ angles and PCC of 0.601-0.716 for Φ, 0.597-0.713 for ψ. The results were compared with well-known Real-SPINE Server, and indicate the proposed method may at least serve as the foundation to obtain better local properties from structural motifs for protein structure prediction. PMID:22894152

  4. Altered sphingoid base profiles predict compromised membrane structure and permeability in atopic dermatitis

    PubMed Central

    Loiseau, Nicolas; Obata, Yasuko; Moradian, Sam; Sano, Hiromu; Yoshino, Saeko; Aburai, Kenichi; Takayama, Kozo; Sakamoto, Kazutami; Holleran, Walter M.; Elias, Peter M.; Uchida, Yoshikazu

    2013-01-01

    Background Ceramide hydrolysis by ceramidase in the stratum corneum (SC) yields both sphingoid bases and free fatty acids (FFA). While FFA are key constituents of the lamellar bilayers that mediate the epidermal permeability barrier, whether sphingoid bases influence permeability barrier homeostasis remains unknown. Pertinently, alterations of lipid profile, including ceramide and ceramidase activities occur in atopic dermatitis (AD). Object We investigated alterations in sphingoid base levels and/or profiles (sphingosine to sphinganine ratio) in the SC of normal vs. AD mice, a model that faithfully replicates human AD, and then whether altered sphingoid base levels and/or profiles influence(s) membrane stability and/or structures. Methods Unilamellar vesicles (LV), incorporating the three major SC lipids (ceramides/FFA/cholesterol) and different ratios of sphingosine/sphinganine, encapsulating carboxyfluorescein, were used as the model of SC lipids. Membrane stability was measured as release of carboxyfluorescein. Thermal analysis of LV was conducted by Differential scanning calorimetry (DSC). Results LV containing AD levels of sphingosine/sphinganine (AD-LV) displayed altered membrane permeability vs. normal-LV. DSC analyses revealed decreases in orthorhombic structures that form tightly-packed lamellar structures in AD-LV. Conclusion Sphingoid base composition influences lamellar membrane architecture in SC, suggesting that altered sphingoid base profiles could contribute to the barrier abnormality in AD. PMID:24070864

  5. Structure-based comparative analysis and prediction of N-linked glycosylation sites in evolutionarily distant eukaryotes.

    PubMed

    Lam, Phuc Vinh Nguyen; Goldman, Radoslav; Karagiannis, Konstantinos; Narsule, Tejas; Simonyan, Vahan; Soika, Valerii; Mazumder, Raja

    2013-04-01

    The asparagine-X-serine/threonine (NXS/T) motif, where X is any amino acid except proline, is the consensus motif for N-linked glycosylation. Significant numbers of high-resolution crystal structures of glycosylated proteins allow us to carry out structural analysis of the N-linked glycosylation sites (NGS). Our analysis shows that there is enough structural information from diverse glycoproteins to allow the development of rules which can be used to predict NGS. A Python-based tool was developed to investigate asparagines implicated in N-glycosylation in five species: Homo sapiens, Mus musculus, Drosophila melanogaster, Arabidopsis thaliana and Saccharomyces cerevisiae. Our analysis shows that 78% of all asparagines of NXS/T motif involved in N-glycosylation are localized in the loop/turn conformation in the human proteome. Similar distribution was revealed for all the other species examined. Comparative analysis of the occurrence of NXS/T motifs not known to be glycosylated and their reverse sequence (S/TXN) shows a similar distribution across the secondary structural elements, indicating that the NXS/T motif in itself is not biologically relevant. Based on our analysis, we have defined rules to determine NGS. Using machine learning methods based on these rules we can predict with 93% accuracy if a particular site will be glycosylated. If structural information is not available the tool uses structural prediction results resulting in 74% accuracy. The tool was used to identify glycosylation sites in 108 human proteins with structures and 2247 proteins without structures that have acquired NXS/T site/s due to non-synonymous variation. The tool, Structure Feature Analysis Tool (SFAT), is freely available to the public at http://hive.biochemistry.gwu.edu/tools/sfat. PMID:23459159

  6. High pressure structures of "111" type iron-based superconductors predicted from first-principles.

    PubMed

    Zhang, Xinxin; Wang, Yanchao; Ma, Yanming

    2012-11-21

    The high-pressure crystal structures of the "111" type iron-based superconductors: NaFeAs, LiFeP and LiFeAs have been systematically explored by using particle-swarm structural searches. It was found that though these iron-based superconductors are chemically similar, they adopted distinct structural phase transitions: P4/nmm→Cmcm→P3m1 for NaFeAs, P4/nmm→Cmcm→I4mm for LiFeP, and P4/nmm→P3m1 →I4mm→P6(3)/mmc for LiFeAs under high pressure. The high pressure orthorhombic Cmcm phase preserved the structural features of FeX(4)(X = As, P) tetrahedral layers present in the ambient-pressure P4/nmm structure. However, the FeX(4) tetrahedrons in the Cmcm phase were clearly distorted, leading to changes in the electronic behavior around the Fermi level. Under higher pressures, the FeX(4) layered structural features were no longer persistent and three-dimensional crystal structures were stabilized in other P3m1, I4mm, and P6(3)/mmc phases, which featured FeAs(5)/FeAs(6) hexahedron and octahedrons, FeX(5) tetragonal pyramids, and FeAs(6) octahedrons, respectively. Analysis of the electronic density of states suggests that most of the high pressure phases are metallic except for the tetragonal I4mm phase, which possesses a narrow band gap. This semiconducting state might relate to the tetragonal pyramid structure formed by FeX(5) unit, which might be favorable for charge localization.

  7. Information theory-based scoring function for the structure-based prediction of protein-ligand binding affinity.

    PubMed

    Kulharia, Mahesh; Goody, Roger S; Jackson, Richard M

    2008-10-01

    The development and validation of a new knowledge based scoring function (SIScoreJE) to predict binding energy between proteins and ligands is presented. SIScoreJE efficiently predicts the binding energy between a small molecule and its protein receptor. Protein-ligand atomic contact information was derived from a Non-Redundant Data set (NRD) of over 3000 X-ray crystal structures of protein-ligand complexes. This information was classified for individual "atom contact pairs" (ACP) which is used to calculate the atomic contact preferences. In addition to the two schemes generated in this study we have assessed a number of other common atom-type classification schemes. The preferences were calculated using an information theoretic relationship of joint entropy. Among 18 different atom-type classification schemes "ScoreJE Atom Type set2" (SATs2) was found to be the most suitable for our approach. To test the sensitivity of the method to the inclusion of solvent, Single-body Solvation Potentials (SSP) were also derived from the atomic contacts between the protein atom types and water molecules modeled using AQUARIUS2. Validation was carried out using an evaluation data set of 100 protein-ligand complexes with known binding energies to test the ability of the scoring functions to reproduce known binding affinities. In summary, it was found that a combined SSP/ScoreJE (SIScoreJE) performed significantly better than ScoreJE alone, and SIScoreJE and ScoreJE performed better than GOLD::GoldScore, GOLD::ChemScore, and XScore.

  8. Physics-based protein-structure prediction using a hierarchical protocol based on the UNRES force field: assessment in two blind tests.

    PubMed

    Ołdziej, S; Czaplewski, C; Liwo, A; Chinchio, M; Nanias, M; Vila, J A; Khalili, M; Arnautova, Y A; Jagielska, A; Makowski, M; Schafroth, H D; Kaźmierkiewicz, R; Ripoll, D R; Pillardy, J; Saunders, J A; Kang, Y K; Gibson, K D; Scheraga, H A

    2005-05-24

    Recent improvements in the protein-structure prediction method developed in our laboratory, based on the thermodynamic hypothesis, are described. The conformational space is searched extensively at the united-residue level by using our physics-based UNRES energy function and the conformational space annealing method of global optimization. The lowest-energy coarse-grained structures are then converted to an all-atom representation and energy-minimized with the ECEPP/3 force field. The procedure was assessed in two recent blind tests of protein-structure prediction. During the first blind test, we predicted large fragments of alpha and alpha+beta proteins [60-70 residues with C(alpha) rms deviation (rmsd) <6 A]. However, for alpha+beta proteins, significant topological errors occurred despite low rmsd values. In the second exercise, we predicted whole structures of five proteins (two alpha and three alpha+beta, with sizes of 53-235 residues) with remarkably good accuracy. In particular, for the genomic target TM0487 (a 102-residue alpha+beta protein from Thermotoga maritima), we predicted the complete, topologically correct structure with 7.3-A C(alpha) rmsd. So far this protein is the largest alpha+beta protein predicted based solely on the amino acid sequence and a physics-based potential-energy function and search procedure. For target T0198, a phosphate transport system regulator PhoU from T. maritima (a 235-residue mainly alpha-helical protein), we predicted the topology of the whole six-helix bundle correctly within 8 A rmsd, except the 32 C-terminal residues, most of which form a beta-hairpin. These and other examples described in this work demonstrate significant progress in physics-based protein-structure prediction.

  9. Physics-based protein-structure prediction using a hierarchical protocol based on the UNRES force field: Assessment in two blind tests

    PubMed Central

    Ołdziej, S.; Czaplewski, C.; Liwo, A.; Chinchio, M.; Nanias, M.; Vila, J. A.; Khalili, M.; Arnautova, Y. A.; Jagielska, A.; Makowski, M.; Schafroth, H. D.; Kaźmierkiewicz, R.; Ripoll, D. R.; Pillardy, J.; Saunders, J. A.; Kang, Y. K.; Gibson, K. D.; Scheraga, H. A.

    2005-01-01

    Recent improvements in the protein-structure prediction method developed in our laboratory, based on the thermodynamic hypothesis, are described. The conformational space is searched extensively at the united-residue level by using our physics-based UNRES energy function and the conformational space annealing method of global optimization. The lowest-energy coarse-grained structures are then converted to an all-atom representation and energy-minimized with the ECEPP/3 force field. The procedure was assessed in two recent blind tests of protein-structure prediction. During the first blind test, we predicted large fragments of α and α+β proteins [60–70 residues with Cα rms deviation (rmsd) <6 Å]. However, for α+β proteins, significant topological errors occurred despite low rmsd values. In the second exercise, we predicted whole structures of five proteins (two α and three α+β, with sizes of 53–235 residues) with remarkably good accuracy. In particular, for the genomic target TM0487 (a 102-residue α+β protein from Thermotoga maritima), we predicted the complete, topologically correct structure with 7.3-Å Cα rmsd. So far this protein is the largest α+β protein predicted based solely on the amino acid sequence and a physics-based potential-energy function and search procedure. For target T0198, a phosphate transport system regulator PhoU from T. maritima (a 235-residue mainly α-helical protein), we predicted the topology of the whole six-helix bundle correctly within 8 Å rmsd, except the 32 C-terminal residues, most of which form a β-hairpin. These and other examples described in this work demonstrate significant progress in physics-based protein-structure prediction. PMID:15894609

  10. 3D Structure Prediction of Human β1-Adrenergic Receptor via Threading-Based Homology Modeling for Implications in Structure-Based Drug Designing

    PubMed Central

    Ul-Haq, Zaheer; Saeed, Maria; Halim, Sobia Ahsan; Khan, Waqasuddin

    2015-01-01

    Dilated cardiomyopathy is a disease of left ventricular dysfunction accompanied by impairment of the β1-adrenergic receptor (β1-AR) signal cascade. The disturbed β1-AR function may be based on an elevated sympathetic tone observed in patients with heart failure. Prolonged adrenergic stimulation may induce metabolic and electrophysiological disturbances in the myocardium, resulting in tachyarrhythmia that leads to the development of heart failure in human and sudden death. Hence, β1-AR is considered as a promising drug target but attempts to develop effective and specific drug against this tempting pharmaceutical target is slowed down due to the lack of 3D structure of Homo sapiens β1-AR (hsβADR1). This study encompasses elucidation of 3D structural and physicochemical properties of hsβADR1 via threading-based homology modeling. Furthermore, the docking performance of several docking programs including Surflex-Dock, FRED, and GOLD were validated by re-docking and cross-docking experiments. GOLD and Surflex-Dock performed best in re-docking and cross docking experiments, respectively. Consequently, Surflex-Dock was used to predict the binding modes of four hsβADR1 agonists. This study provides clear understanding of hsβADR1 structure and its binding mechanism, thus help in providing the remedial solutions of cardiovascular, effective treatment of asthma and other diseases caused by malfunctioning of the target protein. PMID:25860348

  11. 3D structure prediction of human β1-adrenergic receptor via threading-based homology modeling for implications in structure-based drug designing.

    PubMed

    Ul-Haq, Zaheer; Saeed, Maria; Halim, Sobia Ahsan; Khan, Waqasuddin

    2015-01-01

    Dilated cardiomyopathy is a disease of left ventricular dysfunction accompanied by impairment of the β1-adrenergic receptor (β1-AR) signal cascade. The disturbed β1-AR function may be based on an elevated sympathetic tone observed in patients with heart failure. Prolonged adrenergic stimulation may induce metabolic and electrophysiological disturbances in the myocardium, resulting in tachyarrhythmia that leads to the development of heart failure in human and sudden death. Hence, β1-AR is considered as a promising drug target but attempts to develop effective and specific drug against this tempting pharmaceutical target is slowed down due to the lack of 3D structure of Homo sapiens β1-AR (hsβADR1). This study encompasses elucidation of 3D structural and physicochemical properties of hsβADR1 via threading-based homology modeling. Furthermore, the docking performance of several docking programs including Surflex-Dock, FRED, and GOLD were validated by re-docking and cross-docking experiments. GOLD and Surflex-Dock performed best in re-docking and cross docking experiments, respectively. Consequently, Surflex-Dock was used to predict the binding modes of four hsβADR1 agonists. This study provides clear understanding of hsβADR1 structure and its binding mechanism, thus help in providing the remedial solutions of cardiovascular, effective treatment of asthma and other diseases caused by malfunctioning of the target protein. PMID:25860348

  12. Structural Bioinformatics-Based Prediction of Exceptional Selectivity of p38 MAP Kinase Inhibitor PH-797804

    SciTech Connect

    Xing, Li; Shieh, Huey S.; Selness, Shaun R.; Devraj, Rajesh V.; Walker, John K.; Devadas, Balekudru; Hope, Heidi R.; Compton, Robert P.; Schindler, John F.; Hirsch, Jeffrey L.; Benson, Alan G.; Kurumbail, Ravi G.; Stegeman, Roderick A.; Williams, Jennifer M.; Broadus, Richard M.; Walden, Zara; Monahan, Joseph B.; Pfizer

    2009-07-24

    PH-797804 is a diarylpyridinone inhibitor of p38{alpha} mitogen-activated protein (MAP) kinase derived from a racemic mixture as the more potent atropisomer (aS), first proposed by molecular modeling and subsequently confirmed by experiments. On the basis of structural comparison with a different biaryl pyrazole template and supported by dozens of high-resolution crystal structures of p38{alpha} inhibitor complexes, PH-797804 is predicted to possess a high level of specificity across the broad human kinase genome. We used a structural bioinformatics approach to identify two selectivity elements encoded by the TXXXG sequence motif on the p38{alpha} kinase hinge: (i) Thr106 that serves as the gatekeeper to the buried hydrophobic pocket occupied by 2,4-difluorophenyl of PH-797804 and (ii) the bidentate hydrogen bonds formed by the pyridinone moiety with the kinase hinge requiring an induced 180{sup o} rotation of the Met109-Gly110 peptide bond. The peptide flip occurs in p38{alpha} kinase due to the critical glycine residue marked by its conformational flexibility. Kinome-wide sequence mining revealed rare presentation of the selectivity motif. Corroboratively, PH-797804 exhibited exceptionally high specificity against MAP kinases and the related kinases. No cross-reactivity was observed in large panels of kinase screens (selectivity ratio of >500-fold). In cellular assays, PH-797804 demonstrated superior potency and selectivity consistent with the biochemical measurements. PH-797804 has met safety criteria in human phase I studies and is under clinical development for several inflammatory conditions. Understanding the rationale for selectivity at the molecular level helps elucidate the biological function and design of specific p38{alpha} kinase inhibitors.

  13. A Multi-Objective Approach for Protein Structure Prediction Based on an Energy Model and Backbone Angle Preferences.

    PubMed

    Tsay, Jyh-Jong; Su, Shih-Chieh; Yu, Chin-Sheng

    2015-07-03

    Protein structure prediction (PSP) is concerned with the prediction of protein tertiary structure from primary structure and is a challenging calculation problem. After decades of research effort, numerous solutions have been proposed for optimisation methods based on energy models. However, further investigation and improvement is still needed to increase the accuracy and similarity of structures. This study presents a novel backbone angle preference factor, which is one of the factors inducing protein folding. The proposed multiobjective optimisation approach simultaneously considers energy models and backbone angle preferences to solve the ab initio PSP. To prove the effectiveness of the multiobjective optimisation approach based on the energy models and backbone angle preferences, 75 amino acid sequences with lengths ranging from 22 to 88 amino acids were selected from the CB513 data set to be the benchmarks. The data sets were highly dissimilar, therefore indicating that they are meaningful. The experimental results showed that the root-mean-square deviation (RMSD) of the multiobjective optimization approach based on energy model and backbone angle preferences was superior to those of typical energy models, indicating that the proposed approach can facilitate the ab initio PSP.

  14. A Multi-Objective Approach for Protein Structure Prediction Based on an Energy Model and Backbone Angle Preferences

    PubMed Central

    Tsay, Jyh-Jong; Su, Shih-Chieh; Yu, Chin-Sheng

    2015-01-01

    Protein structure prediction (PSP) is concerned with the prediction of protein tertiary structure from primary structure and is a challenging calculation problem. After decades of research effort, numerous solutions have been proposed for optimisation methods based on energy models. However, further investigation and improvement is still needed to increase the accuracy and similarity of structures. This study presents a novel backbone angle preference factor, which is one of the factors inducing protein folding. The proposed multiobjective optimisation approach simultaneously considers energy models and backbone angle preferences to solve the ab initio PSP. To prove the effectiveness of the multiobjective optimisation approach based on the energy models and backbone angle preferences, 75 amino acid sequences with lengths ranging from 22 to 88 amino acids were selected from the CB513 data set to be the benchmarks. The data sets were highly dissimilar, therefore indicating that they are meaningful. The experimental results showed that the root-mean-square deviation (RMSD) of the multiobjective optimization approach based on energy model and backbone angle preferences was superior to those of typical energy models, indicating that the proposed approach can facilitate the ab initio PSP. PMID:26151847

  15. Structure-Based Prediction of Unstable Regions in Proteins: Applications to Protein Misfolding Diseases

    NASA Astrophysics Data System (ADS)

    Guest, Will; Cashman, Neil; Plotkin, Steven

    2009-03-01

    Protein misfolding is a necessary step in the pathogenesis of many diseases, including Creutzfeldt-Jakob disease (CJD) and familial amyotrophic lateral sclerosis (fALS). Identifying unstable structural elements in their causative proteins elucidates the early events of misfolding and presents targets for inhibition of the disease process. An algorithm was developed to calculate the Gibbs free energy of unfolding for all sequence-contiguous regions of a protein using three methods to parameterize energy changes: a modified G=o model, changes in solvent-accessible surface area, and solution of the Poisson-Boltzmann equation. The entropic effects of disulfide bonds and post-translational modifications are treated analytically. It incorporates a novel method for finding local dielectric constants inside a protein to accurately handle charge effects. We have predicted the unstable parts of prion protein and superoxide dismutase 1, the proteins involved in CJD and fALS respectively, and have used these regions as epitopes to prepare antibodies that are specific to the misfolded conformation and show promise as therapeutic agents.

  16. Ecotoxicity quantitative structure-activity relationships for alcohol ethoxylate mixtures based on substance-specific toxicity predictions.

    PubMed

    Boeije, G M; Cano, M L; Marshall, S J; Belanger, S E; Van Compernolle, R; Dorn, P B; Gümbel, H; Toy, R; Wind, T

    2006-05-01

    Traditionally, ecotoxicity quantitative structure-activity relationships (QSARs) for alcohol ethoxylate (AE) surfactants have been developed by assigning the measured ecotoxicity for commercial products to the average structures (alkyl chain length and ethoxylate chain length) of these materials. Acute Daphnia magna toxicity tests for binary mixtures indicate that mixtures are more toxic than the individual AE substances corresponding with their average structures (due to the nonlinear relation of toxicity with structure). Consequently, the ecotoxicity value (expressed as effects concentration) attributed to the average structures that are used to develop the existing QSARs is expected to be too low. A new QSAR technique for complex substances, which interprets the mixture toxicity with regard to the "ethoxymers" distribution (i.e., the individual AE components) rather than the average structure, was developed. This new technique was then applied to develop new AE ecotoxicity QSARs for invertebrates, fish, and mesocosms. Despite the higher complexity, the fit and accuracy of the new QSARs are at least as good as those for the existing QSARs based on the same data set. As expected from typical ethoxymer distributions of commercial AEs, the new QSAR generally predicts less toxicity than the QSARs based on average structure. PMID:16256196

  17. DFGmodel: Predicting Protein Kinase Structures in Inactive States for Structure-Based Discovery of Type-II Inhibitors

    PubMed Central

    2015-01-01

    Protein kinases exist in equilibrium of active and inactive states, in which the aspartate-phenylalanine-glycine motif in the catalytic domain undergoes conformational changes that are required for function. Drugs targeting protein kinases typically bind the primary ATP-binding site of an active state (type-I inhibitors) or utilize an allosteric pocket adjacent to the ATP-binding site in the inactive state (type-II inhibitors). Limited crystallographic data of protein kinases in the inactive state hampers the application of rational drug discovery methods for developing type-II inhibitors. Here, we present a computational approach to generate structural models of protein kinases in the inactive conformation. We first perform a comprehensive analysis of all protein kinase structures deposited in the Protein Data Bank. We then develop DFGmodel, a method that takes either a known structure of a kinase in the active conformation or a sequence of a kinase without a structure, to generate kinase models in the inactive conformation. Evaluation of DFGmodel’s performance using various measures indicates that the inactive kinase models are accurate, exhibiting RMSD of 1.5 Å or lower. The kinase models also accurately distinguish type-II kinase inhibitors from likely nonbinders (AUC > 0.70), suggesting that they are useful for virtual screening. Finally, we demonstrate the applicability of our approach with three case studies. For example, the models are able to capture inhibitors with unintended off-target activity. Our computational approach provides a structural framework for chemical biologists to characterize kinases in the inactive state and to explore new chemical spaces with structure-based drug design. PMID:25420233

  18. Defect Prediction and Control for Ultra-high-strength Steel Complex Structure in Hot Forming Based on FEM

    NASA Astrophysics Data System (ADS)

    Shang, Xin; Zhou, Jie; Zhuo, Fang; Luo, Yan; Li, Yang

    2015-06-01

    Cracking is the main defect in ultra-high-strength steel (UHSS) forming products. In order to avoid cracking, either adjusting process parameters or changing die's design is usually applied. However, under the condition of forming parts with unreasonable structure design, it makes little difference through the traditional methods of modifying process parameters. In this paper, true stress-strain curves under different strain rates and temperatures are obtained via the hot tensile tests. Then, the material constitutive model of UHSS is introduced into software CAE; this step is used to analyze and predict defects of UHSS hot forming complex structural parts based on FEM. In addition, simulation results of changed structure (open end) are compared with original structure (closed end). The results have shown that both maximum reduction ratio and stress in all directions are sharply reduced, i.e., the forming quality is improved significantly after changing the end structure. Finally, the prediction and control methods of forming defects are verified to be feasible in actual production.

  19. Toward improving the reliability of hydrologic prediction: Model structure uncertainty and its quantification using ensemble-based genetic programming framework

    NASA Astrophysics Data System (ADS)

    Parasuraman, Kamban; Elshorbagy, Amin

    2008-12-01

    Uncertainty analysis is starting to be widely acknowledged as an integral part of hydrological modeling. The conventional treatment of uncertainty analysis in hydrologic modeling is to assume a deterministic model structure, and treat its associated parameters as imperfectly known, thereby neglecting the uncertainty associated with the model structure. In this paper, a modeling framework that can explicitly account for the effect of model structure uncertainty has been proposed. The modeling framework is based on initially generating different realizations of the original data set using a non-parametric bootstrap method, and then exploiting the ability of the self-organizing algorithms, namely genetic programming, to evolve their own model structure for each of the resampled data sets. The resulting ensemble of models is then used to quantify the uncertainty associated with the model structure. The performance of the proposed modeling framework is analyzed with regards to its ability in characterizing the evapotranspiration process at the Southwest Sand Storage facility, located near Ft. McMurray, Alberta. Eddy-covariance-measured actual evapotranspiration is modeled as a function of net radiation, air temperature, ground temperature, relative humidity, and wind speed. Investigating the relation between model complexity, prediction accuracy, and uncertainty, two sets of experiments were carried out by varying the level of mathematical operators that can be used to define the predictand-predictor relationship. While the first set uses just the additive operators, the second set uses both the additive and the multiplicative operators to define the predictand-predictor relationship. The results suggest that increasing the model complexity may lead to better prediction accuracy but at an expense of increasing uncertainty. Compared to the model parameter uncertainty, the relative contribution of model structure uncertainty to the predictive uncertainty of a model is

  20. Quantitative Vapor-phase IR Intensities and DFT Computations to Predict Absolute IR Spectra based on Molecular Structure: I. Alkanes

    SciTech Connect

    Williams, Stephen D.; Johnson, Timothy J.; Sharpe, Steven W.; Yavelak, Veronica; Oats, R. P.; Brauer, Carolyn S.

    2013-11-13

    Recently recorded quantitative IR spectra of a variety of gas-phase alkanes are shown to have integrated intensities in both the C-H stretching and C-H bending regions that depend linearly on the molecular size, i.e. the number of C-H bonds. This result is well predicted from CH4 to C15H32 by DFT computations of IR spectra at the B3LYP/6-31+G(d,p) level of DFT theory. A simple model predicting the absolute IR band intensities of alkanes based only on structural formula is proposed: For the C-H stretching band near 2930 cm-1 this is given by (in km/mol): CH¬_str = (34±3)*CH – (41±60) where CH is number of C-H bonds in the alkane. The linearity is explained in terms of coordinated motion of methylene groups rather than the summed intensities of autonomous -CH2- units. The effect of alkyl chain length on the intensity of a C-H bending mode is explored and interpreted in terms of conformer distribution. The relative intensity contribution of a methyl mode compared to the total C-H stretch intensity is shown to be linear in the number of terminal methyl groups in the alkane, and can be used to predict quantitative spectra a priori based on structure alone.

  1. Secondary Structure Prediction of Single Sequences Using RNAstructure.

    PubMed

    Xu, Zhenjiang Zech; Mathews, David H

    2016-01-01

    RNA secondary structure is often predicted using folding thermodynamics. RNAstructure is a software package that includes structure prediction by free energy minimization, prediction of base pairing probabilities, prediction of structures composed of highly probably base pairs, and prediction of structures with pseudoknots. A user-friendly graphical user interface is provided, and this interface works on Windows, Apple OS X, and Linux. This chapter provides protocols for using RNAstructure for structure prediction. PMID:27665590

  2. Multiscale modeling of interwoven Kevlar fibers based on random walk to predict yarn structural response

    NASA Astrophysics Data System (ADS)

    Recchia, Stephen

    Kevlar is the most common high-end plastic filament yarn used in body armor, tire reinforcement, and wear resistant applications. Kevlar is a trade name for an aramid fiber. These are fibers in which the chain molecules are highly oriented along the fiber axis, so the strength of the chemical bond can be exploited. The bulk material is extruded into filaments that are bound together into yarn, which may be chorded with other materials as in car tires, woven into a fabric, or layered in an epoxy to make composite panels. The high tensile strength to low weight ratio makes this material ideal for designs that decrease weight and inertia, such as automobile tires, body panels, and body armor. For designs that use Kevlar, increasing the strength, or tenacity, to weight ratio would improve performance or reduce cost of all products that are based on this material. This thesis computationally and experimentally investigates the tenacity and stiffness of Kevlar yarns with varying twist ratios. The test boundary conditions were replicated with a geometrically accurate finite element model, resulting in a customized code that can reproduce tortuous filaments in a yarn was developed. The solid model geometry capturing filament tortuosity was implemented through a random walk method of axial geometry creation. A finite element analysis successfully recreated the yarn strength and stiffness dependency observed during the tests. The physics applied in the finite element model was reproduced in an analytical equation that was able to predict the failure strength and strain dependency of twist ratio. The analytical solution can be employed to optimize yarn design for high strength applications.

  3. Impact of the subtle differences in MMP-12 structure on Glide-based molecular docking for pose prediction of inhibitors

    NASA Astrophysics Data System (ADS)

    Zhang, Huan; Wang, Yajing; Xu, Feng

    2014-11-01

    Human MMP-12 is involved in many aspects of disease pathology. Substantial efforts have been made to develop MMP-12 inhibitors. However, the mechanism of some MMP-12 inhibitors is still unclear. Recently, the method of molecular modeling was used to explore the mechanism, but selecting the best candidate among the wealth of MMP-12 structures poses a challenge. In this study, we attempted to identify several criteria to predict the most appropriate MMP-12 PDB ID for enzyme-ligand interaction studies based on cross-docking by Glide. Furthermore, the parameters from PDB files such as R-free, resolution, B factor, and the molecular volume of the ligand in the complex can provide useful clues for choosing a suitable approximate initial model for pose prediction for MMP-12 inhibitors. This work might also provide a useful reference for other drug targets.

  4. Finite element prediction of seismic response modification of monumental structures utilizing base isolation

    NASA Astrophysics Data System (ADS)

    Spanos, Konstantinos; Anifantis, Nikolaos; Kakavas, Panayiotis

    2015-05-01

    The analysis of the mechanical behavior of ancient structures is an essential engineering task concerning the preservation of architectural heritage. As many monuments of classical antiquity are located in regions of earthquake activity, the safety assessment of these structures, as well as the selection of possible restoration interventions, requires numerical models capable of correctly representing their seismic response. The work presented herein was part of a research project in which a better understanding of the dynamics of classical column-architrave structures was sought by means of numerical techniques. In this paper, the seismic behavior of ancient monumental structures with multi-drum classical columns is investigated. In particular, the column-architrave classical structure under strong ground excitations was represented by a finite element method. This approach simulates the individual rock blocks as distinct rigid blocks interconnected with slidelines and incorporates seismic isolation dampers under the basement of the structure. Sliding and rocking motions of individual stone blocks and drums are modeled utilizing non-linear frictional contact conditions. The seismic isolation is modeled through the application of pad bearings under the basement of the structure. These pads are interpreted by appropriate rubber and steel layers. Time domain analyses were performed, considering the geometric and material non-linear behavior at the joints and the characteristics of pad bearings. The deformation and failure modes of drum columns subject to seismic excitations of various types and intensities were analyzed. The adverse influence of drum imperfections on structural safety was also examined.

  5. Structure-based prediction of subtype-selectivity of Histamine H3 receptor selective antagonists in clinical trials

    PubMed Central

    Kim, Soo-Kyung; Fristrup, Peter; Abrol, Ravinder; Goddard, William A.

    2011-01-01

    Histamine receptors (HRs) are excellent drug targets for the treatment of diseases such as schizophrenia, psychosis, depression, migraine, allergies, asthma ulcers, and hypertension. Among them, the human H3 Histamine receptor (hH3HR) antagonists have been proposed for specific therapeutic applications, including treatment of Alzheimer's disease, attention deficit hyperactivity disorder (ADHD), epilepsy, and obesity.1 However, many of these drug candidates cause undesired side effects through the cross-reactivity with other histamine receptor subtypes. In order to develop improved selectivity and activity for such treatments it would be useful to have the three dimensional structures for all four HRs. We report here the predicted structures of four HR subtypes (H1, H2, H3, and H4) using the GEnSeMBLE (GPCR Ensemble of Structures in Membrane BiLayer Environment) Monte Carlo protocol.2 sampling ~ 35 million combinations of helix packings to predict the 10 most stable packings for each of the four subtypes. Then we used these best 10 protein structures with the DarwinDock Monte Carlo protocol to sample ~ 50,000*20 poses to predict the optimum ligand-protein structures for various agonists and antagonists. We find that E2065.46 contributes most in binding H3 selective agonists (5, 6, 7) in agreement with experimental mutation studies. We also find that conserved E5.46/ S5.43 in both of hH3HR and hH4HR are involved in H3/ H4 subtype selectivity. In addition, we find that M3786.55 in hH3HR provides additional hydrophobic interactions different from hH4HR (the corresponding amino acid of T3236.55 in hH4HR) to provide additional subtype bias. From these studies we developed a pharmacophore model based on our predictions for known hH3HR selective antagonists in clinical study [ABT-239 1, GSK-189,254 2, PF-3654746 3, and BF2.649 (Tiprolisant) 4] that suggests critical selectivity directing elements are: the basic proton interacting with D1143.32, the spacer, the aromatic

  6. Real-time prediction of atmospheric Lagrangian coherent structures based on forecast data: An application and error analysis

    NASA Astrophysics Data System (ADS)

    BozorgMagham, Amir E.; Ross, Shane D.; Schmale, David G.

    2013-09-01

    The language of Lagrangian coherent structures (LCSs) provides a new means for studying transport and mixing of passive particles advected by an atmospheric flow field. Recent observations suggest that LCSs govern the large-scale atmospheric motion of airborne microorganisms, paving the way for more efficient models and management strategies for the spread of infectious diseases affecting plants, domestic animals, and humans. In addition, having reliable predictions of the timing of hyperbolic LCSs may contribute to improved aerobiological sampling of microorganisms with unmanned aerial vehicles and LCS-based early warning systems. Chaotic atmospheric dynamics lead to unavoidable forecasting errors in the wind velocity field, which compounds errors in LCS forecasting. In this study, we reveal the cumulative effects of errors of (short-term) wind field forecasts on the finite-time Lyapunov exponent (FTLE) fields and the associated LCSs when realistic forecast plans impose certain limits on the forecasting parameters. Objectives of this paper are to (a) quantify the accuracy of prediction of FTLE-LCS features and (b) determine the sensitivity of such predictions to forecasting parameters. Results indicate that forecasts of attracting LCSs exhibit less divergence from the archive-based LCSs than the repelling features. This result is important since attracting LCSs are the backbone of long-lived features in moving fluids. We also show under what circumstances one can trust the forecast results if one merely wants to know if an LCS passed over a region and does not need to precisely know the passage time.

  7. Protein structure prediction using hybrid AI methods

    SciTech Connect

    Guan, X.; Mural, R.J.; Uberbacher, E.C.

    1993-11-01

    This paper describes a new approach for predicting protein structures based on Artificial Intelligence methods and genetic algorithms. We combine nearest neighbor searching algorithms, neural networks, heuristic rules and genetic algorithms to form an integrated system to predict protein structures from their primary amino acid sequences. First we describe our methods and how they are integrated, and then apply our methods to several protein sequences. The results are very close to the real structures obtained by crystallography. Parallel genetic algorithms are also implemented.

  8. Interplay of I-TASSER and QUARK for template-based and ab initio protein structure prediction in CASP10.

    PubMed

    Zhang, Yang

    2014-02-01

    We develop and test a new pipeline in CASP10 to predict protein structures based on an interplay of I-TASSER and QUARK for both free-modeling (FM) and template-based modeling (TBM) targets. The most noteworthy observation is that sorting through the threading template pool using the QUARK-based ab initio models as probes allows the detection of distant-homology templates which might be ignored by the traditional sequence profile-based threading alignment algorithms. Further template assembly refinement by I-TASSER resulted in successful folding of two medium-sized FM targets with >150 residues. For TBM, the multiple threading alignments from LOMETS are, for the first time, incorporated into the ab initio QUARK simulations, which were further refined by I-TASSER assembly refinement. Compared with the traditional threading assembly refinement procedures, the inclusion of the threading-constrained ab initio folding models can consistently improve the quality of the full-length models as assessed by the GDT-HA and hydrogen-bonding scores. Despite the success, significant challenges still exist in domain boundary prediction and consistent folding of medium-size proteins (especially beta-proteins) for nonhomologous targets. Further developments of sensitive fold-recognition and ab initio folding methods are critical for solving these problems.

  9. Structure Based Prediction of Subtype-Selectivity for Adenosine Receptor Antagonists

    PubMed Central

    Katritch, Vsevolod; Kufareva, Irina; Abagyan, Ruben

    2010-01-01

    One of the major hurdles in the development of safe and effective drugs targeting G-protein coupled receptors (GPCRs) is finding ligands that are highly selective for a specific receptor subtype. Structural understanding of subtype-specific binding pocket variations and ligand-receptor interactions may greatly facilitate design of selective ligands. To gain insights into the structural basis of ligand subtype selectivity within the family of adenosine receptors (AR: A1, A2A, A2B, and A3) we generated 3D models of all four subtypes using the recently determined crystal structure of the AA2AR as a template, and employing the methodology of ligand-guided receptor optimization for refinement. This approach produced 3D conformational models of AR subtypes that effectively explain binding modes and subtype selectivity for a diverse set of known AR antagonists. Analysis of the subtype-specific ligand-receptor interactions allowed identification of the major determinants of ligand selectivity, which may facilitate discovery of more efficient drug candidates. PMID:20637786

  10. A population-based evolutionary search approach to the multiple minima problem in de novo protein structure prediction

    PubMed Central

    2013-01-01

    Background Elucidating the native structure of a protein molecule from its sequence of amino acids, a problem known as de novo structure prediction, is a long standing challenge in computational structural biology. Difficulties in silico arise due to the high dimensionality of the protein conformational space and the ruggedness of the associated energy surface. The issue of multiple minima is a particularly troublesome hallmark of energy surfaces probed with current energy functions. In contrast to the true energy surface, these surfaces are weakly-funneled and rich in comparably deep minima populated by non-native structures. For this reason, many algorithms seek to be inclusive and obtain a broad view of the low-energy regions through an ensemble of low-energy (decoy) conformations. Conformational diversity in this ensemble is key to increasing the likelihood that the native structure has been captured. Methods We propose an evolutionary search approach to address the multiple-minima problem in decoy sampling for de novo structure prediction. Two population-based evolutionary search algorithms are presented that follow the basic approach of treating conformations as individuals in an evolving population. Coarse graining and molecular fragment replacement are used to efficiently obtain protein-like child conformations from parents. Potential energy is used both to bias parent selection and determine which subset of parents and children will be retained in the evolving population. The effect on the decoy ensemble of sampling minima directly is measured by additionally mapping a conformation to its nearest local minimum before considering it for retainment. The resulting memetic algorithm thus evolves not just a population of conformations but a population of local minima. Results and conclusions Results show that both algorithms are effective in terms of sampling conformations in proximity of the known native structure. The additional minimization is shown to be

  11. Predicting the Effect of Mutations on Protein-Protein Binding Interactions through Structure-Based Interface Profiles

    PubMed Central

    Brender, Jeffrey R.; Zhang, Yang

    2015-01-01

    The formation of protein-protein complexes is essential for proteins to perform their physiological functions in the cell. Mutations that prevent the proper formation of the correct complexes can have serious consequences for the associated cellular processes. Since experimental determination of protein-protein binding affinity remains difficult when performed on a large scale, computational methods for predicting the consequences of mutations on binding affinity are highly desirable. We show that a scoring function based on interface structure profiles collected from analogous protein-protein interactions in the PDB is a powerful predictor of protein binding affinity changes upon mutation. As a standalone feature, the differences between the interface profile score of the mutant and wild-type proteins has an accuracy equivalent to the best all-atom potentials, despite being two orders of magnitude faster once the profile has been constructed. Due to its unique sensitivity in collecting the evolutionary profiles of analogous binding interactions and the high speed of calculation, the interface profile score has additional advantages as a complementary feature to combine with physics-based potentials for improving the accuracy of composite scoring approaches. By incorporating the sequence-derived and residue-level coarse-grained potentials with the interface structure profile score, a composite model was constructed through the random forest training, which generates a Pearson correlation coefficient >0.8 between the predicted and observed binding free-energy changes upon mutation. This accuracy is comparable to, or outperforms in most cases, the current best methods, but does not require high-resolution full-atomic models of the mutant structures. The binding interface profiling approach should find useful application in human-disease mutation recognition and protein interface design studies. PMID:26506533

  12. Sequence and structure-based prediction of fructosyltransferase activity for functional subclassification of fungal GH32 enzymes.

    PubMed

    Trollope, Kim M; van Wyk, Niël; Kotjomela, Momo A; Volschenk, Heinrich

    2015-12-01

    Sucrolytic enzymes catalyse sucrose hydrolysis or the synthesis of fructooligosaccharides (FOSs), a prebiotic in human and animal nutrition. FOS synthesis capacity differs between sucrolytic enzymes. Amino-acid-sequence-based classification of FOS synthesizing enzymes would greatly facilitate the in silico identification of novel catalysts, as large amounts of sequence data lie untapped. The development of a bioinformatics tool to rapidly distinguish between high-level FOSs synthesizing predominantly sucrose hydrolysing enzymes from fungal genomic data is presented. Sequence comparison of functionally characterized enzymes displaying low- and high-level FOS synthesis revealed conserved motifs unique to each group. New light is shed on the sequence context of active site residues in three previously identified conserved motifs. We characterized two enzymes predicted to possess low- and high-level FOS synthesis activities based on their conserved motif sequences. FOS data for the enzymes confirmed our successful prediction of their FOS synthesis capacity. Structural comparison of enzymes displaying low- and high-level FOS synthesis identified steric hindrance between nystose and a long loop region present only in low-level FOS synthesizers. This loop is proposed to limit the synthesis of FOS species with higher degrees of polymerization, a phenomenon observed among enzymes displaying low-level FOS synthesis. Conserved sequence motifs surrounding catalytic residues and a distant structural determinant were identifiers of FOS synthesis capacity and allow for functional annotation of sucrolytic enzymes directly from amino acid sequence. The tool presented may also be useful to study the structure-function relationships of β-fructofuranosidases by identifying mutations present in a group of closely related enzymes displaying similar function.

  13. Predicting the Effect of Mutations on Protein-Protein Binding Interactions through Structure-Based Interface Profiles.

    PubMed

    Brender, Jeffrey R; Zhang, Yang

    2015-10-01

    The formation of protein-protein complexes is essential for proteins to perform their physiological functions in the cell. Mutations that prevent the proper formation of the correct complexes can have serious consequences for the associated cellular processes. Since experimental determination of protein-protein binding affinity remains difficult when performed on a large scale, computational methods for predicting the consequences of mutations on binding affinity are highly desirable. We show that a scoring function based on interface structure profiles collected from analogous protein-protein interactions in the PDB is a powerful predictor of protein binding affinity changes upon mutation. As a standalone feature, the differences between the interface profile score of the mutant and wild-type proteins has an accuracy equivalent to the best all-atom potentials, despite being two orders of magnitude faster once the profile has been constructed. Due to its unique sensitivity in collecting the evolutionary profiles of analogous binding interactions and the high speed of calculation, the interface profile score has additional advantages as a complementary feature to combine with physics-based potentials for improving the accuracy of composite scoring approaches. By incorporating the sequence-derived and residue-level coarse-grained potentials with the interface structure profile score, a composite model was constructed through the random forest training, which generates a Pearson correlation coefficient >0.8 between the predicted and observed binding free-energy changes upon mutation. This accuracy is comparable to, or outperforms in most cases, the current best methods, but does not require high-resolution full-atomic models of the mutant structures. The binding interface profiling approach should find useful application in human-disease mutation recognition and protein interface design studies.

  14. Sequence and structure-based prediction of fructosyltransferase activity for functional subclassification of fungal GH32 enzymes.

    PubMed

    Trollope, Kim M; van Wyk, Niël; Kotjomela, Momo A; Volschenk, Heinrich

    2015-12-01

    Sucrolytic enzymes catalyse sucrose hydrolysis or the synthesis of fructooligosaccharides (FOSs), a prebiotic in human and animal nutrition. FOS synthesis capacity differs between sucrolytic enzymes. Amino-acid-sequence-based classification of FOS synthesizing enzymes would greatly facilitate the in silico identification of novel catalysts, as large amounts of sequence data lie untapped. The development of a bioinformatics tool to rapidly distinguish between high-level FOSs synthesizing predominantly sucrose hydrolysing enzymes from fungal genomic data is presented. Sequence comparison of functionally characterized enzymes displaying low- and high-level FOS synthesis revealed conserved motifs unique to each group. New light is shed on the sequence context of active site residues in three previously identified conserved motifs. We characterized two enzymes predicted to possess low- and high-level FOS synthesis activities based on their conserved motif sequences. FOS data for the enzymes confirmed our successful prediction of their FOS synthesis capacity. Structural comparison of enzymes displaying low- and high-level FOS synthesis identified steric hindrance between nystose and a long loop region present only in low-level FOS synthesizers. This loop is proposed to limit the synthesis of FOS species with higher degrees of polymerization, a phenomenon observed among enzymes displaying low-level FOS synthesis. Conserved sequence motifs surrounding catalytic residues and a distant structural determinant were identifiers of FOS synthesis capacity and allow for functional annotation of sucrolytic enzymes directly from amino acid sequence. The tool presented may also be useful to study the structure-function relationships of β-fructofuranosidases by identifying mutations present in a group of closely related enzymes displaying similar function. PMID:26426731

  15. Static compressive strength prediction of open-hole structure based on non-linear shear behavior and micro-mechanics

    NASA Astrophysics Data System (ADS)

    Li, Wangnan; Cai, Hongneng; Li, Chao

    2014-11-01

    This paper deals with the characterization of the strength of the constituents of carbon fiber reinforced plastic laminate (CFRP), and a prediction of the static compressive strength of open-hole structure of polymer composites. The approach combined with non-linear analysis in macro-level and a linear elastic micromechanical failure analysis in microlevel (non-linear MMF) is proposed to improve the prediction accuracy. A face-centered cubic micromechanics model is constructed to analyze the stresses in fiber and matrix in microlevel. Non-interactive failure criteria are proposed to characterize the strength of fiber and matrix. The non-linear shear behavior of the laminate is studied experimentally, and a novel approach of cubic spline interpolation is used to capture significant non-linear shear behavior of laminate. The user-defined material subroutine UMAT for the non-linear share behavior is developed and combined in the mechanics analysis in the macro-level using the Abaqus Python codes. The failure mechanism and static strength of open-hole compressive (OHC) structure of polymer composites is studied based on non-linear MMF. The UTS50/E51 CFRP is used to demonstrate the application of theory of non-linear MMF.

  16. Identification of ALK5 inhibitor via structure-based virtual screening and ADMET prediction.

    PubMed

    Lin, Aibin; Cai, Zhongqi; Hu, Guoping; Li, Qiang

    2015-01-01

    TGF-β plays a critical role in the initiation and progression of fibrosis in various organ systems such as kidney, heart, lung and liver. TGF-β and its receptors (ALK5 and TβR II) are able to control the cellular growth and promote several biological responses. To date, many pharmaceutical companies have employed virtual screening to identify potent inhibitors against ALK5. Nevertheless, none of these studies had involved the in silico ADMET evaluation and Raccoon filtering. In our experiment, all 57423 molecules were downloaded from TCM database and were filtered and converted to PDBQT formats by Raccoon software. Then 24 189 structures were run through AutoDock Vina in PyRx 0.8, 164 molecules were selected and further evaluated by ADMET Predictor 6.5, and 56 structures were selected and docked by Glide 6.2. Finally, the top 10 hits were identified as promising oral ALK5 inhibitors according to their Glide scores. The Glide scores of the best two compounds, 40686 and 33534, were -10.75 and -10.30 kcal/mol, respectively. This research provides a set of combined and detailed virtual screening protocol and is helpful for explaining the mechanism of receptor-ligand interactions.

  17. De Novo Protein Structure Prediction

    NASA Astrophysics Data System (ADS)

    Hung, Ling-Hong; Ngan, Shing-Chung; Samudrala, Ram

    An unparalleled amount of sequence data is being made available from large-scale genome sequencing efforts. The data provide a shortcut to the determination of the function of a gene of interest, as long as there is an existing sequenced gene with similar sequence and of known function. This has spurred structural genomic initiatives with the goal of determining as many protein folds as possible (Brenner and Levitt, 2000; Burley, 2000; Brenner, 2001; Heinemann et al., 2001). The purpose of this is twofold: First, the structure of a gene product can often lead to direct inference of its function. Second, since the function of a protein is dependent on its structure, direct comparison of the structures of gene products can be more sensitive than the comparison of sequences of genes for detecting homology. Presently, structural determination by crystallography and NMR techniques is still slow and expensive in terms of manpower and resources, despite attempts to automate the processes. Computer structure prediction algorithms, while not providing the accuracy of the traditional techniques, are extremely quick and inexpensive and can provide useful low-resolution data for structure comparisons (Bonneau and Baker, 2001). Given the immense number of structures which the structural genomic projects are attempting to solve, there would be a considerable gain even if the computer structure prediction approach were applicable to a subset of proteins.

  18. Water-Regulated Self-Assembly Structure Transformation and Gelation Behavior Prediction Based on a Hydrazide Derivative.

    PubMed

    Li, Yajie; Ran, Xia; Li, Qiuyue; Gao, Qiongqiong; Guo, Lijun

    2016-08-01

    Herein, we report the water-regulated supramolecular self-assembly structure transformation and the predictability of the gelation ability based on an azobenzene derivative bearing a hydrazide group, namely, N-(3,4,5-tributoxyphenyl)-N'-4-[(4-hydroxyphenyl)azophenyl] benzohydrazide (BNB-t4). The regulation effects are demonstrated in the morphological transformation from spherical to lamellar particles then back to spherical in different solvent ratios of n-propanol/water. The self-assembly behavior of BNB-t4 was characterized by minimum gelation concentration, microstructure, thermal, and mechanical stabilities. From the spectroscopy studies, it is suggested that gel formation of BNB-t4 is mainly driven by intermolecular hydrogen bonding, accompanied with the contribution from π-π stacking as well as hydrophobic interactions. The successfully established correlation between the self-assembly behavior and solubility parameters yields a facile way to predict the gelation performance of other molecules in other single or mixed solvents.

  19. PROCARB: A Database of Known and Modelled Carbohydrate-Binding Protein Structures with Sequence-Based Prediction Tools

    PubMed Central

    Malik, Adeel; Firoz, Ahmad; Jha, Vivekanand; Ahmad, Shandar

    2010-01-01

    Understanding of the three-dimensional structures of proteins that interact with carbohydrates covalently (glycoproteins) as well as noncovalently (protein-carbohydrate complexes) is essential to many biological processes and plays a significant role in normal and disease-associated functions. It is important to have a central repository of knowledge available about these protein-carbohydrate complexes as well as preprocessed data of predicted structures. This can be significantly enhanced by tools de novo which can predict carbohydrate-binding sites for proteins in the absence of structure of experimentally known binding site. PROCARB is an open-access database comprising three independently working components, namely, (i) Core PROCARB module, consisting of three-dimensional structures of protein-carbohydrate complexes taken from Protein Data Bank (PDB), (ii) Homology Models module, consisting of manually developed three-dimensional models of N-linked and O-linked glycoproteins of unknown three-dimensional structure, and (iii) CBS-Pred prediction module, consisting of web servers to predict carbohydrate-binding sites using single sequence or server-generated PSSM. Several precomputed structural and functional properties of complexes are also included in the database for quick analysis. In particular, information about function, secondary structure, solvent accessibility, hydrogen bonds and literature reference, and so forth, is included. In addition, each protein in the database is mapped to Uniprot, Pfam, PDB, and so forth. PMID:20671979

  20. Vertical Chlorophyll Canopy Structure Affects the Remote Sensing Based Predictability of LAI, Chlorophyll and Leaf Nitrogen in Agricultural Fields

    NASA Astrophysics Data System (ADS)

    Boegh, E.; Houborg, R.; Bienkowski, J.; Braban, C. F.; Dalgaard, T.; van Dijk, N.; Dragosits, U.; Holmes, E.; Magliulo, V.; Schelde, K.; Di Tommasi, P.; Vitale, L.; Theobald, M.; Cellier, P.; Sutton, M.

    2012-12-01

    Leaf nitrogen and leaf surface area influence the exchange of gases between terrestrial ecosystems and the atmosphere, and they play a significant role in the global cycles of carbon, nitrogen and water. Remote sensing can be used to estimate leaf area index (LAI), chlorophyll content (CHL) and leaf nitrogen (N), but methods are often developed using plot-scale data and not verified over extended regions characterized by variations in environmental boundary conditions (soil, atmosphere) and canopy structures. Estimation of N can be indirect due to its association with CHL, however N is also included in pigments such as carotenoids and anthocyanin which have different spectral signatures than CHL. Photosynthesis optimization theory suggests that plants will distribute their N resources in proportion to the light gradient within the canopy. Such vertical variation in CHL and N complicates the evaluation of remote sensing-based methods. Typically remote sensing studies measure CHL of the upper leaf, which is then multiplied by the green LAI to represent canopy chlorophyll content, or random sampling is used. In this study, field measurements and high spatial resolution (10-20 m) remote sensing images acquired from the HRG and HRVIR sensors aboard the SPOT satellites were used to assess the predictability of LAI, CHL and N in five European agricultural landscapes located in Denmark, Scotland (United Kingdom), Poland, The Netherlands and Italy . All satellite images were atmospherically using the 6SV1 model with atmospheric inputs estimated by MODIS and AIRS data. Five spectral vegetation indices (SVIs) were calculated (the Normalized Difference Vegetation index, the Simple Ratio, the Enhanced Vegetation Index-2, the Green Normalized Difference Vegetation Index, and the green Chlorophyll Index), and an image-based inverse canopy radiative transfer modelling system, REGFLEC (REGularized canopy reFLECtance) was applied to each of the five European landscapes. While the

  1. Interface Structure Prediction from First-Principles

    SciTech Connect

    Zhao, Xin; Shu, Qiang; Nguyen, Manh Cuong; Wang, Yangang; Ji, Min; Xiang, Hongjun; Ho, Kai-Ming; Gong, Xingao; Wang, Cai-Zhuang

    2014-05-08

    Information about the atomic structures at solid–solid interfaces is crucial for understanding and predicting the performance of materials. Due to the complexity of the interfaces, it is very challenging to resolve their atomic structures using either experimental techniques or computer simulations. In this paper, we present an efficient first-principles computational method for interface structure prediction based on an adaptive genetic algorithm. This approach significantly reduces the computational cost, while retaining the accuracy of first-principles prediction. The method is applied to the investigation of both stoichiometric and nonstoichiometric SrTiO3 Σ3(112)[1¯10] grain boundaries with unit cell containing up to 200 atoms. Several novel low-energy structures are discovered, which provide fresh insights into the structure and stability of the grain boundaries.

  2. Prediction of RNA multiloop and pseudoknot conformations from a lattice-based, coarse-grain tertiary structure model

    NASA Astrophysics Data System (ADS)

    Jost, Daniel; Everaers, Ralf

    2010-03-01

    We present a semiquantitative lattice model of RNA folding, which is able to reproduce complex folded structures such as multiloops and pseudoknots without relying on the frequently employed ad hoc generalization of the Jacobson-Stockmayer loop entropy. We derive the model parameters from the Turner description of simple secondary structural elements and pay particular attention to the unification of mismatch and coaxial stacking parameters as well as of border and nonlocal loop parameters, resulting in a reduced, unified parameter set for simple loops of arbitrary type and size. For elementary structures, the predictive power of the model is comparable to the standard secondary structure approaches, from which its parameters are derived. For complex structures, our approach offers a systematic treatment of generic effects of chain connectivity as well as of excluded volume or attractive interactions between and within all elements of the secondary structure. We reproduce the native structures of tRNA multiloops and of viral frameshift signal pseudoknots.

  3. Interval prediction in structural dynamic analysis

    NASA Technical Reports Server (NTRS)

    Hasselman, Timothy K.; Chrostowski, Jon D.; Ross, Timothy J.

    1992-01-01

    Methods for assessing the predictive accuracy of structural dynamic models are examined with attention given to the effects of modal mass, stiffness, and damping uncertainties. The methods are based on a nondeterministic analysis called 'interval prediction' in which interval variables are used to describe parameters and responses that are unknown. Statistical databases for generic modeling uncertainties are derived from experimental data and incorporated analytically to evaluate responses. Covariance matrices of modal mass, stiffness, and damping parameters are propagated numerically in models of large space structures by means of three methods. The test data tend to fall within the predicted intervals of uncertainty determined by the statistical databases. The present findings demonstrate the suitability of using data from previously analyzed and tested space structures for assessing the predictive accuracy of an analytical model.

  4. DR_bind: a web server for predicting DNA-binding residues from the protein structure based on electrostatics, evolution and geometry.

    PubMed

    Chen, Yao Chi; Wright, Jon D; Lim, Carmay

    2012-07-01

    DR_bind is a web server that automatically predicts DNA-binding residues, given the respective protein structure based on (i) electrostatics, (ii) evolution and (iii) geometry. In contrast to machine-learning methods, DR_bind does not require a training data set or any parameters. It predicts DNA-binding residues by detecting a cluster of conserved, solvent-accessible residues that are electrostatically stabilized upon mutation to Asp(-)/Glu(-). The server requires as input the DNA-binding protein structure in PDB format and outputs a downloadable text file of the predicted DNA-binding residues, a 3D visualization of the predicted residues highlighted in the given protein structure, and a downloadable PyMol script for visualization of the results. Calibration on 83 and 55 non-redundant DNA-bound and DNA-free protein structures yielded a DNA-binding residue prediction accuracy/precision of 90/47% and 88/42%, respectively. Since DR_bind does not require any training using protein-DNA complex structures, it may predict DNA-binding residues in novel structures of DNA-binding proteins resulting from structural genomics projects with no conservation data. The DR_bind server is freely available with no login requirement at http://dnasite.limlab.ibms.sinica.edu.tw.

  5. Comparison of in silico tools for binding site prediction applied for structure-based design of autolysin inhibitors.

    PubMed

    Tibaut, T; Borišek, J; Novič, M; Turk, D

    2016-07-01

    Autolysin E (AtlE) is a bacteriolytic enzyme which plays an important role in division and growth of bacterial cells and therefore represents a promising potential drug target. Its 3D structure has been recently elucidated. We used in silico prediction tools to study substrate or ligand (inhibitor) binding regions of AtlE. We applied several freely available tools and a commercial tool for binding site identification and compared results of the prediction. Calculation time, number of predictions and output data provided by specific software vary according to the different approaches utilized by specific method categories. Despite different approaches, binding sites in similar locations on the protein were predicted. Specific amino acid residues that form these binding sites were predicted as binding residues. The predicted residues, especially those with predicted highest conservation score, could theoretically have catalytic and binding properties. According to our results, we assume that E138, which has the highest conservation score, is the catalytic residue and F161, G162 and Y224, which are also highly conserved, are responsible for substrate binding. Ligands developed with binding specificity towards these residues could inhibit the catalysis and binding of the substrate of AtlE. The molecules with inhibitory potency could therefore represent potential new antibacterial agents. PMID:27686112

  6. Definition of the applicability domains of knowledge-based predictive toxicology expert systems by using a structural fragment-based approach.

    PubMed

    Ellison, Claire M; Enoch, Steven J; Cronin, Mark Td; Madden, Judith C; Judson, Philip

    2009-11-01

    The applicability domain of a (quantitative) structure-activity relationship ([Q]SAR) must be defined, if a model is to be used successfully for toxicity prediction, particularly for regulatory purposes. Previous efforts to set guidelines on the definition of applicability domains have often been biased toward quantitative, rather than qualitative, models. As a result, novel techniques are still required to define the applicability domains of structural alert models and knowledge-based systems. By using Derek for Windows as an example, this study defined the domain for the skin sensitisation structural alert rule-base. This was achieved by fragmenting the molecules within a training set of compounds, then searching the fragments for those created from a test compound. This novel method was able to highlight test chemicals which differed from those in the training set. The information was then used to designate chemicals as being either within or outside the domain of applicability for the structural alert on which that training set was based.

  7. Characteristics and Prediction of RNA Structure

    PubMed Central

    Zhu, Daming; Zhang, Caiming; Han, Huijian; Crandall, Keith A.

    2014-01-01

    RNA secondary structures with pseudoknots are often predicted by minimizing free energy, which is NP-hard. Most RNAs fold during transcription from DNA into RNA through a hierarchical pathway wherein secondary structures form prior to tertiary structures. Real RNA secondary structures often have local instead of global optimization because of kinetic reasons. The performance of RNA structure prediction may be improved by considering dynamic and hierarchical folding mechanisms. This study is a novel report on RNA folding that accords with the golden mean characteristic based on the statistical analysis of the real RNA secondary structures of all 480 sequences from RNA STRAND, which are validated by NMR or X-ray. The length ratios of domains in these sequences are approximately 0.382L, 0.5L, 0.618L, and L, where L is the sequence length. These points are just the important golden sections of sequence. With this characteristic, an algorithm is designed to predict RNA hierarchical structures and simulate RNA folding by dynamically folding RNA structures according to the above golden section points. The sensitivity and number of predicted pseudoknots of our algorithm are better than those of the Mfold, HotKnots, McQfold, ProbKnot, and Lhw-Zhu algorithms. Experimental results reflect the folding rules of RNA from a new angle that is close to natural folding. PMID:25110687

  8. Predicting protein dynamics from structural ensembles

    NASA Astrophysics Data System (ADS)

    Copperman, J.; Guenza, M. G.

    2015-12-01

    The biological properties of proteins are uniquely determined by their structure and dynamics. A protein in solution populates a structural ensemble of metastable configurations around the global fold. From overall rotation to local fluctuations, the dynamics of proteins can cover several orders of magnitude in time scales. We propose a simulation-free coarse-grained approach which utilizes knowledge of the important metastable folded states of the protein to predict the protein dynamics. This approach is based upon the Langevin Equation for Protein Dynamics (LE4PD), a Langevin formalism in the coordinates of the protein backbone. The linear modes of this Langevin formalism organize the fluctuations of the protein, so that more extended dynamical cooperativity relates to increasing energy barriers to mode diffusion. The accuracy of the LE4PD is verified by analyzing the predicted dynamics across a set of seven different proteins for which both relaxation data and NMR solution structures are available. Using experimental NMR conformers as the input structural ensembles, LE4PD predicts quantitatively accurate results, with correlation coefficient ρ = 0.93 to NMR backbone relaxation measurements for the seven proteins. The NMR solution structure derived ensemble and predicted dynamical relaxation is compared with molecular dynamics simulation-derived structural ensembles and LE4PD predictions and is consistent in the time scale of the simulations. The use of the experimental NMR conformers frees the approach from computationally demanding simulations.

  9. Dynameomics: data-driven methods and models for utilizing large-scale protein structure repositories for improving fragment-based loop prediction.

    PubMed

    Rysavy, Steven J; Beck, David A C; Daggett, Valerie

    2014-11-01

    Protein function is intimately linked to protein structure and dynamics yet experimentally determined structures frequently omit regions within a protein due to indeterminate data, which is often due protein dynamics. We propose that atomistic molecular dynamics simulations provide a diverse sampling of biologically relevant structures for these missing segments (and beyond) to improve structural modeling and structure prediction. Here we make use of the Dynameomics data warehouse, which contains simulations of representatives of essentially all known protein folds. We developed novel computational methods to efficiently identify, rank and retrieve small peptide structures, or fragments, from this database. We also created a novel data model to analyze and compare large repositories of structural data, such as contained within the Protein Data Bank and the Dynameomics data warehouse. Our evaluation compares these structural repositories for improving loop predictions and analyzes the utility of our methods and models. Using a standard set of loop structures, containing 510 loops, 30 for each loop length from 4 to 20 residues, we find that the inclusion of Dynameomics structures in fragment-based methods improves the quality of the loop predictions without being dependent on sequence homology. Depending on loop length, ∼ 25-75% of the best predictions came from the Dynameomics set, resulting in lower main chain root-mean-square deviations for all fragment lengths using the combined fragment library. We also provide specific cases where Dynameomics fragments provide better predictions for NMR loop structures than fragments from crystal structures. Online access to these fragment libraries is available at http://www.dynameomics.org/fragments.

  10. A Novel Peptide Binding Prediction Approach for HLA-DR Molecule Based on Sequence and Structural Information.

    PubMed

    Li, Zhao; Zhao, Yilei; Pan, Gaofeng; Tang, Jijun; Guo, Fei

    2016-01-01

    MHC molecule plays a key role in immunology, and the molecule binding reaction with peptide is an important prerequisite for T cell immunity induced. MHC II molecules do not have conserved residues, so they appear as open grooves. As a consequence, this will increase the difficulty in predicting MHC II molecules binding peptides. In this paper, we aim to propose a novel prediction method for MHC II molecules binding peptides. First, we calculate sequence similarity and structural similarity between different MHC II molecules. Then, we reorder pseudosequences according to descending similarity values and use a weight calculation formula to calculate new pocket profiles. Finally, we use three scoring functions to predict binding cores and evaluate the accuracy of prediction to judge performance of each scoring function. In the experiment, we set a parameter α in the weight formula. By changing α value, we can observe different performances of each scoring function. We compare our method with the best function to some popular prediction methods and ultimately find that our method outperforms them in identifying binding cores of HLA-DR molecules. PMID:27340658

  11. Aza analogues of nucleic acid bases: experimental determination and computational prediction of the crystal structure of anhydrous 5-azauracil

    NASA Astrophysics Data System (ADS)

    Potter, Brian S.; Palmer, Rex A.; Withnall, Robert; Chowdhry, Babur Z.; Price, Sarah L.

    1999-08-01

    The crystal and molecular structure of 5-azauracil, C 3H 3O 2N 3, Mr=113.07 Da, was determined from X-ray diffraction data. The material crystallizes in the orthorhombic space group Pbca with eight molecules of 5-azauracil in a cell of dimensions a=6.5135(3), b=13.5217(4), c=6.5824(4) Å, crystal density Dc=1.779 g cm -3. The structure was determined using direct methods and refined by full-matrix least-squares to a conventional R index of 0.0337 for 763 observed reflections and 86 parameters. Two strong hydrogen bonds, N1H1…O4 and N3H3…N5, and several weaker intermolecular interactions produce a crinkled sheet structure. This crystal structure was independently predicted by a search for minima in the lattice energy, as calculated using an ab initio optimised molecular structure and a distributed multipole model for the electrostatic interactions. Indeed, the global minimum in the search corresponded to the same Pbca space group, with rms errors in the cell lengths of 3.7%. There is a larger energy gap separating the observed hydrogen bonding motif structure from alternative structures, with different hydrogen bonds and connectivity, for 5-azauracil than for 6-azauracil and uracil.

  12. BiPPred: Combined sequence- and structure-based prediction of peptide binding to the Hsp70 chaperone BiP.

    PubMed

    Schneider, Markus; Rosam, Mathias; Glaser, Manuel; Patronov, Atanas; Shah, Harpreet; Back, Katrin Christiane; Daake, Marina Angelika; Buchner, Johannes; Antes, Iris

    2016-10-01

    Substrate binding to Hsp70 chaperones is involved in many biological processes, and the identification of potential substrates is important for a comprehensive understanding of these events. We present a multi-scale pipeline for an accurate, yet efficient prediction of peptides binding to the Hsp70 chaperone BiP by combining sequence-based prediction with molecular docking and MMPBSA calculations. First, we measured the binding of 15mer peptides from known substrate proteins of BiP by peptide array (PA) experiments and performed an accuracy assessment of the PA data by fluorescence anisotropy studies. Several sequence-based prediction models were fitted using this and other peptide binding data. A structure-based position-specific scoring matrix (SB-PSSM) derived solely from structural modeling data forms the core of all models. The matrix elements are based on a combination of binding energy estimations, molecular dynamics simulations, and analysis of the BiP binding site, which led to new insights into the peptide binding specificities of the chaperone. Using this SB-PSSM, peptide binders could be predicted with high selectivity even without training of the model on experimental data. Additional training further increased the prediction accuracies. Subsequent molecular docking (DynaDock) and MMGBSA/MMPBSA-based binding affinity estimations for predicted binders allowed the identification of the correct binding mode of the peptides as well as the calculation of nearly quantitative binding affinities. The general concept behind the developed multi-scale pipeline can readily be applied to other protein-peptide complexes with linearly bound peptides, for which sufficient experimental binding data for the training of classical sequence-based prediction models is not available. Proteins 2016; 84:1390-1407. © 2016 Wiley Periodicals, Inc.

  13. FINDSITE-metal: Integrating evolutionary information and machine learning for structure-based metal binding site prediction at the proteome level

    PubMed Central

    Brylinski, Michal; Skolnick, Jeffrey

    2010-01-01

    The rapid accumulation of gene sequences, many of which are hypothetical proteins with unknown function, has stimulated the development of accurate computational tools for protein function prediction with evolution/structure-based approaches showing considerable promise. In this paper, we present FINDSITE-metal, a new threading-based method designed specifically to detect metal binding sites in modeled protein structures. Comprehensive benchmarks using different quality protein structures show that weakly homologous protein models provide sufficient structural information for quite accurate annotation by FINDSITE-metal. Combining structure/evolutionary information with machine learning results in highly accurate metal binding annotations; for protein models constructed by TASSER, whose average Cα RMSD from the native structure is 8.9 Å, 59.5% (71.9%) of the best of top five predicted metal locations are within 4 Å (8 Å) from a bound metal in the crystal structure. For most of the targets, multiple metal binding sites are detected with the best predicted binding site at rank 1 and within the top 2 ranks in 65.6% and 83.1% of the cases, respectively. Furthermore, for iron, copper, zinc, calcium and magnesium ions, the binding metal can be predicted with high, typically 70-90%, accuracy. FINDSITE-metal also provides a set of confidence indexes that help assess the reliability of predictions. Finally, we describe the proteome-wide application of FINDSITE-metal that quantifies the metal binding complement of the human proteome. FINDSITE-metal is freely available to the academic community at http://cssb.biology.gatech.edu/findsite-metal/. PMID:21287609

  14. RNA secondary structure prediction using soft computing.

    PubMed

    Ray, Shubhra Sankar; Pal, Sankar K

    2013-01-01

    Prediction of RNA structure is invaluable in creating new drugs and understanding genetic diseases. Several deterministic algorithms and soft computing-based techniques have been developed for more than a decade to determine the structure from a known RNA sequence. Soft computing gained importance with the need to get approximate solutions for RNA sequences by considering the issues related with kinetic effects, cotranscriptional folding, and estimation of certain energy parameters. A brief description of some of the soft computing-based techniques, developed for RNA secondary structure prediction, is presented along with their relevance. The basic concepts of RNA and its different structural elements like helix, bulge, hairpin loop, internal loop, and multiloop are described. These are followed by different methodologies, employing genetic algorithms, artificial neural networks, and fuzzy logic. The role of various metaheuristics, like simulated annealing, particle swarm optimization, ant colony optimization, and tabu search is also discussed. A relative comparison among different techniques, in predicting 12 known RNA secondary structures, is presented, as an example. Future challenging issues are then mentioned. PMID:23702539

  15. Ko Displacement Theory for Structural Shape Predictions

    NASA Technical Reports Server (NTRS)

    Ko, William L.

    2010-01-01

    The development of the Ko displacement theory for predictions of structure deformed shapes was motivated in 2003 by the Helios flying wing, which had a 247-ft (75-m) wing span with wingtip deflections reaching 40 ft (12 m). The Helios flying wing failed in midair in June 2003, creating the need to develop new technology to predict in-flight deformed shapes of unmanned aircraft wings for visual display before the ground-based pilots. Any types of strain sensors installed on a structure can only sense the surface strains, but are incapable to sense the overall deformed shapes of structures. After the invention of the Ko displacement theory, predictions of structure deformed shapes could be achieved by feeding the measured surface strains into the Ko displacement transfer functions for the calculations of out-of-plane deflections and cross sectional rotations at multiple locations for mapping out overall deformed shapes of the structures. The new Ko displacement theory combined with a strain-sensing system thus created a revolutionary new structure- shape-sensing technology.

  16. Input-based structure-specific proficiency predicts the neural mechanism of adult L2 syntactic processing.

    PubMed

    Deng, Taiping; Zhou, Huixia; Bi, Hong-Yan; Chen, Baoguo

    2015-06-12

    This study used Event-Related Potentials (ERPs) to explore the role of input-based structure-specific proficiency in L2 syntactic processing, using English subject-verb agreement structures as the stimuli. A pre-test/trainings/post-test paradigm of experimental and control groups was employed, and Chinese speakers who learned English as a second language (L2) participated in the experiment. At pre-test, no ERP component related to the subject-verb agreement structures violations was observed in either group. At training session, the experimental group learned the subject-verb agreement structures, while the control group learned other syntactic structures. After two continuously intensive input trainings, at post-test, a significant P600 component related to the subject-verb agreement structures violations was elicited in the experimental group, but not in the control group. These findings suggest that input training improves structure-specific proficiency, which is reflected in the neural mechanism of L2 syntactic processing.

  17. Predictive models of biohydrogen and biomethane production based on the compositional and structural features of lignocellulosic materials.

    PubMed

    Monlau, Florian; Sambusiti, Cecilia; Barakat, Abdellatif; Guo, Xin Mei; Latrille, Eric; Trably, Eric; Steyer, Jean-Philippe; Carrere, Hélène

    2012-11-01

    In an integrated biorefinery concept, biological hydrogen and methane production from lignocellulosic substrates appears to be one of the most promising alternatives to produce energy from renewable sources. However, lignocellulosic substrates present compositional and structural features that can limit their conversion into biohydrogen and methane. In this study, biohydrogen and methane potentials of 20 lignocellulosic residues were evaluated. Compositional (lignin, cellulose, hemicelluloses, total uronic acids, proteins, and soluble sugars) as well as structural features (crystallinity) were determined for each substrate. Two predictive partial least square (PLS) models were built to determine which compositional and structural parameters affected biohydrogen or methane production from lignocellulosic substrates, among proteins, total uronic acids, soluble sugars, crystalline cellulose, amorphous holocelluloses, and lignin. Only soluble sugars had a significant positive effect on biohydrogen production. Besides, methane potentials correlated negatively to the lignin contents and, to a lower extent, crystalline cellulose showed also a negative impact, whereas soluble sugars, proteins, and amorphous hemicelluloses showed a positive impact. These findings will help to develop further pretreatment strategies for enhancing both biohydrogen and methane production.

  18. Prediction of RNA secondary structures with pseudoknots

    NASA Astrophysics Data System (ADS)

    Bon, M.; Orland, H.

    2010-08-01

    We present a new algorithm to predict RNA secondary structures with pseudoknots. The method is based on a classification of RNA structures according to their topological genus. The algorithm utilizes a simplified parametrization of the free energies for pair stacking, loop penalties, etc. and in addition a free energy penalty proportional to the topological genus of the pairing graph. Our method can take into account all pseudoknot topologies and achieves high success rates compared to state-of-the-art methods. This shows that the genus is a promising concept to classify pseudoknots.

  19. Predicting structured metadata from unstructured metadata

    PubMed Central

    Posch, Lisa; Panahiazar, Maryam; Dumontier, Michel; Gevaert, Olivier

    2016-01-01

    Enormous amounts of biomedical data have been and are being produced by investigators all over the world. However, one crucial and limiting factor in data reuse is accurate, structured and complete description of the data or data about the data—defined as metadata. We propose a framework to predict structured metadata terms from unstructured metadata for improving quality and quantity of metadata, using the Gene Expression Omnibus (GEO) microarray database. Our framework consists of classifiers trained using term frequency-inverse document frequency (TF-IDF) features and a second approach based on topics modeled using a Latent Dirichlet Allocation model (LDA) to reduce the dimensionality of the unstructured data. Our results on the GEO database show that structured metadata terms can be the most accurately predicted using the TF-IDF approach followed by LDA both outperforming the majority vote baseline. While some accuracy is lost by the dimensionality reduction of LDA, the difference is small for elements with few possible values, and there is a large improvement over the majority classifier baseline. Overall this is a promising approach for metadata prediction that is likely to be applicable to other datasets and has implications for researchers interested in biomedical metadata curation and metadata prediction. Database URL: http://www.yeastgenome.org/

  20. Reliable resonance assignments of selected residues of proteins with known structure based on empirical NMR chemical shift prediction

    NASA Astrophysics Data System (ADS)

    Li, Da-Wei; Meng, Dan; Brüschweiler, Rafael

    2015-05-01

    A robust NMR resonance assignment method is introduced for proteins whose 3D structure has previously been determined by X-ray crystallography. The goal of the method is to obtain a subset of correct assignments from a parsimonious set of 3D NMR experiments of 15N, 13C labeled proteins. Chemical shifts of sequential residue pairs are predicted from static protein structures using PPM_One, which are then compared with the corresponding experimental shifts. Globally optimized weighted matching identifies the assignments that are robust with respect to small changes in NMR cross-peak positions. The method, termed PASSPORT, is demonstrated for 4 proteins with 100-250 amino acids using 3D NHCA and a 3D CBCA(CO)NH experiments as input producing correct assignments with high reliability for 22% of the residues. The method, which works best for Gly, Ala, Ser, and Thr residues, provides assignments that serve as anchor points for additional assignments by both manual and semi-automated methods or they can be directly used for further studies, e.g. on ligand binding, protein dynamics, or post-translational modification, such as phosphorylation.

  1. RNAComposer and RNA 3D structure prediction for nanotechnology.

    PubMed

    Biesiada, Marcin; Pachulska-Wieczorek, Katarzyna; Adamiak, Ryszard W; Purzycka, Katarzyna J

    2016-07-01

    RNAs adopt specific, stable tertiary architectures to perform their activities. Knowledge of RNA tertiary structure is fundamental to understand RNA functions beginning with transcription and ending with turnover. Contrary to advanced RNA secondary structure prediction algorithms, which allow good accuracy when experimental data are integrated into the prediction, tertiary structure prediction of large RNAs still remains a significant challenge. However, the field of RNA tertiary structure prediction is rapidly developing and new computational methods based on different strategies are emerging. RNAComposer is a user-friendly and freely available server for 3D structure prediction of RNA up to 500 nucleotide residues. RNAComposer employs fully automated fragment assembly based on RNA secondary structure specified by the user. Importantly, this method allows incorporation of distance restraints derived from the experimental data to strengthen the 3D predictions. The potential and limitations of RNAComposer are discussed and an application to RNA design for nanotechnology is presented.

  2. Prediction of binding affinity and efficacy of thyroid hormone receptor ligands using QSAR and structure based modeling methods

    PubMed Central

    Politi, Regina; Rusyn, Ivan; Tropsha, Alexander

    2016-01-01

    The thyroid hormone receptor (THR) is an important member of the nuclear receptor family that can be activated by endocrine disrupting chemicals (EDC). Quantitative Structure-Activity Relationship (QSAR) models have been developed to facilitate the prioritization of THR-mediated EDC for the experimental validation. The largest database of binding affinities available at the time of the study for ligand binding domain (LBD) of THRβ was assembled to generate both continuous and classification QSAR models with an external accuracy of R2=0.55 and CCR=0.76, respectively. In addition, for the first time a QSAR model was developed to predict binding affinities of antagonists inhibiting the interaction of coactivators with the AF-2 domain of THRβ (R2=0.70). Furthermore, molecular docking studies were performed for a set of THRβ ligands (57 agonists and 15 antagonists of LBD, 210 antagonists of the AF-2 domain, supplemented by putative decoys/non-binders) using several THRβ structures retrieved from the Protein Data Bank. We found that two agonist-bound THRβ conformations could effectively discriminate their corresponding ligands from presumed non-binders. Moreover, one of the agonist conformations could discriminate agonists from antagonists. Finally, we have conducted virtual screening of a chemical library compiled by the EPA as part of the Tox21 program to identify potential THRβ-mediated EDCs using both QSAR models and docking. We concluded that the library is unlikely to have any EDC that would bind to the THRβ. Models developed in this study can be employed either to identify environmental chemicals interacting with the THR or, conversely, to eliminate the THR-mediated mechanism of action for chemicals of concern. PMID:25058446

  3. Mechanics based model for predicting structure-induced rolling resistance (SRR) of the tire-pavement system

    NASA Astrophysics Data System (ADS)

    Shakiba, Maryam; Ozer, Hasan; Ziyadi, Mojtaba; Al-Qadi, Imad L.

    2016-05-01

    The structure-induced rolling resistance of pavements, and its impact on vehicle fuel consumption, is investigated in this study. The structural response of pavement causes additional rolling resistance and fuel consumption of vehicles through deformation of pavement and various dissipation mechanisms associated with inelastic material properties and damping. Accurate and computationally efficient models are required to capture these mechanisms and obtain realistic estimates of changes in vehicle fuel consumption. Two mechanistic-based approaches are currently used to calculate vehicle fuel consumption as related to structural rolling resistance: dissipation-induced and deflection-induced methods. The deflection-induced approach is adopted in this study, and realistic representation of pavement-vehicle interactions (PVIs) is incorporated. In addition to considering viscoelastic behavior of asphalt concrete layers, the realistic representation of PVIs in this study includes non-uniform three-dimensional tire contact stresses and dynamic analysis in pavement simulations. The effects of analysis type, tire contact stresses, pavement viscoelastic properties, pavement damping coefficients, vehicle speed, and pavement temperature are then investigated.

  4. Theoretical prediction of crystal structures of rubrene

    NASA Astrophysics Data System (ADS)

    Obata, Shigeaki; Miura, Toshiaki; Shimoi, Yukihiro

    2014-01-01

    We theoretically predict crystal structures and molecular arrangements for rubrene molecule using CONFLEX program and compare them with the experimental ones. The most, second-most, and fourth-most stable predicted crystal structures show good agreement with the triclinic, orthorhombic, and monoclinic polymorphs of rubrene, respectively. The change in molecular conformation is also predicted between crystalline and gas phases: the tetracene backbone takes flat conformation in crystalline phase as in the observed structure. Meanwhile, it is twisted in gas phase. The theoretical prediction method used in this work provides the successful results on the determination of the three kinds of crystal structures and molecular arrangements for rubrene molecule.

  5. Implementation of pseudoreceptor-based pharmacophore queries in the prediction of probable protein targets: explorations in the protein structural profile of Zea mays.

    PubMed

    Kumar, Sivakumar Prasanth; Jha, Prakash C; Pandya, Himanshu A; Jasrai, Yogesh T

    2014-07-01

    Molecular docking plays an important role in the protein target identification by prioritizing probable druggable proteins using docking energies. Due to the limitations of docking scoring schemes, there arises a need for structure-based approaches to acquire confidence in theoretical binding affinities. In this direction, we present here a receptor (protein)-based approach to predict probable protein targets using a small molecule of interest. We adopted a reverse approach wherein the ligand pharmacophore features were used to decipher interaction complementary amino acids of protein cavities (a pseudoreceptor) and expressed as queries to match the cavities or binding sites of the protein dataset. These pseudoreceptor-based pharmacophore queries were used to estimate total probabilities of each protein cavity thereby representing the ligand binding efficiency of the protein. We applied this approach to predict 3 experimental protein targets among 28 Zea mays structural data using 3 co-crystallized ligands as inputs and compared its effectiveness using conventional docking results. We suggest that the combination of total probabilities and docking energies increases the confidence in prioritizing probable protein targets using docking methods. These prediction hypotheses were further supported by DrugScoreX (DSX) pair potential calculations and molecular dynamic simulations. PMID:24756543

  6. Prediction of binding affinity and efficacy of thyroid hormone receptor ligands using QSAR and structure-based modeling methods

    SciTech Connect

    Politi, Regina; Rusyn, Ivan; Tropsha, Alexander

    2014-10-01

    The thyroid hormone receptor (THR) is an important member of the nuclear receptor family that can be activated by endocrine disrupting chemicals (EDC). Quantitative Structure–Activity Relationship (QSAR) models have been developed to facilitate the prioritization of THR-mediated EDC for the experimental validation. The largest database of binding affinities available at the time of the study for ligand binding domain (LBD) of THRβ was assembled to generate both continuous and classification QSAR models with an external accuracy of R{sup 2} = 0.55 and CCR = 0.76, respectively. In addition, for the first time a QSAR model was developed to predict binding affinities of antagonists inhibiting the interaction of coactivators with the AF-2 domain of THRβ (R{sup 2} = 0.70). Furthermore, molecular docking studies were performed for a set of THRβ ligands (57 agonists and 15 antagonists of LBD, 210 antagonists of the AF-2 domain, supplemented by putative decoys/non-binders) using several THRβ structures retrieved from the Protein Data Bank. We found that two agonist-bound THRβ conformations could effectively discriminate their corresponding ligands from presumed non-binders. Moreover, one of the agonist conformations could discriminate agonists from antagonists. Finally, we have conducted virtual screening of a chemical library compiled by the EPA as part of the Tox21 program to identify potential THRβ-mediated EDCs using both QSAR models and docking. We concluded that the library is unlikely to have any EDC that would bind to the THRβ. Models developed in this study can be employed either to identify environmental chemicals interacting with the THR or, conversely, to eliminate the THR-mediated mechanism of action for chemicals of concern. - Highlights: • This is the largest curated dataset for ligand binding domain (LBD) of the THRβ. • We report the first QSAR model for antagonists of AF-2 domain of THRβ. • A combination of QSAR and docking enables

  7. Topolology-symmetry law of structure of natural titanosilicate micas and related heterophyllosilicates based on the extended OD theory: Structure prediction

    NASA Astrophysics Data System (ADS)

    Belokoneva, E. L.; Topnikova, A. P.; Aksenov, S. M.

    2015-01-01

    A topology-symmetry analysis of the structures in the family of titanosilicate micas and related heterophyllosilicates based on the extended OD theory reveals their kinship with the family of rhodezite, delhayelite, and other minerals that had been analyzed earlier by distinguishing sheets common for all the structures. Like in the family studied earlier, the structural variety of a more complex titanosilicate family is determined by different local symmetries of sheets. Sheets consist of central O layers of edge-sharing octahedra and H layers formed by tetrahedra connected into diortho groups and Ti(Nb,Fe) semioctahedra (octahedra). Three patterns of connection of O and H layers correspond to sheet symmetry P2/ m, P21/ m, and . Various symmetry modes of sheet connection in the structures are analyzed. Hypothetical structures, including structures with a higher degree of disorder, which can be found in nature or obtained by crystal synthesis, are deduced. Factors responsible for structural variety, including the existence of two main sheet varieties (with P2/ m and P21/ m symmetry) are considered a consequence of the difference in the chemism of the mineral formation medium.

  8. Inference of Expanded Lrp-Like Feast/Famine Transcription Factor Targets in a Non-Model Organism Using Protein Structure-Based Prediction

    PubMed Central

    Ashworth, Justin; Plaisier, Christopher L.; Lo, Fang Yin; Reiss, David J.; Baliga, Nitin S.

    2014-01-01

    Widespread microbial genome sequencing presents an opportunity to understand the gene regulatory networks of non-model organisms. This requires knowledge of the binding sites for transcription factors whose DNA-binding properties are unknown or difficult to infer. We adapted a protein structure-based method to predict the specificities and putative regulons of homologous transcription factors across diverse species. As a proof-of-concept we predicted the specificities and transcriptional target genes of divergent archaeal feast/famine regulatory proteins, several of which are encoded in the genome of Halobacterium salinarum. This was validated by comparison to experimentally determined specificities for transcription factors in distantly related extremophiles, chromatin immunoprecipitation experiments, and cis-regulatory sequence conservation across eighteen related species of halobacteria. Through this analysis we were able to infer that Halobacterium salinarum employs a divergent local trans-regulatory strategy to regulate genes (carA and carB) involved in arginine and pyrimidine metabolism, whereas Escherichia coli employs an operon. The prediction of gene regulatory binding sites using structure-based methods is useful for the inference of gene regulatory relationships in new species that are otherwise difficult to infer. PMID:25255272

  9. Inference of expanded Lrp-like feast/famine transcription factor targets in a non-model organism using protein structure-based prediction.

    PubMed

    Ashworth, Justin; Plaisier, Christopher L; Lo, Fang Yin; Reiss, David J; Baliga, Nitin S

    2014-01-01

    Widespread microbial genome sequencing presents an opportunity to understand the gene regulatory networks of non-model organisms. This requires knowledge of the binding sites for transcription factors whose DNA-binding properties are unknown or difficult to infer. We adapted a protein structure-based method to predict the specificities and putative regulons of homologous transcription factors across diverse species. As a proof-of-concept we predicted the specificities and transcriptional target genes of divergent archaeal feast/famine regulatory proteins, several of which are encoded in the genome of Halobacterium salinarum. This was validated by comparison to experimentally determined specificities for transcription factors in distantly related extremophiles, chromatin immunoprecipitation experiments, and cis-regulatory sequence conservation across eighteen related species of halobacteria. Through this analysis we were able to infer that Halobacterium salinarum employs a divergent local trans-regulatory strategy to regulate genes (carA and carB) involved in arginine and pyrimidine metabolism, whereas Escherichia coli employs an operon. The prediction of gene regulatory binding sites using structure-based methods is useful for the inference of gene regulatory relationships in new species that are otherwise difficult to infer.

  10. RNA-SSPT: RNA Secondary Structure Prediction Tools.

    PubMed

    Ahmad, Freed; Mahboob, Shahid; Gulzar, Tahsin; Din, Salah U; Hanif, Tanzeela; Ahmad, Hifza; Afzal, Muhammad

    2013-01-01

    The prediction of RNA structure is useful for understanding evolution for both in silico and in vitro studies. Physical methods like NMR studies to predict RNA secondary structure are expensive and difficult. Computational RNA secondary structure prediction is easier. Comparative sequence analysis provides the best solution. But secondary structure prediction of a single RNA sequence is challenging. RNA-SSPT is a tool that computationally predicts secondary structure of a single RNA sequence. Most of the RNA secondary structure prediction tools do not allow pseudoknots in the structure or are unable to locate them. Nussinov dynamic programming algorithm has been implemented in RNA-SSPT. The current studies shows only energetically most favorable secondary structure is required and the algorithm modification is also available that produces base pairs to lower the total free energy of the secondary structure. For visualization of RNA secondary structure, NAVIEW in C language is used and modified in C# for tool requirement. RNA-SSPT is built in C# using Dot Net 2.0 in Microsoft Visual Studio 2005 Professional edition. The accuracy of RNA-SSPT is tested in terms of Sensitivity and Positive Predicted Value. It is a tool which serves both secondary structure prediction and secondary structure visualization purposes. PMID:24250115

  11. Improving RNA secondary structure prediction with structure mapping data.

    PubMed

    Sloma, Michael F; Mathews, David H

    2015-01-01

    Methods to probe RNA secondary structure, such as small molecule modifying agents, secondary structure-specific nucleases, inline probing, and SHAPE chemistry, are widely used to study the structure of functional RNA. Computational secondary structure prediction programs can incorporate probing data to predict structure with high accuracy. In this chapter, an overview of current methods for probing RNA secondary structure is provided, including modern high-throughput methods. Methods for guiding secondary structure prediction algorithms using these data are explained, and best practices for using these data are provided. This chapter concludes by listing a number of open questions about how to best use probing data, and what these data can provide.

  12. Prediction of the electronic structures, thermodynamic and mechanical properties in manganese doped magnesium-based alloys and their saturated hydrides based on density functional theory

    NASA Astrophysics Data System (ADS)

    Zhang, Ziying; Zhang, Huizhen; Zhao, Hui; Yu, Zhishui; He, Liang; Li, Jin

    2015-04-01

    The crystal structures, electronic structures, thermodynamic and mechanical properties of Mg2Ni alloy and its saturated hydride with different Mn-doping contents are investigated using first-principles density functional theory. The lattice parameters for the Mn-doped Mg2Ni alloys and their saturated hydrides decreased with an increasing Mn-doping content because of the smaller atomic size of Mn compared with that of Mg. Analysis of the formation enthalpies and electronic structures reveal that the partial substitution of Mg with Mn reduces the stability of Mg2Ni alloy and its saturated hydride. The calculated elastic constants indicate that, although the partial substitution of Mg with Mn lowers the toughness of the hexagonal Mg2Ni alloy, the charge/discharge cycles are elevated when the Mn-doping content is high enough to form the predicted intermetallic compound Mg3MnNi2.

  13. Evolutionary Structure Prediction of Stoichiometric Compounds

    NASA Astrophysics Data System (ADS)

    Zhu, Qiang; Oganov, Artem

    2014-03-01

    In general, for a given ionic compound AmBn\\ at ambient pressure condition, its stoichiometry reflects the valence state ratio between per chemical specie (i.e., the charges for each anion and cation). However, compounds under high pressure exhibit significantly behavior, compared to those analogs at ambient condition. Here we developed a method to solve the crystal structure prediction problem based on the evolutionary algorithms, which can predict both the stable compounds and their crystal structures at arbitrary P,T-conditions, given just the set of chemical elements. By applying this method to a wide range of binary ionic systems (Na-Cl, Mg-O, Xe-O, Cs-F, etc), we discovered a lot of compounds with brand new stoichimetries which can become thermodynamically stable. Further electronic structure analysis on these novel compounds indicates that several factors can contribute to this extraordinary phenomenon: (1) polyatomic anions; (2) free electron localization; (3) emergence of new valence states; (4) metallization. In particular, part of the results have been confirmed by experiment, which warrants that this approach can play a crucial role in new materials design under extreme pressure conditions. This work is funded by DARPA (Grants No. W31P4Q1210008 and W31P4Q1310005), NSF (EAR-1114313 and DMR-1231586).

  14. Prediction of estrogen receptor binding for 58,000 chemicals using an integrated system of a tree-based model with structural alerts.

    PubMed Central

    Hong, Huixiao; Tong, Weida; Fang, Hong; Shi, Leming; Xie, Qian; Wu, Jie; Perkins, Roger; Walker, John D; Branham, William; Sheehan, Daniel M

    2002-01-01

    A number of environmental chemicals, by mimicking natural hormones, can disrupt endocrine function in experimental animals, wildlife, and humans. These chemicals, called "endocrine-disrupting chemicals" (EDCs), are such a scientific and public concern that screening and testing 58,000 chemicals for EDC activities is now statutorily mandated. Computational chemistry tools are important to biologists because they identify chemicals most important for in vitro and in vivo studies. Here we used a computational approach with integration of two rejection filters, a tree-based model, and three structural alerts to predict and prioritize estrogen receptor (ER) ligands. The models were developed using data for 232 structurally diverse chemicals (training set) with a 10(6) range of relative binding affinities (RBAs); we then validated the models by predicting ER RBAs for 463 chemicals that had ER activity data (testing set). The integrated model gave a lower false negative rate than any single component for both training and testing sets. When the integrated model was applied to approximately 58,000 potential EDCs, 80% (approximately 46,000 chemicals) were predicted to have negligible potential (log RBA < -4.5, with log RBA = 2.0 for estradiol) to bind ER. The ability to process large numbers of chemicals to predict inactivity for ER binding and to categorically prioritize the remainder provides one biologic measure to prioritize chemicals for entry into more expensive assays (most chemicals have no biologic data of any kind). The general approach for predicting ER binding reported here may be applied to other receptors and/or reversible binding mechanisms involved in endocrine disruption. PMID:11781162

  15. kPROT: a knowledge-based scale for the propensity of residue orientation in transmembrane segments. Application to membrane protein structure prediction.

    PubMed

    Pilpel, Y; Ben-Tal, N; Lancet, D

    1999-12-10

    Modeling of integral membrane proteins and the prediction of their functional sites requires the identification of transmembrane (TM) segments and the determination of their angular orientations. Hydrophobicity scales predict accurately the location of TM helices, but are less accurate in computing angular disposition. Estimating lipid-exposure propensities of the residues from statistics of solved membrane protein structures has the disadvantage of relying on relatively few proteins. As an alternative, we propose here a scale of knowledge-based Propensities for Residue Orientation in Transmembrane segments (kPROT), derived from the analysis of more than 5000 non-redundant protein sequences. We assume that residues that tend to be exposed to the membrane are more frequent in TM segments of single-span proteins, while residues that prefer to be buried in the transmembrane bundle interior are present mainly in multi-span TMs. The kPROT value for each residue is thus defined as the logarithm of the ratio of its proportions in single and multiple TM spans. The scale is refined further by defining it for three discrete sections of the TM segment; namely, extracellular, central, and intracellular. The capacity of the kPROT scale to predict angular helical orientation was compared to that of alternative methods in a benchmark test, using a diversity of multi-span alpha-helical transmembrane proteins with a solved 3D structure. kPROT yielded an average angular error of 41 degrees, significantly lower than that of alternative scales (62 degrees -68 degrees ). The new scale thus provides a useful general tool for modeling and prediction of functional residues in membrane proteins. A WWW server (http://bioinfo.weizmann.ac.il/kPROT) is available for automatic helix orientation prediction with kPROT. PMID:10588897

  16. Novel and efficient RNA secondary structure prediction using hierarchical folding.

    PubMed

    Jabbari, Hosna; Condon, Anne; Zhao, Shelly

    2008-03-01

    Algorithms for prediction of RNA secondary structure-the set of base pairs that form when an RNA molecule folds-are valuable to biologists who aim to understand RNA structure and function. Improving the accuracy and efficiency of prediction methods is an ongoing challenge, particularly for pseudoknotted secondary structures, in which base pairs overlap. This challenge is biologically important, since pseudoknotted structures play essential roles in functions of many RNA molecules, such as splicing and ribosomal frameshifting. State-of-the-art methods, which are based on free energy minimization, have high run-time complexity (typically Theta(n(5)) or worse), and can handle (minimize over) only limited types of pseudoknotted structures. We propose a new approach for prediction of pseudoknotted structures, motivated by the hypothesis that RNA structures fold hierarchically, with pseudoknot-free (non-overlapping) base pairs forming first, and pseudoknots forming later so as to minimize energy relative to the folded pseudoknot-free structure. Our HFold algorithm uses two-phase energy minimization to predict hierarchically formed secondary structures in O(n(3)) time, matching the complexity of the best algorithms for pseudoknot-free secondary structure prediction via energy minimization. Our algorithm can handle a wide range of biological structures, including kissing hairpins and nested kissing hairpins, which have previously required Theta(n(6)) time.

  17. Phylogenetic approaches to natural product structure prediction.

    PubMed

    Ziemert, Nadine; Jensen, Paul R

    2012-01-01

    Phylogenetics is the study of the evolutionary relatedness among groups of organisms. Molecular phylogenetics uses sequence data to infer these relationships for both organisms and the genes they maintain. With the large amount of publicly available sequence data, phylogenetic inference has become increasingly important in all fields of biology. In the case of natural product research, phylogenetic relationships are proving to be highly informative in terms of delineating the architecture and function of the genes involved in secondary metabolite biosynthesis. Polyketide synthases and nonribosomal peptide synthetases provide model examples in which individual domain phylogenies display different predictive capacities, resolving features ranging from substrate specificity to structural motifs associated with the final metabolic product. This chapter provides examples in which phylogeny has proven effective in terms of predicting functional or structural aspects of secondary metabolism. The basics of how to build a reliable phylogenetic tree are explained along with information about programs and tools that can be used for this purpose. Furthermore, it introduces the Natural Product Domain Seeker, a recently developed Web tool that employs phylogenetic logic to classify ketosynthase and condensation domains based on established enzyme architecture and biochemical function.

  18. Prediction of Protein Structure Using Surface Accessibility Data

    PubMed Central

    Hartlmüller, Christoph; Göbl, Christoph

    2016-01-01

    Abstract An approach to the de novo structure prediction of proteins is described that relies on surface accessibility data from NMR paramagnetic relaxation enhancements by a soluble paramagnetic compound (sPRE). This method exploits the distance‐to‐surface information encoded in the sPRE data in the chemical shift‐based CS‐Rosetta de novo structure prediction framework to generate reliable structural models. For several proteins, it is demonstrated that surface accessibility data is an excellent measure of the correct protein fold in the early stages of the computational folding algorithm and significantly improves accuracy and convergence of the standard Rosetta structure prediction approach. PMID:27560616

  19. Prediction of Protein Structure Using Surface Accessibility Data.

    PubMed

    Hartlmüller, Christoph; Göbl, Christoph; Madl, Tobias

    2016-09-19

    An approach to the de novo structure prediction of proteins is described that relies on surface accessibility data from NMR paramagnetic relaxation enhancements by a soluble paramagnetic compound (sPRE). This method exploits the distance-to-surface information encoded in the sPRE data in the chemical shift-based CS-Rosetta de novo structure prediction framework to generate reliable structural models. For several proteins, it is demonstrated that surface accessibility data is an excellent measure of the correct protein fold in the early stages of the computational folding algorithm and significantly improves accuracy and convergence of the standard Rosetta structure prediction approach.

  20. Prediction of Protein Structure Using Surface Accessibility Data.

    PubMed

    Hartlmüller, Christoph; Göbl, Christoph; Madl, Tobias

    2016-09-19

    An approach to the de novo structure prediction of proteins is described that relies on surface accessibility data from NMR paramagnetic relaxation enhancements by a soluble paramagnetic compound (sPRE). This method exploits the distance-to-surface information encoded in the sPRE data in the chemical shift-based CS-Rosetta de novo structure prediction framework to generate reliable structural models. For several proteins, it is demonstrated that surface accessibility data is an excellent measure of the correct protein fold in the early stages of the computational folding algorithm and significantly improves accuracy and convergence of the standard Rosetta structure prediction approach. PMID:27560616

  1. Prediction of binary hard-sphere crystal structures.

    PubMed

    Filion, Laura; Dijkstra, Marjolein

    2009-04-01

    We present a method based on a combination of a genetic algorithm and Monte Carlo simulations to predict close-packed crystal structures in hard-core systems. We employ this method to predict the binary crystal structures in a mixture of large and small hard spheres with various stoichiometries and diameter ratios between 0.4 and 0.84. In addition to known binary hard-sphere crystal structures similar to NaCl and AlB2, we predict additional crystal structures with the symmetry of CrB, gammaCuTi, alphaIrV, HgBr2, AuTe2, Ag2Se, and various structures for which an atomic analog was not found. In order to determine the crystal structures at infinite pressures, we calculate the maximum packing density as a function of size ratio for the crystal structures predicted by our GA using a simulated annealing approach. PMID:19518387

  2. Comparative analysis of QSAR models for predicting pK(a) of organic oxygen acids and nitrogen bases from molecular structure.

    PubMed

    Yu, Haiying; Kühne, Ralph; Ebert, Ralf-Uwe; Schüürmann, Gerrit

    2010-11-22

    For 1143 organic compounds comprising 580 oxygen acids and 563 nitrogen bases that cover more than 17 orders of experimental pK(a) (from -5.00 to 12.23), the pK(a) prediction performances of ACD, SPARC, and two calibrations of a semiempirical quantum chemical (QC) AM1 approach have been analyzed. The overall root-mean-square errors (rms) for the acids are 0.41, 0.58 (0.42 without ortho-substituted phenols with intramolecular H-bonding), and 0.55 and for the bases are 0.65, 0.70, 1.17, and 1.27 for ACD, SPARC, and both QC methods, respectively. Method-specific performances are discussed in detail for six acid subsets (phenols and aromatic and aliphatic carboxylic acids with different substitution patterns) and nine base subsets (anilines, primary, secondary and tertiary amines, meta/para-substituted and ortho-substituted pyridines, pyrimidines, imidazoles, and quinolines). The results demonstrate an overall better performance for acids than for bases but also a substantial variation across subsets. For the overall best-performing ACD, rms ranges from 0.12 to 1.11 and 0.40 to 1.21 pK(a) units for the acid and base subsets, respectively. With regard to the squared correlation coefficient r², the results are 0.86 to 0.96 (acids) and 0.79 to 0.95 (bases) for ACD, 0.77 to 0.95 (acids) and 0.85 to 0.97 (bases) for SPARC, and 0.64 to 0.87 (acids) and 0.43 to 0.83 (bases) for the QC methods, respectively. Attention is paid to structural and method-specific causes for observed pitfalls. The significant subset dependence of the prediction performances suggests a consensus modeling approach. PMID:21033677

  3. Transmembrane beta-barrel protein structure prediction

    NASA Astrophysics Data System (ADS)

    Randall, Arlo; Baldi, Pierre

    Transmembrane β-barrel (TMB) proteins are embedded in the outer membranes of mitochondria, Gram-negative bacteria, and chloroplasts. These proteins perform critical functions, including active ion-transport and passive nutrient intake. Therefore, there is a need for accurate prediction of secondary and tertiary structures of TMB proteins. A variety of methods have been developed for predicting the secondary structure and these predictions are very useful for constructing a coarse topology of TMB structure; however, they do not provide enough information to construct a low-resolution tertiary structure for a TMB protein. In addition, while the overall structural architecture is well conserved among TMB proteins, the amino acid sequences are highly divergent. Thus, traditional homology modeling methods cannot be applied to many putative TMB proteins. Here, we describe the TMBpro: a pipeline of methods for predicting TMB secondary structure, β-residue contacts, and finally tertiary structure. The tertiary prediction method relies on the specific construction rules that TMB proteins adhere to and on the predicted β-residue contacts to dramatically reduce the search space for the model building procedure.

  4. The MULTICOM protein tertiary structure prediction system.

    PubMed

    Li, Jilong; Bhattacharya, Debswapna; Cao, Renzhi; Adhikari, Badri; Deng, Xin; Eickholt, Jesse; Cheng, Jianlin

    2014-01-01

    With the expansion of genomics and proteomics data aided by the rapid progress of next-generation sequencing technologies, computational prediction of protein three-dimensional structure is an essential part of modern structural genomics initiatives. Prediction of protein structure through understanding of the theories behind protein sequence-structure relationship, however, remains one of the most challenging problems in contemporary life sciences. Here, we describe MULTICOM, a multi-level combination technique, intended to predict moderate- to high-resolution structure of a protein through a novel approach of combining multiple sources of complementary information derived from the experimentally solved protein structures in the Protein Data Bank. The MULTICOM web server is freely available at http://sysbio.rnet.missouri.edu/multicom_toolbox/.

  5. Predictive Models of Primary Tropical Forest Structure from Geomorphometric Variables Based on SRTM in the Tapajós Region, Brazilian Amazon

    PubMed Central

    Bispo, Polyanna da Conceição; dos Santos, João Roberto; Valeriano, Márcio de Morisson; Graça, Paulo Maurício Lima de Alencastro; Balzter, Heiko; França, Helena; Bispo, Pitágoras da Conceição

    2016-01-01

    Surveying primary tropical forest over large regions is challenging. Indirect methods of relating terrain information or other external spatial datasets to forest biophysical parameters can provide forest structural maps at large scales but the inherent uncertainties need to be evaluated fully. The goal of the present study was to evaluate relief characteristics, measured through geomorphometric variables, as predictors of forest structural characteristics such as average tree basal area (BA) and height (H) and average percentage canopy openness (CO). Our hypothesis is that geomorphometric variables are good predictors of the structure of primary tropical forest, even in areas, with low altitude variation. The study was performed at the Tapajós National Forest, located in the Western State of Pará, Brazil. Forty-three plots were sampled. Predictive models for BA, H and CO were parameterized based on geomorphometric variables using multiple linear regression. Validation of the models with nine independent sample plots revealed a Root Mean Square Error (RMSE) of 3.73 m2/ha (20%) for BA, 1.70 m (12%) for H, and 1.78% (21%) for CO. The coefficient of determination between observed and predicted values were r2 = 0.32 for CO, r2 = 0.26 for H and r2 = 0.52 for BA. The models obtained were able to adequately estimate BA and CO. In summary, it can be concluded that relief variables are good predictors of vegetation structure and enable the creation of forest structure maps in primary tropical rainforest with an acceptable uncertainty. PMID:27089013

  6. Predictive Models of Primary Tropical Forest Structure from Geomorphometric Variables Based on SRTM in the Tapajós Region, Brazilian Amazon.

    PubMed

    Bispo, Polyanna da Conceição; Dos Santos, João Roberto; Valeriano, Márcio de Morisson; Graça, Paulo Maurício Lima de Alencastro; Balzter, Heiko; França, Helena; Bispo, Pitágoras da Conceição

    2016-01-01

    Surveying primary tropical forest over large regions is challenging. Indirect methods of relating terrain information or other external spatial datasets to forest biophysical parameters can provide forest structural maps at large scales but the inherent uncertainties need to be evaluated fully. The goal of the present study was to evaluate relief characteristics, measured through geomorphometric variables, as predictors of forest structural characteristics such as average tree basal area (BA) and height (H) and average percentage canopy openness (CO). Our hypothesis is that geomorphometric variables are good predictors of the structure of primary tropical forest, even in areas, with low altitude variation. The study was performed at the Tapajós National Forest, located in the Western State of Pará, Brazil. Forty-three plots were sampled. Predictive models for BA, H and CO were parameterized based on geomorphometric variables using multiple linear regression. Validation of the models with nine independent sample plots revealed a Root Mean Square Error (RMSE) of 3.73 m2/ha (20%) for BA, 1.70 m (12%) for H, and 1.78% (21%) for CO. The coefficient of determination between observed and predicted values were r2 = 0.32 for CO, r2 = 0.26 for H and r2 = 0.52 for BA. The models obtained were able to adequately estimate BA and CO. In summary, it can be concluded that relief variables are good predictors of vegetation structure and enable the creation of forest structure maps in primary tropical rainforest with an acceptable uncertainty. PMID:27089013

  7. Predictive Models of Primary Tropical Forest Structure from Geomorphometric Variables Based on SRTM in the Tapajós Region, Brazilian Amazon.

    PubMed

    Bispo, Polyanna da Conceição; Dos Santos, João Roberto; Valeriano, Márcio de Morisson; Graça, Paulo Maurício Lima de Alencastro; Balzter, Heiko; França, Helena; Bispo, Pitágoras da Conceição

    2016-01-01

    Surveying primary tropical forest over large regions is challenging. Indirect methods of relating terrain information or other external spatial datasets to forest biophysical parameters can provide forest structural maps at large scales but the inherent uncertainties need to be evaluated fully. The goal of the present study was to evaluate relief characteristics, measured through geomorphometric variables, as predictors of forest structural characteristics such as average tree basal area (BA) and height (H) and average percentage canopy openness (CO). Our hypothesis is that geomorphometric variables are good predictors of the structure of primary tropical forest, even in areas, with low altitude variation. The study was performed at the Tapajós National Forest, located in the Western State of Pará, Brazil. Forty-three plots were sampled. Predictive models for BA, H and CO were parameterized based on geomorphometric variables using multiple linear regression. Validation of the models with nine independent sample plots revealed a Root Mean Square Error (RMSE) of 3.73 m2/ha (20%) for BA, 1.70 m (12%) for H, and 1.78% (21%) for CO. The coefficient of determination between observed and predicted values were r2 = 0.32 for CO, r2 = 0.26 for H and r2 = 0.52 for BA. The models obtained were able to adequately estimate BA and CO. In summary, it can be concluded that relief variables are good predictors of vegetation structure and enable the creation of forest structure maps in primary tropical rainforest with an acceptable uncertainty.

  8. Defining and predicting structurally conserved regions in protein superfamilies

    PubMed Central

    Huang, Ivan K.; Grishin, Nick V.

    2013-01-01

    Motivation: The structures of homologous proteins are generally better conserved than their sequences. This phenomenon is demonstrated by the prevalence of structurally conserved regions (SCRs) even in highly divergent protein families. Defining SCRs requires the comparison of two or more homologous structures and is affected by their availability and divergence, and our ability to deduce structurally equivalent positions among them. In the absence of multiple homologous structures, it is necessary to predict SCRs of a protein using information from only a set of homologous sequences and (if available) a single structure. Accurate SCR predictions can benefit homology modelling and sequence alignment. Results: Using pairwise DaliLite alignments among a set of homologous structures, we devised a simple measure of structural conservation, termed structural conservation index (SCI). SCI was used to distinguish SCRs from non-SCRs. A database of SCRs was compiled from 386 SCOP superfamilies containing 6489 protein domains. Artificial neural networks were then trained to predict SCRs with various features deduced from a single structure and homologous sequences. Assessment of the predictions via a 5-fold cross-validation method revealed that predictions based on features derived from a single structure perform similarly to ones based on homologous sequences, while combining sequence and structural features was optimal in terms of accuracy (0.755) and Matthews correlation coefficient (0.476). These results suggest that even without information from multiple structures, it is still possible to effectively predict SCRs for a protein. Finally, inspection of the structures with the worst predictions pinpoints difficulties in SCR definitions. Availability: The SCR database and the prediction server can be found at http://prodata.swmed.edu/SCR. Contact: 91huangi@gmail.com or grishin@chop.swmed.edu Supplementary information: Supplementary data are available at Bioinformatics

  9. Protein Structure and Function Prediction Using I-TASSER.

    PubMed

    Yang, Jianyi; Zhang, Yang

    2015-01-01

    I-TASSER is a hierarchical protocol for automated protein structure prediction and structure-based function annotation. Starting from the amino acid sequence of target proteins, I-TASSER first generates full-length atomic structural models from multiple threading alignments and iterative structural assembly simulations followed by atomic-level structure refinement. The biological functions of the protein, including ligand-binding sites, enzyme commission number, and gene ontology terms, are then inferred from known protein function databases based on sequence and structure profile comparisons. I-TASSER is freely available as both an on-line server and a stand-alone package. This unit describes how to use the I-TASSER protocol to generate structure and function prediction and how to interpret the prediction results, as well as alternative approaches for further improving the I-TASSER modeling quality for distant-homologous and multi-domain protein targets.

  10. Development of a Support Vector Machine-Based System to Predict Whether a Compound Is a Substrate of a Given Drug Transporter Using Its Chemical Structure.

    PubMed

    Ose, Atsushi; Toshimoto, Kota; Ikeda, Kazushi; Maeda, Kazuya; Yoshida, Shuya; Yamashita, Fumiyoshi; Hashida, Mitsuru; Ishida, Takashi; Akiyama, Yutaka; Sugiyama, Yuichi

    2016-07-01

    The aim of this study was to develop an in silico prediction system to assess which of 7 categories of drug transporters (organic anion transporting polypeptide [OATP] 1B1/1B3, multidrug resistance-associated protein [MRP] 2/3/4, organic anion transporter [OAT] 1, OAT3, organic cation transporter [OCT] 1/2/multidrug and toxin extrusion [MATE] 1/2-K, multidrug resistance protein 1 [MDR1], and breast cancer resistance protein [BCRP]) can recognize compounds as substrates using its chemical structure alone. We compiled an internal data set consisting of 260 compounds that are substrates for at least 1 of the 7 categories of drug transporters. Four physicochemical parameters (charge, molecular weight, lipophilicity, and plasma unbound fraction) of each compound were used as the basic descriptors. Furthermore, a greedy algorithm was used to select 3 additional physicochemical descriptors from 731 available descriptors. In addition, transporter nonsubstrates tend not to be in the public domain; we, thus, tried to compile an expert-curated data set of putative nonsubstrates for each transporter using personal opinions of 11 researchers in the field of drug transporters. The best prediction was finally achieved by a support vector machine based on 4 basic and 3 additional descriptors. The model correctly judged that 364 of 412 compounds (internal data set) and 111 of 136 compounds (external data set) were substrates, indicating that this model performs well enough to predict the specificity of transporter substrates. PMID:27262201

  11. Predicting the Fatigue life of Structures

    NASA Technical Reports Server (NTRS)

    Besuner, P. M.; Harris, D. O.; Thomas, J. M.; Allison, D. E.; Bannantine, J. M.; Brown, S. B.; Davis, C. S.; Derbalian, G. A.; Eischen, J. W.; Fowler, G. F.; Osteraas, J. D.; Robinson, J. N.; Sire, R. A.; Vroman, G. A.

    1985-01-01

    Report reviews fracture-mechanics technology for predicting life expectancy of structural components subjected to cyclic loads. Report covers analytical tools for modeling and forecasting subcritical fatigue-crack growth in structures. It emphasizes use of tools in practical, day-to-day problems of engineering design, development, and decisionmaking.

  12. Predicting structure in nonsymmetric sparse matrix factorizations

    SciTech Connect

    Gilbert, J.R.; Ng, E.G.

    1992-10-01

    Many computations on sparse matrices have a phase that predicts the nonzero structure of the output, followed by a phase that actually performs the numerical computation. We study structure prediction for computations that involve nonsymmetric row and column permutations and nonsymmetric or non-square matrices. Our tools are bipartite graphs, matchings, and alternating paths. Our main new result concerns LU factorization with partial pivoting. We show that if a square matrix A has the strong Hall property (i.e., is fully indecomposable) then an upper bound due to George and Ng on the nonzero structure of L + U is as tight as possible. To show this, we prove a crucial result about alternating paths in strong Hall graphs. The alternating-paths theorem seems to be of independent interest: it can also be used to prove related results about structure prediction for QR factorization that are due to Coleman, Edenbrandt, Gilbert, Hare, Johnson, Olesky, Pothen, and van den Driessche.

  13. Automated RNA 3D Structure Prediction with RNAComposer.

    PubMed

    Biesiada, Marcin; Purzycka, Katarzyna J; Szachniuk, Marta; Blazewicz, Jacek; Adamiak, Ryszard W

    2016-01-01

    RNAs adopt specific structures to perform their activities and these are critical to virtually all RNA-mediated processes. Because of difficulties in experimentally assessing structures of large RNAs using NMR, X-ray crystallography, or cryo-microscopy, there is currently great demand for new high-resolution 3D structure prediction methods. Recently we reported on RNAComposer, a knowledge-based method for the fully automated RNA 3D structure prediction from a user-defined secondary structure. RNAComposer method is especially suited for structural biology users. Since our initial report in 2012, both servers, freely available at http://rnacomposer.ibch.poznan.pl and http://rnacomposer.cs.put.poznan.pl have been often visited. Therefore this chapter provides guidance for using RNAComposer and discusses points that should be considered when predicting 3D RNA structure. An application example presents current scope and limitations of RNAComposer. PMID:27665601

  14. Predicting polymeric crystal structures by evolutionary algorithms

    NASA Astrophysics Data System (ADS)

    Zhu, Qiang; Sharma, Vinit; Oganov, Artem R.; Ramprasad, Ramamurthy

    2014-10-01

    The recently developed evolutionary algorithm USPEX proved to be a tool that enables accurate and reliable prediction of structures. Here we extend this method to predict the crystal structure of polymers by constrained evolutionary search, where each monomeric unit is treated as a building block with fixed connectivity. This greatly reduces the search space and allows the initial structure generation with different sequences and packings of these blocks. The new constrained evolutionary algorithm is successfully tested and validated on a diverse range of experimentally known polymers, namely, polyethylene, polyacetylene, poly(glycolic acid), poly(vinyl chloride), poly(oxymethylene), poly(phenylene oxide), and poly (p-phenylene sulfide). By fixing the orientation of polymeric chains, this method can be further extended to predict the structures of complex linear polymers, such as all polymorphs of poly(vinylidene fluoride), nylon-6 and cellulose. The excellent agreement between predicted crystal structures and experimentally known structures assures a major role of this approach in the efficient design of the future polymeric materials.

  15. Protein Structure Prediction with Evolutionary Algorithms

    SciTech Connect

    Hart, W.E.; Krasnogor, N.; Pelta, D.A.; Smith, J.

    1999-02-08

    Evolutionary algorithms have been successfully applied to a variety of molecular structure prediction problems. In this paper we reconsider the design of genetic algorithms that have been applied to a simple protein structure prediction problem. Our analysis considers the impact of several algorithmic factors for this problem: the confirmational representation, the energy formulation and the way in which infeasible conformations are penalized, Further we empirically evaluated the impact of these factors on a small set of polymer sequences. Our analysis leads to specific recommendations for both GAs as well as other heuristic methods for solving PSP on the HP model.

  16. Predicting the protein-protein interactions using primary structures with predicted protein surface

    PubMed Central

    2010-01-01

    Background Many biological functions involve various protein-protein interactions (PPIs). Elucidating such interactions is crucial for understanding general principles of cellular systems. Previous studies have shown the potential of predicting PPIs based on only sequence information. Compared to approaches that require other auxiliary information, these sequence-based approaches can be applied to a broader range of applications. Results This study presents a novel sequence-based method based on the assumption that protein-protein interactions are more related to amino acids at the surface than those at the core. The present method considers surface information and maintains the advantage of relying on only sequence data by including an accessible surface area (ASA) predictor recently proposed by the authors. This study also reports the experiments conducted to evaluate a) the performance of PPI prediction achieved by including the predicted surface and b) the quality of the predicted surface in comparison with the surface obtained from structures. The experimental results show that surface information helps to predict interacting protein pairs. Furthermore, the prediction performance achieved by using the surface estimated with the ASA predictor is close to that using the surface obtained from protein structures. Conclusion This work presents a sequence-based method that takes into account surface information for predicting PPIs. The proposed procedure of surface identification improves the prediction performance with an F-measure of 5.1%. The extracted surfaces are also valuable in other biomedical applications that require similar information. PMID:20122202

  17. Near-edge band structures and band gaps of Cu-based semiconductors predicted by the modified Becke-Johnson potential plus an on-site Coulomb U

    NASA Astrophysics Data System (ADS)

    Zhang, Yubo; Zhang, Jiawei; Gao, Weiwei; Abtew, Tesfaye A.; Wang, Youwei; Zhang, Peihong; Zhang, Wenqing

    2013-11-01

    Diamond-like Cu-based multinary semiconductors are a rich family of materials that hold promise in a wide range of applications. Unfortunately, accurate theoretical understanding of the electronic properties of these materials is hindered by the involvement of Cu d electrons. Density functional theory (DFT) based calculations using the local density approximation or generalized gradient approximation often give qualitative wrong electronic properties of these materials, especially for narrow-gap systems. The modified Becke-Johnson (mBJ) method has been shown to be a promising alternative to more elaborate theory such as the GW approximation for fast materials screening and predictions. However, straightforward applications of the mBJ method to these materials still encounter significant difficulties because of the insufficient treatment of the localized d electrons. We show that combining the promise of mBJ potential and the spirit of the well-established DFT + U method leads to a much improved description of the electronic structures, including the most challenging narrow-gap systems. A survey of the band gaps of about 20 Cu-based semiconductors calculated using the mBJ + U method shows that the results agree with reliable values to within ±0.2 eV.

  18. A new protein structure representation for efficient protein function prediction.

    PubMed

    Maghawry, Huda A; Mostafa, Mostafa G M; Gharib, Tarek F

    2014-12-01

    One of the challenging problems in bioinformatics is the prediction of protein function. Protein function is the main key that can be used to classify different proteins. Protein function can be inferred experimentally with very small throughput or computationally with very high throughput. Computational methods are sequence based or structure based. Structure-based methods produce more accurate protein function prediction. In this article, we propose a new protein structure representation for efficient protein function prediction. The representation is based on three-dimensional patterns of protein residues. In the analysis, we used protein function based on enzyme activity through six mechanistically diverse enzyme superfamilies: amidohydrolase, crotonase, haloacid dehalogenase, isoprenoid synthase type I, and vicinal oxygen chelate. We applied three different classification methods, naïve Bayes, k-nearest neighbors, and random forest, to predict the enzyme superfamily of a given protein. The prediction accuracy using the proposed representation outperforms a recently introduced representation method that is based only on the distance patterns. The results show that the proposed representation achieved prediction accuracy up to 98%, with improvement of about 10% on average.

  19. WeFold: A Coopetition for Protein Structure Prediction

    PubMed Central

    Khoury, George A.; Liwo, Adam; Khatib, Firas; Zhou, Hongyi; Chopra, Gaurav; Bacardit, Jaume; Bortot, Leandro O.; Faccioli, Rodrigo A.; Deng, Xin; He, Yi; Krupa, Pawel; Li, Jilong; Mozolewska, Magdalena A.; Sieradzan, Adam K.; Smadbeck, James; Wirecki, Tomasz; Cooper, Seth; Flatten, Jeff; Xu, Kefan; Baker, David; Cheng, Jianlin; Delbem, Alexandre C. B.; Floudas, Christodoulos A.; Keasar, Chen; Levitt, Michael; Popović, Zoran; Scheraga, Harold A.; Skolnick, Jeffrey; Crivelli, Silvia N.; Players, Foldit

    2014-01-01

    The protein structure prediction problem continues to elude scientists. Despite the introduction of many methods, only modest gains were made over the last decade for certain classes of prediction targets. To address this challenge, a social-media based worldwide collaborative effort, named WeFold, was undertaken by thirteen labs. During the collaboration, the labs were simultaneously competing with each other. Here, we present the first attempt at “coopetition” in scientific research applied to the protein structure prediction and refinement problems. The coopetition was possible by allowing the participating labs to contribute different components of their protein structure prediction pipelines and create new hybrid pipelines that they tested during CASP10. This manuscript describes both successes and areas needing improvement as identified throughout the first WeFold experiment and discusses the efforts that are underway to advance this initiative. A footprint of all contributions and structures are publicly accessible at http://www.wefold.org. PMID:24677212

  20. Status of research aimed at predicting structural integrity

    SciTech Connect

    Reuter, W.G.

    1997-12-31

    Considerable research has been performed throughout the world on measuring the fracture toughness of metals. The existing capability fills the need encountered when selecting materials, thermal-mechanical treatments, welding procedures, etc., but cannot predict the fracture process of structural components containing cracks. The Idaho National Engineering and Environmental Laboratory and the Massachusetts Institute of Technology have been collaborating for a number of years on developing capabilities for using fracture toughness results to predict structural integrity. Because of the high cost of fabricating and testing structural components, these studies have been limited to predicting the fracture process in specimens containing surface cracks. This paper summarizes the present status of the experimental studies of using fracture toughness data to predict crack growth initiation in specimens (structural components) containing surface cracks. These results are limited to homogeneous base materials.

  1. Data-Based Predictive Control with Multirate Prediction Step

    NASA Technical Reports Server (NTRS)

    Barlow, Jonathan S.

    2010-01-01

    Data-based predictive control is an emerging control method that stems from Model Predictive Control (MPC). MPC computes current control action based on a prediction of the system output a number of time steps into the future and is generally derived from a known model of the system. Data-based predictive control has the advantage of deriving predictive models and controller gains from input-output data. Thus, a controller can be designed from the outputs of complex simulation code or a physical system where no explicit model exists. If the output data happens to be corrupted by periodic disturbances, the designed controller will also have the built-in ability to reject these disturbances without the need to know them. When data-based predictive control is implemented online, it becomes a version of adaptive control. One challenge of MPC is computational requirements increasing with prediction horizon length. This paper develops a closed-loop dynamic output feedback controller that minimizes a multi-step-ahead receding-horizon cost function with multirate prediction step. One result is a reduced influence of prediction horizon and the number of system outputs on the computational requirements of the controller. Another result is an emphasis on portions of the prediction window that are sampled more frequently. A third result is the ability to include more outputs in the feedback path than in the cost function.

  2. Protein structure prediction enhanced with evolutionary diversity : SPEED.

    SciTech Connect

    DeBartolo, J.; Hocky, G.; Wilde, M.; Xu, J.; Freed, K. F.; Sosnick, T. R.; Univ. of Chicago; Toyota Technological Inst. at Chicago

    2010-03-01

    For naturally occurring proteins, similar sequence implies similar structure. Consequently, multiple sequence alignments (MSAs) often are used in template-based modeling of protein structure and have been incorporated into fragment-based assembly methods. Our previous homology-free structure prediction study introduced an algorithm that mimics the folding pathway by coupling the formation of secondary and tertiary structure. Moves in the Monte Carlo procedure involve only a change in a single pair of {phi},{psi} backbone dihedral angles that are obtained from a Protein Data Bank-based distribution appropriate for each amino acid, conditional on the type and conformation of the flanking residues. We improve this method by using MSAs to enrich the sampling distribution, but in a manner that does not require structural knowledge of any protein sequence (i.e., not homologous fragment insertion). In combination with other tools, including clustering and refinement, the accuracies of the predicted secondary and tertiary structures are substantially improved and a global and position-resolved measure of confidence is introduced for the accuracy of the predictions. Performance of the method in the Critical Assessment of Structure Prediction (CASP8) is discussed.

  3. Prediction of packing of secondary structure.

    PubMed

    Nagano, K; Ponnuswamy, P K

    1984-01-01

    An improved method of picking up candidates for predicting the packing arrangement of beta-strands and alpha-helices of the alpha/beta type domains is described here. The method of judging whether the region of the protein would fold into the alpha/beta type or not is also described. The folding constraints of globular proteins are analysed and presented in this article for application to the prediction of packing of secondary structure. The analysis of the residue-fluctuations is also applicable for the purpose.

  4. Three-dimensional quantitative structure-activity relationship analysis for human pregnane X receptor for the prediction of CYP3A4 induction in human hepatocytes: structure-based comparative molecular field analysis.

    PubMed

    Handa, Koichi; Nakagome, Izumi; Yamaotsu, Noriyuki; Gouda, Hiroaki; Hirono, Shuichi

    2015-01-01

    The pregnane X receptor [PXR (NR1I2)] induces the expression of xenobiotic metabolic genes and transporter genes. In this study, we aimed to establish a computational method for quantifying the enzyme-inducing potencies of different compounds via their ability to activate PXR, for the application in drug discovery and development. To achieve this purpose, we developed a three-dimensional quantitative structure-activity relationship (3D-QSAR) model using comparative molecular field analysis (CoMFA) for predicting enzyme-inducing potencies, based on computer-ligand docking to multiple PXR protein structures sampled from the trajectory of a molecular dynamics simulation. Molecular mechanics-generalized born/surface area scores representing the ligand-protein-binding free energies were calculated for each ligand. As a result, the predicted enzyme-inducing potencies for compounds generated by the CoMFA model were in good agreement with the experimental values. Finally, we concluded that this 3D-QSAR model has the potential to predict the enzyme-inducing potencies of novel compounds with high precision and therefore has valuable applications in the early stages of the drug discovery process.

  5. Predicting meningococcal disease outbreaks in structured populations.

    PubMed

    Ranta, J; Mäkelä, P H; Arjas, E

    2004-03-30

    Rational decision making on whether some form of intervention would be necessary to control the spread of a meningococcal epidemic is based on predictions concerning its potential natural progression. Unfortunately, reliable predictions are difficult to make during the early stages of an outbreak. A stochastic discrete time epidemic model was applied to adaptively predict the development of outbreaks of meningococcal disease in 'closed' populations such as military garrisons or boarding schools, which are further divided into subgroups called 'units'. The performance of the adaptive method was assessed by using 3 simulated epidemics representing substantially different realizations in a 'garrison' of 20 units, with 68 men in each. Predictions of the weekly number of disease cases, of the number of carriers, and of the number of new infections were computed. Simulations suggest that predictions based only on the observed numbers of disease cases are generally inaccurate. These predictions can be improved if temporal observations on asymptomatic carriers in different units are utilized together with observed time series of the disease. A sample of 15 per cent from all units can be sufficient for a major improvement if the alternative is to obtain a full sample of only some units. Exploiting fully such information requires computer intensive Markov chain Monte Carlo methods. PMID:15027081

  6. GSAFold: a new application of GSA to protein structure prediction.

    PubMed

    Melo, Marcelo C R; Bernardi, Rafael C; Fernandes, Tácio V A; Pascutti, Pedro G

    2012-08-01

    The folding process defines three-dimensional protein structures from their amino acid chains. A protein's structure determines its activity and properties; thus knowing such conformation on an atomic level is essential for both basic and applied studies of protein function and dynamics. However, the acquisition of such structures by experimental methods is slow and expensive, and current computational methods mostly depend on previously known structures to determine new ones. Here we present a new software called GSAFold that applies the generalized simulated annealing (GSA) algorithm on ab initio protein structure prediction. The GSA is a stochastic search algorithm employed in energy minimization and used in global optimization problems, especially those that depend on long-range interactions, such as gravity models and conformation optimization of small molecules. This new implementation applies, for the first time in ab initio protein structure prediction, an analytical inverse for the Visitation function of GSA. It also employs the broadly used NAMD Molecular Dynamics package to carry out energy calculations, allowing the user to select different force fields and parameterizations. Moreover, the software also allows the execution of several simulations simultaneously. Applications that depend on protein structures include rational drug design and structure-based protein function prediction. Applying GSAFold in a test peptide, it was possible to predict the structure of mastoparan-X to a root mean square deviation of 3.00 Å. PMID:22622959

  7. Coarse-Grained Prediction of RNA Loop Structures

    PubMed Central

    Liu, Liang; Chen, Shi-Jie

    2012-01-01

    One of the key issues in the theoretical prediction of RNA folding is the prediction of loop structure from the sequence. RNA loop free energies are dependent on the loop sequence content. However, most current models account only for the loop length-dependence. The previously developed “Vfold” model (a coarse-grained RNA folding model) provides an effective method to generate the complete ensemble of coarse-grained RNA loop and junction conformations. However, due to the lack of sequence-dependent scoring parameters, the method is unable to identify the native and near-native structures from the sequence. In this study, using a previously developed iterative method for extracting the knowledge-based potential parameters from the known structures, we derive a set of dinucleotide-based statistical potentials for RNA loops and junctions. A unique advantage of the approach is its ability to go beyond the the (known) native structures by accounting for the full free energy landscape, including all the nonnative folds. The benchmark tests indicate that for given loop/junction sequences, the statistical potentials enable successful predictions for the coarse-grained 3D structures from the complete conformational ensemble generated by the Vfold model. The predicted coarse-grained structures can provide useful initial folds for further detailed structural refinement. PMID:23144887

  8. Structure Prediction of RNA Loops with a Probabilistic Approach

    PubMed Central

    Li, Jun; Zhang, Jian; Wang, Jun; Li, Wenfei; Wang, Wei

    2016-01-01

    The knowledge of the tertiary structure of RNA loops is important for understanding their functions. In this work we develop an efficient approach named RNApps, specifically designed for predicting the tertiary structure of RNA loops, including hairpin loops, internal loops, and multi-way junction loops. It includes a probabilistic coarse-grained RNA model, an all-atom statistical energy function, a sequential Monte Carlo growth algorithm, and a simulated annealing procedure. The approach is tested with a dataset including nine RNA loops, a 23S ribosomal RNA, and a large dataset containing 876 RNAs. The performance is evaluated and compared with a homology modeling based predictor and an ab initio predictor. It is found that RNApps has comparable performance with the former one and outdoes the latter in terms of structure predictions. The approach holds great promise for accurate and efficient RNA tertiary structure prediction. PMID:27494763

  9. 3D protein structure prediction using Imperialist Competitive algorithm and half sphere exposure prediction.

    PubMed

    Khaji, Erfan; Karami, Masoumeh; Garkani-Nejad, Zahra

    2016-02-21

    Predicting the native structure of proteins based on half-sphere exposure and contact numbers has been studied deeply within recent years. Online predictors of these vectors and secondary structures of amino acids sequences have made it possible to design a function for the folding process. By choosing variant structures and directs for each secondary structure, a random conformation can be generated, and a potential function can then be assigned. Minimizing the potential function utilizing meta-heuristic algorithms is the final step of finding the native structure of a given amino acid sequence. In this work, Imperialist Competitive algorithm was used in order to accelerate the process of minimization. Moreover, we applied an adaptive procedure to apply revolutionary changes. Finally, we considered a more accurate tool for prediction of secondary structure. The results of the computational experiments on standard benchmark show the superiority of the new algorithm over the previous methods with similar potential function. PMID:26718864

  10. 3D protein structure prediction using Imperialist Competitive algorithm and half sphere exposure prediction.

    PubMed

    Khaji, Erfan; Karami, Masoumeh; Garkani-Nejad, Zahra

    2016-02-21

    Predicting the native structure of proteins based on half-sphere exposure and contact numbers has been studied deeply within recent years. Online predictors of these vectors and secondary structures of amino acids sequences have made it possible to design a function for the folding process. By choosing variant structures and directs for each secondary structure, a random conformation can be generated, and a potential function can then be assigned. Minimizing the potential function utilizing meta-heuristic algorithms is the final step of finding the native structure of a given amino acid sequence. In this work, Imperialist Competitive algorithm was used in order to accelerate the process of minimization. Moreover, we applied an adaptive procedure to apply revolutionary changes. Finally, we considered a more accurate tool for prediction of secondary structure. The results of the computational experiments on standard benchmark show the superiority of the new algorithm over the previous methods with similar potential function.

  11. Data-directed RNA secondary structure prediction using probabilistic modeling.

    PubMed

    Deng, Fei; Ledda, Mirko; Vaziri, Sana; Aviran, Sharon

    2016-08-01

    Structure dictates the function of many RNAs, but secondary RNA structure analysis is either labor intensive and costly or relies on computational predictions that are often inaccurate. These limitations are alleviated by integration of structure probing data into prediction algorithms. However, existing algorithms are optimized for a specific type of probing data. Recently, new chemistries combined with advances in sequencing have facilitated structure probing at unprecedented scale and sensitivity. These novel technologies and anticipated wealth of data highlight a need for algorithms that readily accommodate more complex and diverse input sources. We implemented and investigated a recently outlined probabilistic framework for RNA secondary structure prediction and extended it to accommodate further refinement of structural information. This framework utilizes direct likelihood-based calculations of pseudo-energy terms per considered structural context and can readily accommodate diverse data types and complex data dependencies. We use real data in conjunction with simulations to evaluate performances of several implementations and to show that proper integration of structural contexts can lead to improvements. Our tests also reveal discrepancies between real data and simulations, which we show can be alleviated by refined modeling. We then propose statistical preprocessing approaches to standardize data interpretation and integration into such a generic framework. We further systematically quantify the information content of data subsets, demonstrating that high reactivities are major drivers of SHAPE-directed predictions and that better understanding of less informative reactivities is key to further improvements. Finally, we provide evidence for the adaptive capability of our framework using mock probe simulations.

  12. Adaptive modelling of structured molecular representations for toxicity prediction

    NASA Astrophysics Data System (ADS)

    Bertinetto, Carlo; Duce, Celia; Micheli, Alessio; Solaro, Roberto; Tiné, Maria Rosaria

    2012-12-01

    We investigated the possibility of modelling structure-toxicity relationships by direct treatment of the molecular structure (without using descriptors) through an adaptive model able to retain the appropriate structural information. With respect to traditional descriptor-based approaches, this provides a more general and flexible way to tackle prediction problems that is particularly suitable when little or no background knowledge is available. Our method employs a tree-structured molecular representation, which is processed by a recursive neural network (RNN). To explore the realization of RNN modelling in toxicological problems, we employed a data set containing growth impairment concentrations (IGC50) for Tetrahymena pyriformis.

  13. MUFOLD: A new solution for protein 3D structure prediction

    PubMed Central

    Zhang, Jingfen; Wang, Qingguo; Barz, Bogdan; He, Zhiquan; Kosztin, Ioan; Shang, Yi; Xu, Dong

    2010-01-01

    There have been steady improvements in protein structure prediction during the past 2 decades. However, current methods are still far from consistently predicting structural models accurately with computing power accessible to common users. Toward achieving more accurate and efficient structure prediction, we developed a number of novel methods and integrated them into a software package, MUFOLD. First, a systematic protocol was developed to identify useful templates and fragments from Protein Data Bank for a given target protein. Then, an efficient process was applied for iterative coarse-grain model generation and evaluation at the Cα or backbone level. In this process, we construct models using interresidue spatial restraints derived from alignments by multidimensional scaling, evaluate and select models through clustering and static scoring functions, and iteratively improve the selected models by integrating spatial restraints and previous models. Finally, the full-atom models were evaluated using molecular dynamics simulations based on structural changes under simulated heating. We have continuously improved the performance of MUFOLD by using a benchmark of 200 proteins from the Astral database, where no template with >25% sequence identity to any target protein is included. The average root-mean-square deviation of the best models from the native structures is 4.28 Å, which shows significant and systematic improvement over our previous methods. The computing time of MUFOLD is much shorter than many other tools, such as Rosetta. MUFOLD demonstrated some success in the 2008 community-wide experiment for protein structure prediction CASP8. PMID:19927325

  14. Service life prediction of reinforced concrete structures

    SciTech Connect

    Liang, M.T.; Wang, K.L.; Liang, C.H.

    1999-09-01

    This paper is focused on the estimation of durability and service life of reinforced concrete structures. Assuming that the chloride ion in concrete can be absorbed on tricalcium aluminate, calcium silicate hydrate, and by other constituents of hardened cement paste, hydrated or not, the exact analytical solution of the governing partial differential equation together with its boundary and initial conditions can be obtained through nondimensional parameters and Laplace's transform. When the results of an exact analytical solution using suitable parameters were compared with the results of previous experimental work, the differences were found to be very small. This suggests that the absorption model is of considerable value. The exact analytical solution with the saturation parameter and time and diffusion coefficients under different effective electrical potential could be used to predict both the experimental results and the service life of reinforced concrete structures.

  15. Improving predictions of protein-protein interfaces by combining amino acid-specific classifiers based on structural and physicochemical descriptors with their weighted neighbor averages.

    PubMed

    de Moraes, Fábio R; Neshich, Izabella A P; Mazoni, Ivan; Yano, Inácio H; Pereira, José G C; Salim, José A; Jardine, José G; Neshich, Goran

    2014-01-01

    Protein-protein interactions are involved in nearly all regulatory processes in the cell and are considered one of the most important issues in molecular biology and pharmaceutical sciences but are still not fully understood. Structural and computational biology contributed greatly to the elucidation of the mechanism of protein interactions. In this paper, we present a collection of the physicochemical and structural characteristics that distinguish interface-forming residues (IFR) from free surface residues (FSR). We formulated a linear discriminative analysis (LDA) classifier to assess whether chosen descriptors from the BlueStar STING database (http://www.cbi.cnptia.embrapa.br/SMS/) are suitable for such a task. Receiver operating characteristic (ROC) analysis indicates that the particular physicochemical and structural descriptors used for building the linear classifier perform much better than a random classifier and in fact, successfully outperform some of the previously published procedures, whose performance indicators were recently compared by other research groups. The results presented here show that the selected set of descriptors can be utilized to predict IFRs, even when homologue proteins are missing (particularly important for orphan proteins where no homologue is available for comparative analysis/indication) or, when certain conformational changes accompany interface formation. The development of amino acid type specific classifiers is shown to increase IFR classification performance. Also, we found that the addition of an amino acid conservation attribute did not improve the classification prediction. This result indicates that the increase in predictive power associated with amino acid conservation is exhausted by adequate use of an extensive list of independent physicochemical and structural parameters that, by themselves, fully describe the nano-environment at protein-protein interfaces. The IFR classifier developed in this study is now

  16. Improving Predictions of Protein-Protein Interfaces by Combining Amino Acid-Specific Classifiers Based on Structural and Physicochemical Descriptors with Their Weighted Neighbor Averages

    PubMed Central

    de Moraes, Fábio R.; Neshich, Izabella A. P.; Mazoni, Ivan; Yano, Inácio H.; Pereira, José G. C.; Salim, José A.; Jardine, José G.; Neshich, Goran

    2014-01-01

    Protein-protein interactions are involved in nearly all regulatory processes in the cell and are considered one of the most important issues in molecular biology and pharmaceutical sciences but are still not fully understood. Structural and computational biology contributed greatly to the elucidation of the mechanism of protein interactions. In this paper, we present a collection of the physicochemical and structural characteristics that distinguish interface-forming residues (IFR) from free surface residues (FSR). We formulated a linear discriminative analysis (LDA) classifier to assess whether chosen descriptors from the BlueStar STING database (http://www.cbi.cnptia.embrapa.br/SMS/) are suitable for such a task. Receiver operating characteristic (ROC) analysis indicates that the particular physicochemical and structural descriptors used for building the linear classifier perform much better than a random classifier and in fact, successfully outperform some of the previously published procedures, whose performance indicators were recently compared by other research groups. The results presented here show that the selected set of descriptors can be utilized to predict IFRs, even when homologue proteins are missing (particularly important for orphan proteins where no homologue is available for comparative analysis/indication) or, when certain conformational changes accompany interface formation. The development of amino acid type specific classifiers is shown to increase IFR classification performance. Also, we found that the addition of an amino acid conservation attribute did not improve the classification prediction. This result indicates that the increase in predictive power associated with amino acid conservation is exhausted by adequate use of an extensive list of independent physicochemical and structural parameters that, by themselves, fully describe the nano-environment at protein-protein interfaces. The IFR classifier developed in this study is now

  17. Improving predictions of protein-protein interfaces by combining amino acid-specific classifiers based on structural and physicochemical descriptors with their weighted neighbor averages.

    PubMed

    de Moraes, Fábio R; Neshich, Izabella A P; Mazoni, Ivan; Yano, Inácio H; Pereira, José G C; Salim, José A; Jardine, José G; Neshich, Goran

    2014-01-01

    Protein-protein interactions are involved in nearly all regulatory processes in the cell and are considered one of the most important issues in molecular biology and pharmaceutical sciences but are still not fully understood. Structural and computational biology contributed greatly to the elucidation of the mechanism of protein interactions. In this paper, we present a collection of the physicochemical and structural characteristics that distinguish interface-forming residues (IFR) from free surface residues (FSR). We formulated a linear discriminative analysis (LDA) classifier to assess whether chosen descriptors from the BlueStar STING database (http://www.cbi.cnptia.embrapa.br/SMS/) are suitable for such a task. Receiver operating characteristic (ROC) analysis indicates that the particular physicochemical and structural descriptors used for building the linear classifier perform much better than a random classifier and in fact, successfully outperform some of the previously published procedures, whose performance indicators were recently compared by other research groups. The results presented here show that the selected set of descriptors can be utilized to predict IFRs, even when homologue proteins are missing (particularly important for orphan proteins where no homologue is available for comparative analysis/indication) or, when certain conformational changes accompany interface formation. The development of amino acid type specific classifiers is shown to increase IFR classification performance. Also, we found that the addition of an amino acid conservation attribute did not improve the classification prediction. This result indicates that the increase in predictive power associated with amino acid conservation is exhausted by adequate use of an extensive list of independent physicochemical and structural parameters that, by themselves, fully describe the nano-environment at protein-protein interfaces. The IFR classifier developed in this study is now

  18. Structure prediction and targeted synthesis: a new Na(n)N2 diazenide crystalline structure.

    PubMed

    Zhang, Xiuwen; Zunger, Alex; Trimarchi, Giancarlo

    2010-11-21

    Significant progress in theoretical and computational techniques for predicting stable crystal structures has recently begun to stimulate targeted synthesis of such predicted structures. Using a global space-group optimization (GSGO) approach that locates ground-state structures and stable stoichiometries from first-principles energy functionals by objectively starting from randomly selected lattice vectors and random atomic positions, we predict the first alkali diazenide compound Na(n)N(2), manifesting homopolar N-N bonds. The previously predicted Na(3)N structure manifests only heteropolar Na-N bonds and has positive formation enthalpy. It was calculated based on local Hartree-Fock relaxation of a fixed-structure type (Li(3)P-type) found by searching an electrostatic point-ion model. Synthesis attempts of this positive ΔH compound using activated nitrogen yielded another structure (anti-ReO(3)-type). The currently predicted (negative formation enthalpy) diazenide Na(2)N(2) completes the series of previously known BaN(2) and SrN(2) diazenides where the metal sublattice transfers charge into the empty N(2) Π(g) orbital. This points to a new class of alkali nitrides with fundamentally different bonding, i.e., homopolar rather than heteropolar bonds and, at the same time, illustrates some of the crucial subtleties and pitfalls involved in structure predictions versus planned synthesis. Attempts at synthesis of the stable Na(2)N(2) predicted here will be interesting.

  19. (PS)2: protein structure prediction server version 3.0.

    PubMed

    Huang, Tsun-Tsao; Hwang, Jenn-Kang; Chen, Chu-Huang; Chu, Chih-Sheng; Lee, Chi-Wen; Chen, Chih-Chieh

    2015-07-01

    Protein complexes are involved in many biological processes. Examining coupling between subunits of a complex would be useful to understand the molecular basis of protein function. Here, our updated (PS)(2) web server predicts the three-dimensional structures of protein complexes based on comparative modeling; furthermore, this server examines the coupling between subunits of the predicted complex by combining structural and evolutionary considerations. The predicted complex structure could be indicated and visualized by Java-based 3D graphics viewers and the structural and evolutionary profiles are shown and compared chain-by-chain. For each subunit, considerations with or without the packing contribution of other subunits cause the differences in similarities between structural and evolutionary profiles, and these differences imply which form, complex or monomeric, is preferred in the biological condition for the subunit. We believe that the (PS)(2) server would be a useful tool for biologists who are interested not only in the structures of protein complexes but also in the coupling between subunits of the complexes. The (PS)(2) is freely available at http://ps2v3.life.nctu.edu.tw/. PMID:25943546

  20. (PS)2: protein structure prediction server version 3.0.

    PubMed

    Huang, Tsun-Tsao; Hwang, Jenn-Kang; Chen, Chu-Huang; Chu, Chih-Sheng; Lee, Chi-Wen; Chen, Chih-Chieh

    2015-07-01

    Protein complexes are involved in many biological processes. Examining coupling between subunits of a complex would be useful to understand the molecular basis of protein function. Here, our updated (PS)(2) web server predicts the three-dimensional structures of protein complexes based on comparative modeling; furthermore, this server examines the coupling between subunits of the predicted complex by combining structural and evolutionary considerations. The predicted complex structure could be indicated and visualized by Java-based 3D graphics viewers and the structural and evolutionary profiles are shown and compared chain-by-chain. For each subunit, considerations with or without the packing contribution of other subunits cause the differences in similarities between structural and evolutionary profiles, and these differences imply which form, complex or monomeric, is preferred in the biological condition for the subunit. We believe that the (PS)(2) server would be a useful tool for biologists who are interested not only in the structures of protein complexes but also in the coupling between subunits of the complexes. The (PS)(2) is freely available at http://ps2v3.life.nctu.edu.tw/.

  1. PredictProtein—an open resource for online prediction of protein structural and functional features

    PubMed Central

    Yachdav, Guy; Kloppmann, Edda; Kajan, Laszlo; Hecht, Maximilian; Goldberg, Tatyana; Hamp, Tobias; Hönigschmid, Peter; Schafferhans, Andrea; Roos, Manfred; Bernhofer, Michael; Richter, Lothar; Ashkenazy, Haim; Punta, Marco; Schlessinger, Avner; Bromberg, Yana; Schneider, Reinhard; Vriend, Gerrit; Sander, Chris; Ben-Tal, Nir; Rost, Burkhard

    2014-01-01

    PredictProtein is a meta-service for sequence analysis that has been predicting structural and functional features of proteins since 1992. Queried with a protein sequence it returns: multiple sequence alignments, predicted aspects of structure (secondary structure, solvent accessibility, transmembrane helices (TMSEG) and strands, coiled-coil regions, disulfide bonds and disordered regions) and function. The service incorporates analysis methods for the identification of functional regions (ConSurf), homology-based inference of Gene Ontology terms (metastudent), comprehensive subcellular localization prediction (LocTree3), protein–protein binding sites (ISIS2), protein–polynucleotide binding sites (SomeNA) and predictions of the effect of point mutations (non-synonymous SNPs) on protein function (SNAP2). Our goal has always been to develop a system optimized to meet the demands of experimentalists not highly experienced in bioinformatics. To this end, the PredictProtein results are presented as both text and a series of intuitive, interactive and visually appealing figures. The web server and sources are available at http://ppopen.rostlab.org. PMID:24799431

  2. Structural prediction of peptides bound to MHC class I.

    PubMed

    Fagerberg, Theres; Cerottini, Jean-Charles; Michielin, Olivier

    2006-02-17

    An ab initio structure prediction approach adapted to the peptide-major histocompatibility complex (MHC) class I system is presented. Based on structure comparisons of a large set of peptide-MHC class I complexes, a molecular dynamics protocol is proposed using simulated annealing (SA) cycles to sample the conformational space of the peptide in its fixed MHC environment. A set of 14 peptide-human leukocyte antigen (HLA) A0201 and 27 peptide-non-HLA A0201 complexes for which X-ray structures are available is used to test the accuracy of the prediction method. For each complex, 1000 peptide conformers are obtained from the SA sampling. A graph theory clustering algorithm based on heavy atom root-mean-square deviation (RMSD) values is applied to the sampled conformers. The clusters are ranked using cluster size, mean effective or conformational free energies, with solvation free energies computed using Generalized Born MV 2 (GB-MV2) and Poisson-Boltzmann (PB) continuum models. The final conformation is chosen as the center of the best-ranked cluster. With conformational free energies, the overall prediction success is 83% using a 1.00 Angstroms crystal RMSD criterion for main-chain atoms, and 76% using a 1.50 Angstroms RMSD criterion for heavy atoms. The prediction success is even higher for the set of 14 peptide-HLA A0201 complexes: 100% of the peptides have main-chain RMSD values < or =1.00 Angstroms and 93% of the peptides have heavy atom RMSD values < or =1.50 Angstroms. This structure prediction method can be applied to complexes of natural or modified antigenic peptides in their MHC environment with the aim to perform rational structure-based optimizations of tumor vaccines.

  3. Prediction of the structure of a novel amylopectin-based Cd-associated molecule in the stem of common reed grown in the presence of Cd.

    PubMed

    Higuchi, Kyoko; Ito, Naho; Nukada, Tomoo

    2016-10-01

    We previously found a novel Cd-associated molecule with an apparent molecular weight of 10-50 kDa in common reeds grown in the presence of Cd. The partial structure of this molecule was predicted by enzymatic digestion to release Cd from a trace amount that had been partially purified from the cell sap. The major component was branched α-glucan, whereas a peptide, β-1,4 glucan, and mannose were found as minor components. Uronic acids appeared to provide functional groups that bind Cd.

  4. Structure prediction of magnetosome-associated proteins.

    PubMed

    Nudelman, Hila; Zarivach, Raz

    2014-01-01

    Magnetotactic bacteria (MTB) are Gram-negative bacteria that can navigate along geomagnetic fields. This ability is a result of a unique intracellular organelle, the magnetosome. These organelles are composed of membrane-enclosed magnetite (Fe3O4) or greigite (Fe3S4) crystals ordered into chains along the cell. Magnetosome formation, assembly, and magnetic nano-crystal biomineralization are controlled by magnetosome-associated proteins (MAPs). Most MAP-encoding genes are located in a conserved genomic region - the magnetosome island (MAI). The MAI appears to be conserved in all MTB that were analyzed so far, although the MAI size and organization differs between species. It was shown that MAI deletion leads to a non-magnetic phenotype, further highlighting its important role in magnetosome formation. Today, about 28 proteins are known to be involved in magnetosome formation, but the structures and functions of most MAPs are unknown. To reveal the structure-function relationship of MAPs we used bioinformatics tools in order to build homology models as a way to understand their possible role in magnetosome formation. Here we present a predicted 3D structural models' overview for all known Magnetospirillum gryphiswaldense strain MSR-1 MAPs.

  5. Structure Prediction for Multicomponent Materials Using Biminima

    NASA Astrophysics Data System (ADS)

    Schebarchov, D.; Wales, D. J.

    2014-10-01

    The potential energy surface of a heteroparticle system will contain points that are local minima in both coordinate space and permutation space for the different species. We introduce the term biminima to describe these special points, and we formulate a deterministic scheme for finding them. Our search algorithm generates a converging sequence of particle-identity swaps, each accompanied by a number of local geometry relaxations. For selected binary atomic clusters of size N=NA+NB≤98, convergence to a biminimum on average takes 3NANB relaxations, and the number of biminima grows with the preference for mixing. The new framework unifies continuous and combinatorial optimization, providing a powerful tool for structure prediction and rational design of multicomponent materials.

  6. Quantitative structure-activity relationship models for predicting drug-induced liver injury based on FDA-approved drug labeling annotation and using a large collection of drugs.

    PubMed

    Chen, Minjun; Hong, Huixiao; Fang, Hong; Kelly, Reagan; Zhou, Guangxu; Borlak, Jürgen; Tong, Weida

    2013-11-01

    Drug-induced liver injury (DILI) is one of the leading causes of the termination of drug development programs. Consequently, identifying the risk of DILI in humans for drug candidates during the early stages of the development process would greatly reduce the drug attrition rate in the pharmaceutical industry but would require the implementation of new research and development strategies. In this regard, several in silico models have been proposed as alternative means in prioritizing drug candidates. Because the accuracy and utility of a predictive model rests largely on how to annotate the potential of a drug to cause DILI in a reliable and consistent way, the Food and Drug Administration-approved drug labeling was given prominence. Out of 387 drugs annotated, 197 drugs were used to develop a quantitative structure-activity relationship (QSAR) model and the model was subsequently challenged by the left of drugs serving as an external validation set with an overall prediction accuracy of 68.9%. The performance of the model was further assessed by the use of 2 additional independent validation sets, and the 3 validation data sets have a total of 483 unique drugs. We observed that the QSAR model's performance varied for drugs with different therapeutic uses; however, it achieved a better estimated accuracy (73.6%) as well as negative predictive value (77.0%) when focusing only on these therapeutic categories with high prediction confidence. Thus, the model's applicability domain was defined. Taken collectively, the developed QSAR model has the potential utility to prioritize compound's risk for DILI in humans, particularly for the high-confidence therapeutic subgroups like analgesics, antibacterial agents, and antihistamines.

  7. Predictive modeling of neuroanatomic structures for brain atrophy detection

    NASA Astrophysics Data System (ADS)

    Hu, Xintao; Guo, Lei; Nie, Jingxin; Li, Kaiming; Liu, Tianming

    2010-03-01

    In this paper, we present an approach of predictive modeling of neuroanatomic structures for the detection of brain atrophy based on cross-sectional MRI image. The underlying premise of applying predictive modeling for atrophy detection is that brain atrophy is defined as significant deviation of part of the anatomy from what the remaining normal anatomy predicts for that part. The steps of predictive modeling are as follows. The central cortical surface under consideration is reconstructed from brain tissue map and Regions of Interests (ROI) on it are predicted from other reliable anatomies. The vertex pair-wise distance between the predicted vertex and the true one within the abnormal region is expected to be larger than that of the vertex in normal brain region. Change of white matter/gray matter ratio within a spherical region is used to identify the direction of vertex displacement. In this way, the severity of brain atrophy can be defined quantitatively by the displacements of those vertices. The proposed predictive modeling method has been evaluated by using both simulated atrophies and MRI images of Alzheimer's disease.

  8. A wave based method to predict the absorption, reflection and transmission coefficient of two-dimensional rigid frame porous structures with periodic inclusions

    NASA Astrophysics Data System (ADS)

    Deckers, Elke; Claeys, Claus; Atak, Onur; Groby, Jean-Philippe; Dazel, Olivier; Desmet, Wim

    2016-05-01

    This paper presents an extension to the Wave Based Method to predict the absorption, reflection and transmission coefficients of a porous material with an embedded periodic set of inclusions. The porous unit cell is described using the Multi-Level methodology and by embedding Bloch-Floquet periodicity conditions in the weighted residual scheme. The dynamic pressure field in the semi-infinite acoustic domains is approximated using a novel wave function set that fulfils the Helmholtz equation, the Bloch-Floquet periodicity conditions and the Sommerfeld radiation condition. The method is meshless and computationally efficient, which makes it well suited for optimisation studies.

  9. THE FUTURE OF COMPUTER-BASED TOXICITY PREDICTION: MECHANISM-BASED MODELS VS. INFORMATION MINING APPROACHES

    EPA Science Inventory


    The Future of Computer-Based Toxicity Prediction:
    Mechanism-Based
    Models vs. Information Mining Approaches

    When we speak of computer-based toxicity prediction, we are generally referring to a broad array of approaches which rely primarily upon chemical structure ...

  10. Predicting modes of toxic action from chemical structure: an overview.

    PubMed

    Bradbury, S P

    1994-01-01

    In the field of environmental toxicology, and especially aquatic toxicology, quantitative structure activity relationships (QSARs) have developed as scientifically-credible tools for predicting the toxicity of chemicals when little or no empirical data are available. A basic and fundamental understanding of toxicological principles has been considered crucial to the continued acceptance and application of these techniques as biologically relevant. As a consequence, there has been an evolution of QSAR development and application from that of a chemical-class perspective to one that is more consistent with assumptions regarding modes of toxic action. The assessment of a compound's likely mode of toxic action is critical for a correct QSAR selection; incorrect mode of action-based QSAR selections can result in 10- to 1000-fold errors in toxicity predictions. The establishment of toxicologically-credible techniques to assess mode of toxic action from chemical structure requires toxicodynamic knowledge bases that are clearly defined with regard to exposure regimes and biological models/endpoints and based on compounds that adequately span the diversity of chemicals anticipated for future applications. With such knowledge bases classification systems, including rule-based experts systems, have been established for use in predictive aquatic toxicology applications. PMID:8790641

  11. How Good Are Simplified Models for Protein Structure Prediction?

    PubMed Central

    Newton, M. A. Hakim; Rashid, Mahmood A.; Pham, Duc Nghia; Sattar, Abdul

    2014-01-01

    Protein structure prediction (PSP) has been one of the most challenging problems in computational biology for several decades. The challenge is largely due to the complexity of the all-atomic details and the unknown nature of the energy function. Researchers have therefore used simplified energy models that consider interaction potentials only between the amino acid monomers in contact on discrete lattices. The restricted nature of the lattices and the energy models poses a twofold concern regarding the assessment of the models. Can a native or a very close structure be obtained when structures are mapped to lattices? Can the contact based energy models on discrete lattices guide the search towards the native structures? In this paper, we use the protein chain lattice fitting (PCLF) problem to address the first concern; we developed a constraint-based local search algorithm for the PCLF problem for cubic and face-centered cubic lattices and found very close lattice fits for the native structures. For the second concern, we use a number of techniques to sample the conformation space and find correlations between energy functions and root mean square deviation (RMSD) distance of the lattice-based structures with the native structures. Our analysis reveals weakness of several contact based energy models used that are popular in PSP. PMID:24876837

  12. Predicting inclusion behaviour and framework structures in organic crystals.

    PubMed

    Cruz-Cabeza, Aurora J; Day, Graeme M; Jones, William

    2009-12-01

    We have used well-established computational methods to generate and explore the crystal structure landscapes of four organic molecules of well-known inclusion behaviour. Using these methods, we are able to generate both close-packed crystal structures and high-energy open frameworks containing voids of molecular dimensions. Some of these high-energy open frameworks correspond to real structures observed experimentally when the appropriate guest molecules are present during crystallisation. We propose a combination of crystal structure prediction methodologies with structure rankings based on relative lattice energy and solvent-accessible volume as a way of selecting likely inclusion frameworks completely ab initio. This methodology can be used as part of a rational strategy in the design of inclusion compounds, and also for the anticipation of inclusion behaviour in organic molecules. PMID:19876969

  13. Template-based prediction of protein function.

    PubMed

    Petrey, Donald; Chen, T Scott; Deng, Lei; Garzon, Jose Ignacio; Hwang, Howook; Lasso, Gorka; Lee, Hunjoong; Silkov, Antonina; Honig, Barry

    2015-06-01

    We discuss recent approaches for structure-based protein function annotation. We focus on template-based methods where the function of a query protein is deduced from that of a template for which both the structure and function are known. We describe the different ways of identifying a template. These are typically based on sequence analysis but new methods based on purely structural similarity are also being developed that allow function annotation based on structural relationships that cannot be recognized by sequence. The growing number of available structures of known function, improved homology modeling techniques and new developments in the use of structure allow template-based methods to be applied on a proteome-wide scale and in many different biological contexts. This progress significantly expands the range of applicability of structural information in function annotation to a level that previously was only achievable by sequence comparison.

  14. Gogny HFB prediction of nuclear structure properties

    SciTech Connect

    Goriely, S.; Hilaire, S.; Girod, M.

    2011-10-28

    Large scale mean field calculations from proton to neutron drip lines have been performed using the Hartree-Fock-Bogoliubov method based on the Gogny nucleon-nucleon effective interaction. This extensive study has shown the ability of the method to reproduce bulk nuclear structure data available experimentally. This includes nuclear masses, radii, matter densities, deformations, moment of inertia as well as collective mode (low energy and giant resonances). In particular, the first mass table based on a Gogny-Hartree-Fock-Bogolyubov calculation including an explicit and coherent account of all the quadrupole correlation energies is presented. The rms deviation with respect to essentially all the available mass data is 798 keV. Nearly 8000 nuclei have been studied under the axial symmetry hypothesis and going beyond the mean-field approach.

  15. Prediction of Secondary Structures Conserved in Multiple RNA Sequences.

    PubMed

    Xu, Zhenjiang Zech; Mathews, David H

    2016-01-01

    RNA structure is conserved by evolution to a greater extent than sequence. Predicting the conserved structure for multiple homologous sequences can be much more accurate than predicting the structure for a single sequence. RNAstructure is a software package that includes the programs Dynalign, Multilign, TurboFold, and PARTS for predicting conserved RNA secondary structure. This chapter provides protocols for using these programs. PMID:27665591

  16. PREDICTING TOXICOLOGICAL ENDPOINTS OF CHEMICALS USING QUANTITATIVE STRUCTURE-ACTIVITY RELATIONSHIPS (QSARS)

    EPA Science Inventory

    Quantitative structure-activity relationships (QSARs) are being developed to predict the toxicological endpoints for untested chemicals similar in structure to chemicals that have known experimental toxicological data. Based on a very large number of predetermined descriptors, a...

  17. RNAex: an RNA secondary structure prediction server enhanced by high-throughput structure-probing data.

    PubMed

    Wu, Yang; Qu, Rihao; Huang, Yiming; Shi, Binbin; Liu, Mengrong; Li, Yang; Lu, Zhi John

    2016-07-01

    Several high-throughput technologies have been developed to probe RNA base pairs and loops at the transcriptome level in multiple species. However, to obtain the final RNA secondary structure, extensive effort and considerable expertise is required to statistically process the probing data and combine them with free energy models. Therefore, we developed an RNA secondary structure prediction server that is enhanced by experimental data (RNAex). RNAex is a web interface that enables non-specialists to easily access cutting-edge structure-probing data and predict RNA secondary structures enhanced by in vivo and in vitro data. RNAex annotates the RNA editing, RNA modification and SNP sites on the predicted structures. It provides four structure-folding methods, restrained MaxExpect, SeqFold, RNAstructure (Fold) and RNAfold that can be selected by the user. The performance of these four folding methods has been verified by previous publications on known structures. We re-mapped the raw sequencing data of the probing experiments to the whole genome for each species. RNAex thus enables users to predict secondary structures for both known and novel RNA transcripts in human, mouse, yeast and Arabidopsis The RNAex web server is available at http://RNAex.ncrnalab.org/.

  18. RNAex: an RNA secondary structure prediction server enhanced by high-throughput structure-probing data

    PubMed Central

    Wu, Yang; Qu, Rihao; Huang, Yiming; Shi, Binbin; Liu, Mengrong; Li, Yang; Lu, Zhi John

    2016-01-01

    Several high-throughput technologies have been developed to probe RNA base pairs and loops at the transcriptome level in multiple species. However, to obtain the final RNA secondary structure, extensive effort and considerable expertise is required to statistically process the probing data and combine them with free energy models. Therefore, we developed an RNA secondary structure prediction server that is enhanced by experimental data (RNAex). RNAex is a web interface that enables non-specialists to easily access cutting-edge structure-probing data and predict RNA secondary structures enhanced by in vivo and in vitro data. RNAex annotates the RNA editing, RNA modification and SNP sites on the predicted structures. It provides four structure-folding methods, restrained MaxExpect, SeqFold, RNAstructure (Fold) and RNAfold that can be selected by the user. The performance of these four folding methods has been verified by previous publications on known structures. We re-mapped the raw sequencing data of the probing experiments to the whole genome for each species. RNAex thus enables users to predict secondary structures for both known and novel RNA transcripts in human, mouse, yeast and Arabidopsis. The RNAex web server is available at http://RNAex.ncrnalab.org/. PMID:27137891

  19. Using the RNAstructure Software Package to Predict Conserved RNA Structures.

    PubMed

    Mathews, David H

    2014-01-01

    The structures of many non-coding RNA (ncRNA) are conserved by evolution to a greater extent than their sequences. By predicting the conserved structure of two or more homologous sequences, the accuracy of secondary structure prediction can be improved as compared to structure prediction for a single sequence. This unit provides protocols for the use of four programs in the RNAstructure suite for prediction of conserved structures, Multilign, TurboFold, Dynalign, and PARTS. These programs can be run via Web servers, on the command line, or with graphical interfaces.

  20. Structure-based predictions of 13C-NMR chemical shifts for a series of 2-functionalized 5-(methylsulfonyl)-1-phenyl-1H-indoles derivatives using GA-based MLR method

    NASA Astrophysics Data System (ADS)

    Ghavami, Raouf; Sadeghi, Faridoon; Rasouli, Zolikha; Djannati, Farhad

    2012-12-01

    Experimental values for the 13C NMR chemical shifts (ppm, TMS = 0) at 300 K ranging from 96.28 ppm (C4' of indole derivative 17) to 159.93 ppm (C4' of indole derivative 23) relative to deuteride chloroform (CDCl3, 77.0 ppm) or dimethylsulfoxide (DMSO, 39.50 ppm) as internal reference in CDCl3 or DMSO-d6 solutions have been collected from literature for thirty 2-functionalized 5-(methylsulfonyl)-1-phenyl-1H-indole derivatives containing different substituted groups. An effective quantitative structure-property relationship (QSPR) models were built using hybrid method combining genetic algorithm (GA) based on stepwise selection multiple linear regression (SWS-MLR) as feature-selection tools and correlation models between each carbon atom of indole derivative and calculated descriptors. Each compound was depicted by molecular structural descriptors that encode constitutional, topological, geometrical, electrostatic, and quantum chemical features. The accuracy of all developed models were confirmed using different types of internal and external procedures and various statistical tests. Furthermore, the domain of applicability for each model which indicates the area of reliable predictions was defined.

  1. Predicting fracture in micron-scale polycrystalline silicon MEMS structures.

    SciTech Connect

    Hazra, Siddharth S.; de Boer, Maarten Pieter; Boyce, Brad Lee; Ohlhausen, James Anthony; Foulk, James W., III; Reedy, Earl David, Jr.

    2010-09-01

    Designing reliable MEMS structures presents numerous challenges. Polycrystalline silicon fractures in a brittle manner with considerable variability in measured strength. Furthermore, it is not clear how to use a measured tensile strength distribution to predict the strength of a complex MEMS structure. To address such issues, two recently developed high throughput MEMS tensile test techniques have been used to measure strength distribution tails. The measured tensile strength distributions enable the definition of a threshold strength as well as an inferred maximum flaw size. The nature of strength-controlling flaws has been identified and sources of the observed variation in strength investigated. A double edge-notched specimen geometry was also tested to study the effect of a severe, micron-scale stress concentration on the measured strength distribution. Strength-based, Weibull-based, and fracture mechanics-based failure analyses were performed and compared with the experimental results.

  2. Experiment-Assisted Secondary Structure Prediction with RNAstructure.

    PubMed

    Xu, Zhenjiang Zech; Mathews, David H

    2016-01-01

    Experimental probing data can be used to improve the accuracy of RNA secondary structure prediction. The software package RNAstructure can take advantage of enzymatic cleavage data, FMN cleavage data, traditional chemical modification reactivity data, and SHAPE reactivity data for secondary structure modeling. This chapter provides protocols for using experimental probing data with RNAstructure to restrain or constrain RNA secondary structure prediction. PMID:27665598

  3. Statistical energy analysis response prediction methods for structural systems

    NASA Technical Reports Server (NTRS)

    Davis, R. F.

    1979-01-01

    The results of an effort to document methods for accomplishing response predictions for commonly encountered aerospace structural configurations is presented. Application of these methods to specified aerospace structure to provide sample analyses is included. An applications manual, with the structural analyses appended as example problems is given. Comparisons of the response predictions with measured data are provided for three of the example problems.

  4. Graphlet kernels for prediction of functional residues in protein structures.

    PubMed

    Vacic, Vladimir; Iakoucheva, Lilia M; Lonardi, Stefano; Radivojac, Predrag

    2010-01-01

    We introduce a novel graph-based kernel method for annotating functional residues in protein structures. A structure is first modeled as a protein contact graph, where nodes correspond to residues and edges connect spatially neighboring residues. Each vertex in the graph is then represented as a vector of counts of labeled non-isomorphic subgraphs (graphlets), centered on the vertex of interest. A similarity measure between two vertices is expressed as the inner product of their respective count vectors and is used in a supervised learning framework to classify protein residues. We evaluated our method on two function prediction problems: identification of catalytic residues in proteins, which is a well-studied problem suitable for benchmarking, and a much less explored problem of predicting phosphorylation sites in protein structures. The performance of the graphlet kernel approach was then compared against two alternative methods, a sequence-based predictor and our implementation of the FEATURE framework. On both tasks, the graphlet kernel performed favorably; however, the margin of difference was considerably higher on the problem of phosphorylation site prediction. While there is data that phosphorylation sites are preferentially positioned in intrinsically disordered regions, we provide evidence that for the sites that are located in structured regions, neither the surface accessibility alone nor the averaged measures calculated from the residue microenvironments utilized by FEATURE were sufficient to achieve high accuracy. The key benefit of the graphlet representation is its ability to capture neighborhood similarities in protein structures via enumerating the patterns of local connectivity in the corresponding labeled graphs.

  5. "Well-determined" regions in RNA secondary structure prediction: analysis of small subunit ribosomal RNA.

    PubMed Central

    Zuker, M; Jacobson, A B

    1995-01-01

    Recent structural analyses of genomic RNAs from RNA coliphages suggest that both well-determined base paired helices and well-determined structural domains that are identified by "energy dot plot" analysis using the RNA folding package mfold, are likely to be predicted correctly. To test these observations with another group of large RNAs, we have analyzed 15 ribosomal RNAs. Published secondary structure models that were derived by comparative sequence analysis were used to evaluate the predicted structures. Both the optimal predicted fold and the predicted "energy dot plot" of each sequence were examined. Each prediction was obtained from a single computer run on an entire ribosomal RNA sequence. All predicted base pairs in optimal foldings were examined for agreement with proven base pairs in the comparative models. Our analyses show that the overall correspondence between the predicted and comparative models varied for different RNAs and ranges from a low of 27% to high of 70%, with a mean value of 49%. The correspondence improves to a mean value of 81% when the analysis is limited to well-determined helices. In addition to well-determined helices, large well-determined structural domains can be observed in "energy dot plots" of some 16S ribosomal RNAs. The predicted domains correspond closely with structural domains that are found by the comparative method in the same RNAs. Our analyses also show that measuring the agreement between predicted and comparative secondary structure models underestimates the reliability of structural prediction by mfold. PMID:7544463

  6. Integrating chemical footprinting data into RNA secondary structure prediction.

    PubMed

    Zarringhalam, Kourosh; Meyer, Michelle M; Dotu, Ivan; Chuang, Jeffrey H; Clote, Peter

    2012-01-01

    Chemical and enzymatic footprinting experiments, such as shape (selective 2'-hydroxyl acylation analyzed by primer extension), yield important information about RNA secondary structure. Indeed, since the [Formula: see text]-hydroxyl is reactive at flexible (loop) regions, but unreactive at base-paired regions, shape yields quantitative data about which RNA nucleotides are base-paired. Recently, low error rates in secondary structure prediction have been reported for three RNAs of moderate size, by including base stacking pseudo-energy terms derived from shape data into the computation of minimum free energy secondary structure. Here, we describe a novel method, RNAsc (RNA soft constraints), which includes pseudo-energy terms for each nucleotide position, rather than only for base stacking positions. We prove that RNAsc is self-consistent, in the sense that the nucleotide-specific probabilities of being unpaired in the low energy Boltzmann ensemble always become more closely correlated with the input shape data after application of RNAsc. From this mathematical perspective, the secondary structure predicted by RNAsc should be 'correct', in as much as the shape data is 'correct'. We benchmark RNAsc against the previously mentioned method for eight RNAs, for which both shape data and native structures are known, to find the same accuracy in 7 out of 8 cases, and an improvement of 25% in one case. Furthermore, we present what appears to be the first direct comparison of shape data and in-line probing data, by comparing yeast asp-tRNA shape data from the literature with data from in-line probing experiments we have recently performed. With respect to several criteria, we find that shape data appear to be more robust than in-line probing data, at least in the case of asp-tRNA.

  7. Improved hybrid optimization algorithm for 3D protein structure prediction.

    PubMed

    Zhou, Changjun; Hou, Caixia; Wei, Xiaopeng; Zhang, Qiang

    2014-07-01

    A new improved hybrid optimization algorithm - PGATS algorithm, which is based on toy off-lattice model, is presented for dealing with three-dimensional protein structure prediction problems. The algorithm combines the particle swarm optimization (PSO), genetic algorithm (GA), and tabu search (TS) algorithms. Otherwise, we also take some different improved strategies. The factor of stochastic disturbance is joined in the particle swarm optimization to improve the search ability; the operations of crossover and mutation that are in the genetic algorithm are changed to a kind of random liner method; at last tabu search algorithm is improved by appending a mutation operator. Through the combination of a variety of strategies and algorithms, the protein structure prediction (PSP) in a 3D off-lattice model is achieved. The PSP problem is an NP-hard problem, but the problem can be attributed to a global optimization problem of multi-extremum and multi-parameters. This is the theoretical principle of the hybrid optimization algorithm that is proposed in this paper. The algorithm combines local search and global search, which overcomes the shortcoming of a single algorithm, giving full play to the advantage of each algorithm. In the current universal standard sequences, Fibonacci sequences and real protein sequences are certified. Experiments show that the proposed new method outperforms single algorithms on the accuracy of calculating the protein sequence energy value, which is proved to be an effective way to predict the structure of proteins. PMID:25069136

  8. Structure prediction and targeted synthesis: a new NanN2 diazenide crystalline structure

    SciTech Connect

    Zhang, Xiuwen; Zunger, Alex; Trimarchi, Giancarlo

    2010-01-01

    Significant progress in theoretical and computational techniques for predicting stable crystal structures has recently begun to stimulate targeted synthesis of such predicted structures. Using a global space-group optimization (GSGO) approach that locates ground-statestructures and stable stoichiometries from first-principles energy functionals by objectively starting from randomly selected lattice vectors and random atomic positions, we predict the first alkali diazenide compound Na{sub n} N{sub 2} , manifesting homopolar N–N bonds. The previously predicted Na{sub 3} N structure manifests only heteropolar Na–N bonds and has positive formation enthalpy. It was calculated based on local Hartree–Fock relaxation of a fixed-structure type (Li{sub 3} P -type) found by searching an electrostatic point-ion model. Synthesis attempts of this positive ΔH compound using activated nitrogen yielded another structure (anti-ReO{sub 3} -type). The currently predicted (negative formation enthalpy) diazenide Na{sub 2} N{sub 2} completes the series of previously known BaN{sub 2} and SrN{sub 2} diazenides where the metal sublattice transfers charge into the empty N{sub 2} Π{sub g} orbital. This points to a new class of alkali nitrides with fundamentally different bonding, i.e., homopolar rather than heteropolar bonds and, at the same time, illustrates some of the crucial subtleties and pitfalls involved in structure predictions versus planned synthesis. Attempts at synthesis of the stable Na{sub 2} N{sub 2} predicted here will be interesting.

  9. Prediction-based threshold for medication alert.

    PubMed

    Kawazoe, Yoshimasa; Miyo, Kengo; Kurahashi, Issei; Sakurai, Ryota; Ohe, Kazuhiko

    2013-01-01

    This study presents a prediction-based approach to determine thresholds for a medication alert in a computerized physician order entry. Traditional static thresholds can sometimes lead to physician's alert fatigue or overlook potentially excessive medication even if the doses are belowthe configured threshold. To address this problem, we applied a random forest algorithm to develop a prediction model for medication doses, and applied a boxplot to determine the thresholds based on the prediction results. An evaluation of the eight drugs most frequently causing alerts in our hospital showed that the performances of the prediction were high, except for two drugs. It was also found that using the thresholds based on the predictions would reduce the alerts to a half of those when using the static thresholds. Notably, some cases were detected only by the prediction thresholds. The significance of the thresholds should be discussed in terms of the trade-offs between gains and losses; however, our approach, which relies on physicians' collective experiences, has practical advantages. PMID:23920550

  10. Addressing the Role of Conformational Diversity in Protein Structure Prediction.

    PubMed

    Palopoli, Nicolas; Monzon, Alexander Miguel; Parisi, Gustavo; Fornasari, Maria Silvina

    2016-01-01

    Computational modeling of tertiary structures has become of standard use to study proteins that lack experimental characterization. Unfortunately, 3D structure prediction methods and model quality assessment programs often overlook that an ensemble of conformers in equilibrium populates the native state of proteins. In this work we collected sets of publicly available protein models and the corresponding target structures experimentally solved and studied how they describe the conformational diversity of the protein. For each protein, we assessed the quality of the models against known conformers by several standard measures and identified those models ranked best. We found that model rankings are defined by both the selected target conformer and the similarity measure used. 70% of the proteins in our datasets show that different models are structurally closest to different conformers of the same protein target. We observed that model building protocols such as template-based or ab initio approaches describe in similar ways the conformational diversity of the protein, although for template-based methods this description may depend on the sequence similarity between target and template sequences. Taken together, our results support the idea that protein structure modeling could help to identify members of the native ensemble, highlight the importance of considering conformational diversity in protein 3D quality evaluations and endorse the study of the variability of the native structure for a meaningful biological analysis. PMID:27159429

  11. Addressing the Role of Conformational Diversity in Protein Structure Prediction

    PubMed Central

    Parisi, Gustavo; Fornasari, Maria Silvina

    2016-01-01

    Computational modeling of tertiary structures has become of standard use to study proteins that lack experimental characterization. Unfortunately, 3D structure prediction methods and model quality assessment programs often overlook that an ensemble of conformers in equilibrium populates the native state of proteins. In this work we collected sets of publicly available protein models and the corresponding target structures experimentally solved and studied how they describe the conformational diversity of the protein. For each protein, we assessed the quality of the models against known conformers by several standard measures and identified those models ranked best. We found that model rankings are defined by both the selected target conformer and the similarity measure used. 70% of the proteins in our datasets show that different models are structurally closest to different conformers of the same protein target. We observed that model building protocols such as template-based or ab initio approaches describe in similar ways the conformational diversity of the protein, although for template-based methods this description may depend on the sequence similarity between target and template sequences. Taken together, our results support the idea that protein structure modeling could help to identify members of the native ensemble, highlight the importance of considering conformational diversity in protein 3D quality evaluations and endorse the study of the variability of the native structure for a meaningful biological analysis. PMID:27159429

  12. Ichthyophonus parasite phylogeny based on ITS rDNA structure prediction and alignment identifies six clades, with a single dominant marine type.

    PubMed

    Gregg, Jacob L; Powers, Rachel L; Purcell, Maureen K; Friedman, Carolyn S; Hershberger, Paul K

    2016-07-01

    Despite their widespread, global impact in both wild and cultured fishes, little is known of the diversity, transmission patterns, and phylogeography of parasites generally identified as Ichthyophonus. This study constructed a phylogeny based on the structural alignment of internal transcribed spacer (ITS) rDNA sequences to compare Ichthyophonus isolates from fish hosts in the Atlantic and Pacific oceans, and several rivers and aquaculture sites in North America, Europe, and Japan. Structure of the Ichthyophonus ITS1-5.8S-ITS2 transcript exhibited several homologies with other eukaryotes, and 6 distinct clades were identified within Ichthyophonus. A single clade contained a majority (71 of 98) of parasite isolations. This ubiquitous Ichthyophonus type occurred in 13 marine and anadromous hosts and was associated with epizootics in Atlantic herring, Chinook salmon, and American shad. A second clade contained all isolates from aquaculture, despite great geographic separation of the freshwater hosts. Each of the 4 remaining clades contained isolates from single host species. This study is the first to evaluate the genetic relationships among Ichthyophonus species across a significant portion of their host and geographic range. Additionally, parasite infection prevalence is reported in 16 fish species.

  13. Ichthyophonus parasite phylogeny based on ITS rDNA structure prediction and alignment identifies six clades, with a single dominant marine type.

    PubMed

    Gregg, Jacob L; Powers, Rachel L; Purcell, Maureen K; Friedman, Carolyn S; Hershberger, Paul K

    2016-07-01

    Despite their widespread, global impact in both wild and cultured fishes, little is known of the diversity, transmission patterns, and phylogeography of parasites generally identified as Ichthyophonus. This study constructed a phylogeny based on the structural alignment of internal transcribed spacer (ITS) rDNA sequences to compare Ichthyophonus isolates from fish hosts in the Atlantic and Pacific oceans, and several rivers and aquaculture sites in North America, Europe, and Japan. Structure of the Ichthyophonus ITS1-5.8S-ITS2 transcript exhibited several homologies with other eukaryotes, and 6 distinct clades were identified within Ichthyophonus. A single clade contained a majority (71 of 98) of parasite isolations. This ubiquitous Ichthyophonus type occurred in 13 marine and anadromous hosts and was associated with epizootics in Atlantic herring, Chinook salmon, and American shad. A second clade contained all isolates from aquaculture, despite great geographic separation of the freshwater hosts. Each of the 4 remaining clades contained isolates from single host species. This study is the first to evaluate the genetic relationships among Ichthyophonus species across a significant portion of their host and geographic range. Additionally, parasite infection prevalence is reported in 16 fish species. PMID:27409236

  14. Ichthyophonus parasite phylogeny based on ITS rDNA structure prediction and alignment identifies six clades, with a single dominant marine type

    USGS Publications Warehouse

    Gregg, Jacob; Thompson, Rachel L.; Purcell, Maureen; Friedman, Carolyn S.; Hershberger, Paul

    2016-01-01

    Despite their widespread, global impact in both wild and cultured fishes, little is known of the diversity, transmission patterns, and phylogeography of parasites generally identified as Ichthyophonus. This study constructed a phylogeny based on the structural alignment of internal transcribed spacer (ITS) rDNA sequences to compare Ichthyophonus isolates from fish hosts in the Atlantic and Pacific oceans, and several rivers and aquaculture sites in North America, Europe, and Japan. Structure of the Ichthyophonus ITS1–5.8S–ITS2 transcript exhibited several homologies with other eukaryotes, and 6 distinct clades were identified within Ichthyophonus. A single clade contained a majority (71 of 98) of parasite isolations. This ubiquitous Ichthyophonus type occurred in 13 marine and anadromous hosts and was associated with epizootics in Atlantic herring, Chinook salmon, and American shad. A second clade contained all isolates from aquaculture, despite great geographic separation of the freshwater hosts. Each of the 4 remaining clades contained isolates from single host species. This study is the first to evaluate the genetic relationships among Ichthyophonus species across a significant portion of their host and geographic range. Additionally, parasite infection prevalence is reported in 16 fish species.

  15. Cortical structure predicts success in performing musical transformation judgments.

    PubMed

    Foster, Nicholas E V; Zatorre, Robert J

    2010-10-15

    Recognizing melodies by their interval structure, or "relative pitch," is a fundamental aspect of musical perception. By using relative pitch, we are able to recognize tunes regardless of the key in which they are played. We sought to determine the cortical areas important for relative pitch processing using two morphometric techniques. Cortical differences have been reported in musicians within right auditory cortex (AC), a region considered important for pitch-based processing, and we have previously reported a functional correlation between relative pitch processing in the anterior intraparietal sulcus (IPS). We addressed the hypothesis that regional variation of cortical structure within AC and IPS is related to relative pitch ability using two anatomical techniques, cortical thickness (CT) analysis and voxel-based morphometry (VBM) of magnetic resonance imaging data. Persons with variable amounts of formal musical training were tested on a melody transposition task, as well as two musical control tasks and a speech control task. We found that gray matter concentration and cortical thickness in right Heschl's sulcus and bilateral IPS both predicted relative pitch task performance and correlated to a lesser extent with performance on the two musical control tasks. After factoring out variance explained by musical training, only relative pitch performance was predicted by cortical structure in these regions. These results directly demonstrate the functional relevance of previously reported anatomical differences in the auditory cortex of musicians. The findings in the IPS provide further support for the existence of a multimodal network for systematic transformation of stimulus information in this region. PMID:20600982

  16. Comparative modeling: the state of the art and protein drug target structure prediction.

    PubMed

    Liu, Tianyun; Tang, Grace W; Capriotti, Emidio

    2011-07-01

    The goal of computational protein structure prediction is to provide three-dimensional (3D) structures with resolution comparable to experimental results. Comparative modeling, which predicts the 3D structure of a protein based on its sequence similarity to homologous structures, is the most accurate computational method for structure prediction. In the last two decades, significant progress has been made on comparative modeling methods. Using the large number of protein structures deposited in the Protein Data Bank (~65,000), automatic prediction pipelines are generating a tremendous number of models (~1.9 million) for sequences whose structures have not been experimentally determined. Accurate models are suitable for a wide range of applications, such as prediction of protein binding sites, prediction of the effect of protein mutations, and structure-guided virtual screening. In particular, comparative modeling has enabled structure-based drug design against protein targets with unknown structures. In this review, we describe the theoretical basis of comparative modeling, the available automatic methods and databases, and the algorithms to evaluate the accuracy of predicted structures. Finally, we discuss relevant applications in the prediction of important drug target proteins, focusing on the G protein-coupled receptor (GPCR) and protein kinase families.

  17. Predicting crystal structure by merging data mining with quantum mechanics.

    PubMed

    Fischer, Christopher C; Tibbetts, Kevin J; Morgan, Dane; Ceder, Gerbrand

    2006-08-01

    Modern methods of quantum mechanics have proved to be effective tools to understand and even predict materials properties. An essential element of the materials design process, relevant to both new materials and the optimization of existing ones, is knowing which crystal structures will form in an alloy system. Crystal structure can only be predicted effectively with quantum mechanics if an algorithm to direct the search through the large space of possible structures is found. We present a new approach to the prediction of structure that rigorously mines correlations embodied within experimental data and uses them to direct quantum mechanical techniques efficiently towards the stable crystal structure of materials.

  18. Docking-based three-dimensional quantitative structure-activity relationship (3D-QSAR) predicts binding affinities to aryl hydrocarbon receptor for polychlorinated dibenzodioxins, dibenzofurans, and biphenyls.

    PubMed

    Yuan, Jintao; Pu, Yuepu; Yin, Lihong

    2013-07-01

    Polychlorinated dibenzodioxins (PCDDs), polychlorinated dibenzofurans (PCDFs), and polychlorinated biphenyls (PCBs) cause toxic effects after binding to an intracellular cytosolic receptor called the aryl hydrocarbon receptor (AhR). Thymic atrophy, weight loss, immunotoxicity, acute lethality, and induction of cytochrome P4501A1 have all been correlated with the binding affinity to AhR. To study the key molecular features for determining binding affinity to AhR, a homology model of AhR ligand-binding domains was developed, a molecular docking approach was employed to obtain docking-based conformations of all molecules in the whole set, and 3-dimensional quantitative structure-activity relationship (3D-QSAR) methodology, namely, comparative molecular field analysis (CoMFA), was applied. A partial least square analysis was performed, and QSAR models were generated for a training set of 59 compounds. The generated QSAR model showed good internal and external statistical reliability, and in a comparison with other reported CoMFA models using different alignment methods, the docking-based CoMFA model showed some advantages.

  19. Evaluation of the information content of RNA structure mapping data for secondary structure prediction.

    PubMed

    Quarrier, Scott; Martin, Joshua S; Davis-Neulander, Lauren; Beauregard, Arthur; Laederach, Alain

    2010-06-01

    Structure mapping experiments (using probes such as dimethyl sulfate [DMS], kethoxal, and T1 and V1 RNases) are used to determine the secondary structures of RNA molecules. The process is iterative, combining the results of several probes with constrained minimum free-energy calculations to produce a model of the structure. We aim to evaluate whether particular probes provide more structural information, and specifically, how noise in the data affects the predictions. Our approach involves generating "decoy" RNA structures (using the sFold Boltzmann sampling procedure) and evaluating whether we are able to identify the correct structure from this ensemble of structures. We show that with perfect information, we are always able to identify the optimal structure for five RNAs of known structure. We then collected orthogonal structure mapping data (DMS and RNase T1 digest) under several solution conditions using our high-throughput capillary automated footprinting analysis (CAFA) technique on two group I introns of known structure. Analysis of these data reveals the error rates in the data under optimal (low salt) and suboptimal solution conditions (high MgCl(2)). We show that despite these errors, our computational approach is less sensitive to experimental noise than traditional constraint-based structure prediction algorithms. Finally, we propose a novel approach for visualizing the interaction of chemical and enzymatic mapping data with RNA structure. We project the data onto the first two dimensions of a multidimensional scaling of the sFold-generated decoy structures. We are able to directly visualize the structural information content of structure mapping data and reconcile multiple data sets.

  20. Evaluation of the information content of RNA structure mapping data for secondary structure prediction.

    PubMed

    Quarrier, Scott; Martin, Joshua S; Davis-Neulander, Lauren; Beauregard, Arthur; Laederach, Alain

    2010-06-01

    Structure mapping experiments (using probes such as dimethyl sulfate [DMS], kethoxal, and T1 and V1 RNases) are used to determine the secondary structures of RNA molecules. The process is iterative, combining the results of several probes with constrained minimum free-energy calculations to produce a model of the structure. We aim to evaluate whether particular probes provide more structural information, and specifically, how noise in the data affects the predictions. Our approach involves generating "decoy" RNA structures (using the sFold Boltzmann sampling procedure) and evaluating whether we are able to identify the correct structure from this ensemble of structures. We show that with perfect information, we are always able to identify the optimal structure for five RNAs of known structure. We then collected orthogonal structure mapping data (DMS and RNase T1 digest) under several solution conditions using our high-throughput capillary automated footprinting analysis (CAFA) technique on two group I introns of known structure. Analysis of these data reveals the error rates in the data under optimal (low salt) and suboptimal solution conditions (high MgCl(2)). We show that despite these errors, our computational approach is less sensitive to experimental noise than traditional constraint-based structure prediction algorithms. Finally, we propose a novel approach for visualizing the interaction of chemical and enzymatic mapping data with RNA structure. We project the data onto the first two dimensions of a multidimensional scaling of the sFold-generated decoy structures. We are able to directly visualize the structural information content of structure mapping data and reconcile multiple data sets. PMID:20413617

  1. Structural coding versus free-energy predictive coding.

    PubMed

    van der Helm, Peter A

    2016-06-01

    Focusing on visual perceptual organization, this article contrasts the free-energy (FE) version of predictive coding (a recent Bayesian approach) to structural coding (a long-standing representational approach). Both use free-energy minimization as metaphor for processing in the brain, but their formal elaborations of this metaphor are fundamentally different. FE predictive coding formalizes it by minimization of prediction errors, whereas structural coding formalizes it by minimization of the descriptive complexity of predictions. Here, both sides are evaluated. A conclusion regarding competence is that FE predictive coding uses a powerful modeling technique, but that structural coding has more explanatory power. A conclusion regarding performance is that FE predictive coding-though more detailed in its account of neurophysiological data-provides a less compelling cognitive architecture than that of structural coding, which, for instance, supplies formal support for the computationally powerful role it attributes to neuronal synchronization.

  2. Predicting Career Advancement with Structural Equation Modelling

    ERIC Educational Resources Information Center

    Heimler, Ronald; Rosenberg, Stuart; Morote, Elsa-Sofia

    2012-01-01

    Purpose: The purpose of this paper is to use the authors' prior findings concerning basic employability skills in order to determine which skills best predict career advancement potential. Design/methodology/approach: Utilizing survey responses of human resource managers, the employability skills showing the largest relationships to career…

  3. Predicting Learned Helplessness Based on Personality

    ERIC Educational Resources Information Center

    Maadikhah, Elham; Erfani, Nasrollah

    2014-01-01

    Learned helplessness as a negative motivational state can latently underlie repeated failures and create negative feelings toward the education as well as depression in students and other members of a society. The purpose of this paper is to predict learned helplessness based on students' personality traits. The research is a predictive…

  4. Prediction of rigid silica based insulation conductivity

    NASA Technical Reports Server (NTRS)

    Williams, Stanley D.; Curry, Donald M.

    1993-01-01

    A method is presented for predicting the thermal conductivity of low density, silica based fibrous insulators. It is shown that the method can be used to extend data values to the upper material temperature limits from those obtained from the test data. It is demonstrated that once the conductivity is accurately determined by the analytical model the conductivity for other atmospheres can be predicted. The method is similar to that presented by previous investigators, but differs significantly in the contribution due to gas and internal radiation.

  5. Genome-wide Membrane Protein Structure Prediction

    PubMed Central

    Piccoli, Stefano; Suku, Eda; Garonzi, Marianna; Giorgetti, Alejandro

    2013-01-01

    Transmembrane proteins allow cells to extensively communicate with the external world in a very accurate and specific way. They form principal nodes in several signaling pathways and attract large interest in therapeutic intervention, as the majority pharmaceutical compounds target membrane proteins. Thus, according to the current genome annotation methods, a detailed structural/functional characterization at the protein level of each of the elements codified in the genome is also required. The extreme difficulty in obtaining high-resolution three-dimensional structures, calls for computational approaches. Here we review to which extent the efforts made in the last few years, combining the structural characterization of membrane proteins with protein bioinformatics techniques, could help describing membrane proteins at a genome-wide scale. In particular we analyze the use of comparative modeling techniques as a way of overcoming the lack of high-resolution three-dimensional structures in the human membrane proteome. PMID:24403851

  6. Neural network definitions of highly predictable protein secondary structure classes

    SciTech Connect

    Lapedes, A. |; Steeg, E.; Farber, R.

    1994-02-01

    We use two co-evolving neural networks to determine new classes of protein secondary structure which are significantly more predictable from local amino sequence than the conventional secondary structure classification. Accurate prediction of the conventional secondary structure classes: alpha helix, beta strand, and coil, from primary sequence has long been an important problem in computational molecular biology. Neural networks have been a popular method to attempt to predict these conventional secondary structure classes. Accuracy has been disappointingly low. The algorithm presented here uses neural networks to similtaneously examine both sequence and structure data, and to evolve new classes of secondary structure that can be predicted from sequence with significantly higher accuracy than the conventional classes. These new classes have both similarities to, and differences with the conventional alpha helix, beta strand and coil.

  7. Factors Influencing Progressive Failure Analysis Predictions for Laminated Composite Structure

    NASA Technical Reports Server (NTRS)

    Knight, Norman F., Jr.

    2008-01-01

    Progressive failure material modeling methods used for structural analysis including failure initiation and material degradation are presented. Different failure initiation criteria and material degradation models are described that define progressive failure formulations. These progressive failure formulations are implemented in a user-defined material model for use with a nonlinear finite element analysis tool. The failure initiation criteria include the maximum stress criteria, maximum strain criteria, the Tsai-Wu failure polynomial, and the Hashin criteria. The material degradation model is based on the ply-discounting approach where the local material constitutive coefficients are degraded. Applications and extensions of the progressive failure analysis material model address two-dimensional plate and shell finite elements and three-dimensional solid finite elements. Implementation details are described in the present paper. Parametric studies for laminated composite structures are discussed to illustrate the features of the progressive failure modeling methods that have been implemented and to demonstrate their influence on progressive failure analysis predictions.

  8. The sequential structure of brain activation predicts skill.

    PubMed

    Anderson, John R; Bothell, Daniel; Fincham, Jon M; Moon, Jungaa

    2016-01-29

    In an fMRI study, participants were trained to play a complex video game. They were scanned early and then again after substantial practice. While better players showed greater activation in one region (right dorsal striatum) their relative skill was better diagnosed by considering the sequential structure of whole brain activation. Using a cognitive model that played this game, we extracted a characterization of the mental states that are involved in playing a game and the statistical structure of the transitions among these states. There was a strong correspondence between this measure of sequential structure and the skill of different players. Using multi-voxel pattern analysis, it was possible to recognize, with relatively high accuracy, the cognitive states participants were in during particular scans. We used the sequential structure of these activation-recognized states to predict the skill of individual players. These findings indicate that important features about information-processing strategies can be identified from a model-based analysis of the sequential structure of brain activation. PMID:26707716

  9. Structural prediction for scandium carbide monolayer sheet

    NASA Astrophysics Data System (ADS)

    Ma, Hong-Man; Wang, Jing; Zhao, Hui-Yan; Zhang, Dong-Bo; Liu, Ying

    2016-09-01

    A two-dimensional tetragonal scandium carbide monolayer sheet has been constructed and studied using density functional theory. The results show that the scandium carbide sheet is stable and exhibits a novel tetracoordinated quasiplanar structure, as favored by the hybridization between Sc-3d orbitals and C-2p orbitals. Calculations of the phonon dispersion as well as molecular dynamics simulations also demonstrate the structural stability of this scandium carbide monolayer sheet. Electronic properties show that the scandium carbide monolayer sheet is metallic and non-magnetic.

  10. Saliency-based gaze prediction based on head direction.

    PubMed

    Nakashima, Ryoichi; Fang, Yu; Hatori, Yasuhiro; Hiratani, Akinori; Matsumiya, Kazumichi; Kuriki, Ichiro; Shioiri, Satoshi

    2015-12-01

    Despite decades of attempts to create a model for predicting gaze locations by using saliency maps, a highly accurate gaze prediction model for general conditions has yet to be devised. In this study, we propose a gaze prediction method based on head direction that can improve the accuracy of any model. We used a probability distribution of eye position based on head direction (static eye-head coordination) and added this information to a model of saliency-based visual attention. Using empirical data on eye and head directions while observers were viewing natural scenes, we estimated a probability distribution of eye position. We then combined the relationship between eye position and head direction with visual saliency to predict gaze locations. The model showed that information on head direction improved the prediction accuracy. Further, there was no difference in the gaze prediction accuracy between the two models using information on head direction with and without eye-head coordination. Therefore, information on head direction is useful for predicting gaze location when it is available. Furthermore, this gaze prediction model can be applied relatively easily to many daily situations such as during walking.

  11. Optimal Mutation Sites for PRE Data Collection and Membrane Protein Structure Prediction

    PubMed Central

    Chen, Huiling; Ji, Fei; Olman, Victor; Mobley, Charles K.; Liu, Yizhou; Zhou, Yunpeng; Bushweller, John H.; Prestegard, James H.; Xu, Ying

    2011-01-01

    Summary NMR paramagnetic relaxation enhancement (PRE) measures long-range distances to isotopically labeled residues, providing useful constraints for protein structure prediction. The method usually requires labor-intensive conjugation of nitroxide labels to multiple locations on the protein, one at a time. Here a computational procedure, based on protein sequence and simple secondary structure models, is presented to facilitate optimal placement of a minimum number of labels needed to determine the correct topology of a helical transmembrane protein. Test on DsbB (4 helices) using just one label leads to correct topology prediction in four of five cases, with the predicted structures <6Å to the native structure. Benchmark results using simulated PRE data show we can generally predict correct topology for five and six-to-seven helices using two and three labels, respectively, with an average success rate of 76% and structures of similar precision, showing promises in facilitating experimentally constrained structure prediction of membrane proteins. PMID:21481772

  12. Structural brain MRI trait polygenic score prediction of cognitive abilities

    PubMed Central

    Luciano, Michelle; Marioni, Riccardo E; Hernández, Maria Valdés; Maniega, Susana Munoz; Hamilton, Iona F; Royle, Natalie A.; Scotland, Generation; Chauhan, Ganesh; Bis, Joshua C.; Debette, Stephanie; DeCarli, Charles; Fornage, Myriam; Schmidt, Reinhold; Ikram, M. Arfan; Launer, Lenore J.; Seshadri, Sudha; Bastin, Mark E.; Porteous, David J.; Wardlaw, Joanna; Deary, Ian J

    2016-01-01

    Structural brain magnetic resonance imaging (MRI) traits share part of their genetic variance with cognitive traits. Here, we use genetic association results from large meta-analytic studies of genome-wide association for brain infarcts, white matter hyperintensities, intracranial, hippocampal and total brain volumes to estimate polygenic scores for these traits in three Scottish samples: Generation Scotland: Scottish Family Health Study (GS:SFHS), and the Lothian Birth Cohorts of 1936 (LBC1936) and 1921 (LBC1921). These five brain MRI trait polygenic scores were then used to 1) predict corresponding MRI traits in the LBC1936 (numbers ranged 573 to 630 across traits) and 2) predict cognitive traits in all three cohorts (in 8,115 to 8,250 persons). In the LBC1936, all MRI phenotypic traits were correlated with at least one cognitive measure; and polygenic prediction of MRI traits was observed for intracranial volume. Meta-analysis of the correlations between MRI polygenic scores and cognitive traits revealed a significant negative correlation (maximal r=0.08) between the hippocampal volume polygenic score and measures of global cognitive ability collected in childhood and in old age in the Lothian Birth Cohorts. The lack of association to a related general cognitive measure when including the GS:SFHS points to either type 1 error or the importance of using prediction samples that closely match the demographics of the genome-wide association samples from which prediction is based. Ideally, these analyses should be repeated in larger samples with data on both MRI and cognition, and using MRI GWA results from even larger meta-analysis studies. PMID:26427786

  13. Bimetric structure formation: Non-Gaussian predictions

    SciTech Connect

    Magueijo, Joao; Noller, Johannes; Piazza, Federico

    2010-08-15

    The minimal bimetric theory employing a disformal transformation between matter and gravity metrics is known to produce exactly scale-invariant fluctuations. It has a purely equilateral non-Gaussian signal, with an amplitude smaller than that of Dirac Born Infeld inflation (with opposite sign) but larger than standard inflation. We consider nonminimal bimetric models, where the coupling B appearing in the disformal transformation g-circumflex{sub {mu}{nu}}=g{sub {mu}{nu}}-B{partial_derivative}{sub {mu}{phi}{partial_derivative}{nu}{phi}} can run with {phi}. For power-law B({phi}) these models predict tilted spectra. For each value of the spectral index, a distinctive distortion to the equilateral property can be found. The constraint between this distortion and the spectral index can be seen as a 'consistency relation' for nonminimal bimetric models.

  14. Bimetric structure formation: Non-Gaussian predictions

    NASA Astrophysics Data System (ADS)

    Magueijo, João; Noller, Johannes; Piazza, Federico

    2010-08-01

    The minimal bimetric theory employing a disformal transformation between matter and gravity metrics is known to produce exactly scale-invariant fluctuations. It has a purely equilateral non-Gaussian signal, with an amplitude smaller than that of Dirac Born Infeld inflation (with opposite sign) but larger than standard inflation. We consider nonminimal bimetric models, where the coupling B appearing in the disformal transformation g^μν=gμν-B∂μϕ∂νϕ can run with ϕ. For power-law B(ϕ) these models predict tilted spectra. For each value of the spectral index, a distinctive distortion to the equilateral property can be found. The constraint between this distortion and the spectral index can be seen as a “consistency relation” for nonminimal bimetric models.

  15. Structure Prediction and Analysis of Neuraminidase Sequence Variants

    ERIC Educational Resources Information Center

    Thayer, Kelly M.

    2016-01-01

    Analyzing protein structure has become an integral aspect of understanding systems of biochemical import. The laboratory experiment endeavors to introduce protein folding to ascertain structures of proteins for which the structure is unavailable, as well as to critically evaluate the quality of the prediction obtained. The model system used is the…

  16. Practical theories for service life prediction of critical aerospace structural components

    NASA Technical Reports Server (NTRS)

    Ko, William L.; Monaghan, Richard C.; Jackson, Raymond H.

    1992-01-01

    A new second-order theory was developed for predicting the service lives of aerospace structural components. The predictions based on this new theory were compared with those based on the Ko first-order theory and the classical theory of service life predictions. The new theory gives very accurate service life predictions. An equivalent constant-amplitude stress cycle method was proposed for representing the random load spectrum for crack growth calculations. This method predicts the most conservative service life. The proposed use of minimum detectable crack size, instead of proof load established crack size as an initial crack size for crack growth calculations, could give a more realistic service life.

  17. Evaluating the accuracy of SHAPE-directed RNA secondary structure predictions.

    PubMed

    Sükösd, Zsuzsanna; Swenson, M Shel; Kjems, Jørgen; Heitsch, Christine E

    2013-03-01

    Recent advances in RNA structure determination include using data from high-throughput probing experiments to improve thermodynamic prediction accuracy. We evaluate the extent and nature of improvements in data-directed predictions for a diverse set of 16S/18S ribosomal sequences using a stochastic model of experimental SHAPE data. The average accuracy for 1000 data-directed predictions always improves over the original minimum free energy (MFE) structure. However, the amount of improvement varies with the sequence, exhibiting a correlation with MFE accuracy. Further analysis of this correlation shows that accurate MFE base pairs are typically preserved in a data-directed prediction, whereas inaccurate ones are not. Thus, the positive predictive value of common base pairs is consistently higher than the directed prediction accuracy. Finally, we confirm sequence dependencies in the directability of thermodynamic predictions and investigate the potential for greater accuracy improvements in the worst performing test sequence.

  18. RNAalifold: improved consensus structure prediction for RNA alignments

    PubMed Central

    Bernhart, Stephan H; Hofacker, Ivo L; Will, Sebastian; Gruber, Andreas R; Stadler, Peter F

    2008-01-01

    Background The prediction of a consensus structure for a set of related RNAs is an important first step for subsequent analyses. RNAalifold, which computes the minimum energy structure that is simultaneously formed by a set of aligned sequences, is one of the oldest and most widely used tools for this task. In recent years, several alternative approaches have been advocated, pointing to several shortcomings of the original RNAalifold approach. Results We show that the accuracy of RNAalifold predictions can be improved substantially by introducing a different, more rational handling of alignment gaps, and by replacing the rather simplistic model of covariance scoring with more sophisticated RIBOSUM-like scoring matrices. These improvements are achieved without compromising the computational efficiency of the algorithm. We show here that the new version of RNAalifold not only outperforms the old one, but also several other tools recently developed, on different datasets. Conclusion The new version of RNAalifold not only can replace the old one for almost any application but it is also competitive with other approaches including those based on SCFGs, maximum expected accuracy, or hierarchical nearest neighbor classifiers. PMID:19014431

  19. Statistical mechanical modeling of RNA folding: from free energy landscape to tertiary structural prediction

    PubMed Central

    CAO, Song; CHEN, Shi-Jie

    2016-01-01

    In spite of the success of computational methods for predicting RNA secondary structure, the problem of predicting RNA tertiary structure folding remains. Low-resolution structural models show promise as they allow for rigorous statistical mechanical computation for the conformational entropies, free energies, and the coarse-grained structures of tertiary folds. Molecular dynamics refinement of coarse-grained structures leads to all-atom 3D structures. Modeling based on statistical mechanics principles also has the unique advantage of predicting the full free energy landscape, including local minima and the global free energy minimum. The energy landscapes combined with the 3D structures form the basis for quantitative predictions of RNA functions. In this chapter, we present an overview of statistical mechanical models for RNA folding and then focus on a recently developed RNA statistical mechanical model -- the Vfold model. The main emphasis is placed on the physics underpinning the models, the computational strategies, and the connections to RNA biology. PMID:27293312

  20. Superfamily Assignments for the Yeast Proteome through Integration of Structure Prediction with the Gene Ontology

    PubMed Central

    Malmström, Lars; Riffle, Michael; Strauss, Charlie E. M; Chivian, Dylan; Davis, Trisha N; Bonneau, Richard; Baker, David

    2007-01-01

    Saccharomyces cerevisiae is one of the best-studied model organisms, yet the three-dimensional structure and molecular function of many yeast proteins remain unknown. Yeast proteins were parsed into 14,934 domains, and those lacking sequence similarity to proteins of known structure were folded using the Rosetta de novo structure prediction method on the World Community Grid. This structural data was integrated with process, component, and function annotations from the Saccharomyces Genome Database to assign yeast protein domains to SCOP superfamilies using a simple Bayesian approach. We have predicted the structure of 3,338 putative domains and assigned SCOP superfamily annotations to 581 of them. We have also assigned structural annotations to 7,094 predicted domains based on fold recognition and homology modeling methods. The domain predictions and structural information are available in an online database at http://rd.plos.org/10.1371_journal.pbio.0050076_01. PMID:17373854

  1. Superfamily assignments for the yeast proteome through integration of structure prediction with the gene ontology.

    PubMed

    Malmström, Lars; Riffle, Michael; Strauss, Charlie E M; Chivian, Dylan; Davis, Trisha N; Bonneau, Richard; Baker, David

    2007-04-01

    Saccharomyces cerevisiae is one of the best-studied model organisms, yet the three-dimensional structure and molecular function of many yeast proteins remain unknown. Yeast proteins were parsed into 14,934 domains, and those lacking sequence similarity to proteins of known structure were folded using the Rosetta de novo structure prediction method on the World Community Grid. This structural data was integrated with process, component, and function annotations from the Saccharomyces Genome Database to assign yeast protein domains to SCOP superfamilies using a simple Bayesian approach. We have predicted the structure of 3,338 putative domains and assigned SCOP superfamily annotations to 581 of them. We have also assigned structural annotations to 7,094 predicted domains based on fold recognition and homology modeling methods. The domain predictions and structural information are available in an online database at http://rd.plos.org/10.1371_journal.pbio.0050076_01.

  2. Pfold: RNA secondary structure prediction using stochastic context-free grammars.

    PubMed

    Knudsen, Bjarne; Hein, Jotun

    2003-07-01

    RNA secondary structures are important in many biological processes and efficient structure prediction can give vital directions for experimental investigations. Many available programs for RNA secondary structure prediction only use a single sequence at a time. This may be sufficient in some applications, but often it is possible to obtain related RNA sequences with conserved secondary structure. These should be included in structural analyses to give improved results. This work presents a practical way of predicting RNA secondary structure that is especially useful when related sequences can be obtained. The method improves a previous algorithm based on an explicit evolutionary model and a probabilistic model of structures. Predictions can be done on a web server at http://www.daimi.au.dk/~compbio/pfold.

  3. Computational methods in sequence and structure prediction

    NASA Astrophysics Data System (ADS)

    Lang, Caiyi

    This dissertation is organized into two parts. In the first part, we will discuss three computational methods for cis-regulatory element recognition in three different gene regulatory networks as the following: (a) Using a comprehensive "Phylogenetic Footprinting Comparison" method, we will investigate the promoter sequence structures of three enzymes (PAL, CHS and DFR) that catalyze sequential steps in the pathway from phenylalanine to anthocyanins in plants. Our result shows there exists a putative cis-regulatory element "AC(C/G)TAC(C)" in the upstream of these enzyme genes. We propose this cis-regulatory element to be responsible for the genetic regulation of these three enzymes and this element, might also be the binding site for MYB class transcription factor PAP1. (b) We will investigate the role of the Arabidopsis gene glutamate receptor 1.1 (AtGLR1.1) in C and N metabolism by utilizing the microarray data we obtained from AtGLR1.1 deficient lines (antiAtGLR1.1). We focus our investigation on the putatively co-regulated transcript profile of 876 genes we have collected in antiAtGLR1.1 lines. By (a) scanning the occurrence of several groups of known abscisic acid (ABA) related cisregulatory elements in the upstream regions of 876 Arabidopsis genes; and (b) exhaustive scanning of all possible 6-10 bps motif occurrence in the upstream regions of the same set of genes, we are able to make a quantative estimation on the enrichment level of each of the cis-regulatory element candidates. We finally conclude that one specific cis-regulatory element group, called "ABRE" elements, are statistically highly enriched within the 876-gene group as compared to their occurrence within the genome. (c) We will introduce a new general purpose algorithm, called "fuzzy REDUCE1", which we have developed recently for automated cis-regulatory element identification. In the second part, we will discuss our newly devised protein design framework. With this framework we have developed

  4. Structure of orbitofrontal cortex predicts social influence

    PubMed Central

    Campbell-Meiklejohn, Daniel K.; Kanai, Ryota; Bahrami, Bahador; Bach, Dominik R.; Dolan, Raymond J.; Roepstorff, Andreas; Frith, Chris D.

    2012-01-01

    Summary Some people conform more than others. Across different contexts, this tendency is a fairly stable trait [1]. This stability suggests that the tendency to conform might have an anatomical correlate [2]. Values that one associates with available options, from foods to political candidates, help to guide choices and behaviour. These values can often be updated by the expressed preferences of other people as much as by independent experience. In this correspondence, we report a linear relationship between grey matter volume (GM) in a region of lateral orbitofrontal cortex (lOFCGM) and the tendency to shift reported desire for objects toward values expressed by other people. This effect was found in precisely the same region in each brain hemisphere. lOFCGM also predicted the functional hemodynamic response in the middle frontal gyrus to discovering that someone else's values contrast with one's own. These findings indicate that the tendency to conform one's values to those expressed by other people has an anatomical correlate in the human brain. PMID:22361146

  5. Streamflow Prediction based on Chaos Theory

    NASA Astrophysics Data System (ADS)

    Li, X.; Wang, X.; Babovic, V. M.

    2015-12-01

    Chaos theory is a popular method in hydrologic time series prediction. Local model (LM) based on this theory utilizes time-delay embedding to reconstruct the phase-space diagram. For this method, its efficacy is dependent on the embedding parameters, i.e. embedding dimension, time lag, and nearest neighbor number. The optimal estimation of these parameters is thus critical to the application of Local model. However, these embedding parameters are conventionally estimated using Average Mutual Information (AMI) and False Nearest Neighbors (FNN) separately. This may leads to local optimization and thus has limitation to its prediction accuracy. Considering about these limitation, this paper applies a local model combined with simulated annealing (SA) to find the global optimization of embedding parameters. It is also compared with another global optimization approach of Genetic Algorithm (GA). These proposed hybrid methods are applied in daily and monthly streamflow time series for examination. The results show that global optimization can contribute to the local model to provide more accurate prediction results compared with local optimization. The LM combined with SA shows more advantages in terms of its computational efficiency. The proposed scheme here can also be applied to other fields such as prediction of hydro-climatic time series, error correction, etc.

  6. A set of nearest neighbor parameters for predicting the enthalpy change of RNA secondary structure formation

    PubMed Central

    Lu, Zhi John; Turner, Douglas H.; Mathews, David H.

    2006-01-01

    A complete set of nearest neighbor parameters to predict the enthalpy change of RNA secondary structure formation was derived. These parameters can be used with available free energy nearest neighbor parameters to extend the secondary structure prediction of RNA sequences to temperatures other than 37°C. The parameters were tested by predicting the secondary structures of sequences with known secondary structure that are from organisms with known optimal growth temperatures. Compared with the previous set of enthalpy nearest neighbor parameters, the sensitivity of base pair prediction improved from 65.2 to 68.9% at optimal growth temperatures ranging from 10 to 60°C. Base pair probabilities were predicted with a partition function and the positive predictive value of structure prediction is 90.4% when considering the base pairs in the lowest free energy structure with pairing probability of 0.99 or above. Moreover, a strong correlation is found between the predicted melting temperatures of RNA sequences and the optimal growth temperatures of the host organism. This indicates that organisms that live at higher temperatures have evolved RNA sequences with higher melting temperatures. PMID:16982646

  7. JPred4: a protein secondary structure prediction server.

    PubMed

    Drozdetskiy, Alexey; Cole, Christian; Procter, James; Barton, Geoffrey J

    2015-07-01

    JPred4 (http://www.compbio.dundee.ac.uk/jpred4) is the latest version of the popular JPred protein secondary structure prediction server which provides predictions by the JNet algorithm, one of the most accurate methods for secondary structure prediction. In addition to protein secondary structure, JPred also makes predictions of solvent accessibility and coiled-coil regions. The JPred service runs up to 94 000 jobs per month and has carried out over 1.5 million predictions in total for users in 179 countries. The JPred4 web server has been re-implemented in the Bootstrap framework and JavaScript to improve its design, usability and accessibility from mobile devices. JPred4 features higher accuracy, with a blind three-state (α-helix, β-strand and coil) secondary structure prediction accuracy of 82.0% while solvent accessibility prediction accuracy has been raised to 90% for residues <5% accessible. Reporting of results is enhanced both on the website and through the optional email summaries and batch submission results. Predictions are now presented in SVG format with options to view full multiple sequence alignments with and without gaps and insertions. Finally, the help-pages have been updated and tool-tips added as well as step-by-step tutorials. PMID:25883141

  8. JPred4: a protein secondary structure prediction server

    PubMed Central

    Drozdetskiy, Alexey; Cole, Christian; Procter, James; Barton, Geoffrey J.

    2015-01-01

    JPred4 (http://www.compbio.dundee.ac.uk/jpred4) is the latest version of the popular JPred protein secondary structure prediction server which provides predictions by the JNet algorithm, one of the most accurate methods for secondary structure prediction. In addition to protein secondary structure, JPred also makes predictions of solvent accessibility and coiled-coil regions. The JPred service runs up to 94 000 jobs per month and has carried out over 1.5 million predictions in total for users in 179 countries. The JPred4 web server has been re-implemented in the Bootstrap framework and JavaScript to improve its design, usability and accessibility from mobile devices. JPred4 features higher accuracy, with a blind three-state (α-helix, β-strand and coil) secondary structure prediction accuracy of 82.0% while solvent accessibility prediction accuracy has been raised to 90% for residues <5% accessible. Reporting of results is enhanced both on the website and through the optional email summaries and batch submission results. Predictions are now presented in SVG format with options to view full multiple sequence alignments with and without gaps and insertions. Finally, the help-pages have been updated and tool-tips added as well as step-by-step tutorials. PMID:25883141

  9. A predictive structural model for bulk metallic glasses

    PubMed Central

    Laws, K. J.; Miracle, D. B.; Ferry, M.

    2015-01-01

    Great progress has been made in understanding the atomic structure of metallic glasses, but there is still no clear connection between atomic structure and glass-forming ability. Here we give new insights into perhaps the most important question in the field of amorphous metals: how can glass-forming ability be predicted from atomic structure? We give a new approach to modelling metallic glass atomic structures by solving three long-standing problems: we discover a new family of structural defects that discourage glass formation; we impose efficient local packing around all atoms simultaneously; and we enforce structural self-consistency. Fewer than a dozen binary structures satisfy these constraints, but extra degrees of freedom in structures with three or more different atom sizes significantly expand the number of relatively stable, ‘bulk' metallic glasses. The present work gives a new approach towards achieving the long-sought goal of a predictive capability for bulk metallic glasses. PMID:26370667

  10. Predicting RNA secondary structures from sequence and probing data.

    PubMed

    Lorenz, Ronny; Wolfinger, Michael T; Tanzer, Andrea; Hofacker, Ivo L

    2016-07-01

    RNA secondary structures have proven essential for understanding the regulatory functions performed by RNA such as microRNAs, bacterial small RNAs, or riboswitches. This success is in part due to the availability of efficient computational methods for predicting RNA secondary structures. Recent advances focus on dealing with the inherent uncertainty of prediction by considering the ensemble of possible structures rather than the single most stable one. Moreover, the advent of high-throughput structural probing has spurred the development of computational methods that incorporate such experimental data as auxiliary information.

  11. Predicting carcinogenicity of organic compounds based on CPDB.

    PubMed

    Wu, Xiuchao; Zhang, Qingzhu; Wang, Hui; Hu, Jingtian

    2015-11-01

    Cancer is a major killer of human health and predictions for the carcinogenicity of chemicals are of great importance. In this article, predictive models for the carcinogenicity of organic compounds using QSAR methods for rats and mice were developed based on the data from CPDB. The models was developed based on the data of specific target site liver and classified according to sex of rats and mice. Meanwhile, models were also classified according to whether there is a ring in the molecular structure in order to reduce the diversity of molecular structure. Therefore, eight local models were developed in the final. Taking into account the complexity of carcinogenesis and in order to obtain as much information, DRAGON descriptors were selected as the variables used to develop models. Fitting ability, robustness and predictive power of the models were assessed according to the OECD principles. The external predictive coefficients for validation sets of each model were in the range of 0.711-0.906, and for the whole data in each model were all greater than 0.8, which represents that all models have good predictivity. In order to study the mechanism of carcinogenesis, standardized regression coefficients were calculated for all predictor variables. In addition, the effect of animal sex on carcinogenesis was compared and a trend that female showed stronger tolerance for cancerogen than male in both species was appeared. PMID:26070146

  12. Methods for evaluating the predictive accuracy of structural dynamic models

    NASA Technical Reports Server (NTRS)

    Hasselman, Timothy K.; Chrostowski, Jon D.

    1991-01-01

    Modeling uncertainty is defined in terms of the difference between predicted and measured eigenvalues and eigenvectors. Data compiled from 22 sets of analysis/test results was used to create statistical databases for large truss-type space structures and both pretest and posttest models of conventional satellite-type space structures. Modeling uncertainty is propagated through the model to produce intervals of uncertainty on frequency response functions, both amplitude and phase. This methodology was used successfully to evaluate the predictive accuracy of several structures, including the NASA CSI Evolutionary Structure tested at Langley Research Center. Test measurements for this structure were within + one-sigma intervals of predicted accuracy for the most part, demonstrating the validity of the methodology and computer code.

  13. CRYSTALP2: sequence-based protein crystallization propensity prediction

    PubMed Central

    Kurgan, Lukasz; Razib, Ali A; Aghakhani, Sara; Dick, Scott; Mizianty, Marcin; Jahandideh, Samad

    2009-01-01

    Background Current protocols yield crystals for <30% of known proteins, indicating that automatically identifying crystallizable proteins may improve high-throughput structural genomics efforts. We introduce CRYSTALP2, a kernel-based method that predicts the propensity of a given protein sequence to produce diffraction-quality crystals. This method utilizes the composition and collocation of amino acids, isoelectric point, and hydrophobicity, as estimated from the primary sequence, to generate predictions. CRYSTALP2 extends its predecessor, CRYSTALP, by enabling predictions for sequences of unrestricted size and provides improved prediction quality. Results A significant majority of the collocations used by CRYSTALP2 include residues with high conformational entropy, or low entropy and high potential to mediate crystal contacts; notably, such residues are utilized by surface entropy reduction methods. We show that the collocations provide complementary information to the hydrophobicity and isoelectric point. Tests on four datasets show that CRYSTALP2 outperforms several existing sequence-based predictors (CRYSTALP, OB-score, and SECRET). CRYSTALP2's accuracy, MCC, and AROC range between 69.3 and 77.5%, 0.39 and 0.55, and 0.72 and 0.79, respectively. Our predictions are similar in quality and are complementary to the predictions of the most recent ParCrys and XtalPred methods. Our results also suggest that, as work in protein crystallization continues (thereby enlarging the population of proteins with known crystallization propensities), the prediction quality of the CRYSTALP2 method should increase. The prediction model and the datasets used in this contribution can be downloaded from . Conclusion CRYSTALP2 provides relatively accurate crystallization propensity predictions for a given protein chain that either outperform or complement the existing approaches. The proposed method can be used to support current efforts towards improving the success rate in obtaining

  14. Dst Prediction Based on Solar Wind Parameters

    NASA Astrophysics Data System (ADS)

    Park, Yoon-Kyung; Ahn, Byung-Ho

    2009-12-01

    We reevaluate the Burton equation (Burton et al. 1975) of predicting Dst index using high quality hourly solar wind data supplied by the ACE satellite for the period from 1998 to 2006. Sixty magnetic storms with monotonously decreasing main phase are selected. In order to determine the injection term (Q) and the decay time (tau) of the equation, we examine the relationships between Dst^ast and VB_s, Delta Dst^ast and VB_s, and Delta Dst^ast and Dst^ast during the magnetic storms. For this analysis, we take into account one hour of the propagation time from the ACE satellite to the magnetopause, and a half hour of the response time of the magnetosphere/ring current to the solar wind forcing. The injection term is found to be Q({nT}/h)=-3.56VB_s for VB_s>0.5mV/m and Q({nT}/h)=0 for VB_s leq0.5mV/m. The tau (hour) is estimated as 0.060 Dst^ast + 16.65 for Dst^ast>-175nT and 6.15 hours for Dst^ast leq -175nT. Based on these empirical relationships, we predict the 60 magnetic storms and find that the correlation coefficient between the observed and predicted Dst^ast is 0.88. To evaluate the performance of our prediction scheme, the 60 magnetic storms are predicted again using the models by Burton et al. (1975) and O'Brien & McPherron (2000a). The correlation coefficients thus obtained are 0.85, the same value for both of the two models. In this respect, our model is slightly improved over the other two models as far as the correlation coefficients is concerned. Particularly our model does a better job than the other two models in predicting intense magnetic storms (Dst^ast lesssim -200nT).

  15. Analysis and prediction of RNA-binding residues using sequence, evolutionary conservation, and predicted secondary structure and solvent accessibility.

    PubMed

    Zhang, Tuo; Zhang, Hua; Chen, Ke; Ruan, Jishou; Shen, Shiyi; Kurgan, Lukasz

    2010-11-01

    Identification and prediction of RNA-binding residues (RBRs) provides valuable insights into the mechanisms of protein-RNA interactions. We analyzed the contributions of a wide range of factors including amino acid sequence, evolutionary conservation, secondary structure and solvent accessibility, to the prediction/characterization of RBRs. Five feature sets were designed and feature selection was performed to find and investigate relevant features. We demonstrate that (1) interactions with positively charged amino acids Arg and Lys are preferred by the egatively charged nucleotides; (2) Gly provides flexibility for the RNA binding sites; (3) Glu with negatively charged side chain and several hydrophobic residues such as Leu, Val, Ala and Phe are disfavored in the RNA-binding sites; (4) coil residues, especially in long segments, are more flexible (than other secondary structures) and more likely to interact with RNA; (5) helical residues are more rigid and consequently they are less likely to bind RNA; and (6) residues partially exposed to the solvent are more likely to form RNA-binding sites. We introduce a novel sequence-based predictor of RBRs, RBRpred, which utilizes the selected features. RBRpred is comprehensively tested on three datasets with varied atom distance cutoffs by performing both five-fold cross validation and jackknife tests and achieves Matthew's correlation coefficient (MCC) of 0.51, 0.48 and 0.42, respectively. The quality is comparable to or better than that for state-of-the-art predictors that apply the distancebased cutoff definition. We show that the most important factor for RBRs prediction is evolutionary conservation, followed by the amino acid sequence, predicted secondary structure and predicted solvent accessibility. We also investigate the impact of using native vs. predicted secondary structure and solvent accessibility. The predictions are sufficient for the RBR prediction and the knowledge of the actual solvent accessibility

  16. Evolving networks-Using past structure to predict the future

    NASA Astrophysics Data System (ADS)

    Shang, Ke-ke; Yan, Wei-sheng; Small, Michael

    2016-08-01

    Many previous studies on link prediction have focused on using common neighbors to predict the existence of links between pairs of nodes. More broadly, research into the structural properties of evolving temporal networks and temporal link prediction methods have recently attracted increasing attention. In this study, for the first time, we examine the use of links between a pair of nodes to predict their common neighbors and analyze the relationship between the weight and the structure in static networks, evolving networks, and in the corresponding randomized networks. We propose both new unweighted and weighted prediction methods and use six kinds of real networks to test our algorithms. In unweighted networks, we find that if a pair of nodes connect to each other in the current network, they will have a higher probability to connect common nodes both in the current and the future networks-and the probability will decrease with the increase of the number of neighbors. Furthermore, we find that the original networks have their particular structure and statistical characteristics which benefit link prediction. In weighted networks, the prediction algorithm performance of networks which are dominated by human factors decrease with the decrease of weight and are in general better in static networks. Furthermore, we find that geographical position and link weight both have significant influence on the transport network. Moreover, the evolving financial network has the lowest predictability. In addition, we find that the structure of non-social networks has more robustness than social networks. The structure of engineering networks has both best predictability and also robustness.

  17. Quasi-Steady-State Analysis based on Structural Modules and Timed Petri Net Predict System’s Dynamics: The Life Cycle of the Insulin Receptor

    PubMed Central

    Scheidel, Jennifer; Lindauer, Klaus; Ackermann, Jörg; Koch, Ina

    2015-01-01

    The insulin-dependent activation and recycling of the insulin receptor play an essential role in the regulation of the energy metabolism, leading to a special interest for pharmaceutical applications. Thus, the recycling of the insulin receptor has been intensively investigated, experimentally as well as theoretically. We developed a time-resolved, discrete model to describe stochastic dynamics and study the approximation of non-linear dynamics in the context of timed Petri nets. Additionally, using a graph-theoretical approach, we analyzed the structure of the regulatory system and demonstrated the close interrelation of structural network properties with the kinetic behavior. The transition invariants decomposed the model into overlapping subnetworks of various sizes, which represent basic functional modules. Moreover, we computed the quasi-steady states of these subnetworks and demonstrated that they are fundamental to understand the dynamic behavior of the system. The Petri net approach confirms the experimental results of insulin-stimulated degradation of the insulin receptor, which represents a common feature of insulin-resistant, hyperinsulinaemic states. PMID:26694479

  18. Quasi-Steady-State Analysis based on Structural Modules and Timed Petri Net Predict System's Dynamics: The Life Cycle of the Insulin Receptor.

    PubMed

    Scheidel, Jennifer; Lindauer, Klaus; Ackermann, Jörg; Koch, Ina

    2015-12-17

    The insulin-dependent activation and recycling of the insulin receptor play an essential role in the regulation of the energy metabolism, leading to a special interest for pharmaceutical applications. Thus, the recycling of the insulin receptor has been intensively investigated, experimentally as well as theoretically. We developed a time-resolved, discrete model to describe stochastic dynamics and study the approximation of non-linear dynamics in the context of timed Petri nets. Additionally, using a graph-theoretical approach, we analyzed the structure of the regulatory system and demonstrated the close interrelation of structural network properties with the kinetic behavior. The transition invariants decomposed the model into overlapping subnetworks of various sizes, which represent basic functional modules. Moreover, we computed the quasi-steady states of these subnetworks and demonstrated that they are fundamental to understand the dynamic behavior of the system. The Petri net approach confirms the experimental results of insulin-stimulated degradation of the insulin receptor, which represents a common feature of insulin-resistant, hyperinsulinaemic states.

  19. Quasi-Steady-State Analysis based on Structural Modules and Timed Petri Net Predict System's Dynamics: The Life Cycle of the Insulin Receptor.

    PubMed

    Scheidel, Jennifer; Lindauer, Klaus; Ackermann, Jörg; Koch, Ina

    2015-01-01

    The insulin-dependent activation and recycling of the insulin receptor play an essential role in the regulation of the energy metabolism, leading to a special interest for pharmaceutical applications. Thus, the recycling of the insulin receptor has been intensively investigated, experimentally as well as theoretically. We developed a time-resolved, discrete model to describe stochastic dynamics and study the approximation of non-linear dynamics in the context of timed Petri nets. Additionally, using a graph-theoretical approach, we analyzed the structure of the regulatory system and demonstrated the close interrelation of structural network properties with the kinetic behavior. The transition invariants decomposed the model into overlapping subnetworks of various sizes, which represent basic functional modules. Moreover, we computed the quasi-steady states of these subnetworks and demonstrated that they are fundamental to understand the dynamic behavior of the system. The Petri net approach confirms the experimental results of insulin-stimulated degradation of the insulin receptor, which represents a common feature of insulin-resistant, hyperinsulinaemic states. PMID:26694479

  20. Evaluation of aerial photography for predicting trends in structural attributes of Australian woodland including comparison with ground-based monitoring data.

    PubMed

    Fensham, Roderick J; Bray, Steven G; Fairfax, Russell J

    2007-06-01

    The accurate assessment of trends in the woody structure of savannas has important implications for greenhouse accounting and land-use industries such as pastoralism. Two recent assessments of live woody biomass change from north-east Australian eucalypt woodland between the 1980s and 1990s present divergent results. The first estimate is derived from a network of permanent monitoring plots and the second from woody cover assessments from aerial photography. The differences between the studies are reviewed and include sample density, spatial scale and design. Further analyses targeting potential biases in the indirect aerial photography technique are conducted including a comparison of basal area estimates derived from 28 permanent monitoring sites with basal area estimates derived by the aerial photography technique. It is concluded that the effect of photo-scale; or the failure to include appropriate back-transformation of biomass estimates in the aerial photography study are not likely to have contributed significantly to the discrepancy. However, temporal changes in the structure of woodlands, for example, woodlands maturing from many smaller trees to fewer larger trees or seasonal changes, which affect the relationship between cover and basal area could impact on the detection of trends using the aerial photography technique. It is also possible that issues concerning photo-quality may bias assessments through time, and that the limited sample of the permanent monitoring network may inadequately represent change at regional scales. PMID:16828220

  1. Predictability of sacral base levelness based on iliac crest measurements.

    PubMed

    Dott, G A; Hart, C L; McKay, C

    1994-05-01

    A level sacral base plane is necessary to allow normalization of complex lumbosacral mechanics. Palpatory examinations are often used to evaluate for leg length discrepancy and pelvic obliquity despite improved accuracy and consistency of radiographic techniques. Treatment based on palpatory examinations suppose a direct and consistent relationship between the pelvic bones (innominates) and the sacral base. To evaluate the relationship between iliac crest levelness and sacral base levelness, a radiographic postural survey in the upright, weight-bearing position was performed on 358 men and women thought to have pelvic obliquity. Of these subjects, 293 demonstrated unlevel iliac crest heights or sacral base > or = 3/16 inch (4.76 mm), with iliac crest heights accurately predicting sacral base position 62% of the time. At > or = 3/8 inch (9.53 mm), 68% of the cases were accurately predicted. When the criterion for unlevelness was increased to > or = 1/2 inch (12.70 mm), the predictive accuracy improved to 83%. Radiographic findings in this study demonstrate a significant difference between iliac crest heights and sacral base position. In cases of mild to moderate short leg syndromes, the iliac crest height is an unreliable predictor of the direction or degree of sacral base levelness. PMID:8056627

  2. Lessons from application of the UNRES force field to predictions of structures of CASP10 targets

    PubMed Central

    He, Yi; Mozolewska, Magdalena A.; Krupa, Paweł; Sieradzan, Adam K.; Wirecki, Tomasz K.; Liwo, Adam; Kachlishvili, Khatuna; Rackovsky, Shalom; Jagieła, Dawid; Ślusarz, Rafał; Czaplewski, Cezary R.; Ołdziej, Stanisław; Scheraga, Harold A.

    2013-01-01

    The performance of the physics-based protocol, whose main component is the United Residue (UNRES) physics-based coarse-grained force field, developed in our laboratory for the prediction of protein structure from amino acid sequence, is illustrated. Candidate models are selected, based on probabilities of the conformational families determined by multiplexed replica-exchange simulations, from the 10th Community Wide Experiment on the Critical Assessment of Techniques for Protein Structure Prediction (CASP10). For target T0663, classified as a new fold, which consists of two domains homologous to those of known proteins, UNRES predicted the correct symmetry of packing, in which the domains are rotated with respect to each other by 180° in the experimental structure. By contrast, models obtained by knowledge-based methods, in which each domain is modeled very accurately but not rotated, resulted in incorrect packing. Two UNRES models of this target were featured by the assessors. Correct domain packing was also predicted by UNRES for the homologous target T0644, which has a similar structure to that of T0663, except that the two domains are not rotated. Predictions for two other targets, T0668 and T0684_D2, are among the best ones by global distance test score. These results suggest that our physics-based method has substantial predictive power. In particular, it has the ability to predict domain–domain orientations, which is a significant advance in the state of the art. PMID:23980156

  3. Text Prediction on Structured Data Entry in Healthcare

    PubMed Central

    Hua, L.; Wang, S.; Gong, Y.

    2014-01-01

    Summary Background Structured data entry pervades computerized patient safety event reporting systems and serves as a key component in collecting patient-related information in electronic health records. Clinicians would spend more time being with patients and arrive at a high probability of proper diagnosis and treatment, if data entry can be completed efficiently and effectively. Historically it has been proven text prediction holds potential for human performance regarding data entry in a variety of research areas. Objective This study aimed at examining a function of text prediction proposed for increasing efficiency and data quality in structured data entry. Methods We employed a two-group randomized design with fifty-two nurses in this usability study. Each participant was assigned the task of reporting patient falls by answering multiple choice questions either with or without the text prediction function. t-test statistics and linear regression model were applied to analyzing the results of the two groups. Results While both groups of participants exhibited a good capacity of accomplishing the assigned task, the results were an overall 13.0% time reduction and 3.9% increase of response accuracy for the group utilizing the prediction function. Conclusion As a primary attempt investigating the effectiveness of text prediction in healthcare, study findings validated the necessity of text prediction to structured date entry, and laid the ground for further research improving the effectiveness of text prediction in clinical settings. PMID:24734137

  4. Prediction of structural features and application to outer membrane protein identification

    NASA Astrophysics Data System (ADS)

    Yan, Renxiang; Wang, Xiaofeng; Huang, Lanqing; Yan, Feidi; Xue, Xiaoyu; Cai, Weiwen

    2015-06-01

    Protein three-dimensional (3D) structures provide insightful information in many fields of biology. One-dimensional properties derived from 3D structures such as secondary structure, residue solvent accessibility, residue depth and backbone torsion angles are helpful to protein function prediction, fold recognition and ab initio folding. Here, we predict various structural features with the assistance of neural network learning. Based on an independent test dataset, protein secondary structure prediction generates an overall Q3 accuracy of ~80%. Meanwhile, the prediction of relative solvent accessibility obtains the highest mean absolute error of 0.164, and prediction of residue depth achieves the lowest mean absolute error of 0.062. We further improve the outer membrane protein identification by including the predicted structural features in a scoring function using a simple profile-to-profile alignment. The results demonstrate that the accuracy of outer membrane protein identification can be improved by ~3% at a 1% false positive level when structural features are incorporated. Finally, our methods are available as two convenient and easy-to-use programs. One is PSSM-2-Features for predicting secondary structure, relative solvent accessibility, residue depth and backbone torsion angles, the other is PPA-OMP for identifying outer membrane proteins from proteomes.

  5. Structure activity relationships: their function in biological prediction

    SciTech Connect

    Schultz, T.W.

    1982-01-01

    Quantitative structure activity relationships provide a means of ranking or predicting biological effects based on chemical structure. For each compound used to formulate a structure activity model two kinds of quantitative information are required: (1) biological activity and (2) molecular properties. Molecular properties are of three types: (1) molecular shape, (2) physiochemical parameters, and (3) abstract quantitations of molecular structure. Currently the two best descriptors are the hydrophobic parameter, log 1-octanol/water partition coefficient (log P), and the /sup 1/X/sup v/(one-chi-v) molecular connectivity index. Biological responses can be divided into three main categories: (1) non-specific effects due to membrane perturbation, (2) non-specific effects due to interaction with functional groups of proteins, and (3) specific effects due to interaction with receptors. Twenty-six synthetic fossil fuel-related nitrogen-containing aromatic compounds were examined to determine the quantitative correlation between log P and /sup 1/X/sup v/ and population growth impairment of Tetrahymena pyriformis. Nitro-containing compounds are the most active, followed by amino-containing compounds and azaarenes. Within each analog series activity increases with alkyl substitution and ring addition. The planar model log BR = 0.5564 log P + 0.3000 /sup 1/X/sup v/ -2.0138 was determined using mono-nitrogen substituted compounds. Attempts to extrapolate this model to dinitrogen-containing molecules were, for the most part, unsuccessful because of a change in mode of action from membrane perturbation to uncoupling of oxidative phosphoralation.

  6. Distributed Prognostics based on Structural Model Decomposition

    NASA Technical Reports Server (NTRS)

    Daigle, Matthew J.; Bregon, Anibal; Roychoudhury, I.

    2014-01-01

    Within systems health management, prognostics focuses on predicting the remaining useful life of a system. In the model-based prognostics paradigm, physics-based models are constructed that describe the operation of a system and how it fails. Such approaches consist of an estimation phase, in which the health state of the system is first identified, and a prediction phase, in which the health state is projected forward in time to determine the end of life. Centralized solutions to these problems are often computationally expensive, do not scale well as the size of the system grows, and introduce a single point of failure. In this paper, we propose a novel distributed model-based prognostics scheme that formally describes how to decompose both the estimation and prediction problems into independent local subproblems whose solutions may be easily composed into a global solution. The decomposition of the prognostics problem is achieved through structural decomposition of the underlying models. The decomposition algorithm creates from the global system model a set of local submodels suitable for prognostics. Independent local estimation and prediction problems are formed based on these local submodels, resulting in a scalable distributed prognostics approach that allows the local subproblems to be solved in parallel, thus offering increases in computational efficiency. Using a centrifugal pump as a case study, we perform a number of simulation-based experiments to demonstrate the distributed approach, compare the performance with a centralized approach, and establish its scalability. Index Terms-model-based prognostics, distributed prognostics, structural model decomposition ABBREVIATIONS

  7. A life prediction model for laminated composite structural components

    NASA Technical Reports Server (NTRS)

    Allen, David H.

    1990-01-01

    A life prediction methodology for laminated continuous fiber composites subjected to fatigue loading conditions was developed. A summary is presented of research completed. A phenomenological damage evolution law was formulated for matrix cracking which is independent of stacking sequence. Mechanistic and physical support was developed for the phenomenological evolution law proposed above. The damage evolution law proposed above was implemented to a finite element computer program. And preliminary predictions were obtained for a structural component undergoing fatigue loading induced damage.

  8. Prediction of grain structures in various solidification processes

    SciTech Connect

    Rappaz, M.; Gandin, C.A.; Desbiolles, J.L.; Thevoz, P.

    1996-03-01

    Grain structure formation during solidification can be simulated via the use of stochastic models providing the physical mechanisms of nucleation and dendrite growth are accounted for. With this goal in mind, a physically based cellular automaton (CA) model has been coupled with finite element (FE) heat flow computations and implemented into the code 3-MOS. The CA enmeshment of the solidifying domain with small square cells is first generated automatically from the FE mesh. Within each time-step, the variation of enthalpy at each node of the FE mesh is calculated using an implicit scheme and a Newton-type linearization method. After interpolation of the explicit temperature and of the enthalpy variation at the cell location, the nucleation and growth of grains are simulated using the CA algorithm. This algorithm accounts for the heterogeneous nucleation in the bulk and at the surface of the ingot, for the growth and preferential growth directions of the dendrites, and for microsegregation. The variation of volume fraction of solid at the cell location are then summed up at the FE nodes in order to find the new temperatures. This CAFE model, which allows the prediction and the visualization of grain structures during and after solidification, is applied to various solidification processes: the investment casting of turbine blades, the continuous casting of rods, and the laser remelting or welding of plates. Because the CAFE model is yet two-dimensional (2-D), the simulation results are compared in a qualitative way with experimental findings.

  9. Cloud Prediction of Protein Structure and Function with PredictProtein for Debian

    PubMed Central

    Kaján, László; Yachdav, Guy; Vicedo, Esmeralda; Steinegger, Martin; Mirdita, Milot; Angermüller, Christof; Böhm, Ariane; Domke, Simon; Ertl, Julia; Mertes, Christian; Reisinger, Eva; Rost, Burkhard

    2013-01-01

    We report the release of PredictProtein for the Debian operating system and derivatives, such as Ubuntu, Bio-Linux, and Cloud BioLinux. The PredictProtein suite is available as a standard set of open source Debian packages. The release covers the most popular prediction methods from the Rost Lab, including methods for the prediction of secondary structure and solvent accessibility (profphd), nuclear localization signals (predictnls), and intrinsically disordered regions (norsnet). We also present two case studies that successfully utilize PredictProtein packages for high performance computing in the cloud: the first analyzes protein disorder for whole organisms, and the second analyzes the effect of all possible single sequence variants in protein coding regions of the human genome. PMID:23971032

  10. Cloud prediction of protein structure and function with PredictProtein for Debian.

    PubMed

    Kaján, László; Yachdav, Guy; Vicedo, Esmeralda; Steinegger, Martin; Mirdita, Milot; Angermüller, Christof; Böhm, Ariane; Domke, Simon; Ertl, Julia; Mertes, Christian; Reisinger, Eva; Staniewski, Cedric; Rost, Burkhard

    2013-01-01

    We report the release of PredictProtein for the Debian operating system and derivatives, such as Ubuntu, Bio-Linux, and Cloud BioLinux. The PredictProtein suite is available as a standard set of open source Debian packages. The release covers the most popular prediction methods from the Rost Lab, including methods for the prediction of secondary structure and solvent accessibility (profphd), nuclear localization signals (predictnls), and intrinsically disordered regions (norsnet). We also present two case studies that successfully utilize PredictProtein packages for high performance computing in the cloud: the first analyzes protein disorder for whole organisms, and the second analyzes the effect of all possible single sequence variants in protein coding regions of the human genome.

  11. Using theoretical descriptors in structure activity relationships: Validating toxicity predictions

    SciTech Connect

    Famini, G.R.; Wilson, L.Y.; Chester, N.A.; Sterling, P.A.

    1995-12-01

    Quantitative Structure Activity Relationships (QSAR) and Linear Free Energy Relationships (LFER) are very useful for correlating toxicological data, and in characterizating trends in terms of structural and electronic effects. Several years ago, we developed a series of equations correlating a number of toxicity tests with theoretically determined descriptors. One of these tests was the Microtox test, using the degradation in light from Photobacteriurn phosphoreum. Recently, several new compounds have been tested in our laboratory using the Microtox test, and compared against the predicted values. The agreement between experimental and theoretical results will be discussed, as will reasons for {open_quotes}good{close_quotes} or {open_quotes}poor{close_quotes} predictions.

  12. Protein structure prediction: combining de novo modeling with sparse experimental data.

    PubMed

    Latek, Dorota; Ekonomiuk, Dariusz; Kolinski, Andrzej

    2007-07-30

    Routine structure prediction of new folds is still a challenging task for computational biology. The challenge is not only in the proper determination of overall fold but also in building models of acceptable resolution, useful for modeling the drug interactions and protein-protein complexes. In this work we propose and test a comprehensive approach to protein structure modeling supported by sparse, and relatively easy to obtain, experimental data. We focus on chemical shift-based restraints from NMR, although other sparse restraints could be easily included. In particular, we demonstrate that combining the typical NMR software with artificial intelligence-based prediction of secondary structure enhances significantly the accuracy of the restraints for molecular modeling. The computational procedure is based on the reduced representation approach implemented in the CABS modeling software, which proved to be a versatile tool for protein structure prediction during the CASP (CASP stands for critical assessment of techniques for protein structure prediction) experiments (see http://predictioncenter/CASP6/org). The method is successfully tested on a small set of representative globular proteins of different size and topology, including the two CASP6 targets, for which the required NMR data already exist. The method is implemented in a semi-automated pipeline applicable to a large scale structural annotation of genomic data. Here, we limit the computations to relatively small set. This enabled, without a loss of generality, a detailed discussion of various factors determining accuracy of the proposed approach to the protein structure prediction.

  13. Hill-Climbing search and diversification within an evolutionary approach to protein structure prediction.

    PubMed

    Chira, Camelia; Horvath, Dragos; Dumitrescu, D

    2011-01-01

    Proteins are complex structures made of amino acids having a fundamental role in the correct functioning of living cells. The structure of a protein is the result of the protein folding process. However, the general principles that govern the folding of natural proteins into a native structure are unknown. The problem of predicting a protein structure with minimum-energy starting from the unfolded amino acid sequence is a highly complex and important task in molecular and computational biology. Protein structure prediction has important applications in fields such as drug design and disease prediction. The protein structure prediction problem is NP-hard even in simplified lattice protein models. An evolutionary model based on hill-climbing genetic operators is proposed for protein structure prediction in the hydrophobic - polar (HP) model. Problem-specific search operators are implemented and applied using a steepest-ascent hill-climbing approach. Furthermore, the proposed model enforces an explicit diversification stage during the evolution in order to avoid local optimum. The main features of the resulting evolutionary algorithm - hill-climbing mechanism and diversification strategy - are evaluated in a set of numerical experiments for the protein structure prediction problem to assess their impact to the efficiency of the search process. Furthermore, the emerging consolidated model is compared to relevant algorithms from the literature for a set of difficult bidimensional instances from lattice protein models. The results obtained by the proposed algorithm are promising and competitive with those of related methods.

  14. RNA:(guanine-N2) methyltransferases RsmC/RsmD and their homologs revisited – bioinformatic analysis and prediction of the active site based on the uncharacterized Mj0882 protein structure

    PubMed Central

    2002-01-01

    Background Escherichia coli guanine-N2 (m2G) methyltransferases (MTases) RsmC and RsmD modify nucleosides G1207 and G966 of 16S rRNA. They possess a common MTase domain in the C-terminus and a variable region in the N-terminus. Their C-terminal domain is related to the YbiN family of hypothetical MTases, but nothing is known about the structure or function of the N-terminal domain. Results Using a combination of sequence database searches and fold recognition methods it has been demonstrated that the N-termini of RsmC and RsmD are related to each other and that they represent a "degenerated" version of the C-terminal MTase domain. Novel members of the YbiN family from Archaea and Eukaryota were also indentified. It is inferred that YbiN and both domains of RsmC and RsmD are closely related to a family of putative MTases from Gram-positive bacteria and Archaea, typified by the Mj0882 protein from M. jannaschii (1dus in PDB). Based on the results of sequence analysis and structure prediction, the residues involved in cofactor binding, target recognition and catalysis were identified, and the mechanism of the guanine-N2 methyltransfer reaction was proposed. Conclusions Using the known Mj0882 structure, a comprehensive analysis of sequence-structure-function relationships in the family of genuine and putative m2G MTases was performed. The results provide novel insight into the mechanism of m2G methylation and will serve as a platform for experimental analysis of numerous uncharacterized N-MTases. PMID:11929612

  15. Temperature-based bioclimatic parameters can predict nematode metabolic footprints.

    PubMed

    Bhusal, Daya Ram; Tsiafouli, Maria A; Sgardelis, Stefanos P

    2015-09-01

    Nematode metabolic footprints (MFs) refer to the lifetime amount of metabolized carbon per individual, indicating a connection to soil food web functions and eventually to processes supporting ecosystem services. Estimating and managing these at a convenient scale requires information upscaling from the soil sample to the landscape level. We explore the feasibility of predicting nematode MFs from temperature-based bioclimatic parameters across a landscape. We assume that temperature effects are reflected in MFs, since temperature variations determine life processes ranging from enzyme activities to community structure. We use microclimate data recorded for 1 year from sites differing by orientation, altitude and vegetation cover. At the same sites we estimate MFs for each nematode trophic group. Our models show that bioclimatic parameters, specifically those accounting for temporal variations in temperature and extremities, predict most of the variation in nematode MFs. Higher fungivorous and lower bacterivorous nematode MFs are predicted for sites with high seasonality and low isothermality (sites of low vegetation, mostly at low altitudes), indicating differences in the relative contribution of the corresponding food web channels to the metabolism of carbon across the landscape. Higher plant-parasitic MFs were predicted for sites with high seasonality. The fitted models provide realistic predictions of unknown cases within the range of the predictor's values, allowing for the interpolation of MFs within the sampled region. We conclude that upscaling of the bioindication potential of nematode communities is feasible and can provide new perspectives not only in the field of soil ecology but other research areas as well.

  16. New Quantitative Structure-Activity Relationship Models Improve Predictability of Ames Mutagenicity for Aromatic Azo Compounds.

    PubMed

    Manganelli, Serena; Benfenati, Emilio; Manganaro, Alberto; Kulkarni, Sunil; Barton-Maclaren, Tara S; Honma, Masamitsu

    2016-10-01

    Existing Quantitative Structure-Activity Relationship (QSAR) models have limited predictive capabilities for aromatic azo compounds. In this study, 2 new models were built to predict Ames mutagenicity of this class of compounds. The first one made use of descriptors based on simplified molecular input-line entry system (SMILES), calculated with the CORAL software. The second model was based on the k-nearest neighbors algorithm. The statistical quality of the predictions from single models was satisfactory. The performance further improved when the predictions from these models were combined. The prediction results from other QSAR models for mutagenicity were also evaluated. Most of the existing models were found to be good at finding toxic compounds but resulted in many false positive predictions. The 2 new models specific for this class of compounds avoid this problem thanks to a larger set of related compounds as training set and improved algorithms.

  17. New Quantitative Structure-Activity Relationship Models Improve Predictability of Ames Mutagenicity for Aromatic Azo Compounds.

    PubMed

    Manganelli, Serena; Benfenati, Emilio; Manganaro, Alberto; Kulkarni, Sunil; Barton-Maclaren, Tara S; Honma, Masamitsu

    2016-10-01

    Existing Quantitative Structure-Activity Relationship (QSAR) models have limited predictive capabilities for aromatic azo compounds. In this study, 2 new models were built to predict Ames mutagenicity of this class of compounds. The first one made use of descriptors based on simplified molecular input-line entry system (SMILES), calculated with the CORAL software. The second model was based on the k-nearest neighbors algorithm. The statistical quality of the predictions from single models was satisfactory. The performance further improved when the predictions from these models were combined. The prediction results from other QSAR models for mutagenicity were also evaluated. Most of the existing models were found to be good at finding toxic compounds but resulted in many false positive predictions. The 2 new models specific for this class of compounds avoid this problem thanks to a larger set of related compounds as training set and improved algorithms. PMID:27413112

  18. An Integrated Theory for Predicting the Hydrothermomechanical Response of Advanced Composite Structural Components

    NASA Technical Reports Server (NTRS)

    Chamis, C. C.; Lark, R. F.; Sinclair, J. H.

    1977-01-01

    An integrated theory is developed for predicting the hydrothermomechanical (HDTM) response of fiber composite components. The integrated theory is based on a combined theoretical and experimental investigation. In addition to predicting the HDTM response of components, the theory is structured to assess the combined hydrothermal effects on the mechanical properties of unidirectional composites loaded along the material axis and off-axis, and those of angleplied laminates. The theory developed predicts values which are in good agreement with measured data at the micromechanics, macromechanics, laminate analysis and structural analysis levels.

  19. Process for predicting structural performance of mechanical systems

    DOEpatents

    Gardner, David R.; Hendrickson, Bruce A.; Plimpton, Steven J.; Attaway, Stephen W.; Heinstein, Martin W.; Vaughan, Courtenay T.

    1998-01-01

    A process for predicting the structural performance of a mechanical system represents the mechanical system by a plurality of surface elements. The surface elements are grouped according to their location in the volume occupied by the mechanical system so that contacts between surface elements can be efficiently located. The process is well suited for efficient practice on multiprocessor computers.

  20. Process for predicting structural performance of mechanical systems

    DOEpatents

    Gardner, D.R.; Hendrickson, B.A.; Plimpton, S.J.; Attaway, S.W.; Heinstein, M.W.; Vaughan, C.T.

    1998-05-19

    A process for predicting the structural performance of a mechanical system represents the mechanical system by a plurality of surface elements. The surface elements are grouped according to their location in the volume occupied by the mechanical system so that contacts between surface elements can be efficiently located. The process is well suited for efficient practice on multiprocessor computers. 12 figs.

  1. Structural Damage Prediction and Analysis for Hypervelocity Impact: Consulting

    NASA Technical Reports Server (NTRS)

    1995-01-01

    A portion of the contract NAS8-38856, 'Structural Damage Prediction and Analysis for Hypervelocity Impacts,' from NASA Marshall Space Flight Center (MSFC), included consulting which was to be documented in the final report. This attachment to the final report contains memos produced as part of that consulting.

  2. The Attribute for Hydrocarbon Prediction Based on Attenuation

    NASA Astrophysics Data System (ADS)

    Hermana, Maman; Harith, Z. Z. T.; Sum, C. W.; Ghosh, D. P.

    2014-03-01

    Hydrocarbon prediction is a crucial issue in the oil and gas industry. Currently, the prediction of pore fluid and lithology are based on amplitude interpretation which has the potential to produce pitfalls in certain conditions of reservoir. Motivated by this fact, this work is directed to find out other attributes that can be used to reduce the pitfalls in the amplitude interpretation. Some seismic attributes were examined and studies showed that the attenuation attribute is a better attribute for hydrocarbon prediction. Theoretically, the attenuation mechanism of wave propagation is associated with the movement of fluid in the pore; hence the existence of hydrocarbon in the pore will be represented by attenuation attribute directly. In this paper we evaluated the feasibility of the quality factor ratio of P-wave and S-wave (Qp/Qs) as hydrocarbon indicator using well data and also we developed a new attribute based on attenuation for hydrocarbon prediction -- Normalized Energy Reduction Stack (NERS). To achieve these goals, this work was divided into 3 main parts; estimating the Qp/Qs on well log data, testing the new attribute in the synthetic data and applying the new attribute on real data in Malay Basin data. The result show that the Qp/Qs is better than Poisson's ratio and Lamda over Mu as hydrocarbon indicator. The curve, trend analysis and contrast of Qp/Qs is more powerful at distinguishing pore fluid than Poisson ratio and Lamda over Mu. The NERS attribute was successful in distinguishing the hydrocarbon from brine on synthetic data. Applying this attribute on real data on Malay basin, the NERS attribute is qualitatively conformable with the structure and location where the gas is predicted. The quantitative interpretation of this attribute for hydrocarbon prediction needs to be investigated further.

  3. Protein structure validation by generalized linear model root-mean-square deviation prediction.

    PubMed

    Bagaria, Anurag; Jaravine, Victor; Huang, Yuanpeng J; Montelione, Gaetano T; Güntert, Peter

    2012-02-01

    Large-scale initiatives for obtaining spatial protein structures by experimental or computational means have accentuated the need for the critical assessment of protein structure determination and prediction methods. These include blind test projects such as the critical assessment of protein structure prediction (CASP) and the critical assessment of protein structure determination by nuclear magnetic resonance (CASD-NMR). An important aim is to establish structure validation criteria that can reliably assess the accuracy of a new protein structure. Various quality measures derived from the coordinates have been proposed. A universal structural quality assessment method should combine multiple individual scores in a meaningful way, which is challenging because of their different measurement units. Here, we present a method based on a generalized linear model (GLM) that combines diverse protein structure quality scores into a single quantity with intuitive meaning, namely the predicted coordinate root-mean-square deviation (RMSD) value between the present structure and the (unavailable) "true" structure (GLM-RMSD). For two sets of structural models from the CASD-NMR and CASP projects, this GLM-RMSD value was compared with the actual accuracy given by the RMSD value to the corresponding, experimentally determined reference structure from the Protein Data Bank (PDB). The correlation coefficients between actual (model vs. reference from PDB) and predicted (model vs. "true") heavy-atom RMSDs were 0.69 and 0.76, for the two datasets from CASD-NMR and CASP, respectively, which is considerably higher than those for the individual scores (-0.24 to 0.68). The GLM-RMSD can thus predict the accuracy of protein structures more reliably than individual coordinate-based quality scores.

  4. Automatic prediction of facial trait judgments: appearance vs. structural models.

    PubMed

    Rojas, Mario; Masip, David; Todorov, Alexander; Vitria, Jordi

    2011-01-01

    Evaluating other individuals with respect to personality characteristics plays a crucial role in human relations and it is the focus of attention for research in diverse fields such as psychology and interactive computer systems. In psychology, face perception has been recognized as a key component of this evaluation system. Multiple studies suggest that observers use face information to infer personality characteristics. Interactive computer systems are trying to take advantage of these findings and apply them to increase the natural aspect of interaction and to improve the performance of interactive computer systems. Here, we experimentally test whether the automatic prediction of facial trait judgments (e.g. dominance) can be made by using the full appearance information of the face and whether a reduced representation of its structure is sufficient. We evaluate two separate approaches: a holistic representation model using the facial appearance information and a structural model constructed from the relations among facial salient points. State of the art machine learning methods are applied to a) derive a facial trait judgment model from training data and b) predict a facial trait value for any face. Furthermore, we address the issue of whether there are specific structural relations among facial points that predict perception of facial traits. Experimental results over a set of labeled data (9 different trait evaluations) and classification rules (4 rules) suggest that a) prediction of perception of facial traits is learnable by both holistic and structural approaches; b) the most reliable prediction of facial trait judgments is obtained by certain type of holistic descriptions of the face appearance; and c) for some traits such as attractiveness and extroversion, there are relationships between specific structural features and social perceptions.

  5. Lattice-free prediction of three-dimensional structure of programmed DNA assemblies

    NASA Astrophysics Data System (ADS)

    Pan, Keyao; Kim, Do-Nyun; Zhang, Fei; Adendorff, Matthew R.; Yan, Hao; Bathe, Mark

    2014-12-01

    DNA can be programmed to self-assemble into high molecular weight 3D assemblies with precise nanometer-scale structural features. Although numerous sequence design strategies exist to realize these assemblies in solution, there is currently no computational framework to predict their 3D structures on the basis of programmed underlying multi-way junction topologies constrained by DNA duplexes. Here, we introduce such an approach and apply it to assemblies designed using the canonical immobile four-way junction. The procedure is used to predict the 3D structure of high molecular weight planar and spherical ring-like origami objects, a tile-based sheet-like ribbon, and a 3D crystalline tensegrity motif, in quantitative agreement with experiments. Our framework provides a new approach to predict programmed nucleic acid 3D structure on the basis of prescribed secondary structure motifs, with possible application to the design of such assemblies for use in biomolecular and materials science.

  6. Lattice-free prediction of three-dimensional structure of programmed DNA assemblies.

    PubMed

    Pan, Keyao; Kim, Do-Nyun; Zhang, Fei; Adendorff, Matthew R; Yan, Hao; Bathe, Mark

    2014-01-01

    DNA can be programmed to self-assemble into high molecular weight 3D assemblies with precise nanometer-scale structural features. Although numerous sequence design strategies exist to realize these assemblies in solution, there is currently no computational framework to predict their 3D structures on the basis of programmed underlying multi-way junction topologies constrained by DNA duplexes. Here, we introduce such an approach and apply it to assemblies designed using the canonical immobile four-way junction. The procedure is used to predict the 3D structure of high molecular weight planar and spherical ring-like origami objects, a tile-based sheet-like ribbon, and a 3D crystalline tensegrity motif, in quantitative agreement with experiments. Our framework provides a new approach to predict programmed nucleic acid 3D structure on the basis of prescribed secondary structure motifs, with possible application to the design of such assemblies for use in biomolecular and materials science. PMID:25470497

  7. Composite failure prediction of π-joint structures under bending

    NASA Astrophysics Data System (ADS)

    Huang, Hong-mei; Yuan, Shen-fang

    2012-03-01

    In this article, the composite -joint is investigated under bending loads. The "L" preform is the critical component regarding composite -joint failure. The study is presented in the failure detection of a carbon fiber composite -joint structure under bending loads using fiber Bragg grating (FBG) sensor. Firstly, based on the general finite element method (FEM) software, the 3-D finite element (FE) model of composite -joint is established, and the failure process and every lamina failure load of composite -joint are investigated by maximum stress criteria. Then, strain distributions along the length of FBG are extracted, and the reflection spectra of FBG are calculated according to the strain distribution. Finally, to verify the numerical results, a test scheme is performed and the experimental spectra of FBG are recorded. The experimental results indicate that the failure sequence and the corresponding critical loads of failure are consistent with the numerical predictions, and the computational error of failure load is less than 6.4%. Furthermore, it also verifies the feasibility of the damage detection system.

  8. Behavior predicts genes structure in a wild primate group.

    PubMed Central

    Altmann, J; Alberts, S C; Haines, S A; Dubach, J; Muruthi, P; Coote, T; Geffen, E; Cheesman, D J; Mututua, R S; Saiyalel, S N; Wayne, R K; Lacy, R C; Bruford, M W

    1996-01-01

    The predictability of genetic structure from social structure and differential mating success was tested in wild baboons. Baboon populations are subdivided into cohesive social groups that include multiple adults of both sexes. As in many mammals, males are the dispersing sex. Social structure and behavior successfully predicted molecular genetic measures of relatedness and variance in reproductive success. In the first quantitative test of the priority-of-access model among wild primates, the reproductive priority of dominant males was confirmed by molecular genetic analysis. However, the resultant high short-term variance in reproductive success did not translate into equally high long-term variance because male dominance status was unstable. An important consequence of high but unstable short-term variance is that age cohorts will tend to be paternal sibships and social groups will be genetically substructured by age. PMID:8650172

  9. A dynamic programming algorithm for RNA structure prediction including pseudoknots.

    PubMed

    Rivas, E; Eddy, S R

    1999-02-01

    We describe a dynamic programming algorithm for predicting optimal RNA secondary structure, including pseudoknots. The algorithm has a worst case complexity of O(N6) in time and O(N4) in storage. The description of the algorithm is complex, which led us to adopt a useful graphical representation (Feynman diagrams) borrowed from quantum field theory. We present an implementation of the algorithm that generates the optimal minimum energy structure for a single RNA sequence, using standard RNA folding thermodynamic parameters augmented by a few parameters describing the thermodynamic stability of pseudoknots. We demonstrate the properties of the algorithm by using it to predict structures for several small pseudoknotted and non-pseudoknotted RNAs. Although the time and memory demands of the algorithm are steep, we believe this is the first algorithm to be able to fold optimal (minimum energy) pseudoknotted RNAs with the accepted RNA thermodynamic model.

  10. Prediction of Mortality Based on Facial Characteristics

    PubMed Central

    Delorme, Arnaud; Pierce, Alan; Michel, Leena; Radin, Dean

    2016-01-01

    Recent studies have shown that characteristics of the face contain a wealth of information about health, age and chronic clinical conditions. Such studies involve objective measurement of facial features correlated with historical health information. But some individuals also claim to be adept at gauging mortality based on a glance at a person’s photograph. To test this claim, we invited 12 such individuals to see if they could determine if a person was alive or dead based solely on a brief examination of facial photographs. All photos used in the experiment were transformed into a uniform gray scale and then counterbalanced across eight categories: gender, age, gaze direction, glasses, head position, smile, hair color, and image resolution. Participants examined 404 photographs displayed on a computer monitor, one photo at a time, each shown for a maximum of 8 s. Half of the individuals in the photos were deceased, and half were alive at the time the experiment was conducted. Participants were asked to press a button if they thought the person in a photo was living or deceased. Overall mean accuracy on this task was 53.8%, where 50% was expected by chance (p < 0.004, two-tail). Statistically significant accuracy was independently obtained in 5 of the 12 participants. We also collected 32-channel electrophysiological recordings and observed a robust difference between images of deceased individuals correctly vs. incorrectly classified in the early event related potential (ERP) at 100 ms post-stimulus onset. Our results support claims of individuals who report that some as-yet unknown features of the face predict mortality. The results are also compatible with claims about clairvoyance warrants further investigation. PMID:27242466

  11. Multithreaded comparative RNA secondary structure prediction using stochastic context-free grammars

    PubMed Central

    2011-01-01

    Background The prediction of the structure of large RNAs remains a particular challenge in bioinformatics, due to the computational complexity and low levels of accuracy of state-of-the-art algorithms. The pfold model couples a stochastic context-free grammar to phylogenetic analysis for a high accuracy in predictions, but the time complexity of the algorithm and underflow errors have prevented its use for long alignments. Here we present PPfold, a multithreaded version of pfold, which is capable of predicting the structure of large RNA alignments accurately on practical timescales. Results We have distributed both the phylogenetic calculations and the inside-outside algorithm in PPfold, resulting in a significant reduction of runtime on multicore machines. We have addressed the floating-point underflow problems of pfold by implementing an extended-exponent datatype, enabling PPfold to be used for large-scale RNA structure predictions. We have also improved the user interface and portability: alongside standalone executable and Java source code of the program, PPfold is also available as a free plugin to the CLC Workbenches. We have evaluated the accuracy of PPfold using BRaliBase I tests, and demonstrated its practical use by predicting the secondary structure of an alignment of 24 complete HIV-1 genomes in 65 minutes on an 8-core machine and identifying several known structural elements in the prediction. Conclusions PPfold is the first parallelized comparative RNA structure prediction algorithm to date. Based on the pfold model, PPfold is capable of fast, high-quality predictions of large RNA secondary structures, such as the genomes of RNA viruses or long genomic transcripts. The techniques used in the parallelization of this algorithm may be of general applicability to other bioinformatics algorithms. PMID:21501497

  12. Compound Structure-Independent Activity Prediction in High-Dimensional Target Space.

    PubMed

    Balfer, Jenny; Hu, Ye; Bajorath, Jürgen

    2014-08-01

    Profiling of compound libraries against arrays of targets has become an important approach in pharmaceutical research. The prediction of multi-target compound activities also represents an attractive task for machine learning with potential for drug discovery applications. Herein, we have explored activity prediction in high-dimensional target space. Different types of models were derived to predict multi-target activities. The models included naïve Bayesian (NB) and support vector machine (SVM) classifiers based upon compound structure information and NB models derived on the basis of activity profiles, without considering compound structure. Because the latter approach can be applied to incomplete training data and principally depends on the feature independence assumption, SVM modeling was not applicable in this case. Furthermore, iterative hybrid NB models making use of both activity profiles and compound structure information were built. In high-dimensional target space, NB models utilizing activity profile data were found to yield more accurate activity predictions than structure-based NB and SVM models or hybrid models. An in-depth analysis of activity profile-based models revealed the presence of correlation effects across different targets and rationalized prediction accuracy. Taken together, the results indicate that activity profile information can be effectively used to predict the activity of test compounds against novel targets.

  13. RNA secondary structure prediction by centroids in a Boltzmann weighted ensemble.

    PubMed

    Ding, Ye; Chan, Chi Yu; Lawrence, Charles E

    2005-08-01

    Prediction of RNA secondary structure by free energy minimization has been the standard for over two decades. Here we describe a novel method that forsakes this paradigm for predictions based on Boltzmann-weighted structure ensemble. We introduce the notion of a centroid structure as a representative for a set of structures and describe a procedure for its identification. In comparison with the minimum free energy (MFE) structure using diverse types of structural RNAs, the centroid of the ensemble makes 30.0% fewer prediction errors as measured by the positive predictive value (PPV) with marginally improved sensitivity. The Boltzmann ensemble can be separated into a small number (3.2 on average) of clusters. Among the centroids of these clusters, the "best cluster centroid" as determined by comparison to the known structure simultaneously improves PPV by 46.5% and sensitivity by 21.7%. For 58% of the studied sequences for which the MFE structure is outside the cluster containing the best centroid, the improvements by the best centroid are 62.5% for PPV and 31.4% for sensitivity. These results suggest that the energy well containing the MFE structure under the current incomplete energy model is often different from the one for the unavailable complete model that presumably contains the unique native structure. Centroids are available on the Sfold server at http://sfold.wadsworth.org.

  14. Infants Generate Goal-Based Action Predictions

    ERIC Educational Resources Information Center

    Cannon, Erin N.; Woodward, Amanda L.

    2012-01-01

    Predicting the actions of others is critical to smooth social interactions. Prior work suggests that both understanding and anticipation of goal-directed actions appears early in development. In this study, on-line goal prediction was tested explicitly using an adaptation of Woodward's (1998) paradigm for an eye-tracking task. Twenty 11-month-olds…

  15. Improved prediction of RNA tertiary structure with insights into native state dynamics.

    PubMed

    Bida, John Paul; Maher, L James

    2012-03-01

    The importance of RNA tertiary structure is evident from the growing number of published high resolution NMR and X-ray crystallographic structures of RNA molecules. These structures provide insights into function and create a knowledge base that is leveraged by programs such as Assemble, ModeRNA, RNABuilder, NAST, FARNA, Mc-Sym, RNA2D3D, and iFoldRNA for tertiary structure prediction and design. While these methods sample native-like RNA structures during simulations, all struggle to capture the native RNA conformation after scoring. We propose RSIM, an improved RNA fragment assembly method that preserves RNA global secondary structure while sampling conformations. This approach enhances the quality of predicted RNA tertiary structure, provides insights into the native state dynamics, and generates a powerful visualization of the RNA conformational space. RSIM is available for download from http://www.github.com/jpbida/rsim.

  16. Identifying functional sites based on prediction of charged group behavior.

    PubMed

    Ondrechen, Mary Jo

    2004-09-01

    This protocol describes the implementation and interpretation of THEMATICS, a simple computational predictor of functional information for proteins from the three-dimensional structure. This method is based on the computation of the electrical potential function for the protein and the calculation of the predicted titration curves for each of the titratable groups in the protein. While most of the titratable residues in a protein have predicted titration behavior that fits the Henderson-Hasselbalch equation, the ionizable residues in the active site generally deviate dramatically from the typical behavior. From the calculated titration curves, one identifies those residues that deviate significantly from Henderson-Hasselbalch behavior. A cluster of two or more of such deviant titratable residues in physical proximity is a reliable predictor of active-site location.

  17. Free energy minimization to predict RNA secondary structures and computational RNA design.

    PubMed

    Churkin, Alexander; Weinbrand, Lina; Barash, Danny

    2015-01-01

    Determining the RNA secondary structure from sequence data by computational predictions is a long-standing problem. Its solution has been approached in two distinctive ways. If a multiple sequence alignment of a collection of homologous sequences is available, the comparative method uses phylogeny to determine conserved base pairs that are more likely to form as a result of billions of years of evolution than by chance. In the case of single sequences, recursive algorithms that compute free energy structures by using empirically derived energy parameters have been developed. This latter approach of RNA folding prediction by energy minimization is widely used to predict RNA secondary structure from sequence. For a significant number of RNA molecules, the secondary structure of the RNA molecule is indicative of its function and its computational prediction by minimizing its free energy is important for its functional analysis. A general method for free energy minimization to predict RNA secondary structures is dynamic programming, although other optimization methods have been developed as well along with empirically derived energy parameters. In this chapter, we introduce and illustrate by examples the approach of free energy minimization to predict RNA secondary structures.

  18. Computational prediction of riboswitch tertiary structures including pseudoknots by RAGTOP: a hierarchical graph sampling approach.

    PubMed

    Kim, Namhee; Zahran, Mai; Schlick, Tamar

    2015-01-01

    The modular organization of RNA structure has been exploited in various computational and theoretical approaches to identify RNA tertiary (3D) motifs and assemble RNA structures. Riboswitches exemplify this modularity in terms of both structural and functional adaptability of RNA components. Here, we extend our computational approach based on tree graph sampling to the prediction of riboswitch topologies by defining additional edges to mimick pseudoknots. Starting from a secondary (2D) structure, we construct an initial graph deduced from predicted junction topologies by our data-mining algorithm RNAJAG trained on known RNAs; we sample these graphs in 3D space guided by knowledge-based statistical potentials derived from bending and torsion measures of internal loops as well as radii of gyration for known RNAs. We present graph sampling results for 10 representative riboswitches, 6 of them with pseudoknots, and compare our predictions to solved structures based on global and local RMSD measures. Our results indicate that the helical arrangements in riboswitches can be approximated using our combination of modified 3D tree graph representations for pseudoknots, junction prediction, graph moves, and scoring functions. Future challenges in the field of riboswitch prediction and design are also discussed. PMID:25726463

  19. Predicting three-dimensional structures of transmembrane domains of β-barrel membrane proteins

    PubMed Central

    Naveed, Hammad; Xu, Yun; Jackups, Ronald; Liang, Jie

    2012-01-01

    β-barrel membrane proteins are found in the outer membrane of gram-negative bacteria, mitochondria, and chloroplasts. They are important for pore formation, membrane anchoring, enzyme activity, and are often responsible for bacterial virulence. Due to difficulties in experimental structure determination, they are sparsely represented in the protein structure databank. We have developed a computational method for predicting structures of the trans-membrane (TM) domains of β-barrel membrane proteins. Our method based on key organization principles, can predict structures of the TM domain of β-barrel membrane proteins of novel topology, including those from eukaryotic mitochondria. Our method is based on a model of physical interactions, a discrete conformational state-space, an empirical potential function, as well as a model to account for interstrand loop entropy. We are able to construct three dimensional atomic structure of the TM-domains from sequences for a set of 23 non-homologous proteins (resolution 1.8 – 3.0 Å). The median RMSD of TM-domains containing 75–222 residues between predicted and measured structures is 3.9 Å for main chain atoms. In addition, stability determinants and protein-protein interaction sites can be predicted. Such predictions on eukaryotic mitochondria outer membrane protein Tom40 and VDAC are confirmed by independent mutagenesis and chemical cross-linking studies. These results suggest that our model captures key components of the organization principles of β-barrel membrane protein assembly. PMID:22148174

  20. Predicting aphasia type from brain damage measured with structural MRI.

    PubMed

    Yourganov, Grigori; Smith, Kimberly G; Fridriksson, Julius; Rorden, Chris

    2015-12-01

    Chronic aphasia is a common consequence of a left-hemisphere stroke. Since the early insights by Broca and Wernicke, studying the relationship between the loci of cortical damage and patterns of language impairment has been one of the concerns of aphasiology. We utilized multivariate classification in a cross-validation framework to predict the type of chronic aphasia from the spatial pattern of brain damage. Our sample consisted of 98 patients with five types of aphasia (Broca's, Wernicke's, global, conduction, and anomic), classified based on scores on the Western Aphasia Battery (WAB). Binary lesion maps were obtained from structural MRI scans (obtained at least 6 months poststroke, and within 2 days of behavioural assessment); after spatial normalization, the lesions were parcellated into a disjoint set of brain areas. The proportion of damage to the brain areas was used to classify patients' aphasia type. To create this parcellation, we relied on five brain atlases; our classifier (support vector machine - SVM) could differentiate between different kinds of aphasia using any of the five parcellations. In our sample, the best classification accuracy was obtained when using a novel parcellation that combined two previously published brain atlases, with the first atlas providing the segmentation of grey matter, and the second atlas used to segment the white matter. For each aphasia type, we computed the relative importance of different brain areas for distinguishing it from other aphasia types; our findings were consistent with previously published reports of lesion locations implicated in different types of aphasia. Overall, our results revealed that automated multivariate classification could distinguish between aphasia types based on damage to atlas-defined brain areas. PMID:26465238

  1. Full-length RNA structure prediction of the HIV-1 genome reveals a conserved core domain.

    PubMed

    Sükösd, Zsuzsanna; Andersen, Ebbe S; Seemann, Stefan E; Jensen, Mads Krogh; Hansen, Mathias; Gorodkin, Jan; Kjems, Jørgen

    2015-12-01

    A distance constrained secondary structural model of the ≈10 kb RNA genome of the HIV-1 has been predicted but higher-order structures, involving long distance interactions, are currently unknown. We present the first global RNA secondary structure model for the HIV-1 genome, which integrates both comparative structure analysis and information from experimental data in a full-length prediction without distance constraints. Besides recovering known structural elements, we predict several novel structural elements that are conserved in HIV-1 evolution. Our results also indicate that the structure of the HIV-1 genome is highly variable in most regions, with a limited number of stable and conserved RNA secondary structures. Most interesting, a set of long distance interactions form a core organizing structure (COS) that organize the genome into three major structural domains. Despite overlapping protein-coding regions the COS is supported by a particular high frequency of compensatory base changes, suggesting functional importance for this element. This new structural element potentially organizes the whole genome into three major domains protruding from a conserved core structure with potential roles in replication and evolution for the virus. PMID:26476446

  2. Predicting protein structures with a multiplayer online game.

    PubMed

    Cooper, Seth; Khatib, Firas; Treuille, Adrien; Barbero, Janos; Lee, Jeehyung; Beenen, Michael; Leaver-Fay, Andrew; Baker, David; Popović, Zoran; Players, Foldit

    2010-08-01

    People exert large amounts of problem-solving effort playing computer games. Simple image- and text-recognition tasks have been successfully 'crowd-sourced' through games, but it is not clear if more complex scientific problems can be solved with human-directed computing. Protein structure prediction is one such problem: locating the biologically relevant native conformation of a protein is a formidable computational challenge given the very large size of the search space. Here we describe Foldit, a multiplayer online game that engages non-scientists in solving hard prediction problems. Foldit players interact with protein structures using direct manipulation tools and user-friendly versions of algorithms from the Rosetta structure prediction methodology, while they compete and collaborate to optimize the computed energy. We show that top-ranked Foldit players excel at solving challenging structure refinement problems in which substantial backbone rearrangements are necessary to achieve the burial of hydrophobic residues. Players working collaboratively develop a rich assortment of new strategies and algorithms; unlike computational approaches, they explore not only the conformational space but also the space of possible search strategies. The integration of human visual problem-solving and strategy development capabilities with traditional computational algorithms through interactive multiplayer games is a powerful new approach to solving computationally-limited scientific problems.

  3. Protein Secondary Structure Prediction Using Deep Convolutional Neural Fields.

    PubMed

    Wang, Sheng; Peng, Jian; Ma, Jianzhu; Xu, Jinbo

    2016-01-01

    Protein secondary structure (SS) prediction is important for studying protein structure and function. When only the sequence (profile) information is used as input feature, currently the best predictors can obtain ~80% Q3 accuracy, which has not been improved in the past decade. Here we present DeepCNF (Deep Convolutional Neural Fields) for protein SS prediction. DeepCNF is a Deep Learning extension of Conditional Neural Fields (CNF), which is an integration of Conditional Random Fields (CRF) and shallow neural networks. DeepCNF can model not only complex sequence-structure relationship by a deep hierarchical architecture, but also interdependency between adjacent SS labels, so it is much more powerful than CNF. Experimental results show that DeepCNF can obtain ~84% Q3 accuracy, ~85% SOV score, and ~72% Q8 accuracy, respectively, on the CASP and CAMEO test proteins, greatly outperforming currently popular predictors. As a general framework, DeepCNF can be used to predict other protein structure properties such as contact number, disorder regions, and solvent accessibility. PMID:26752681

  4. Protein Secondary Structure Prediction Using Deep Convolutional Neural Fields

    NASA Astrophysics Data System (ADS)

    Wang, Sheng; Peng, Jian; Ma, Jianzhu; Xu, Jinbo

    2016-01-01

    Protein secondary structure (SS) prediction is important for studying protein structure and function. When only the sequence (profile) information is used as input feature, currently the best predictors can obtain ~80% Q3 accuracy, which has not been improved in the past decade. Here we present DeepCNF (Deep Convolutional Neural Fields) for protein SS prediction. DeepCNF is a Deep Learning extension of Conditional Neural Fields (CNF), which is an integration of Conditional Random Fields (CRF) and shallow neural networks. DeepCNF can model not only complex sequence-structure relationship by a deep hierarchical architecture, but also interdependency between adjacent SS labels, so it is much more powerful than CNF. Experimental results show that DeepCNF can obtain ~84% Q3 accuracy, ~85% SOV score, and ~72% Q8 accuracy, respectively, on the CASP and CAMEO test proteins, greatly outperforming currently popular predictors. As a general framework, DeepCNF can be used to predict other protein structure properties such as contact number, disorder regions, and solvent accessibility.

  5. 3D-Fun: predicting enzyme function from structure.

    PubMed

    von Grotthuss, Marcin; Plewczynski, Dariusz; Vriend, Gert; Rychlewski, Leszek

    2008-07-01

    The 'omics' revolution is causing a flurry of data that all needs to be annotated for it to become useful. Sequences of proteins of unknown function can be annotated with a putative function by comparing them with proteins of known function. This form of annotation is typically performed with BLAST or similar software. Structural genomics is nowadays also bringing us three dimensional structures of proteins with unknown function. We present here software that can be used when sequence comparisons fail to determine the function of a protein with known structure but unknown function. The software, called 3D-Fun, is implemented as a server that runs at several European institutes and is freely available for everybody at all these sites. The 3D-Fun servers accept protein coordinates in the standard PDB format and compare them with all known protein structures by 3D structural superposition using the 3D-Hit software. If structural hits are found with proteins with known function, these are listed together with their function and some vital comparison statistics. This is conceptually very similar in 3D to what BLAST does in 1D. Additionally, the superposition results are displayed using interactive graphics facilities. Currently, the 3D-Fun system only predicts enzyme function but an expanded version with Gene Ontology predictions will be available soon. The server can be accessed at http://3dfun.bioinfo.pl/ or at http://3dfun.cmbi.ru.nl/.

  6. Aircraft Structural Mass Property Prediction Using Conceptual-Level Structural Analysis

    NASA Technical Reports Server (NTRS)

    Sexstone, Matthew G.

    1998-01-01

    This paper describes a methodology that extends the use of the Equivalent LAminated Plate Solution (ELAPS) structural analysis code from conceptual-level aircraft structural analysis to conceptual-level aircraft mass property analysis. Mass property analysis in aircraft structures has historically depended upon parametric weight equations at the conceptual design level and Finite Element Analysis (FEA) at the detailed design level ELAPS allows for the modeling of detailed geometry, metallic and composite materials, and non-structural mass coupled with analytical structural sizing to produce high-fidelity mass property analyses representing fully configured vehicles early in the design process. This capability is especially valuable for unusual configuration and advanced concept development where existing parametric weight equations are inapplicable and FEA is too time consuming for conceptual design. This paper contrasts the use of ELAPS relative to empirical weight equations and FEA. ELAPS modeling techniques are described and the ELAPS-based mass property analysis process is detailed Examples of mass property stochastic calculations produced during a recent systems study are provided This study involved the analysis of three remotely piloted aircraft required to carry scientific payloads to very high altitudes at subsonic speeds. Due to the extreme nature of this high-altitude flight regime,few existing vehicle designs are available for use in performance and weight prediction. ELAPS was employed within a concurrent engineering analysis process that simultaneously produces aerodynamic, structural, and static aeroelastic results for input to aircraft performance analyses. The ELAPS models produced for each concept were also used to provide stochastic analyses of wing structural mass properties. The results of this effort indicate that ELAPS is an efficient means to conduct multidisciplinary trade studies at the conceptual design level.

  7. Aircraft Structural Mass Property Prediction Using Conceptual-Level Structural Analysis

    NASA Technical Reports Server (NTRS)

    Sexstone, Matthew G.

    1998-01-01

    This paper describes a methodology that extends the use of the Equivalent LAminated Plate Solution (ELAPS) structural analysis code from conceptual-level aircraft structural analysis to conceptual-level aircraft mass property analysis. Mass property analysis in aircraft structures has historically depended upon parametric weight equations at the conceptual design level and Finite Element Analysis (FEA) at the detailed design level. ELAPS allows for the modeling of detailed geometry, metallic and composite materials, and non-structural mass coupled with analytical structural sizing to produce high-fidelity mass property analyses representing fully configured vehicles early in the design process. This capability is especially valuable for unusual configuration and advanced concept development where existing parametric weight equations are inapplicable and FEA is too time consuming for conceptual design. This paper contrasts the use of ELAPS relative to empirical weight equations and FEA. ELAPS modeling techniques are described and the ELAPS-based mass property analysis process is detailed. Examples of mass property stochastic calculations produced during a recent systems study are provided. This study involved the analysis of three remotely piloted aircraft required to carry scientific payloads to very high altitudes at subsonic speeds. Due to the extreme nature of this high-altitude flight regime, few existing vehicle designs are available for use in performance and weight prediction. ELAPS was employed within a concurrent engineering analysis process that simultaneously produces aerodynamic, structural, and static aeroelastic results for input to aircraft performance analyses. The ELAPS models produced for each concept were also used to provide stochastic analyses of wing structural mass properties. The results of this effort indicate that ELAPS is an efficient means to conduct multidisciplinary trade studies at the conceptual design level.

  8. Predicting mortality based on body composition analysis.

    PubMed Central

    Tellado, J M; Garcia-Sabrido, J L; Hanley, J A; Shizgal, H M; Christou, N V

    1989-01-01

    The role of the Nae/Ke ratio (the ratio of exchangeable sodium to exchangeable potassium) was examined as a nutritional marker in surgical patients in relation to anthropometrical and biochemical indexes by its ability to identify patients at risk for mortality after hospitalization. In 73 patients with sepsis and malnutrition (Training Group, Madrid) the following were determined: percentage of recent weight loss, triceps skin fold, midarm muscle circumference, serum albumin, serum transferrin, delayed hypersensitivity skin test response, total lymphocytes, and Nae/Ke ratio by multiple isotope dilution. The predictive power of Nae/Ke ratio was so strong (F = 105.1; p less than 0.00001) that it displaced anthropometric, biochemical, and immunologic variables from the linear equation derived from stepwise discriminant analysis using hospital mortality as the dependent variable. A theoretical curve of expected deaths was developed, based on an equation obtained by logistic regression analysis: Pr/death/ = 1/(1 + e[11.8-5.2 Nae/Ke]). Pre- and post-test probabilities on that curve allowed us to determine two cut-off values, Nae/Ke ratios of 1.5 and 2.5, which were markers for nonrisk and mortality, respectively. The model was tested in a heterogeneous data base of surgical patients (n = 417) in another hospital (Validation Group, Montreal). For patients exhibiting an abnormal Nae/Ke ratio (greater than 1.2) and a greater than 10% of probability of death, 54 deaths were expected and 53 observed (X2 = 1.8 NS). Two tests confirmed the basic agreement between the model and its performance, a G statistic of -0.704 and the area beneath the "receiver-operating-characteristic" (ROC) curve (Az = 0.904 + 0.0516 for the Madrid group vs. Az = 0.915 + 0.0349 for the Montreal group, NS). It was concluded from this analysis that, compared with the usual anthropometric measurements, the Nae/Ke ratio, if available, is the best method for identifying malnourished patients at risk of

  9. Efficient method for predicting crystal structures at finite temperature: variable box shape simulations.

    PubMed

    Filion, Laura; Marechal, Matthieu; van Oorschot, Bas; Pelt, Daniël; Smallenburg, Frank; Dijkstra, Marjolein

    2009-10-30

    We present an efficient and robust method based on Monte Carlo simulations for predicting crystal structures at finite temperature. We apply this method, which is surprisingly easy to implement, to a variety of systems, demonstrating its effectiveness for hard, attractive, and anisotropic interactions, binary mixtures, semi-long-range soft interactions, and truly long-range interactions where the truly long-range interactions are treated using Ewald sums. In the case of binary hard-sphere mixtures, star polymers, and binary Lennard-Jones mixtures, the crystal structures predicted by this algorithm are consistent with literature, providing confidence in the method. Finally, we predict new crystal structures for hard asymmetric dumbbell particles, bowl-like particles and hard oblate cylinders and present the phase diagram for the oblate cylinders based on full free energy calculations. PMID:19905838

  10. Virality Prediction and Community Structure in Social Networks

    PubMed Central

    Weng, Lilian; Menczer, Filippo; Ahn, Yong-Yeol

    2013-01-01

    How does network structure affect diffusion? Recent studies suggest that the answer depends on the type of contagion. Complex contagions, unlike infectious diseases (simple contagions), are affected by social reinforcement and homophily. Hence, the spread within highly clustered communities is enhanced, while diffusion across communities is hampered. A common hypothesis is that memes and behaviors are complex contagions. We show that, while most memes indeed spread like complex contagions, a few viral memes spread across many communities, like diseases. We demonstrate that the future popularity of a meme can be predicted by quantifying its early spreading pattern in terms of community concentration. The more communities a meme permeates, the more viral it is. We present a practical method to translate data about community structure into predictive knowledge about what information will spread widely. This connection contributes to our understanding in computational social science, social media analytics, and marketing applications. PMID:23982106

  11. Virality Prediction and Community Structure in Social Networks

    NASA Astrophysics Data System (ADS)

    Weng, Lilian; Menczer, Filippo; Ahn, Yong-Yeol

    2013-08-01

    How does network structure affect diffusion? Recent studies suggest that the answer depends on the type of contagion. Complex contagions, unlike infectious diseases (simple contagions), are affected by social reinforcement and homophily. Hence, the spread within highly clustered communities is enhanced, while diffusion across communities is hampered. A common hypothesis is that memes and behaviors are complex contagions. We show that, while most memes indeed spread like complex contagions, a few viral memes spread across many communities, like diseases. We demonstrate that the future popularity of a meme can be predicted by quantifying its early spreading pattern in terms of community concentration. The more communities a meme permeates, the more viral it is. We present a practical method to translate data about community structure into predictive knowledge about what information will spread widely. This connection contributes to our understanding in computational social science, social media analytics, and marketing applications.

  12. One Single Static Measurement Predicts Wave Localization in Complex Structures

    NASA Astrophysics Data System (ADS)

    Lefebvre, Gautier; Gondel, Alexane; Dubois, Marc; Atlan, Michael; Feppon, Florian; Labbé, Aimé; Gillot, Camille; Garelli, Alix; Ernoult, Maxence; Mayboroda, Svitlana; Filoche, Marcel; Sebbah, Patrick

    2016-08-01

    A recent theoretical breakthrough has brought a new tool, called the localization landscape, for predicting the localization regions of vibration modes in complex or disordered systems. Here, we report on the first experiment which measures the localization landscape and demonstrates its predictive power. Holographic measurement of the static deformation under uniform load of a thin plate with complex geometry provides direct access to the landscape function. When put in vibration, this system shows modes precisely confined within the subregions delineated by the landscape function. Also the maxima of this function match the measured eigenfrequencies, while the minima of the valley network gives the frequencies at which modes become extended. This approach fully characterizes the low frequency spectrum of a complex structure from a single static measurement. It paves the way for controlling and engineering eigenmodes in any vibratory system, especially where a structural or microscopic description is not accessible.

  13. Virality prediction and community structure in social networks.

    PubMed

    Weng, Lilian; Menczer, Filippo; Ahn, Yong-Yeol

    2013-01-01

    How does network structure affect diffusion? Recent studies suggest that the answer depends on the type of contagion. Complex contagions, unlike infectious diseases (simple contagions), are affected by social reinforcement and homophily. Hence, the spread within highly clustered communities is enhanced, while diffusion across communities is hampered. A common hypothesis is that memes and behaviors are complex contagions. We show that, while most memes indeed spread like complex contagions, a few viral memes spread across many communities, like diseases. We demonstrate that the future popularity of a meme can be predicted by quantifying its early spreading pattern in terms of community concentration. The more communities a meme permeates, the more viral it is. We present a practical method to translate data about community structure into predictive knowledge about what information will spread widely. This connection contributes to our understanding in computational social science, social media analytics, and marketing applications. PMID:23982106

  14. Virality prediction and community structure in social networks.

    PubMed

    Weng, Lilian; Menczer, Filippo; Ahn, Yong-Yeol

    2013-01-01

    How does network structure affect diffusion? Recent studies suggest that the answer depends on the type of contagion. Complex contagions, unlike infectious diseases (simple contagions), are affected by social reinforcement and homophily. Hence, the spread within highly clustered communities is enhanced, while diffusion across communities is hampered. A common hypothesis is that memes and behaviors are complex contagions. We show that, while most memes indeed spread like complex contagions, a few viral memes spread across many communities, like diseases. We demonstrate that the future popularity of a meme can be predicted by quantifying its early spreading pattern in terms of community concentration. The more communities a meme permeates, the more viral it is. We present a practical method to translate data about community structure into predictive knowledge about what information will spread widely. This connection contributes to our understanding in computational social science, social media analytics, and marketing applications.

  15. One Single Static Measurement Predicts Wave Localization in Complex Structures.

    PubMed

    Lefebvre, Gautier; Gondel, Alexane; Dubois, Marc; Atlan, Michael; Feppon, Florian; Labbé, Aimé; Gillot, Camille; Garelli, Alix; Ernoult, Maxence; Mayboroda, Svitlana; Filoche, Marcel; Sebbah, Patrick

    2016-08-12

    A recent theoretical breakthrough has brought a new tool, called the localization landscape, for predicting the localization regions of vibration modes in complex or disordered systems. Here, we report on the first experiment which measures the localization landscape and demonstrates its predictive power. Holographic measurement of the static deformation under uniform load of a thin plate with complex geometry provides direct access to the landscape function. When put in vibration, this system shows modes precisely confined within the subregions delineated by the landscape function. Also the maxima of this function match the measured eigenfrequencies, while the minima of the valley network gives the frequencies at which modes become extended. This approach fully characterizes the low frequency spectrum of a complex structure from a single static measurement. It paves the way for controlling and engineering eigenmodes in any vibratory system, especially where a structural or microscopic description is not accessible. PMID:27563967

  16. Structure-based identification of catalytic residues.

    PubMed

    Yahalom, Ran; Reshef, Dan; Wiener, Ayana; Frankel, Sagiv; Kalisman, Nir; Lerner, Boaz; Keasar, Chen

    2011-06-01

    The identification of catalytic residues is an essential step in functional characterization of enzymes. We present a purely structural approach to this problem, which is motivated by the difficulty of evolution-based methods to annotate structural genomics targets that have few or no homologs in the databases. Our approach combines a state-of-the-art support vector machine (SVM) classifier with novel structural features that augment structural clues by spatial averaging and Z scoring. Special attention is paid to the class imbalance problem that stems from the overwhelming number of non-catalytic residues in enzymes compared to catalytic residues. This problem is tackled by: (1) optimizing the classifier to maximize a performance criterion that considers both Type I and Type II errors in the classification of catalytic and non-catalytic residues; (2) under-sampling non-catalytic residues before SVM training; and (3) during SVM training, penalizing errors in learning catalytic residues more than errors in learning non-catalytic residues. Tested on four enzyme datasets, one specifically designed by us to mimic the structural genomics scenario and three previously evaluated datasets, our structure-based classifier is never inferior to similar structure-based classifiers and comparable to classifiers that use both structural and evolutionary features. In addition to the evaluation of the performance of catalytic residue identification, we also present detailed case studies on three proteins. This analysis suggests that many false positive predictions may correspond to binding sites and other functional residues. A web server that implements the method, our own-designed database, and the source code of the programs are publicly available at http://www.cs.bgu.ac.il/∼meshi/functionPrediction.

  17. TT2NE: a novel algorithm to predict RNA secondary structures with pseudoknots

    PubMed Central

    Bon, Michaël; Orland, Henri

    2011-01-01

    We present TT2NE, a new algorithm to predict RNA secondary structures with pseudoknots. The method is based on a classification of RNA structures according to their topological genus. TT2NE is guaranteed to find the minimum free energy structure regardless of pseudoknot topology. This unique proficiency is obtained at the expense of the maximum length of sequences that can be treated, but comparison with state-of-the-art algorithms shows that TT2NE significantly improves the quality of predictions. Analysis of TT2NE's incorrect predictions sheds light on the need to study how sterical constraints limit the range of pseudoknotted structures that can be formed from a given sequence. An implementation of TT2NE on a public server can be found at http://ipht.cea.fr/rna/tt2ne.php. PMID:21593129

  18. Effects of scale in predicting global structural response

    NASA Technical Reports Server (NTRS)

    Deo, R. B.; Kan, H. P.

    1991-01-01

    Analytical techniques for scale-up effects were reviewed. The advantages and limitations of applying the principles of similitude to composite structures is summarized and illustrated by simple examples. An analytical procedure was formulated to design scale models of an axially compressed composite cylinder. A building-block approach was outlined where each structural detail is analyzed independently and the probable failure sequence of a selected component is predicted, taking into account load redistribution subsequent to first element failure. Details of this building-block approach are under development.

  19. Image-Based Predictive Modeling of Heart Mechanics.

    PubMed

    Wang, V Y; Nielsen, P M F; Nash, M P

    2015-01-01

    Personalized biophysical modeling of the heart is a useful approach for noninvasively analyzing and predicting in vivo cardiac mechanics. Three main developments support this style of analysis: state-of-the-art cardiac imaging technologies, modern computational infrastructure, and advanced mathematical modeling techniques. In vivo measurements of cardiac structure and function can be integrated using sophisticated computational methods to investigate mechanisms of myocardial function and dysfunction, and can aid in clinical diagnosis and developing personalized treatment. In this article, we review the state-of-the-art in cardiac imaging modalities, model-based interpretation of 3D images of cardiac structure and function, and recent advances in modeling that allow personalized predictions of heart mechanics. We discuss how using such image-based modeling frameworks can increase the understanding of the fundamental biophysics behind cardiac mechanics, and assist with diagnosis, surgical guidance, and treatment planning. Addressing the challenges in this field will require a coordinated effort from both the clinical-imaging and modeling communities. We also discuss future directions that can be taken to bridge the gap between basic science and clinical translation.

  20. Evaluation of a universal flow-through model for predicting and designing phosphorus removal structures.

    PubMed

    Penn, Chad; Bowen, James; McGrath, Joshua; Nairn, Robert; Fox, Garey; Brown, Glenn; Wilson, Stuart; Gill, Clinton

    2016-05-01

    Phosphorus (P) removal structures have been shown to decrease dissolved P loss from agricultural and urban areas which may reduce the threat of eutrophication. In order to design or quantify performance of these structures, the relationship between discrete and cumulative removal with cumulative P loading must be determined, either by individual flow-through experiments or model prediction. A model was previously developed for predicting P removal with P sorption materials (PSMs) under flow-through conditions, as a function of inflow P concentration, retention time (RT), and PSM characteristics. The objective of this study was to compare model results to measured P removal data from several PSM under a range of conditions (P concentrations and RT) and scales ranging from laboratory to field. Materials tested included acid mine drainage residuals (AMDRs), treated and non-treated electric arc furnace (EAF) steel slag at different size fractions, and flue gas desulfurization (FGD) gypsum. Equations for P removal curves and cumulative P removed were not significantly different between predicted and actual values for any of the 23 scenarios examined. However, the model did tend to slightly over-predict cumulative P removal for calcium-based PSMs. The ability of the model to predict P removal for various materials, RTs, and P concentrations in both controlled settings and field structures validate its use in design and quantification of these structures. This ability to predict P removal without constant monitoring is vital to widespread adoption of P removal structures, especially for meeting discharge regulations and nutrient trading programs.

  1. Plasma-based accelerator structures

    SciTech Connect

    Schroeder, Carl B.

    1999-12-01

    Plasma-based accelerators have the ability to sustain extremely large accelerating gradients, with possible high-energy physics applications. This dissertation further develops the theory of plasma-based accelerators by addressing three topics: the performance of a hollow plasma channel as an accelerating structure, the generation of ultrashort electron bunches, and the propagation of laser pulses is underdense plasmas.

  2. SHM-Based Probabilistic Fatigue Life Prediction for Bridges Based on FE Model Updating.

    PubMed

    Lee, Young-Joo; Cho, Soojin

    2016-01-01

    Fatigue life prediction for a bridge should be based on the current condition of the bridge, and various sources of uncertainty, such as material properties, anticipated vehicle loads and environmental conditions, make the prediction very challenging. This paper presents a new approach for probabilistic fatigue life prediction for bridges using finite element (FE) model updating based on structural health monitoring (SHM) data. Recently, various types of SHM systems have been used to monitor and evaluate the long-term structural performance of bridges. For example, SHM data can be used to estimate the degradation of an in-service bridge, which makes it possible to update the initial FE model. The proposed method consists of three steps: (1) identifying the modal properties of a bridge, such as mode shapes and natural frequencies, based on the ambient vibration under passing vehicles; (2) updating the structural parameters of an initial FE model using the identified modal properties; and (3) predicting the probabilistic fatigue life using the updated FE model. The proposed method is demonstrated by application to a numerical model of a bridge, and the impact of FE model updating on the bridge fatigue life is discussed. PMID:26950125

  3. SHM-Based Probabilistic Fatigue Life Prediction for Bridges Based on FE Model Updating

    PubMed Central

    Lee, Young-Joo; Cho, Soojin

    2016-01-01

    Fatigue life prediction for a bridge should be based on the current condition of the bridge, and various sources of uncertainty, such as material properties, anticipated vehicle loads and environmental conditions, make the prediction very challenging. This paper presents a new approach for probabilistic fatigue life prediction for bridges using finite element (FE) model updating based on structural health monitoring (SHM) data. Recently, various types of SHM systems have been used to monitor and evaluate the long-term structural performance of bridges. For example, SHM data can be used to estimate the degradation of an in-service bridge, which makes it possible to update the initial FE model. The proposed method consists of three steps: (1) identifying the modal properties of a bridge, such as mode shapes and natural frequencies, based on the ambient vibration under passing vehicles; (2) updating the structural parameters of an initial FE model using the identified modal properties; and (3) predicting the probabilistic fatigue life using the updated FE model. The proposed method is demonstrated by application to a numerical model of a bridge, and the impact of FE model updating on the bridge fatigue life is discussed. PMID:26950125

  4. SHM-Based Probabilistic Fatigue Life Prediction for Bridges Based on FE Model Updating.

    PubMed

    Lee, Young-Joo; Cho, Soojin

    2016-03-02

    Fatigue life prediction for a bridge should be based on the current condition of the bridge, and various sources of uncertainty, such as material properties, anticipated vehicle loads and environmental conditions, make the prediction very challenging. This paper presents a new approach for probabilistic fatigue life prediction for bridges using finite element (FE) model updating based on structural health monitoring (SHM) data. Recently, various types of SHM systems have been used to monitor and evaluate the long-term structural performance of bridges. For example, SHM data can be used to estimate the degradation of an in-service bridge, which makes it possible to update the initial FE model. The proposed method consists of three steps: (1) identifying the modal properties of a bridge, such as mode shapes and natural frequencies, based on the ambient vibration under passing vehicles; (2) updating the structural parameters of an initial FE model using the identified modal properties; and (3) predicting the probabilistic fatigue life using the updated FE model. The proposed method is demonstrated by application to a numerical model of a bridge, and the impact of FE model updating on the bridge fatigue life is discussed.

  5. ASTRO-FOLD 2.0: an Enhanced Framework for Protein Structure Prediction.

    PubMed

    Subramani, A; Wei, Y; Floudas, C A

    2012-05-01

    The three-dimensional (3-D) structure prediction of proteins, given their amino acid sequence, is addressed using the first principles-based approach ASTRO-FOLD 2.0. The key features presented are: (1) Secondary structure prediction using a novel optimization-based consensus approach, (2) β-sheet topology prediction using mixed-integer linear optimization (MILP), (3) Residue-to-residue contact prediction using a high-resolution distance-dependent force field and MILP formulation, (4) Tight dihedral angle and distance bound generation for loop residues using dihedral angle clustering and non-linear optimization (NLP), (5) 3-D structure prediction using deterministic global optimization, stochastic conformational space annealing, and the full-atomistic ECEPP/3 potential, (6) Near-native structure selection using a traveling salesman problem-based clustering approach, ICON, and (7) Improved bound generation using chemical shifts of subsets of heavy atoms, generated by SPARTA and CS23D. Computational results of ASTRO-FOLD 2.0 on 47 blind targets of the recently concluded CASP9 experiment are presented.

  6. ASTRO-FOLD 2.0: an Enhanced Framework for Protein Structure Prediction

    PubMed Central

    Subramani, A.; Wei, Y.; Floudas, C. A.

    2011-01-01

    The three-dimensional (3-D) structure prediction of proteins, given their amino acid sequence, is addressed using the first principles–based approach ASTRO-FOLD 2.0. The key features presented are: (1) Secondary structure prediction using a novel optimization-based consensus approach, (2) β-sheet topology prediction using mixed-integer linear optimization (MILP), (3) Residue-to-residue contact prediction using a high-resolution distance-dependent force field and MILP formulation, (4) Tight dihedral angle and distance bound generation for loop residues using dihedral angle clustering and non-linear optimization (NLP), (5) 3-D structure prediction using deterministic global optimization, stochastic conformational space annealing, and the full-atomistic ECEPP/3 potential, (6) Near-native structure selection using a traveling salesman problem-based clustering approach, ICON, and (7) Improved bound generation using chemical shifts of subsets of heavy atoms, generated by SPARTA and CS23D. Computational results of ASTRO-FOLD 2.0 on 47 blind targets of the recently concluded CASP9 experiment are presented. PMID:23049093

  7. Prediction-based dynamic load-sharing heuristics

    NASA Technical Reports Server (NTRS)

    Goswami, Kumar K.; Devarakonda, Murthy; Iyer, Ravishankar K.

    1993-01-01

    The authors present dynamic load-sharing heuristics that use predicted resource requirements of processes to manage workloads in a distributed system. A previously developed statistical pattern-recognition method is employed for resource prediction. While nonprediction-based heuristics depend on a rapidly changing system status, the new heuristics depend on slowly changing program resource usage patterns. Furthermore, prediction-based heuristics can be more effective since they use future requirements rather than just the current system state. Four prediction-based heuristics, two centralized and two distributed, are presented. Using trace driven simulations, they are compared against random scheduling and two effective nonprediction based heuristics. Results show that the prediction-based centralized heuristics achieve up to 30 percent better response times than the nonprediction centralized heuristic, and that the prediction-based distributed heuristics achieve up to 50 percent improvements relative to their nonprediction counterpart.

  8. Sequence-only evolutionary and predicted structural features for the prediction of stability changes in protein mutants

    PubMed Central

    2013-01-01

    Background Even a single amino acid substitution in a protein sequence may result in significant changes in protein stability, structure, and therefore in protein function as well. In the post-genomic era, computational methods for predicting stability changes from only the sequence of a protein are of importance. While evolutionary relationships of protein mutations can be extracted from large protein databases holding millions of protein sequences, relevant evolutionary features for the prediction of stability changes have not been proposed. Also, the use of predicted structural features in situations when a protein structure is not available has not been explored. Results We proposed a number of evolutionary and predicted structural features for the prediction of stability changes and analysed which of them capture the determinants of protein stability the best. We trained and evaluated our machine learning method on a non-redundant data set of experimentally measured stability changes. When only the direction of the stability change was predicted, we found that the best performance improvement can be achieved by the combination of the evolutionary features mutation likelihood and SIFTscore in conjunction with the predicted structural feature secondary structure. The same two evolutionary features in the combination with the predicted structural feature accessible surface area achieved the lowest error when the prediction of actual values of stability changes was assessed. Compared to similar studies, our method achieved improvements in prediction performance. Conclusion Although the strongest feature for the prediction of stability changes appears to be the vector of amino acid identities in the sequential neighbourhood of the mutation, the most relevant combination of evolutionary and predicted structural features further improves prediction performance. Even the predicted structural features, which did not perform well on their own, turn out to be beneficial

  9. Mimicking the folding pathway to improve homology-free protein structure prediction

    NASA Astrophysics Data System (ADS)

    Freed, Karl; Debartolo, Joe; Colubri, Andres; Jha, Abhishek; Fitzgerald, James; Sosnick, Tobin

    2010-03-01

    Since demonstrating that a protein's sequence encodes its structure, the prediction of structure from sequence remains an outstanding problem that impacts numerous scientific disciplines including many genome projects. By iteratively fixing secondary structure assignments of residues during Monte Carlo simulations of folding, our coarse grained model without information concerning homology or explicit side chains outperforms current homology-based secondary structure prediction methods for many proteins. The computationally rapid algorithm using only single residue (phi, psi) dihedral angle moves also generates tertiary structures of comparable accuracy to existing all-atom methods for many small proteins, particularly ones with low homology. Hence, given appropriate search strategies and scoring functions, reduced representations can be used for accurately predicting secondary structure as well as providing three-dimensional structures, thereby increasing the size of proteins approachable by homology-free methods and the accuracy of template methods whose accuracy depends on the quality of the input secondary structure. Inclusion of information from evolutionarily related sequences enhances the statistics and the accuracy of the predictions.

  10. The neurobiology of memory based predictions.

    PubMed

    Eichenbaum, Howard; Fortin, Norbert J

    2009-05-12

    Recent findings indicate that, in humans, the hippocampal memory system is involved in the capacity to imagine the future as well as remember the past. Other studies have suggested that animals may also have the capacity to recall the past and plan for the future. Here, we will consider data that bridge between these sets of findings by assessing the role of the hippocampus in memory and prediction in rats. We will argue that animals have the capacity for recollection and that the hippocampus plays a central and selective role in binding information in the service of recollective memory. Then we will consider examples of transitive inference, a paradigm that requires the integration of overlapping memories and flexible use of the resulting relational memory networks for generating predictions in novel situations. Our data show that animals have the capacity for transitive inference and that the hippocampus plays a central role in the ability to predict outcomes of events that have not yet occurred.

  11. Optimal mutation sites for PRE data collection and membrane protein structure prediction.

    PubMed

    Chen, Huiling; Ji, Fei; Olman, Victor; Mobley, Charles K; Liu, Yizhou; Zhou, Yunpeng; Bushweller, John H; Prestegard, James H; Xu, Ying

    2011-04-13

    Nuclear magnetic resonance paramagnetic relaxation enhancement (PRE) measures long-range distances to isotopically labeled residues, providing useful constraints for protein structure prediction. The method usually requires labor-intensive conjugation of nitroxide labels to multiple locations on the protein, one at a time. Here a computational procedure, based on protein sequence and simple secondary structure models, is presented to facilitate optimal placement of a minimum number of labels needed to determine the correct topology of a helical transmembrane protein. Tests on DsbB (four helices) using just one label lead to correct topology predictions in four of five cases, with the predicted structures <6 Å to the native structure. Benchmark results using simulated PRE data show that we can generally predict the correct topology for five and six to seven helices using two and three labels, respectively, with an average success rate of 76% and structures of similar precision. The results show promise in facilitating experimentally constrained structure prediction of membrane proteins.

  12. A multilayer evaluation approach for protein structure prediction and model quality assessment.

    PubMed

    Zhang, Jingfen; Wang, Qingguo; Vantasin, Kittinun; Zhang, Jiong; He, Zhiquan; Kosztin, Ioan; Shang, Yi; Xu, Dong

    2011-01-01

    Protein tertiary structures are essential for studying functions of proteins at molecular level. An indispensable approach for protein structure solution is computational prediction. Most protein structure prediction methods generate candidate models first and select the best candidates by model quality assessment (QA). In many cases, good models can be produced, but the QA tools fail to select the best ones from the candidate model pool. Because of incomplete understanding of protein folding, each QA method only reflects partial facets of a structure model and thus has limited discerning power with no one consistently outperforming others. In this article, we developed a set of new QA methods, including two QA methods for evaluating target/template alignments, a molecular dynamics (MD)-based QA method, and three consensus QA methods with selected references to reveal new facets of protein structures complementary to the existing methods. Moreover, the underlying relationship among different QA methods were analyzed and then integrated into a multilayer evaluation approach to guide the model generation and model selection in prediction. All methods are integrated and implemented into an innovative and improved prediction system hereafter referred to as MUFOLD. In CASP8 and CASP9, MUFOLD has demonstrated the proof of the principles in terms of both QA discerning power and structure prediction accuracy. PMID:21997706

  13. A multilayer evaluation approach for protein structure prediction and model quality assessment.

    PubMed

    Zhang, Jingfen; Wang, Qingguo; Vantasin, Kittinun; Zhang, Jiong; He, Zhiquan; Kosztin, Ioan; Shang, Yi; Xu, Dong

    2011-01-01

    Protein tertiary structures are essential for studying functions of proteins at molecular level. An indispensable approach for protein structure solution is computational prediction. Most protein structure prediction methods generate candidate models first and select the best candidates by model quality assessment (QA). In many cases, good models can be produced, but the QA tools fail to select the best ones from the candidate model pool. Because of incomplete understanding of protein folding, each QA method only reflects partial facets of a structure model and thus has limited discerning power with no one consistently outperforming others. In this article, we developed a set of new QA methods, including two QA methods for evaluating target/template alignments, a molecular dynamics (MD)-based QA method, and three consensus QA methods with selected references to reveal new facets of protein structures complementary to the existing methods. Moreover, the underlying relationship among different QA methods were analyzed and then integrated into a multilayer evaluation approach to guide the model generation and model selection in prediction. All methods are integrated and implemented into an innovative and improved prediction system hereafter referred to as MUFOLD. In CASP8 and CASP9, MUFOLD has demonstrated the proof of the principles in terms of both QA discerning power and structure prediction accuracy.

  14. Using crystal structure prediction to rationalize the hydration propensities of substituted adamantane hydrochloride salts.

    PubMed

    Mohamed, Sharmarke; Karothu, Durga Prasad; Naumov, Panče

    2016-08-01

    The crystal energy landscapes of the salts of two rigid pharmaceutically active molecules reveal that the experimental structure of amantadine hydrochloride is the most stable structure with the majority of low-energy structures adopting a chain hydrogen-bond motif and packings that do not have solvent accessible voids. By contrast, memantine hydrochloride which differs in the substitution of two methyl groups on the adamantane ring has a crystal energy landscape where all structures within 10 kJ mol(-1) of the global minimum have solvent-accessible voids ranging from 3 to 14% of the unit-cell volume including the lattice energy minimum that was calculated after removing water from the hydrated memantine hydrochloride salt structure. The success in using crystal structure prediction (CSP) to rationalize the different hydration propensities of these substituted adamantane hydrochloride salts allowed us to extend the model to predict under blind test conditions the experimental crystal structures of the previously uncharacterized 1-(methylamino)adamantane base and its corresponding hydrochloride salt. Although the crystal structure of 1-(methylamino)adamantane was correctly predicted as the second ranked structure on the static lattice energy landscape, the crystallization of a Z' = 3 structure of 1-(methylamino)adamantane hydrochloride reveals the limits of applying CSP when the contents of the crystallographic asymmetric unit are unknown.

  15. Using crystal structure prediction to rationalize the hydration propensities of substituted adamantane hydrochloride salts.

    PubMed

    Mohamed, Sharmarke; Karothu, Durga Prasad; Naumov, Panče

    2016-08-01

    The crystal energy landscapes of the salts of two rigid pharmaceutically active molecules reveal that the experimental structure of amantadine hydrochloride is the most stable structure with the majority of low-energy structures adopting a chain hydrogen-bond motif and packings that do not have solvent accessible voids. By contrast, memantine hydrochloride which differs in the substitution of two methyl groups on the adamantane ring has a crystal energy landscape where all structures within 10 kJ mol(-1) of the global minimum have solvent-accessible voids ranging from 3 to 14% of the unit-cell volume including the lattice energy minimum that was calculated after removing water from the hydrated memantine hydrochloride salt structure. The success in using crystal structure prediction (CSP) to rationalize the different hydration propensities of these substituted adamantane hydrochloride salts allowed us to extend the model to predict under blind test conditions the experimental crystal structures of the previously uncharacterized 1-(methylamino)adamantane base and its corresponding hydrochloride salt. Although the crystal structure of 1-(methylamino)adamantane was correctly predicted as the second ranked structure on the static lattice energy landscape, the crystallization of a Z' = 3 structure of 1-(methylamino)adamantane hydrochloride reveals the limits of applying CSP when the contents of the crystallographic asymmetric unit are unknown. PMID:27484376

  16. Structure Prediction: New Insights into Decrypting Long Noncoding RNAs

    PubMed Central

    Yan, Kun; Arfat, Yasir; Li, Dijie; Zhao, Fan; Chen, Zhihao; Yin, Chong; Sun, Yulong; Hu, Lifang; Yang, Tuanmin; Qian, Airong

    2016-01-01

    Long noncoding RNAs (lncRNAs), which form a diverse class of RNAs, remain the least understood type of noncoding RNAs in terms of their nature and identification. Emerging evidence has revealed that a small number of newly discovered lncRNAs perform important and complex biological functions such as dosage compensation, chromatin regulation, genomic imprinting, and nuclear organization. However, understanding the wide range of functions of lncRNAs related to various processes of cellular networks remains a great experimental challenge. Structural versatility is critical for RNAs to perform various functions and provides new insights into probing the functions of lncRNAs. In recent years, the computational method of RNA structure prediction has been developed to analyze the structure of lncRNAs. This novel methodology has provided basic but indispensable information for the rapid, large-scale and in-depth research of lncRNAs. This review focuses on mainstream RNA structure prediction methods at the secondary and tertiary levels to offer an additional approach to investigating the functions of lncRNAs. PMID:26805815

  17. Functional structure of biological communities predicts ecosystem multifunctionality.

    PubMed

    Mouillot, David; Villéger, Sébastien; Scherer-Lorenzen, Michael; Mason, Norman W H

    2011-01-01

    The accelerating rate of change in biodiversity patterns, mediated by ever increasing human pressures and global warming, demands a better understanding of the relationship between the structure of biological communities and ecosystem functioning (BEF). Recent investigations suggest that the functional structure of communities, i.e. the composition and diversity of functional traits, is the main driver of ecological processes. However, the predictive power of BEF research is still low, the integration of all components of functional community structure as predictors is still lacking, and the multifunctionality of ecosystems (i.e. rates of multiple processes) must be considered. Here, using a multiple-processes framework from grassland biodiversity experiments, we show that functional identity of species and functional divergence among species, rather than species diversity per se, together promote the level of ecosystem multifunctionality with a predictive power of 80%. Our results suggest that primary productivity and decomposition rates, two key ecosystem processes upon which the global carbon cycle depends, are primarily sustained by specialist species, i.e. those that hold specialized combinations of traits and perform particular functions. Contrary to studies focusing on single ecosystem functions and considering species richness as the sole measure of biodiversity, we found a linear and non-saturating effect of the functional structure of communities on ecosystem multifunctionality. Thus, sustaining multiple ecological processes would require focusing on trait dominance and on the degree of community specialization, even in species-rich assemblages.

  18. Prediction of Solar Activity Based on Neuro-Fuzzy Modeling

    NASA Astrophysics Data System (ADS)

    Attia, Abdel-Fattah; Abdel-Hamid, Rabab; Quassim, Maha

    2005-03-01

    This paper presents an application of the neuro-fuzzy modeling to analyze the time series of solar activity, as measured through the relative Wolf number. The neuro-fuzzy structure is optimized based on the linear adapted genetic algorithm with controlling population size (LAGA-POP). Initially, the dimension of the time series characteristic attractor is obtained based on the smallest regularity criterion (RC) and the neuro-fuzzy model. Then the performance of the proposed approach, in forecasting yearly sunspot numbers, is favorably compared to that of other published methods. Finally, a comparison predictions for the remaining part of the 22nd and the whole 23rd cycle of the solar activity are presented.

  19. Reducing model structural uncertainty in predictions for ungauged basins via Bayesian approach.

    NASA Astrophysics Data System (ADS)

    Prieto, Cristina; Le Vine, Nataliya; Vitolo, Claudia; García, Eduardo; Medina, Raúl

    2016-04-01

    A catchment is a complex system where a multitude of interrelated energy, water and vegetation processes occur at different temporal and spatial scales. A rainfall-runoff model is a simplified representation of the system, and serves as a hypothesis about an inner catchment working. In predictions for ungauged basins, a common practice is to use a pre-selected assumed-to-be-perfect model structure to represent all catchments under analysis. However, it is unlikely that the same model structure is appropriate for diverse catchments due to the 'uniqueness of the place'. At the same time, there is no obvious justification to select a single model structure as a suitable description of the system. The contribution of this research is a move forward in the 'one size fits all' problem for predicting flows in ungauged basins. We present a statistical methodology, which allows regionalization that considers the information given by different hydrological model structures. First, the information to be regionalised is compactly represented via Principal Component Analysis. Second, the most significant principal components are regionalised using non-linear regionalisation method based on Random Forests. Third, a regionalisation error structure is derived based on the gauged catchments to be used in the Bayesian condition of the rainfall-runoff structures and their parameters. The methodological developments are demonstrated for predicting flows in ungauged basins of Northern Spain; and the results show that the methodology allows improving the flow prediction.

  20. EVO—Evolutionary algorithm for crystal structure prediction

    NASA Astrophysics Data System (ADS)

    Bahmann, Silvia; Kortus, Jens

    2013-06-01

    We present EVO—an evolution strategy designed for crystal structure search and prediction. The concept and main features of biological evolution such as creation of diversity and survival of the fittest have been transferred to crystal structure prediction. EVO successfully demonstrates its applicability to find crystal structures of the elements of the 3rd main group with their different spacegroups. For this we used the number of atoms in the conventional cell and multiples of it. Running EVO with different numbers of carbon atoms per unit cell yields graphite as the lowest energy structure as well as a diamond-like structure, both in one run. Our implementation also supports the search for 2D structures and was able to find a boron sheet with structural features so far not considered in literature. Program summaryProgram title: EVO Catalogue identifier: AEOZ_v1_0 Program summary URL:http://cpc.cs.qub.ac.uk/summaries/AEOZ_v1_0.html Program obtainable from: CPC Program Library, Queen's University, Belfast, N. Ireland Licensing provisions: GNU General Public License version 3 No. of lines in distributed program, including test data, etc.: 23488 No. of bytes in distributed program, including test data, etc.: 1830122 Distribution format: tar.gz Programming language: Python. Computer: No limitations known. Operating system: Linux. RAM: Negligible compared to the requirements of the electronic structure programs used Classification: 7.8. External routines: Quantum ESPRESSO (http://www.quantum-espresso.org/), GULP (https://projects.ivec.org/gulp/) Nature of problem: Crystal structure search is a global optimisation problem in 3N+3 dimensions where N is the number of atoms in the unit cell. The high dimensional search space is accompanied by an unknown energy landscape. Solution method: Evolutionary algorithms transfer the main features of biological evolution to use them in global searches. The combination of the "survival of the fittest" (deterministic) and the

  1. Predicting hepatotoxicity using ToxCast in vitro bioactivity and chemical structure

    EPA Science Inventory

    Background: The U.S. EPA ToxCastTM program is screening thousands of environmental chemicals for bioactivity using hundreds of high-throughput in vitro assays to build predictive models of toxicity. We represented chemicals based on bioactivity and chemical structure descriptors ...

  2. Prediction of three-dimensional transmembrane helical protein structures

    NASA Astrophysics Data System (ADS)

    Barth, Patrick

    Membrane proteins are critical to living cells and their dysfunction can lead to serious diseases. High-resolution structures of these proteins would provide very valuable information for designing eficient therapies but membrane protein crystallization is a major bottleneck. As an important alternative approach, methods for predicting membrane protein structures have been developed in recent years. This chapter focuses on the problem of modeling the structure of transmembrane helical proteins, and describes recent advancements, current limitations, and future challenges facing de novo modeling, modeling with experimental constraints, and high-resolution comparative modeling of these proteins. Abbreviations: MP, membrane protein; SP, water-soluble protein; RMSD, root-mean square deviation; Cα RMSD, root-mean square deviation over Cα atoms; TM, transmembrane; TMH, transmembrane helix; GPCR, G protein-coupled receptor; 3D, three dimensional; NMR, nuclear magnetic resonance spectroscopy; EPR, electron paramagnetic resonance spectroscopy; FTIR, Fourier transform infrared spectroscopy.

  3. Predicting the stability of large structured food webs.

    PubMed

    Allesina, Stefano; Grilli, Jacopo; Barabás, György; Tang, Si; Aljadeff, Johnatan; Maritan, Amos

    2015-01-01

    The stability of ecological systems has been a long-standing focus of ecology. Recently, tools from random matrix theory have identified the main drivers of stability in ecological communities whose network structure is random. However, empirical food webs differ greatly from random graphs. For example, their degree distribution is broader, they contain few trophic cycles, and they are almost interval. Here we derive an approximation for the stability of food webs whose structure is generated by the cascade model, in which 'larger' species consume 'smaller' ones. We predict the stability of these food webs with great accuracy, and our approximation also works well for food webs whose structure is determined empirically or by the niche model. We find that intervality and broad degree distributions tend to stabilize food webs, and that average interaction strength has little influence on stability, compared with the effect of variance and correlation. PMID:26198207

  4. Predicting the stability of large structured food webs

    PubMed Central

    Allesina, Stefano; Grilli, Jacopo; Barabás, György; Tang, Si; Aljadeff, Johnatan; Maritan, Amos

    2015-01-01

    The stability of ecological systems has been a long-standing focus of ecology. Recently, tools from random matrix theory have identified the main drivers of stability in ecological communities whose network structure is random. However, empirical food webs differ greatly from random graphs. For example, their degree distribution is broader, they contain few trophic cycles, and they are almost interval. Here we derive an approximation for the stability of food webs whose structure is generated by the cascade model, in which ‘larger' species consume ‘smaller' ones. We predict the stability of these food webs with great accuracy, and our approximation also works well for food webs whose structure is determined empirically or by the niche model. We find that intervality and broad degree distributions tend to stabilize food webs, and that average interaction strength has little influence on stability, compared with the effect of variance and correlation. PMID:26198207

  5. Predicting the stability of large structured food webs.

    PubMed

    Allesina, Stefano; Grilli, Jacopo; Barabás, György; Tang, Si; Aljadeff, Johnatan; Maritan, Amos

    2015-07-22

    The stability of ecological systems has been a long-standing focus of ecology. Recently, tools from random matrix theory have identified the main drivers of stability in ecological communities whose network structure is random. However, empirical food webs differ greatly from random graphs. For example, their degree distribution is broader, they contain few trophic cycles, and they are almost interval. Here we derive an approximation for the stability of food webs whose structure is generated by the cascade model, in which 'larger' species consume 'smaller' ones. We predict the stability of these food webs with great accuracy, and our approximation also works well for food webs whose structure is determined empirically or by the niche model. We find that intervality and broad degree distributions tend to stabilize food webs, and that average interaction strength has little influence on stability, compared with the effect of variance and correlation.

  6. RPI-Pred: predicting ncRNA-protein interaction using sequence and structural information

    PubMed Central

    Suresh, V.; Liu, Liang; Adjeroh, Donald; Zhou, Xiaobo

    2015-01-01

    RNA-protein complexes are essential in mediating important fundamental cellular processes, such as transport and localization. In particular, ncRNA-protein interactions play an important role in post-transcriptional gene regulation like mRNA localization, mRNA stabilization, poly-adenylation, splicing and translation. The experimental methods to solve RNA-protein interaction prediction problem remain expensive and time-consuming. Here, we present the RPI-Pred (RNA-protein interaction predictor), a new support-vector machine-based method, to predict protein-RNA interaction pairs, based on both the sequences and structures. The results show that RPI-Pred can correctly predict RNA-protein interaction pairs with ∼94% prediction accuracy when using sequence and experimentally determined protein and RNA structures, and with ∼83% when using sequences and predicted protein and RNA structures. Further, our proposed method RPI-Pred was superior to other existing ones by predicting more experimentally validated ncRNA-protein interaction pairs from different organisms. Motivated by the improved performance of RPI-Pred, we further applied our method for reliable construction of ncRNA-protein interaction networks. The RPI-Pred is publicly available at: http://ctsb.is.wfubmc.edu/projects/rpi-pred. PMID:25609700

  7. Predicting activity approach based on new atoms similarity kernel function.

    PubMed

    Abu El-Atta, Ahmed H; Moussa, M I; Hassanien, Aboul Ella

    2015-07-01

    Drug design is a high cost and long term process. To reduce time and costs for drugs discoveries, new techniques are needed. Chemoinformatics field implements the informational techniques and computer science like machine learning and graph theory to discover the chemical compounds properties, such as toxicity or biological activity. This is done through analyzing their molecular structure (molecular graph). To overcome this problem there is an increasing need for algorithms to analyze and classify graph data to predict the activity of molecules. Kernels methods provide a powerful framework which combines machine learning with graph theory techniques. These kernels methods have led to impressive performance results in many several chemoinformatics problems like biological activity prediction. This paper presents a new approach based on kernel functions to solve activity prediction problem for chemical compounds. First we encode all atoms depending on their neighbors then we use these codes to find a relationship between those atoms each other. Then we use relation between different atoms to find similarity between chemical compounds. The proposed approach was compared with many other classification methods and the results show competitive accuracy with these methods.

  8. Structure-aided prediction of mammalian transcription factor complexes in conserved non-coding elements.

    PubMed

    Guturu, Harendra; Doxey, Andrew C; Wenger, Aaron M; Bejerano, Gill

    2013-12-19

    Mapping the DNA-binding preferences of transcription factor (TF) complexes is critical for deciphering the functions of cis-regulatory elements. Here, we developed a computational method that compares co-occurring motif spacings in conserved versus unconserved regions of the human genome to detect evolutionarily constrained binding sites of rigid TF complexes. Structural data were used to estimate TF complex physical plausibility, explore overlapping motif arrangements seldom tackled by non-structure-aware methods, and generate and analyse three-dimensional models of the predicted complexes bound to DNA. Using this approach, we predicted 422 physically realistic TF complex motifs at 18% false discovery rate, the majority of which (326, 77%) contain some sequence overlap between binding sites. The set of mostly novel complexes is enriched in known composite motifs, predictive of binding site configurations in TF-TF-DNA crystal structures, and supported by ChIP-seq datasets. Structural modelling revealed three cooperativity mechanisms: direct protein-protein interactions, potentially indirect interactions and 'through-DNA' interactions. Indeed, 38% of the predicted complexes were found to contain four or more bases in which TF pairs appear to synergize through overlapping binding to the same DNA base pairs in opposite grooves or strands. Our TF complex and associated binding site predictions are available as a web resource at http://bejerano.stanford.edu/complex.

  9. Protein design by fusion: implications for protein structure prediction and evolution

    SciTech Connect

    Skorupka, Katarzyna; Han, Seong Kyu; Nam, Hyun-Jun; Kim, Sanguk; Faham, Salem

    2013-11-19

    Domain fusion is a useful tool in protein design. Here, the structure of a fusion of the heterodimeric flagella-assembly proteins FliS and FliC is reported. Although the ability of the fusion protein to maintain the structure of the heterodimer may be apparent, threading-based structural predictions do not properly fuse the heterodimer. Additional examples of naturally occurring heterodimers that are homologous to full-length proteins were identified. These examples highlight that the designed protein was engineered by the same tools as used in the natural evolution of proteins and that heterodimeric structures contain a wealth of information, currently unused, that can improve structural predictions.

  10. Theoretical prediction of low-density hexagonal ZnO hollow structures

    NASA Astrophysics Data System (ADS)

    Tuoc, Vu Ngoc; Huan, Tran Doan; Thao, Nguyen Thi; Tuan, Le Manh

    2016-10-01

    Along with wurtzite and zinc blende, zinc oxide (ZnO) has been found in a large number of polymorphs with substantially different properties and, hence, applications. Therefore, predicting and synthesizing new classes of ZnO polymorphs are of great significance and have been gaining considerable interest. Herein, we perform a density functional theory based tight-binding study, predicting several new series of ZnO hollow structures using the bottom-up approach. The geometry of the building blocks allows for obtaining a variety of hexagonal, low-density nanoporous, and flexible ZnO hollow structures. Their stability is discussed by means of the free energy computed within the lattice-dynamics approach. Our calculations also indicate that all the reported hollow structures are wide band gap semiconductors in the same fashion with bulk ZnO. The electronic band structures of the ZnO hollow structures are finally examined in detail.

  11. On local prediction based reversible watermarking.

    PubMed

    Dragoi, Ioan-Catalin; Coltuc, Dinu

    2015-04-01

    The use of local prediction in difference expansion reversible watermarking provides very good results, but at the cost of computing for each pixel a least square predictor in a square block centered on the pixel. This correspondence investigates the reduction of the mathematical complexity by computing distinct predictors not for pixels, but for groups of pixels. The same predictors are recovered at detection. Experimental results for the case of prediction on the rhombus defined by the four horizontal and vertical neighbors are provided. It is shown that by computing a predictor for a pair of pixels, the computational cost is halved without any loss in performance. A small loss appears for groups of three and four pixels with the advantage of reducing the mathematical complexity to a third and a fourth, respectively.

  12. FOURIER ANALYSIS OF EXTENDED FINE STRUCTURE WITH AUTOREGRESSIVE PREDICTION

    SciTech Connect

    Barton, J.; Shirley, D.A.

    1985-01-01

    Autoregressive prediction is adapted to double the resolution of Angle-Resolved Photoemission Extended Fine Structure (ARPEFS) Fourier transforms. Even with the optimal taper (weighting function), the commonly used taper-and-transform Fourier method has limited resolution: it assumes the signal is zero beyond the limits of the measurement. By seeking the Fourier spectrum of an infinite extent oscillation consistent with the measurements but otherwise having maximum entropy, the errors caused by finite data range can be reduced. Our procedure developed to implement this concept applies autoregressive prediction to extrapolate the signal to an extent controlled by a taper width. Difficulties encountered when processing actual ARPEFS data are discussed. A key feature of this approach is the ability to convert improved measurements (signal-to-noise or point density) into improved Fourier resolution.

  13. Ab-initio crystal structure prediction. A case study: NaBH{sub 4}

    SciTech Connect

    Caputo, Riccarda; Tekin, Adem

    2011-07-15

    Crystal structure prediction from first principles is still one of the most challenging and interesting issue in condensed matter science. we explored the potential energy surface of NaBH{sub 4} by a combined ab-initio approach, based on global structure optimizations and quantum chemistry. In particular, we used simulated annealing (SA) and density functional theory (DFT) calculations. The methodology enabled the identification of several local minima, of which the global minimum corresponded to the tetragonal ground-state structure (P4{sub 2}/nmc), and the prediction of higher energy stable structures, among them a monoclinic (Pm) one was identified to be 22.75 kJ/mol above the ground-state at T=298 K. In between, orthorhombic and cubic structures were recovered, in particular those with Pnma and F4-bar 3m symmetries. - Graphical abstract: The total electron energy difference of the calculated stable structures. Here, the tetragonal (IT 137) and the monoclinic (IT 6) symmetry groups corresponded to the lowest and the highest energy structures, respectively. Highlights: > Potential energy surface of NaBH{sub 4} is investigated. > This is done a combination of global structure optimizations based on simulated annealing and density functional calculations. > We successfully reproduced experimentally found tetragonal and orthorhombic structures of NaBH{sub 4}. > Furthermore, we found a new stable high energy structure.

  14. ORION: a web server for protein fold recognition and structure prediction using evolutionary hybrid profiles.

    PubMed

    Ghouzam, Yassine; Postic, Guillaume; Guerin, Pierre-Edouard; de Brevern, Alexandre G; Gelly, Jean-Christophe

    2016-01-01

    Protein structure prediction based on comparative modeling is the most efficient way to produce structural models when it can be performed. ORION is a dedicated webserver based on a new strategy that performs this task. The identification by ORION of suitable templates is performed using an original profile-profile approach that combines sequence and structure evolution information. Structure evolution information is encoded into profiles using structural features, such as solvent accessibility and local conformation -with Protein Blocks-, which give an accurate description of the local protein structure. ORION has recently been improved, increasing by 5% the quality of its results. The ORION web server accepts a single protein sequence as input and searches homologous protein structures within minutes. Various databases such as PDB, SCOP and HOMSTRAD can be mined to find an appropriate structural template. For the modeling step, a protein 3D structure can be directly obtained from the selected template by MODELLER and displayed with global and local quality model estimation measures. The sequence and the predicted structure of 4 examples from the CAMEO server and a recent CASP11 target from the 'Hard' category (T0818-D1) are shown as pertinent examples. Our web server is accessible at http://www.dsimb.inserm.fr/ORION/. PMID:27319297

  15. ORION: a web server for protein fold recognition and structure prediction using evolutionary hybrid profiles

    PubMed Central

    Ghouzam, Yassine; Postic, Guillaume; Guerin, Pierre-Edouard; de Brevern, Alexandre G.; Gelly, Jean-Christophe

    2016-01-01

    Protein structure prediction based on comparative modeling is the most efficient way to produce structural models when it can be performed. ORION is a dedicated webserver based on a new strategy that performs this task. The identification by ORION of suitable templates is performed using an original profile-profile approach that combines sequence and structure evolution information. Structure evolution information is encoded into profiles using structural features, such as solvent accessibility and local conformation —with Protein Blocks—, which give an accurate description of the local protein structure. ORION has recently been improved, increasing by 5% the quality of its results. The ORION web server accepts a single protein sequence as input and searches homologous protein structures within minutes. Various databases such as PDB, SCOP and HOMSTRAD can be mined to find an appropriate structural template. For the modeling step, a protein 3D structure can be directly obtained from the selected template by MODELLER and displayed with global and local quality model estimation measures. The sequence and the predicted structure of 4 examples from the CAMEO server and a recent CASP11 target from the ‘Hard’ category (T0818-D1) are shown as pertinent examples. Our web server is accessible at http://www.dsimb.inserm.fr/ORION/. PMID:27319297

  16. ORION: a web server for protein fold recognition and structure prediction using evolutionary hybrid profiles.

    PubMed

    Ghouzam, Yassine; Postic, Guillaume; Guerin, Pierre-Edouard; de Brevern, Alexandre G; Gelly, Jean-Christophe

    2016-06-20

    Protein structure prediction based on comparative modeling is the most efficient way to produce structural models when it can be performed. ORION is a dedicated webserver based on a new strategy that performs this task. The identification by ORION of suitable templates is performed using an original profile-profile approach that combines sequence and structure evolution information. Structure evolution information is encoded into profiles using structural features, such as solvent accessibility and local conformation -with Protein Blocks-, which give an accurate description of the local protein structure. ORION has recently been improved, increasing by 5% the quality of its results. The ORION web server accepts a single protein sequence as input and searches homologous protein structures within minutes. Various databases such as PDB, SCOP and HOMSTRAD can be mined to find an appropriate structural template. For the modeling step, a protein 3D structure can be directly obtained from the selected template by MODELLER and displayed with global and local quality model estimation measures. The sequence and the predicted structure of 4 examples from the CAMEO server and a recent CASP11 target from the 'Hard' category (T0818-D1) are shown as pertinent examples. Our web server is accessible at http://www.dsimb.inserm.fr/ORION/.

  17. Integration of QUARK and I-TASSER for Ab Initio Protein Structure Prediction in CASP11.

    PubMed

    Zhang, Wenxuan; Yang, Jianyi; He, Baoji; Walker, Sara Elizabeth; Zhang, Hongjiu; Govindarajoo, Brandon; Virtanen, Jouko; Xue, Zhidong; Shen, Hong-Bin; Zhang, Yang

    2016-09-01

    We tested two pipelines developed for template-free protein structure prediction in the CASP11 experiment. First, the QUARK pipeline constructs structure models by reassembling fragments of continuously distributed lengths excised from unrelated proteins. Five free-modeling (FM) targets have the model successfully constructed by QUARK with a TM-score above 0.4, including the first model of T0837-D1, which has a TM-score = 0.736 and RMSD = 2.9 Å to the native. Detailed analysis showed that the success is partly attributed to the high-resolution contact map prediction derived from fragment-based distance-profiles, which are mainly located between regular secondary structure elements and loops/turns and help guide the orientation of secondary structure assembly. In the Zhang-Server pipeline, weakly scoring threading templates are re-ordered by the structural similarity to the ab initio folding models, which are then reassembled by I-TASSER based structure assembly simulations; 60% more domains with length up to 204 residues, compared to the QUARK pipeline, were successfully modeled by the I-TASSER pipeline with a TM-score above 0.4. The robustness of the I-TASSER pipeline can stem from the composite fragment-assembly simulations that combine structures from both ab initio folding and threading template refinements. Despite the promising cases, challenges still exist in long-range beta-strand folding, domain parsing, and the uncertainty of secondary structure prediction; the latter of which was found to affect nearly all aspects of FM structure predictions, from fragment identification, target classification, structure assembly, to final model selection. Significant efforts are needed to solve these problems before real progress on FM could be made. Proteins 2016; 84(Suppl 1):76-86. © 2015 Wiley Periodicals, Inc.

  18. Tailor-made force fields for crystal-structure prediction.

    PubMed

    Neumann, Marcus A

    2008-08-14

    A general procedure is presented to derive a complete set of force-field parameters for flexible molecules in the crystalline state on a case-by-case basis. The force-field parameters are fitted to the electrostatic potential as well as to accurate energies and forces generated by means of a hybrid method that combines solid-state density functional theory (DFT) calculations with an empirical van der Waals correction. All DFT calculations are carried out with the VASP program. The mathematical structure of the force field, the generation of reference data, the choice of the figure of merit, the optimization algorithm, and the parameter-refinement strategy are discussed in detail. The approach is applied to cyclohexane-1,4-dione, a small flexible ring. The tailor-made force field obtained for cyclohexane-1,4-dione is used to search for low-energy crystal packings in all 230 space groups with one molecule per asymmetric unit, and the most stable crystal structures are reoptimized in a second step with the hybrid method. The experimental crystal structure is found as the most stable predicted crystal structure both with the tailor-made force field and the hybrid method. The same methodology has also been applied successfully to the four compounds of the fourth CCDC blind test on crystal-structure prediction. For the five aforementioned compounds, the root-mean-square deviations between lattice energies calculated with the tailor-made force fields and the hybrid method range from 0.024 to 0.053 kcal/mol per atom around an average value of 0.034 kcal/mol per atom.

  19. Tailor-made force fields for crystal-structure prediction.

    PubMed

    Neumann, Marcus A

    2008-08-14

    A general procedure is presented to derive a complete set of force-field parameters for flexible molecules in the crystalline state on a case-by-case basis. The force-field parameters are fitted to the electrostatic potential as well as to accurate energies and forces generated by means of a hybrid method that combines solid-state density functional theory (DFT) calculations with an empirical van der Waals correction. All DFT calculations are carried out with the VASP program. The mathematical structure of the force field, the generation of reference data, the choice of the figure of merit, the optimization algorithm, and the parameter-refinement strategy are discussed in detail. The approach is applied to cyclohexane-1,4-dione, a small flexible ring. The tailor-made force field obtained for cyclohexane-1,4-dione is used to search for low-energy crystal packings in all 230 space groups with one molecule per asymmetric unit, and the most stable crystal structures are reoptimized in a second step with the hybrid method. The experimental crystal structure is found as the most stable predicted crystal structure both with the tailor-made force field and the hybrid method. The same methodology has also been applied successfully to the four compounds of the fourth CCDC blind test on crystal-structure prediction. For the five aforementioned compounds, the root-mean-square deviations between lattice energies calculated with the tailor-made force fields and the hybrid method range from 0.024 to 0.053 kcal/mol per atom around an average value of 0.034 kcal/mol per atom. PMID:18642947

  20. Predicting protein ligand binding sites by combining evolutionary sequence conservation and 3D structure.

    PubMed

    Capra, John A; Laskowski, Roman A; Thornton, Janet M; Singh, Mona; Funkhouser, Thomas A

    2009-12-01

    Identifying a protein's functional sites is an important step towards characterizing its molecular function. Numerous structure- and sequence-based methods have been developed for this problem. Here we introduce ConCavity, a small molecule binding site prediction algorithm that integrates evolutionary sequence conservation estimates with structure-based methods for identifying protein surface cavities. In large-scale testing on a diverse set of single- and multi-chain protein structures, we show that ConCavity substantially outperforms existing methods for identifying both 3D ligand binding pockets and individual ligand binding residues. As part of our testing, we perform one of the first direct comparisons of conservation-based and structure-based methods. We find that the two approaches provide largely complementary information, which can be combined to improve upon either approach alone. We also demonstrate that ConCavity has state-of-the-art performance in predicting catalytic sites and drug binding pockets. Overall, the algorithms and analysis presented here significantly improve our ability to identify ligand binding sites and further advance our understanding of the relationship between evolutionary sequence conservation and structural and functional attributes of proteins. Data, source code, and prediction visualizations are available on the ConCavity web site (http://compbio.cs.princeton.edu/concavity/).

  1. Structure prediction and analysis of neuraminidase sequence variants.

    PubMed

    Thayer, Kelly M

    2016-07-01

    Analyzing protein structure has become an integral aspect of understanding systems of biochemical import. The laboratory experiment endeavors to introduce protein folding to ascertain structures of proteins for which the structure is unavailable, as well as to critically evaluate the quality of the prediction obtained. The model system used is the highly mutable influenza virus protein neuraminidase, which is the key target in the development of therapeutics. In light of recent pandemics, understanding how mutations confer drug resistance, which translates at the molecular level to understanding how different sequence variants differ, constitutes an area of great interest because of the ramifications in public health. This lab targets upper level undergraduate biochemistry students, and aims to introduce tools to be used to explore protein folding and protein visualization in the context of the neuraminidase case study. Students proceed to critically evaluate the folded models by comparison with crystallographic structures. When validity is established, they fold a neuraminidase sequence for which a structure is not available. Through structural alignment and visual inspection of the 150 loop, students gain molecular insight into two possible conformations of the protein, which are actively being studied. Folding the third chosen sequence mimics a true research environment in allowing students to generate a structure from a sequence for which a structure was not previously available, and to assess whether their particular variant has an open or closed loop. From this vantage, they are then challenged to speculate about the connection between loop conformation and drug susceptibility. © 2016 by The International Union of Biochemistry and Molecular Biology, 44(4):361-376, 2016. PMID:26900942

  2. Link prediction based on temporal similarity metrics using continuous action set learning automata

    NASA Astrophysics Data System (ADS)

    Moradabadi, Behnaz; Meybodi, Mohammad Reza

    2016-10-01

    Link prediction is a social network research area that tries to predict future links using network structure. The main approaches in this area are based on predicting future links using network structure at a specific period, without considering the links behavior through different periods. For example, a common traditional approach in link prediction calculates a chosen similarity metric for each non-connected link and outputs the links with higher similarity scores as the prediction result. In this paper, we propose a new link prediction method based on temporal similarity metrics and Continuous Action set Learning Automata (CALA). The proposed method takes advantage of using different similarity metrics as well as different time periods. In the proposed algorithm, we try to model the link prediction problem as a noisy optimization problem and use a team of CALAs to solve the noisy optimization problem. CALA is a reinforcement based optimization tool which tries to learn the optimal behavior from the environment feedbacks. To determine the importance of different periods and similarity metrics on the prediction result, we define a coefficient for each of different periods and similarity metrics and use a CALA for each coefficient. Each CALA tries to learn the true value of the corresponding coefficient. Final link prediction is obtained from a combination of different similarity metrics in different times based on the obtained coefficients. The link prediction results reported here show satisfactory of the proposed method for some social network data sets.

  3. New insights from cluster analysis methods for RNA secondary structure prediction.

    PubMed

    Rogers, Emily; Heitsch, Christine

    2016-05-01

    A widening gap exists between the best practices for RNA secondary structure prediction developed by computational researchers and the methods used in practice by experimentalists. Minimum free energy predictions, although broadly used, are outperformed by methods which sample from the Boltzmann distribution and data mine the results. In particular, moving beyond the single structure prediction paradigm yields substantial gains in accuracy. Furthermore, the largest improvements in accuracy and precision come from viewing secondary structures not at the base pair level but at lower granularity/higher abstraction. This suggests that random errors affecting precision and systematic ones affecting accuracy are both reduced by this 'fuzzier' view of secondary structures. Thus experimentalists who are willing to adopt a more rigorous, multilayered approach to secondary structure prediction by iterating through these levels of granularity will be much better able to capture fundamental aspects of RNA base pairing. WIREs RNA 2016, 7:278-294. doi: 10.1002/wrna.1334 For further resources related to this article, please visit the WIREs website.

  4. Incorporating chemical modification constraints into a dynamic programming algorithm for prediction of RNA secondary structure

    PubMed Central

    Mathews, David H.; Disney, Matthew D.; Childs, Jessica L.; Schroeder, Susan J.; Zuker, Michael; Turner, Douglas H.

    2004-01-01

    A dynamic programming algorithm for prediction of RNA secondary structure has been revised to accommodate folding constraints determined by chemical modification and to include free energy increments for coaxial stacking of helices when they are either adjacent or separated by a single mismatch. Furthermore, free energy parameters are revised to account for recent experimental results for terminal mismatches and hairpin, bulge, internal, and multibranch loops. To demonstrate the applicability of this method, in vivo modification was performed on 5S rRNA in both Escherichia coli and Candida albicans with 1-cyclohexyl-3-(2-morpholinoethyl) carbodiimide metho-p-toluene sulfonate, dimethyl sulfate, and kethoxal. The percentage of known base pairs in the predicted structure increased from 26.3% to 86.8% for the E. coli sequence by using modification constraints. For C. albicans, the accuracy remained 87.5% both with and without modification data. On average, for these sequences and a set of 14 sequences with known secondary structure and chemical modification data taken from the literature, accuracy improves from 67% to 76%. This enhancement primarily reflects improvement for three sequences that are predicted with <40% accuracy on the basis of energetics alone. For these sequences, inclusion of chemical modification constraints improves the average accuracy from 28% to 78%. For the 11 sequences with <6% pseudoknotted base pairs, structures predicted with constraints from chemical modification contain on average 84% of known canonical base pairs. PMID:15123812

  5. Failure prediction of thin beryllium sheets used in spacecraft structures

    NASA Technical Reports Server (NTRS)

    Roschke, Paul N.; Papados, Photios; Mascorro, Edward

    1991-01-01

    In an attempt to predict failure for cross-rolled beryllium sheet structures, high order macroscopic failure criteria are used. These require the knowledge of in-plane uniaxial and shear strengths. Test results are included for in-plane biaxial tension, uniaxial compression for two different material orientations, and shear. All beryllium specimens have the same chemical composition. In addition, all experimental work was performed in a controlled laboratory environment. Numerical simulation complements these tests. A brief bibliography supplements references listed in a previous report.

  6. Computer-aided prediction of RNA secondary structures.

    PubMed Central

    Auron, P E; Rindone, W P; Vary, C P; Celentano, J J; Vournakis, J N

    1982-01-01

    A brief survey of computer algorithms that have been developed to generate predictions of the secondary structures of RNA molecules is presented. Two particular methods are described in some detail. The first utilizes a thermodynamic energy minimization algorithm that takes into account the likelihood that short-range folding tends to be favored over long-range interactions. The second utilizes an interactive computer graphic modelling algorithm that enables the user to consider thermodynamic criteria as well as structural data obtained by nuclease susceptibility, chemical reactivity and phylogenetic studies. Examples of structures for prokaryotic 16S and 23S ribosomal RNAs, several eukaryotic 5S ribosomal RNAs and rabbit beta-globin messenger RNA are presented as case studies in order to describe the two techniques. Anm argument is made for integrating the two approaches presented in this paper, enabling the user to generate proposed structures using thermodynamic criteria, allowing interactive refinement of these structures through the application of experimentally derived data. PMID:6174937

  7. Offspring social network structure predicts fitness in families

    PubMed Central

    Royle, Nick J.; Pike, Thomas W.; Heeb, Philipp; Richner, Heinz; Kölliker, Mathias

    2012-01-01

    Social structures such as families emerge as outcomes of behavioural interactions among individuals, and can evolve over time if families with particular types of social structures tend to leave more individuals in subsequent generations. The social behaviour of interacting individuals is typically analysed as a series of multiple dyadic (pair-wise) interactions, rather than a network of interactions among multiple individuals. However, in species where parents feed dependant young, interactions within families nearly always involve more than two individuals simultaneously. Such social networks of interactions at least partly reflect conflicts of interest over the provision of costly parental investment. Consequently, variation in family network structure reflects variation in how conflicts of interest are resolved among family members. Despite its importance in understanding the evolution of emergent properties of social organization such as family life and cooperation, nothing is currently known about how selection acts on the structure of social networks. Here, we show that the social network structure of broods of begging nestling great tits Parus major predicts fitness in families. Although selection at the level of the individual favours large nestlings, selection at the level of the kin-group primarily favours families that resolve conflicts most effectively. PMID:23097505

  8. Link prediction based on path entropy

    NASA Astrophysics Data System (ADS)

    Xu, Zhongqi; Pu, Cunlai; Yang, Jian

    2016-08-01

    Information theory has been taken as a prospective tool for quantifying the complexity of complex networks. In this paper, first we study the information entropy or uncertainty of a path using the information theory. After that, we apply the path entropy to the link prediction problem in real-world networks. Specifically, we propose a new similarity index, namely Path Entropy (PE) index, which considers the information entropies of shortest paths between node pairs with penalization to long paths. Empirical experiments demonstrate that PE index outperforms the mainstream of link predictors.

  9. A novel logic-based approach for quantitative toxicology prediction.

    PubMed

    Amini, Ata; Muggleton, Stephen H; Lodhi, Huma; Sternberg, Michael J E

    2007-01-01

    There is a pressing need for accurate in silico methods to predict the toxicity of molecules that are being introduced into the environment or are being developed into new pharmaceuticals. Predictive toxicology is in the realm of structure activity relationships (SAR), and many approaches have been used to derive such SAR. Previous work has shown that inductive logic programming (ILP) is a powerful approach that circumvents several major difficulties, such as molecular superposition, faced by some other SAR methods. The ILP approach reasons with chemical substructures within a relational framework and yields chemically understandable rules. Here, we report a general new approach, support vector inductive logic programming (SVILP), which extends the essentially qualitative ILP-based SAR to quantitative modeling. First, ILP is used to learn rules, the predictions of which are then used within a novel kernel to derive a support-vector generalization model. For a highly heterogeneous dataset of 576 molecules with known fathead minnow fish toxicity, the cross-validated correlation coefficients (R2CV) from a chemical descriptor method (CHEM) and SVILP are 0.52 and 0.66, respectively. The ILP, CHEM, and SVILP approaches correctly predict 55, 58, and 73%, respectively, of toxic molecules. In a set of 165 unseen molecules, the R2 values from the commercial software TOPKAT and SVILP are 0.26 and 0.57, respectively. In all calculations, SVILP showed significant improvements in comparison with the other methods. The SVILP approach has a major advantage in that it uses ILP automatically and consistently to derive rules, mostly novel, describing fragments that are toxicity alerts. The SVILP is a general machine-learning approach and has the potential of tackling many problems relevant to chemoinformatics including in silico drug design.

  10. Predicting RNA 3D structure using a coarse-grain helix-centered model.

    PubMed

    Kerpedjiev, Peter; Höner Zu Siederdissen, Christian; Hofacker, Ivo L

    2015-06-01

    A 3D model of RNA structure can provide information about its function and regulation that is not possible with just the sequence or secondary structure. Current models suffer from low accuracy and long running times and either neglect or presume knowledge of the long-range interactions which stabilize the tertiary structure. Our coarse-grained, helix-based, tertiary structure model operates with only a few degrees of freedom compared with all-atom models while preserving the ability to sample tertiary structures given a secondary structure. It strikes a balance between the precision of an all-atom tertiary structure model and the simplicity and effectiveness of a secondary structure representation. It provides a simplified tool for exploring global arrangements of helices and loops within RNA structures. We provide an example of a novel energy function relying only on the positions of stems and loops. We show that coupling our model to this energy function produces predictions as good as or better than the current state of the art tools. We propose that given the wide range of conformational space that needs to be explored, a coarse-grain approach can explore more conformations in less iterations than an all-atom model coupled to a fine-grain energy function. Finally, we emphasize the overarching theme of providing an ensemble of predicted structures, something which our tool excels at, rather than providing a handful of the lowest energy structures. PMID:25904133

  11. Predicting RNA 3D structure using a coarse-grain helix-centered model.

    PubMed

    Kerpedjiev, Peter; Höner Zu Siederdissen, Christian; Hofacker, Ivo L

    2015-06-01

    A 3D model of RNA structure can provide information about its function and regulation that is not possible with just the sequence or secondary structure. Current models suffer from low accuracy and long running times and either neglect or presume knowledge of the long-range interactions which stabilize the tertiary structure. Our coarse-grained, helix-based, tertiary structure model operates with only a few degrees of freedom compared with all-atom models while preserving the ability to sample tertiary structures given a secondary structure. It strikes a balance between the precision of an all-atom tertiary structure model and the simplicity and effectiveness of a secondary structure representation. It provides a simplified tool for exploring global arrangements of helices and loops within RNA structures. We provide an example of a novel energy function relying only on the positions of stems and loops. We show that coupling our model to this energy function produces predictions as good as or better than the current state of the art tools. We propose that given the wide range of conformational space that needs to be explored, a coarse-grain approach can explore more conformations in less iterations than an all-atom model coupled to a fine-grain energy function. Finally, we emphasize the overarching theme of providing an ensemble of predicted structures, something which our tool excels at, rather than providing a handful of the lowest energy structures.

  12. PiDNA: Predicting protein-DNA interactions with structural models.

    PubMed

    Lin, Chih-Kang; Chen, Chien-Yu

    2013-07-01

    Predicting binding sites of a transcription factor in the genome is an important, but challenging, issue in studying gene regulation. In the past decade, a large number of protein-DNA co-crystallized structures available in the Protein Data Bank have facilitated the understanding of interacting mechanisms between transcription factors and their binding sites. Recent studies have shown that both physics-based and knowledge-based potential functions can be applied to protein-DNA complex structures to deliver position weight matrices (PWMs) that are consistent with the experimental data. To further use the available structural models, the proposed Web server, PiDNA, aims at first constructing reliable PWMs by applying an atomic-level knowledge-based scoring function on numerous in silico mutated complex structures, and then using the PWM constructed by the structure models with small energy changes to predict the interaction between proteins and DNA sequences. With PiDNA, the users can easily predict the relative preference of all the DNA sequences with limited mutations from the native sequence co-crystallized in the model in a single run. More predictions on sequences with unlimited mutations can be realized by additional requests or file uploading. Three types of information can be downloaded after prediction: (i) the ranked list of mutated sequences, (ii) the PWM constructed by the favourable mutated structures, and (iii) any mutated protein-DNA complex structure models specified by the user. This study first shows that the constructed PWMs are similar to the annotated PWMs collected from databases or literature. Second, the prediction accuracy of PiDNA in detecting relatively high-specificity sites is evaluated by comparing the ranked lists against in vitro experiments from protein-binding microarrays. Finally, PiDNA is shown to be able to select the experimentally validated binding sites from 10,000 random sites with high accuracy. With PiDNA, the users can

  13. Predicting the structures of complexes between phosphoinositide 3-kinase (PI3K) and romidepsin-related compounds for the drug design of PI3K/histone deacetylase dual inhibitors using computational docking and the ligand-based drug design approach.

    PubMed

    Oda, Akifumi; Saijo, Ken; Ishioka, Chikashi; Narita, Koichi; Katoh, Tadashi; Watanabe, Yurie; Fukuyoshi, Shuichi; Takahashi, Ohgi

    2014-11-01

    Predictions of the three-dimensional (3D) structures of the complexes between phosphoinositide 3-kinase (PI3K) and two inhibitors were conducted using computational docking and the ligand-based drug design approach. The obtained structures were refined by structural optimizations and molecular dynamics (MD) simulations. The ligands were located deep inside the ligand binding pocket of the p110α subunit of PI3K, and the hydrogen bond formations and hydrophobic effects of the surrounding amino acids were predicted. Although rough structures were obtained for the PI3K-inhibitor complexes before the MD simulations, the refinement of the structures by these simulations clarified the hydrogen bonding patterns of the complexes.

  14. Predicting Monoamine Oxidase Inhibitory Activity through Ligand-Based Models

    PubMed Central

    Vilar, Santiago; Ferino, Giulio; Quezada, Elias; Santana, Lourdes; Friedman, Carol

    2013-01-01

    The evolution of bio- and cheminformatics associated with the development of specialized software and increasing computer power has produced a great interest in theoretical in silico methods applied in drug rational design. These techniques apply the concept that “similar molecules have similar biological properties” that has been exploited in Medicinal Chemistry for years to design new molecules with desirable pharmacological profiles. Ligand-based methods are not dependent on receptor structural data and take into account two and three-dimensional molecular properties to assess similarity of new compounds in regards to the set of molecules with the biological property under study. Depending on the complexity of the calculation, there are different types of ligand-based methods, such as QSAR (Quantitative Structure-Activity Relationship) with 2D and 3D descriptors, CoMFA (Comparative Molecular Field Analysis) or pharmacophoric approaches. This work provides a description of a series of ligand-based models applied in the prediction of the inhibitory activity of monoamine oxidase (MAO) enzymes. The controlled regulation of the enzymes’ function through the use of MAO inhibitors is used as a treatment in many psychiatric and neurological disorders, such as depression, anxiety, Alzheimer’s and Parkinson’s disease. For this reason, multiple scaffolds, such as substituted coumarins, indolylmethylamine or pyridazine derivatives were synthesized and assayed toward MAO-A and MAO-B inhibition. Our intention is to focus on the description of ligand-based models to provide new insights in the relationship between the MAO inhibitory activity and the molecular structure of the different inhibitors, and further study enzyme selectivity and possible mechanisms of action. PMID:23231398

  15. SPOT-Seq-RNA: predicting protein-RNA complex structure and RNA-binding function by fold recognition and binding affinity prediction.

    PubMed

    Yang, Yuedong; Zhao, Huiying; Wang, Jihua; Zhou, Yaoqi

    2014-01-01

    RNA-binding proteins (RBPs) play key roles in RNA metabolism and post-transcriptional regulation. Computational methods have been developed separately for prediction of RBPs and RNA-binding residues by machine-learning techniques and prediction of protein-RNA complex structures by rigid or semiflexible structure-to-structure docking. Here, we describe a template-based technique called SPOT-Seq-RNA that integrates prediction of RBPs, RNA-binding residues, and protein-RNA complex structures into a single package. This integration is achieved by combining template-based structure-prediction software, SPARKS X, with binding affinity prediction software, DRNA. This tool yields reasonable sensitivity (46 %) and high precision (84 %) for an independent test set of 215 RBPs and 5,766 non-RBPs. SPOT-Seq-RNA is computationally efficient for genome-scale prediction of RBPs and protein-RNA complex structures. Its application to human genome study has revealed a similar sensitivity and ability to uncover hundreds of novel RBPs beyond simple homology. The online server and downloadable version of SPOT-Seq-RNA are available at http://sparks-lab.org/server/SPOT-Seq-RNA/.

  16. Methods for evaluating the predictive accuracy of structural dynamic models

    NASA Technical Reports Server (NTRS)

    Hasselman, T. K.; Chrostowski, Jon D.

    1990-01-01

    Uncertainty of frequency response using the fuzzy set method and on-orbit response prediction using laboratory test data to refine an analytical model are emphasized with respect to large space structures. Two aspects of the fuzzy set approach were investigated relative to its application to large structural dynamics problems: (1) minimizing the number of parameters involved in computing possible intervals; and (2) the treatment of extrema which may occur in the parameter space enclosed by all possible combinations of the important parameters of the model. Extensive printer graphics were added to the SSID code to help facilitate model verification, and an application of this code to the LaRC Ten Bay Truss is included in the appendix to illustrate this graphics capability.

  17. Prediction of Halocarbon Toxicity from Structure: A Hierarchical QSAR Approach

    SciTech Connect

    Gute, B D; Balasubramanian, K; Geiss, K; Basak, S C

    2003-04-11

    Mathematical structural invariants and quantum theoretical descriptors have been used extensively in quantitative structure-activity relationships (QSARs) for the estimation of pharmaceutical activities, biological properties, physicochemical properties, and the toxicities of chemicals. Recently our research team has explored the relative importance of various levels of chemodescriptors, i.e., topostructural, topochemical, geometrical, and quantum theoretical descriptors, in property estimation. This study examines the contribution of chemodescriptors ranging from topostructural to quantum theoretic calculations up to the Gaussian STO-3G level in the prediction of the toxicity of a set of twenty halocarbons. We also report the results of experimental cell-level toxicity studies on these twenty halocarbons to validate our models.

  18. Exponential repulsion improves structural predictability of molecular docking.

    PubMed

    Bazgier, Václav; Berka, Karel; Otyepka, Michal; Banáš, Pavel

    2016-10-30

    Molecular docking is a powerful tool for theoretical prediction of the preferred conformation and orientation of small molecules within protein active sites. The obtained poses can be used for estimation of binding energies, which indicate the inhibition effect of designed inhibitors, and therefore might be used for in silico drug design. However, the evaluation of ligand binding affinity critically depends on successful prediction of the native binding mode. Contemporary docking methods are often based on scoring functions derived from molecular mechanical potentials. In such potentials, nonbonded interactions are typically represented by electrostatic interactions between atom-centered partial charges and standard 6-12 Lennard-Jones potential. Here, we present implementation and testing of a scoring function based on more physically justified exponential repulsion instead of the standard Lennard-Jones potential. We found that this scoring function significantly improved prediction of the native binding modes in proteins bearing narrow active sites such as serine proteases and kinases. © 2016 Wiley Periodicals, Inc. PMID:27620738

  19. Generic eukaryotic core promoter prediction using structural features of DNA.

    PubMed

    Abeel, Thomas; Saeys, Yvan; Bonnet, Eric; Rouzé, Pierre; Van de Peer, Yves

    2008-02-01

    Despite many recent efforts, in silico identification of promoter regions is still in its infancy. However, the accurate identification and delineation of promoter regions is important for several reasons, such as improving genome annotation and devising experiments to study and understand transcriptional regulation. Current methods to identify the core region of promoters require large amounts of high-quality training data and often behave like black box models that output predictions that are difficult to interpret. Here, we present a novel approach for predicting promoters in whole-genome sequences by using large-scale structural properties of DNA. Our technique requires no training, is applicable to many eukaryotic genomes, and performs extremely well in comparison with the best available promoter prediction programs. Moreover, it is fast, simple in design, and has no size constraints, and the results are easily interpretable. We compared our approach with 14 current state-of-the-art implementations using human gene and transcription start site data and analyzed the ENCODE region in more detail. We also validated our method on 12 additional eukaryotic genomes, including vertebrates, invertebrates, plants, fungi, and protists.

  20. Sequence-Based Prediction of Type III Secreted Proteins

    PubMed Central

    Arnold, Roland; Brandmaier, Stefan; Kleine, Frederick; Tischler, Patrick; Heinz, Eva; Behrens, Sebastian; Niinikoski, Antti; Mewes, Hans-Werner; Horn, Matthias; Rattei, Thomas

    2009-01-01

    The type III secretion system (TTSS) is a key mechanism for host cell interaction used by a variety of bacterial pathogens and symbionts of plants and animals including humans. The TTSS represents a molecular syringe with which the bacteria deliver effector proteins directly into the host cell cytosol. Despite the importance of the TTSS for bacterial pathogenesis, recognition and targeting of type III secreted proteins has up until now been poorly understood. Several hypotheses are discussed, including an mRNA-based signal, a chaperon-mediated process, or an N-terminal signal peptide. In this study, we systematically analyzed the amino acid composition and secondary structure of N-termini of 100 experimentally verified effector proteins. Based on this, we developed a machine-learning approach for the prediction of TTSS effector proteins, taking into account N-terminal sequence features such as frequencies of amino acids, short peptides, or residues with certain physico-chemical properties. The resulting computational model revealed a strong type III secretion signal in the N-terminus that can be used to detect effectors with sensitivity of ∼71% and selectivity of ∼85%. This signal seems to be taxonomically universal and conserved among animal pathogens and plant symbionts, since we could successfully detect effector proteins if the respective group was excluded from training. The application of our prediction approach to 739 complete bacterial and archaeal genome sequences resulted in the identification of between 0% and 12% putative TTSS effector proteins. Comparison of effector proteins with orthologs that are not secreted by the TTSS showed no clear pattern of signal acquisition by fusion, suggesting convergent evolutionary processes shaping the type III secretion signal. The newly developed program EffectiveT3 (http://www.chlamydiaedb.org) is the first universal in silico prediction program for the identification of novel TTSS effectors. Our findings will

  1. Phosphate-based glasses: Prediction of acoustical properties

    NASA Astrophysics Data System (ADS)

    El-Moneim, Amin Abd

    2016-04-01

    In this work, a comprehensive study has been carried out to predict the composition dependence of bulk modulus and ultrasonic attenuation coefficient in the phosphate-based glass systems PbO-P2O5, Li2O-TeO2-B2O3-P2O5, TiO2-Na2O-CaO-P2O5 and Cr2O3-doped Na2O-ZnO-P2O5 at room temperature. The prediction is based on (i) Makishima-Mackenzie theory, which correlates the bulk modulus with packing density and dissociation energy per unit volume, and (ii) Our recently presented semi-empirical formulas, which correlate the ultrasonic attenuation coefficient with the oxygen density, mean atomic ring size, first-order stretching force constant and experimental bulk modulus. Results revealed that our recently presented semi-empirical formulas can be applied successfully to predict changes of ultrasonic attenuation coefficient in binary PbO-P2O5 glasses at 10 MHz frequency and in quaternary Li2O-TeO2-B2O3-P2O5, TiO2-Na2O-CaO-P2O5 and Cr2O3-Na2O-ZnO-P2O5 glasses at 5 MHz frequency. Also, Makishima-Mackenzie theory appears to be valid for the studied glasses if the effect of the basic structural units that present in the glass network is taken into account.

  2. Towards Practical Carbonation Prediction and Modelling for Service Life Design of Reinforced Concrete Structures

    NASA Astrophysics Data System (ADS)

    Ekolu, O. S.

    2015-11-01

    Amongst the scientific community, the interest in durability of concrete structures has been high for quite a long time of over 40 years. Of the various causes of degradation of concrete structures, corrosion is the most widespread durability problem and carbonation is one of the two causes of steel reinforcement corrosion. While much scientific understanding has been gained from the numerous carbonation studies undertaken over the past years, it is still presently not possible to accurately predict carbonation and apply it in design of structures. This underscores the complex nature of the mechanisms as influenced by several interactive factors. Based on critical literature and some experience of the author, it is found that there still exist major challenges in establishing a mathematical constitutive relation for realistic carbonation prediction. While most current models employ permeability /diffusion as the main model property, analysis shows that the most practical material property would be compressive strength, which has a low coefficient of variation of 20% compared to 30 to 50% for permeability. This important characteristic of compressive strength, combined with its merit of simplicity and data availability at all stages of a structure's life, promote its potential use in modelling over permeability. By using compressive strength in carbonation prediction, the need for accelerated testing and permeability measurement can be avoided. This paper attempts to examine the issues associated with carbonation prediction, which could underlie the current lack of a sound established prediction method. Suggestions are then made for possible employment of different or alternative approaches.

  3. RNA tertiary structure prediction with ModeRNA.

    PubMed

    Rother, Magdalena; Rother, Kristian; Puton, Tomasz; Bujnicki, Janusz M

    2011-11-01

    Noncoding RNAs perform important roles in the cell. As their function is tightly connected with structure, and as experimental methods are time-consuming and expensive, the field of RNA structure prediction is developing rapidly. Here, we present a detailed study on using the ModeRNA software. The tool uses the comparative modeling approach and can be applied when a structural template is available and an alignment of reasonable quality can be performed. We guide the reader through the entire process of modeling Escherichia coli tRNA(Thr) in a conformation corresponding to the complex with an aminoacyl-tRNA synthetase (aaRS). We describe the choice of a template structure, preparation of input files, and explore three possible modeling strategies. In the end, we evaluate the resulting models using six alternative benchmarks. The ModeRNA software can be freely downloaded from http://iimcb.genesilico.pl/moderna/ under the conditions of the General Public License. It runs under LINUX, Windows and Mac OS. It is also available as a server at http://iimcb.genesilico.pl/modernaserver/. The models and the script to reproduce the study from this article are available at http://www.genesilico.pl/moderna/examples/.

  4. Antibody modeling using the prediction of immunoglobulin structure (PIGS) web server [corrected].

    PubMed

    Marcatili, Paolo; Olimpieri, Pier Paolo; Chailyan, Anna; Tramontano, Anna

    2014-12-01

    Antibodies (or immunoglobulins) are crucial for defending organisms from pathogens, but they are also key players in many medical, diagnostic and biotechnological applications. The ability to predict their structure and the specific residues involved in antigen recognition has several useful applications in all of these areas. Over the years, we have developed or collaborated in developing a strategy that enables researchers to predict the 3D structure of antibodies with a very satisfactory accuracy. The strategy is completely automated and extremely fast, requiring only a few minutes (∼10 min on average) to build a structural model of an antibody. It is based on the concept of canonical structures of antibody loops and on our understanding of the way light and heavy chains pack together.

  5. A windows based mechanistic subsidence prediction model for longwall mining

    SciTech Connect

    Begley, R.; Beheler, P.; Khair, A.W.

    1996-12-31

    The previously developed Mechanistic Subsidence Prediction Model (MSPM) has been incorporated into the graphical interface environment of MS Windows. MSPM has the unique capability of predicting maximum subsidence, angle of draw and the subsidence profile of a longwall panel at various locations for both the transverse and longitudinal orientations. The resultant enhanced model can be operated by individuals with little knowledge of subsidence prediction theories or little computer programming experience. In addition, predictions of subsidence can be made in a matter of seconds without the need to develop input data files or use the keyboard in some cases. The predictions are based upon the following input parameters: panel width, mining height, overburden depth, rock quality designation, and percent hard rock in the immediate roof, main roof and the entire overburden. The recently developed enhanced model has the capability to compare predictions in a graphical format for one half of the predicted subsidence profile based upon changes in input parameters easily and instantly on the same screen. In addition another screen can be obtained from a pull down menu where the operator can compare predictions for the entire subsidence profiles. This paper presents the background of the subsidence prediction model and the methodology of the enhanced model development. The paper also presents comparisons of subsidence predictions for several different sets of input parameters in addition to comparisons of the subsidence predictions with actual field data.

  6. Protein structure prediction with local adjust tabu search algorithm

    PubMed Central

    2014-01-01

    Background Protein folding structure prediction is one of the most challenging problems in the bioinformatics domain. Because of the complexity of the realistic protein structure, the simplified structure model and the computational method should be adopted in the research. The AB off-lattice model is one of the simplification models, which only considers two classes of amino acids, hydrophobic (A) residues and hydrophilic (B) residues. Results The main work of this paper is to discuss how to optimize the lowest energy configurations in 2D off-lattice model and 3D off-lattice model by using Fibonacci sequences and real protein sequences. In order to avoid falling into local minimum and faster convergence to the global minimum, we introduce a novel method (SATS) to the protein structure problem, which combines simulated annealing algorithm and tabu search algorithm. Various strategies, such as the new encoding strategy, the adaptive neighborhood generation strategy and the local adjustment strategy, are adopted successfully for high-speed searching the optimal conformation corresponds to the lowest energy of the protein sequences. Experimental results show that some of the results obtained by the improved SATS are better than those reported in previous literatures, and we can sure that the lowest energy folding state for short Fibonacci sequences have been found. Conclusions Although the off-lattice models is not very realistic, they can reflect some important characteristics of the realistic protein. It can be found that 3D off-lattice model is more like native folding structure of the realistic protein than 2D off-lattice model. In addition, compared with some previous researches, the proposed hybrid algorithm can more effectively and more quickly search the spatial folding structure of a protein chain. PMID:25474708

  7. Crystal structure prediction from first principles: The crystal structures of glycine

    NASA Astrophysics Data System (ADS)

    Lund, Albert M.; Pagola, Gabriel I.; Orendt, Anita M.; Ferraro, Marta B.; Facelli, Julio C.

    2015-04-01

    Here we present the results of our unbiased searches of glycine polymorphs obtained using the genetic algorithms search implemented in MGAC, modified genetic algorithm for crystals, coupled with the local optimization and energy evaluation provided by Quantum Espresso. We demonstrate that it is possible to predict the crystal structures of a biomedical molecule using solely first principles calculations. We were able to find all the ambient pressure stable glycine polymorphs, which are found in the same energetic ordering as observed experimentally and the agreement between the experimental and predicted structures is of such accuracy that the two are visually almost indistinguishable.

  8. Crystal Structure Prediction from First Principles: The Crystal Structures of Glycine

    PubMed Central

    Lund, Albert M.; Pagola, Gabriel I.; Orendt, Anita M.; Ferraro, Marta B.; Facelli, Julio C.

    2015-01-01

    Here we present the results of our unbiased searches of glycine polymorphs obtained using the Genetic Algorithms search implemented in Modified Genetic Algorithm for Crystals coupled with the local optimization and energy evaluation provided by Quantum Espresso. We demonstrate that it is possible to predict the crystal structures of a biomedical molecule using solely first principles calculations. We were able to find all the ambient pressure stable glycine polymorphs, which are found in the same energetic ordering as observed experimentally and the agreement between the experimental and predicted structures is of such accuracy that the two are visually almost indistinguishable. PMID:25843964

  9. A Structural Equation Model for Predicting Business Student Performance

    ERIC Educational Resources Information Center

    Pomykalski, James J.; Dion, Paul; Brock, James L.

    2008-01-01

    In this study, the authors developed a structural equation model that accounted for 79% of the variability of a student's final grade point average by using a sample size of 147 students. The model is based on student grades in 4 foundational business courses: introduction to business, macroeconomics, statistics, and using databases. Educators and…

  10. How evolutionary crystal structure prediction works--and why.

    PubMed

    Oganov, Artem R; Lyakhov, Andriy O; Valle, Mario

    2011-03-15

    Once the crystal structure of a chemical substance is known, many properties can be predicted reliably and routinely. Therefore if researchers could predict the crystal structure of a material before it is synthesized, they could significantly accelerate the discovery of new materials. In addition, the ability to predict crystal structures at arbitrary conditions of pressure and temperature is invaluable for the study of matter at extreme conditions, where experiments are difficult. Crystal structure prediction (CSP), the problem of finding the most stable arrangement of atoms given only the chemical composition, has long remained a major unsolved scientific problem. Two problems are entangled here: search, the efficient exploration of the multidimensional energy landscape, and ranking, the correct calculation of relative energies. For organic crystals, which contain a few molecules in the unit cell, search can be quite simple as long as a researcher does not need to include many possible isomers or conformations of the molecules; therefore ranking becomes the main challenge. For inorganic crystals, quantum mechanical methods often provide correct relative energies, making search the most critical problem. Recent developments provide useful practical methods for solving the search problem to a considerable extent. One can use simulated annealing, metadynamics, random sampling, basin hopping, minima hopping, and data mining. Genetic algorithms have been applied to crystals since 1995, but with limited success, which necessitated the development of a very different evolutionary algorithm. This Account reviews CSP using one of the major techniques, the hybrid evolutionary algorithm USPEX (Universal Structure Predictor: Evolutionary Xtallography). Using recent developments in the theory of energy landscapes, we unravel the reasons evolutionary techniques work for CSP and point out their limitations. We demonstrate that the energy landscapes of chemical systems have an

  11. A graph-theoretic approach for classification and structure prediction of transmembrane β-barrel proteins

    PubMed Central

    2012-01-01

    Background Transmembrane β-barrel proteins are a special class of transmembrane proteins which play several key roles in human body and diseases. Due to experimental difficulties, the number of transmembrane β-barrel proteins with known structures is very small. Over the years, a number of learning-based methods have been introduced for recognition and structure prediction of transmembrane β-barrel proteins. Most of these methods emphasize on homology search rather than any biological or chemical basis. Results We present a novel graph-theoretic model for classification and structure prediction of transmembrane β-barrel proteins. This model folds proteins based on energy minimization rather than a homology search, avoiding any assumption on availability of training dataset. The ab initio model presented in this paper is the first method to allow for permutations in the structure of transmembrane proteins and provides more structural information than any known algorithm. The model is also able to recognize β-barrels by assessing the pseudo free energy. We assess the structure prediction on 41 proteins gathered from existing databases on experimentally validated transmembrane β-barrel proteins. We show that our approach is quite accurate with over 90% F-score on strands and over 74% F-score on residues. The results are comparable to other algorithms suggesting that our pseudo-energy model is close to the actual physical model. We test our classification approach and show that it is able to reject α-helical bundles with 100% accuracy and β-barrel lipocalins with 97% accuracy. Conclusions We show that it is possible to design models for classification and structure prediction for transmembrane β-barrel proteins which do not depend essentially on training sets but on combinatorial properties of the structures to be proved. These models are fairly accurate, robust and can be run very efficiently on PC-like computers. Such models are useful for the genome

  12. Hypersonic vehicle structural weight prediction using parametric modeling, finite element modeling, and structural optimization

    NASA Astrophysics Data System (ADS)

    Ngo, Dung A.; Koshiba, David A.; Moses, Paul L.

    1993-04-01

    Detailed structural analysis/optimization is required in the conceptual design stage because of the combination of aerodynamic and aerothermodynamic environment. This is a time and manpower consuming activity which is exasperated by constant vehicle moldline changes as a configuration matures. A simple parametric math model is presented that takes into consideration static loads and the geometry and structural weight of a baseline hypersonic vehicle in predicting the structural weight of a new configuration scaled from the baseline. The approach in developing the math model was to consider a generic parametric cross-sectional geometry that could be used to approximate the baseline geometry and to predict the behavior of this baselne when it is scaled to provide performance and design benefits. This mathematical model, calibrated to finite element analysis and structural optimization sizing results, provides accurate weight prediction for a new configuration which has been moderately scaled from a thoroughly analyzed baseline configuration. This paper will present the structural optimization weight results and the math model weight predictions for a baseline configuration and 15 scaled configurations.

  13. Statistical potential for assessment and prediction of protein structures

    PubMed Central

    Shen, Min-yi; Sali, Andrej

    2006-01-01

    Protein structures in the Protein Data Bank provide a wealth of data about the interactions that determine the native states of proteins. Using the probability theory, we derive an atomic distance-dependent statistical potential from a sample of native structures that does not depend on any adjustable parameters (Discrete Optimized Protein Energy, or DOPE). DOPE is based on an improved reference state that corresponds to noninteracting atoms in a homogeneous sphere with the radius dependent on a sample native structure; it thus accounts for the finite and spherical shape of the native structures. The DOPE potential was extracted from a nonredundant set of 1472 crystallographic structures. We tested DOPE and five other scoring functions by the detection of the native state among six multiple target decoy sets, the correlation between the score and model error, and the identification of the most accurate non-native structure in the decoy set. For all decoy sets, DOPE is the best performing function in terms of all criteria, except for a tie in one criterion for one decoy set. To facilitate its use in various applications, such as model assessment, loop modeling, and fitting into cryo-electron microscopy mass density maps combined with comparative protein structure modeling, DOPE was incorporated into the modeling package MODELLER-8. PMID:17075131

  14. Principles for Predicting RNA Secondary Structure Design Difficulty.

    PubMed

    Anderson-Lee, Jeff; Fisker, Eli; Kosaraju, Vineet; Wu, Michelle; Kong, Justin; Lee, Jeehyung; Lee, Minjae; Zada, Mathew; Treuille, Adrien; Das, Rhiju

    2016-02-27

    Designing RNAs that form specific secondary structures is enabling better understanding and control of living systems through RNA-guided silencing, genome editing and protein organization. Little is known, however, about which RNA secondary structures might be tractable for downstream sequence design, increasing the time and expense of design efforts due to inefficient secondary structure choices. Here, we present insights into specific structural features that increase the difficulty of finding sequences that fold into a target RNA secondary structure, summarizing the design efforts of tens of thousands of human participants and three automated algorithms (RNAInverse, INFO-RNA and RNA-SSD) in the Eterna massive open laboratory. Subsequent tests through three independent RNA design algorithms (NUPACK, DSS-Opt and MODENA) confirmed the hypothesized importance of several features in determining design difficulty, including sequence length, mean stem length, symmetry and specific difficult-to-design motifs such as zigzags. Based on these results, we have compiled an Eterna100 benchmark of 100 secondary structure design challenges that span a large range in design difficulty to help test future efforts. Our in silico results suggest new routes for improving computational RNA design methods and for extending these insights to assess "designability" of single RNA structures, as well as of switches for in vitro and in vivo applications. PMID:26902426

  15. Principles for Predicting RNA Secondary Structure Design Difficulty.

    PubMed

    Anderson-Lee, Jeff; Fisker, Eli; Kosaraju, Vineet; Wu, Michelle; Kong, Justin; Lee, Jeehyung; Lee, Minjae; Zada, Mathew; Treuille, Adrien; Das, Rhiju

    2016-02-27

    Designing RNAs that form specific secondary structures is enabling better understanding and control of living systems through RNA-guided silencing, genome editing and protein organization. Little is known, however, about which RNA secondary structures might be tractable for downstream sequence design, increasing the time and expense of design efforts due to inefficient secondary structure choices. Here, we present insights into specific structural features that increase the difficulty of finding sequences that fold into a target RNA secondary structure, summarizing the design efforts of tens of thousands of human participants and three automated algorithms (RNAInverse, INFO-RNA and RNA-SSD) in the Eterna massive open laboratory. Subsequent tests through three independent RNA design algorithms (NUPACK, DSS-Opt and MODENA) confirmed the hypothesized importance of several features in determining design difficulty, including sequence length, mean stem length, symmetry and specific difficult-to-design motifs such as zigzags. Based on these results, we have compiled an Eterna100 benchmark of 100 secondary structure design challenges that span a large range in design difficulty to help test future efforts. Our in silico results suggest new routes for improving computational RNA design methods and for extending these insights to assess "designability" of single RNA structures, as well as of switches for in vitro and in vivo applications.

  16. Contact prediction for beta and alpha-beta proteins using integer linear optimization and its impact on the first principles 3D structure prediction method ASTRO-FOLD.

    PubMed

    Rajgaria, R; Wei, Y; Floudas, C A

    2010-06-01

    An integer linear optimization model is presented to predict residue contacts in beta, alpha + beta, and alpha/beta proteins. The total energy of a protein is expressed as sum of a C(alpha)-C(alpha) distance dependent contact energy contribution and a hydrophobic contribution. The model selects contact that assign lowest energy to the protein structure as satisfying a set of constraints that are included to enforce certain physically observed topological information. A new method based on hydrophobicity is proposed to find the beta-sheet alignments. These beta-sheet alignments are used as constraints for contacts between residues of beta-sheets. This model was tested on three independent protein test sets and CASP8 test proteins consisting of beta, alpha + beta, alpha/beta proteins and it was found to perform very well. The average accuracy of the predictions (separated by at least six residues) was approximately 61%. The average true positive and false positive distances were also calculated for each of the test sets and they are 7.58 A and 15.88 A, respectively. Residue contact prediction can be directly used to facilitate the protein tertiary structure prediction. This proposed residue contact prediction model is incorporated into the first principles protein tertiary structure prediction approach, ASTRO-FOLD. The effectiveness of the contact prediction model was further demonstrated by the improvement in the quality of the protein structure ensemble generated using the predicted residue contacts for a test set of 10 proteins.

  17. Contact Prediction for Beta and Alpha-Beta Proteins Using Integer Linear Optimization and its Impact on the First Principles 3D Structure Prediction Method ASTRO-FOLD

    PubMed Central

    Rajgaria, R.; Wei, Y.; Floudas, C. A.

    2010-01-01

    An integer linear optimization model is presented to predict residue contacts in β, α + β, and α/β proteins. The total energy of a protein is expressed as sum of a Cα – Cα distance dependent contact energy contribution and a hydrophobic contribution. The model selects contacts that assign lowest energy to the protein structure while satisfying a set of constraints that are included to enforce certain physically observed topological information. A new method based on hydrophobicity is proposed to find the β-sheet alignments. These β-sheet alignments are used as constraints for contacts between residues of β-sheets. This model was tested on three independent protein test sets and CASP8 test proteins consisting of β, α + β, α/β proteins and was found to perform very well. The average accuracy of the predictions (separated by at least six residues) was approximately 61%. The average true positive and false positive distances were also calculated for each of the test sets and they are 7.58 Å and 15.88 Å, respectively. Residue contact prediction can be directly used to facilitate the protein tertiary structure prediction. This proposed residue contact prediction model is incorporated into the first principles protein tertiary structure prediction approach, ASTRO-FOLD. The effectiveness of the contact prediction model was further demonstrated by the improvement in the quality of the protein structure ensemble generated using the predicted residue contacts for a test set of 10 proteins. PMID:20225257

  18. Copula-based prediction of economic movements

    NASA Astrophysics Data System (ADS)

    García, J. E.; González-López, V. A.; Hirsh, I. D.

    2016-06-01

    In this paper we model the discretized returns of two paired time series BM&FBOVESPA Dividend Index and BM&FBOVESPA Public Utilities Index using multivariate Markov models. The discretization corresponds to three categories, high losses, high profits and the complementary periods of the series. In technical terms, the maximal memory that can be considered for a Markov model, can be derived from the size of the alphabet and dataset. The number of parameters needed to specify a discrete multivariate Markov chain grows exponentially with the order and dimension of the chain. In this case the size of the database is not large enough for a consistent estimation of the model. We apply a strategy to estimate a multivariate process with an order greater than the order achieved using standard procedures. The new strategy consist on obtaining a partition of the state space which is constructed from a combination, of the partitions corresponding to the two marginal processes and the partition corresponding to the multivariate Markov chain. In order to estimate the transition probabilities, all the partitions are linked using a copula. In our application this strategy provides a significant improvement in the movement predictions.

  19. Transmission line icing prediction based on DWT feature extraction

    NASA Astrophysics Data System (ADS)

    Ma, T. N.; Niu, D. X.; Huang, Y. L.

    2016-08-01

    Transmission line icing prediction is the premise of ensuring the safe operation of the network as well as the very important basis for the prevention of freezing disasters. In order to improve the prediction accuracy of icing, a transmission line icing prediction model based on discrete wavelet transform (DWT) feature extraction was built. In this method, a group of high and low frequency signals were obtained by DWT decomposition, and were fitted and predicted by using partial least squares regression model (PLS) and wavelet least square support vector model (w-LSSVM). Finally, the final result of the icing prediction was obtained by adding the predicted values of the high and low frequency signals. The results showed that the method is effective and feasible in the prediction of transmission line icing.

  20. M3Ag17(SPh)12 Nanoparticles and Their Structure Prediction.

    PubMed

    Wickramasinghe, Sameera; Atnagulov, Aydar; Yoon, Bokwon; Barnett, Robert N; Griffith, Wendell P; Landman, Uzi; Bigioni, Terry P

    2015-09-16

    Although silver nanoparticles are of great fundamental and practical interest, only one structure has been determined thus far: M4Ag44(SPh)30, where M is a monocation, and SPh is an aromatic thiolate ligand. This is in part due to the fact that no other molecular silver nanoparticles have been synthesized with aromatic thiolate ligands. Here we report the synthesis of M3Ag17(4-tert-butylbenzene-thiol)12, which has good stability and an unusual optical spectrum. We also present a rational strategy for predicting the structure of this molecule. First-principles calculations support the structural model, predict a HOMO-LUMO energy gap of 1.77 eV, and predict a new "monomer mount" capping motif, Ag(SR)3, for Ag nanoparticles. The calculated optical absorption spectrum is in good correspondence with the measured spectrum. Heteroatom substitution was also used as a structural probe. First-principles calculations based on the structural model predicted a strong preference for a single Au atom substitution in agreement with experiment.

  1. The four ingredients of single-sequence RNA secondary structure prediction. A unifying perspective

    PubMed Central

    Rivas, Elena

    2013-01-01

    Any method for RNA secondary structure prediction is determined by four ingredients. The architecture is the choice of features implemented by the model (such as stacked basepairs, loop length distributions, etc.). The architecture determines the number of parameters in the model. The scoring scheme is the nature of those parameters (whether thermodynamic, probabilistic, or weights). The parameterization stands for the specific values assigned to the parameters. These three ingredients are referred to as “the model.” The fourth ingredient is the folding algorithms used to predict plausible secondary structures given the model and the sequence of a structural RNA. Here, I make several unifying observations drawn from looking at more than 40 years of methods for RNA secondary structure prediction in the light of this classification. As a final observation, there seems to be a performance ceiling that affects all methods with complex architectures, a ceiling that impacts all scoring schemes with remarkable similarity. This suggests that modeling RNA secondary structure by using intrinsic sequence-based plausible “foldability” will require the incorporation of other forms of information in order to constrain the folding space and to improve prediction accuracy. This could give an advantage to probabilistic scoring systems since a probabilistic framework is a natural platform to incorporate different sources of information into one single inference problem. PMID:23695796

  2. The four ingredients of single-sequence RNA secondary structure prediction. A unifying perspective.

    PubMed

    Rivas, Elena

    2013-07-01

    Any method for RNA secondary structure prediction is determined by four ingredients. The architecture is the choice of features implemented by the model (such as stacked basepairs, loop length distributions, etc.). The architecture determines the number of parameters in the model. The scoring scheme is the nature of those parameters (whether thermodynamic, probabilistic, or weights). The parameterization stands for the specific values assigned to the parameters. These three ingredients are referred to as "the model." The fourth ingredient is the folding algorithms used to predict plausible secondary structures given the model and the sequence of a structural RNA. Here, I make several unifying observations drawn from looking at more than 40 years of methods for RNA secondary structure prediction in the light of this classification. As a final observation, there seems to be a performance ceiling that affects all methods with complex architectures, a ceiling that impacts all scoring schemes with remarkable similarity. This suggests that modeling RNA secondary structure by using intrinsic sequence-based plausible "foldability" will require the incorporation of other forms of information in order to constrain the folding space and to improve prediction accuracy. This could give an advantage to probabilistic scoring systems since a probabilistic framework is a natural platform to incorporate different sources of information into one single inference problem.

  3. Prediction of the rodent carcinogenicity of organic compounds from their chemical structures using the FALS method.

    PubMed Central

    Moriguchi, I; Hirano, H; Hirono, S

    1996-01-01

    Fuzzy adaptive least-squares (FALS), a pattern recognition method recently developed in our laboratory for correlating structure with activity rating, was used to generate quantitative structure-activity relationship (QSAR) models on the carcinogenicity of organic compounds of several chemical classes. Using the predictive models obtained from the chemical class-based FALS QSAR approach, the rodent carcinogenicity or noncarcinogenicity of a group of organic chemicals currently being tested by the U.S. National Toxicology Program was estimated from their chemical structures. PMID:8933054

  4. Prediction of the three-dimensional structure of human interleukin-7 by homology modeling.

    PubMed

    Kroemer, R T; Doughty, S W; Robinson, A J; Richards, W G

    1996-06-01

    The three-dimensional structure of human interleukin (IL)-7 has been predicted based on homology to human IL-2, IL-4, granulocyte-macrophage colony stimulating factor and growth hormone. The model has a topology common to other cytokines and displays a unique disulfide pattern. Knowledge of the tertiary structure of IL-7 has implications for analysis of key binding regions, suggestions for mutagenesis experiments and design of (ant)agonists. In this context, the model is discussed and compared with other cytokine structures. PMID:8862549

  5. Prediction of the rodent carcinogenicity of organic compounds from their chemical structures using the FALS method

    SciTech Connect

    Moriguchi, Ikuo; Hirono, Shuichi; Hirano, Hiroyuki

    1996-10-01

    Fuzzy adaptive least-squares (FALS), a pattern recognition method recently developed in our laboratory for correlating structure with activity rating, was used to generate quantitative structure-activity relationship (QSAR) models on the carcinogenicity of organic compounds of several chemical classes. Using the predictive models obtained from the chemical class-based FALS QSAR approach, the rodent carcinogenicity or noncarcinogenicity of a group of organic chemicals currently being tested by the U.S. National Toxicology Program was estimated from their chemical structures. 12 refs., 4 tabs.

  6. Prediction of gene expression in embryonic structures of Drosophila melanogaster.

    PubMed

    Samsonova, Anastasia A; Niranjan, Mahesan; Russell, Steven; Brazma, Alvis

    2007-07-01

    Understanding how sets of genes are coordinately regulated in space and time to generate the diversity of cell types that characterise complex metazoans is a major challenge in modern biology. The use of high-throughput approaches, such as large-scale in situ hybridisation and genome-wide expression profiling via DNA microarrays, is beginning to provide insights into the complexities of development. However, in many organisms the collection and annotation of comprehensive in situ localisation data is a difficult and time-consuming task. Here, we present a widely applicable computational approach, integrating developmental time-course microarray data with annotated in situ hybridisation studies, that facilitates the de novo prediction of tissue-specific expression for genes that have no in vivo gene expression localisation data available. Using a classification approach, trained with data from microarray and in situ hybridisation studies of gene expression during Drosophila embryonic development, we made a set of predictions on the tissue-specific expression of Drosophila genes that have not been systematically characterised by in situ hybridisation experiments. The reliability of our predictions is confirmed by literature-derived annotations in FlyBase, by overrepresentation of Gene Ontology biological process annotations, and, in a selected set, by detailed gene-specific studies from the literature. Our novel organism-independent method will be of considerable utility in enriching the annotation of gene function and expression in complex multicellular organisms.

  7. Prediction of spectral acceleration response ordinates based on PGA attenuation

    USGS Publications Warehouse

    Graizer, V.; Kalkan, E.

    2009-01-01

    Developed herein is a new peak ground acceleration (PGA)-based predictive model for 5% damped pseudospectral acceleration (SA) ordinates of free-field horizontal component of ground motion from shallow-crustal earthquakes. The predictive model of ground motion spectral shape (i.e., normalized spectrum) is generated as a continuous function of few parameters. The proposed model eliminates the classical exhausted matrix of estimator coefficients, and provides significant ease in its implementation. It is structured on the Next Generation Attenuation (NGA) database with a number of additions from recent Californian events including 2003 San Simeon and 2004 Parkfield earthquakes. A unique feature of the model is its new functional form explicitly integrating PGA as a scaling factor. The spectral shape model is parameterized within an approximation function using moment magnitude, closest distance to the fault (fault distance) and VS30 (average shear-wave velocity in the upper 30 m) as independent variables. Mean values of its estimator coefficients were computed by fitting an approximation function to spectral shape of each record using robust nonlinear optimization. Proposed spectral shape model is independent of the PGA attenuation, allowing utilization of various PGA attenuation relations to estimate the response spectrum of earthquake recordings.

  8. RNA-Puzzles: A CASP-like evaluation of RNA three-dimensional structure prediction

    PubMed Central

    Cruz, José Almeida; Blanchet, Marc-Frédérick; Boniecki, Michal; Bujnicki, Janusz M.; Chen, Shi-Jie; Cao, Song; Das, Rhiju; Ding, Feng; Dokholyan, Nikolay V.; Flores, Samuel Coulbourn; Huang, Lili; Lavender, Christopher A.; Lisi, Véronique; Major, François; Mikolajczak, Katarzyna; Patel, Dinshaw J.; Philips, Anna; Puton, Tomasz; Santalucia, John; Sijenyi, Fredrick; Hermann, Thomas; Rother, Kristian; Rother, Magdalena; Serganov, Alexander; Skorupski, Marcin; Soltysinski, Tomasz; Sripakdeevong, Parin; Tuszynska, Irina; Weeks, Kevin M.; Waldsich, Christina; Wildauer, Michael; Leontis, Neocles B.; Westhof, Eric

    2012-01-01

    We report the results of a first, collective, blind experiment in RNA three-dimensional (3D) structure prediction, encompassing three prediction puzzles. The goals are to assess the leading edge of RNA structure prediction techniques; compare existing methods and tools; and evaluate their relative strengths, weaknesses, and limitations in terms of sequence length and structural complexity. The results should give potential users insight into the suitability of available methods for different applications and facilitate efforts in the RNA structure prediction community in ongoing efforts to improve prediction tools. We also report the creation of an automated evaluation pipeline to facilitate the analysis of future RNA structure prediction exercises. PMID:22361291

  9. Target Highlights in CASP9: Experimental Target Structures for the Critical Assessment of Techniques for Protein Structure Prediction

    PubMed Central

    Kryshtafovych, Andriy; Moult, John; Bartual, Sergio G.; Bazan, J. Fernando; Berman, Helen; Casteel, Darren E.; Christodoulou, Evangelos; Everett, John K.; Hausmann, Jens; Heidebrecht, Tatjana; Hills, Tanya; Hui, Raymond; Hunt, John F.; Jayaraman, Seetharaman; Joachimiak, Andrzej; Kennedy, Michael A.; Kim, Choel; Lingel, Andreas; Michalska, Karolina; Montelione, Gaetano T.; Otero, José M.; Perrakis, Anastassis; Pizarro, Juan C.; van Raaij, Mark J.; Ramelot, Theresa A.; Rousseau, Francois; Tong, Liang; Wernimont, Amy K.; Young, Jasmine; Schwede, Torsten

    2011-01-01

    One goal of the CASP Community Wide Experiment on the Critical Assessment of Techniques for Protein Structure Prediction is to identify the current state of the art in protein structure prediction and modeling. A fundamental principle of CASP is blind prediction on a set of relevant protein targets, i.e. the participating computational methods are tested on a common set of experimental target proteins, for which the experimental structures are not known at the time of modeling. Therefore, the CASP experiment would not have been possible without broad support of the experimental protein structural biology community. In this manuscript, several experimental groups discuss the structures of the proteins which they provided as prediction targets for CASP9, highlighting structural and functional peculiarities of these structures: the long tail fibre protein gp37 from bacteriophage T4, the cyclic GMP-dependent protein kinase Iβ (PKGIβ) dimerization/docking domain, the ectodomain of the JTB (Jumping Translocation Breakpoint) transmembrane receptor, Autotaxin (ATX) in complex with an inhibitor, the DNA-Binding J-Binding Protein 1 (JBP1) domain essential for biosynthesis and maintenance of DNA base-J (β-D-glucosyl-hydroxymethyluracil) in Trypanosoma and Leishmania, an so far uncharacterized 73 residue domain from Ruminococcus gnavus with a fold typical for PDZ-like domains, a domain from the Phycobilisome (PBS) core-membrane linker (LCM) phycobiliprotein ApcE from Synechocystis, the Heat shock protein 90 (Hsp90) activators PFC0360w and PFC0270w from Plasmodium falciparum, and 2-oxo-3-deoxygalactonate kinase from Klebsiella pneumoniae. PMID:22020785

  10. The extended evolutionary synthesis: its structure, assumptions and predictions

    PubMed Central

    Laland, Kevin N.; Uller, Tobias; Feldman, Marcus W.; Sterelny, Kim; Müller, Gerd B.; Moczek, Armin; Jablonka, Eva; Odling-Smee, John

    2015-01-01

    Scientific activities take place within the structured sets of ideas and assumptions that define a field and its practices. The conceptual framework of evolutionary biology emerged with the Modern Synthesis in the early twentieth century and has since expanded into a highly successful research program to explore the processes of diversification and adaptation. Nonetheless, the ability of that framework satisfactorily to accommodate the rapid advances in developmental biology, genomics and ecology has been questioned. We review some of these arguments, focusing on literatures (evo-devo, developmental plasticity, inclusive inheritance and niche construction) whose implications for evolution can be interpreted in two ways—one that preserves the internal structure of contemporary evolutionary theory and one that points towards an alternative conceptual framework. The latter, which we label the ‘extended evolutionary synthesis' (EES), retains the fundaments of evolutionary theory, but differs in its emphasis on the role of constructive processes in development and evolution, and reciprocal portrayals of causation. In the EES, developmental processes, operating through developmental bias, inclusive inheritance and niche construction, share responsibility for the direction and rate of evolution, the origin of character variation and organism–environment complementarity. We spell out the structure, core assumptions and novel predictions of the EES, and show how it can be deployed to stimulate and advance research in those fields that study or use evolutionary biology. PMID:26246559

  11. The extended evolutionary synthesis: its structure, assumptions and predictions.

    PubMed

    Laland, Kevin N; Uller, Tobias; Feldman, Marcus W; Sterelny, Kim; Müller, Gerd B; Moczek, Armin; Jablonka, Eva; Odling-Smee, John

    2015-08-22

    Scientific activities take place within the structured sets of ideas and assumptions that define a field and its practices. The conceptual framework of evolutionary biology emerged with the Modern Synthesis in the early twentieth century and has since expanded into a highly successful research program to explore the processes of diversification and adaptation. Nonetheless, the ability of that framework satisfactorily to accommodate the rapid advances in developmental biology, genomics and ecology has been questioned. We review some of these arguments, focusing on literatures (evo-devo, developmental plasticity, inclusive inheritance and niche construction) whose implications for evolution can be interpreted in two ways—one that preserves the internal structure of contemporary evolutionary theory and one that points towards an alternative conceptual framework. The latter, which we label the 'extended evolutionary synthesis' (EES), retains the fundaments of evolutionary theory, but differs in its emphasis on the role of constructive processes in development and evolution, and reciprocal portrayals of causation. In the EES, developmental processes, operating through developmental bias, inclusive inheritance and niche construction, share responsibility for the direction and rate of evolution, the origin of character variation and organism-environment complementarity. We spell out the structure, core assumptions and novel predictions of the EES, and show how it can be deployed to stimulate and advance research in those fields that study or use evolutionary biology.

  12. Displacement based multilevel structural optimization

    NASA Technical Reports Server (NTRS)

    Striz, Alfred G.

    1995-01-01

    Multidisciplinary design optimization (MDO) is expected to play a major role in the competitive transportation industries of tomorrow, i.e., in the design of aircraft and spacecraft, of high speed trains, boats, and automobiles. All of these vehicles require maximum performance at minimum weight to keep fuel consumption low and conserve resources. Here, MDO can deliver mathematically based design tools to create systems with optimum performance subject to the constraints of disciplines such as structures, aerodynamics, controls, etc. Although some applications of MDO are beginning to surface, the key to a widespread use of this technology lies in the improvement of its efficiency. This aspect is investigated here for the MDO subset of structural optimization, i.e., for the weight minimization of a given structure under size, strength, and displacement constraints. Specifically, finite element based multilevel optimization of structures (here, statically indeterminate trusses and beams for proof of concept) is performed. In the system level optimization, the design variables are the coefficients of assumed displacement functions, and the load unbalance resulting from the solution of the stiffness equations is minimized. Constraints are placed on the deflection amplitudes and the weight of the structure. In the subsystems level optimizations, the weight of each element is minimized under the action of stress constraints, with the cross sectional dimensions as design variables. This approach is expected to prove very efficient, especially for complex structures, since the design task is broken down into a large number of small and efficiently handled subtasks, each with only a small number of variables. This partitioning will also allow for the use of parallel computing, first, by sending the system and subsystems level computations to two different processors, ultimately, by performing all subsystems level optimizations in a massively parallel manner on separate

  13. The myth of science-based predictive modeling.

    SciTech Connect

    Hemez, F. M.

    2004-01-01

    A key aspect of science-based predictive modeling is the assessment of prediction credibility. This publication argues that the credibility of a family of models and their predictions must combine three components: (1) the fidelity of predictions to test data; (2) the robustness of predictions to variability, uncertainty, and lack-of-knowledge; and (3) the prediction accuracy of models in cases where measurements are not available. Unfortunately, these three objectives are antagonistic. A recently published Theorem that demonstrates the irrevocable trade-offs between fidelity-to-data, robustness-to-uncertainty, and confidence in prediction is summarized. High-fidelity models cannot be made increasingly robust to uncertainty and lack-of-knowledge. Similarly, robustness-to-uncertainty can only be improved at the cost of reducing the confidence in prediction. The concept of confidence in prediction relies on a metric for total uncertainty, capable of aggregating different representations of uncertainty (probabilistic or not). The discussion is illustrated with an engineering application where a family of models is developed to predict the acceleration levels obtained when impacts of varying levels propagate through layers of crushable hyper-foam material of varying thicknesses. Convex modeling is invoked to represent a severe lack-of-knowledge about the constitutive material behavior. The analysis produces intervals of performance metrics from which the total uncertainty and confidence levels are estimated. Finally, performance, robustness and confidence are extrapolated throughout the validation domain to assess the predictive power of the family of models away from tested configurations.

  14. Predictive Multiscale Modeling of Nanocellulose Based Materials and Systems

    NASA Astrophysics Data System (ADS)

    Kovalenko, Andriy

    2014-08-01

    Cellulose Nanocrysals (CNC) is a renewable biodegradable biopolymer with outstanding mechanical properties made from highly abundant natural source, and therefore is very attractive as reinforcing additive to replace petroleum-based plastics in biocomposite materials, foams, and gels. Large-scale applications of CNC are currently limited due to its low solubility in non-polar organic solvents used in existing polymerization technologies. The solvation properties of CNC can be improved by chemical modification of its surface. Development of effective surface modifications has been rather slow because extensive chemical modifications destabilize the hydrogen bonding network of cellulose and deteriorate the mechanical properties of CNC. We employ predictive multiscale theory, modeling, and simulation to gain a fundamental insight into the effect of CNC surface modifications on hydrogen bonding, CNC crystallinity, solvation thermodynamics, and CNC compatibilization with the existing polymerization technologies, so as to rationally design green nanomaterials with improved solubility in non-polar solvents, controlled liquid crystal ordering and optimized extrusion properties. An essential part of this multiscale modeling approach is the statistical- mechanical 3D-RISM-KH molecular theory of solvation, coupled with quantum mechanics, molecular mechanics, and multistep molecular dynamics simulation. The 3D-RISM-KH theory provides predictive modeling of both polar and non-polar solvents, solvent mixtures, and electrolyte solutions in a wide range of concentrations and thermodynamic states. It properly accounts for effective interactions in solution such as steric effects, hydrophobicity and hydrophilicity, hydrogen bonding, salt bridges, buffer, co-solvent, and successfully predicts solvation effects and processes in bulk liquids, solvation layers at solid surface, and in pockets and other inner spaces of macromolecules and supramolecular assemblies. This methodology

  15. VfoldCPX Server: Predicting RNA-RNA Complex Structure and Stability

    PubMed Central

    Xu, Xiaojun; Chen, Shi-Jie

    2016-01-01

    RNA-RNA interactions are essential for genomic RNA dimerization, mRNA splicing, and many RNA-related gene expression and regulation processes. The prediction of the structure and folding stability of RNA-RNA complexes is a problem of significant biological importance and receives substantial interest in the biological community. The VfoldCPX server provides a new web interface to predict the two-dimensional (2D) structures of RNA-RNA complexes from the nucleotide sequences. The VfoldCPX server has several novel advantages including the ability to treat RNAs with tertiary contacts (crossing base pairs) such as loop-loop kissing interactions and the use of physical loop entropy parameters. Based on a partition function-based algorithm, the server enables prediction for structure with and without tertiary contacts. Furthermore, the server outputs a set of energetically stable structures, ranked by their stabilities. The results allow users to gain extensive physical insights into RNA-RNA interactions and their roles in RNA function. The web server is freely accessible at “http://rna.physics.missouri.edu/vfoldCPX”. PMID:27657918

  16. Lifetime Reliability Prediction of Ceramic Structures Under Transient Thermomechanical Loads

    NASA Technical Reports Server (NTRS)

    Nemeth, Noel N.; Jadaan, Osama J.; Gyekenyesi, John P.

    2005-01-01

    An analytical methodology is developed to predict the probability of survival (reliability) of ceramic components subjected to harsh thermomechanical loads that can vary with time (transient reliability analysis). This capability enables more accurate prediction of ceramic component integrity against fracture in situations such as turbine startup and shutdown, operational vibrations, atmospheric reentry, or other rapid heating or cooling situations (thermal shock). The transient reliability analysis methodology developed herein incorporates the following features: fast-fracture transient analysis (reliability analysis without slow crack growth, SCG); transient analysis with SCG (reliability analysis with time-dependent damage due to SCG); a computationally efficient algorithm to compute the reliability for components subjected to repeated transient loading (block loading); cyclic fatigue modeling using a combined SCG and Walker fatigue law; proof testing for transient loads; and Weibull and fatigue parameters that are allowed to vary with temperature or time. Component-to-component variation in strength (stochastic strength response) is accounted for with the Weibull distribution, and either the principle of independent action or the Batdorf theory is used to predict the effect of multiaxial stresses on reliability. The reliability analysis can be performed either as a function of the component surface (for surface-distributed flaws) or component volume (for volume-distributed flaws). The transient reliability analysis capability has been added to the NASA CARES/ Life (Ceramic Analysis and Reliability Evaluation of Structures/Life) code. CARES/Life was also updated to interface with commercially available finite element analysis software, such as ANSYS, when used to model the effects of transient load histories. Examples are provided to demonstrate the features of the methodology as implemented in the CARES/Life program.

  17. PETcofold: predicting conserved interactions and structures of two multiple alignments of RNA sequences

    PubMed Central

    Seemann, Stefan E.; Richter, Andreas S.; Gesell, Tanja; Backofen, Rolf; Gorodkin, Jan

    2011-01-01

    Motivation: Predicting RNA–RNA interactions is essential for determining the function of putative non-coding RNAs. Existing methods for the prediction of interactions are all based on single sequences. Since comparative methods have already been useful in RNA structure determination, we assume that conserved RNA–RNA interactions also imply conserved function. Of these, we further assume that a non-negligible amount of the existing RNA–RNA interactions have also acquired compensating base changes throughout evolution. We implement a method, PETcofold, that can take covariance information in intra-molecular and inter-molecular base pairs into account to predict interactions and secondary structures of two multiple alignments of RNA sequences. Results: PETcofold's ability to predict RNA–RNA interactions was evaluated on a carefully curated dataset of 32 bacterial small RNAs and their targets, which was manually extracted from the literature. For evaluation of both RNA–RNA interaction and structure prediction, we were able to extract only a few high-quality examples: one vertebrate small nucleolar RNA and four bacterial small RNAs. For these we show that the prediction can be improved by our comparative approach. Furthermore, PETcofold was evaluated on controlled data with phylogenetically simulated sequences enriched for covariance patterns at the interaction sites. We observed increased performance with increased amounts of covariance. Availability: The program PETcofold is available as source code and can be downloaded from http://rth.dk/resources/petcofold. Contact: gorodkin@rth.dk; backofen@informatik.uni-freiburg.de Supplementary information: Supplementary data are available at Bioinformatics online. PMID:21088024

  18. Detecting and representing predictable structure during auditory scene analysis.

    PubMed

    Sohoglu, Ediz; Chait, Maria

    2016-01-01

    We use psychophysics and MEG to test how sensitivity to input statistics facilitates auditory-scene-analysis (ASA). Human subjects listened to 'scenes' comprised of concurrent tone-pip streams (sources). On occasional trials a new source appeared partway. Listeners were more accurate and quicker to detect source appearance in scenes comprised of temporally-regular (REG), rather than random (RAND), sources. MEG in passive listeners and those actively detecting appearance events revealed increased sustained activity in auditory and parietal cortex in REG relative to RAND scenes, emerging ~400 ms of scene-onset. Over and above this, appearance in REG scenes was associated with increased responses relative to RAND scenes. The effect of temporal structure on appearance-evoked responses was delayed when listeners were focused on the scenes relative to when listening passively, consistent with the notion that attention reduces 'surprise'. Overall, the results implicate a mechanism that tracks predictability of multiple concurrent sources to facilitate active and passive ASA.

  19. Structural Acoustic Prediction and Interior Noise Control Technology

    NASA Technical Reports Server (NTRS)

    Mathur, G. P.; Chin, C. L.; Simpson, M. A.; Lee, J. T.; Palumbo, Daniel L. (Technical Monitor)

    2001-01-01

    This report documents the results of Task 14, "Structural Acoustic Prediction and Interior Noise Control Technology". The task was to evaluate the performance of tuned foam elements (termed Smart Foam) both analytically and experimentally. Results taken from a three-dimensional finite element model of an active, tuned foam element are presented. Measurements of sound absorption and sound transmission loss were taken using the model. These results agree well with published data. Experimental performance data were taken in Boeing's Interior Noise Test Facility where 12 smart foam elements were applied to a 757 sidewall. Several configurations were tested. Noise reductions of 5-10 dB were achieved over the 200-800 Hz bandwidth of the controller. Accelerometers mounted on the panel provided a good reference for the controller. Configurations with far-field error microphones outperformed near-field cases.

  20. High Precision Prediction of Functional Sites in Protein Structures

    PubMed Central

    Buturovic, Ljubomir; Wong, Mike; Tang, Grace W.; Altman, Russ B.; Petkovic, Dragutin

    2014-01-01

    We address the problem of assigning biological function to solved protein structures. Computational tools play a critical role in identifying potential active sites and informing screening decisions for further lab analysis. A critical parameter in the practical application of computational methods is the precision, or positive predictive value. Precision measures the level of confidence the user should have in a particular computed functional assignment. Low precision annotations lead to futile laboratory investigations and waste scarce research resources. In this paper we describe an advanced version of the protein function annotation system FEATURE, which achieved 99% precision and average recall of 95% across 20 representative functional sites. The system uses a Support Vector Machine classifier operating on the microenvironment of physicochemical features around an amino acid. We also compared performance of our method with state-of-the-art sequence-level annotator Pfam in terms of precision, recall and localization. To our knowledge, no other functional site annotator has been rigorously evaluated against these key criteria. The software and predictive models are incorporated into the WebFEATURE service at http://feature.stanford.edu/wf4.0-beta. PMID:24632601

  1. Engineering Property Prediction Tools for Tailored Polymer Composite Structures

    SciTech Connect

    Nguyen, Ba Nghiep; Foss, Peter; Wyzgoski, Michael; Trantina, Gerry; Kunc, Vlastimil; Schutte, Carol; Smith, Mark T.

    2009-12-23

    This report summarizes our FY 2009 research activities for the project titled:"Engineering Property Prediction Tools for Tailored Polymer Composite Structures." These activities include (i) the completion of the development of a fiber length attrition model for injection-molded long-fiber thermoplastics (LFTs), (ii) development of the a fatigue damage model for LFTs and its implementation in ABAQUS, (iii) development of an impact damage model for LFTs and its implementation in ABAQUS, (iv) development of characterization methods for fatigue testing, (v) characterization of creep and fatigue responses of glass-fiber/polyamide (PA6,6) and glass-fiber/polypropylene (PP), (vi) characterization of fiber length distribution along the flow length of glass/PA6,6 and glass-fiber/PP, and (vii) characterization of impact responses of glass-fiber/PA6,6. The fiber length attrition model accurately captures the fiber length distribution along the flow length of the studied glass-fiber/PP material. The fatigue damage model is able to predict the S-N and stiffness reduction data which are valuable to the fatigue design of LFTs. The impact damage model correctly captures damage accumulation observed in experiments of glass-fiber/PA6,6 plaques.Further work includes validations of these models for representative LFT materials and a complex LFT part.

  2. Can we predict lattice energy from molecular structure?

    PubMed

    Ouvrard, Carole; Mitchell, John B O

    2003-10-01

    By using simply the numbers of occurrences of different atom types as descriptors, a conceptually transparent and remarkably accurate model for the prediction of the enthalpies of sublimation of organic compounds has been generated. The atom types are defined on the basis of atomic number, hybridization state and bonded environment. Models of this kind were applied firstly to aliphatic hydrocarbons, secondly to both aliphatic and aromatic hydrocarbons, thirdly to a wide range of non-hydrogen-bonding molecules, and finally to a set of 226 organic compounds including 70 containing hydrogen-bond donors and acceptors. The final model gives squared correlation coefficients of 0.925 for the 226 compounds in the training set and 0.937 for an independent test set of 35 compounds. The success of such a simple model implies that the enthalpy of sublimation can be predicted accurately without knowledge of the crystal packing. This hypothesis is in turn consistent with the idea that, rather than being determined by the particular features of the lowest-energy packing, the lattice energy is similar for a number of hypothetical alternative crystal structures of a molecule.

  3. Multiscale model for predicting shear zone structure and permeability in deforming rock

    NASA Astrophysics Data System (ADS)

    Cleary, Paul W.; Pereira, Gerald G.; Lemiale, Vincent; Piane, Claudio Delle; Clennell, M. Ben

    2016-04-01

    A novel multiscale model is proposed for the evolution of faults in rocks, which predicts their internal properties and permeability as strain increases. The macroscale model, based on smoothed particle hydrodynamics (SPH), predicts system scale deformation by a pressure-dependent elastoplastic representation of the rock and shear zone. Being a continuum method, SPH contains no intrinsic information on the grain scale structure or behaviour of the shear zone, so a series of discrete element method microscale shear cell models are embedded into the macroscale model at specific locations. In the example used here, the overall geometry and kinematics of a direct shear test on a block of intact rock is simulated. Deformation is imposed by a macroscale model where stresses and displacement rates are applied at the shear cell walls in contact with the rock. Since the microscale models within the macroscale block of deforming rock now include representations of the grains, the structure of the shear zone, the evolution of the size and shape distribution of these grains, and the dilatancy of the shear zone can all be predicted. The microscale dilatancy can be used to vary the macroscale model dilatancy both spatially and temporally to give a full two-way coupling between the spatial scales. The ability of this model to predict shear zone structure then allows the prediction of the shear zone permeability using the Lattice-Boltzmann method.

  4. Evaluation of a universal flow-through model for predicting and designing phosphorus removal structures.

    PubMed

    Penn, Chad; Bowen, James; McGrath, Joshua; Nairn, Robert; Fox, Garey; Brown, Glenn; Wilson, Stuart; Gill, Clinton

    2016-05-01

    Phosphorus (P) removal structures have been shown to decrease dissolved P loss from agricultural and urban areas which may reduce the threat of eutrophication. In order to design or quantify performance of these structures, the relationship between discrete and cumulative removal with cumulative P loading must be determined, either by individual flow-through experiments or model prediction. A model was previously developed for predicting P removal with P sorption materials (PSMs) under flow-through conditions, as a function of inflow P concentration, retention time (RT), and PSM characteristics. The objective of this study was to compare model results to measured P removal data from several PSM under a range of conditions (P concentrations and RT) and scales ranging from laboratory to field. Materials tested included acid mine drainage residuals (AMDRs), treated and non-treated electric arc furnace (EAF) steel slag at different size fractions, and flue gas desulfurization (FGD) gypsum. Equations for P removal curves and cumulative P removed were not significantly different between predicted and actual values for any of the 23 scenarios examined. However, the model did tend to slightly over-predict cumulative P removal for calcium-based PSMs. The ability of the model to predict P removal for various materials, RTs, and P concentrations in both controlled settings and field structures validate its use in design and quantification of these structures. This ability to predict P removal without constant monitoring is vital to widespread adoption of P removal structures, especially for meeting discharge regulations and nutrient trading programs. PMID:26950026

  5. A Fully Bayesian Approach to Improved Calibration and Prediction of Groundwater Models With Structure Error

    NASA Astrophysics Data System (ADS)

    Xu, T.; Valocchi, A. J.

    2014-12-01

    Effective water resource management typically relies on numerical models to analyse groundwater flow and solute transport processes. These models are usually subject to model structure error due to simplification and/or misrepresentation of the real system. As a result, the model outputs may systematically deviate from measurements, thus violating a key assumption for traditional regression-based calibration and uncertainty analysis. On the other hand, model structure error induced bias can be described statistically in an inductive, data-driven way based on historical model-to-measurement misfit. We adopt a fully Bayesian approach that integrates a Gaussian process error model to account for model structure error to the calibration, prediction and uncertainty analysis of groundwater models. The posterior distributions of parameters of the groundwater model and the Gaussian process error model are jointly inferred using DREAM, an efficient Markov chain Monte Carlo sampler. We test the usefulness of the fully Bayesian approach towards a synthetic case study of surface-ground water interaction under changing pumping conditions. We first illustrate through this example that traditional least squares regression without accounting for model structure error yields biased parameter estimates due to parameter compensation as well as biased predictions. In contrast, the Bayesian approach gives less biased parameter estimates. Moreover, the integration of a Gaussian process error model significantly reduces predictive bias and leads to prediction intervals that are more consistent with observations. The results highlight the importance of explicit treatment of model structure error especially in circumstances where subsequent decision-making and risk analysis require accurate prediction and uncertainty quantification. In addition, the data-driven error modelling approach is capable of extracting more information from observation data than using a groundwater model alone.

  6. Prediction of turn types in protein structure by machine-learning classifiers.

    PubMed

    Meissner, Michael; Koch, Oliver; Klebe, Gerhard; Schneider, Gisbert

    2009-02-01

    We present machine learning approaches for turn prediction from the amino acid sequence. Different turn classes and types were considered based on a novel turn classification scheme. We trained an unsupervised (self-organizing map) and two kernel-based classifiers, namely the support vector machine and a probabilistic neural network. Turn versus non-turn classification was carried out for turn families containing intramolecular hydrogen bonds and three to six residues. Support vector machine classifiers yielded a Matthews correlation coefficient (mcc) of approximately 0.6 and a prediction accuracy of 80%. Probabilistic neural networks were developed for beta-turn type prediction. The method was able to distinguish between five types of beta-turns yielding mcc > 0.5 and at least 80% overall accuracy. We conclude that the proposed new turn classification is distinct and well-defined, and machine learning classifiers are suited for sequence-based turn prediction. Their potential for sequence-based prediction of turn structures is discussed.

  7. Model-based Heart rate prediction during Lokomat walking.

    PubMed

    Koenig, Alexander C; Somaini, Luca; Pulfer, Michael; Holenstein, Thomas; Omlin, Ximena; Wieser, Martin; Riener, Robert

    2009-01-01

    We implemented a model for prediction of heart rate during Lokomat walking. Using this model, we can predict potential overstressing of the patient and adapt the physical load accordingly. Current models for treadmill based heart rate control neglect the fact that the interaction torques between Lokomat and human can have a significant effect on heart rate. Tests with five healthy subjects lead to a model of sixth order with walking speed and power expenditure as inputs and heart rate prediction as output. Recordings with five different subjects were used for model validation. Future work includes model identification and predictive heart rate control with spinal cord injured and stroke patients. PMID:19963765

  8. Prediction of the Fundamental Period of Infilled RC Frame Structures Using Artificial Neural Networks

    PubMed Central

    Asteris, Panagiotis G.; Tsaris, Athanasios K.; Cavaleri, Liborio; Repapis, Constantinos C.; Papalou, Angeliki; Di Trapani, Fabio; Karypidis, Dimitrios F.

    2016-01-01

    The fundamental period is one of the most critical parameters for the seismic design of structures. There are several literature approaches for its estimation which often conflict with each other, making their use questionable. Furthermore, the majority of these approaches do not take into account the presence of infill walls into the structure despite the fact that infill walls increase the stiffness and mass of structure leading to significant changes in the fundamental period. In the present paper, artificial neural networks (ANNs) are used to predict the fundamental period of infilled reinforced concrete (RC) structures. For the training and the validation of the ANN, a large data set is used based on a detailed investigation of the parameters that affect the fundamental period of RC structures. The comparison of the predicted values with analytical ones indicates the potential of using ANNs for the prediction of the fundamental period of infilled RC frame structures taking into account the crucial parameters that influence its value. PMID:27066069

  9. HAAD: A quick algorithm for accurate prediction of hydrogen atoms in protein structures.

    PubMed

    Li, Yunqi; Roy, Ambrish; Zhang, Yang

    2009-08-20

    Hydrogen constitutes nearly half of all atoms in proteins and their positions are essential for analyzing hydrogen-bonding interactions and refining atomic-level structures. However, most protein structures determined by experiments or computer prediction lack hydrogen coordinates. We present a new algorithm, HAAD, to predict the positions of hydrogen atoms based on the positions of heavy atoms. The algorithm is built on the basic rules of orbital hybridization followed by the optimization of steric repulsion and electrostatic interactions. We tested the algorithm using three independent data sets: ultra-high-resolution X-ray structures, structures determined by neutron diffraction, and NOE proton-proton distances. Compared with the widely used programs CHARMM and REDUCE, HAAD has a significantly higher accuracy, with the average RMSD of the predicted hydrogen atoms to the X-ray and neutron diffraction structures decreased by 26% and 11%, respectively. Furthermore, hydrogen atoms placed by HAAD have more matches with the NOE restraints and fewer clashes with heavy atoms. The average CPU cost by HAAD is 18 and 8 times lower than that of CHARMM and REDUCE, respectively. The significant advantage of HAAD in both the accuracy and the speed of the hydrogen additions should make HAAD a useful tool for the detailed study of protein structure and function. Both an executable and the source code of HAAD are freely available at http://zhang.bioinformatics.ku.edu/HAAD.

  10. Protein-Based Urine Test Predicts Kidney Transplant Outcomes

    MedlinePlus

    ... News Releases News Release Thursday, August 22, 2013 Protein-based urine test predicts kidney transplant outcomes NIH- ... supporting development of noninvasive tests. Levels of a protein in the urine of kidney transplant recipients can ...

  11. BetaSCPWeb: side-chain prediction for protein structures using Voronoi diagrams and geometry prioritization

    PubMed Central

    Ryu, Joonghyun; Lee, Mokwon; Cha, Jehyun; Laskowski, Roman A.; Ryu, Seong Eon; Kim, Deok-Soo

    2016-01-01

    Many applications, such as protein design, homology modeling, flexible docking, etc. require the prediction of a protein's optimal side-chain conformations from just its amino acid sequence and backbone structure. Side-chain prediction (SCP) is an NP-hard energy minimization problem. Here, we present BetaSCPWeb which efficiently computes a conformation close to optimal using a geometry-prioritization method based on the Voronoi diagram of spherical atoms. Its outputs are visual, textual and PDB file format. The web server is free and open to all users at http://voronoi.hanyang.ac.kr/betascpweb with no login requirement. PMID:27151195

  12. Flow structure generated by perpendicular blade vortex interaction and implications for helicopter noise predictions

    NASA Technical Reports Server (NTRS)

    Devenport, William J.; Glegg, Stewart A. L.

    1994-01-01

    Activities carried out in support of research on flow structure generated by perpendicular blade vortex interaction and implications for helicopter noise prediction are summarized. Progress in the following areas is described: (1) construction of 8 inch-chord NACA 0012 full-span blade; (2) Acquisition of two full-span blades; (3) preparation for hot wire measurements; (4) related work on a modified Betz's theory; and (5) work related to helicopter noise prediction. In addition, a list of publications based on the results of prior experimentation is presented.

  13. QuaBingo: A Prediction System for Protein Quaternary Structure Attributes Using Block Composition.

    PubMed

    Tung, Chi-Hua; Chen, Chi-Wei; Guo, Ren-Chao; Ng, Hui-Fuang; Chu, Yen-Wei

    2016-01-01

    Background. Quaternary structures of proteins are closely relevant to gene regulation, signal transduction, and many other biological functions of proteins. In the current study, a new method based on protein-conserved motif composition in block format for feature extraction is proposed, which is termed block composition. Results. The protein quaternary assembly states prediction system which combines blocks with functional domain composition, called QuaBingo, is constructed by three layers of classifiers that can categorize quaternary structural attributes of monomer, homooligomer, and heterooligomer. The building of the first layer classifier uses support vector machines (SVM) based on blocks and functional domains of proteins, and the second layer SVM was utilized to process the outputs of the first layer. Finally, the result is determined by the Random Forest of the third layer. We compared the effectiveness of the combination of block composition, functional domain composition, and pseudoamino acid composition of the model. In the 11 kinds of functional protein families, QuaBingo is 23% of Matthews Correlation Coefficient (MCC) higher than the existing prediction system. The results also revealed the biological characterization of the top five block compositions. Conclusions. QuaBingo provides better predictive ability for predicting the quaternary structural attributes of proteins. PMID:27610389

  14. QuaBingo: A Prediction System for Protein Quaternary Structure Attributes Using Block Composition.

    PubMed

    Tung, Chi-Hua; Chen, Chi-Wei; Guo, Ren-Chao; Ng, Hui-Fuang; Chu, Yen-Wei

    2016-01-01

    Background. Quaternary structures of proteins are closely relevant to gene regulation, signal transduction, and many other biological functions of proteins. In the current study, a new method based on protein-conserved motif composition in block format for feature extraction is proposed, which is termed block composition. Results. The protein quaternary assembly states prediction system which combines blocks with functional domain composition, called QuaBingo, is constructed by three layers of classifiers that can categorize quaternary structural attributes of monomer, homooligomer, and heterooligomer. The building of the first layer classifier uses support vector machines (SVM) based on blocks and functional domains of proteins, and the second layer SVM was utilized to process the outputs of the first layer. Finally, the result is determined by the Random Forest of the third layer. We compared the effectiveness of the combination of block composition, functional domain composition, and pseudoamino acid composition of the model. In the 11 kinds of functional protein families, QuaBingo is 23% of Matthews Correlation Coefficient (MCC) higher than the existing prediction system. The results also revealed the biological characterization of the top five block compositions. Conclusions. QuaBingo provides better predictive ability for predicting the quaternary structural attributes of proteins.

  15. QuaBingo: A Prediction System for Protein Quaternary Structure Attributes Using Block Composition

    PubMed Central

    Tung, Chi-Hua; Chen, Chi-Wei; Guo, Ren-Chao; Ng, Hui-Fuang

    2016-01-01

    Background. Quaternary structures of proteins are closely relevant to gene regulation, signal transduction, and many other biological functions of proteins. In the current study, a new method based on protein-conserved motif composition in block format for feature extraction is proposed, which is termed block composition. Results. The protein quaternary assembly states prediction system which combines blocks with functional domain composition, called QuaBingo, is constructed by three layers of classifiers that can categorize quaternary structural attributes of monomer, homooligomer, and heterooligomer. The building of the first layer classifier uses support vector machines (SVM) based on blocks and functional domains of proteins, and the second layer SVM was utilized to process the outputs of the first layer. Finally, the result is determined by the Random Forest of the third layer. We compared the effectiveness of the combination of block composition, functional domain composition, and pseudoamino acid composition of the model. In the 11 kinds of functional protein families, QuaBingo is 23% of Matthews Correlation Coefficient (MCC) higher than the existing prediction system. The results also revealed the biological characterization of the top five block compositions. Conclusions. QuaBingo provides better predictive ability for predicting the quaternary structural attributes of proteins. PMID:27610389

  16. QuaBingo: A Prediction System for Protein Quaternary Structure Attributes Using Block Composition

    PubMed Central

    Tung, Chi-Hua; Chen, Chi-Wei; Guo, Ren-Chao; Ng, Hui-Fuang

    2016-01-01

    Background. Quaternary structures of proteins are closely relevant to gene regulation, signal transduction, and many other biological functions of proteins. In the current study, a new method based on protein-conserved motif composition in block format for feature extraction is proposed, which is termed block composition. Results. The protein quaternary assembly states prediction system which combines blocks with functional domain composition, called QuaBingo, is constructed by three layers of classifiers that can categorize quaternary structural attributes of monomer, homooligomer, and heterooligomer. The building of the first layer classifier uses support vector machines (SVM) based on blocks and functional domains of proteins, and the second layer SVM was utilized to process the outputs of the first layer. Finally, the result is determined by the Random Forest of the third layer. We compared the effectiveness of the combination of block composition, functional domain composition, and pseudoamino acid composition of the model. In the 11 kinds of functional protein families, QuaBingo is 23% of Matthews Correlation Coefficient (MCC) higher than the existing prediction system. The results also revealed the biological characterization of the top five block compositions. Conclusions. QuaBingo provides better predictive ability for predicting the quaternary structural attributes of proteins.

  17. Prediction of HIV drug resistance from genotype with encoded three-dimensional protein structure

    PubMed Central

    2014-01-01

    Background Drug resistance has become a severe challenge for treatment of HIV infections. Mutations accumulate in the HIV genome and make certain drugs ineffective. Prediction of resistance from genotype data is a valuable guide in choice of drugs for effective therapy. Results In order to improve the computational prediction of resistance from genotype data we have developed a unified encoding of the protein sequence and three-dimensional protein structure of the drug target for classification and regression analysis. The method was tested on genotype-resistance data for mutants of HIV protease and reverse transcriptase. Our graph based sequence-structure approach gives high accuracy with a new sparse dictionary classification method, as well as support vector machine and artificial neural networks classifiers. Cross-validated regression analysis with the sparse dictionary gave excellent correlation between predicted and observed resistance. Conclusion The approach of encoding the protein structure and sequence as a 210-dimensional vector, based on Delaunay triangulation, has promise as an accurate method for predicting resistance from sequence for drugs inhibiting HIV protease and reverse transcriptase. PMID:25081370

  18. Structure-dynamics relationships in bursting neuronal networks revealed using a prediction framework.

    PubMed

    Mäki-Marttunen, Tuomo; Aćimović, Jugoslava; Ruohonen, Keijo; Linne, Marja-Leena

    2013-01-01

    The question of how the structure of a neuronal network affects its functionality has gained a lot of attention in neuroscience. However, the vast majority of the studies on structure-dynamics relationships consider few types of network structures and assess limited numbers of structural measures. In this in silico study, we employ a wide diversity of network topologies and search among many possibilities the aspects of structure that have the greatest effect on the network excitability. The network activity is simulated using two point-neuron models, where the neurons are activated by noisy fluctuation of the membrane potential and their connections are described by chemical synapse models, and statistics on the number and quality of the emergent network bursts are collected for each network type. We apply a prediction framework to the obtained data in order to find out the most relevant aspects of network structure. In this framework, predictors that use different sets of graph-theoretic measures are trained to estimate the activity properties, such as burst count or burst length, of the networks. The performances of these predictors are compared with each other. We show that the best performance in prediction of activity properties for networks with sharp in-degree distribution is obtained when the prediction is based on clustering coefficient. By contrast, for networks with broad in-degree distribution, the maximum eigenvalue of the connectivity graph gives the most accurate prediction. The results shown for small ([Formula: see text]) networks hold with few exceptions when different neuron models, different choices of neuron population and different average degrees are applied. We confirm our conclusions using larger ([Formula: see text]) networks as well. Our findings reveal the relevance of different aspects of network structure from the viewpoint of network excitability, and our integrative method could serve as a general framework for structure

  19. Ocean circulation model predicts high genetic structure observed in a long-lived pelagic developer.

    PubMed

    Sunday, J M; Popovic, I; Palen, W J; Foreman, M G G; Hart, M W

    2014-10-01

    Understanding the movement of genes and individuals across marine seascapes is a long-standing challenge in marine ecology and can inform our understanding of local adaptation, the persistence and movement of populations, and the spatial scale of effective management. Patterns of gene flow in the ocean are often inferred based on population genetic analyses coupled with knowledge of species' dispersive life histories. However, genetic structure is the result of time-integrated processes and may not capture present-day connectivity between populations. Here, we use a high-resolution oceanographic circulation model to predict larval dispersal along the complex coastline of western Canada that includes the transition between two well-studied zoogeographic provinces. We simulate dispersal in a benthic sea star with a 6-10 week pelagic larval phase and test predictions of this model against previously observed genetic structure including a strong phylogeographic break within the zoogeographical transition zone. We also test predictions with new genetic sampling in a site within the phylogeographic break. We find that the coupled genetic and circulation model predicts the high degree of genetic structure observed in this species, despite its long pelagic duration. High genetic structure on this complex coastline can thus be explained through ocean circulation patterns, which tend to retain passive larvae within 20-50 km of their parents, suggesting a necessity for close-knit design of Marine Protected Area networks.

  20. Plant genetics predicts intra-annual variation in phytochemistry and arthropod community structure.

    PubMed

    Wimp, G M; Wooley, S; Bangert, R K; Young, W P; Martinsen, G D; Keim, P; Rehill, B; Lindroth, R L; Whitham, T G

    2007-12-01

    With the emerging field of community genetics, it is important to quantify the key mechanisms that link genetics and community structure. We studied cottonwoods in common gardens and in natural stands and examined the potential for plant chemistry to be a primary mechanism linking plant genetics and arthropod communities. If plant chemistry drives the relationship between plant genetics and arthropod community structure, then several predictions followed. We would find (i) the strongest correlation between plant genetic composition and chemical composition; (ii) an intermediate correlation between plant chemical composition and arthropod community composition; and (iii) the weakest relationship between plant genetic composition and arthropod community composition. Our results supported our first prediction: plant genetics and chemistry had the strongest correlation in the common garden and the wild. Our results largely supported our second prediction, but varied across space, seasonally, and according to arthropod feeding group. Plant chemistry played a larger role in structuring common garden arthropod communities relative to wild communities, free-living arthropods relative to leaf and stem modifiers, and early-season relative to late-season arthropods. Our results did not support our last prediction, as host plant genetics was at least as tightly linked to arthropod community structure as plant chemistry, if not more so. Our results demonstrate the consistency of the relationship between plant genetics and biodiversity. Additionally, plant chemistry can be an important mechanism by which plant genetics affects arthropod community composition, but other genetic-based factors are likely involved that remain to be measured.

  1. Ocean circulation model predicts high genetic structure observed in a long-lived pelagic developer.

    PubMed

    Sunday, J M; Popovic, I; Palen, W J; Foreman, M G G; Hart, M W

    2014-10-01

    Understanding the movement of genes and individuals across marine seascapes is a long-standing challenge in marine ecology and can inform our understanding of local adaptation, the persistence and movement of populations, and the spatial scale of effective management. Patterns of gene flow in the ocean are often inferred based on population genetic analyses coupled with knowledge of species' dispersive life histories. However, genetic structure is the result of time-integrated processes and may not capture present-day connectivity between populations. Here, we use a high-resolution oceanographic circulation model to predict larval dispersal along the complex coastline of western Canada that includes the transition between two well-studied zoogeographic provinces. We simulate dispersal in a benthic sea star with a 6-10 week pelagic larval phase and test predictions of this model against previously observed genetic structure including a strong phylogeographic break within the zoogeographical transition zone. We also test predictions with new genetic sampling in a site within the phylogeographic break. We find that the coupled genetic and circulation model predicts the high degree of genetic structure observed in this species, despite its long pelagic duration. High genetic structure on this complex coastline can thus be explained through ocean circulation patterns, which tend to retain passive larvae within 20-50 km of their parents, suggesting a necessity for close-knit design of Marine Protected Area networks. PMID:25231198

  2. In silico prediction of the β-cyclodextrin complexation based on Monte Carlo method.

    PubMed

    Veselinović, Aleksandar M; Veselinović, Jovana B; Toropov, Andrey A; Toropova, Alla P; Nikolić, Goran M

    2015-11-10

    In this study QSPR models were developed to predict the complexation of structurally diverse compounds with β-cyclodextrin based on SMILES notation optimal descriptors using Monte Carlo method. The predictive potential of the applied approach was tested with three random splits into the sub-training, calibration, test and validation sets and with different statistical methods. Obtained results demonstrate that Monte Carlo method based modeling is a very promising computational method in the QSPR studies for predicting the complexation of structurally diverse compounds with β-cyclodextrin. The SMILES attributes (structural features both local and global), defined as molecular fragments, which are promoters of the increase/decrease of molecular binding constants were identified. These structural features were correlated to the complexation process and their identification helped to improve the understanding for the complexation mechanisms of the host molecules.

  3. Local-prediction-based difference expansion reversible watermarking.

    PubMed

    Dragoi, Ioan-Catalin; Coltuc, Dinu

    2014-04-01

    This paper investigates the use of local prediction in difference expansion reversible watermarking. For each pixel, a least square predictor is computed on a square block centered on the pixel and the corresponding prediction error is expanded. The same predictor is recovered at detection without any additional information. The proposed local prediction is general and it applies regardless of the predictor order or the prediction context. For the particular cases of least square predictors with the same context as the median edge detector, gradient-adjusted predictor or the simple rhombus neighborhood, the local prediction-based reversible watermarking clearly outperforms the state-of-the-art schemes based on the classical counterparts. Experimental results are provided.

  4. Genotyping Oral Commensal Bacteria to Predict Social Contact and Structure

    PubMed Central

    Wallace, Amelia D.; Riley, Lee W.

    2016-01-01

    Social network structure is a fundamental determinant of human health, from infectious to chronic diseases. However, quantitative and unbiased approaches to measuring social network structure are lacking. We hypothesized that genetic relatedness of oral commensal bacteria could be used to infer social contact between humans, just as genetic relatedness of pathogens can be used to determine transmission chains of pathogens. We used a traditional, questionnaire survey-based method to characterize the contact network of the School of Public Health at a large research university. We then collected saliva from a subset of individuals to analyze their oral microflora using a modified deep sequencing multilocus sequence typing (MLST) procedure. We examined micro-evolutionary changes in the S. viridans group to uncover transmission patterns reflecting social network structure. We amplified seven housekeeping gene loci from the Streptococcus viridans group, a group of ubiquitous commensal bacteria, and sequenced the PCR products using next-generation sequencing. By comparing the generated S. viridans reads between pairs of individuals, we reconstructed the social network of the sampled individuals and compared it to the network derived from the questionnaire survey-based method. The genetic relatedness significantly (p-value < 0.001) correlated with social distance in the questionnaire-based network, and the reconstructed network closely matched the network derived from the questionnaire survey-based method. Oral commensal bacterial are thus likely transmitted through routine physical contact or shared environment. Their genetic relatedness can be used to represent a combination of social contact and shared physical space, therefore reconstructing networks of contact. This study provides the first step in developing a method to measure direct social contact based on commensal organism genotyping, potentially capable of unmasking hidden social networks that contribute to

  5. Vertical structure of predictability and information transport over the Northern Hemisphere

    NASA Astrophysics Data System (ADS)

    Feng, Ai-Xia; Wang, Qi-Gang; Gong, Zhi-Qiang; Feng, Guo-Lin

    2014-02-01

    Based on nonlinear prediction and information theory, vertical heterogeneity of predictability and information loss rate in geopotential height field are obtained over the Northern Hemisphere. On a seasonal-to-interannual time scale, the predictability is low in the lower troposphere and high in the mid-upper troposphere. However, within mid-upper troposphere over the subtropics ocean area, there is a relatively poor predictability. These conclusions also fit the seasonal time scale. Moving to the interannual time scale, the predictability becomes high in the lower troposphere and low in the mid-upper troposphere, contrary to the former case. On the whole the interannual trend is more predictable than the seasonal trend. The average information loss rate is low over the mid-east Pacific, west of North America, Atlantic and Eurasia, and the atmosphere over other places has a relatively high information loss rate on all-time scales. Two channels are found steadily over the Pacific Ocean and Atlantic Ocean in subtropics. There are also unstable channels. The four-season influence on predictability and information communication are studied. The predictability is low, no matter which season data are removed and each season plays an important role in the existence of the channels, except for the winter. The predictability and teleconnections are paramount issues in atmospheric science, and the teleconnections may be established by communication channels. So, this work is interesting since it reveals the vertical structure of predictability distribution, channel locations, and the contributions of different time scales to them and their variations under different seasons.

  6. Automated antibody structure prediction using Accelrys tools: Results and best practices

    PubMed Central

    Fasnacht, Marc; Butenhof, Ken; Goupil-Lamy, Anne; Hernandez-Guzman, Francisco; Huang, Hongwei; Yan, Lisa

    2014-01-01

    We describe the methodology and results from our participation in the second Antibody Modeling Assessment experiment. During the experiment we predicted the structure of eleven unpublished antibody Fv fragments. Our prediction methods centered on template-based modeling; potential templates were selected from an antibody database based on their sequence similarity to the target in the framework regions. Depending on the quality of the templates, we constructed models of the antibody framework regions either using a single, chimeric or multiple template approach. The hypervariable loop regions in the initial models were rebuilt by grafting the corresponding regions from suitable templates onto the model. For the H3 loop region, we further refined models using ab initio methods. The final models were subjected to constrained energy minimization to resolve severe local structural problems. The analysis of the models submitted show that Accelrys tools allow for the construction of quite accurate models for the framework and the canonical CDR regions, with RMSDs to the X-ray structure on average below 1 Å for most of these regions. The results show that accurate prediction of the H3 hypervariable loops remains a challenge. Furthermore, model quality assessment of the submitted models show that the models are of quite high quality, with local geometry assessment scores similar to that of the target X-ray structures. Proteins 2014; 82:1583–1598. © 2014 The Authors. Proteins published by Wiley Periodicals, Inc. PMID:24833271

  7. Hazard of pharmaceuticals for aquatic environment: Prioritization by structural approaches and prediction of ecotoxicity.