assigning large proteins: Topics by Science.gov

Sample records for assigning large proteins

Mixed pyruvate labeling enables backbone resonance assignment of large proteins using a single experiment.

PubMed

Robson, Scott A; Takeuchi, Koh; Boeszoermenyi, Andras; Coote, Paul W; Dubey, Abhinav; Hyberts, Sven; Wagner, Gerhard; Arthanari, Haribabu

2018-01-24

Backbone resonance assignment is a critical first step in the investigation of proteins by NMR. This is traditionally achieved with a standard set of experiments, most of which are not optimal for large proteins. Of these, HNCA is the most sensitive experiment that provides sequential correlations. However, this experiment suffers from chemical shift degeneracy problems during the assignment procedure. We present a strategy that increases the effective resolution of HNCA and enables near-complete resonance assignment using this single HNCA experiment. We utilize a combination of 2- 13 C and 3- 13 C pyruvate as the carbon source for isotope labeling, which suppresses the one bond ( 1 J αβ ) coupling providing enhanced resolution for the Cα resonance and amino acid-specific peak shapes that arise from the residual coupling. Using this approach, we can obtain near-complete (>85%) backbone resonance assignment of a 42 kDa protein using a single HNCA experiment.
Practical aspects of NMR signal assignment in larger and challenging proteins

PubMed Central

Frueh, Dominique P.

2014-01-01

NMR has matured into a technique routinely employed for studying proteins in near physiological conditions. However, applications to larger proteins are impeded by the complexity of the various correlation maps necessary to assign NMR signals. This article reviews the data analysis techniques traditionally employed for resonance assignment and describes alternative protocols necessary for overcoming challenges in large protein spectra. In particular, simultaneous analysis of multiple spectra may help overcome ambiguities or may reveal correlations in an indirect manner. Similarly, visualization of orthogonal planes in a multidimensional spectrum can provide alternative assignment procedures. We describe examples of such strategies for assignment of backbone, methyl, and nOe resonances. We describe experimental aspects of data acquisition for the related experiments and provide guidelines for preliminary studies. Focus is placed on large folded monomeric proteins and examples are provided for 37, 48, 53, and 81 kDa proteins. PMID:24534088
Protein Side-Chain Resonance Assignment and NOE Assignment Using RDC-Defined Backbones without TOCSY Data3

PubMed Central

Zeng, Jianyang; Zhou, Pei; Donald, Bruce Randall

2011-01-01

One bottleneck in NMR structure determination lies in the laborious and time-consuming process of side-chain resonance and NOE assignments. Compared to the well-studied backbone resonance assignment problem, automated side-chain resonance and NOE assignments are relatively less explored. Most NOE assignment algorithms require nearly complete side-chain resonance assignments from a series of through-bond experiments such as HCCH-TOCSY or HCCCONH. Unfortunately, these TOCSY experiments perform poorly on large proteins. To overcome this deficiency, we present a novel algorithm, called NASCA (NOE Assignment and Side-Chain Assignment), to automate both side-chain resonance and NOE assignments and to perform high-resolution protein structure determination in the absence of any explicit through-bond experiment to facilitate side-chain resonance assignment, such as HCCH-TOCSY. After casting the assignment problem into a Markov Random Field (MRF), NASCA extends and applies combinatorial protein design algorithms to compute optimal assignments that best interpret the NMR data. The MRF captures the contact map information of the protein derived from NOESY spectra, exploits the backbone structural information determined by RDCs, and considers all possible side-chain rotamers. The complexity of the combinatorial search is reduced by using a dead-end elimination (DEE) algorithm, which prunes side-chain resonance assignments that are provably not part of the optimal solution. Then an A* search algorithm is employed to find a set of optimal side-chain resonance assignments that best fit the NMR data. These side-chain resonance assignments are then used to resolve the NOE assignment ambiguity and compute high-resolution protein structures. Tests on five proteins show that NASCA assigns resonances for more than 90% of side-chain protons, and achieves about 80% correct assignments. The final structures computed using the NOE distance restraints assigned by NASCA have backbone RMSD 0
Fast and accurate resonance assignment of small-to-large proteins by combining automated and manual approaches.

PubMed

Niklasson, Markus; Ahlner, Alexandra; Andresen, Cecilia; Marsh, Joseph A; Lundström, Patrik

2015-01-01

The process of resonance assignment is fundamental to most NMR studies of protein structure and dynamics. Unfortunately, the manual assignment of residues is tedious and time-consuming, and can represent a significant bottleneck for further characterization. Furthermore, while automated approaches have been developed, they are often limited in their accuracy, particularly for larger proteins. Here, we address this by introducing the software COMPASS, which, by combining automated resonance assignment with manual intervention, is able to achieve accuracy approaching that from manual assignments at greatly accelerated speeds. Moreover, by including the option to compensate for isotope shift effects in deuterated proteins, COMPASS is far more accurate for larger proteins than existing automated methods. COMPASS is an open-source project licensed under GNU General Public License and is available for download from http://www.liu.se/forskning/foass/tidigare-foass/patrik-lundstrom/software?l=en. Source code and binaries for Linux, Mac OS X and Microsoft Windows are available.
Fast and Accurate Resonance Assignment of Small-to-Large Proteins by Combining Automated and Manual Approaches

PubMed Central

Niklasson, Markus; Ahlner, Alexandra; Andresen, Cecilia; Marsh, Joseph A.; Lundström, Patrik

2015-01-01

The process of resonance assignment is fundamental to most NMR studies of protein structure and dynamics. Unfortunately, the manual assignment of residues is tedious and time-consuming, and can represent a significant bottleneck for further characterization. Furthermore, while automated approaches have been developed, they are often limited in their accuracy, particularly for larger proteins. Here, we address this by introducing the software COMPASS, which, by combining automated resonance assignment with manual intervention, is able to achieve accuracy approaching that from manual assignments at greatly accelerated speeds. Moreover, by including the option to compensate for isotope shift effects in deuterated proteins, COMPASS is far more accurate for larger proteins than existing automated methods. COMPASS is an open-source project licensed under GNU General Public License and is available for download from http://www.liu.se/forskning/foass/tidigare-foass/patrik-lundstrom/software?l=en. Source code and binaries for Linux, Mac OS X and Microsoft Windows are available. PMID:25569628
Covariance NMR Processing and Analysis for Protein Assignment.

PubMed

Harden, Bradley J; Frueh, Dominique P

2018-01-01

During NMR resonance assignment it is often necessary to relate nuclei to one another indirectly, through their common correlations to other nuclei. Covariance NMR has emerged as a powerful technique to correlate such nuclei without relying on error-prone peak peaking. However, false-positive artifacts in covariance spectra have impeded a general application to proteins. We recently introduced pre- and postprocessing steps to reduce the prevalence of artifacts in covariance spectra, allowing for the calculation of a variety of 4D covariance maps obtained from diverse combinations of pairs of 3D spectra, and we have employed them to assign backbone and sidechain resonances in two large and challenging proteins. In this chapter, we present a detailed protocol describing how to (1) properly prepare existing 3D spectra for covariance, (2) understand and apply our processing script, and (3) navigate and interpret the resulting 4D spectra. We also provide solutions to a number of errors that may occur when using our script, and we offer practical advice when assigning difficult signals. We believe such 4D spectra, and covariance NMR in general, can play an integral role in the assignment of NMR signals.
Assignment of protein sequences to existing domain and family classification systems: Pfam and the PDB.

PubMed

Xu, Qifang; Dunbrack, Roland L

2012-11-01

Automating the assignment of existing domain and protein family classifications to new sets of sequences is an important task. Current methods often miss assignments because remote relationships fail to achieve statistical significance. Some assignments are not as long as the actual domain definitions because local alignment methods often cut alignments short. Long insertions in query sequences often erroneously result in two copies of the domain assigned to the query. Divergent repeat sequences in proteins are often missed. We have developed a multilevel procedure to produce nearly complete assignments of protein families of an existing classification system to a large set of sequences. We apply this to the task of assigning Pfam domains to sequences and structures in the Protein Data Bank (PDB). We found that HHsearch alignments frequently scored more remotely related Pfams in Pfam clans higher than closely related Pfams, thus, leading to erroneous assignment at the Pfam family level. A greedy algorithm allowing for partial overlaps was, thus, applied first to sequence/HMM alignments, then HMM-HMM alignments and then structure alignments, taking care to join partial alignments split by large insertions into single-domain assignments. Additional assignment of repeat Pfams with weaker E-values was allowed after stronger assignments of the repeat HMM. Our database of assignments, presented in a database called PDBfam, contains Pfams for 99.4% of chains >50 residues. The Pfam assignment data in PDBfam are available at http://dunbrack2.fccc.edu/ProtCid/PDBfam, which can be searched by PDB codes and Pfam identifiers. They will be updated regularly.
An Algorithm for Protein Helix Assignment Using Helix Geometry

PubMed Central

Cao, Chen; Xu, Shutan; Wang, Lincong

2015-01-01

Helices are one of the most common and were among the earliest recognized secondary structure elements in proteins. The assignment of helices in a protein underlies the analysis of its structure and function. Though the mathematical expression for a helical curve is simple, no previous assignment programs have used a genuine helical curve as a model for helix assignment. In this paper we present a two-step assignment algorithm. The first step searches for a series of bona fide helical curves each one best fits the coordinates of four successive backbone Cα atoms. The second step uses the best fit helical curves as input to make helix assignment. The application to the protein structures in the PDB (protein data bank) proves that the algorithm is able to assign accurately not only regular α-helix but also 310 and π helices as well as their left-handed versions. One salient feature of the algorithm is that the assigned helices are structurally more uniform than those by the previous programs. The structural uniformity should be useful for protein structure classification and prediction while the accurate assignment of a helix to a particular type underlies structure-function relationship in proteins. PMID:26132394
Assignment of protein sequences to existing domain and family classification systems: Pfam and the PDB

PubMed Central

Dunbrack, Roland L.

2012-01-01

Motivation: Automating the assignment of existing domain and protein family classifications to new sets of sequences is an important task. Current methods often miss assignments because remote relationships fail to achieve statistical significance. Some assignments are not as long as the actual domain definitions because local alignment methods often cut alignments short. Long insertions in query sequences often erroneously result in two copies of the domain assigned to the query. Divergent repeat sequences in proteins are often missed. Results: We have developed a multilevel procedure to produce nearly complete assignments of protein families of an existing classification system to a large set of sequences. We apply this to the task of assigning Pfam domains to sequences and structures in the Protein Data Bank (PDB). We found that HHsearch alignments frequently scored more remotely related Pfams in Pfam clans higher than closely related Pfams, thus, leading to erroneous assignment at the Pfam family level. A greedy algorithm allowing for partial overlaps was, thus, applied first to sequence/HMM alignments, then HMM–HMM alignments and then structure alignments, taking care to join partial alignments split by large insertions into single-domain assignments. Additional assignment of repeat Pfams with weaker E-values was allowed after stronger assignments of the repeat HMM. Our database of assignments, presented in a database called PDBfam, contains Pfams for 99.4% of chains >50 residues. Availability: The Pfam assignment data in PDBfam are available at http://dunbrack2.fccc.edu/ProtCid/PDBfam, which can be searched by PDB codes and Pfam identifiers. They will be updated regularly. Contact: Roland.Dunbracks@fccc.edu PMID:22942020
A Markov Random Field Framework for Protein Side-Chain Resonance Assignment

NASA Astrophysics Data System (ADS)

Zeng, Jianyang; Zhou, Pei; Donald, Bruce Randall

Nuclear magnetic resonance (NMR) spectroscopy plays a critical role in structural genomics, and serves as a primary tool for determining protein structures, dynamics and interactions in physiologically-relevant solution conditions. The current speed of protein structure determination via NMR is limited by the lengthy time required in resonance assignment, which maps spectral peaks to specific atoms and residues in the primary sequence. Although numerous algorithms have been developed to address the backbone resonance assignment problem [68,2,10,37,14,64,1,31,60], little work has been done to automate side-chain resonance assignment [43, 48, 5]. Most previous attempts in assigning side-chain resonances depend on a set of NMR experiments that record through-bond interactions with side-chain protons for each residue. Unfortunately, these NMR experiments have low sensitivity and limited performance on large proteins, which makes it difficult to obtain enough side-chain resonance assignments. On the other hand, it is essential to obtain almost all of the side-chain resonance assignments as a prerequisite for high-resolution structure determination. To overcome this deficiency, we present a novel side-chain resonance assignment algorithm based on alternative NMR experiments measuring through-space interactions between protons in the protein, which also provide crucial distance restraints and are normally required in high-resolution structure determination. We cast the side-chain resonance assignment problem into a Markov Random Field (MRF) framework, and extend and apply combinatorial protein design algorithms to compute the optimal solution that best interprets the NMR data. Our MRF framework captures the contact map information of the protein derived from NMR spectra, and exploits the structural information available from the backbone conformations determined by orientational restraints and a set of discretized side-chain conformations (i.e., rotamers). A Hausdorff
ProteinInferencer: Confident protein identification and multiple experiment comparison for large scale proteomics projects.

PubMed

Zhang, Yaoyang; Xu, Tao; Shan, Bing; Hart, Jonathan; Aslanian, Aaron; Han, Xuemei; Zong, Nobel; Li, Haomin; Choi, Howard; Wang, Dong; Acharya, Lipi; Du, Lisa; Vogt, Peter K; Ping, Peipei; Yates, John R

2015-11-03

Shotgun proteomics generates valuable information from large-scale and target protein characterizations, including protein expression, protein quantification, protein post-translational modifications (PTMs), protein localization, and protein-protein interactions. Typically, peptides derived from proteolytic digestion, rather than intact proteins, are analyzed by mass spectrometers because peptides are more readily separated, ionized and fragmented. The amino acid sequences of peptides can be interpreted by matching the observed tandem mass spectra to theoretical spectra derived from a protein sequence database. Identified peptides serve as surrogates for their proteins and are often used to establish what proteins were present in the original mixture and to quantify protein abundance. Two major issues exist for assigning peptides to their originating protein. The first issue is maintaining a desired false discovery rate (FDR) when comparing or combining multiple large datasets generated by shotgun analysis and the second issue is properly assigning peptides to proteins when homologous proteins are present in the database. Herein we demonstrate a new computational tool, ProteinInferencer, which can be used for protein inference with both small- or large-scale data sets to produce a well-controlled protein FDR. In addition, ProteinInferencer introduces confidence scoring for individual proteins, which makes protein identifications evaluable. This article is part of a Special Issue entitled: Computational Proteomics. Copyright © 2015. Published by Elsevier B.V.
DSSPcont: continuous secondary structure assignments for proteins

PubMed Central

Carter, Phil; Andersen, Claus A. F.; Rost, Burkhard

2003-01-01

The DSSP program automatically assigns the secondary structure for each residue from the three-dimensional co-ordinates of a protein structure to one of eight states. However, discrete assignments are incomplete in that they cannot capture the continuum of thermal fluctuations. Therefore, DSSPcont (http://cubic.bioc.columbia.edu/services/DSSPcont) introduces a continuous assignment of secondary structure that replaces ‘static’ by ‘dynamic’ states. Technically, the continuum results from calculating weighted averages over 10 discrete DSSP assignments with different hydrogen bond thresholds. A DSSPcont assignment for a particular residue is a percentage likelihood of eight secondary structure states, derived from a weighted average of the ten DSSP assignments. The continuous assignments have two important features: (i) they reflect the structural variations due to thermal fluctuations as detected by NMR spectroscopy; and (ii) they reproduce the structural variation between many NMR models from one single model. Therefore, functionally important variation can be extracted from a single X-ray structure using the continuous assignment procedure. PMID:12824310
More reliable protein NMR peak assignment via improved 2-interval scheduling.

PubMed

Chen, Zhi-Zhong; Lin, Guohui; Rizzi, Romeo; Wen, Jianjun; Xu, Dong; Xu, Ying; Jiang, Tao

2005-03-01

Protein NMR peak assignment refers to the process of assigning a group of "spin systems" obtained experimentally to a protein sequence of amino acids. The automation of this process is still an unsolved and challenging problem in NMR protein structure determination. Recently, protein NMR peak assignment has been formulated as an interval scheduling problem (ISP), where a protein sequence P of amino acids is viewed as a discrete time interval I (the amino acids on P one-to-one correspond to the time units of I), each subset S of spin systems that are known to originate from consecutive amino acids from P is viewed as a "job" j(s), the preference of assigning S to a subsequence P of consecutive amino acids on P is viewed as the profit of executing job j(s) in the subinterval of I corresponding to P, and the goal is to maximize the total profit of executing the jobs (on a single machine) during I. The interval scheduling problem is max SNP-hard in general; but in the real practice of protein NMR peak assignment, each job j(s) usually requires at most 10 consecutive time units, and typically the jobs that require one or two consecutive time units are the most difficult to assign/schedule. In order to solve these most difficult assignments, we present an efficient 13/7-approximation algorithm for the special case of the interval scheduling problem where each job takes one or two consecutive time units. Combining this algorithm with a greedy filtering strategy for handling long jobs (i.e., jobs that need more than two consecutive time units), we obtain a new efficient heuristic for protein NMR peak assignment. Our experimental study shows that the new heuristic produces the best peak assignment in most of the cases, compared with the NMR peak assignment algorithms in the recent literature. The above algorithm is also the first approximation algorithm for a nontrivial case of the well-known interval scheduling problem that breaks the ratio 2 barrier.
Rapid NMR Assignments of Proteins by Using Optimized Combinatorial Selective Unlabeling.

PubMed

Dubey, Abhinav; Kadumuri, Rajashekar Varma; Jaipuria, Garima; Vadrevu, Ramakrishna; Atreya, Hanudatta S

2016-02-15

A new approach for rapid resonance assignments in proteins based on amino acid selective unlabeling is presented. The method involves choosing a set of multiple amino acid types for selective unlabeling and identifying specific tripeptides surrounding the labeled residues from specific 2D NMR spectra in a combinatorial manner. The methodology directly yields sequence specific assignments, without requiring a contiguously stretch of amino acid residues to be linked, and is applicable to deuterated proteins. We show that a 2D [(15) N,(1) H] HSQC spectrum with two 2D spectra can result in ∼50 % assignments. The methodology was applied to two proteins: an intrinsically disordered protein (12 kDa) and the 29 kDa (268 residue) α-subunit of Escherichia coli tryptophan synthase, which presents a challenging case with spectral overlaps and missing peaks. The method can augment existing approaches and will be useful for applications such as identifying active-site residues involved in ligand binding, phosphorylation, or protein-protein interactions, even prior to complete resonance assignments. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Automated sequence-specific protein NMR assignment using the memetic algorithm MATCH.

PubMed

Volk, Jochen; Herrmann, Torsten; Wüthrich, Kurt

2008-07-01

MATCH (Memetic Algorithm and Combinatorial Optimization Heuristics) is a new memetic algorithm for automated sequence-specific polypeptide backbone NMR assignment of proteins. MATCH employs local optimization for tracing partial sequence-specific assignments within a global, population-based search environment, where the simultaneous application of local and global optimization heuristics guarantees high efficiency and robustness. MATCH thus makes combined use of the two predominant concepts in use for automated NMR assignment of proteins. Dynamic transition and inherent mutation are new techniques that enable automatic adaptation to variable quality of the experimental input data. The concept of dynamic transition is incorporated in all major building blocks of the algorithm, where it enables switching between local and global optimization heuristics at any time during the assignment process. Inherent mutation restricts the intrinsically required randomness of the evolutionary algorithm to those regions of the conformation space that are compatible with the experimental input data. Using intact and artificially deteriorated APSY-NMR input data of proteins, MATCH performed sequence-specific resonance assignment with high efficiency and robustness.
Amino acid selective unlabeling for sequence specific resonance assignments in proteins

PubMed Central

Krishnarjuna, B.; Jaipuria, Garima; Thakur, Anushikha

2010-01-01

Sequence specific resonance assignment constitutes an important step towards high-resolution structure determination of proteins by NMR and is aided by selective identification and assignment of amino acid types. The traditional approach to selective labeling yields only the chemical shifts of the particular amino acid being selected and does not help in establishing a link between adjacent residues along the polypeptide chain, which is important for sequential assignments. An alternative approach is the method of amino acid selective ‘unlabeling’ or reverse labeling, which involves selective unlabeling of specific amino acid types against a uniformly 13C/15N labeled background. Based on this method, we present a novel approach for sequential assignments in proteins. The method involves a new NMR experiment named, {12COi–15Ni+1}-filtered HSQC, which aids in linking the 1HN/15N resonances of the selectively unlabeled residue, i, and its C-terminal neighbor, i + 1, in HN-detected double and triple resonance spectra. This leads to the assignment of a tri-peptide segment from the knowledge of the amino acid types of residues: i − 1, i and i + 1, thereby speeding up the sequential assignment process. The method has the advantage of being relatively inexpensive, applicable to 2H labeled protein and can be coupled with cell-free synthesis and/or automated assignment approaches. A detailed survey involving unlabeling of different amino acid types individually or in pairs reveals that the proposed approach is also robust to misincorporation of 14N at undesired sites. Taken together, this study represents the first application of selective unlabeling for sequence specific resonance assignments and opens up new avenues to using this methodology in protein structural studies. Electronic supplementary material The online version of this article (doi:10.1007/s10858-010-9459-z) contains supplementary material, which is available to authorized users. PMID:21153044
Organic Chemistry YouTube Writing Assignment for Large Lecture Classes

ERIC Educational Resources Information Center

Franz, Annaliese K.

2012-01-01

This work describes efforts to incorporate and evaluate the use of a YouTube writing assignment in large lecture classes to personalize learning and improve conceptual understanding of chemistry through peer- and self-explanation strategies. Although writing assignments can be a method to incorporate peer- and self-explanation strategies, this…
Facilitated assignment of large protein NMR signals with covariance sequential spectra using spectral derivatives.

PubMed

Harden, Bradley J; Nichols, Scott R; Frueh, Dominique P

2014-09-24

Nuclear magnetic resonance (NMR) studies of larger proteins are hampered by difficulties in assigning NMR resonances. Human intervention is typically required to identify NMR signals in 3D spectra, and subsequent procedures depend on the accuracy of this so-called peak picking. We present a method that provides sequential connectivities through correlation maps constructed with covariance NMR, bypassing the need for preliminary peak picking. We introduce two novel techniques to minimize false correlations and merge the information from all original 3D spectra. First, we take spectral derivatives prior to performing covariance to emphasize coincident peak maxima. Second, we multiply covariance maps calculated with different 3D spectra to destroy erroneous sequential correlations. The maps are easy to use and can readily be generated from conventional triple-resonance experiments. Advantages of the method are demonstrated on a 37 kDa nonribosomal peptide synthetase domain subject to spectral overlap.
On the problem of resonance assignments in solid state NMR of uniformly 15N, 13C-labeled proteins

NASA Astrophysics Data System (ADS)

Tycko, Robert

2015-04-01

Determination of accurate resonance assignments from multidimensional chemical shift correlation spectra is one of the major problems in biomolecular solid state NMR, particularly for relative large proteins with less-than-ideal NMR linewidths. This article investigates the difficulty of resonance assignment, using a computational Monte Carlo/simulated annealing (MCSA) algorithm to search for assignments from artificial three-dimensional spectra that are constructed from the reported isotropic 15N and 13C chemical shifts of two proteins whose structures have been determined by solution NMR methods. The results demonstrate how assignment simulations can provide new insights into factors that affect the assignment process, which can then help guide the design of experimental strategies. Specifically, simulations are performed for the catalytic domain of SrtC (147 residues, primarily β-sheet secondary structure) and the N-terminal domain of MLKL (166 residues, primarily α-helical secondary structure). Assuming unambiguous residue-type assignments and four ideal three-dimensional data sets (NCACX, NCOCX, CONCA, and CANCA), uncertainties in chemical shifts must be less than 0.4 ppm for assignments for SrtC to be unique, and less than 0.2 ppm for MLKL. Eliminating CANCA data has no significant effect, but additionally eliminating CONCA data leads to more stringent requirements for chemical shift precision. Introducing moderate ambiguities in residue-type assignments does not have a significant effect.
Stereospecific assignment of the asparagine and glutamine sidechain amide protons in proteins from chemical shift analysis.

PubMed

Harsch, Tobias; Schneider, Philipp; Kieninger, Bärbel; Donaubauer, Harald; Kalbitzer, Hans Robert

2017-02-01

Side chain amide protons of asparagine and glutamine residues in random-coil peptides are characterized by large chemical shift differences and can be stereospecifically assigned on the basis of their chemical shift values only. The bimodal chemical shift distributions stored in the biological magnetic resonance data bank (BMRB) do not allow such an assignment. However, an analysis of the BMRB shows, that a substantial part of all stored stereospecific assignments is not correct. We show here that in most cases stereospecific assignment can also be done for folded proteins using an unbiased artificial chemical shift data base (UACSB). For a separation of the chemical shifts of the two amide resonance lines with differences ≥0.40 ppm for asparagine and differences ≥0.42 ppm for glutamine, the downfield shifted resonance lines can be assigned to H δ21 and H ε21 , respectively, at a confidence level >95%. A classifier derived from UASCB can also be used to correct the BMRB data. The program tool AssignmentChecker implemented in AUREMOL calculates the Bayesian probability for a given stereospecific assignment and automatically corrects the assignments for a given list of chemical shifts.

Rapid analysis of protein backbone resonance assignments using cryogenic probes, a distributed Linux-based computing architecture, and an integrated set of spectral analysis tools.

PubMed

Monleón, Daniel; Colson, Kimberly; Moseley, Hunter N B; Anklin, Clemens; Oswald, Robert; Szyperski, Thomas; Montelione, Gaetano T

2002-01-01

Rapid data collection, spectral referencing, processing by time domain deconvolution, peak picking and editing, and assignment of NMR spectra are necessary components of any efficient integrated system for protein NMR structure analysis. We have developed a set of software tools designated AutoProc, AutoPeak, and AutoAssign, which function together with the data processing and peak-picking programs NMRPipe and Sparky, to provide an integrated software system for rapid analysis of protein backbone resonance assignments. In this paper we demonstrate that these tools, together with high-sensitivity triple resonance NMR cryoprobes for data collection and a Linux-based computer cluster architecture, can be combined to provide nearly complete backbone resonance assignments and secondary structures (based on chemical shift data) for a 59-residue protein in less than 30 hours of data collection and processing time. In this optimum case of a small protein providing excellent spectra, extensive backbone resonance assignments could also be obtained using less than 6 hours of data collection and processing time. These results demonstrate the feasibility of high throughput triple resonance NMR for determining resonance assignments and secondary structures of small proteins, and the potential for applying NMR in large scale structural proteomics projects.
Protein domain assignment from the recurrence of locally similar structures

PubMed Central

Tai, Chin-Hsien; Sam, Vichetra; Gibrat, Jean-Francois; Garnier, Jean; Munson, Peter J.

2010-01-01

Domains are basic units of protein structure and essential for exploring protein fold space and structure evolution. With the structural genomics initiative, the number of protein structures in the Protein Databank (PDB) is increasing dramatically and domain assignments need to be done automatically. Most existing structural domain assignment programs define domains using the compactness of the domains and/or the number and strength of intra-domain versus inter-domain contacts. Here we present a different approach based on the recurrence of locally similar structural pieces (LSSPs) found by one-against-all structure comparisons with a dataset of 6,373 protein chains from the PDB. Residues of the query protein are clustered using LSSPs via three different procedures to define domains. This approach gives results that are comparable to several existing programs that use geometrical and other structural information explicitly. Remarkably, most of the proteins that contribute the LSSPs defining a domain do not themselves contain the domain of interest. This study shows that domains can be defined by a collection of relatively small locally similar structural pieces containing, on average, four secondary structure elements. In addition, it indicates that domains are indeed made of recurrent small structural pieces that are used to build protein structures of many different folds as suggested by recent studies. PMID:21287617
Protein assignments without peak lists using higher-order spectra.

PubMed

Benison, Gregory; Berkholz, Donald S; Barbar, Elisar

2007-12-01

Despite advances in automating the generation and manipulation of peak lists for assigning biomolecules, there are well-known advantages to working directly with spectra: the eye is still superior to computer algorithms when it comes to picking out peak relationships from contour plots in the presence of confounding factors such as noise, overlap, and spectral artifacts. Here, we present constructs called higher-order spectra for identifying, through direct visual examination, many of the same relationships typically identified by searching peak lists, making them another addition to the set of tools (alongside peak picking and automated assignment) that can be used to solve the assignment problem. The technique is useful for searching for correlated peaks in any spectrum type. Application of this technique to novel, complete sequential assignment of two proteins (AhpFn and IC74(84-143)) is demonstrated. The program "burrow-owl" for the generation and display of higher-order spectra is available at (http://sourceforge.net/projects/burrow-owl) or from the authors.
From protein interaction profile to functional assignment: the human protein Ki-1/57 is associated with pre-mRNA splicing events.

PubMed

Bressan, Gustavo Costa; Kobarg, Jörg

2010-01-01

The mapping of protein-protein interactions of a determined organism is considered fundamental to assign protein function in the post-genomic era. As part of this effort, screenings for pairwise interactions by yeast two-hybrid system have been used popularly to reveal protein interaction networks in different biological systems. Through the identification of protein interaction partners we have successfully obtained interesting functional clues for Ki-1/57, a human protein with no previous functional annotation, in the context of RNA metabolism. We briefly discuss the way we approached protein-protein interaction data to conduct and interpret further molecular biological and cellular studies as well as structural analyses on this protein. Our data suggest that Ki-1/57 belongs to the family of intrinsically unstructured proteins and that the structural flexibility may be crucial for its capacity to interact with many different proteins. A large fraction of these proteins are involved in pre-mRNA splicing control. Finally, Ki-1/57 is localized to several subnuclear domains, all of which have been described to splicing and other RNA processing events.
Resonance assignment of the NMR spectra of disordered proteins using a multi-objective non-dominated sorting genetic algorithm.

PubMed

Yang, Yu; Fritzsching, Keith J; Hong, Mei

2013-11-01

A multi-objective genetic algorithm is introduced to predict the assignment of protein solid-state NMR (SSNMR) spectra with partial resonance overlap and missing peaks due to broad linewidths, molecular motion, and low sensitivity. This non-dominated sorting genetic algorithm II (NSGA-II) aims to identify all possible assignments that are consistent with the spectra and to compare the relative merit of these assignments. Our approach is modeled after the recently introduced Monte-Carlo simulated-annealing (MC/SA) protocol, with the key difference that NSGA-II simultaneously optimizes multiple assignment objectives instead of searching for possible assignments based on a single composite score. The multiple objectives include maximizing the number of consistently assigned peaks between multiple spectra ("good connections"), maximizing the number of used peaks, minimizing the number of inconsistently assigned peaks between spectra ("bad connections"), and minimizing the number of assigned peaks that have no matching peaks in the other spectra ("edges"). Using six SSNMR protein chemical shift datasets with varying levels of imperfection that was introduced by peak deletion, random chemical shift changes, and manual peak picking of spectra with moderately broad linewidths, we show that the NSGA-II algorithm produces a large number of valid and good assignments rapidly. For high-quality chemical shift peak lists, NSGA-II and MC/SA perform similarly well. However, when the peak lists contain many missing peaks that are uncorrelated between different spectra and have chemical shift deviations between spectra, the modified NSGA-II produces a larger number of valid solutions than MC/SA, and is more effective at distinguishing good from mediocre assignments by avoiding the hazard of suboptimal weighting factors for the various objectives. These two advantages, namely diversity and better evaluation, lead to a higher probability of predicting the correct assignment for a
Quality Control Test for Sequence-Phenotype Assignments

PubMed Central

Ortiz, Maria Teresa Lara; Rosario, Pablo Benjamín Leon; Luna-Nevarez, Pablo; Gamez, Alba Savin; Martínez-del Campo, Ana; Del Rio, Gabriel

2015-01-01

Relating a gene mutation to a phenotype is a common task in different disciplines such as protein biochemistry. In this endeavour, it is common to find false relationships arising from mutations introduced by cells that may be depurated using a phenotypic assay; yet, such phenotypic assays may introduce additional false relationships arising from experimental errors. Here we introduce the use of high-throughput DNA sequencers and statistical analysis aimed to identify incorrect DNA sequence-phenotype assignments and observed that 10–20% of these false assignments are expected in large screenings aimed to identify critical residues for protein function. We further show that this level of incorrect DNA sequence-phenotype assignments may significantly alter our understanding about the structure-function relationship of proteins. We have made available an implementation of our method at http://bis.ifc.unam.mx/en/software/chispas. PMID:25700273
Specific labeling and assignment strategies of valine methyl groups for NMR studies of high molecular weight proteins.

PubMed

Mas, Guillaume; Crublet, Elodie; Hamelin, Olivier; Gans, Pierre; Boisbouvier, Jérôme

2013-11-01

The specific protonation of valine and leucine methyl groups in proteins is typically achieved by overexpressing proteins in M9/D2O medium supplemented with either labeled α-ketoisovalerate for the labeling of the four prochiral methyl groups or with 2-acetolactate for the stereospecific labeling of the valine and leucine side chains. However, when these labeling schemes are applied to large protein assemblies, significant overlap between the correlations of the valine and leucine methyl groups occurs, hampering the analysis of 2D methyl-TROSY spectra. Analysis of the leucine and valine biosynthesis pathways revealed that the incorporation of labeled precursors in the leucine pathway can be inhibited by the addition of exogenous l-leucine-d10. We exploited this property to label stereospecifically the pro-R and pro-S methyl groups of valine with minimal scrambling to the leucine residues. This new labeling protocol was applied to the 468 kDa homododecameric peptidase TET2 to decrease the complexity of its NMR spectra. All of the pro-S valine methyl resonances of TET2 were assigned by combining mutagenesis with this innovative labeling approach. The assignments were transferred to the pro-R groups using an optimally labeled sample and a set of triple resonance experiments. This improved labeling scheme enables us to overcome the main limitation of overcrowding in the NMR spectra of prochiral methyl groups, which is a prerequisite for the site-specific measurement of the structural and dynamic parameters or for the study of interactions in very large protein assemblies.
An ensemble approach to protein fold classification by integration of template-based assignment and support vector machine classifier.

PubMed

Xia, Jiaqi; Peng, Zhenling; Qi, Dawei; Mu, Hongbo; Yang, Jianyi

2017-03-15

Protein fold classification is a critical step in protein structure prediction. There are two possible ways to classify protein folds. One is through template-based fold assignment and the other is ab-initio prediction using machine learning algorithms. Combination of both solutions to improve the prediction accuracy was never explored before. We developed two algorithms, HH-fold and SVM-fold for protein fold classification. HH-fold is a template-based fold assignment algorithm using the HHsearch program. SVM-fold is a support vector machine-based ab-initio classification algorithm, in which a comprehensive set of features are extracted from three complementary sequence profiles. These two algorithms are then combined, resulting to the ensemble approach TA-fold. We performed a comprehensive assessment for the proposed methods by comparing with ab-initio methods and template-based threading methods on six benchmark datasets. An accuracy of 0.799 was achieved by TA-fold on the DD dataset that consists of proteins from 27 folds. This represents improvement of 5.4-11.7% over ab-initio methods. After updating this dataset to include more proteins in the same folds, the accuracy increased to 0.971. In addition, TA-fold achieved >0.9 accuracy on a large dataset consisting of 6451 proteins from 184 folds. Experiments on the LE dataset show that TA-fold consistently outperforms other threading methods at the family, superfamily and fold levels. The success of TA-fold is attributed to the combination of template-based fold assignment and ab-initio classification using features from complementary sequence profiles that contain rich evolution information. http://yanglab.nankai.edu.cn/TA-fold/. yangjy@nankai.edu.cn or mhb-506@163.com. Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
An ambiguity principle for assigning protein structural domains.

PubMed

Postic, Guillaume; Ghouzam, Yassine; Chebrek, Romain; Gelly, Jean-Christophe

2017-01-01

Ambiguity is the quality of being open to several interpretations. For an image, it arises when the contained elements can be delimited in two or more distinct ways, which may cause confusion. We postulate that it also applies to the analysis of protein three-dimensional structure, which consists in dividing the molecule into subunits called domains. Because different definitions of what constitutes a domain can be used to partition a given structure, the same protein may have different but equally valid domain annotations. However, knowledge and experience generally displace our ability to accept more than one way to decompose the structure of an object-in this case, a protein. This human bias in structure analysis is particularly harmful because it leads to ignoring potential avenues of research. We present an automated method capable of producing multiple alternative decompositions of protein structure (web server and source code available at www.dsimb.inserm.fr/sword/). Our innovative algorithm assigns structural domains through the hierarchical merging of protein units, which are evolutionarily preserved substructures that describe protein architecture at an intermediate level, between domain and secondary structure. To validate the use of these protein units for decomposing protein structures into domains, we set up an extensive benchmark made of expert annotations of structural domains and including state-of-the-art domain parsing algorithms. The relevance of our "multipartitioning" approach is shown through numerous examples of applications covering protein function, evolution, folding, and structure prediction. Finally, we introduce a measure for the structural ambiguity of protein molecules.
An ambiguity principle for assigning protein structural domains

PubMed Central

Postic, Guillaume; Ghouzam, Yassine; Chebrek, Romain; Gelly, Jean-Christophe

2017-01-01

Ambiguity is the quality of being open to several interpretations. For an image, it arises when the contained elements can be delimited in two or more distinct ways, which may cause confusion. We postulate that it also applies to the analysis of protein three-dimensional structure, which consists in dividing the molecule into subunits called domains. Because different definitions of what constitutes a domain can be used to partition a given structure, the same protein may have different but equally valid domain annotations. However, knowledge and experience generally displace our ability to accept more than one way to decompose the structure of an object—in this case, a protein. This human bias in structure analysis is particularly harmful because it leads to ignoring potential avenues of research. We present an automated method capable of producing multiple alternative decompositions of protein structure (web server and source code available at www.dsimb.inserm.fr/sword/). Our innovative algorithm assigns structural domains through the hierarchical merging of protein units, which are evolutionarily preserved substructures that describe protein architecture at an intermediate level, between domain and secondary structure. To validate the use of these protein units for decomposing protein structures into domains, we set up an extensive benchmark made of expert annotations of structural domains and including state-of-the-art domain parsing algorithms. The relevance of our “multipartitioning” approach is shown through numerous examples of applications covering protein function, evolution, folding, and structure prediction. Finally, we introduce a measure for the structural ambiguity of protein molecules. PMID:28097215
Reliable resonance assignments of selected residues of proteins with known structure based on empirical NMR chemical shift prediction

NASA Astrophysics Data System (ADS)

Li, Da-Wei; Meng, Dan; Brüschweiler, Rafael

2015-05-01

A robust NMR resonance assignment method is introduced for proteins whose 3D structure has previously been determined by X-ray crystallography. The goal of the method is to obtain a subset of correct assignments from a parsimonious set of 3D NMR experiments of 15N, 13C labeled proteins. Chemical shifts of sequential residue pairs are predicted from static protein structures using PPM_One, which are then compared with the corresponding experimental shifts. Globally optimized weighted matching identifies the assignments that are robust with respect to small changes in NMR cross-peak positions. The method, termed PASSPORT, is demonstrated for 4 proteins with 100-250 amino acids using 3D NHCA and a 3D CBCA(CO)NH experiments as input producing correct assignments with high reliability for 22% of the residues. The method, which works best for Gly, Ala, Ser, and Thr residues, provides assignments that serve as anchor points for additional assignments by both manual and semi-automated methods or they can be directly used for further studies, e.g. on ligand binding, protein dynamics, or post-translational modification, such as phosphorylation.
Reliable Resonance Assignments of Selected Residues of Proteins with Known Structure Based on Empirical NMR Chemical Shift Prediction

PubMed Central

Li, Da-Wei; Meng, Dan; Brüschweiler, Rafael

2015-01-01

A robust NMR resonance assignment method is introduced for proteins whose 3D structure has previously been determined by X-ray crystallography. The goal of the method is to obtain a subset of correct assignments from a parsimonious set of 3D NMR experiments of 15N, 13C labeled proteins. Chemical shifts of sequential residue pairs are predicted from static protein structures using PPM_One, which are then compared with the corresponding experimental shifts. Globally optimized weighted matching identifies the assignments that are robust with respect to small changes in NMR cross-peak positions. The method, termed PASSPORT, is demonstrated for 4 proteins with 100 – 250 amino acids using 3D NHCA and a 3D CBCA(CO)NH experiments as input producing correct assignments with high reliability for 22% of the residues. The method, which works best for Gly, Ala, Ser, and Thr residues, provides assignments that serve as anchor points for additional assignments by both manual and semi-automated methods or they can be directly used for further studies, e.g. on ligand binding, protein dynamics, or post-translational modification, such as phosphorylation. PMID:25863893
Normalized Cut Algorithm for Automated Assignment of Protein Domains

NASA Technical Reports Server (NTRS)

Samanta, M. P.; Liang, S.; Zha, H.; Biegel, Bryan A. (Technical Monitor)

2002-01-01

We present a novel computational method for automatic assignment of protein domains from structural data. At the core of our algorithm lies a recently proposed clustering technique that has been very successful for image-partitioning applications. This grap.,l-theory based clustering method uses the notion of a normalized cut to partition. an undirected graph into its strongly-connected components. Computer implementation of our method tested on the standard comparison set of proteins from the literature shows a high success rate (84%), better than most existing alternative In addition, several other features of our algorithm, such as reliance on few adjustable parameters, linear run-time with respect to the size of the protein and reduced complexity compared to other graph-theory based algorithms, would make it an attractive tool for structural biologists.
Investigations of Protein Structure and Function Using the Scientific Literature: An Assignment for an Undergraduate Cell Physiology Course

PubMed Central

Mulnix, Amy B.

2003-01-01

Undergraduate biology curricula are being modified to model and teach the activities of scientists better. The assignment described here, one that investigates protein structure and function, was designed for use in a sophomore-level cell physiology course at Earlham College. Students work in small groups to read and present in poster format on the content of a single research article reporting on the structure and/or function of a protein. Goals of the assignment include highlighting the interdependence of protein structure and function; asking students to review, integrate, and apply previously acquired knowledge; and helping students see protein structure/function in a context larger than cell physiology. The assignment also is designed to build skills in reading scientific literature, oral and written communication, and collaboration among peers. Assessment of student perceptions of the assignment in two separate offerings indicates that the project successfully achieves these goals. Data specifically show that students relied heavily on their peers to understand their article. The assignment was also shown to require students to read articles more carefully than previously. In addition, the data suggest that the assignment could be modified and used successfully in other courses and at other institutions. PMID:14673490
Investigations of protein structure and function using the scientific literature: an assignment for an undergraduate cell physiology course.

PubMed

Mulnix, Amy B

2003-01-01

Undergraduate biology curricula are being modified to model and teach the activities of scientists better. The assignment described here, one that investigates protein structure and function, was designed for use in a sophomore-level cell physiology course at Earlham College. Students work in small groups to read and present in poster format on the content of a single research article reporting on the structure and/or function of a protein. Goals of the assignment include highlighting the interdependence of protein structure and function; asking students to review, integrate, and apply previously acquired knowledge; and helping students see protein structure/function in a context larger than cell physiology. The assignment also is designed to build skills in reading scientific literature, oral and written communication, and collaboration among peers. Assessment of student perceptions of the assignment in two separate offerings indicates that the project successfully achieves these goals. Data specifically show that students relied heavily on their peers to understand their article. The assignment was also shown to require students to read articles more carefully than previously. In addition, the data suggest that the assignment could be modified and used successfully in other courses and at other institutions.
Novel 2D Triple-Resonance NMR Experiments for Sequential Resonance Assignments of Proteins

NASA Astrophysics Data System (ADS)

Ding, Keyang; Gronenborn, Angela M.

2002-06-01

We present 2D versions of the popular triple resonance HN(CO) CACB, HN(COCA)CACB, HN(CO)CAHA, and HN(COCA) CAHA experiments, commonly used for sequential resonance assignments of proteins. These experiments provide information about correlations between amino proton and nitrogen chemical shifts and the α- and β-carbon and α-proton chemical shifts within and between amino acid residues. Using these 2D spectra, sequential resonance assignments of H N, N, C α, C β, and H α nuclei are easily achieved. The resolution of these spectra is identical to the well-resolved 2D 15N- 1H HSQC and H(NCO)CA spectra, with slightly reduced sensitivity compared to their 3D and 4D versions. These types of spectra are ideally suited for exploitation in automated assignment procedures and thereby constitute a fast and efficient means for NMR structural determination of small and medium-sized proteins in solution in structural genomics programs.
Backbone dynamics of a model membrane protein: assignment of the carbonyl carbon /sup 13/C NMR resonances in detergent-solubilized M13 coat protein

DOE Office of Scientific and Technical Information (OSTI.GOV)

Henry, G.D.; Weiner, J.H.; Sykes, B.D.

The major coat protein of the filamentous bacteriophage M13 is a 50-residue amphiphilic polypeptide which is inserted, as an integral membrane-spanning protein, in the inner membrane of the Escherichia coli host during infection. /sup 13/C was incorporated biosynthetically into a total of 23 of the peptide carbonyls using labeled amino acids (alanine, glycine, lysine, phenylalanine, and proline). The structure and dynamics of carbonyl-labeled M13 coat protein were monitored by /sup 13/C nuclear magnetic resonance (NMR) spectroscopy. Assignment of many resonances was achieved by using protease digestion, pH titration, or labeling of the peptide bond with both /sup 13/C and /supmore » 15/N. The carbonyl region of the natural-abundance /sup 13/C NMR spectrum of M13 coat protein in sodium dodecyl sulfate solution shows approximately eight backbone carbonyl resonances with line widths much narrower than the rest. Three of these more mobile residues correspond to assigned peaks (glycine-3, lysine-48, and alanine-49) in the individual amino acid spectra, and another almost certainly arises from glutamic acid-2. A ninth residue, alanine-1, also gives rise to a very narrow carbonyl resonance if the pH is well above or below the pK/sub a/ of the terminal amino group. These data suggest that only about four residues at either end of the protein experience large-amplitude spatial fluctuations; the rest of the molecule is essentially rigid on the time scale of the overall rotational tumbling of the protein-detergent complex. The relative exposure of different regions of detergent-bound protein was monitored by limited digestion with proteinase K. Comparable spectra and digestion patterns were obtained when the protein was solubilized in sodium deoxycholate, suggesting that the coat protein binds both amphiphiles in a similar fashion.« less
Coming to Grips with Ambiguity: Ion Mobility-Mass Spectrometry for Protein Quaternary Structure Assignment

NASA Astrophysics Data System (ADS)

Eschweiler, Joseph D.; Frank, Aaron T.; Ruotolo, Brandon T.

2017-10-01

Multiprotein complexes are central to our understanding of cellular biology, as they play critical roles in nearly every biological process. Despite many impressive advances associated with structural characterization techniques, large and highly-dynamic protein complexes are too often refractory to analysis by conventional, high-resolution approaches. To fill this gap, ion mobility-mass spectrometry (IM-MS) methods have emerged as a promising approach for characterizing the structures of challenging assemblies due in large part to the ability of these methods to characterize the composition, connectivity, and topology of large, labile complexes. In this Critical Insight, we present a series of bioinformatics studies aimed at assessing the information content of IM-MS datasets for building models of multiprotein structure. Our computational data highlights the limits of current coarse-graining approaches, and compelled us to develop an improved workflow for multiprotein topology modeling, which we benchmark against a subset of the multiprotein complexes within the PDB. This improved workflow has allowed us to ascertain both the minimal experimental restraint sets required for generation of high-confidence multiprotein topologies, and quantify the ambiguity in models where insufficient IM-MS information is available. We conclude by projecting the future of IM-MS in the context of protein quaternary structure assignment, where we predict that a more complete knowledge of the ultimate information content and ambiguity within such models will undoubtedly lead to applications for a broader array of challenging biomolecular assemblies. [Figure not available: see fulltext.
NMR assignments of juvenile hormone binding protein in complex with JH III.

PubMed

Suzuki, Rintaro; Tase, Akira; Fujimoto, Zui; Shiotsuki, Takahiro; Yamazaki, Toshimasa

2009-06-01

A hemolymph juvenile hormone binding protein (JHBP) shuttles hydrophobic JH, a key hormone in regulation of the insect life cycle, from the site of the JH biosynthesis to the cells of target organs. We report complete NMR chemical shift assignments of Bombyx mori JHBP in the JH III-bound state.
Mass spectrometry-based protein identification with accurate statistical significance assignment.

PubMed

Alves, Gelio; Yu, Yi-Kuo

2015-03-01

Assigning statistical significance accurately has become increasingly important as metadata of many types, often assembled in hierarchies, are constructed and combined for further biological analyses. Statistical inaccuracy of metadata at any level may propagate to downstream analyses, undermining the validity of scientific conclusions thus drawn. From the perspective of mass spectrometry-based proteomics, even though accurate statistics for peptide identification can now be achieved, accurate protein level statistics remain challenging. We have constructed a protein ID method that combines peptide evidences of a candidate protein based on a rigorous formula derived earlier; in this formula the database P-value of every peptide is weighted, prior to the final combination, according to the number of proteins it maps to. We have also shown that this protein ID method provides accurate protein level E-value, eliminating the need of using empirical post-processing methods for type-I error control. Using a known protein mixture, we find that this protein ID method, when combined with the Sorić formula, yields accurate values for the proportion of false discoveries. In terms of retrieval efficacy, the results from our method are comparable with other methods tested. The source code, implemented in C++ on a linux system, is available for download at ftp://ftp.ncbi.nlm.nih.gov/pub/qmbp/qmbp_ms/RAId/RAId_Linux_64Bit. Published by Oxford University Press 2014. This work is written by US Government employees and is in the public domain in the US.

A New Secondary Structure Assignment Algorithm Using Cα Backbone Fragments

PubMed Central

Cao, Chen; Wang, Guishen; Liu, An; Xu, Shutan; Wang, Lincong; Zou, Shuxue

2016-01-01

The assignment of secondary structure elements in proteins is a key step in the analysis of their structures and functions. We have developed an algorithm, SACF (secondary structure assignment based on Cα fragments), for secondary structure element (SSE) assignment based on the alignment of Cα backbone fragments with central poses derived by clustering known SSE fragments. The assignment algorithm consists of three steps: First, the outlier fragments on known SSEs are detected. Next, the remaining fragments are clustered to obtain the central fragments for each cluster. Finally, the central fragments are used as a template to make assignments. Following a large-scale comparison of 11 secondary structure assignment methods, SACF, KAKSI and PROSS are found to have similar agreement with DSSP, while PCASSO agrees with DSSP best. SACF and PCASSO show preference to reducing residues in N and C cap regions, whereas KAKSI, P-SEA and SEGNO tend to add residues to the terminals when DSSP assignment is taken as standard. Moreover, our algorithm is able to assign subtle helices (310-helix, π-helix and left-handed helix) and make uniform assignments, as well as to detect rare SSEs in β-sheets or long helices as outlier fragments from other programs. The structural uniformity should be useful for protein structure classification and prediction, while outlier fragments underlie the structure–function relationship. PMID:26978354
AssignFit: a program for simultaneous assignment and structure refinement from solid-state NMR spectra

PubMed Central

Tian, Ye; Schwieters, Charles D.; Opella, Stanley J.; Marassi, Francesca M.

2011-01-01

AssignFit is a computer program developed within the XPLOR-NIH package for the assignment of dipolar coupling (DC) and chemical shift anisotropy (CSA) restraints derived from the solid-state NMR spectra of protein samples with uniaxial order. The method is based on minimizing the difference between experimentally observed solid-state NMR spectra and the frequencies back calculated from a structural model. Starting with a structural model and a set of DC and CSA restraints grouped only by amino acid type, as would be obtained by selective isotopic labeling, AssignFit generates all of the possible assignment permutations and calculates the corresponding atomic coordinates oriented in the alignment frame, together with the associated set of NMR frequencies, which are then compared with the experimental data for best fit. Incorporation of AssignFit in a simulated annealing refinement cycle provides an approach for simultaneous assignment and structure refinement (SASR) of proteins from solid-state NMR orientation restraints. The methods are demonstrated with data from two integral membrane proteins, one α-helical and one β-barrel, embedded in phospholipid bilayer membranes. PMID:22036904
Application of a fast sorting algorithm to the assignment of mass spectrometric cross-linking data.

PubMed

Petrotchenko, Evgeniy V; Borchers, Christoph H

2014-09-01

Cross-linking combined with MS involves enzymatic digestion of cross-linked proteins and identifying cross-linked peptides. Assignment of cross-linked peptide masses requires a search of all possible binary combinations of peptides from the cross-linked proteins' sequences, which becomes impractical with increasing complexity of the protein system and/or if digestion enzyme specificity is relaxed. Here, we describe the application of a fast sorting algorithm to search large sequence databases for cross-linked peptide assignments based on mass. This same algorithm has been used previously for assigning disulfide-bridged peptides (Choi et al., ), but has not previously been applied to cross-linking studies. © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Two-dimensional sup 1 H NMR studies on HPr protein from Staphylococcus aureus: Complete sequential assignments and secondary structure

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kalbitzer, H.R.; Neidig, K.P.; Hengstenberg, W.

1991-11-19

Complete sequence-specific assignments of the {sup 1}H NMR spectrum of HPr protein from Staphylococcus aureus were obtained by two-dimensional NMR methods. Important secondary structure elements that can be derived from the observed nuclear Overhauser effects are a large antiparallel {beta}-pleated sheet consisting of four strands, A, B, C, D, a segment S{sub AB} consisting of an extended region around the active-center histidine (His-15) and an {alpha}-helix, a half-turn between strands B and C, a segment S{sub CD} which shows no typical secondary structure, and the {alpha}-helical, C-terminal segment S{sub term}. These general structural features are similar to those found earliermore » in HPr proteins from different microorganisms such as Escherichia coli, Bacillus subtilis, and Streptococcus faecalis.« less
Genetic assignment of large seizures of elephant ivory reveals Africa’s major poaching hotspots

PubMed Central

Wasser, S. K.; Brown, L.; Mailand, C.; Mondol, S.; Clark, W.; Laurie, C.; Weir, B. S.

2017-01-01

Poaching of elephants is now occurring at rates that threaten African populations with extinction. Identifying the number and location of Africa’s major poaching hotspots may assist efforts to end poaching and facilitate recovery of elephant populations. We genetically assign origin to 28 large ivory seizures (≥0.5 metric tons) made between 1996 and 2014, also testing assignment accuracy. Results suggest that the major poaching hotspots in Africa may be currently concentrated in as few as two areas. Increasing law enforcement in these two hotspots could help curtail future elephant losses across Africa and disrupt this organized transnational crime. PMID:26089357
Improved Peak Detection and Deconvolution of Native Electrospray Mass Spectra from Large Protein Complexes.

PubMed

Lu, Jonathan; Trnka, Michael J; Roh, Soung-Hun; Robinson, Philip J J; Shiau, Carrie; Fujimori, Danica Galonic; Chiu, Wah; Burlingame, Alma L; Guan, Shenheng

2015-12-01

Native electrospray-ionization mass spectrometry (native MS) measures biomolecules under conditions that preserve most aspects of protein tertiary and quaternary structure, enabling direct characterization of large intact protein assemblies. However, native spectra derived from these assemblies are often partially obscured by low signal-to-noise as well as broad peak shapes because of residual solvation and adduction after the electrospray process. The wide peak widths together with the fact that sequential charge state series from highly charged ions are closely spaced means that native spectra containing multiple species often suffer from high degrees of peak overlap or else contain highly interleaved charge envelopes. This situation presents a challenge for peak detection, correct charge state and charge envelope assignment, and ultimately extraction of the relevant underlying mass values of the noncovalent assemblages being investigated. In this report, we describe a comprehensive algorithm developed for addressing peak detection, peak overlap, and charge state assignment in native mass spectra, called PeakSeeker. Overlapped peaks are detected by examination of the second derivative of the raw mass spectrum. Charge state distributions of the molecular species are determined by fitting linear combinations of charge envelopes to the overall experimental mass spectrum. This software is capable of deconvoluting heterogeneous, complex, and noisy native mass spectra of large protein assemblies as demonstrated by analysis of (1) synthetic mononucleosomes containing severely overlapping peaks, (2) an RNA polymerase II/α-amanitin complex with many closely interleaved ion signals, and (3) human TriC complex containing high levels of background noise. Graphical Abstract ᅟ.
Selective excitation for spectral editing and assignment in separated local field experiments of oriented membrane proteins

NASA Astrophysics Data System (ADS)

Koroloff, Sophie N.; Nevzorov, Alexander A.

2017-01-01

Spectroscopic assignment of NMR spectra for oriented uniformly labeled membrane proteins embedded in their native-like bilayer environment is essential for their structure determination. However, sequence-specific assignment in oriented-sample (OS) NMR is often complicated by insufficient resolution and spectral crowding. Therefore, the assignment process is usually done by a laborious and expensive "shotgun" method involving multiple selective labeling of amino acid residues. Presented here is a strategy to overcome poor spectral resolution in crowded regions of 2D spectra by selecting resolved "seed" residues via soft Gaussian pulses inserted into spin-exchange separated local-field experiments. The Gaussian pulse places the selected polarization along the z-axis while dephasing the other signals before the evolution of the 1H-15N dipolar couplings. The transfer of magnetization is accomplished via mismatched Hartmann-Hahn conditions to the nearest-neighbor peaks via the proton bath. By optimizing the length and amplitude of the Gaussian pulse, one can also achieve a phase inversion of the closest peaks, thus providing an additional phase contrast. From the superposition of the selective spin-exchanged SAMPI4 onto the fully excited SAMPI4 spectrum, the 15N sites that are directly adjacent to the selectively excited residues can be easily identified, thereby providing a straightforward method for initiating the assignment process in oriented membrane proteins.
Backbone resonance assignments for G protein α(i3) subunit in the GDP-bound state.

PubMed

Mase, Yoko; Yokogawa, Mariko; Osawa, Masanori; Shimada, Ichio

2014-10-01

Guanine-nucleotide binding proteins (G proteins) serve as molecular switches in signaling pathways, by coupling the activation of G protein-coupled receptors (GPCRs) at the cell surface to intracellular responses. In the resting state, G protein forms a heterotrimer, consisting of the G protein α subunit with GDP (Gα·GDP) and the G protein βγ subunit (Gβγ). Ligand binding to GPCRs promotes the GDP-GTP exchange on Gα, leading to the dissociation of the GTP-bound form of Gα (Gα·GTP) and Gβγ. Then, Gα·GTP and Gβγ bind to their downstream effector enzymes or ion channels and regulate their activities, leading to a variety of cellular responses. Finally, Gα hydrolyzes the bound GTP to GDP and returns to the resting state by re-associating with Gβγ. The G proteins are classified with four major families based on the amino acid sequences of Gα: i/o, s, q/11, and 12/13. Here, we established the backbone resonance assignments of human Gαi3, a member of the i/o family with a molecular weight of 41 K, in complex with GDP. The chemical shifts were compared with those of Gα(i3) in complex with a GTP-analogue, GTPγS, which we recently reported, indicating that the residues with significant chemical shift differences are mostly consistent with the regions with the structural differences between the GDP- and GTPγS-bound states, as indicated in the crystal structures. The assignments of Gα(i3)·GDP would be useful for the analyses of the dynamics of Gα(i3) and its interactions with various target molecules.
CONSERVATION. Genetic assignment of large seizures of elephant ivory reveals Africa's major poaching hotspots.

PubMed

Wasser, S K; Brown, L; Mailand, C; Mondol, S; Clark, W; Laurie, C; Weir, B S

2015-07-03

Poaching of elephants is now occurring at rates that threaten African populations with extinction. Identifying the number and location of Africa's major poaching hotspots may assist efforts to end poaching and facilitate recovery of elephant populations. We genetically assign origin to 28 large ivory seizures (≥0.5 metric tons) made between 1996 and 2014, also testing assignment accuracy. Results suggest that the major poaching hotspots in Africa may be currently concentrated in as few as two areas. Increasing law enforcement in these two hotspots could help curtail future elephant losses across Africa and disrupt this organized transnational crime. Copyright © 2015, American Association for the Advancement of Science.
CONNJUR R: An annotation strategy for fostering reproducibility in bio-NMR: protein spectral assignment

PubMed Central

Fenwick, Matthew; Hoch, Jeffrey C.; Ulrich, Eldon; Gryk, Michael R.

2015-01-01

Reproducibility is a cornerstone of the scientific method, essential for validation of results by independent laboratories and the sine qua non of scientific progress. A key step toward reproducibility of biomolecular NMR studies was the establishment of public data repositories (PDB and BMRB). Nevertheless, bio-NMR studies routinely fall short of the requirement for reproducibility that all the data needed to reproduce the results are published. A key limitation is that considerable metadata goes unpublished, notably manual interventions that are typically applied during the assignment of multidimensional NMR spectra. A general solution to this problem has been elusive, in part because of the wide range of approaches and software packages employed in the analysis of protein NMR spectra. Here we describe an approach for capturing missing metadata during the assignment of protein NMR spectra that can be generalized to arbitrary workflows, different software packages, other biomolecules, or other stages of data analysis in bio-NMR. We also present extensions to the NMR-STAR data dictionary that enable machine archival and retrieval of the “missing” metadata. PMID:26253947
iHADAMAC: A complementary tool for sequential resonance assignment of globular and highly disordered proteins

NASA Astrophysics Data System (ADS)

Feuerstein, Sophie; Plevin, Michael J.; Willbold, Dieter; Brutscher, Bernhard

2012-01-01

An experiment, iHADAMAC, is presented that yields information on the amino-acid type of individual residues in a protein by editing the 1H- 15N correlations into seven different 2D spectra, each corresponding to a different class of amino-acid types. Amino-acid type discrimination is realized via a Hadamard encoding scheme based on four different spin manipulations as recently introduced in the context of the sequential HADAMAC experiment. Both sequential and intra-residue HADAMAC experiments yield highly complementary information that greatly facilitate resonance assignment of proteins with high frequency degeneracy, as demonstrated here for a 188-residue intrinsically disordered protein fragment of the hepatitis C virus protein NS5A.
Resonance assignment of disordered protein with repetitive and overlapping sequence using combinatorial approach reveals initial structural propensities and local restrictions in the denatured state.

PubMed

Malik, Nikita; Kumar, Ashutosh

2016-09-01

NMR resonance assignment of intrinsically disordered proteins poses a challenge because of the limited dispersion of amide proton chemical shifts. This becomes even more complex with the increase in the size of the system. Residue specific selective labeling/unlabeling experiments have been used to resolve the overlap, but require multiple sample preparations. Here, we demonstrate an assignment strategy requiring only a single sample of uniformly labeled (13)C,(15)N-protein. We have used a combinatorial approach, involving 3D-HNN, CC(CO)NH and 2D-MUSIC, which allowed us to assign a denatured centromeric protein Cse4 of 229 residues. Further, we show that even the less sensitive experiments, when used in an efficient manner can lead to the complete assignment of a complex system without the use of specialized probes in a relatively short time frame. The assignment of the amino acids discloses the presence of local structural propensities even in the denatured state accompanied by restricted motion in certain regions that provides insights into the early folding events of the protein.
Solid-state NMR spectroscopy of 18.5 kDa myelin basic protein reconstituted with lipid vesicles: spectroscopic characterisation and spectral assignments of solvent-exposed protein fragments.

PubMed

Zhong, Ligang; Bamm, Vladimir V; Ahmed, Mumdooh A M; Harauz, George; Ladizhansky, Vladimir

2007-12-01

Myelin basic protein (MBP, 18.5 kDa isoform) is a peripheral membrane protein that is essential for maintaining the structural integrity of the multilamellar myelin sheath of the central nervous system. Reconstitution of the most abundant 18.5 kDa MBP isoform with lipid vesicles yields an aggregated assembly mimicking the protein's natural environment, but which is not amenable to standard solution NMR spectroscopy. On the other hand, the mobility of MBP in such a system is variable, depends on the local strength of the protein-lipid interaction, and in general is of such a time scale that the dipolar interactions are averaged out. Here, we used a combination of solution and solid-state NMR (ssNMR) approaches: J-coupling-driven polarization transfers were combined with magic angle spinning and high-power decoupling to yield high-resolution spectra of the mobile fragments of 18.5 kDa murine MBP in membrane-associated form. To partially circumvent the problem of short transverse relaxation, we implemented three-dimensional constant-time correlation experiments (NCOCX, NCACX, CONCACX, and CAN(CO)CX) that were able to provide interresidue and intraresidue backbone correlations. These experiments resulted in partial spectral assignments for mobile fragments of the protein. Additional nuclear Overhauser effect spectroscopy (NOESY)-based experiments revealed that the mobile fragments were exposed to solvent and were likely located outside the lipid bilayer, or in its hydrophilic portion. Chemical shift index analysis showed that the fragments were largely disordered under these conditions. These combined approaches are applicable to ssNMR investigations of other peripheral membrane proteins reconstituted with lipids.
Ab Initio structure prediction for Escherichia coli: towards genome-wide protein structure modeling and fold assignment

PubMed Central

Xu, Dong; Zhang, Yang

2013-01-01

Genome-wide protein structure prediction and structure-based function annotation have been a long-term goal in molecular biology but not yet become possible due to difficulties in modeling distant-homology targets. We developed a hybrid pipeline combining ab initio folding and template-based modeling for genome-wide structure prediction applied to the Escherichia coli genome. The pipeline was tested on 43 known sequences, where QUARK-based ab initio folding simulation generated models with TM-score 17% higher than that by traditional comparative modeling methods. For 495 unknown hard sequences, 72 are predicted to have a correct fold (TM-score > 0.5) and 321 have a substantial portion of structure correctly modeled (TM-score > 0.35). 317 sequences can be reliably assigned to a SCOP fold family based on structural analogy to existing proteins in PDB. The presented results, as a case study of E. coli, represent promising progress towards genome-wide structure modeling and fold family assignment using state-of-the-art ab initio folding algorithms. PMID:23719418
43 CFR 2521.3 - Assignment.

Code of Federal Regulations, 2011 CFR

2011-10-01

... OF THE INTERIOR LAND RESOURCE MANAGEMENT (2000) DESERT-LAND ENTRIES Procedures § 2521.3 Assignment...), assignments of desert-land entries were recognized, the Department of the Interior, largely for administrative reasons, held that a desert-land entry might be assigned as a whole or in its entirety, but refused to...
43 CFR 2521.3 - Assignment.

Code of Federal Regulations, 2013 CFR

2013-10-01

... OF THE INTERIOR LAND RESOURCE MANAGEMENT (2000) DESERT-LAND ENTRIES Procedures § 2521.3 Assignment...), assignments of desert-land entries were recognized, the Department of the Interior, largely for administrative reasons, held that a desert-land entry might be assigned as a whole or in its entirety, but refused to...
43 CFR 2521.3 - Assignment.

Code of Federal Regulations, 2014 CFR

2014-10-01

... OF THE INTERIOR LAND RESOURCE MANAGEMENT (2000) DESERT-LAND ENTRIES Procedures § 2521.3 Assignment...), assignments of desert-land entries were recognized, the Department of the Interior, largely for administrative reasons, held that a desert-land entry might be assigned as a whole or in its entirety, but refused to...
43 CFR 2521.3 - Assignment.

Code of Federal Regulations, 2012 CFR

2012-10-01

... OF THE INTERIOR LAND RESOURCE MANAGEMENT (2000) DESERT-LAND ENTRIES Procedures § 2521.3 Assignment...), assignments of desert-land entries were recognized, the Department of the Interior, largely for administrative reasons, held that a desert-land entry might be assigned as a whole or in its entirety, but refused to...
Reduced dimensionality tailored HN(C)N experiments for facile backbone resonance assignment of proteins through unambiguous identification of sequential HSQC peaks

NASA Astrophysics Data System (ADS)

Kumar, Dinesh

2013-12-01

Two novel reduced dimensionality (RD) tailored HN(C)N [S.C. Panchal, N.S. Bhavesh, R.V. Hosur, Improved 3D triple resonance experiments, HNN and HN(C)N, for HN and 15N sequential correlations in (13C, 15N) labeled proteins: application to unfolded proteins, J. Biomol. NMR 20 (2001) 135-147] experiments are proposed to facilitate the backbone resonance assignment of proteins both in terms of its accuracy and speed. These experiments - referred here as (4,3)D-hNCOcaNH and (4,3)D-hNcoCANH - exploit the linear combination of backbone 15N and 13C‧/13Cα chemical shifts simultaneously to achieve higher peak dispersion and randomness along their respective F1 dimensions. Simply, this has been achieved by modulating the backbone 15N(i) chemical shifts with that of 13C‧ (i - 1)/13Cα (i - 1) spins following the established reduced dimensionality NMR approach [T. Szyperski, D.C. Yeh, D.K. Sukumaran, H.N. Moseley, G.T. Montelione, Reduced-dimensionality NMR spectroscopy for high-throughput protein resonance assignment, Proc. Natl. Acad. Sci. USA 99 (2002) 8009-8014]. Though the modification is simple it has resulted an ingenious improvement of HN(C)N both in terms of peak dispersion and easiness of establishing the sequential connectivities. The increased dispersion along F1 dimension solves two purposes here: (i) resolves the ambiguities arising because of degenerate 15N chemical shifts and (ii) reduces the signal overlap in F2(15N)-F3(1H) planes (an important requisite in HN(C)N based assignment protocol for facile and unambiguous identification of sequentially connected HSQC peaks). The performance of both these experiments and the assignment protocol has been demonstrated using bovine apo Calbindin-d9k (75 aa) and urea denatured UNC60B (a 152 amino acid ADF/cofilin family protein of Caenorhabditis elegans), as representatives of folded and unfolded protein systems, respectively.
A new algorithm for reliable and general NMR resonance assignment.

PubMed

Schmidt, Elena; Güntert, Peter

2012-08-01

The new FLYA automated resonance assignment algorithm determines NMR chemical shift assignments on the basis of peak lists from any combination of multidimensional through-bond or through-space NMR experiments for proteins. Backbone and side-chain assignments can be determined. All experimental data are used simultaneously, thereby exploiting optimally the redundancy present in the input peak lists and circumventing potential pitfalls of assignment strategies in which results obtained in a given step remain fixed input data for subsequent steps. Instead of prescribing a specific assignment strategy, the FLYA resonance assignment algorithm requires only experimental peak lists and the primary structure of the protein, from which the peaks expected in a given spectrum can be generated by applying a set of rules, defined in a straightforward way by specifying through-bond or through-space magnetization transfer pathways. The algorithm determines the resonance assignment by finding an optimal mapping between the set of expected peaks that are assigned by definition but have unknown positions and the set of measured peaks in the input peak lists that are initially unassigned but have a known position in the spectrum. Using peak lists obtained by purely automated peak picking from the experimental spectra of three proteins, FLYA assigned correctly 96-99% of the backbone and 90-91% of all resonances that could be assigned manually. Systematic studies quantified the impact of various factors on the assignment accuracy, namely the extent of missing real peaks and the amount of additional artifact peaks in the input peak lists, as well as the accuracy of the peak positions. Comparing the resonance assignments from FLYA with those obtained from two other existing algorithms showed that using identical experimental input data these other algorithms yielded significantly (40-142%) more erroneous assignments than FLYA. The FLYA resonance assignment algorithm thus has the

Protein complex prediction in large ontology attributed protein-protein interaction networks.

PubMed

Zhang, Yijia; Lin, Hongfei; Yang, Zhihao; Wang, Jian; Li, Yanpeng; Xu, Bo

2013-01-01

Protein complexes are important for unraveling the secrets of cellular organization and function. Many computational approaches have been developed to predict protein complexes in protein-protein interaction (PPI) networks. However, most existing approaches focus mainly on the topological structure of PPI networks, and largely ignore the gene ontology (GO) annotation information. In this paper, we constructed ontology attributed PPI networks with PPI data and GO resource. After constructing ontology attributed networks, we proposed a novel approach called CSO (clustering based on network structure and ontology attribute similarity). Structural information and GO attribute information are complementary in ontology attributed networks. CSO can effectively take advantage of the correlation between frequent GO annotation sets and the dense subgraph for protein complex prediction. Our proposed CSO approach was applied to four different yeast PPI data sets and predicted many well-known protein complexes. The experimental results showed that CSO was valuable in predicting protein complexes and achieved state-of-the-art performance.
Classifying proteins into functional groups based on all-versus-all BLAST of 10 million proteins.

PubMed

Kolker, Natali; Higdon, Roger; Broomall, William; Stanberry, Larissa; Welch, Dean; Lu, Wei; Haynes, Winston; Barga, Roger; Kolker, Eugene

2011-01-01

To address the monumental challenge of assigning function to millions of sequenced proteins, we completed the first of a kind all-versus-all sequence alignments using BLAST for 9.9 million proteins in the UniRef100 database. Microsoft Windows Azure produced over 3 billion filtered records in 6 days using 475 eight-core virtual machines. Protein classification into functional groups was then performed using Hive and custom jars implemented on top of Apache Hadoop utilizing the MapReduce paradigm. First, using the Clusters of Orthologous Genes (COG) database, a length normalized bit score (LNBS) was determined to be the best similarity measure for classification of proteins. LNBS achieved sensitivity and specificity of 98% each. Second, out of 5.1 million bacterial proteins, about two-thirds were assigned to significantly extended COG groups, encompassing 30 times more assigned proteins. Third, the remaining proteins were classified into protein functional groups using an innovative implementation of a single-linkage algorithm on an in-house Hadoop compute cluster. This implementation significantly reduces the run time for nonindexed queries and optimizes efficient clustering on a large scale. The performance was also verified on Amazon Elastic MapReduce. This clustering assigned nearly 2 million proteins to approximately half a million different functional groups. A similar approach was applied to classify 2.8 million eukaryotic sequences resulting in over 1 million proteins being assign to existing KOG groups and the remainder clustered into 100,000 functional groups.
Optimizing a realistic large-scale frequency assignment problem using a new parallel evolutionary approach

NASA Astrophysics Data System (ADS)

Chaves-González, José M.; Vega-Rodríguez, Miguel A.; Gómez-Pulido, Juan A.; Sánchez-Pérez, Juan M.

2011-08-01

This article analyses the use of a novel parallel evolutionary strategy to solve complex optimization problems. The work developed here has been focused on a relevant real-world problem from the telecommunication domain to verify the effectiveness of the approach. The problem, known as frequency assignment problem (FAP), basically consists of assigning a very small number of frequencies to a very large set of transceivers used in a cellular phone network. Real data FAP instances are very difficult to solve due to the NP-hard nature of the problem, therefore using an efficient parallel approach which makes the most of different evolutionary strategies can be considered as a good way to obtain high-quality solutions in short periods of time. Specifically, a parallel hyper-heuristic based on several meta-heuristics has been developed. After a complete experimental evaluation, results prove that the proposed approach obtains very high-quality solutions for the FAP and beats any other result published.
Automated Assignment of MS/MS Cleavable Cross-Links in Protein 3D-Structure Analysis

NASA Astrophysics Data System (ADS)

Götze, Michael; Pettelkau, Jens; Fritzsche, Romy; Ihling, Christian H.; Schäfer, Mathias; Sinz, Andrea

2015-01-01

CID-MS/MS cleavable cross-linkers hold an enormous potential for an automated analysis of cross-linked products, which is essential for conducting structural proteomics studies. The created characteristic fragment ion patterns can easily be used for an automated assignment and discrimination of cross-linked products. To date, there are only a few software solutions available that make use of these properties, but none allows for an automated analysis of cleavable cross-linked products. The MeroX software fills this gap and presents a powerful tool for protein 3D-structure analysis in combination with MS/MS cleavable cross-linkers. We show that MeroX allows an automatic screening of characteristic fragment ions, considering static and variable peptide modifications, and effectively scores different types of cross-links. No manual input is required for a correct assignment of cross-links and false discovery rates are calculated. The self-explanatory graphical user interface of MeroX provides easy access for an automated cross-link search platform that is compatible with commonly used data file formats, enabling analysis of data originating from different instruments. The combination of an MS/MS cleavable cross-linker with a dedicated software tool for data analysis provides an automated workflow for 3D-structure analysis of proteins. MeroX is available at www.StavroX.com .
Optimal processor assignment for pipeline computations

NASA Technical Reports Server (NTRS)

Nicol, David M.; Simha, Rahul; Choudhury, Alok N.; Narahari, Bhagirath

1991-01-01

The availability of large scale multitasked parallel architectures introduces the following processor assignment problem for pipelined computations. Given a set of tasks and their precedence constraints, along with their experimentally determined individual responses times for different processor sizes, find an assignment of processor to tasks. Two objectives are of interest: minimal response given a throughput requirement, and maximal throughput given a response time requirement. These assignment problems differ considerably from the classical mapping problem in which several tasks share a processor; instead, it is assumed that a large number of processors are to be assigned to a relatively small number of tasks. Efficient assignment algorithms were developed for different classes of task structures. For a p processor system and a series parallel precedence graph with n constituent tasks, an O(np2) algorithm is provided that finds the optimal assignment for the response time optimization problem; it was found that the assignment optimizing the constrained throughput in O(np2log p) time. Special cases of linear, independent, and tree graphs are also considered.
Predicting protein functions from redundancies in large-scale protein interaction networks

NASA Technical Reports Server (NTRS)

Samanta, Manoj Pratim; Liang, Shoudan

2003-01-01

Interpreting data from large-scale protein interaction experiments has been a challenging task because of the widespread presence of random false positives. Here, we present a network-based statistical algorithm that overcomes this difficulty and allows us to derive functions of unannotated proteins from large-scale interaction data. Our algorithm uses the insight that if two proteins share significantly larger number of common interaction partners than random, they have close functional associations. Analysis of publicly available data from Saccharomyces cerevisiae reveals >2,800 reliable functional associations, 29% of which involve at least one unannotated protein. By further analyzing these associations, we derive tentative functions for 81 unannotated proteins with high certainty. Our method is not overly sensitive to the false positives present in the data. Even after adding 50% randomly generated interactions to the measured data set, we are able to recover almost all (approximately 89%) of the original associations.
Multidimensional oriented solid-state NMR experiments enable the sequential assignment of uniformly 15N labeled integral membrane proteins in magnetically aligned lipid bilayers.

PubMed

Mote, Kaustubh R; Gopinath, T; Traaseth, Nathaniel J; Kitchen, Jason; Gor'kov, Peter L; Brey, William W; Veglia, Gianluigi

2011-11-01

Oriented solid-state NMR is the most direct methodology to obtain the orientation of membrane proteins with respect to the lipid bilayer. The method consists of measuring (1)H-(15)N dipolar couplings (DC) and (15)N anisotropic chemical shifts (CSA) for membrane proteins that are uniformly aligned with respect to the membrane bilayer. A significant advantage of this approach is that tilt and azimuthal (rotational) angles of the protein domains can be directly derived from analytical expression of DC and CSA values, or, alternatively, obtained by refining protein structures using these values as harmonic restraints in simulated annealing calculations. The Achilles' heel of this approach is the lack of suitable experiments for sequential assignment of the amide resonances. In this Article, we present a new pulse sequence that integrates proton driven spin diffusion (PDSD) with sensitivity-enhanced PISEMA in a 3D experiment ([(1)H,(15)N]-SE-PISEMA-PDSD). The incorporation of 2D (15)N/(15)N spin diffusion experiments into this new 3D experiment leads to the complete and unambiguous assignment of the (15)N resonances. The feasibility of this approach is demonstrated for the membrane protein sarcolipin reconstituted in magnetically aligned lipid bicelles. Taken with low electric field probe technology, this approach will propel the determination of sequential assignment as well as structure and topology of larger integral membrane proteins in aligned lipid bilayers. © Springer Science+Business Media B.V. 2011
Contact replacement for NMR resonance assignment.

PubMed

Xiong, Fei; Pandurangan, Gopal; Bailey-Kellogg, Chris

2008-07-01

Complementing its traditional role in structural studies of proteins, nuclear magnetic resonance (NMR) spectroscopy is playing an increasingly important role in functional studies. NMR dynamics experiments characterize motions involved in target recognition, ligand binding, etc., while NMR chemical shift perturbation experiments identify and localize protein-protein and protein-ligand interactions. The key bottleneck in these studies is to determine the backbone resonance assignment, which allows spectral peaks to be mapped to specific atoms. This article develops a novel approach to address that bottleneck, exploiting an available X-ray structure or homology model to assign the entire backbone from a set of relatively fast and cheap NMR experiments. We formulate contact replacement for resonance assignment as the problem of computing correspondences between a contact graph representing the structure and an NMR graph representing the data; the NMR graph is a significantly corrupted, ambiguous version of the contact graph. We first show that by combining connectivity and amino acid type information, and exploiting the random structure of the noise, one can provably determine unique correspondences in polynomial time with high probability, even in the presence of significant noise (a constant number of noisy edges per vertex). We then detail an efficient randomized algorithm and show that, over a variety of experimental and synthetic datasets, it is robust to typical levels of structural variation (1-2 AA), noise (250-600%) and missings (10-40%). Our algorithm achieves very good overall assignment accuracy, above 80% in alpha-helices, 70% in beta-sheets and 60% in loop regions. Our contact replacement algorithm is implemented in platform-independent Python code. The software can be freely obtained for academic use by request from the authors.
Functional assignment to JEV proteins using SVM.

PubMed

Sahoo, Ganesh Chandra; Dikhit, Manas Ranjan; Das, Pradeep

2008-01-01

Identification of different protein functions facilitates a mechanistic understanding of Japanese encephalitis virus (JEV) infection and opens novel means for drug development. Support vector machines (SVM), useful for predicting the functional class of distantly related proteins, is employed to ascribe a possible functional class to Japanese encephalitis virus protein. Our study from SVMProt and available JE virus sequences suggests that structural and nonstructural proteins of JEV genome possibly belong to diverse protein functions, are expected to occur in the life cycle of JE virus. Protein functions common to both structural and non-structural proteins are iron-binding, metal-binding, lipid-binding, copper-binding, transmembrane, outer membrane, channels/Pores - Pore-forming toxins (proteins and peptides) group of proteins. Non-structural proteins perform functions like actin binding, zinc-binding, calcium-binding, hydrolases, Carbon-Oxygen Lyases, P-type ATPase, proteins belonging to major facilitator family (MFS), secreting main terminal branch (MTB) family, phosphotransfer-driven group translocators and ATP-binding cassette (ABC) family group of proteins. Whereas structural proteins besides belonging to same structural group of proteins (capsid, structural, envelope), they also perform functions like nuclear receptor, antibiotic resistance, RNA-binding, DNA-binding, magnesium-binding, isomerase (intra-molecular), oxidoreductase and participate in type II (general) secretory pathway (IISP).
Functional assignment to JEV proteins using SVM

PubMed Central

Sahoo, Ganesh Chandra; Dikhit, Manas Ranjan; Das, Pradeep

2008-01-01

Identification of different protein functions facilitates a mechanistic understanding of Japanese encephalitis virus (JEV) infection and opens novel means for drug development. Support vector machines (SVM), useful for predicting the functional class of distantly related proteins, is employed to ascribe a possible functional class to Japanese encephalitis virus protein. Our study from SVMProt and available JE virus sequences suggests that structural and nonstructural proteins of JEV genome possibly belong to diverse protein functions, are expected to occur in the life cycle of JE virus. Protein functions common to both structural and non-structural proteins are iron-binding, metal-binding, lipid-binding, copper-binding, transmembrane, outer membrane, channels/Pores - Pore-forming toxins (proteins and peptides) group of proteins. Non-structural proteins perform functions like actin binding, zinc-binding, calcium-binding, hydrolases, Carbon-Oxygen Lyases, P-type ATPase, proteins belonging to major facilitator family (MFS), secreting main terminal branch (MTB) family, phosphotransfer-driven group translocators and ATP-binding cassette (ABC) family group of proteins. Whereas structural proteins besides belonging to same structural group of proteins (capsid, structural, envelope), they also perform functions like nuclear receptor, antibiotic resistance, RNA-binding, DNA-binding, magnesium-binding, isomerase (intra-molecular), oxidoreductase and participate in type II (general) secretory pathway (IISP). PMID:19052658
CoMoDo: identifying dynamic protein domains based on covariances of motion.

PubMed

Wieninger, Silke A; Ullmann, G Matthias

2015-06-09

Most large proteins are built of several domains, compact units which enable functional protein motions. Different domain assignment approaches exist, which mostly rely on concepts of stability, folding, and evolution. We describe the automatic assignment method CoMoDo, which identifies domains based on protein dynamics. Covariances of atomic fluctuations, here calculated by an Elastic Network Model, are used to group residues into domains of different hierarchical levels. The so-called dynamic domains facilitate the study of functional protein motions involved in biological processes like ligand binding and signal transduction. By applying CoMoDo to a large number of proteins, we demonstrate that dynamic domains exhibit features absent in the commonly assigned structural domains, which can deliver insight into the interactions between domains and between subunits of multimeric proteins. CoMoDo is distributed as free open source software at www.bisb.uni-bayreuth.de/CoMoDo.html .
Protein complex prediction for large protein protein interaction networks with the Core&Peel method.

PubMed

Pellegrini, Marco; Baglioni, Miriam; Geraci, Filippo

2016-11-08

Biological networks play an increasingly important role in the exploration of functional modularity and cellular organization at a systemic level. Quite often the first tools used to analyze these networks are clustering algorithms. We concentrate here on the specific task of predicting protein complexes (PC) in large protein-protein interaction networks (PPIN). Currently, many state-of-the-art algorithms work well for networks of small or moderate size. However, their performance on much larger networks, which are becoming increasingly common in modern proteome-wise studies, needs to be re-assessed. We present a new fast algorithm for clustering large sparse networks: Core&Peel, which runs essentially in time and storage O(a(G)m+n) for a network G of n nodes and m arcs, where a(G) is the arboricity of G (which is roughly proportional to the maximum average degree of any induced subgraph in G). We evaluated Core&Peel on five PPI networks of large size and one of medium size from both yeast and homo sapiens, comparing its performance against those of ten state-of-the-art methods. We demonstrate that Core&Peel consistently outperforms the ten competitors in its ability to identify known protein complexes and in the functional coherence of its predictions. Our method is remarkably robust, being quite insensible to the injection of random interactions. Core&Peel is also empirically efficient attaining the second best running time over large networks among the tested algorithms. Our algorithm Core&Peel pushes forward the state-of the-art in PPIN clustering providing an algorithmic solution with polynomial running time that attains experimentally demonstrable good output quality and speed on challenging large real networks.
1H, 15N, and 13C resonance assignments of the third domain from the S. aureus innate immune evasion protein Eap.

PubMed

Herrera, Alvaro I; Ploscariu, Nicoleta T; Geisbrecht, Brian V; Prakash, Om

2018-04-01

Staphylococcus aureus is a widespread and persistent pathogen of humans and livestock. The bacterium expresses a wide variety of virulence proteins, many of which serve to disrupt the host's innate immune system from recognizing and clearing bacteria with optimal efficiency. The extracellular adherence protein (Eap) is a multidomain protein that participates in various protein-protein interactions that inhibit the innate immune response, including both the complement system (Woehl et al in J Immunol 193:6161-6171, 2014) and Neutrophil Serine Proteases (NSPs) (Stapels et al in Proc Natl Acad Sci USA 111:13187-13192, 2014). The third domain of Eap, Eap3, is an ~ 11 kDa protein that was recently shown to bind complement component C4b (Woehl et al in Protein Sci 26:1595-1608, 2017) and therefore play an essential role in inhibiting the classical and lectin pathways of complement (Woehl et al in J Immunol 193:6161-6171, 2014). Since structural characterization of Eap3 is still incomplete, we acquired a series of 2D and 3D NMR spectra of Eap3 in solution. Here we report the backbone and side-chain 1 H, 15 N, and 13 C resonance assignments of Eap3 and its predicted secondary structure via the TALOS-N server. The assignment data have been deposited in the BMRB data bank under accession number 27087.
Automation of NMR structure determination of proteins.

PubMed

Altieri, Amanda S; Byrd, R Andrew

2004-10-01

The automation of protein structure determination using NMR is coming of age. The tedious processes of resonance assignment, followed by assignment of NOE (nuclear Overhauser enhancement) interactions (now intertwined with structure calculation), assembly of input files for structure calculation, intermediate analyses of incorrect assignments and bad input data, and finally structure validation are all being automated with sophisticated software tools. The robustness of the different approaches continues to deal with problems of completeness and uniqueness; nevertheless, the future is very bright for automation of NMR structure generation to approach the levels found in X-ray crystallography. Currently, near completely automated structure determination is possible for small proteins, and the prospect for medium-sized and large proteins is good. Copyright 2004 Elsevier Ltd.
NVR-BIP: Nuclear Vector Replacement using Binary Integer Programming for NMR Structure-Based Assignments.

PubMed

Apaydin, Mehmet Serkan; Çatay, Bülent; Patrick, Nicholas; Donald, Bruce R

2011-05-01

Nuclear magnetic resonance (NMR) spectroscopy is an important experimental technique that allows one to study protein structure and dynamics in solution. An important bottleneck in NMR protein structure determination is the assignment of NMR peaks to the corresponding nuclei. Structure-based assignment (SBA) aims to solve this problem with the help of a template protein which is homologous to the target and has applications in the study of structure-activity relationship, protein-protein and protein-ligand interactions. We formulate SBA as a linear assignment problem with additional nuclear overhauser effect constraints, which can be solved within nuclear vector replacement's (NVR) framework (Langmead, C., Yan, A., Lilien, R., Wang, L. and Donald, B. (2003) A Polynomial-Time Nuclear Vector Replacement Algorithm for Automated NMR Resonance Assignments. Proc. the 7th Annual Int. Conf. Research in Computational Molecular Biology (RECOMB) , Berlin, Germany, April 10-13, pp. 176-187. ACM Press, New York, NY. J. Comp. Bio. , (2004), 11, pp. 277-298; Langmead, C. and Donald, B. (2004) An expectation/maximization nuclear vector replacement algorithm for automated NMR resonance assignments. J. Biomol. NMR , 29, 111-138). Our approach uses NVR's scoring function and data types and also gives the option of using CH and NH residual dipolar coupling (RDCs), instead of NH RDCs which NVR requires. We test our technique on NVR's data set as well as on four new proteins. Our results are comparable to NVR's assignment accuracy on NVR's test set, but higher on novel proteins. Our approach allows partial assignments. It is also complete and can return the optimum as well as near-optimum assignments. Furthermore, it allows us to analyze the information content of each data type and is easily extendable to accept new forms of input data, such as additional RDCs.
Photoswitchable red fluorescent protein with a large Stokes shift

PubMed Central

Piatkevich, Kiryl D.; English, Brian P.; Malashkevich, Vladimir N.; Xiao, Hui; Almo, Steven C.; Singer, Robert H.; Verkhusha, Vladislav V.

2014-01-01

SUMMARY Subclass of fluorescent proteins, large Stokes shift fluorescent proteins, is characterized by their increased spread between the excitation and emission maxima. Here we report a photoswitchable variant of a red fluorescent protein with a large Stokes shift, PSLSSmKate, which initially exhibits excitation/emission at 445/622 nm, but irradiation with violet light photoswitches PSLSSmKate into a common red form with excitation/emission at 573/621 nm. We characterize spectral, photophysical and biochemical properties of PSLSSmKate in vitro and in mammalian cells, and determine its crystal structure in the large Stokes shift form. Mass-spectrometry, mutagenesis and spectroscopic analysis of PSLSSmKate allow us to propose molecular mechanisms for the large Stokes shift, pH dependence and light-induced chromophore transformation. We demonstrate applicability of PSLSSmKate to superresolution PALM microscopy and protein dynamics in live cells. Given its promising properties, we expect that PSLSSmKate-like phenotype will be further used for photoactivatable imaging and tracking multiple populations of intracellular objects. PMID:25242289
Insights into Hox protein function from a large scale combinatorial analysis of protein domains.

PubMed

Merabet, Samir; Litim-Mecheri, Isma; Karlsson, Daniel; Dixit, Richa; Saadaoui, Mehdi; Monier, Bruno; Brun, Christine; Thor, Stefan; Vijayraghavan, K; Perrin, Laurent; Pradel, Jacques; Graba, Yacine

2011-10-01

Protein function is encoded within protein sequence and protein domains. However, how protein domains cooperate within a protein to modulate overall activity and how this impacts functional diversification at the molecular and organism levels remains largely unaddressed. Focusing on three domains of the central class Drosophila Hox transcription factor AbdominalA (AbdA), we used combinatorial domain mutations and most known AbdA developmental functions as biological readouts to investigate how protein domains collectively shape protein activity. The results uncover redundancy, interactivity, and multifunctionality of protein domains as salient features underlying overall AbdA protein activity, providing means to apprehend functional diversity and accounting for the robustness of Hox-controlled developmental programs. Importantly, the results highlight context-dependency in protein domain usage and interaction, allowing major modifications in domains to be tolerated without general functional loss. The non-pleoitropic effect of domain mutation suggests that protein modification may contribute more broadly to molecular changes underlying morphological diversification during evolution, so far thought to rely largely on modification in gene cis-regulatory sequences.
Insights into Hox Protein Function from a Large Scale Combinatorial Analysis of Protein Domains

PubMed Central

Karlsson, Daniel; Dixit, Richa; Saadaoui, Mehdi; Monier, Bruno; Brun, Christine; Thor, Stefan; Vijayraghavan, K.; Perrin, Laurent; Pradel, Jacques; Graba, Yacine

2011-01-01

Protein function is encoded within protein sequence and protein domains. However, how protein domains cooperate within a protein to modulate overall activity and how this impacts functional diversification at the molecular and organism levels remains largely unaddressed. Focusing on three domains of the central class Drosophila Hox transcription factor AbdominalA (AbdA), we used combinatorial domain mutations and most known AbdA developmental functions as biological readouts to investigate how protein domains collectively shape protein activity. The results uncover redundancy, interactivity, and multifunctionality of protein domains as salient features underlying overall AbdA protein activity, providing means to apprehend functional diversity and accounting for the robustness of Hox-controlled developmental programs. Importantly, the results highlight context-dependency in protein domain usage and interaction, allowing major modifications in domains to be tolerated without general functional loss. The non-pleoitropic effect of domain mutation suggests that protein modification may contribute more broadly to molecular changes underlying morphological diversification during evolution, so far thought to rely largely on modification in gene cis-regulatory sequences. PMID:22046139
NMR assignments of the N-terminal domain of Nephila clavipes spidroin 1

PubMed Central

Parnham, Stuart; Gaines, William A.; Duggan, Brendan M.; Marcotte, William R.

2011-01-01

The building blocks of spider dragline silk are two fibrous proteins secreted from the major ampullate gland named spidroins 1 and 2 (MaSp1, MaSp2). These proteins consist of a large central domain composed of approximately 100 tandem copies of a 35–40 amino acid repeat sequence. Non-repetitive N and C-terminal domains, of which the C-terminal domain has been implicated to transition from soluble and insoluble states during spinning, flank the repetitive core. The N-terminal domain until recently has been largely unknown due to difficulties in cloning and expression. Here, we report nearly complete assignment for all 1H, 13C, and 15N resonances in the 14 kDa N-terminal domain of major ampullate spidroin 1 (MaSp1-N) of the golden orb-web spider Nephila clavipes. PMID:21152998
Nano-Mole Scale Side-Chain Signal Assignment by 1H-Detected Protein Solid-State NMR by Ultra-Fast Magic-Angle Spinning and Stereo-Array Isotope Labeling

PubMed Central

Nishiyama, Yusuke; Endo, Yuki; Nemoto, Takahiro; Yamauchi, Kazuo; Asakura, Tetsuo; Takeda, Mitsuhiro; Terauchi, Tsutomu; Kainosho, Masatsune; Ishii, Yoshitaka

2015-01-01

We present a general approach in 1H-detected 13C solid-state NMR (SSNMR) for side-chain signal assignments of 10-50 nmol quantities of proteins using a combination of a high magnetic field, ultra-fast magic-angle spinning (MAS) at ~80 kHz, and stereo-array-isotope-labeled (SAIL) proteins [Kainosho M. et al., Nature 440, 52–57, 2006]. First, we demonstrate that 1H indirect detection improves the sensitivity and resolution of 13C SSNMR of SAIL proteins for side-chain assignments in the ultra-fast MAS condition. 1H-detected SSNMR was performed for micro-crystalline ubiquitin (~55 nmol or ~0.5mg) that was SAIL-labeled at seven isoleucine (Ile) residues. Sensitivity was dramatically improved by 1H-detected 2D 1H/13C SSNMR by factors of 5.4-9.7 and 2.1-5.0, respectively, over 13C-detected 2D 1H/13C SSNMR and 1D 13C CPMAS, demonstrating that 2D 1H-detected SSNMR offers not only additional resolution but also sensitivity advantage over 1D 13C detection for the first time. High 1H resolution for the SAIL-labeled side-chain residues offered reasonable resolution even in the 2D data. A 1H-detected 3D 13C/13C/1H experiment on SAIL-ubiquitin provided nearly complete 1H and 13C assignments for seven Ile residues only within ~2.5 h. The results demonstrate the feasibility of side-chain signal assignment in this approach for as little as 10 nmol of a protein sample within ~3 days. The approach is likely applicable to a variety of proteins of biological interest without any requirements of highly efficient protein expression systems. PMID:25856081

SONAR Discovers RNA-Binding Proteins from Analysis of Large-Scale Protein-Protein Interactomes.

PubMed

Brannan, Kristopher W; Jin, Wenhao; Huelga, Stephanie C; Banks, Charles A S; Gilmore, Joshua M; Florens, Laurence; Washburn, Michael P; Van Nostrand, Eric L; Pratt, Gabriel A; Schwinn, Marie K; Daniels, Danette L; Yeo, Gene W

2016-10-20

RNA metabolism is controlled by an expanding, yet incomplete, catalog of RNA-binding proteins (RBPs), many of which lack characterized RNA binding domains. Approaches to expand the RBP repertoire to discover non-canonical RBPs are currently needed. Here, HaloTag fusion pull down of 12 nuclear and cytoplasmic RBPs followed by quantitative mass spectrometry (MS) demonstrates that proteins interacting with multiple RBPs in an RNA-dependent manner are enriched for RBPs. This motivated SONAR, a computational approach that predicts RNA binding activity by analyzing large-scale affinity precipitation-MS protein-protein interactomes. Without relying on sequence or structure information, SONAR identifies 1,923 human, 489 fly, and 745 yeast RBPs, including over 100 human candidate RBPs that contain zinc finger domains. Enhanced CLIP confirms RNA binding activity and identifies transcriptome-wide RNA binding sites for SONAR-predicted RBPs, revealing unexpected RNA binding activity for disease-relevant proteins and DNA binding proteins. Copyright © 2016 Elsevier Inc. All rights reserved.
Combining automated peak tracking in SAR by NMR with structure-based backbone assignment from 15N-NOESY

PubMed Central

2012-01-01

Background Chemical shift mapping is an important technique in NMR-based drug screening for identifying the atoms of a target protein that potentially bind to a drug molecule upon the molecule's introduction in increasing concentrations. The goal is to obtain a mapping of peaks with known residue assignment from the reference spectrum of the unbound protein to peaks with unknown assignment in the target spectrum of the bound protein. Although a series of perturbed spectra help to trace a path from reference peaks to target peaks, a one-to-one mapping generally is not possible, especially for large proteins, due to errors, such as noise peaks, missing peaks, missing but then reappearing, overlapped, and new peaks not associated with any peaks in the reference. Due to these difficulties, the mapping is typically done manually or semi-automatically, which is not efficient for high-throughput drug screening. Results We present PeakWalker, a novel peak walking algorithm for fast-exchange systems that models the errors explicitly and performs many-to-one mapping. On the proteins: hBclXL, UbcH5B, and histone H1, it achieves an average accuracy of over 95% with less than 1.5 residues predicted per target peak. Given these mappings as input, we present PeakAssigner, a novel combined structure-based backbone resonance and NOE assignment algorithm that uses just 15N-NOESY, while avoiding TOCSY experiments and 13C-labeling, to resolve the ambiguities for a one-to-one mapping. On the three proteins, it achieves an average accuracy of 94% or better. Conclusions Our mathematical programming approach for modeling chemical shift mapping as a graph problem, while modeling the errors directly, is potentially a time- and cost-effective first step for high-throughput drug screening based on limited NMR data and homologous 3D structures. PMID:22536902
AUTOBA: automation of backbone assignment from HN(C)N suite of experiments.

PubMed

Borkar, Aditi; Kumar, Dinesh; Hosur, Ramakrishna V

2011-07-01

Development of efficient strategies and automation represent important milestones of progress in rapid structure determination efforts in proteomics research. In this context, we present here an efficient algorithm named as AUTOBA (Automatic Backbone Assignment) designed to automate the assignment protocol based on HN(C)N suite of experiments. Depending upon the spectral dispersion, the user can record 2D or 3D versions of the experiments for assignment. The algorithm uses as inputs: (i) protein primary sequence and (ii) peak-lists from user defined HN(C)N suite of experiments. In the end, one gets H(N), (15)N, C(α) and C' assignments (in common BMRB format) for the individual residues along the polypeptide chain. The success of the algorithm has been demonstrated, not only with experimental spectra recorded on two small globular proteins: ubiquitin (76 aa) and M-crystallin (85 aa), but also with simulated spectra of 27 other proteins using assignment data from the BMRB.
Assigning protein functions by comparative genome analysis protein phylogenetic profiles

DOEpatents

Pellegrini, Matteo; Marcotte, Edward M.; Thompson, Michael J.; Eisenberg, David; Grothe, Robert; Yeates, Todd O.

2003-05-13

A computational method system, and computer program are provided for inferring functional links from genome sequences. One method is based on the observation that some pairs of proteins A' and B' have homologs in another organism fused into a single protein chain AB. A trans-genome comparison of sequences can reveal these AB sequences, which are Rosetta Stone sequences because they decipher an interaction between A' and B. Another method compares the genomic sequence of two or more organisms to create a phylogenetic profile for each protein indicating its presence or absence across all the genomes. The profile provides information regarding functional links between different families of proteins. In yet another method a combination of the above two methods is used to predict functional links.
A large-scale evaluation of computational protein function prediction

PubMed Central

Radivojac, Predrag; Clark, Wyatt T; Ronnen Oron, Tal; Schnoes, Alexandra M; Wittkop, Tobias; Sokolov, Artem; Graim, Kiley; Funk, Christopher; Verspoor, Karin; Ben-Hur, Asa; Pandey, Gaurav; Yunes, Jeffrey M; Talwalkar, Ameet S; Repo, Susanna; Souza, Michael L; Piovesan, Damiano; Casadio, Rita; Wang, Zheng; Cheng, Jianlin; Fang, Hai; Gough, Julian; Koskinen, Patrik; Törönen, Petri; Nokso-Koivisto, Jussi; Holm, Liisa; Cozzetto, Domenico; Buchan, Daniel W A; Bryson, Kevin; Jones, David T; Limaye, Bhakti; Inamdar, Harshal; Datta, Avik; Manjari, Sunitha K; Joshi, Rajendra; Chitale, Meghana; Kihara, Daisuke; Lisewski, Andreas M; Erdin, Serkan; Venner, Eric; Lichtarge, Olivier; Rentzsch, Robert; Yang, Haixuan; Romero, Alfonso E; Bhat, Prajwal; Paccanaro, Alberto; Hamp, Tobias; Kassner, Rebecca; Seemayer, Stefan; Vicedo, Esmeralda; Schaefer, Christian; Achten, Dominik; Auer, Florian; Böhm, Ariane; Braun, Tatjana; Hecht, Maximilian; Heron, Mark; Hönigschmid, Peter; Hopf, Thomas; Kaufmann, Stefanie; Kiening, Michael; Krompass, Denis; Landerer, Cedric; Mahlich, Yannick; Roos, Manfred; Björne, Jari; Salakoski, Tapio; Wong, Andrew; Shatkay, Hagit; Gatzmann, Fanny; Sommer, Ingolf; Wass, Mark N; Sternberg, Michael J E; Škunca, Nives; Supek, Fran; Bošnjak, Matko; Panov, Panče; Džeroski, Sašo; Šmuc, Tomislav; Kourmpetis, Yiannis A I; van Dijk, Aalt D J; ter Braak, Cajo J F; Zhou, Yuanpeng; Gong, Qingtian; Dong, Xinran; Tian, Weidong; Falda, Marco; Fontana, Paolo; Lavezzo, Enrico; Di Camillo, Barbara; Toppo, Stefano; Lan, Liang; Djuric, Nemanja; Guo, Yuhong; Vucetic, Slobodan; Bairoch, Amos; Linial, Michal; Babbitt, Patricia C; Brenner, Steven E; Orengo, Christine; Rost, Burkhard; Mooney, Sean D; Friedberg, Iddo

2013-01-01

Automated annotation of protein function is challenging. As the number of sequenced genomes rapidly grows, the overwhelming majority of protein products can only be annotated computationally. If computational predictions are to be relied upon, it is crucial that the accuracy of these methods be high. Here we report the results from the first large-scale community-based Critical Assessment of protein Function Annotation (CAFA) experiment. Fifty-four methods representing the state-of-the-art for protein function prediction were evaluated on a target set of 866 proteins from eleven organisms. Two findings stand out: (i) today’s best protein function prediction algorithms significantly outperformed widely-used first-generation methods, with large gains on all types of targets; and (ii) although the top methods perform well enough to guide experiments, there is significant need for improvement of currently available tools. PMID:23353650
Electron transfer dissociation (ETD): The mass spectrometric breakthrough essential for O-GlcNAc protein site assignments – A study of the O-GlcNAcylated protein Host Cell Factor C1

PubMed Central

Myers, Samuel A.; Daou, Salima; Affar, El Bachir; Burlingame, AL

2014-01-01

The development of electron-based, unimolecular dissociation mass spectrometric methods, i.e. electron capture and electron transfer dissociation (ECD and ETD, respectively), has greatly increased the speed and reliability of labile post-translational modification (PTM) site assignment. The field of intracellular O-GlcNAc (O-linked N-acetylglucosamine) signaling has especially advanced with the advent of ETD mass spectrometry. Only within the last five years have proteomic-scale experiments utilizing ETD allowed the assignment of hundreds of O-GlcNAc sites within cells and subcellular structures. Our ability to identify and unambiguously assign the site of O-GlcNAc modifications using ETD is rapidly increasing our understanding of this regulatory glycosylation and its potential interaction with other PTMs. Here, we discuss the advantages of using ETD, complimented with collisional-activation mass spectrometry (CID/CAD), in a study of O-GlcNAc modified peptides of the extensively O-GlcNAcylated protein Host Cell Factor C1 (HCF-1). HCF-1 is a transcriptional co-regulator, forms a stable complex with O-GlcNAc transferase and is involved in control of cell cycle progression. ETD, along with higher energy collisional dissociation (HCD) mass spectrometry, was employed to assign the PTMs of the HCF-1 protein isolated from HEK293T cells. These include nineteen sites of O-GlcNAcylation, two sites of phosphorylation and two sites bearing dimethylarginine, and showcase the residue-specific, PTM complexity of this regulator of cell proliferation. PMID:23335398
Factors Associated with Assignment of Therapeutic Homework in a Large Public Children's Mental Health System.

PubMed

Trask, Emily Velazquez; Barounis, Kya; Carlisle, Brandon L; Garland, Ann F; Aarons, Gregory A

2018-03-24

Therapeutic homework is a fundamental skill-building component of the majority of evidence-based therapies and is associated with better treatment outcomes. However, it is rarely utilized in public mental health settings. To determine barriers to homework use and identify predictors of clinicians' assignment of homework, an online survey was administered to 267 clinicians in a large diverse public mental health system. Clinicians who were younger, licensed, whose supervisors asked about homework and whose clients completed their homework more frequently were predictors of greater homework utilization. The survey results are discussed and a novel idea to increase the use of homework is introduced.
Fast large-scale clustering of protein structures using Gauss integrals.

PubMed

Harder, Tim; Borg, Mikael; Boomsma, Wouter; Røgen, Peter; Hamelryck, Thomas

2012-02-15

Clustering protein structures is an important task in structural bioinformatics. De novo structure prediction, for example, often involves a clustering step for finding the best prediction. Other applications include assigning proteins to fold families and analyzing molecular dynamics trajectories. We present Pleiades, a novel approach to clustering protein structures with a rigorous mathematical underpinning. The method approximates clustering based on the root mean square deviation by first mapping structures to Gauss integral vectors--which were introduced by Røgen and co-workers--and subsequently performing K-means clustering. Compared to current methods, Pleiades dramatically improves on the time needed to perform clustering, and can cluster a significantly larger number of structures, while providing state-of-the-art results. The number of low energy structures generated in a typical folding study, which is in the order of 50,000 structures, can be clustered within seconds to minutes.
BioPlex Display: An Interactive Suite for Large-Scale AP-MS Protein-Protein Interaction Data.

PubMed

Schweppe, Devin K; Huttlin, Edward L; Harper, J Wade; Gygi, Steven P

2018-01-05

The development of large-scale data sets requires a new means to display and disseminate research studies to large audiences. Knowledge of protein-protein interaction (PPI) networks has become a principle interest of many groups within the field of proteomics. At the confluence of technologies, such as cross-linking mass spectrometry, yeast two-hybrid, protein cofractionation, and affinity purification mass spectrometry (AP-MS), detection of PPIs can uncover novel biological inferences at a high-throughput. Thus new platforms to provide community access to large data sets are necessary. To this end, we have developed a web application that enables exploration and dissemination of the growing BioPlex interaction network. BioPlex is a large-scale interactome data set based on AP-MS of baits from the human ORFeome. The latest BioPlex data set release (BioPlex 2.0) contains 56 553 interactions from 5891 AP-MS experiments. To improve community access to this vast compendium of interactions, we developed BioPlex Display, which integrates individual protein querying, access to empirical data, and on-the-fly annotation of networks within an easy-to-use and mobile web application. BioPlex Display enables rapid acquisition of data from BioPlex and development of hypotheses based on protein interactions.
Isothermal chemical denaturation of large proteins: Path-dependence and irreversibility.

PubMed

Wafer, Lucas; Kloczewiak, Marek; Polleck, Sharon M; Luo, Yin

2017-12-15

State functions (e.g., ΔG) are path independent and quantitatively describe the equilibrium states of a thermodynamic system. Isothermal chemical denaturation (ICD) is often used to extrapolate state function parameters for protein unfolding in native buffer conditions. The approach is prudent when the unfolding/refolding processes are path independent and reversible, but may lead to erroneous results if the processes are not reversible. The reversibility was demonstrated in several early studies for smaller proteins, but was assumed in some reports for large proteins with complex structures. In this work, the unfolding/refolding of several proteins were systematically studied using an automated ICD instrument. It is shown that: (i) the apparent unfolding mechanism and conformational stability of large proteins can be denaturant-dependent, (ii) equilibration times for large proteins are non-trivial and may introduce significant error into calculations of ΔG, (iii) fluorescence emission spectroscopy may not correspond to other methods, such as circular dichroism, when used to measure protein unfolding, and (iv) irreversible unfolding and hysteresis can occur in the absence of aggregation. These results suggest that thorough confirmation of the state functions by, for example, performing refolding experiments or using additional denaturants, is needed when quantitatively studying the thermodynamics of protein unfolding using ICD. Copyright © 2017 Elsevier Inc. All rights reserved.
Towards Automated Structure-Based NMR Resonance Assignment

NASA Astrophysics Data System (ADS)

Jang, Richard; Gao, Xin; Li, Ming

We propose a general framework for solving the structure-based NMR backbone resonance assignment problem. The core is a novel 0-1 integer programming model that can start from a complete or partial assignment, generate multiple assignments, and model not only the assignment of spins to residues, but also pairwise dependencies consisting of pairs of spins to pairs of residues. It is still a challenge for automated resonance assignment systems to perform the assignment directly from spectra without any manual intervention. To test the feasibility of this for structure-based assignment, we integrated our system with our automated peak picking and sequence-based resonance assignment system to obtain an assignment for the protein TM1112 with 91% recall and 99% precision without manual intervention. Since using a known structure has the potential to allow one to use only N-labeled NMR data and avoid the added expense of using C-labeled data, we work towards the goal of automated structure-based assignment using only such labeled data. Our system reduced the assignment error of Xiong-Pandurangan-Bailey-Kellogg's contact replacement (CR) method, which to our knowledge is the most error-tolerant method for this problem, by 5 folds on average. By using an iterative algorithm, our system has the added capability of using the NOESY data to correct assignment errors due to errors in predicting the amino acid and secondary structure type of each spin system. On a publicly available data set for Ubiquitin, where the type prediction accuracy is 83%, we achieved 91% assignment accuracy, compared to the 59% accuracy that was obtained without correcting for typing errors.
Large-scale protein/antibody patterning with limiting unspecific adsorption

NASA Astrophysics Data System (ADS)

Fedorenko, Viktoriia; Bechelany, Mikhael; Janot, Jean-Marc; Smyntyna, Valentyn; Balme, Sebastien

2017-10-01

A simple synthetic route based on nanosphere lithography has been developed in order to design a large-scale nanoarray for specific control of protein anchoring. This technique based on two-dimensional (2D) colloidal crystals composed of polystyrene spheres allows the easy and inexpensive fabrication of large arrays (up to several centimeters) by reducing the cost. A silicon wafer coated with a thin adhesion layer of chromium (15 nm) and a layer of gold (50 nm) is used as a substrate. PS spheres are deposited on the gold surface using the floating-transferring technique. The PS spheres were then functionalized with PEG-biotin and the defects by self-assembly monolayer (SAM) PEG to prevent unspecific adsorption. Using epifluorescence microscopy, we show that after immersion of sample on target protein (avidin and anti-avidin) solution, the latter are specifically located on polystyrene spheres. Thus, these results are meaningful for exploration of devices based on a large-scale nanoarray of PS spheres and can be used for detection of target proteins or simply to pattern a surface with specific proteins.
A non-uniformly sampled 4D HCC(CO)NH-TOCSY experiment processed using maximum entropy for rapid protein sidechain assignment

PubMed Central

Mobli, Mehdi; Stern, Alan S.; Bermel, Wolfgang; King, Glenn F.; Hoch, Jeffrey C.

2010-01-01

One of the stiffest challenges in structural studies of proteins using NMR is the assignment of sidechain resonances. Typically, a panel of lengthy 3D experiments are acquired in order to establish connectivities and resolve ambiguities due to overlap. We demonstrate that these experiments can be replaced by a single 4D experiment that is time-efficient, yields excellent resolution, and captures unique carbon-proton connectivity information. The approach is made practical by the use of non-uniform sampling in the three indirect time dimensions and maximum entropy reconstruction of the corresponding 3D frequency spectrum. This 4D method will facilitate automated resonance assignment procedures and it should be particularly beneficial for increasing throughput in NMR-based structural genomics initiatives. PMID:20299257
1H, 13C, and 15N resonance assignments for the protein coded by gene locus BB0938 of Bordetella bronchiseptica

DOE Office of Scientific and Technical Information (OSTI.GOV)

Rossi, Paolo; Ramelot, Theresa A.; Xiao, Rong

2005-11-01

The product of gene locus BB0938 from Bordetella bronchiseptica (Swiss-Prot ID: Q7WNU7-BORBR; NESG target ID: BoR11; Wunderlich et al., 2004; Pfam ID: PF03476) is a 128-residue protein of unknown function. This broadly conserved protein family is found in eubacteria and eukaryotes. Using triple resonance NMR techniques, we have determined 98% of backbone and 94% of side chain 1H, 13C, and 15N resonance assignments. The chemical shift and 3J(HN?Ha) scalar coupling data reveal a b topology with a seven-residue helical insert, ??????????. BMRB deposit with accession number 6693. Reference: Wunderlich et al. (2004) Proteins, 56, 181?187.
42 CFR 433.146 - Rights assigned; assignment method.

Code of Federal Regulations, 2013 CFR

2013-10-01

... 42 Public Health 4 2013-10-01 2013-10-01 false Rights assigned; assignment method. 433.146 Section 433.146 Public Health CENTERS FOR MEDICARE & MEDICAID SERVICES, DEPARTMENT OF HEALTH AND HUMAN... Assignment of Rights to Benefits § 433.146 Rights assigned; assignment method. (a) Except as specified in...
42 CFR 433.146 - Rights assigned; assignment method.

Code of Federal Regulations, 2011 CFR

2011-10-01

... 42 Public Health 4 2011-10-01 2011-10-01 false Rights assigned; assignment method. 433.146 Section 433.146 Public Health CENTERS FOR MEDICARE & MEDICAID SERVICES, DEPARTMENT OF HEALTH AND HUMAN... Assignment of Rights to Benefits § 433.146 Rights assigned; assignment method. (a) Except as specified in...
42 CFR 433.146 - Rights assigned; assignment method.

Code of Federal Regulations, 2010 CFR

2010-10-01

... 42 Public Health 4 2010-10-01 2010-10-01 false Rights assigned; assignment method. 433.146 Section 433.146 Public Health CENTERS FOR MEDICARE & MEDICAID SERVICES, DEPARTMENT OF HEALTH AND HUMAN... Assignment of Rights to Benefits § 433.146 Rights assigned; assignment method. (a) Except as specified in...
Large-scale protein-protein interaction analysis in Arabidopsis mesophyll protoplasts by split firefly luciferase complementation.

PubMed

Li, Jian-Feng; Bush, Jenifer; Xiong, Yan; Li, Lei; McCormack, Matthew

2011-01-01

Protein-protein interactions (PPIs) constitute the regulatory network that coordinates diverse cellular functions. There are growing needs in plant research for creating protein interaction maps behind complex cellular processes and at a systems biology level. However, only a few approaches have been successfully used for large-scale surveys of PPIs in plants, each having advantages and disadvantages. Here we present split firefly luciferase complementation (SFLC) as a highly sensitive and noninvasive technique for in planta PPI investigation. In this assay, the separate halves of a firefly luciferase can come into close proximity and transiently restore its catalytic activity only when their fusion partners, namely the two proteins of interest, interact with each other. This assay was conferred with quantitativeness and high throughput potential when the Arabidopsis mesophyll protoplast system and a microplate luminometer were employed for protein expression and luciferase measurement, respectively. Using the SFLC assay, we could monitor the dynamics of rapamycin-induced and ascomycin-disrupted interaction between Arabidopsis FRB and human FKBP proteins in a near real-time manner. As a proof of concept for large-scale PPI survey, we further applied the SFLC assay to testing 132 binary PPIs among 8 auxin response factors (ARFs) and 12 Aux/IAA proteins from Arabidopsis. Our results demonstrated that the SFLC assay is ideal for in vivo quantitative PPI analysis in plant cells and is particularly powerful for large-scale binary PPI screens.
Calibrated peer review assignments for the earth sciences

USGS Publications Warehouse

Rudd, J.A.; Wang, V.Z.; Cervato, C.; Ridky, R.W.

2009-01-01

Calibrated Peer Review ??? (CPR), a web-based instructional tool developed as part of the National Science Foundation reform initiatives in undergraduate science education, allows instructors to incorporate multiple writing assignments in large courses without overwhelming the instructor. This study reports successful implementation of CPR in a large, introductory geology course and student learning of geoscience content. For each CPR assignment in this study, students studied web-based and paper resources, wrote an essay, and reviewed seven essays (three from the instructor, three from peers, and their own) on the topic. Although many students expressed negative attitudes and concerns, particularly about the peer review process of this innovative instructional approach, they also recognized the learning potential of completing CPR assignments. Comparing instruction on earthquakes and plate boundaries using a CPR assignment vs. an instructional video lecture and homework essay with extensive instructor feedback, students mastered more content via CPR instruction.
Negotiating Assignment Pathways: Students and Academic Assignments

ERIC Educational Resources Information Center

McDowell, Liz

2008-01-01

Existing research identifies that students' approaches to assignments are related to their general approaches to study. It is suggested that students need to better understand the requirements of assignments and acquire new concepts such as "argument". This fine-grained study proposes four qualitatively distinct assignment pathways: gathering,…

Scaffolding Assignments and Activities for Undergraduate Research Methods

ERIC Educational Resources Information Center

Fisher, Sarah; Justwan, Florian

2018-01-01

This article details assignments and lessons created for and tested in research methods courses at two different universities, a large state school and a small liberal arts college. Each assignment or activity utilized scaffolding. Students were asked to push beyond their comfort zone while utilizing concrete and/or creative examples,…
Assessment of a Diversity Assignment in a PR Principles Course

ERIC Educational Resources Information Center

Gallicano, Tiffany Derville; Stansberry, Kathleen

2012-01-01

This study assesses an assignment for incorporating diversity into the principles of public relations course. The assignment is tailored to the challenges of using an active learning approach in a large lecture class. For the assignment, students write a goal, objectives, strategies, an identification of tactics, and evaluation plans for either…
Large heat capacity change in a protein-monovalent cation interaction.

PubMed

Guinto, E R; Di Cera, E

1996-07-09

Current views about protein-ligand interactions state that electrostatic forces drive the binding of charged species and that burial of hydrophobic and polar surfaces controls the heat capacity change associated with the reaction. For the interaction of a protein with a monovalent cation the electrostatic components are expected to be significant due to the ionic nature of the ligand, whereas the heat capacity change is expected to be small due to the size of the surface area involved in the recognition event. The physiologically important interaction of Na+ with thrombin was studied over the temperature range from 5 to 45 degrees C and the ionic strength range from 50 to 800 mM. These measurements reveal an unanticipated result that bears quite generally on studies of molecular recognition and protein folding. Binding of Na+ to thrombin is characterized by a modest dependence on ionic strength but a large and negative heat capacity change of -1.1 +/- 0.1 kcal mol-1 K-1. The small electrostatic coupling can be explained in terms of a minimal perturbation of the ionic atmosphere of the protein upon Na+ binding. The large heat capacity change, however, is difficult to reconcile with current views on the origin of this effect from surface area changes or large folding transitions coupled to binding. It is proposed that this change is linked to burial of a large cluster of water molecules in the Na+ binding pocket upon Na+ binding. Due to their reduced mobility and highly ordered structure, water molecules sequestered in the interior of a protein must have a lower heat capacity compared to those on the surface of a protein or in the bulk solvent. Hence, a binding or folding event where water molecules are buried may result in significant heat capacity changes independent of changes in exposed hydrophobic surface or coupled conformational transitions.
Global Optimization of Emergency Evacuation Assignments

DOE Office of Scientific and Technical Information (OSTI.GOV)

Han, Lee; Yuan, Fang; Chin, Shih-Miao

2006-01-01

Conventional emergency evacuation plans often assign evacuees to fixed routes or destinations based mainly on geographic proximity. Such approaches can be inefficient if the roads are congested, blocked, or otherwise dangerous because of the emergency. By not constraining evacuees to prespecified destinations, a one-destination evacuation approach provides flexibility in the optimization process. We present a framework for the simultaneous optimization of evacuation-traffic distribution and assignment. Based on the one-destination evacuation concept, we can obtain the optimal destination and route assignment by solving a one-destination traffic-assignment problem on a modified network representation. In a county-wide, large-scale evacuation case study, the one-destinationmore » model yields substantial improvement over the conventional approach, with the overall evacuation time reduced by more than 60 percent. More importantly, emergency planners can easily implement this framework by instructing evacuees to go to destinations that the one-destination optimization process selects.« less
Assignment of the zinc ligands in RsrA, a redox-sensing ZAS protein from Streptomyces coelicolor.

PubMed

Zdanowski, Konrad; Doughty, Phillip; Jakimowicz, Piotr; O'Hara, Liisa; Buttner, Mark J; Paget, Mark S B; Kleanthous, Colin

2006-07-11

ZAS proteins are widespread bacterial zinc-containing anti-sigma factors that regulate the activity of sigma factors in response to diverse cues. One of the best characterized ZAS proteins is RsrA from Streptomyces coelicolor, which responds to disulfide stress. Zn-RsrA binds and represses the transcriptional activity of sigmaR in the reducing environment of the cytoplasm but undergoes reversible, intramolecular disulfide bond formation during oxidative stress. This expels the single metal ion and causes dramatic structural changes in RsrA that result in its dissociation from sigmaR, leaving the sigma factor free to activate the transcription of antioxidant genes. We showed recently that Zn2+ serves a critical role in modulating the redox activity of RsrA thiols but uncertainty remains as to how the metal ion is coordinated in RsrA and related ZAS proteins. Using a combination of random and site-specific mutagenesis with zinc K-edge extended X-ray absorption fine structure (EXAFS) spectroscopy, we have assigned unambiguously the metal ligands in RsrA, thereby distinguishing between the different ligation models that have been proposed. The data show that the zinc site in RsrA is comprised of Cys11, His37, Cys41, and Cys44. Three of these residues are part of a conserved ZAS-specific sequence motif (H37xxxC41xxC44), with the fourth ligand, Cys11, found in a subset of ZAS proteins. Cys11 and Cys44 form the trigger disulfide in RsrA, explaining why the metal ion is expelled during oxidation. We discuss these data in the context of redox sensing by RsrA and the sensory mechanisms of other ZAS proteins.
Automation of large scale transient protein expression in mammalian cells

PubMed Central

Zhao, Yuguang; Bishop, Benjamin; Clay, Jordan E.; Lu, Weixian; Jones, Margaret; Daenke, Susan; Siebold, Christian; Stuart, David I.; Yvonne Jones, E.; Radu Aricescu, A.

2011-01-01

Traditional mammalian expression systems rely on the time-consuming generation of stable cell lines; this is difficult to accommodate within a modern structural biology pipeline. Transient transfections are a fast, cost-effective solution, but require skilled cell culture scientists, making man-power a limiting factor in a setting where numerous samples are processed in parallel. Here we report a strategy employing a customised CompacT SelecT cell culture robot allowing the large-scale expression of multiple protein constructs in a transient format. Successful protocols have been designed for automated transient transfection of human embryonic kidney (HEK) 293T and 293S GnTI− cells in various flask formats. Protein yields obtained by this method were similar to those produced manually, with the added benefit of reproducibility, regardless of user. Automation of cell maintenance and transient transfection allows the expression of high quality recombinant protein in a completely sterile environment with limited support from a cell culture scientist. The reduction in human input has the added benefit of enabling continuous cell maintenance and protein production, features of particular importance to structural biology laboratories, which typically use large quantities of pure recombinant proteins, and often require rapid characterisation of a series of modified constructs. This automated method for large scale transient transfection is now offered as a Europe-wide service via the P-cube initiative. PMID:21571074
Large-scale model quality assessment for improving protein tertiary structure prediction.

PubMed

Cao, Renzhi; Bhattacharya, Debswapna; Adhikari, Badri; Li, Jilong; Cheng, Jianlin

2015-06-15

Sampling structural models and ranking them are the two major challenges of protein structure prediction. Traditional protein structure prediction methods generally use one or a few quality assessment (QA) methods to select the best-predicted models, which cannot consistently select relatively better models and rank a large number of models well. Here, we develop a novel large-scale model QA method in conjunction with model clustering to rank and select protein structural models. It unprecedentedly applied 14 model QA methods to generate consensus model rankings, followed by model refinement based on model combination (i.e. averaging). Our experiment demonstrates that the large-scale model QA approach is more consistent and robust in selecting models of better quality than any individual QA method. Our method was blindly tested during the 11th Critical Assessment of Techniques for Protein Structure Prediction (CASP11) as MULTICOM group. It was officially ranked third out of all 143 human and server predictors according to the total scores of the first models predicted for 78 CASP11 protein domains and second according to the total scores of the best of the five models predicted for these domains. MULTICOM's outstanding performance in the extremely competitive 2014 CASP11 experiment proves that our large-scale QA approach together with model clustering is a promising solution to one of the two major problems in protein structure modeling. The web server is available at: http://sysbio.rnet.missouri.edu/multicom_cluster/human/. © The Author 2015. Published by Oxford University Press.
A Simple and Effective Protein Folding Activity Suitable for Large Lectures

ERIC Educational Resources Information Center

White, Brian

2006-01-01

This article describes a simple and inexpensive hands-on simulation of protein folding suitable for use in large lecture classes. This activity uses a minimum of parts, tools, and skill to simulate some of the fundamental principles of protein folding. The major concepts targeted are that proteins begin as linear polypeptides and fold to…
Online Assignments in Economics: A Test of Their Effectiveness

ERIC Educational Resources Information Center

Kennelly, Brendan; Considine, John; Flannery, Darragh

2011-01-01

This article compares the effectiveness of online and paper-based assignments and tutorials using summative assessment results. All of the students in a large managerial economics course at National University of Ireland, Galway were asked to do six assignments online using Aplia and to do two on paper. The authors examined whether a student's…
Development and Application of ANN Model for Worker Assignment into Virtual Cells of Large Sized Configurations

NASA Astrophysics Data System (ADS)

Murali, R. V.; Puri, A. B.; Fathi, Khalid

2010-10-01

This paper presents an extended version of study already undertaken on development of an artificial neural networks (ANNs) model for assigning workforce into virtual cells under virtual cellular manufacturing systems (VCMS) environments. Previously, the same authors have introduced this concept and applied it to virtual cells of two-cell configuration and the results demonstrated that ANNs could be a worth applying tool for carrying out workforce assignments. In this attempt, three-cell configurations problems are considered for worker assignment task. Virtual cells are formed under dual resource constraint (DRC) context in which the number of available workers is less than the total number of machines available. Since worker assignment tasks are quite non-linear and highly dynamic in nature under varying inputs & conditions and, in parallel, ANNs have the ability to model complex relationships between inputs and outputs and find similar patterns effectively, an attempt was earlier made to employ ANNs into the above task. In this paper, the multilayered perceptron with feed forward (MLP-FF) neural network model has been reused for worker assignment tasks of three-cell configurations under DRC context and its performance at different time periods has been analyzed. The previously proposed worker assignment model has been reconfigured and cell formation solutions available for three-cell configuration in the literature are used in combination to generate datasets for training ANNs framework. Finally, results of the study have been presented and discussed.
1H, 13C, and 15N resonance assignments for Escherichia coli ytfP, a member of the broadly conserved UPF0131 protein domain family

DOE Office of Scientific and Technical Information (OSTI.GOV)

Aramini, James M.; Swapna, G.V.T.; Huang, Yuanpeng

2005-11-01

Protein ytfP from Escherichia coli (Swiss-Prot ID: YTFP-ECOLI; NESG target ID: ER111; Wunderlich et al., 2004) is a 113-residue member of the UPF0131 protein family (Pfam ID: PF03674) of unknown function. This domain family is found in organisms from all three kingdoms, archaea, eubacteria and eukaryotes. Using triple resonance NMR techniques, we have determined 97% of backbone and 91% of side chain 1H, 13C, and 15N resonance assignments. The chemical shift and 3J(HN?Ha) scalar coupling data reveal a mixed a/b topology,????????. BMRB deposit with Accession No. 6448. Reference: Wunderlich et al. (2004) Proteins, 56, 181?187.
Protein homology model refinement by large-scale energy optimization.

PubMed

Park, Hahnbeom; Ovchinnikov, Sergey; Kim, David E; DiMaio, Frank; Baker, David

2018-03-20

Proteins fold to their lowest free-energy structures, and hence the most straightforward way to increase the accuracy of a partially incorrect protein structure model is to search for the lowest-energy nearby structure. This direct approach has met with little success for two reasons: first, energy function inaccuracies can lead to false energy minima, resulting in model degradation rather than improvement; and second, even with an accurate energy function, the search problem is formidable because the energy only drops considerably in the immediate vicinity of the global minimum, and there are a very large number of degrees of freedom. Here we describe a large-scale energy optimization-based refinement method that incorporates advances in both search and energy function accuracy that can substantially improve the accuracy of low-resolution homology models. The method refined low-resolution homology models into correct folds for 50 of 84 diverse protein families and generated improved models in recent blind structure prediction experiments. Analyses of the basis for these improvements reveal contributions from both the improvements in conformational sampling techniques and the energy function.
Vibrational entropy of a protein: large differences between distinct conformations.

PubMed

Goethe, Martin; Fita, Ignacio; Rubi, J Miguel

2015-01-13

In this article, it is investigated whether vibrational entropy (VE) is an important contribution to the free energy of globular proteins at ambient conditions. VE represents the major configurational-entropy contribution of these proteins. By definition, it is an average of the configurational entropies of the protein within single minima of the energy landscape, weighted by their occupation probabilities. Its large part originates from thermal motion of flexible torsion angles giving rise to the finite peak widths observed in torsion angle distributions. While VE may affect the equilibrium properties of proteins, it is usually neglected in numerical calculations as its consideration is difficult. Moreover, it is sometimes believed that all well-packed conformations of a globular protein have similar VE anyway. Here, we measure explicitly the VE for six different conformations from simulation data of a test protein. Estimates are obtained using the quasi-harmonic approximation for three coordinate sets, Cartesian, bond-angle-torsion (BAT), and a new set termed rotamer-degeneracy lifted BAT coordinates by us. The new set gives improved estimates as it overcomes a known shortcoming of the quasi-harmonic approximation caused by multiply populated rotamer states, and it may serve for VE estimation of macromolecules in a very general context. The obtained VE values depend considerably on the type of coordinates used. However, for all coordinate sets we find large entropy differences between the conformations, of the order of the overall stability of the protein. This result may have important implications on the choice of free energy expressions used in software for protein structure prediction, protein design, and NMR refinement.
Filtering Gene Ontology semantic similarity for identifying protein complexes in large protein interaction networks.

PubMed

Wang, Jian; Xie, Dong; Lin, Hongfei; Yang, Zhihao; Zhang, Yijia

2012-06-21

Many biological processes recognize in particular the importance of protein complexes, and various computational approaches have been developed to identify complexes from protein-protein interaction (PPI) networks. However, high false-positive rate of PPIs leads to challenging identification. A protein semantic similarity measure is proposed in this study, based on the ontology structure of Gene Ontology (GO) terms and GO annotations to estimate the reliability of interactions in PPI networks. Interaction pairs with low GO semantic similarity are removed from the network as unreliable interactions. Then, a cluster-expanding algorithm is used to detect complexes with core-attachment structure on filtered network. Our method is applied to three different yeast PPI networks. The effectiveness of our method is examined on two benchmark complex datasets. Experimental results show that our method performed better than other state-of-the-art approaches in most evaluation metrics. The method detects protein complexes from large scale PPI networks by filtering GO semantic similarity. Removing interactions with low GO similarity significantly improves the performance of complex identification. The expanding strategy is also effective to identify attachment proteins of complexes.
Assignment Choice, Effort, and Assignment Completion: Does Work Ethic Predict Those Who Choose Higher-Effort Assignments?

ERIC Educational Resources Information Center

Parkhurst, John T.; Fleisher, Matthew S.; Skinner, Christopher H.; Woehr, David J.; Hawthorn-Embree, Meredith L.

2011-01-01

After completing the Multidimensional Work-Ethic Profile (MWEP), 98 college students were given a 20-problem math computation assignment and instructed to stop working on the assignment after completing 10 problems. Next, they were allowed to choose to finish either the partially completed assignment that had 10 problems remaining or a new…
Neutron protein crystallography: A complementary tool for locating hydrogens in proteins.

PubMed

O'Dell, William B; Bodenheimer, Annette M; Meilleur, Flora

2016-07-15

Neutron protein crystallography is a powerful tool for investigating protein chemistry because it directly locates hydrogen atom positions in a protein structure. The visibility of hydrogen and deuterium atoms arises from the strong interaction of neutrons with the nuclei of these isotopes. Positions can be unambiguously assigned from diffraction at resolutions typical of protein crystals. Neutrons have the additional benefit to structural biology of not inducing radiation damage in protein crystals. The same crystal could be measured multiple times for parametric studies. Here, we review the basic principles of neutron protein crystallography. The information that can be gained from a neutron structure is presented in balance with practical considerations. Methods to produce isotopically-substituted proteins and to grow large crystals are provided in the context of neutron structures reported in the literature. Available instruments for data collection and software for data processing and structure refinement are described along with technique-specific strategies including joint X-ray/neutron structure refinement. Examples are given to illustrate, ultimately, the unique scientific value of neutron protein crystal structures. Copyright © 2015 Elsevier Inc. All rights reserved.
Split luciferase complementation assay to detect regulated protein-protein interactions in rice protoplasts in a large-scale format

PubMed Central

2014-01-01

Background The rice interactome, in which a network of protein-protein interactions has been elucidated in rice, is a useful resource to identify functional modules of rice signal transduction pathways. Protein-protein interactions occur in cells in two ways, constitutive and regulative. While a yeast-based high-throughput method has been widely used to identify the constitutive interactions, a method to detect the regulated interactions is rarely developed for a large-scale analysis. Results A split luciferase complementation assay was applied to detect the regulated interactions in rice. A transformation method of rice protoplasts in a 96-well plate was first established for a large-scale analysis. In addition, an antibody that specifically recognizes a carboxyl-terminal fragment of Renilla luciferase was newly developed. A pair of antibodies that recognize amino- and carboxyl- terminal fragments of Renilla luciferase, respectively, was then used to monitor quality and quantity of interacting recombinant-proteins accumulated in the cells. For a proof-of-concept, the method was applied to detect the gibberellin-dependent interaction between GIBBERELLIN INSENSITIVE DWARF1 and SLENDER RICE 1. Conclusions A method to detect regulated protein-protein interactions was developed towards establishment of the rice interactome. PMID:24987490
Effortless assignment with 4D covariance sequential correlation maps

NASA Astrophysics Data System (ADS)

Harden, Bradley J.; Mishra, Subrata H.; Frueh, Dominique P.

2015-11-01

Traditional Nuclear Magnetic Resonance (NMR) assignment procedures for proteins rely on preliminary peak-picking to identify and label NMR signals. However, such an approach has severe limitations when signals are erroneously labeled or completely neglected. The consequences are especially grave for proteins with substantial peak overlap, and mistakes can often thwart entire projects. To overcome these limitations, we previously introduced an assignment technique that bypasses traditional pick peaking altogether. Covariance Sequential Correlation Maps (COSCOMs) transform the indirect connectivity information provided by multiple 3D backbone spectra into direct (H, N) to (H, N) correlations. Here, we present an updated method that utilizes a single four-dimensional spectrum rather than a suite of three-dimensional spectra. We demonstrate the advantages of 4D-COSCOMs relative to their 3D counterparts. We introduce improvements accelerating their calculation. We discuss practical considerations affecting their quality. And finally we showcase their utility in the context of a 52 kDa cyclization domain from a non-ribosomal peptide synthetase.
A sequential assignment procedure for proteins that have intermediate line widths in MAS NMR spectra: amyloid fibrils of human CA150.WW2.

PubMed

Becker, Johanna; Ferguson, Neil; Flinders, Jeremy; van Rossum, Barth-Jan; Fersht, Alan R; Oschkinat, Hartmut

2008-08-11

The second WW domain (WW2) of CA150, a human transcriptional activator, forms amyloid fibrils in vitro under physiological conditions. Based on experimental constraints from MAS NMR spectroscopy experiments, alanine scanning and electron microscopy, a structural model of CA150.WW2 amyloid fibrils was calculated earlier. Here, the assignment strategy is presented and suggested as a general approach for proteins that show intermediate line width. The (13)C,(13)C correlation experiments were recorded on fully or partially (13)C-labelled fibrils. The earlier (13)C assignment (26 residues) was extended to 34 of the 40 residues by direct (13)C-excitation experiments by using a deuterated sample that showed strongly improved line width. A 3D HNC-TEDOR (transferred-echo double-resonance) experiment with deuterated CA150.WW2 fibrils yielded 14 amide nitrogen and proton resonance assignments. The obtained chemical shifts were compared with the chemical shifts determined with the natively folded WW domain. TALOS (Torsion angle likelihood obtained from shift and sequence similarity) predictions confirmed that, under physiological conditions, the fibrillar form of CA150.WW2 adopts a significantly different beta structure than the native WW-domain fold.
A Scalable Approach for Protein False Discovery Rate Estimation in Large Proteomic Data Sets

PubMed Central

Savitski, Mikhail M.; Wilhelm, Mathias; Hahne, Hannes; Kuster, Bernhard; Bantscheff, Marcus

2015-01-01

Calculating the number of confidently identified proteins and estimating false discovery rate (FDR) is a challenge when analyzing very large proteomic data sets such as entire human proteomes. Biological and technical heterogeneity in proteomic experiments further add to the challenge and there are strong differences in opinion regarding the conceptual validity of a protein FDR and no consensus regarding the methodology for protein FDR determination. There are also limitations inherent to the widely used classic target–decoy strategy that particularly show when analyzing very large data sets and that lead to a strong over-representation of decoy identifications. In this study, we investigated the merits of the classic, as well as a novel target–decoy-based protein FDR estimation approach, taking advantage of a heterogeneous data collection comprised of ∼19,000 LC-MS/MS runs deposited in ProteomicsDB (https://www.proteomicsdb.org). The “picked” protein FDR approach treats target and decoy sequences of the same protein as a pair rather than as individual entities and chooses either the target or the decoy sequence depending on which receives the highest score. We investigated the performance of this approach in combination with q-value based peptide scoring to normalize sample-, instrument-, and search engine-specific differences. The “picked” target–decoy strategy performed best when protein scoring was based on the best peptide q-value for each protein yielding a stable number of true positive protein identifications over a wide range of q-value thresholds. We show that this simple and unbiased strategy eliminates a conceptual issue in the commonly used “classic” protein FDR approach that causes overprediction of false-positive protein identification in large data sets. The approach scales from small to very large data sets without losing performance, consistently increases the number of true-positive protein identifications and is readily

Writing Assignments with a Metacognitive Component Enhance Learning in a Large Introductory Biology Course

PubMed Central

Mynlieff, Michelle; Manogaran, Anita L.; St. Maurice, Martin

2014-01-01

Writing assignments, including note taking and written recall, should enhance retention of knowledge, whereas analytical writing tasks with metacognitive aspects should enhance higher-order thinking. In this study, we assessed how certain writing-intensive “interventions,” such as written exam corrections and peer-reviewed writing assignments using Calibrated Peer Review and including a metacognitive component, improve student learning. We designed and tested the possible benefits of these approaches using control and experimental variables across and between our three-section introductory biology course. Based on assessment, students who corrected exam questions showed significant improvement on postexam assessment compared with their nonparticipating peers. Differences were also observed between students participating in written and discussion-based exercises. Students with low ACT scores benefited equally from written and discussion-based exam corrections, whereas students with midrange to high ACT scores benefited more from written than discussion-based exam corrections. Students scored higher on topics learned via peer-reviewed writing assignments relative to learning in an active classroom discussion or traditional lecture. However, students with low ACT scores (17–23) did not show the same benefit from peer-reviewed written essays as the other students. These changes offer significant student learning benefits with minimal additional effort by the instructors. PMID:26086661
METHODOLOGY AND CALCULATIONS FOR THE ASSIGNMENT OF WASTE GROUPS FOR THE LARGE UNDERGROUND WASTE STORAGE TANKS AT THE HANFORD SITE

DOE Office of Scientific and Technical Information (OSTI.GOV)

WEBER RA

2009-01-16

The Hanford Site contains 177 large underground radioactive waste storage tanks (28 double-shell tanks and 149 single-shell tanks). These tanks are categorized into one of three waste groups (A, B, and C) based on their waste and tank characteristics. These waste group assignments reflect a tank's propensity to retain a significant volume of flammable gases and the potential of the waste to release retained gas by a buoyant displacement gas release event. Assignments of waste groups to the 177 double-shell tanks and single-shell tanks, as reported in this document, are based on a Monte Carlo analysis of three criteria. Themore » first criterion is the headspace flammable gas concentration following release of retained gas. This criterion determines whether the tank contains sufficient retained gas such that the well-mixed headspace flammable gas concentration would reach 100% of the lower flammability limit if the entire tank's retained gas were released. If the volume of retained gas is not sufficient to reach 100% of the lower flammability limit, then flammable conditions cannot be reached and the tank is classified as a waste group C tank independent of the method the gas is released. The second criterion is the energy ratio and considers whether there is sufficient supernatant on top of the saturated solids such that gas-bearing solids have the potential energy required to break up the material and release gas. Tanks that are not waste group C tanks and that have an energy ratio < 3.0 do not have sufficient potential energy to break up material and release gas and are assigned to waste group B. These tanks are considered to represent a potential induced flammable gas release hazard, but no spontaneous buoyant displacement flammable gas release hazard. Tanks that are not waste group C tanks and have an energy ratio {ge} 3.0, but that pass the third criterion (buoyancy ratio < 1.0, see below) are also assigned to waste group B. Even though the designation as a
Correlated motion of protein subdomains and large-scale conformational flexibility of RecA protein filament

NASA Astrophysics Data System (ADS)

Yu, Garmay; A, Shvetsov; D, Karelov; D, Lebedev; A, Radulescu; M, Petukhov; V, Isaev-Ivanov

2012-02-01

Based on X-ray crystallographic data available at Protein Data Bank, we have built molecular dynamics (MD) models of homologous recombinases RecA from E. coli and D. radiodurans. Functional form of RecA enzyme, which is known to be a long helical filament, was approximated by a trimer, simulated in periodic water box. The MD trajectories were analyzed in terms of large-scale conformational motions that could be detectable by neutron and X-ray scattering techniques. The analysis revealed that large-scale RecA monomer dynamics can be described in terms of relative motions of 7 subdomains. Motion of C-terminal domain was the major contributor to the overall dynamics of protein. Principal component analysis (PCA) of the MD trajectories in the atom coordinate space showed that rotation of C-domain is correlated with the conformational changes in the central domain and N-terminal domain, that forms the monomer-monomer interface. Thus, even though C-terminal domain is relatively far from the interface, its orientation is correlated with large-scale filament conformation. PCA of the trajectories in the main chain dihedral angle coordinate space implicates a co-existence of a several different large-scale conformations of the modeled trimer. In order to clarify the relationship of independent domain orientation with large-scale filament conformation, we have performed analysis of independent domain motion and its implications on the filament geometry.
Large-scale serum protein biomarker discovery in Duchenne muscular dystrophy.

PubMed

Hathout, Yetrib; Brody, Edward; Clemens, Paula R; Cripe, Linda; DeLisle, Robert Kirk; Furlong, Pat; Gordish-Dressman, Heather; Hache, Lauren; Henricson, Erik; Hoffman, Eric P; Kobayashi, Yvonne Monique; Lorts, Angela; Mah, Jean K; McDonald, Craig; Mehler, Bob; Nelson, Sally; Nikrad, Malti; Singer, Britta; Steele, Fintan; Sterling, David; Sweeney, H Lee; Williams, Steve; Gold, Larry

2015-06-09

Serum biomarkers in Duchenne muscular dystrophy (DMD) may provide deeper insights into disease pathogenesis, suggest new therapeutic approaches, serve as acute read-outs of drug effects, and be useful as surrogate outcome measures to predict later clinical benefit. In this study a large-scale biomarker discovery was performed on serum samples from patients with DMD and age-matched healthy volunteers using a modified aptamer-based proteomics technology. Levels of 1,125 proteins were quantified in serum samples from two independent DMD cohorts: cohort 1 (The Parent Project Muscular Dystrophy-Cincinnati Children's Hospital Medical Center), 42 patients with DMD and 28 age-matched normal volunteers; and cohort 2 (The Cooperative International Neuromuscular Research Group, Duchenne Natural History Study), 51 patients with DMD and 17 age-matched normal volunteers. Forty-four proteins showed significant differences that were consistent in both cohorts when comparing DMD patients and healthy volunteers at a 1% false-discovery rate, a large number of significant protein changes for such a small study. These biomarkers can be classified by known cellular processes and by age-dependent changes in protein concentration. Our findings demonstrate both the utility of this unbiased biomarker discovery approach and suggest potential new diagnostic and therapeutic avenues for ameliorating the burden of DMD and, we hope, other rare and devastating diseases.
Accelerating large-scale protein structure alignments with graphics processing units

PubMed Central

2012-01-01

Background Large-scale protein structure alignment, an indispensable tool to structural bioinformatics, poses a tremendous challenge on computational resources. To ensure structure alignment accuracy and efficiency, efforts have been made to parallelize traditional alignment algorithms in grid environments. However, these solutions are costly and of limited accessibility. Others trade alignment quality for speedup by using high-level characteristics of structure fragments for structure comparisons. Findings We present ppsAlign, a parallel protein structure Alignment framework designed and optimized to exploit the parallelism of Graphics Processing Units (GPUs). As a general-purpose GPU platform, ppsAlign could take many concurrent methods, such as TM-align and Fr-TM-align, into the parallelized algorithm design. We evaluated ppsAlign on an NVIDIA Tesla C2050 GPU card, and compared it with existing software solutions running on an AMD dual-core CPU. We observed a 36-fold speedup over TM-align, a 65-fold speedup over Fr-TM-align, and a 40-fold speedup over MAMMOTH. Conclusions ppsAlign is a high-performance protein structure alignment tool designed to tackle the computational complexity issues from protein structural data. The solution presented in this paper allows large-scale structure comparisons to be performed using massive parallel computing power of GPU. PMID:22357132
An efficient randomized algorithm for contact-based NMR backbone resonance assignment.

PubMed

Kamisetty, Hetunandan; Bailey-Kellogg, Chris; Pandurangan, Gopal

2006-01-15

Backbone resonance assignment is a critical bottleneck in studies of protein structure, dynamics and interactions by nuclear magnetic resonance (NMR) spectroscopy. A minimalist approach to assignment, which we call 'contact-based', seeks to dramatically reduce experimental time and expense by replacing the standard suite of through-bond experiments with the through-space (nuclear Overhauser enhancement spectroscopy, NOESY) experiment. In the contact-based approach, spectral data are represented in a graph with vertices for putative residues (of unknown relation to the primary sequence) and edges for hypothesized NOESY interactions, such that observed spectral peaks could be explained if the residues were 'close enough'. Due to experimental ambiguity, several incorrect edges can be hypothesized for each spectral peak. An assignment is derived by identifying consistent patterns of edges (e.g. for alpha-helices and beta-sheets) within a graph and by mapping the vertices to the primary sequence. The key algorithmic challenge is to be able to uncover these patterns even when they are obscured by significant noise. This paper develops, analyzes and applies a novel algorithm for the identification of polytopes representing consistent patterns of edges in a corrupted NOESY graph. Our randomized algorithm aggregates simplices into polytopes and fixes inconsistencies with simple local modifications, called rotations, that maintain most of the structure already uncovered. In characterizing the effects of experimental noise, we employ an NMR-specific random graph model in proving that our algorithm gives optimal performance in expected polynomial time, even when the input graph is significantly corrupted. We confirm this analysis in simulation studies with graphs corrupted by up to 500% noise. Finally, we demonstrate the practical application of the algorithm on several experimental beta-sheet datasets. Our approach is able to eliminate a large majority of noise edges and to
XLinkDB 2.0: integrated, large-scale structural analysis of protein crosslinking data

PubMed Central

Schweppe, Devin K.; Zheng, Chunxiang; Chavez, Juan D.; Navare, Arti T.; Wu, Xia; Eng, Jimmy K.; Bruce, James E.

2016-01-01

Motivation: Large-scale chemical cross-linking with mass spectrometry (XL-MS) analyses are quickly becoming a powerful means for high-throughput determination of protein structural information and protein–protein interactions. Recent studies have garnered thousands of cross-linked interactions, yet the field lacks an effective tool to compile experimental data or access the network and structural knowledge for these large scale analyses. We present XLinkDB 2.0 which integrates tools for network analysis, Protein Databank queries, modeling of predicted protein structures and modeling of docked protein structures. The novel, integrated approach of XLinkDB 2.0 enables the holistic analysis of XL-MS protein interaction data without limitation to the cross-linker or analytical system used for the analysis. Availability and Implementation: XLinkDB 2.0 can be found here, including documentation and help: http://xlinkdb.gs.washington.edu/. Contact: jimbruce@uw.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:27153666
BACHSCORE. A tool for evaluating efficiently and reliably the quality of large sets of protein structures

NASA Astrophysics Data System (ADS)

Sarti, E.; Zamuner, S.; Cossio, P.; Laio, A.; Seno, F.; Trovato, A.

2013-12-01

In protein structure prediction it is of crucial importance, especially at the refinement stage, to score efficiently large sets of models by selecting the ones that are closest to the native state. We here present a new computational tool, BACHSCORE, that allows its users to rank different structural models of the same protein according to their quality, evaluated by using the BACH++ (Bayesian Analysis Conformation Hunt) scoring function. The original BACH statistical potential was already shown to discriminate with very good reliability the protein native state in large sets of misfolded models of the same protein. BACH++ features a novel upgrade in the solvation potential of the scoring function, now computed by adapting the LCPO (Linear Combination of Pairwise Orbitals) algorithm. This change further enhances the already good performance of the scoring function. BACHSCORE can be accessed directly through the web server: bachserver.pd.infn.it. Catalogue identifier: AEQD_v1_0 Program summary URL:http://cpc.cs.qub.ac.uk/summaries/AEQD_v1_0.html Program obtainable from: CPC Program Library, Queen’s University, Belfast, N. Ireland Licensing provisions: GNU General Public License version 3 No. of lines in distributed program, including test data, etc.: 130159 No. of bytes in distributed program, including test data, etc.: 24 687 455 Distribution format: tar.gz Programming language: C++. Computer: Any computer capable of running an executable produced by a g++ compiler (4.6.3 version). Operating system: Linux, Unix OS-es. RAM: 1 073 741 824 bytes Classification: 3. Nature of problem: Evaluate the quality of a protein structural model, taking into account the possible “a priori” knowledge of a reference primary sequence that may be different from the amino-acid sequence of the model; the native protein structure should be recognized as the best model. Solution method: The contact potential scores the occurrence of any given type of residue pair in 5 possible
3D DOSY-TROSY to determine the translational diffusion coefficient of large protein complexes.

PubMed

Didenko, Tatiana; Boelens, Rolf; Rüdiger, Stefan G D

2011-01-01

The translational diffusion coefficient is a sensitive parameter to probe conformational changes in proteins and protein-protein interactions. Pulsed-field gradient NMR spectroscopy allows one to measure the translational diffusion with high accuracy. Two-dimensional (2D) heteronuclear NMR spectroscopy combined with diffusion-ordered spectroscopy (DOSY) provides improved resolution and therefore selectivity when compared with a conventional 1D readout. Here, we show that a combination of selective isotope labelling, 2D ¹H-¹³C methyl-TROSY (transverse relaxation-optimised spectroscopy) and DOSY allows one to study diffusion properties of large protein complexes. We propose that a 3D DOSY-heteronuclear multiple quantum coherence (HMQC) pulse sequence, that uses the TROSY effect of the HMQC sequence for ¹³C methyl-labelled proteins, is highly suitable for measuring the diffusion coefficient of large proteins. We used the 20 kDa co-chaperone p23 as model system to test this 3D DOSY-TROSY technique under various conditions. We determined the diffusion coefficient of p23 in viscous solutions, mimicking large complexes of up to 200 kDa. We found the experimental data to be in excellent agreement with theoretical predictions. To demonstrate the use for complex formation, we applied this technique to record the formation of a complex of p23 with the molecular chaperone Hsp90, which is around 200 kDa. We anticipate that 3D DOSY-TROSY will be a useful tool to study conformational changes in large protein complexes.
Human ESP1/CRP2, a member of the LIM domain protein family: Characterization of the cDNA and assignment of the gene locus to chromosome 14q32.3

DOE Office of Scientific and Technical Information (OSTI.GOV)

Karim, Mohammad Azharul; Ohta, Kohji; Matsuda, Ichiro

1996-01-15

The LIM domain is present in a wide variety of proteins with diverse functions and exhibits characteristic arrangements of Cys and His residues with a novel zinc-binding motif. LIM domain proteins have been implicated in development, cell regulation, and cell structure. A LIM domain protein was identified by screening a human cDNA library with rat cysteine-rich intestinal protein (CRIP) as a probe, under conditions of low stringency. Comparison of the predicted amino acid sequence with several LIM domain proteins revealed 93% of the residues to be identical to rat LIM domain protein, termed ESP1 or CRP2. Thus, the protein ismore » hereafter referred to as human ESP1/CRP2. The cDNA encompasses a 1171-base region, including 26, 624, and 521 bases in the 5{prime}-noncoding region, coding region, and 3{prime}-noncoding regions, respectively, and encodes the entire ESP1/CRP2 protein has two LIM domains, and each shares 35.1% and 77 or 79% identical residues with human cysteine-rich protein (CRP) and rat CRIP, respectively. Northern blot analysis of ESP1/CRP2 in various human tissues showed distinct tissue distributions compared with CRP and CRIP, suggesting that each might serve related but specific roles in tissue organization or function. Using a panel of human-rodent somatic cell hybrids, the ESP1/CRP2 locus was assigned to chromosome 14. Fluorescence in situ hybridization, using cDNA and a genome DNA fragment of the ESP1/CRP2 as probes, confirms this assignment and relegates regional localization to band 14q32.3 47 refs., 7 figs.« less
A Scalable Approach for Protein False Discovery Rate Estimation in Large Proteomic Data Sets.

PubMed

Savitski, Mikhail M; Wilhelm, Mathias; Hahne, Hannes; Kuster, Bernhard; Bantscheff, Marcus

2015-09-01

Calculating the number of confidently identified proteins and estimating false discovery rate (FDR) is a challenge when analyzing very large proteomic data sets such as entire human proteomes. Biological and technical heterogeneity in proteomic experiments further add to the challenge and there are strong differences in opinion regarding the conceptual validity of a protein FDR and no consensus regarding the methodology for protein FDR determination. There are also limitations inherent to the widely used classic target-decoy strategy that particularly show when analyzing very large data sets and that lead to a strong over-representation of decoy identifications. In this study, we investigated the merits of the classic, as well as a novel target-decoy-based protein FDR estimation approach, taking advantage of a heterogeneous data collection comprised of ∼19,000 LC-MS/MS runs deposited in ProteomicsDB (https://www.proteomicsdb.org). The "picked" protein FDR approach treats target and decoy sequences of the same protein as a pair rather than as individual entities and chooses either the target or the decoy sequence depending on which receives the highest score. We investigated the performance of this approach in combination with q-value based peptide scoring to normalize sample-, instrument-, and search engine-specific differences. The "picked" target-decoy strategy performed best when protein scoring was based on the best peptide q-value for each protein yielding a stable number of true positive protein identifications over a wide range of q-value thresholds. We show that this simple and unbiased strategy eliminates a conceptual issue in the commonly used "classic" protein FDR approach that causes overprediction of false-positive protein identification in large data sets. The approach scales from small to very large data sets without losing performance, consistently increases the number of true-positive protein identifications and is readily implemented in
BTEC Integrative Assignments.

ERIC Educational Resources Information Center

Foot, G. E.

1992-01-01

To equip electrical engineering students with common and transferable work skills, a program of integrative assignments was created to develop communication and teamwork skills. Discusses assignment components; the log book, a personal account of each assignment; assessment; conversion of "common skills" to competence statements, and…
Investigating the Role of Large-Scale Domain Dynamics in Protein-Protein Interactions.

PubMed

Delaforge, Elise; Milles, Sigrid; Huang, Jie-Rong; Bouvier, Denis; Jensen, Malene Ringkjøbing; Sattler, Michael; Hart, Darren J; Blackledge, Martin

2016-01-01

Intrinsically disordered linkers provide multi-domain proteins with degrees of conformational freedom that are often essential for function. These highly dynamic assemblies represent a significant fraction of all proteomes, and deciphering the physical basis of their interactions represents a considerable challenge. Here we describe the difficulties associated with mapping the large-scale domain dynamics and describe two recent examples where solution state methods, in particular NMR spectroscopy, are used to investigate conformational exchange on very different timescales.
Investigating the Role of Large-Scale Domain Dynamics in Protein-Protein Interactions

PubMed Central

Delaforge, Elise; Milles, Sigrid; Huang, Jie-rong; Bouvier, Denis; Jensen, Malene Ringkjøbing; Sattler, Michael; Hart, Darren J.; Blackledge, Martin

2016-01-01

Intrinsically disordered linkers provide multi-domain proteins with degrees of conformational freedom that are often essential for function. These highly dynamic assemblies represent a significant fraction of all proteomes, and deciphering the physical basis of their interactions represents a considerable challenge. Here we describe the difficulties associated with mapping the large-scale domain dynamics and describe two recent examples where solution state methods, in particular NMR spectroscopy, are used to investigate conformational exchange on very different timescales. PMID:27679800
The Personal Response: A Novel Writing Assignment to Engage First Year Students in Large Human Biology Classes

ERIC Educational Resources Information Center

Moni, Roger W.; Moni, Karen B.; Poronnik, Philip

2007-01-01

The teaching of highly valued scientific writing skills in the first year of university is challenging. This report describes the design, implementation, and evaluation of a novel written assignment, "The Personal Response" and accompanying Peer Review, in the course, Human Biology (BIOL1015) at The University of Queensland. These assignments were…
Luminescent conjugated oligothiophenes for sensitive fluorescent assignment of protein inclusion bodies.

PubMed

Klingstedt, Therése; Blechschmidt, Cristiane; Nogalska, Anna; Prokop, Stefan; Häggqvist, Bo; Danielsson, Olof; Engel, W King; Askanas, Valerie; Heppner, Frank L; Nilsson, K Peter R

2013-03-18

Small hydrophobic ligands identifying intracellular protein deposits are of great interest, as protein inclusion bodies are the pathological hallmark of several degenerative diseases. Here we report that fluorescent amyloid ligands, termed luminescent conjugated oligothiophenes (LCOs), rapidly and with high sensitivity detect protein inclusion bodies in skeletal muscle tissue from patients with sporadic inclusion body myositis (s-IBM). LCOs having a conjugated backbone of at least five thiophene units emitted strong fluorescence upon binding, and showed co-localization with proteins reported to accumulate in s-IBM protein inclusion bodies. Compared with conventional amyloid ligands, LCOs identified a larger fraction of immunopositive inclusion bodies. When the conjugated thiophene backbone was extended with terminal carboxyl groups, the LCO revealed striking spectral differences between distinct protein inclusion bodies. We conclude that 1) LCOs are sensitive, rapid and powerful tools for identifying protein inclusion bodies and 2) LCOs identify a wider range of protein inclusion bodies than conventional amyloid ligands. Copyright © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Luminescent Conjugated Oligothiophenes for Sensitive Fluorescent Assignment of Protein Inclusion Bodies

PubMed Central

Klingstedt, Therése; Blechschmidt, Cristiane; Nogalska, Anna; Prokop, Stefan; Häggqvist, Bo; Danielsson, Olof; Engel, W King; Askanas, Valerie; Heppner, Frank L; Nilsson, K Peter R

2013-01-01

Small hydrophobic ligands identifying intracellular protein deposits are of great interest, as protein inclusion bodies are the pathological hallmark of several degenerative diseases. Here we report that fluorescent amyloid ligands, termed luminescent conjugated oligothiophenes (LCOs), rapidly and with high sensitivity detect protein inclusion bodies in skeletal muscle tissue from patients with sporadic inclusion body myositis (s-IBM). LCOs having a conjugated backbone of at least five thiophene units emitted strong fluorescence upon binding, and showed co-localization with proteins reported to accumulate in s-IBM protein inclusion bodies. Compared with conventional amyloid ligands, LCOs identified a larger fraction of immunopositive inclusion bodies. When the conjugated thiophene backbone was extended with terminal carboxyl groups, the LCO revealed striking spectral differences between distinct protein inclusion bodies. We conclude that 1) LCOs are sensitive, rapid and powerful tools for identifying protein inclusion bodies and 2) LCOs identify a wider range of protein inclusion bodies than conventional amyloid ligands. PMID:23450708
Systematic Errors in Peptide and Protein Identification and Quantification by Modified Peptides*

PubMed Central

Bogdanow, Boris; Zauber, Henrik; Selbach, Matthias

2016-01-01

The principle of shotgun proteomics is to use peptide mass spectra in order to identify corresponding sequences in a protein database. The quality of peptide and protein identification and quantification critically depends on the sensitivity and specificity of this assignment process. Many peptides in proteomic samples carry biochemical modifications, and a large fraction of unassigned spectra arise from modified peptides. Spectra derived from modified peptides can erroneously be assigned to wrong amino acid sequences. However, the impact of this problem on proteomic data has not yet been investigated systematically. Here we use combinations of different database searches to show that modified peptides can be responsible for 20–50% of false positive identifications in deep proteomic data sets. These false positive hits are particularly problematic as they have significantly higher scores and higher intensities than other false positive matches. Furthermore, these wrong peptide assignments lead to hundreds of false protein identifications and systematic biases in protein quantification. We devise a “cleaned search” strategy to address this problem and show that this considerably improves the sensitivity and specificity of proteomic data. In summary, we show that modified peptides cause systematic errors in peptide and protein identification and quantification and should therefore be considered to further improve the quality of proteomic data annotation. PMID:27215553
Automatic Assignment of Methyl-NMR Spectra of Supramolecular Machines Using Graph Theory.

PubMed

Pritišanac, Iva; Degiacomi, Matteo T; Alderson, T Reid; Carneiro, Marta G; Ab, Eiso; Siegal, Gregg; Baldwin, Andrew J

2017-07-19

Methyl groups are powerful probes for the analysis of structure, dynamics and function of supramolecular assemblies, using both solution- and solid-state NMR. Widespread application of the methodology has been limited due to the challenges associated with assigning spectral resonances to specific locations within a biomolecule. Here, we present Methyl Assignment by Graph Matching (MAGMA), for the automatic assignment of methyl resonances. A graph matching protocol examines all possibilities for each resonance in order to determine an exact assignment that includes a complete description of any ambiguity. MAGMA gives 100% accuracy in confident assignments when tested against both synthetic data, and 9 cross-validated examples using both solution- and solid-state NMR data. We show that this remarkable accuracy enables a user to distinguish between alternative protein structures. In a drug discovery application on HSP90, we show the method can rapidly and efficiently distinguish between possible ligand binding modes. By providing an exact and robust solution to methyl resonance assignment, MAGMA can facilitate significantly accelerated studies of supramolecular machines using methyl-based NMR spectroscopy.
Effortless assignment with 4D covariance sequential correlation maps.

PubMed

Harden, Bradley J; Mishra, Subrata H; Frueh, Dominique P

2015-11-01

Traditional Nuclear Magnetic Resonance (NMR) assignment procedures for proteins rely on preliminary peak-picking to identify and label NMR signals. However, such an approach has severe limitations when signals are erroneously labeled or completely neglected. The consequences are especially grave for proteins with substantial peak overlap, and mistakes can often thwart entire projects. To overcome these limitations, we previously introduced an assignment technique that bypasses traditional pick peaking altogether. Covariance Sequential Correlation Maps (COSCOMs) transform the indirect connectivity information provided by multiple 3D backbone spectra into direct (H, N) to (H, N) correlations. Here, we present an updated method that utilizes a single four-dimensional spectrum rather than a suite of three-dimensional spectra. We demonstrate the advantages of 4D-COSCOMs relative to their 3D counterparts. We introduce improvements accelerating their calculation. We discuss practical considerations affecting their quality. And finally we showcase their utility in the context of a 52 kDa cyclization domain from a non-ribosomal peptide synthetase. Copyright © 2015 Elsevier Inc. All rights reserved.

Transport of large breakdown products of dietary protein through the gut wall.

PubMed Central

Hemmings, W A; Williams, E W

1978-01-01

Ferritin or tritium labelled immunoglobulin G may, by electron microscopy, be demonstrated entering, within, and leaving the epithelial cells. Quantitative studies using various proteins labelled with radioiodine show that large amounts of protein bound radioactivity may be demonstrated in the tissues after feeding the labelled protein to adult rats by stomach tube. The molecular size of this material as determined by sugar gradient ultracentrifugation of tissue extracts ranges when IgG is fed from 50,000-20,000 Daltons. The material retains its ability to react as antigen with antisera specific to the original molecule: precipitation reactions may be obtained in gels and quantitative studies show that cnosiderable amounts of the protein-bound radioactivity are still specifically precipitable. Such studies have been carried out with alpha-gliadin as well as bovine IgG. At 100 days old rats may absorb as much as 40% of a dose of bovine IgG in the form of these large molecular breakdown products. PMID:680603
Light-fuelled transport of large dendrimers and proteins.

PubMed

Koskela, Jenni E; Liljeström, Ville; Lim, Jongdoo; Simanek, Eric E; Ras, Robin H A; Priimagi, Arri; Kostiainen, Mauri A

2014-05-14

This work presents a facile water-based supramolecular approach for light-induced surface patterning. The method is based upon azobenzene-functionalized high-molecular weight triazine dendrimers up to generation 9, demonstrating that even very large globular supramolecular complexes can be made to move in response to light. We also demonstrate light-fuelled macroscopic movements in native biomolecules, showing that complexes of apoferritin protein and azobenzene can effectively form light-induced surface patterns. Fundamentally, the results establish that thin films comprising both flexible and rigid globular particles of large diameter can be moved with light, whereas the presented material concepts offer new possibilities for the yet marginally explored biological applications of azobenzene surface patterning.
Gcn4-Mediator Specificity Is Mediated by a Large and Dynamic Fuzzy Protein-Protein Complex.

PubMed

Tuttle, Lisa M; Pacheco, Derek; Warfield, Linda; Luo, Jie; Ranish, Jeff; Hahn, Steven; Klevit, Rachel E

2018-03-20

Transcription activation domains (ADs) are inherently disordered proteins that often target multiple coactivator complexes, but the specificity of these interactions is not understood. Efficient transcription activation by yeast Gcn4 requires its tandem ADs and four activator-binding domains (ABDs) on its target, the Mediator subunit Med15. Multiple ABDs are a common feature of coactivator complexes. We find that the large Gcn4-Med15 complex is heterogeneous and contains nearly all possible AD-ABD interactions. Gcn4-Med15 forms via a dynamic fuzzy protein-protein interface, where ADs bind the ABDs in multiple orientations via hydrophobic regions that gain helicity. This combinatorial mechanism allows individual low-affinity and specificity interactions to generate a biologically functional, specific, and higher affinity complex despite lacking a defined protein-protein interface. This binding strategy is likely representative of many activators that target multiple coactivators, as it allows great flexibility in combinations of activators that can cooperate to regulate genes with variable coactivator requirements. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.
Site-directed DNA crosslinking of large multisubunit protein-DNA complexes.

PubMed

Persinger, Jim; Bartholomew, Blaine

2009-01-01

Several methods have been developed to site-specifically incorporate photoreactive nucleotide analogs into DNA for the purpose of identifying the proteins and their domains that are in contact with particular regions of DNA. The synthesis of several deoxynucleotide analogs that have a photoreactive group tethered to the nucleotide base and the incorporation of these analogs into DNA are described. In a second approach, oligonucleotide with a photoreactive group attached to the phosphate backbone is chemically synthesized. The photoreactive oligonucleotide is then enzymatically incorporated into DNA by annealing it to a complementary DNA template and extending with DNA polymerase. Both approaches have been effectively used to map protein-DNA interactions in large multisubunit complexes such as the eukaryotic transcription or ATP-dependent chromatin remodeling complexes. Not only do these techniques map the binding sites of the various subunits in these complexes, but when coupled with peptide mapping also determine the protein domain that is in close proximity to the different DNA sites. The strength of these techniques is the ability to scan a large number of potential sites by making combinations of different DNA probes and is facilitated by using an immobilized DNA template for synthesis.
An Interactive Introduction to Protein Structure

ERIC Educational Resources Information Center

Lee, W. Theodore

2004-01-01

To improve student understanding of protein structure and the significance of noncovalent interactions in protein structure and function, students are assigned a project to write a paper complemented with computer-generated images. The assignment provides an opportunity for students to select a protein structure that is of interest and detail…
Revealing the global map of protein folding space by large-scale simulations

NASA Astrophysics Data System (ADS)

Sinner, Claude; Lutz, Benjamin; Verma, Abhinav; Schug, Alexander

2015-12-01

The full characterization of protein folding is a remarkable long-standing challenge both for experiment and simulation. Working towards a complete understanding of this process, one needs to cover the full diversity of existing folds and identify the general principles driving the process. Here, we want to understand and quantify the diversity in folding routes for a large and representative set of protein topologies covering the full range from all alpha helical topologies towards beta barrels guided by the key question: Does the majority of the observed routes contribute to the folding process or only a particular route? We identified a set of two-state folders among non-homologous proteins with a sequence length of 40-120 residues. For each of these proteins, we ran native-structure based simulations both with homogeneous and heterogeneous contact potentials. For each protein, we simulated dozens of folding transitions in continuous uninterrupted simulations and constructed a large database of kinetic parameters. We investigate folding routes by tracking the formation of tertiary structure interfaces and discuss whether a single specific route exists for a topology or if all routes are equiprobable. These results permit us to characterize the complete folding space for small proteins in terms of folding barrier ΔG‡, number of routes, and the route specificity RT.
Using Clouds for MapReduce Measurement Assignments

ERIC Educational Resources Information Center

Rabkin, Ariel; Reiss, Charles; Katz, Randy; Patterson, David

2013-01-01

We describe our experiences teaching MapReduce in a large undergraduate lecture course using public cloud services and the standard Hadoop API. Using the standard API, students directly experienced the quality of industrial big-data tools. Using the cloud, every student could carry out scalability benchmarking assignments on realistic hardware,…
Large-scale protein-protein interactions detection by integrating big biosensing data with computational model.

PubMed

You, Zhu-Hong; Li, Shuai; Gao, Xin; Luo, Xin; Ji, Zhen

2014-01-01

Protein-protein interactions are the basis of biological functions, and studying these interactions on a molecular level is of crucial importance for understanding the functionality of a living cell. During the past decade, biosensors have emerged as an important tool for the high-throughput identification of proteins and their interactions. However, the high-throughput experimental methods for identifying PPIs are both time-consuming and expensive. On the other hand, high-throughput PPI data are often associated with high false-positive and high false-negative rates. Targeting at these problems, we propose a method for PPI detection by integrating biosensor-based PPI data with a novel computational model. This method was developed based on the algorithm of extreme learning machine combined with a novel representation of protein sequence descriptor. When performed on the large-scale human protein interaction dataset, the proposed method achieved 84.8% prediction accuracy with 84.08% sensitivity at the specificity of 85.53%. We conducted more extensive experiments to compare the proposed method with the state-of-the-art techniques, support vector machine. The achieved results demonstrate that our approach is very promising for detecting new PPIs, and it can be a helpful supplement for biosensor-based PPI data detection.
Comparing Looping Teacher-Assigned and Traditional Teacher-Assigned Student Achievement Scores

ERIC Educational Resources Information Center

Lloyd, Melissa C.

2014-01-01

A problem in many elementary schools is determining which teacher assignment strategy best promotes the academic progress of students. To find and implement educational practices that address the academic needs of all learners, schools need research-based data focusing on the 2 teacher assignment strategies: looping assignment (LA) and traditional…
¹H, ¹³C and ¹⁵N resonance assignment for the human K-Ras at physiological pH.

PubMed

Vo, Uybach; Embrey, Kevin J; Breeze, Alexander L; Golovanov, Alexander P

2013-10-01

K-Ras, a member of the Ras family of small GTPases, is involved in cell growth, proliferation, differentiation and apoptosis and is frequently mutated in cancer. The activity of Ras is mediated by the inter-conversion between GTP- and GDP- bound states. This conversion is regulated by binding of effector proteins such as guanine nucleotide exchange factors and GTPase activating proteins. Previously, NMR signals from these effector-binding regions of Ras often remained unassigned and largely unobservable due to conformational exchange and polysterism inherent to this protein. In this paper, we report the complete backbone and C(β), as well as partial H(α), H(β) and C(γ), NMR assignment for human K-Ras (residues 1-166) in the GDP-bound form at a physiological pH of 7.4. These data thereby make possible detailed monitoring of the functional cycle of Ras and its interactions with nucleotides and effector proteins through the observation of fingerprint signals from all the functionally important regions of the protein.
Grouping puts figure-ground assignment in context by constraining propagation of edge-assignment

PubMed Central

Brooks, Joseph L.; Driver, Jon

2010-01-01

Figure-ground organization involves assignment of edges to a figural shape on one or the other side of each dividing edge. Established visual cues for edge-assignment primarily concern relatively local rather than contextual factors. Here we show that assignment for a locally-unbiased edge can be affected by assignment of a remote contextual edge that has its own locally-biased assignment. We find that such propagation of edge-assignment from the biased remote context occurs only when the biased and unbiased edges are grouped. This new principle, whereby grouping constrains propagation of figural edge-assignment, emerges from both subjective reports and from an objective short-term edge-matching task. It generalizes from moving displays involving grouping by common fate and collinearity, to static displays with grouping by similarity of edge-contrast polarity, or apparent occlusion. Our results identify a new contextual influence upon edge-assignment. They also identify a new mechanistic relation between grouping and figure-ground processes, whereby grouping between remote elements can constrain propagation of edge-assignment between those elements. PMID:20436200
Large-Scale Validation of Mixed-Solvent Simulations to Assess Hotspots at Protein-Protein Interaction Interfaces.

PubMed

Ghanakota, Phani; van Vlijmen, Herman; Sherman, Woody; Beuming, Thijs

2018-04-23

The ability to target protein-protein interactions (PPIs) with small molecule inhibitors offers great promise in expanding the druggable target space and addressing a broad range of untreated diseases. However, due to their nature and function of interacting with protein partners, PPI interfaces tend to extend over large surfaces without the typical pockets of enzymes and receptors. These features present unique challenges for small molecule inhibitor design. As such, determining whether a particular PPI of interest could be pursued with a small molecule discovery strategy requires an understanding of the characteristics of the PPI interface and whether it has hotspots that can be leveraged by small molecules to achieve desired potency. Here, we assess the ability of mixed-solvent molecular dynamic (MSMD) simulations to detect hotspots at PPI interfaces. MSMD simulations using three cosolvents (acetonitrile, isopropanol, and pyrimidine) were performed on a large test set of 21 PPI targets that have been experimentally validated by small molecule inhibitors. We compare MSMD, which includes explicit solvent and full protein flexibility, to a simpler approach that does not include dynamics or explicit solvent (SiteMap) and find that MSMD simulations reveal additional information about the characteristics of these targets and the ability for small molecules to inhibit the PPI interface. In the few cases were MSMD simulations did not detect hotspots, we explore the shortcomings of this technique and propose future improvements. Finally, using Interleukin-2 as an example, we highlight the advantage of the MSMD approach for detecting transient cryptic druggable pockets that exists at PPI interfaces.
Large protein as a potential target for use in rabies diagnostics.

PubMed

Santos Katz, I S; Dias, M H; Lima, I F; Chaves, L B; Ribeiro, O G; Scheffer, K C; Iwai, L K

Rabies is a zoonotic viral disease that remains a serious threat to public health worldwide. The rabies lyssavirus (RABV) genome encodes five structural proteins, multifunctional and significant for pathogenicity. The large protein (L) presents well-conserved genomic regions, which may be a good alternative to generate informative datasets for development of new methods for rabies diagnosis. This paper describes the development of a technique for the identification of L protein in several RABV strains from different hosts, demonstrating that MS-based proteomics is a potential method for antigen identification and a good alternative for rabies diagnosis.
Large-scale purification and biochemical characterization of crystallization-grade porin protein P from Pseudomonas aeruginosa.

PubMed

Worobec, E A; Martin, N L; McCubbin, W D; Kay, C M; Brayer, G D; Hancock, R E

1988-04-07

A large-scale purification scheme was developed for lipopolysaccharide-free protein P, the phosphate-starvation-inducible outer-membrane porin from Pseudomonas aeruginosa. This highly purified protein P was used to successfully form hexagonal crystals in the presence of n-octyl-beta-glucopyranoside. Amino-acid analysis indicated that protein P had a similar composition to other bacterial outer membrane proteins, containing a high percentage (50%) of hydrophilic residues. The amino-terminal sequence of this protein, although not homologous to either outer membrane protein, PhoE or OmpF, of Escherichia coli, was found to have an analogous protein-folding pattern. Protein P in the native trimer form was capable of maintaining a stable functional trimer after proteinase cleavage. This suggested the existence of a strongly associated tertiary and quaternary structure. Circular dichroism studies confirmed these results in that a large proportion of the protein structure was determined to be beta-sheet and resistant to acid pH and heating in 0.1% sodium dodecyl sulphate.
TROSY of side-chain amides in large proteins

PubMed Central

Liu, Aizhuo; Yao, Lishan; Li, Yue; Yan, Honggao

2012-01-01

By using the mixed solvent of 50% H2O/50% D2O and employing deuterium decoupling, TROSY experiments exclusively detect NMR signals from semideuterated isotopomers of carboxamide groups with high sensitivities for proteins with molecular weights up to 80 kDa. This isotopomer-selective strategy extends TROSY experiments from exclusively detecting backbone to both backbone and side-chain amides, particularly in large proteins. Because of differences in both TROSY effect and dynamics between 15N–HE{DZ} and 15N–HZ{DE} isotopomers of the same carboxamide, the 15N transverse magnetization of the latter relaxes significantly faster than that of the former, which provides a direct and reliable stereospecific distinction between the two configurations. The TROSY effects on the 15N–HE{DZ} isotopomers of side-chain amides are as significant as on backbone amides. PMID:17347000
Grouping puts figure-ground assignment in context by constraining propagation of edge assignment.

PubMed

Brooks, Joseph L; Brook, Joseph L; Driver, Jon

2010-05-01

Figure-ground organization involves the assignment of edges to a figural shape on one or the other side of each dividing edge. Established visual cues for edge assignment primarily concern relatively local rather than contextual factors. In the present article, we show that an assignment for a locally unbiased edge can be affected by an assignment of a remote contextual edge that has its own locally biased assignment. We find that such propagation of edge assignment from the biased remote context occurs only when the biased and unbiased edges are grouped. This new principle, whereby grouping constrains the propagation of figural edge assignment, emerges from both subjective reports and an objective short-term edge-matching task. It generalizes from moving displays involving grouping by common fate and collinearity, to static displays with grouping by similarity of edge-contrast polarity, or apparent occlusion. Our results identify a new contextual influence on edge assignment. They also identify a new mechanistic relation between grouping and figure-ground processes, whereby grouping between remote elements can constrain the propagation of edge assignment between those elements. Supplemental materials for this article may be downloaded from http://app.psychonomic-journals.org/content/supplemental.
An Investigation of the Partial-Assignment Completion Effect on Students' Assignment Choice Behavior

ERIC Educational Resources Information Center

Hawthorn-Embree, Meredith L.; Skinner, Christopher H.; Parkhurst, John; Conley, Elisha

2011-01-01

This study was designed to investigate the partial assignment completion effect. Seventh-grade students were given a math assignment. After working for 5 min, they were interrupted and their partially completed assignments were collected. About 20 min later, students were given their partially completed assignment and a new, control assignment…
Large scale crystallization of protein pharmaceuticals in microgravity via temperature change

NASA Technical Reports Server (NTRS)

Long, Marianna M.

1992-01-01

The major objective of this research effort is the temperature driven growth of protein crystals in large batches in the microgravity environment of space. Pharmaceutical houses are developing protein products for patient care, for example, human insulin, human growth hormone, interferons, and tissue plasminogen activator or TPA, the clot buster for heart attack victims. Except for insulin, these are very high value products; they are extremely potent in small quantities and have a great value per gram of material. It is feasible that microgravity crystallization can be a cost recoverable, economically sound final processing step in their manufacture. Large scale protein crystal growth in microgravity has significant advantages from the basic science and the applied science standpoints. Crystal growth can proceed unhindered due to lack of surface effects. Dynamic control is possible and relatively easy. The method has the potential to yield large quantities of pure crystalline product. Crystallization is a time honored procedure for purifying organic materials and microgravity crystallization could be the final step to remove trace impurities from high value protein pharmaceuticals. In addition, microgravity grown crystals could be the final formulation for those medicines that need to be administered in a timed release fashion. Long lasting insulin, insulin lente, is such a product. Also crystalline protein pharmaceuticals are more stable for long-term storage. Temperature, as the initiation step, has certain advantages. Again, dynamic control of the crystallization process is possible and easy. A temperature step is non-invasive and is the most subtle way to control protein solubility and therefore crystallization. Seeding is not necessary. Changes in protein and precipitant concentrations and pH are not necessary. Finally, this method represents a new way to crystallize proteins in space that takes advantage of the unique microgravity environment. The results
Identification of a cytoplasmic interaction partner of the large regulatory proteins Rep78/Rep68 of adeno-associated virus type 2 (AAV-2)

DOE Office of Scientific and Technical Information (OSTI.GOV)

Weger, Stefan; Hammer, Eva; Goetz, Anne

2007-05-25

Through yeast two-hybrid analysis and coimmunoprecipitation studies, we have identified a novel cellular AAV-2 Rep78/Rep68 interaction partner located predominantly in the cytoplasm. In public databases, it has been assigned as KCTD5, because of a region of high similarity to the cytoplasmic tetramerization domain of voltage-gated potassium channels. Whereas Rep/KCTD5 interaction relied on the region surrounding the Rep nuclear localization signal, nuclear accumulation of Rep was not required. Wildtype Rep78/Rep68 proteins induced the translocation of large portions of KCTD5 into the nucleus pointing to functional interactions both in the cytoplasm and the nucleus. In line with an anticipated functional interference inmore » the cytoplasm, KCTD5 overexpression completely abrogated Rep68-mediated posttranscriptional activation of a HIV-LTR driven luciferase reporter gene. Our study expands the panel of already identified nuclear Rep interaction partners to a cytoplasmic protein, which raises the awareness that important steps in the AAV life cycle may be regulated in this compartment.« less
Protein-Fragment Complementation Assays for Large-Scale Analysis, Functional Dissection, and Spatiotemporal Dynamic Studies of Protein-Protein Interactions in Living Cells.

PubMed

Michnick, Stephen W; Landry, Christian R; Levy, Emmanuel D; Diss, Guillaume; Ear, Po Hien; Kowarzyk, Jacqueline; Malleshaiah, Mohan K; Messier, Vincent; Tchekanda, Emmanuelle

2016-11-01

Protein-fragment complementation assays (PCAs) comprise a family of assays that can be used to study protein-protein interactions (PPIs), conformation changes, and protein complex dimensions. We developed PCAs to provide simple and direct methods for the study of PPIs in any living cell, subcellular compartments or membranes, multicellular organisms, or in vitro. Because they are complete assays, requiring no cell-specific components other than reporter fragments, they can be applied in any context. PCAs provide a general strategy for the detection of proteins expressed at endogenous levels within appropriate subcellular compartments and with normal posttranslational modifications, in virtually any cell type or organism under any conditions. Here we introduce a number of applications of PCAs in budding yeast, Saccharomyces cerevisiae These applications represent the full range of PPI characteristics that might be studied, from simple detection on a large scale to visualization of spatiotemporal dynamics. © 2016 Cold Spring Harbor Laboratory Press.

An automated framework for NMR resonance assignment through simultaneous slice picking and spin system forming.

PubMed

Abbas, Ahmed; Guo, Xianrong; Jing, Bing-Yi; Gao, Xin

2014-06-01

Despite significant advances in automated nuclear magnetic resonance-based protein structure determination, the high numbers of false positives and false negatives among the peaks selected by fully automated methods remain a problem. These false positives and negatives impair the performance of resonance assignment methods. One of the main reasons for this problem is that the computational research community often considers peak picking and resonance assignment to be two separate problems, whereas spectroscopists use expert knowledge to pick peaks and assign their resonances at the same time. We propose a novel framework that simultaneously conducts slice picking and spin system forming, an essential step in resonance assignment. Our framework then employs a genetic algorithm, directed by both connectivity information and amino acid typing information from the spin systems, to assign the spin systems to residues. The inputs to our framework can be as few as two commonly used spectra, i.e., CBCA(CO)NH and HNCACB. Different from the existing peak picking and resonance assignment methods that treat peaks as the units, our method is based on 'slices', which are one-dimensional vectors in three-dimensional spectra that correspond to certain ([Formula: see text]) values. Experimental results on both benchmark simulated data sets and four real protein data sets demonstrate that our method significantly outperforms the state-of-the-art methods while using a less number of spectra than those methods. Our method is freely available at http://sfb.kaust.edu.sa/Pages/Software.aspx.
Rewarding psychiatric aides for the behavioral improvement of assigned patients1

PubMed Central

Pomerleau, Ovide F.; Bobrove, Philip H.; Smith, Rita H.

1973-01-01

Different ways of modifying the aide-patient relationship to promote improvement in psychiatric patients were investigated. Psychiatric aides were given information about the behavior of assigned patients, cash awards based on the improvement of assigned patients, and different kinds of supervision by the psychology staff; the effects of these variables on a large number of psychiatrically relevant behaviors were measured. Appropriate behavior of patients increased when the aides were given quantitative information about the improvement of assigned patients. Cash awards for aides, which were not contingent on the behavior of patients had little effect, while cash awards contingent on the behavior of assigned patients were associated with more appropriate behavior. Direct supervision of aide-patient interactions was associated with an increase in appropriate behavior, while required consultation for the aides about assigned patients was not. Behavior of patients deteriorated when the program was terminated. PMID:16795420
Interactive visual exploration and refinement of cluster assignments.

PubMed

Kern, Michael; Lex, Alexander; Gehlenborg, Nils; Johnson, Chris R

2017-09-12

With ever-increasing amounts of data produced in biology research, scientists are in need of efficient data analysis methods. Cluster analysis, combined with visualization of the results, is one such method that can be used to make sense of large data volumes. At the same time, cluster analysis is known to be imperfect and depends on the choice of algorithms, parameters, and distance measures. Most clustering algorithms don't properly account for ambiguity in the source data, as records are often assigned to discrete clusters, even if an assignment is unclear. While there are metrics and visualization techniques that allow analysts to compare clusterings or to judge cluster quality, there is no comprehensive method that allows analysts to evaluate, compare, and refine cluster assignments based on the source data, derived scores, and contextual data. In this paper, we introduce a method that explicitly visualizes the quality of cluster assignments, allows comparisons of clustering results and enables analysts to manually curate and refine cluster assignments. Our methods are applicable to matrix data clustered with partitional, hierarchical, and fuzzy clustering algorithms. Furthermore, we enable analysts to explore clustering results in context of other data, for example, to observe whether a clustering of genomic data results in a meaningful differentiation in phenotypes. Our methods are integrated into Caleydo StratomeX, a popular, web-based, disease subtype analysis tool. We show in a usage scenario that our approach can reveal ambiguities in cluster assignments and produce improved clusterings that better differentiate genotypes and phenotypes.
Development of an automated large-scale protein-crystallization and monitoring system for high-throughput protein-structure analyses.

PubMed

Hiraki, Masahiko; Kato, Ryuichi; Nagai, Minoru; Satoh, Tadashi; Hirano, Satoshi; Ihara, Kentaro; Kudo, Norio; Nagae, Masamichi; Kobayashi, Masanori; Inoue, Michio; Uejima, Tamami; Oda, Shunichiro; Chavas, Leonard M G; Akutsu, Masato; Yamada, Yusuke; Kawasaki, Masato; Matsugaki, Naohiro; Igarashi, Noriyuki; Suzuki, Mamoru; Wakatsuki, Soichi

2006-09-01

Protein crystallization remains one of the bottlenecks in crystallographic analysis of macromolecules. An automated large-scale protein-crystallization system named PXS has been developed consisting of the following subsystems, which proceed in parallel under unified control software: dispensing precipitants and protein solutions, sealing crystallization plates, carrying robot, incubators, observation system and image-storage server. A sitting-drop crystallization plate specialized for PXS has also been designed and developed. PXS can set up 7680 drops for vapour diffusion per hour, which includes time for replenishing supplies such as disposable tips and crystallization plates. Images of the crystallization drops are automatically recorded according to a preprogrammed schedule and can be viewed by users remotely using web-based browser software. A number of protein crystals were successfully produced and several protein structures could be determined directly from crystals grown by PXS. In other cases, X-ray quality crystals were obtained by further optimization by manual screening based on the conditions found by PXS.
Thermodynamic database for proteins: features and applications.

PubMed

Gromiha, M Michael; Sarai, Akinori

2010-01-01

We have developed a thermodynamic database for proteins and mutants, ProTherm, which is a collection of a large number of thermodynamic data on protein stability along with the sequence and structure information, experimental methods and conditions, and literature information. This is a valuable resource for understanding/predicting the stability of proteins, and it can be accessible at http://www.gibk26.bse.kyutech.ac.jp/jouhou/Protherm/protherm.html . ProTherm has several features including various search, display, and sorting options and visualization tools. We have analyzed the data in ProTherm to examine the relationship among thermodynamics, structure, and function of proteins. We describe the progress on the development of methods for understanding/predicting protein stability, such as (i) relationship between the stability of protein mutants and amino acid properties, (ii) average assignment method, (iii) empirical energy functions, (iv) torsion, distance, and contact potentials, and (v) machine learning techniques. The list of online resources for predicting protein stability has also been provided.
NMR assignment of a PDZ domain in complex with a HPV51 E6 derived N-terminally pyroglutamic acid modified peptide.

PubMed

Mischo, André; Ohlenschläger, Oliver; Ramachandran, Ramadurai; Görlach, Matthias

2013-04-01

The resonance assignment of an amino-terminal pyroglutamic acid containing peptide derived from the E6 protein of human papillomavirus (HPV) type 51 in complex with PDZ domain 2 of hDlg/SAP-97 is reported. The assignments include (1)H, (13)C and (15)N resonances for the protein and peptide in the complex and all of the peptide's pyroglutamic acid nuclei.
Simultaneous display of two large proteins on the head and tail of bacteriophage lambda

PubMed Central

2013-01-01

Background Consistent progress in the development of bacteriophage lambda display platform as an alternative to filamentous phage display system was achieved in the recent years. The lambda phage has been engineered to display efficiently multiple copies of peptides or even large protein domains providing a powerful tool for screening libraries of peptides, proteins and cDNA. Results In the present work we describe an original method for dual display of large proteins on the surface of lambda particles. An anti-CEA single-chain antibody fragment and green fluorescent protein or alkaline phosphatase were simultaneously displayed by engineering both gpD and gpV lambda proteins. Conclusions Here we show that such modified phage particles can be used for the detection of target molecules in vitro and in vivo. Dual expression of functional moieties on the surface of the lambda phage might open the way to generation of a new class of diagnostic and therapeutic targeted nanoparticles. PMID:24073829
Simultaneous display of two large proteins on the head and tail of bacteriophage lambda.

PubMed

Pavoni, Emiliano; Vaccaro, Paola; D'Alessio, Valeria; De Santis, Rita; Minenkova, Olga

2013-09-30

Consistent progress in the development of bacteriophage lambda display platform as an alternative to filamentous phage display system was achieved in the recent years. The lambda phage has been engineered to display efficiently multiple copies of peptides or even large protein domains providing a powerful tool for screening libraries of peptides, proteins and cDNA. In the present work we describe an original method for dual display of large proteins on the surface of lambda particles. An anti-CEA single-chain antibody fragment and green fluorescent protein or alkaline phosphatase were simultaneously displayed by engineering both gpD and gpV lambda proteins. Here we show that such modified phage particles can be used for the detection of target molecules in vitro and in vivo. Dual expression of functional moieties on the surface of the lambda phage might open the way to generation of a new class of diagnostic and therapeutic targeted nanoparticles.
Monomeric red fluorescent proteins with a large Stokes shift.

PubMed

Piatkevich, Kiryl D; Hulit, James; Subach, Oksana M; Wu, Bin; Abdulla, Arian; Segall, Jeffrey E; Verkhusha, Vladislav V

2010-03-23

Two-photon microscopy has advanced fluorescence imaging of cellular processes in living animals. Fluorescent proteins in the blue-green wavelength range are widely used in two-photon microscopy; however, the use of red fluorescent proteins is limited by the low power output of Ti-Sapphire lasers above 1,000 nm. To overcome this limitation we have developed two red fluorescent proteins, LSS-mKate1 and LSS-mKate2, which possess large Stokes shifts with excitation/emission maxima at 463/624 and 460/605 nm, respectively. These LSS-mKates are characterized by high pH stability, photostability, rapid chromophore maturation, and monomeric behavior. They lack absorbance in the green region, providing an additional red color to the commonly used red fluorescent proteins. Substantial overlap between the two-photon excitation spectra of the LSS-mKates and blue-green fluorophores enables multicolor imaging using a single laser. We applied this approach to a mouse xenograft model of breast cancer to intravitally study the motility and Golgi-nucleus alignment of tumor cells as a function of their distance from blood vessels. Our data indicate that within 40 mum the breast cancer cells show significant polarization towards vessels in living mice.
Aromatic Cluster Sensor of Protein Folding: Near-UV Electronic Circular Dichroism Bands Assigned to Fold Compactness.

PubMed

Farkas, Viktor; Jákli, Imre; Tóth, Gábor K; Perczel, András

2016-09-19

Both far- and near-UV electronic circular dichroism (ECD) spectra have bands sensitive to thermal unfolding of Trp and Tyr residues containing proteins. Beside spectral changes at 222 nm reporting secondary structural variations (far-UV range), L b bands (near-UV range) are applicable as 3D-fold sensors of protein's core structure. In this study we show that both L b (Tyr) and L b (Trp) ECD bands could be used as sensors of fold compactness. ECD is a relative method and thus requires NMR referencing and cross-validation, also provided here. The ensemble of 204 ECD spectra of Trp-cage miniproteins is analysed as a training set for "calibrating" Trp↔Tyr folded systems of known NMR structure. While in the far-UV ECD spectra changes are linear as a function of the temperature, near-UV ECD data indicate a non-linear and thus, cooperative unfolding mechanism of these proteins. Ensemble of ECD spectra deconvoluted gives both conformational weights and insight to a protein folding↔unfolding mechanism. We found that the L b 293 band is reporting on the 3D-structure compactness. In addition, the pure near-UV ECD spectrum of the unfolded state is described here for the first time. Thus, ECD folding information now validated can be applied with confidence in a large thermal window (5≤T≤85 °C) compared to NMR for studying the unfolding of Trp↔Tyr residue pairs. In conclusion, folding propensities of important proteins (RNA polymerase II, ubiquitin protein ligase, tryptase-inhibitor etc.) can now be analysed with higher confidence. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Simultaneous acquisition of 2D and 3D solid-state NMR experiments for sequential assignment of oriented membrane protein samples.

PubMed

Gopinath, T; Mote, Kaustubh R; Veglia, Gianluigi

2015-05-01

We present a new method called DAISY (Dual Acquisition orIented ssNMR spectroScopY) for the simultaneous acquisition of 2D and 3D oriented solid-state NMR experiments for membrane proteins reconstituted in mechanically or magnetically aligned lipid bilayers. DAISY utilizes dual acquisition of sine and cosine dipolar or chemical shift coherences and long living (15)N longitudinal polarization to obtain two multi-dimensional spectra, simultaneously. In these new experiments, the first acquisition gives the polarization inversion spin exchange at the magic angle (PISEMA) or heteronuclear correlation (HETCOR) spectra, the second acquisition gives PISEMA-mixing or HETCOR-mixing spectra, where the mixing element enables inter-residue correlations through (15)N-(15)N homonuclear polarization transfer. The analysis of the two 2D spectra (first and second acquisitions) enables one to distinguish (15)N-(15)N inter-residue correlations for sequential assignment of membrane proteins. DAISY can be implemented in 3D experiments that include the polarization inversion spin exchange at magic angle via I spin coherence (PISEMAI) sequence, as we show for the simultaneous acquisition of 3D PISEMAI-HETCOR and 3D PISEMAI-HETCOR-mixing experiments.
Heteronuclear NMR assignments and secondary structure of the coiled coil trimerization domain from cartilage matrix protein in oxidized and reduced forms.

PubMed Central

Wiltscheck, R.; Kammerer, R. A.; Dames, S. A.; Schulthess, T.; Blommers, M. J.; Engel, J.; Alexandrescu, A. T.

1997-01-01

The C-terminal oligomerization domain of chicken cartilage matrix protein is a trimeric coiled coil comprised of three identical 43-residue chains. NMR spectra of the protein show equivalent magnetic environments for each monomer, indicating a parallel coiled coil structure with complete threefold symmetry. Sequence-specific assignments for 1H-, 15N-, and 13C-NMR resonances have been obtained from 2D 1H NOESY and TOCSY spectra, and from 3D HNCA, 15N NOESY-HSQC, and HCCH-TOCSY spectra. A stretch of alpha-helix encompassing five heptad repeats (35 residues) has been identified from intra-chain HN-HN and HN-H alpha NOE connectivities. 3JHNH alpha coupling constants, and chemical shift indices. The alpha-helix begins immediately downstream of inter-chain disulfide bonds between residues Cys 5 and Cys 7, and extends to near the C-terminus of the molecule. The threefold symmetry of the molecule is maintained when the inter-chain disulfide bonds that flank the N-terminus of the coiled coil are reduced. Residues Ile 21 through Glu 36 show conserved chemical shifts and NOE connectivities, as well as strong protection from solvent exchange in the oxidized and reduced forms of the protein. By contrast, residues Ile 10 through Val 17 show pronounced chemical shift differences between the oxidized and reduced protein. Strong chemical exchange NOEs between HN resonances and water indicate solvent exchange on time scales faster than 10 s, and suggests a dynamic fraying of the N-terminus of the coiled coil upon reduction of the disulfide bonds. Possible roles for the disulfide crosslinks of the oligomerization domain in the function of cartilage matrix protein are proposed. PMID:9260286
Expanded microbial genome coverage and improved protein family annotation in the COG database

PubMed Central

Galperin, Michael Y.; Makarova, Kira S.; Wolf, Yuri I.; Koonin, Eugene V.

2015-01-01

Microbial genome sequencing projects produce numerous sequences of deduced proteins, only a small fraction of which have been or will ever be studied experimentally. This leaves sequence analysis as the only feasible way to annotate these proteins and assign to them tentative functions. The Clusters of Orthologous Groups of proteins (COGs) database (http://www.ncbi.nlm.nih.gov/COG/), first created in 1997, has been a popular tool for functional annotation. Its success was largely based on (i) its reliance on complete microbial genomes, which allowed reliable assignment of orthologs and paralogs for most genes; (ii) orthology-based approach, which used the function(s) of the characterized member(s) of the protein family (COG) to assign function(s) to the entire set of carefully identified orthologs and describe the range of potential functions when there were more than one; and (iii) careful manual curation of the annotation of the COGs, aimed at detailed prediction of the biological function(s) for each COG while avoiding annotation errors and overprediction. Here we present an update of the COGs, the first since 2003, and a comprehensive revision of the COG annotations and expansion of the genome coverage to include representative complete genomes from all bacterial and archaeal lineages down to the genus level. This re-analysis of the COGs shows that the original COG assignments had an error rate below 0.5% and allows an assessment of the progress in functional genomics in the past 12 years. During this time, functions of many previously uncharacterized COGs have been elucidated and tentative functional assignments of many COGs have been validated, either by targeted experiments or through the use of high-throughput methods. A particularly important development is the assignment of functions to several widespread, conserved proteins many of which turned out to participate in translation, in particular rRNA maturation and tRNA modification. The new version of the
Pfarao: a web application for protein family analysis customized for cytoskeletal and motor proteins (CyMoBase).

PubMed

Odronitz, Florian; Kollmar, Martin

2006-11-29

Annotation of protein sequences of eukaryotic organisms is crucial for the understanding of their function in the cell. Manual annotation is still by far the most accurate way to correctly predict genes. The classification of protein sequences, their phylogenetic relation and the assignment of function involves information from various sources. This often leads to a collection of heterogeneous data, which is hard to track. Cytoskeletal and motor proteins consist of large and diverse superfamilies comprising up to several dozen members per organism. Up to date there is no integrated tool available to assist in the manual large-scale comparative genomic analysis of protein families. Pfarao (Protein Family Application for Retrieval, Analysis and Organisation) is a database driven online working environment for the analysis of manually annotated protein sequences and their relationship. Currently, the system can store and interrelate a wide range of information about protein sequences, species, phylogenetic relations and sequencing projects as well as links to literature and domain predictions. Sequences can be imported from multiple sequence alignments that are generated during the annotation process. A web interface allows to conveniently browse the database and to compile tabular and graphical summaries of its content. We implemented a protein sequence-centric web application to store, organize, interrelate, and present heterogeneous data that is generated in manual genome annotation and comparative genomics. The application has been developed for the analysis of cytoskeletal and motor proteins (CyMoBase) but can easily be adapted for any protein.
Complex of simian virus 40 large-T antigen and host 53,000-molecular-weight protein in monkey cells.

PubMed Central

Harlow, E; Pim, D C; Crawford, L V

1981-01-01

Mouse cells transformed by simian virus 40 (SV40) have been shown to contain a complex of the virus-coded large-T antigen with a host 53,000-molecular-weight (53K) protein. Initial attempts to detect a similar complex in lytically infected cells were unsuccessful, and it therefore seemed that the complex might be peculiar to transformed or abortively transformed nonpermissive cells. Immunoprecipitation of [32P]phosphate-labeled extracts of SV40-infected CV-1 African green monkey kidney cells with antibodies specific for large-T or the 53K protein revealed that the large-T-53K protein complex was formed during lytic infections. Only a minor fraction of the large-T present was associated with 53K protein, and large-T and the 53K host protein cosedimented during centrifugation through sucrose gradients. We used monospecific sera and monoclonal antibodies to study the rate of synthesis and phosphorylation of the 53K protein during lytic infections. Infection of CV-1 cells with SV40 increased the rate of synthesis of the 53K protein fivefold over that in mock-infected cells. At the same time, the rate of phosphorylation of the 53K protein increased more than 30-fold compared with control cultures. Monkey cells transformed by UV-irradiated SV40 (Gluzman et al., J. Virol. 22:256-266, 1977) also contained the large-T-53K protein complex. The formation of the complex is therefore not a peculiarity of SV40-transformed rodent cells but is a common feature of SV40 infections. Images PMID:6163871
7 CFR 760.31 - Assignment.

Code of Federal Regulations, 2013 CFR

2013-01-01

... Assignment. No assignment shall be made of any indemnity payment due or to come due under the regulations in this subpart. Any assignment or attempted assignment of any indemnity payment due or to come due under...
7 CFR 760.31 - Assignment.

Code of Federal Regulations, 2012 CFR

2012-01-01

... Assignment. No assignment shall be made of any indemnity payment due or to come due under the regulations in this subpart. Any assignment or attempted assignment of any indemnity payment due or to come due under...
7 CFR 760.31 - Assignment.

Code of Federal Regulations, 2014 CFR

2014-01-01

... Assignment. No assignment shall be made of any indemnity payment due or to come due under the regulations in this subpart. Any assignment or attempted assignment of any indemnity payment due or to come due under...
7 CFR 760.31 - Assignment.

Code of Federal Regulations, 2010 CFR

2010-01-01

... Assignment. No assignment shall be made of any indemnity payment due or to come due under the regulations in this subpart. Any assignment or attempted assignment of any indemnity payment due or to come due under...
7 CFR 760.31 - Assignment.

Code of Federal Regulations, 2011 CFR

2011-01-01

... Assignment. No assignment shall be made of any indemnity payment due or to come due under the regulations in this subpart. Any assignment or attempted assignment of any indemnity payment due or to come due under...

Static assignment of complex stochastic tasks using stochastic majorization

NASA Technical Reports Server (NTRS)

Nicol, David; Simha, Rahul; Towsley, Don

1992-01-01

We consider the problem of statically assigning many tasks to a (smaller) system of homogeneous processors, where a task's structure is modeled as a branching process, and all tasks are assumed to have identical behavior. We show how the theory of majorization can be used to obtain a partial order among possible task assignments. Our results show that if the vector of numbers of tasks assigned to each processor under one mapping is majorized by that of another mapping, then the former mapping is better than the latter with respect to a large number of objective functions. In particular, we show how measurements of finishing time, resource utilization, and reliability are all captured by the theory. We also show how the theory may be applied to the problem of partitioning a pool of processors for distribution among parallelizable tasks.
Large-volume protein crystal growth for neutron macromolecular crystallography.

PubMed

Ng, Joseph D; Baird, James K; Coates, Leighton; Garcia-Ruiz, Juan M; Hodge, Teresa A; Huang, Sijay

2015-04-01

Neutron macromolecular crystallography (NMC) is the prevailing method for the accurate determination of the positions of H atoms in macromolecules. As neutron sources are becoming more available to general users, finding means to optimize the growth of protein crystals to sizes suitable for NMC is extremely important. Historically, much has been learned about growing crystals for X-ray diffraction. However, owing to new-generation synchrotron X-ray facilities and sensitive detectors, protein crystal sizes as small as in the nano-range have become adequate for structure determination, lessening the necessity to grow large crystals. Here, some of the approaches, techniques and considerations for the growth of crystals to significant dimensions that are now relevant to NMC are revisited. These include experimental strategies utilizing solubility diagrams, ripening effects, classical crystallization techniques, microgravity and theoretical considerations.
An Investigation into E-Tool Use for Formative Assignment Assessment--Status and Recommendations

ERIC Educational Resources Information Center

Heinrich, Eva; Milne, John; Moore, Maurice

2009-01-01

This article reports on a comprehensive study, investigating the use of e-tools for formative assignment assessment. The study conducted a large-scale literature review and interviews with 90 academics at five New Zealand tertiary institutions. The focus of the study was on formative assessment provided in assignments, an area in which educational…
NMR resonance assignments of the lantibiotic immunity protein NisI from Lactococcus lactis.

PubMed

Hacker, Carolin; Christ, Nina Alexandra; Duchardt-Ferner, Elke; Korn, Sophie; Berninger, Lucija; Kötter, Peter; Entian, Karl-Dieter; Wöhnert, Jens

2015-10-01

The lantibiotic nisin is a small antimicrobial peptide which acts against a wide range of Gram-positive bacteria. Nisin-producing Lactococcus lactis strains express four genes for self-protection against their own antimicrobial compound. This immunity system consists of the lipoprotein NisI and the ABC transporter NisFEG. NisI is attached to the outside of the cytoplasmic membrane via a covalently linked diacylglycerol anchor. Both the lipoprotein and the ABC transporter are needed for full immunity but the exact immunity mechanism is still unclear. To gain insights into the highly specific immunity mechanism of nisin producing strains on a structural level we present here the backbone resonance assignment of NisI (25.8 kDa) as well as the virtually complete (1)H,(15)N,(13)C chemical shift assignments for the isolated 12.7 kDa N-terminal and 14.6 kDa C-terminal domains of NisI.
Crystallization of the Large Membrane Protein Complex Photosystem I in a Microfluidic Channel

PubMed Central

Abdallah, Bahige G.; Kupitz, Christopher; Fromme, Petra; Ros, Alexandra

2014-01-01

Traditional macroscale protein crystallization is accomplished non-trivially by exploring a range of protein concentrations and buffers in solution until a suitable combination is attained. This methodology is time consuming and resource intensive, hindering protein structure determination. Even more difficulties arise when crystallizing large membrane protein complexes such as photosystem I (PSI) due to their large unit cells dominated by solvent and complex characteristics that call for even stricter buffer requirements. Structure determination techniques tailored for these ‘difficult to crystallize’ proteins such as femtosecond nanocrystallography are being developed, yet still need specific crystal characteristics. Here, we demonstrate a simple and robust method to screen protein crystallization conditions at low ionic strength in a microfluidic device. This is realized in one microfluidic experiment using low sample amounts, unlike traditional methods where each solution condition is set up separately. Second harmonic generation microscopy via Second Order Nonlinear Imaging of Chiral Crystals (SONICC) was applied for the detection of nanometer and micrometer sized PSI crystals within microchannels. To develop a crystallization phase diagram, crystals imaged with SONICC at specific channel locations were correlated to protein and salt concentrations determined by numerical simulations of the time-dependent diffusion process along the channel. Our method demonstrated that a portion of the PSI crystallization phase diagram could be reconstructed in excellent agreement with crystallization conditions determined by traditional methods. We postulate that this approach could be utilized to efficiently study and optimize crystallization conditions for a wide range of proteins that are poorly understood to date. PMID:24191698
Bridge over troubled proline: assignment of intrinsically disordered proteins using (HCA)CON(CAN)H and (HCA)N(CA)CO(N)H experiments concomitantly with HNCO and i(HCA)CO(CA)NH.

PubMed

Hellman, Maarit; Piirainen, Henni; Jaakola, Veli-Pekka; Permi, Perttu

2014-01-01

NMR spectroscopy is by far the most versatile and information rich technique to study intrinsically disordered proteins (IDPs). While NMR is able to offer residue level information on structure and dynamics, assignment of chemical shift resonances in IDPs is not a straightforward process. Consequently, numerous pulse sequences and assignment protocols have been developed during past several years, targeted especially for the assignment of IDPs, including experiments that employ H(N), H(α) or (13)C detection combined with two to six indirectly detected dimensions. Here we propose two new HN-detection based pulse sequences, (HCA)CON(CAN)H and (HCA)N(CA)CO(N)H, that provide correlations with (1)H(N)(i - 1), (13)C'(i - 1) and (15)N(i), and (1)H(N)(i + 1), (13)C'(i) and (15)N(i) frequencies, respectively. Most importantly, they offer sequential links across the proline bridges and enable filling the single proline gaps during the assignment. We show that the novel experiments can efficiently complement the information available from existing HNCO and intraresidual i(HCA)CO(CA)NH pulse sequences and their concomitant usage enabled >95 % assignment of backbone resonances in cytoplasmic tail of adenosine receptor A2A in comparison to 73 % complete assignment using the HNCO/i(HCA)CO(CA)NH data alone.
Dynamics of domain coverage of the protein sequence universe.

PubMed

Rekapalli, Bhanu; Wuichet, Kristin; Peterson, Gregory D; Zhulin, Igor B

2012-11-16

The currently known protein sequence space consists of millions of sequences in public databases and is rapidly expanding. Assigning sequences to families leads to a better understanding of protein function and the nature of the protein universe. However, a large portion of the current protein space remains unassigned and is referred to as its "dark matter". Here we suggest that true size of "dark matter" is much larger than stated by current definitions. We propose an approach to reducing the size of "dark matter" by identifying and subtracting regions in protein sequences that are not likely to contain any domain. Recent improvements in computational domain modeling result in a decrease, albeit slowly, in the relative size of "dark matter"; however, its absolute size increases substantially with the growth of sequence data.
Measuring and comparing structural fluctuation patterns in large protein datasets.

PubMed

Fuglebakk, Edvin; Echave, Julián; Reuter, Nathalie

2012-10-01

The function of a protein depends not only on its structure but also on its dynamics. This is at the basis of a large body of experimental and theoretical work on protein dynamics. Further insight into the dynamics-function relationship can be gained by studying the evolutionary divergence of protein motions. To investigate this, we need appropriate comparative dynamics methods. The most used dynamical similarity score is the correlation between the root mean square fluctuations (RMSF) of aligned residues. Despite its usefulness, RMSF is in general less evolutionarily conserved than the native structure. A fundamental issue is whether RMSF is not as conserved as structure because dynamics is less conserved or because RMSF is not the best property to use to study its conservation. We performed a systematic assessment of several scores that quantify the (dis)similarity between protein fluctuation patterns. We show that the best scores perform as well as or better than structural dissimilarity, as assessed by their consistency with the SCOP classification. We conclude that to uncover the full extent of the evolutionary conservation of protein fluctuation patterns, it is important to measure the directions of fluctuations and their correlations between sites. Nathalie.Reuter@mbi.uib.no Supplementary data are available at Bioinformatics Online.
The protein-protein interaction network of eyestalk, Y-organ and hepatopancreas in Chinese mitten crab Eriocheir sinensis.

PubMed

Hao, Tong; Zeng, Zheng; Wang, Bin; Zhang, Yichen; Liu, Yichen; Geng, Xuyun; Sun, Jinsheng

2014-03-27

The protein-protein interaction network (PIN) is an effective information tool for understanding the complex biological processes inside the cell and solving many biological problems such as signaling pathway identification and prediction of protein functions. Eriocheir sinensis is a highly-commercial aquaculture species with an unclear proteome background which hinders the construction and development of PIN for E. sinensis. However, in recent years, the development of next-generation deep-sequencing techniques makes it possible to get high throughput data of E. sinensis tanscriptome and subsequently obtain a systematic overview of the protein-protein interaction system. In this work we sequenced the transcriptional RNA of eyestalk, Y-organ and hepatopancreas in E. sinensis and generated a PIN of E. sinensis which included 3,223 proteins and 35,787 interactions. Each protein-protein interaction in the network was scored according to the homology and genetic relationship. The signaling sub-network, representing the signal transduction pathways in E. sinensis, was extracted from the global network, which depicted a global view of the signaling systems in E. sinensis. Seven basic signal transduction pathways were identified in E. sinensis. By investigating the evolution paths of the seven pathways, we found that these pathways got mature in different evolutionary stages. Moreover, the functions of unclassified proteins and unigenes in the PIN of E. sinensis were predicted. Specifically, the functions of 549 unclassified proteins related to 864 unclassified unigenes were assigned, which respectively covered 76% and 73% of all the unclassified proteins and unigenes in the network. The PIN generated in this work is the first large-scale PIN of aquatic crustacean, thereby providing a paradigmatic blueprint of the aquatic crustacean interactome. Signaling sub-network extracted from the global PIN depicts the interaction of different signaling proteins and the evolutionary
Rice Ribosomal Protein Large Subunit Genes and Their Spatio-temporal and Stress Regulation

PubMed Central

Moin, Mazahar; Bakshi, Achala; Saha, Anusree; Dutta, Mouboni; Madhav, Sheshu M.; Kirti, P. B.

2016-01-01

Ribosomal proteins (RPs) are well-known for their role in mediating protein synthesis and maintaining the stability of the ribosomal complex, which includes small and large subunits. In the present investigation, in a genome-wide survey, we predicted that the large subunit of rice ribosomes is encoded by at least 123 genes including individual gene copies, distributed throughout the 12 chromosomes. We selected 34 candidate genes, each having 2–3 identical copies, for a detailed characterization of their gene structures, protein properties, cis-regulatory elements and comprehensive expression analysis. RPL proteins appear to be involved in interactions with other RP and non-RP proteins and their encoded RNAs have a higher content of alpha-helices in their predicted secondary structures. The majority of RPs have binding sites for metal and non-metal ligands. Native expression profiling of 34 ribosomal protein large (RPL) subunit genes in tissues covering the major stages of rice growth shows that they are predominantly expressed in vegetative tissues and seedlings followed by meiotically active tissues like flowers. The putative promoter regions of these genes also carry cis-elements that respond specifically to stress and signaling molecules. All the 34 genes responded differentially to the abiotic stress treatments. Phytohormone and cold treatments induced significant up-regulation of several RPL genes, while heat and H2O2 treatments down-regulated a majority of them. Furthermore, infection with a bacterial pathogen, Xanthomonas oryzae, which causes leaf blight also induced the expression of 80% of the RPL genes in leaves. Although the expression of RPL genes was detected in all the tissues studied, they are highly responsive to stress and signaling molecules indicating that their encoded proteins appear to have roles in stress amelioration besides house-keeping. This shows that the RPL gene family is a valuable resource for manipulation of stress tolerance in
Fast and Accurate Protein False Discovery Rates on Large-Scale Proteomics Data Sets with Percolator 3.0.

PubMed

The, Matthew; MacCoss, Michael J; Noble, William S; Käll, Lukas

2016-11-01

Percolator is a widely used software tool that increases yield in shotgun proteomics experiments and assigns reliable statistical confidence measures, such as q values and posterior error probabilities, to peptides and peptide-spectrum matches (PSMs) from such experiments. Percolator's processing speed has been sufficient for typical data sets consisting of hundreds of thousands of PSMs. With our new scalable approach, we can now also analyze millions of PSMs in a matter of minutes on a commodity computer. Furthermore, with the increasing awareness for the need for reliable statistics on the protein level, we compared several easy-to-understand protein inference methods and implemented the best-performing method-grouping proteins by their corresponding sets of theoretical peptides and then considering only the best-scoring peptide for each protein-in the Percolator package. We used Percolator 3.0 to analyze the data from a recent study of the draft human proteome containing 25 million spectra (PM:24870542). The source code and Ubuntu, Windows, MacOS, and Fedora binary packages are available from http://percolator.ms/ under an Apache 2.0 license. Graphical Abstract ᅟ.
Fair Package Assignment

NASA Astrophysics Data System (ADS)

Lahaie, Sébastien; Parkes, David C.

We consider the problem of fair allocation in the package assignment model, where a set of indivisible items, held by single seller, must be efficiently allocated to agents with quasi-linear utilities. A fair assignment is one that is efficient and envy-free. We consider a model where bidders have superadditive valuations, meaning that items are pure complements. Our central result is that core outcomes are fair and even coalition-fair over this domain, while fair distributions may not even exist for general valuations. Of relevance to auction design, we also establish that the core is equivalent to the set of anonymous-price competitive equilibria, and that superadditive valuations are a maximal domain that guarantees the existence of anonymous-price competitive equilibrium. Our results are analogs of core equivalence results for linear prices in the standard assignment model, and for nonlinear, non-anonymous prices in the package assignment model with general valuations.
Large-volume protein crystal growth for neutron macromolecular crystallography

DOE PAGES

Ng, Joseph D.; Baird, James K.; Coates, Leighton; ...

2015-03-30

Neutron macromolecular crystallography (NMC) is the prevailing method for the accurate determination of the positions of H atoms in macromolecules. As neutron sources are becoming more available to general users, finding means to optimize the growth of protein crystals to sizes suitable for NMC is extremely important. Historically, much has been learned about growing crystals for X-ray diffraction. However, owing to new-generation synchrotron X-ray facilities and sensitive detectors, protein crystal sizes as small as in the nano-range have become adequate for structure determination, lessening the necessity to grow large crystals. Here, some of the approaches, techniques and considerations for themore » growth of crystals to significant dimensions that are now relevant to NMC are revisited. We report that these include experimental strategies utilizing solubility diagrams, ripening effects, classical crystallization techniques, microgravity and theoretical considerations.« less
Large-volume protein crystal growth for neutron macromolecular crystallography

PubMed Central

Ng, Joseph D.; Baird, James K.; Coates, Leighton; Garcia-Ruiz, Juan M.; Hodge, Teresa A.; Huang, Sijay

2015-01-01

Neutron macromolecular crystallography (NMC) is the prevailing method for the accurate determination of the positions of H atoms in macromolecules. As neutron sources are becoming more available to general users, finding means to optimize the growth of protein crystals to sizes suitable for NMC is extremely important. Historically, much has been learned about growing crystals for X-ray diffraction. However, owing to new-generation synchrotron X-ray facilities and sensitive detectors, protein crystal sizes as small as in the nano-range have become adequate for structure determination, lessening the necessity to grow large crystals. Here, some of the approaches, techniques and considerations for the growth of crystals to significant dimensions that are now relevant to NMC are revisited. These include experimental strategies utilizing solubility diagrams, ripening effects, classical crystallization techniques, microgravity and theoretical considerations. PMID:25849493
Large-volume protein crystal growth for neutron macromolecular crystallography

DOE Office of Scientific and Technical Information (OSTI.GOV)

Ng, Joseph D.; Baird, James K.; Coates, Leighton

Neutron macromolecular crystallography (NMC) is the prevailing method for the accurate determination of the positions of H atoms in macromolecules. As neutron sources are becoming more available to general users, finding means to optimize the growth of protein crystals to sizes suitable for NMC is extremely important. Historically, much has been learned about growing crystals for X-ray diffraction. However, owing to new-generation synchrotron X-ray facilities and sensitive detectors, protein crystal sizes as small as in the nano-range have become adequate for structure determination, lessening the necessity to grow large crystals. Here, some of the approaches, techniques and considerations for themore » growth of crystals to significant dimensions that are now relevant to NMC are revisited. We report that these include experimental strategies utilizing solubility diagrams, ripening effects, classical crystallization techniques, microgravity and theoretical considerations.« less
PREFACE: Protein protein interactions: principles and predictions

NASA Astrophysics Data System (ADS)

Nussinov, Ruth; Tsai, Chung-Jung

2005-06-01

Proteins are the `workhorses' of the cell. Their roles span functions as diverse as being molecular machines and signalling. They carry out catalytic reactions, transport, form viral capsids, traverse membranes and form regulated channels, transmit information from DNA to RNA, making possible the synthesis of new proteins, and they are responsible for the degradation of unnecessary proteins and nucleic acids. They are the vehicles of the immune response and are responsible for viral entry into the cell. Given their importance, considerable effort has been centered on the prediction of protein function. A prime way to do this is through identification of binding partners. If the function of at least one of the components with which the protein interacts is known, that should let us assign its function(s) and the pathway(s) in which it plays a role. This holds since the vast majority of their chores in the living cell involve protein-protein interactions. Hence, through the intricate network of these interactions we can map cellular pathways, their interconnectivities and their dynamic regulation. Their identification is at the heart of functional genomics; their prediction is crucial for drug discovery. Knowledge of the pathway, its topology, length, and dynamics may provide useful information for forecasting side effects. The goal of predicting protein-protein interactions is daunting. Some associations are obligatory, others are continuously forming and dissociating. In principle, from the physical standpoint, any two proteins can interact, but under what conditions and at which strength? The principles of protein-protein interactions are general: the non-covalent interactions of two proteins are largely the outcome of the hydrophobic effect, which drives the interactions. In addition, hydrogen bonds and electrostatic interactions play important roles. Thus, many of the interactions observed in vitro are the outcome of experimental overexpression. Protein disorder
Method for protein structure alignment

DOEpatents

Blankenbecler, Richard; Ohlsson, Mattias; Peterson, Carsten; Ringner, Markus

2005-02-22

This invention provides a method for protein structure alignment. More particularly, the present invention provides a method for identification, classification and prediction of protein structures. The present invention involves two key ingredients. First, an energy or cost function formulation of the problem simultaneously in terms of binary (Potts) assignment variables and real-valued atomic coordinates. Second, a minimization of the energy or cost function by an iterative method, where in each iteration (1) a mean field method is employed for the assignment variables and (2) exact rotation and/or translation of atomic coordinates is performed, weighted with the corresponding assignment variables.
How large B-factors can be in protein crystal structures.

PubMed

Carugo, Oliviero

2018-02-23

Protein crystal structures are potentially over-interpreted since they are routinely refined without any restraint on the upper limit of atomic B-factors. Consequently, some of their atoms, undetected in the electron density maps, are allowed to reach extremely large B-factors, even above 100 square Angstroms, and their final positions are purely speculative and not based on any experimental evidence. A strategy to define B-factors upper limits is described here, based on the analysis of protein crystal structures deposited in the Protein Data Bank prior 2008, when the tendency to allow B-factor to arbitrary inflate was limited. This B-factor upper limit (B_max) is determined by extrapolating the relationship between crystal structure average B-factor and percentage of crystal volume occupied by solvent (pcVol) to pcVol =100%, when, ab absurdo, the crystal contains only liquid solvent, the structure of which is, by definition, undetectable in electron density maps. It is thus possible to highlight structures with average B-factors larger than B_max, which should be considered with caution by the users of the information deposited in the Protein Data Bank, in order to avoid scientifically deleterious over-interpretations.
Plagiarism-Proofing Assignments

ERIC Educational Resources Information Center

Johnson, Doug

2004-01-01

Mr. Johnson has discovered that the higher the level of student engagement and creativity, the lower the probability of plagiarism. For teachers who would like to see such desirable results, he describes the characteristics of assignments that are most likely to produce them. Two scenarios of types of assignments that avoid plagiarism are…
Support Vector Machines Trained with Evolutionary Algorithms Employing Kernel Adatron for Large Scale Classification of Protein Structures.

PubMed

Arana-Daniel, Nancy; Gallegos, Alberto A; López-Franco, Carlos; Alanís, Alma Y; Morales, Jacob; López-Franco, Adriana

2016-01-01

With the increasing power of computers, the amount of data that can be processed in small periods of time has grown exponentially, as has the importance of classifying large-scale data efficiently. Support vector machines have shown good results classifying large amounts of high-dimensional data, such as data generated by protein structure prediction, spam recognition, medical diagnosis, optical character recognition and text classification, etc. Most state of the art approaches for large-scale learning use traditional optimization methods, such as quadratic programming or gradient descent, which makes the use of evolutionary algorithms for training support vector machines an area to be explored. The present paper proposes an approach that is simple to implement based on evolutionary algorithms and Kernel-Adatron for solving large-scale classification problems, focusing on protein structure prediction. The functional properties of proteins depend upon their three-dimensional structures. Knowing the structures of proteins is crucial for biology and can lead to improvements in areas such as medicine, agriculture and biofuels.

Proteins with similar architecture exhibit similar large-scale dynamic behavior.

PubMed Central

Keskin, O; Jernigan, R L; Bahar, I

2000-01-01

We have investigated the similarities and differences in the computed dynamic fluctuations exhibited by six members of a protein fold family with a coarse-grained Gaussian network model. Specifically, we consider the cofactor binding fragment of CysB; the lysine/arginine/ornithine-binding protein (LAO); the enzyme porphobilinogen deaminase (PBGD); the ribose-binding protein (RBP); the N-terminal lobe of ovotransferrin in apo-form (apo-OVOT); and the leucine/isoleucine/valine-binding protein (LIVBP). All have domains that resemble a Rossmann fold, but there are also some significant differences. Results indicate that similar global dynamic behavior is preserved for the members of a fold family, and that differences usually occur in regions only where specific function is localized. The present work is a computational demonstration that the scaffold of a protein fold may be utilized for diverse purposes. LAO requires a bound ligand before it conforms to the large-scale fluctuation behavior of the three other members of the family, CysB, PBGD, and RBP, all of which contain a substrate (cofactor) at the active site cleft. The dynamics of the ligand-free enzymes LIVBP and apo-OVOT, on the other hand, concur with that of unliganded LAO. The present results suggest that it is possible to construct structure alignments based on dynamic fluctuation behavior. PMID:10733987
Pfarao: a web application for protein family analysis customized for cytoskeletal and motor proteins (CyMoBase)

PubMed Central

Odronitz, Florian; Kollmar, Martin

2006-01-01

Background Annotation of protein sequences of eukaryotic organisms is crucial for the understanding of their function in the cell. Manual annotation is still by far the most accurate way to correctly predict genes. The classification of protein sequences, their phylogenetic relation and the assignment of function involves information from various sources. This often leads to a collection of heterogeneous data, which is hard to track. Cytoskeletal and motor proteins consist of large and diverse superfamilies comprising up to several dozen members per organism. Up to date there is no integrated tool available to assist in the manual large-scale comparative genomic analysis of protein families. Description Pfarao (Protein Family Application for Retrieval, Analysis and Organisation) is a database driven online working environment for the analysis of manually annotated protein sequences and their relationship. Currently, the system can store and interrelate a wide range of information about protein sequences, species, phylogenetic relations and sequencing projects as well as links to literature and domain predictions. Sequences can be imported from multiple sequence alignments that are generated during the annotation process. A web interface allows to conveniently browse the database and to compile tabular and graphical summaries of its content. Conclusion We implemented a protein sequence-centric web application to store, organize, interrelate, and present heterogeneous data that is generated in manual genome annotation and comparative genomics. The application has been developed for the analysis of cytoskeletal and motor proteins (CyMoBase) but can easily be adapted for any protein. PMID:17134497
Multiple-basin energy landscapes for large-amplitude conformational motions of proteins: Structure-based molecular dynamics simulations

PubMed Central

Okazaki, Kei-ichi; Koga, Nobuyasu; Takada, Shoji; Onuchic, Jose N.; Wolynes, Peter G.

2006-01-01

Biomolecules often undergo large-amplitude motions when they bind or release other molecules. Unlike macroscopic machines, these biomolecular machines can partially disassemble (unfold) and then reassemble (fold) during such transitions. Here we put forward a minimal structure-based model, the “multiple-basin model,” that can directly be used for molecular dynamics simulation of even very large biomolecular systems so long as the endpoints of the conformational change are known. We investigate the model by simulating large-scale motions of four proteins: glutamine-binding protein, S100A6, dihydrofolate reductase, and HIV-1 protease. The mechanisms of conformational transition depend on the protein basin topologies and change with temperature near the folding transition. The conformational transition rate varies linearly with driving force over a fairly large range. This linearity appears to be a consequence of partial unfolding during the conformational transition. PMID:16877541
Chemical shift assignments of the first and second RRMs of Nrd1, a fission yeast MAPK-target RNA binding protein.

PubMed

Kobayashi, Ayaho; Kanaba, Teppei; Satoh, Ryosuke; Ito, Yutaka; Sugiura, Reiko; Mishima, Masaki

2017-10-01

Negative regulator differentiation 1 (Nrd1), a fission yeast RNA binding protein, modulates cytokinesis and sexual development and contributes to stress granule formation in response to environmental stresses. Nrd1 comprises four RRM domains and binds and stabilizes Cdc4 mRNA that encodes the myosin II light chain. Nrd1 binds the Cpc2 fission-yeast RACK1 homolog, and the interaction promotes Nrd1 localization to stress granules. Interestingly, Pmk1 mitogen-activated protein kinase phosphorylates Thr40 in the unstructured N-terminal region and Thr126 in the first RRM domain of Nrd1. Phosphorylation significantly reduces RNA-binding activity and likely modulates Nrd1 function. To reveal the relationship between the structure and function of Nrd1 and how phosphorylation affects structure, we used heteronuclear NMR techniques to investigate the three-dimensional structure of Nrd1. Here we report the 1 H, 13 C, and 15 N resonance assignments of RRM1-RRM2 (residues 108-284) comprising the first and second RRMs obtained using heteronuclear NMR techniques. Secondary structures derived from the chemical shifts are reported. These data should contribute to the understanding of the three-dimensional structure of the RRM1-RRM2 region of Nrd1 and the perturbation caused by phosphorylation.
An ensemble framework for clustering protein-protein interaction networks.

PubMed

Asur, Sitaram; Ucar, Duygu; Parthasarathy, Srinivasan

2007-07-01

Protein-Protein Interaction (PPI) networks are believed to be important sources of information related to biological processes and complex metabolic functions of the cell. The presence of biologically relevant functional modules in these networks has been theorized by many researchers. However, the application of traditional clustering algorithms for extracting these modules has not been successful, largely due to the presence of noisy false positive interactions as well as specific topological challenges in the network. In this article, we propose an ensemble clustering framework to address this problem. For base clustering, we introduce two topology-based distance metrics to counteract the effects of noise. We develop a PCA-based consensus clustering technique, designed to reduce the dimensionality of the consensus problem and yield informative clusters. We also develop a soft consensus clustering variant to assign multifaceted proteins to multiple functional groups. We conduct an empirical evaluation of different consensus techniques using topology-based, information theoretic and domain-specific validation metrics and show that our approaches can provide significant benefits over other state-of-the-art approaches. Our analysis of the consensus clusters obtained demonstrates that ensemble clustering can (a) produce improved biologically significant functional groupings; and (b) facilitate soft clustering by discovering multiple functional associations for proteins. Supplementary data are available at Bioinformatics online.
Protein Composition of Trypanosoma brucei Mitochondrial Membranes

PubMed Central

Acestor, Nathalie; Panigrahi, Aswini K.; Ogata, Yuko; Anupama, Atashi; Stuart, Kenneth D.

2010-01-01

Mitochondria consist of four compartments, outer membrane, intermembrane space, inner membrane and matrix; each harboring specific functions and structures. In this study, we used mass spectrometry (LC-MS/MS) to characterize the protein composition of Trypanosoma brucei mitochondrial membranes, which were enriched by different biochemical fractionation techniques. The analyses identified 202 proteins that contain one or more transmembrane domain(s) and/or positive GRAVY scores. Of these, various criteria were used to assign 72 proteins to mitochondrial membranes with high confidence, and 106 with moderate to low confidence. The sub-cellular localization of a selected subset of 13 membrane assigned proteins was confirmed by tagging and immunofluorescence analysis. While most proteins assigned to mitochondrial membrane have putative roles in metabolic, energy generating, and transport processes, ~50% have no known function. These studies result in a comprehensive profile of the composition and sub-organellar location of proteins in the T. brucei mitochondrion thus, providing useful information on mitochondrial functions. PMID:19834910
Dynamics of domain coverage of the protein sequence universe

PubMed Central

2012-01-01

Background The currently known protein sequence space consists of millions of sequences in public databases and is rapidly expanding. Assigning sequences to families leads to a better understanding of protein function and the nature of the protein universe. However, a large portion of the current protein space remains unassigned and is referred to as its “dark matter”. Results Here we suggest that true size of “dark matter” is much larger than stated by current definitions. We propose an approach to reducing the size of “dark matter” by identifying and subtracting regions in protein sequences that are not likely to contain any domain. Conclusions Recent improvements in computational domain modeling result in a decrease, albeit slowly, in the relative size of “dark matter”; however, its absolute size increases substantially with the growth of sequence data. PMID:23157439
Enabling Large-Scale Design, Synthesis and Validation of Small Molecule Protein-Protein Antagonists

PubMed Central

Koes, David; Khoury, Kareem; Huang, Yijun; Wang, Wei; Bista, Michal; Popowicz, Grzegorz M.; Wolf, Siglinde; Holak, Tad A.; Dömling, Alexander; Camacho, Carlos J.

2012-01-01

Although there is no shortage of potential drug targets, there are only a handful known low-molecular-weight inhibitors of protein-protein interactions (PPIs). One problem is that current efforts are dominated by low-yield high-throughput screening, whose rigid framework is not suitable for the diverse chemotypes present in PPIs. Here, we developed a novel pharmacophore-based interactive screening technology that builds on the role anchor residues, or deeply buried hot spots, have in PPIs, and redesigns these entry points with anchor-biased virtual multicomponent reactions, delivering tens of millions of readily synthesizable novel compounds. Application of this approach to the MDM2/p53 cancer target led to high hit rates, resulting in a large and diverse set of confirmed inhibitors, and co-crystal structures validate the designed compounds. Our unique open-access technology promises to expand chemical space and the exploration of the human interactome by leveraging in-house small-scale assays and user-friendly chemistry to rationally design ligands for PPIs with known structure. PMID:22427896
Batch Immunostaining for Large-Scale Protein Detection in the Whole Monkey Brain

PubMed Central

Zangenehpour, Shahin; Burke, Mark W.; Chaudhuri, Avi; Ptito, Maurice

2009-01-01

Immunohistochemistry (IHC) is one of the most widely used laboratory techniques for the detection of target proteins in situ. Questions concerning the expression pattern of a target protein across the entire brain are relatively easy to answer when using IHC in small brains, such as those of rodents. However, answering the same questions in large and convoluted brains, such as those of primates presents a number of challenges. Here we present a systematic approach for immunodetection of target proteins in an adult monkey brain. This approach relies on the tissue embedding and sectioning methodology of NeuroScience Associates (NSA) as well as tools developed specifically for batch-staining of free-floating sections. It results in uniform staining of a set of sections which, at a particular interval, represents the entire brain. The resulting stained sections can be subjected to a wide variety of analytical procedures in order to measure protein levels, the population of neurons expressing a certain protein. PMID:19636291
Expanded microbial genome coverage and improved protein family annotation in the COG database.

PubMed

Galperin, Michael Y; Makarova, Kira S; Wolf, Yuri I; Koonin, Eugene V

2015-01-01

Microbial genome sequencing projects produce numerous sequences of deduced proteins, only a small fraction of which have been or will ever be studied experimentally. This leaves sequence analysis as the only feasible way to annotate these proteins and assign to them tentative functions. The Clusters of Orthologous Groups of proteins (COGs) database (http://www.ncbi.nlm.nih.gov/COG/), first created in 1997, has been a popular tool for functional annotation. Its success was largely based on (i) its reliance on complete microbial genomes, which allowed reliable assignment of orthologs and paralogs for most genes; (ii) orthology-based approach, which used the function(s) of the characterized member(s) of the protein family (COG) to assign function(s) to the entire set of carefully identified orthologs and describe the range of potential functions when there were more than one; and (iii) careful manual curation of the annotation of the COGs, aimed at detailed prediction of the biological function(s) for each COG while avoiding annotation errors and overprediction. Here we present an update of the COGs, the first since 2003, and a comprehensive revision of the COG annotations and expansion of the genome coverage to include representative complete genomes from all bacterial and archaeal lineages down to the genus level. This re-analysis of the COGs shows that the original COG assignments had an error rate below 0.5% and allows an assessment of the progress in functional genomics in the past 12 years. During this time, functions of many previously uncharacterized COGs have been elucidated and tentative functional assignments of many COGs have been validated, either by targeted experiments or through the use of high-throughput methods. A particularly important development is the assignment of functions to several widespread, conserved proteins many of which turned out to participate in translation, in particular rRNA maturation and tRNA modification. The new version of the
Prediction of protein tertiary structure from sequences using a very large back-propagation neural network

DOE Office of Scientific and Technical Information (OSTI.GOV)

Liu, X.; Wilcox, G.L.

1993-12-31

We have implemented large scale back-propagation neural networks on a 544 node Connection Machine, CM-5, using the C language in MIMD mode. The program running on 512 processors performs backpropagation learning at 0.53 Gflops, which provides 76 million connection updates per second. We have applied the network to the prediction of protein tertiary structure from sequence information alone. A neural network with one hidden layer and 40 million connections is trained to learn the relationship between sequence and tertiary structure. The trained network yields predicted structures of some proteins on which it has not been trained given only their sequences.more » Presentation of the Fourier transform of the sequences accentuates periodicity in the sequence and yields good generalization with greatly increased training efficiency. Training simulations with a large, heterologous set of protein structures (111 proteins from CM-5 time) to solutions with under 2% RMS residual error within the training set (random responses give an RMS error of about 20%). Presentation of 15 sequences of related proteins in a testing set of 24 proteins yields predicted structures with less than 8% RMS residual error, indicating good apparent generalization.« less
Photoswitchable red fluorescent protein with a large Stokes shift.

PubMed

Piatkevich, Kiryl D; English, Brian P; Malashkevich, Vladimir N; Xiao, Hui; Almo, Steven C; Singer, Robert H; Verkhusha, Vladislav V

2014-10-23

A subclass of fluorescent proteins (FPs), large Stokes shift (LSS) FP, are characterized by increased spread between excitation and emission maxima. We report a photoswitchable variant of a red FP with an LSS, PSLSSmKate, which initially exhibits excitation and emission at 445 and 622 nm, but violet irradiation photoswitches PSLSSmKate into a common red form with excitation and emission at 573 and 621 nm. We characterize spectral, photophysical, and biochemical properties of PSLSSmKate in vitro and in mammalian cells and determine its crystal structure in the LSS form. Mass spectrometry, mutagenesis, and spectroscopy of PSLSSmKate allow us to propose molecular mechanisms for the LSS, pH dependence, and light-induced chromophore transformation. We demonstrate the applicability of PSLSSmKate to superresolution photoactivated localization microscopy and protein dynamics in live cells. Given its promising properties, we expect that PSLSSmKate-like phenotype will be further used for photoactivatable imaging and tracking multiple populations of intracellular objects.
Diets with High or Low Protein Content and Glycemic Index for Weight-Loss Maintenance

PubMed Central

Larsen, Thomas Meinert; Dalskov, Stine-Mathilde; van Baak, Marleen; Jebb, Susan A.; Papadaki, Angeliki; Pfeiffer, Andreas F.H.; Martinez, J. Alfredo; Handjieva-Darlenska, Teodora; Kunešová, Marie; Pihlsgård, Mats; Stender, Steen; Holst, Claus; Saris, Wim H.M.; Astrup, Arne

2012-01-01

Background Studies of weight-control diets that are high in protein or low in glycemic index have reached varied conclusions, probably owing to the fact that the studies had insufficient power. Methods We enrolled overweight adults from eight European countries who had lost at least 8% of their initial body weight with a 3.3-MJ (800-kcal) low-calorie diet. Participants were randomly assigned, in a two-by-two factorial design, to one of five ad libitum diets to prevent weight regain over a 26-week period: a low-protein and low-glycemic-index diet, a low-protein and high-glycemic-index diet, a high-protein and low-glycemic-index diet, a high-protein and high-glycemic-index diet, or a control diet. Results A total of 1209 adults were screened (mean age, 41 years; body-mass index [the weight in kilograms divided by the square of the height in meters], 34), of whom 938 entered the low-calorie-diet phase of the study. A total of 773 participants who completed that phase were randomly assigned to one of the five maintenance diets; 548 completed the intervention (71%). Fewer participants in the high-protein and the low-glycemic-index groups than in the low-protein–high-glycemic-index group dropped out of the study (26.4% and 25.6%, respectively, vs. 37.4%; P = 0.02 and P = 0.01 for the respective comparisons). The mean initial weight loss with the low-calorie diet was 11.0 kg. In the analysis of participants who completed the study, only the low-protein–high-glycemic-index diet was associated with subsequent significant weight regain (1.67 kg; 95% confidence interval [CI], 0.48 to 2.87). In an intention-to-treat analysis, the weight regain was 0.93 kg less (95% CI, 0.31 to 1.55) in the groups assigned to a high-protein diet than in those assigned to a low-protein diet (P = 0.003) and 0.95 kg less (95% CI, 0.33 to 1.57) in the groups assigned to a low-glycemic-index diet than in those assigned to a high-glycemic-index diet (P = 0.003). The analysis involving
Large-scale identification of c-MYC-associated proteins using a combined TAP/MudPIT approach.

PubMed

Koch, Heike B; Zhang, Ru; Verdoodt, Berlinda; Bailey, Aaron; Zhang, Chang-Dong; Yates, John R; Menssen, Antje; Hermeking, Heiko

2007-01-15

The c-MYC oncogene encodes a transcription factor, which is sufficient and necessary for the induction of cellular proliferation. However, the c-MYC protein is a relatively weak transactivator suggesting that it may have other functions. To identify protein interactors which may reveal new functions or represent regulators of c-MYC we systematically identified proteins associated with c-MYC in vivo using a proteomic approach. We combined tandem affinity purification (TAP) with the mass spectral multidimensional protein identification technology (MudPIT). Thereby, 221 c-MYC-associated proteins were identified. Among them were 17 previously known c-MYC-interactors. Selected new c-MYC-associated proteins (DBC-1, FBX29, KU70, MCM7, Mi2-beta/CHD4, RNA Pol II, RFC2, RFC3, SV40 Large T Antigen, TCP1alpha, U5-116kD, ZNF281) were confirmed independently. For association with MCM7, SV40 Large T Antigen and DBC-1 the functionally important MYC-box II region was required, whereas FBX29 and Mi2-beta interacted via MYC-box II and the BR-HLH-LZ motif. In addition, regulators of c-MYC activity were identified: ectopic expression of FBX29, an E3 ubiquitin ligase, decreased c-MYC protein levels and inhibited c-MYC transactivation, whereas knock-down of FBX29 elevated the concentration of c-MYC. Furthermore, sucrose gradient analysis demonstrated that c-MYC is present in numerous complexes with varying size and composition, which may accommodate the large number of new c-MYC-associated proteins identified here and mediate the diverse functions of c-MYC. Our results suggest that c-MYC, besides acting as a mitogenic transcription factor, regulates cellular proliferation by direct association with protein complexes involved in multiple synthetic processes required for cell division, as for example DNA-replication/repair and RNA-processing. Furthermore, this first comprehensive description of the c-MYC-associated sub-proteome will facilitate further studies aimed to elucidate the biology
Systemic transport of Alfalfa mosaic virus can be mediated by the movement proteins of several viruses assigned to five genera of the 30K family.

PubMed

Fajardo, Thor V M; Peiró, Ana; Pallás, Vicente; Sánchez-Navarro, Jesús

2013-03-01

We previously showed that the movement protein (MP) gene of Alfalfa mosaic virus (AMV) is functionally exchangeable for the cell-to-cell transport of the corresponding genes of Tobacco mosaic virus (TMV), Brome mosaic virus, Prunus necrotic ringspot virus, Cucumber mosaic virus and Cowpea mosaic virus. We have analysed the capacity of the heterologous MPs to systemically transport the corresponding chimeric AMV genome. All MPs were competent in systemic transport but required the fusion at their C terminus of the coat protein-interacting C-terminal 44 aa (A44) of the AMV MP. Except for the TMV MP, the presence of the hybrid virus in upper leaves correlated with the capacity to move locally. These results suggest that all the MPs assigned to the 30K superfamily should be exchangeable not only for local virus movement but also for systemic transport when the A44 fragment is present.
48 CFR 227.7010 - Assignments.

Code of Federal Regulations, 2010 CFR

2010-10-01

... OF DEFENSE GENERAL CONTRACTING REQUIREMENTS PATENTS, DATA, AND COPYRIGHTS Infringement Claims, Licenses, and Assignments 227.7010 Assignments. (a) The clause at 252.227-7011 is an example which may be used in contracts of assignment of patent rights to the Government. (b) To facilitate proof of...
Wildlife forensic science: A review of genetic geographic origin assignment.

PubMed

Ogden, Rob; Linacre, Adrian

2015-09-01

Wildlife forensic science has become a key means of enforcing legislation surrounding the illegal trade in protected and endangered species. A relatively new dimension to this area of forensic science is to determine the geographic origin of a seized sample. This review focuses on DNA testing, which relies on assignment of an unknown sample to its genetic population of origin. Key examples of this are the trade in timber, fish and ivory and these are used only to illustrate the large number of species for which this type of testing is potentially available. The role of mitochondrial and nuclear DNA markers is discussed, alongside a comparison of neutral markers with those exhibiting signatures of selection, which potentially offer much higher levels of assignment power to address specific questions. A review of assignment tests is presented along with detailed methods for evaluating error rates and considerations for marker selection. The availability and quality of reference data are of paramount importance to support assignment applications and ensure reliability of any conclusions drawn. The genetic methods discussed have been developed initially as investigative tools but comment is made regarding their use in courts. The potential to compliment DNA markers with elemental assays for greater assignment power is considered and finally recommendations are made for the future of this type of testing. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Fast and Accurate Protein False Discovery Rates on Large-Scale Proteomics Data Sets with Percolator 3.0

NASA Astrophysics Data System (ADS)

The, Matthew; MacCoss, Michael J.; Noble, William S.; Käll, Lukas

2016-11-01

Percolator is a widely used software tool that increases yield in shotgun proteomics experiments and assigns reliable statistical confidence measures, such as q values and posterior error probabilities, to peptides and peptide-spectrum matches (PSMs) from such experiments. Percolator's processing speed has been sufficient for typical data sets consisting of hundreds of thousands of PSMs. With our new scalable approach, we can now also analyze millions of PSMs in a matter of minutes on a commodity computer. Furthermore, with the increasing awareness for the need for reliable statistics on the protein level, we compared several easy-to-understand protein inference methods and implemented the best-performing method—grouping proteins by their corresponding sets of theoretical peptides and then considering only the best-scoring peptide for each protein—in the Percolator package. We used Percolator 3.0 to analyze the data from a recent study of the draft human proteome containing 25 million spectra (PM:24870542). The source code and Ubuntu, Windows, MacOS, and Fedora binary packages are available from http://percolator.ms/ under an Apache 2.0 license.
Evaluation of assigned-value uncertainty for complex calibrator value assignment processes: a prealbumin example.

PubMed

Middleton, John; Vaks, Jeffrey E

2007-04-01

Errors of calibrator-assigned values lead to errors in the testing of patient samples. The ability to estimate the uncertainties of calibrator-assigned values and other variables minimizes errors in testing processes. International Organization of Standardization guidelines provide simple equations for the estimation of calibrator uncertainty with simple value-assignment processes, but other methods are needed to estimate uncertainty in complex processes. We estimated the assigned-value uncertainty with a Monte Carlo computer simulation of a complex value-assignment process, based on a formalized description of the process, with measurement parameters estimated experimentally. This method was applied to study uncertainty of a multilevel calibrator value assignment for a prealbumin immunoassay. The simulation results showed that the component of the uncertainty added by the process of value transfer from the reference material CRM470 to the calibrator is smaller than that of the reference material itself (<0.8% vs 3.7%). Varying the process parameters in the simulation model allowed for optimizing the process, while keeping the added uncertainty small. The patient result uncertainty caused by the calibrator uncertainty was also found to be small. This method of estimating uncertainty is a powerful tool that allows for estimation of calibrator uncertainty for optimization of various value assignment processes, with a reduced number of measurements and reagent costs, while satisfying the requirements to uncertainty. The new method expands and augments existing methods to allow estimation of uncertainty in complex processes.
Structural Case Assignment in Korean

ERIC Educational Resources Information Center

Koak, Heeshin

2012-01-01

In this dissertation, I aim to provide a theory on the distribution of structural Case in Korean. I propose the following Structural Case Assignment Hypothesis (SCAH) regarding the assignment of structural Case: "Structural Case is assigned by phase heads (C: nominative; v: accusative) to every argument in the c-command domain of the phase…

Interrogation of Mammalian Protein Complex Structure, Function, and Membership Using Genome-Scale Fitness Screens.

PubMed

Pan, Joshua; Meyers, Robin M; Michel, Brittany C; Mashtalir, Nazar; Sizemore, Ann E; Wells, Jonathan N; Cassel, Seth H; Vazquez, Francisca; Weir, Barbara A; Hahn, William C; Marsh, Joseph A; Tsherniak, Aviad; Kadoch, Cigall

2018-05-23

Protein complexes are assemblies of subunits that have co-evolved to execute one or many coordinated functions in the cellular environment. Functional annotation of mammalian protein complexes is critical to understanding biological processes, as well as disease mechanisms. Here, we used genetic co-essentiality derived from genome-scale RNAi- and CRISPR-Cas9-based fitness screens performed across hundreds of human cancer cell lines to assign measures of functional similarity. From these measures, we systematically built and characterized functional similarity networks that recapitulate known structural and functional features of well-studied protein complexes and resolve novel functional modules within complexes lacking structural resolution, such as the mammalian SWI/SNF complex. Finally, by integrating functional networks with large protein-protein interaction networks, we discovered novel protein complexes involving recently evolved genes of unknown function. Taken together, these findings demonstrate the utility of genetic perturbation screens alone, and in combination with large-scale biophysical data, to enhance our understanding of mammalian protein complexes in normal and disease states. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.
A constraint logic programming approach to associate 1D and 3D structural components for large protein complexes.

PubMed

Dal Palù, Alessandro; Pontelli, Enrico; He, Jing; Lu, Yonggang

2007-01-01

The paper describes a novel framework, constructed using Constraint Logic Programming (CLP) and parallelism, to determine the association between parts of the primary sequence of a protein and alpha-helices extracted from 3D low-resolution descriptions of large protein complexes. The association is determined by extracting constraints from the 3D information, regarding length, relative position and connectivity of helices, and solving these constraints with the guidance of a secondary structure prediction algorithm. Parallelism is employed to enhance performance on large proteins. The framework provides a fast, inexpensive alternative to determine the exact tertiary structure of unknown proteins.
"C.R.E.A.T.E."-ing Unique Primary-Source Research Paper Assignments for a Pleasure and Pain Course Teaching Neuroscientific Principles in a Large General Education Undergraduate Course.

PubMed

Bodnar, Richard J; Rotella, Francis M; Loiacono, Ilyssa; Coke, Tricia; Olsson, Kerstin; Barrientos, Alicia; Blachorsky, Lauren; Warshaw, Deena; Buras, Agata; Sanchez, Ciara M; Azad, Raihana; Stellar, James R

2016-01-01

A large (250 registrants) General Education lecture course, Pleasure and Pain, presented basic neuroscience principles as they related to animal and human models of pleasure and pain by weaving basic findings related to food and drug addiction and analgesic states with human studies examining empathy, social neuroscience and neuroeconomics. In its first four years, the course grade was based on weighted scores from two multiple-choice exams and a five-page review of three unique peer-reviewed research articles. Although well-registered and well-received, 18% of the students received Incomplete grades, primarily due to failing to submit the paper that went largely unresolved and eventually resulted in a failing grade. To rectify this issue, a modified version of the C.R.E.A.T.E. (Consider, Read, Elucidate hypotheses, Analyze and interpret data, Think of the next Experiment) method replaced the paper with eight structured assignments focusing on an initial general-topic article, the introduction-methods, and results-discussion of each of three related peer-review neuroscience-related articles, and a final summary. Compliance in completing these assignments was very high, resulting in only 11 INC grades out of 228 students. Thus, use of the C.R.E.A.T.E. method reduced the percentage of problematic INC grades from 18% to 4.8%, a 73% decline, without changing the overall grade distribution. Other analyses suggested the students achieved a deeper understanding of the scientific process using the C.R.E.A.T.E. method relative to the original term paper assignment.
Competing Pathways and Multiple Folding Nuclei in a Large Multidomain Protein, Luciferase.

PubMed

Scholl, Zackary N; Yang, Weitao; Marszalek, Piotr E

2017-05-09

Proteins obtain their final functional configuration through incremental folding with many intermediate steps in the folding pathway. If known, these intermediate steps could be valuable new targets for designing therapeutics and the sequence of events could elucidate the mechanism of refolding. However, determining these intermediate steps is hardly an easy feat, and has been elusive for most proteins, especially large, multidomain proteins. Here, we effectively map part of the folding pathway for the model large multidomain protein, Luciferase, by combining single-molecule force-spectroscopy experiments and coarse-grained simulation. Single-molecule refolding experiments reveal the initial nucleation of folding while simulations corroborate these stable core structures of Luciferase, and indicate the relative propensities for each to propagate to the final folded native state. Both experimental refolding and Monte Carlo simulations of Markov state models generated from simulation reveal that Luciferase most often folds along a pathway originating from the nucleation of the N-terminal domain, and that this pathway is the least likely to form nonnative structures. We then engineer truncated variants of Luciferase whose sequences corresponded to the putative structure from simulation and we use atomic force spectroscopy to determine their unfolding and stability. These experimental results corroborate the structures predicted from the folding simulation and strongly suggest that they are intermediates along the folding pathway. Taken together, our results suggest that initial Luciferase refolding occurs along a vectorial pathway and also suggest a mechanism that chaperones may exploit to prevent misfolding. Copyright © 2017 Biophysical Society. Published by Elsevier Inc. All rights reserved.
Applications of random forest feature selection for fine-scale genetic population assignment.

PubMed

Sylvester, Emma V A; Bentzen, Paul; Bradbury, Ian R; Clément, Marie; Pearce, Jon; Horne, John; Beiko, Robert G

2018-02-01

Genetic population assignment used to inform wildlife management and conservation efforts requires panels of highly informative genetic markers and sensitive assignment tests. We explored the utility of machine-learning algorithms (random forest, regularized random forest and guided regularized random forest) compared with F ST ranking for selection of single nucleotide polymorphisms (SNP) for fine-scale population assignment. We applied these methods to an unpublished SNP data set for Atlantic salmon ( Salmo salar ) and a published SNP data set for Alaskan Chinook salmon ( Oncorhynchus tshawytscha ). In each species, we identified the minimum panel size required to obtain a self-assignment accuracy of at least 90% using each method to create panels of 50-700 markers Panels of SNPs identified using random forest-based methods performed up to 7.8 and 11.2 percentage points better than F ST -selected panels of similar size for the Atlantic salmon and Chinook salmon data, respectively. Self-assignment accuracy ≥90% was obtained with panels of 670 and 384 SNPs for each data set, respectively, a level of accuracy never reached for these species using F ST -selected panels. Our results demonstrate a role for machine-learning approaches in marker selection across large genomic data sets to improve assignment for management and conservation of exploited populations.
Large oncosomes contain distinct protein cargo and represent a separate functional class of tumor-derived extracellular vesicles

PubMed Central

Minciacchi, Valentina R.; You, Sungyong; Spinelli, Cristiana; Morley, Samantha; Zandian, Mandana; Aspuria, Paul-Joseph; Cavallini, Lorenzo; Ciardiello, Chiara; Sobreiro, Mariana Reis; Morello, Matteo; Kharmate, Geetanjali; Jang, Su Chul; Kim, Dae-Kyum; Hosseini-Beheshti, Elham; Guns, Emma Tomlinson; Gleave, Martin; Gho, Yong Song; Mathivanan, Suresh; Yang, Wei; Freeman, Michael R.; Di Vizio, Dolores

2015-01-01

Large oncosomes (LO) are atypically large (1-10μm diameter) cancer-derived extracellular vesicles (EVs), originating from the shedding of membrane blebs and associated with advanced disease. We report that 25% of the proteins, identified by a quantitative proteomics analysis, are differentially represented in large and nano-sized EVs from prostate cancer cells. Proteins enriched in large EVs included enzymes involved in glucose, glutamine and amino acid metabolism, all metabolic processes relevant to cancer. Glutamine metabolism was altered in cancer cells exposed to large EVs, an effect that was not observed upon treatment with exosomes. Large EVs exhibited discrete buoyant densities in iodixanol (OptiPrepTM) gradients. Fluorescent microscopy of large EVs revealed an appearance consistent with LO morphology, indicating that these structures can be categorized as LO. Among the proteins enriched in LO, cytokeratin 18 (CK18) was one of the most abundant (within the top 5th percentile) and was used to develop an assay to detect LO in the circulation and tissues of mice and patients with prostate cancer. These observations indicate that LO represent a discrete EV type that may play a distinct role in tumor progression and that may be a source of cancer-specific markers. PMID:25857301
Structure based alignment and clustering of proteins (STRALCP)

DOEpatents

Zemla, Adam T.; Zhou, Carol E.; Smith, Jason R.; Lam, Marisa W.

2013-06-18

Disclosed are computational methods of clustering a set of protein structures based on local and pair-wise global similarity values. Pair-wise local and global similarity values are generated based on pair-wise structural alignments for each protein in the set of protein structures. Initially, the protein structures are clustered based on pair-wise local similarity values. The protein structures are then clustered based on pair-wise global similarity values. For each given cluster both a representative structure and spans of conserved residues are identified. The representative protein structure is used to assign newly-solved protein structures to a group. The spans are used to characterize conservation and assign a "structural footprint" to the cluster.
Evaluating an Earlybird Scheme: Encouraging Early Assignment Writing and Revising

ERIC Educational Resources Information Center

Unsworth, Kerrie; Kauter, Katherine

2008-01-01

It is widely suggested that feedback on assignments is useful to students' learning, however, little research has examined how this feedback may be provided in large classes or the actual effects of such a scheme. We designed and implemented a voluntary "earlybird scheme" that provided detailed feedback to undergraduate Business students on a…
TRP channel proteins and signal transduction.

PubMed

Minke, Baruch; Cook, Boaz

2002-04-01

TRP channel proteins constitute a large and diverse family of proteins that are expressed in many tissues and cell types. This family was designated TRP because of a spontaneously occurring Drosophila mutant lacking TRP that responded to a continuous light with a transient receptor potential (hence TRP). In addition to responses to light, TRPs mediate responses to nerve growth factor, pheromones, olfaction, mechanical, chemical, temperature, pH, osmolarity, vasorelaxation of blood vessels, and metabolic stress. Furthermore, mutations in several members of TRP-related channel proteins are responsible for several diseases, such as several tumors and neurodegenerative disorders. TRP-related channel proteins are found in a variety of organisms, tissues, and cell types, including nonexcitable, smooth muscle, and neuronal cells. The large functional diversity of TRPs is also reflected in their diverse permeability to ions, although, in general, they are classified as nonselective cationic channels. The molecular domains that are conserved in all members of the TRP family constitute parts of the transmembrane domains and in most members also the ankyrin-like repeats at the NH2 terminal of the protein and a "TRP domain" at the COOH terminal, which is a highly conserved 25-amino acid stretch with still unknown function. All of the above features suggest that members of the TRP family are "special assignment" channels, which are recruited to diverse signaling pathways. The channels' roles and characteristics such as gating mechanism, regulation, and permeability are determined by evolution according to the specific functional requirements.
De-Coding Writing Assignments.

ERIC Educational Resources Information Center

Simon, Linda

1991-01-01

Argues that understanding assignments is the first step toward successful college writing. Urges instructors to support students by helping them to decode assignments. Breaks down instructions into individual tasks including (1) writing an essay, (2) examining an issue, (3) reviewing articles and books, and (4) focusing on some texts. Defines each…
Neandertals' large lower thorax may represent adaptation to high protein diet.

PubMed

Ben-Dor, Miki; Gopher, Avi; Barkai, Ran

2016-07-01

Humans are limited in their capacity to convert protein into energy. We present a hypothesis that a "bell" shaped thorax and a wide pelvis evolved in Neandertals, at least in part, as an adaptation to a high protein diet. A high protein diet created a need to house an enlarged liver and urinary system in a wider lower trunk. To test the hypothesis, we applied a model developed to identify points of nutritional stress. A ratio of obligatory dietary fat to total animal fat and protein sourced calories is calculated based on various known and estimated parameters. Stress is identified when the obligatory dietary fat ratio is higher than fat content ratios in available prey. The model predicts that during glacial winters, when carbohydrates weren't available, 74%-85% of Neandertals' caloric intake would have had to come from animal fat. Large animals contain around 50% fat calories, and their fat content is diminished during winter, so a significant stressful dietary fat deficit was identified by the model. This deficit could potentially be ameliorated by an increased capability to convert protein into energy. Given that high protein consumption is associated with larger liver and kidneys in animal models, it appears likely that the enlarged inferior section of the Neandertals thorax and possibly, in part, also his wide pelvis, represented an adaptation to provide encasement for those enlarged organs. Behavioral and evolutionary implications of the hypothesis are also discussed. Am J Phys Anthropol 160:367-378, 2016. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Backbone resonance assignments of the PRYSPRY domain of TRIM25.

PubMed

Kong, Chen; Penumutchu, Srinivasa R; Hung, Kuo-Wei; Huang, Huiying; Lin, Tianwei; Yu, Chin

2015-10-01

TRIM25 is a member of the tripartite motif (TRIM) family and has been implicated in the regulation of innate immune signaling via the RIG-I (retinoic acid-inducible gene-I) pathway for antiviral defense. As the essential first step towards the structural and functional characterization of the TRIM25/RIG-I interaction, the backbone resonance of the PRYSPRY domain of TRIM25 is assigned here based on triple-resonance experiments using uniformly [(2)H, (13)C, (15)N]-labeled protein.
47 CFR 74.1202 - Frequency assignment.

Code of Federal Regulations, 2013 CFR

2013-10-01

... FM Broadcast Booster Stations § 74.1202 Frequency assignment. (a) An applicant for a new FM broadcast... broadcast booster station will be assigned the channel assigned to its primary station. [35 FR 15388, Oct. 2...
47 CFR 74.1202 - Frequency assignment.

Code of Federal Regulations, 2011 CFR

2011-10-01

... FM Broadcast Booster Stations § 74.1202 Frequency assignment. (a) An applicant for a new FM broadcast... broadcast booster station will be assigned the channel assigned to its primary station. [35 FR 15388, Oct. 2...
47 CFR 74.1202 - Frequency assignment.

Code of Federal Regulations, 2012 CFR

2012-10-01

... FM Broadcast Booster Stations § 74.1202 Frequency assignment. (a) An applicant for a new FM broadcast... broadcast booster station will be assigned the channel assigned to its primary station. [35 FR 15388, Oct. 2...
47 CFR 74.1202 - Frequency assignment.

Code of Federal Regulations, 2014 CFR

2014-10-01

... FM Broadcast Booster Stations § 74.1202 Frequency assignment. (a) An applicant for a new FM broadcast... broadcast booster station will be assigned the channel assigned to its primary station. [35 FR 15388, Oct. 2...
An automated method for finding molecular complexes in large protein interaction networks

PubMed Central

Bader, Gary D; Hogue, Christopher WV

2003-01-01

Background Recent advances in proteomics technologies such as two-hybrid, phage display and mass spectrometry have enabled us to create a detailed map of biomolecular interaction networks. Initial mapping efforts have already produced a wealth of data. As the size of the interaction set increases, databases and computational methods will be required to store, visualize and analyze the information in order to effectively aid in knowledge discovery. Results This paper describes a novel graph theoretic clustering algorithm, "Molecular Complex Detection" (MCODE), that detects densely connected regions in large protein-protein interaction networks that may represent molecular complexes. The method is based on vertex weighting by local neighborhood density and outward traversal from a locally dense seed protein to isolate the dense regions according to given parameters. The algorithm has the advantage over other graph clustering methods of having a directed mode that allows fine-tuning of clusters of interest without considering the rest of the network and allows examination of cluster interconnectivity, which is relevant for protein networks. Protein interaction and complex information from the yeast Saccharomyces cerevisiae was used for evaluation. Conclusion Dense regions of protein interaction networks can be found, based solely on connectivity data, many of which correspond to known protein complexes. The algorithm is not affected by a known high rate of false positives in data from high-throughput interaction techniques. The program is available from . PMID:12525261
Optimizing Marine Security Guard Assignments

DTIC Science & Technology

2011-06-01

Bangkok, Thailand East Asia and Pacific 18 4 Fort Lauderdale, Florida Western Hemisphere - South 13 5 Frankfurt, Germany Western Europe and Scandinavia 15...Marine-billet assignment using similar fit criteria, such as rank and gender . 3. Minimize the amount of movement when filling the billets. That is, their...more than once. Gender Female MSGs will not be assigned to embassies that are not configured for females. DC Only DC-qualified MSGs are assigned to DC
Cell-free synthesis of membrane subunits of ATP synthase in phospholipid bicelles: NMR shows subunit a fold similar to the protein in the cell membrane

PubMed Central

Uhlemann, Eva-Maria E; Pierson, Hannah E; Fillingame, Robert H; Dmitriev, Oleg Y

2012-01-01

NMR structure determination of large membrane proteins is hampered by broad spectral lines, overlap, and ambiguity of signal assignment. Chemical shift and NOE assignment can be facilitated by amino acid selective isotope labeling in cell-free protein synthesis system. However, many biological detergents are incompatible with the cell-free synthesis, and membrane proteins often have to be synthesized in an insoluble form. We report cell-free synthesis of subunits a and c of the proton channel of Escherichia coli ATP synthase in a soluble form in a mixture of phosphatidylcholine derivatives. In comparison, subunit a was purified from the cell-free system and from the bacterial cell membranes. NMR spectra of both preparations were similar, indicating that our procedure for cell-free synthesis produces protein structurally similar to that prepared from the cell membranes. PMID:22162071
Proton nuclear magnetic resonance studies on the variant-3 neurotoxin from Centruroides sculpturatus Ewing: Sequential assignment of resonances

DOE Office of Scientific and Technical Information (OSTI.GOV)

Nettesheim, D.G.; Klevit, R.E.; Drobny, G.

1989-02-21

The authors report the sequential assignment of resonances to specific residues in the proton nuclear magnetic resonance spectrum of the variant-3 neurotoxin from the scorpion Centruroides sculpturatus Ewing (range southwestern U.S.A.). A combination of two-dimensional NMR experiments such as 2D-COSY, 2D-NOESY, and single- and double-RELAY coherence transfer spectroscopy has been employed on samples of the protein dissolved in D{sub 2}O and in H{sub 2}O for assignment purposes. These studies provide a basis for the determination of the solution-phase conformation of this protein and for undertaking detailed structure-function studies of these neurotoxins that modulate the flow of sodium current by bindingmore » to the sodium channels of excitable membranes.« less

Value assignment of nutrient concentrations in five standard reference materials and six reference materials.

PubMed

Sharpless, K E; Gill, L M

2000-01-01

A number of food-matrix reference materials (RMs) are available from the National Institute of Standards and Technology (NIST) and from Agriculture Canada through NIST. Most of these materials were originally value-assigned for their elemental composition (major, minor, and trace elements), but no additional nutritional information was provided. Two of the materials were certified for selected organic constituents. Ten of these materials (Standard Reference Material [SRM] 1,563 Cholesterol and Fat-Soluble Vitamins in Coconut Oil [Natural and Fortified], SRM 1,566b Oyster Tissue, SRM 1,570a Spinach Leaves, SRM 1,974a Organics in Mussel Tissue (Mytilus edulis), RM 8,415 Whole Egg Powder, RM 8,418 Wheat Gluten, RM 8,432 Corn Starch, RM 8,433 Corn Bran, RM 8,435 Whole Milk Powder, and RM 8,436 Durum Wheat Flour) were recently distributed by NIST to 4 laboratories with expertise in food analysis for the measurement of proximates (solids, fat, protein, etc.), calories, and total dietary fiber, as appropriate. SRM 1846 Infant Formula was distributed as a quality control sample for the proximates and for analysis for individual fatty acids. Two of the materials (Whole Egg Powder and Whole Milk Powder) were distributed in an earlier interlaboratory comparison exercise in which they were analyzed for several vitamins. Value assignment of analyte concentrations in these 11 SRMs and RMs, based on analyses by the collaborating laboratories, is described in this paper. These materials are intended primarily for validation of analytical methods for the measurement of nutrients in foods of similar composition (based on AOAC INTERNATIONAL's fat-protein-carbohydrate triangle). They may also be used as "primary control materials" in the value assignment of in-house control materials of similar composition. The addition of proximate information for 10 existing reference materials means that RMs are now available from NIST with assigned values for proximates in 6 of the 9 sectors of
Optimizing high performance computing workflow for protein functional annotation

PubMed Central

Stanberry, Larissa; Rekepalli, Bhanu; Liu, Yuan; Giblock, Paul; Higdon, Roger; Montague, Elizabeth; Broomall, William; Kolker, Natali; Kolker, Eugene

2014-01-01

Functional annotation of newly sequenced genomes is one of the major challenges in modern biology. With modern sequencing technologies, the protein sequence universe is rapidly expanding. Newly sequenced bacterial genomes alone contain over 7.5 million proteins. The rate of data generation has far surpassed that of protein annotation. The volume of protein data makes manual curation infeasible, whereas a high compute cost limits the utility of existing automated approaches. In this work, we present an improved and optmized automated workflow to enable large-scale protein annotation. The workflow uses high performance computing architectures and a low complexity classification algorithm to assign proteins into existing clusters of orthologous groups of proteins. On the basis of the Position-Specific Iterative Basic Local Alignment Search Tool the algorithm ensures at least 80% specificity and sensitivity of the resulting classifications. The workflow utilizes highly scalable parallel applications for classification and sequence alignment. Using Extreme Science and Engineering Discovery Environment supercomputers, the workflow processed 1,200,000 newly sequenced bacterial proteins. With the rapid expansion of the protein sequence universe, the proposed workflow will enable scientists to annotate big genome data. PMID:25313296
Optimizing high performance computing workflow for protein functional annotation.

PubMed

Stanberry, Larissa; Rekepalli, Bhanu; Liu, Yuan; Giblock, Paul; Higdon, Roger; Montague, Elizabeth; Broomall, William; Kolker, Natali; Kolker, Eugene

2014-09-10

Functional annotation of newly sequenced genomes is one of the major challenges in modern biology. With modern sequencing technologies, the protein sequence universe is rapidly expanding. Newly sequenced bacterial genomes alone contain over 7.5 million proteins. The rate of data generation has far surpassed that of protein annotation. The volume of protein data makes manual curation infeasible, whereas a high compute cost limits the utility of existing automated approaches. In this work, we present an improved and optmized automated workflow to enable large-scale protein annotation. The workflow uses high performance computing architectures and a low complexity classification algorithm to assign proteins into existing clusters of orthologous groups of proteins. On the basis of the Position-Specific Iterative Basic Local Alignment Search Tool the algorithm ensures at least 80% specificity and sensitivity of the resulting classifications. The workflow utilizes highly scalable parallel applications for classification and sequence alignment. Using Extreme Science and Engineering Discovery Environment supercomputers, the workflow processed 1,200,000 newly sequenced bacterial proteins. With the rapid expansion of the protein sequence universe, the proposed workflow will enable scientists to annotate big genome data.
Interactive Effects of Indigestible Carbohydrates, Protein Type, and Protein Level on Biomarkers of Large Intestine Health in Rats

PubMed Central

Taciak, Marcin; Barszcz, Marcin; Tuśnio, Anna; Pastuszewska, Barbara

2015-01-01

The effects of indigestible carbohydrates, protein type, and protein level on large intestine health were examined in rats. For 21 days, 12 groups of six 12-week-old male Wistar rats were fed diets with casein (CAS), or potato protein concentrate (PPC), providing 14% (lower protein level; LP), or 20% (higher protein level; HP) protein, and containing cellulose, resistant potato starch, or pectin. Fermentation end-products, pH, and β-glucuronidase levels in cecal digesta, and ammonia levels in colonic digesta were determined. Cecal digesta, tissue weights, cecal and colon morphology, and colonocyte DNA damage were also analyzed. Digesta pH was lower, whereas relative mass of cecal tissue and digesta were higher in rats fed pectin diets than in those fed cellulose. Cecal parameters were greater in rats fed PPC and HP diets than in those fed CAS and LP diets, respectively. Short-chain fatty acid (SCFA) concentrations were unaffected by protein or carbohydrate type. Total SCFA, acetic acid, and propionic acid concentrations were greater in rats fed LP diets than in those fed HP. Cecal pool of isobutyric and isovaleric acids was greater in rats fed PPC than in those fed CAS diets. PPC diets decreased phenol concentration and increased ammonia concentration in cecal and colonic digesta, respectively. Cecal crypt depth was greater in rats fed PPC and HP diets, and was unaffected by carbohydrates; whereas colonic crypt depth was greater in rats fed cellulose. Myenteron thickness in the cecum was unaffected by nutrition, but was greater in the colon of rats fed cellulose. Colonocyte DNA damage was greater in rats fed LP diets than in those fed HP diets, and was unaffected by carbohydrate or protein type. It was found that nutritional factors decreasing cecal digesta weight contribute to greater phenol production, increased DNA damage, and reduced ammonia concentration in the colon. PMID:26536028
Chemical shift assignments of the partially deuterated Fyn SH2-SH3 domain.

PubMed

Kieken, Fabien; Loth, Karine; van Nuland, Nico; Tompa, Peter; Lenaerts, Tom

2018-04-01

Src Homology 2 and 3 (SH2 and SH3) are two key protein interaction modules involved in regulating the activity of many proteins such as tyrosine kinases and phosphatases by respective recognition of phosphotyrosine and proline-rich regions. In the Src family kinases, the inactive state of the protein is the direct result of the interaction of the SH2 and the SH3 domain with intra-molecular regions, leading to a closed structure incompetent with substrate modification. Here, we report the 1 H, 15 N and 13 C backbone- and side-chain chemical shift assignments of the partially deuterated Fyn SH3-SH2 domain and structural differences between tandem and single domains. The BMRB accession number is 27165.
Adaptable Assignment

DOT National Transportation Integrated Search

1997-01-01

This paper reports on a practical, simple method for adjusting a vehicle trip table so that the resulting assignments more closely match available traffic counts. "Practical" means that this is not purely a research effort - the procedure described h...
A new system for naming ribosomal proteins

PubMed Central

Ban, Nenad; Beckmann, Roland; Cate, Jamie HD; Dinman, Jonathan D; Dragon, François; Ellis, Steven R; Lafontaine, Denis LJ; Lindahl, Lasse; Liljas, Anders; Lipton, Jeffrey M; McAlear, Michael A; Moore, Peter B; Noller, Harry F; Ortega, Joaquin; Panse, Vikram Govind; Ramakrishnan, V; Spahn, Christian MT; Steitz, Thomas A; Tchorzewski, Marek; Tollervey, David; Warren, Alan J; Williamson, James R; Wilson, Daniel; Yonath, Ada; Yusupov, Marat

2015-01-01

A system for naming ribosomal proteins is described that the authors intend to use in the future. They urge others to adopt it. The objective is to eliminate the confusion caused by the assignment of identical names to ribosomal proteins from different species that are unrelated in structure and function. In the system proposed here, homologous ribosomal proteins are assigned the same name, regardless of species. It is designed so that new names are similar enough to old names to be easily recognized, but are written in a format that unambiguously identifies them as ‘new system’ names. PMID:24524803
Real life working shift assignment problem

NASA Astrophysics Data System (ADS)

Sze, San-Nah; Kwek, Yeek-Ling; Tiong, Wei-King; Chiew, Kang-Leng

2017-07-01

This study concerns about the working shift assignment in an outlet of Supermarket X in Eastern Mall, Kuching. The working shift assignment needs to be solved at least once in every month. Current approval process of working shifts is too troublesome and time-consuming. Furthermore, the management staff cannot have an overview of manpower and working shift schedule. Thus, the aim of this study is to develop working shift assignment simulation and propose a working shift assignment solution. The main objective for this study is to fulfill manpower demand at minimum operation cost. Besides, the day off and meal break policy should be fulfilled accordingly. Demand based heuristic is proposed to assign working shift and the quality of the solution is evaluated by using the real data.
28 CFR 301.103 - Inmate work assignments.

Code of Federal Regulations, 2014 CFR

2014-07-01

... 28 Judicial Administration 2 2014-07-01 2014-07-01 false Inmate work assignments. 301.103 Section... COMPENSATION General § 301.103 Inmate work assignments. The unit team of each inmate, which ordinarily designates work assignments, or whoever makes work assignments, shall review appropriate medical records...
28 CFR 301.103 - Inmate work assignments.

Code of Federal Regulations, 2012 CFR

2012-07-01

... 28 Judicial Administration 2 2012-07-01 2012-07-01 false Inmate work assignments. 301.103 Section... COMPENSATION General § 301.103 Inmate work assignments. The unit team of each inmate, which ordinarily designates work assignments, or whoever makes work assignments, shall review appropriate medical records...
28 CFR 301.103 - Inmate work assignments.

Code of Federal Regulations, 2011 CFR

2011-07-01

... 28 Judicial Administration 2 2011-07-01 2011-07-01 false Inmate work assignments. 301.103 Section... COMPENSATION General § 301.103 Inmate work assignments. The unit team of each inmate, which ordinarily designates work assignments, or whoever makes work assignments, shall review appropriate medical records...
28 CFR 301.103 - Inmate work assignments.

Code of Federal Regulations, 2013 CFR

2013-07-01

... 28 Judicial Administration 2 2013-07-01 2013-07-01 false Inmate work assignments. 301.103 Section... COMPENSATION General § 301.103 Inmate work assignments. The unit team of each inmate, which ordinarily designates work assignments, or whoever makes work assignments, shall review appropriate medical records...
28 CFR 544.74 - Work assignment limitations.

Code of Federal Regulations, 2010 CFR

2010-07-01

... 28 Judicial Administration 2 2010-07-01 2010-07-01 false Work assignment limitations. 544.74... EDUCATION Literacy Program § 544.74 Work assignment limitations. These limitations on work assignment... order to be considered for a commissary work assignment above minimum pay level, an institution work...
Dual expression of MYC and BCL2 proteins predicts worse outcomes in diffuse large B-cell lymphoma.

PubMed

Clark Schneider, Kelli M; Banks, Peter M; Collie, Angela M B; Lanigan, Christopher P; Manilich, Elena; Durkin, Lisa M; Hill, Brian T; Hsi, Eric D

2016-07-01

Recent studies suggested that MYC and BCL2 protein co-expression is an independent indicator of poor prognosis in diffuse large B-cell lymphoma. However, the immunohistochemistry protocols for dual-expression staining and the scoring cut-offs vary by study. Sixty-nine cases of diffuse large B-cell lymphoma were evaluated for MYC and BCL2 protein expression using various cut-offs that have been recommended in prior studies. Independent of the International Prognostic Index risk group, cases with dual protein expression of BCL2 and MYC using ≥50%/40% cut-offs and ≥70%/40% had significantly shorter overall survival than cases without. It was verified in this patient population that the use of BCL2 and MYC immunohistochemistry, performed with available in vitro diagnostic-cleared antibodies, provides rapid prognostic information in patients with de novo diffuse large B-cell lymphoma. This study has practical implications for diagnostic laboratories and serves as a guide for implementation in the setting of future clinical trials.
24 CFR 221.770 - Assignment option.

Code of Federal Regulations, 2010 CFR

2010-04-01

... 24 Housing and Urban Development 2 2010-04-01 2010-04-01 false Assignment option. 221.770 Section... § 221.770 Assignment option. A mortgagee holding a conditional or firm commitment issued on or before... mortgagee's approved underwriter on or before November 30, 1983) has the option to assign, transfer and...
Genome-wide protein-protein interactions and protein function exploration in cyanobacteria

PubMed Central

Lv, Qi; Ma, Weimin; Liu, Hui; Li, Jiang; Wang, Huan; Lu, Fang; Zhao, Chen; Shi, Tieliu

2015-01-01

Genome-wide network analysis is well implemented to study proteins of unknown function. Here, we effectively explored protein functions and the biological mechanism based on inferred high confident protein-protein interaction (PPI) network in cyanobacteria. We integrated data from seven different sources and predicted 1,997 PPIs, which were evaluated by experiments in molecular mechanism, text mining of literatures in proved direct/indirect evidences, and “interologs” in conservation. Combined the predicted PPIs with known PPIs, we obtained 4,715 no-redundant PPIs (involving 3,231 proteins covering over 90% of genome) to generate the PPI network. Based on the PPI network, terms in Gene ontology (GO) were assigned to function-unknown proteins. Functional modules were identified by dissecting the PPI network into sub-networks and analyzing pathway enrichment, with which we investigated novel function of underlying proteins in protein complexes and pathways. Examples of photosynthesis and DNA repair indicate that the network approach is a powerful tool in protein function analysis. Overall, this systems biology approach provides a new insight into posterior functional analysis of PPIs in cyanobacteria. PMID:26490033
Protein crystal growth in microgravity: Temperature induced large scale crystallization of insulin

NASA Technical Reports Server (NTRS)

Long, Marianna M.; Delucas, Larry J.; Smith, C.; Carson, M.; Moore, K.; Harrington, Michael D.; Pillion, D. J.; Bishop, S. P.; Rosenblum, W. M.; Naumann, R. J.

1994-01-01

One of the major stumbling blocks that prevents rapid structure determination using x-ray crystallography is macro-molecular crystal growth. There are many examples where crystallization takes longer than structure determination. In some cases, it is impossible to grow useful crystals on earth. Recent experiments conducted in conjuction with NASA on various Space Shuttle missions have demonstrated that protein crystals often grow larger and display better internal molecular order than their earth-grown counterparts. This paper reports results from three Shuttle flights using the Protein Crystallization Facility (PCF). The PCF hardware produced large, high-quality insulin crystals by using a temperature change as the sole means to affect protein solubility and thus, crystallization. The facility consists of cylinders/containers with volumes of 500, 200, 100, and 50 ml. Data from the three Shuttle flights demonstrated that larger, higher resolution crystals (as evidenced by x-ray diffraction data) were obtained from the microgravity experiments when compared to earth-grown crystals.
Large-scale De Novo Prediction of Physical Protein-Protein Association*

PubMed Central

Elefsinioti, Antigoni; Saraç, Ömer Sinan; Hegele, Anna; Plake, Conrad; Hubner, Nina C.; Poser, Ina; Sarov, Mihail; Hyman, Anthony; Mann, Matthias; Schroeder, Michael; Stelzl, Ulrich; Beyer, Andreas

2011-01-01

Information about the physical association of proteins is extensively used for studying cellular processes and disease mechanisms. However, complete experimental mapping of the human interactome will remain prohibitively difficult in the near future. Here we present a map of predicted human protein interactions that distinguishes functional association from physical binding. Our network classifies more than 5 million protein pairs predicting 94,009 new interactions with high confidence. We experimentally tested a subset of these predictions using yeast two-hybrid analysis and affinity purification followed by quantitative mass spectrometry. Thus we identified 462 new protein-protein interactions and confirmed the predictive power of the network. These independent experiments address potential issues of circular reasoning and are a distinctive feature of this work. Analysis of the physical interactome unravels subnetworks mediating between different functional and physical subunits of the cell. Finally, we demonstrate the utility of the network for the analysis of molecular mechanisms of complex diseases by applying it to genome-wide association studies of neurodegenerative diseases. This analysis provides new evidence implying TOMM40 as a factor involved in Alzheimer's disease. The network provides a high-quality resource for the analysis of genomic data sets and genetic association studies in particular. Our interactome is available via the hPRINT web server at: www.print-db.org. PMID:21836163
Accounting for Protein Subcellular Localization: A Compartmental Map of the Rat Liver Proteome*

PubMed Central

Jadot, Michel; Boonen, Marielle; Thirion, Jaqueline; Wang, Nan; Xing, Jinchuan; Zhao, Caifeng; Tannous, Abla; Qian, Meiqian; Zheng, Haiyan; Everett, John K.; Moore, Dirk F.; Sleat, David E.; Lobel, Peter

2017-01-01

Accurate knowledge of the intracellular location of proteins is important for numerous areas of biomedical research including assessing fidelity of putative protein-protein interactions, modeling cellular processes at a system-wide level and investigating metabolic and disease pathways. Many proteins have not been localized, or have been incompletely localized, partly because most studies do not account for entire subcellular distribution. Thus, proteins are frequently assigned to one organelle whereas a significant fraction may reside elsewhere. As a step toward a comprehensive cellular map, we used subcellular fractionation with classic balance sheet analysis and isobaric labeling/quantitative mass spectrometry to assign locations to >6000 rat liver proteins. We provide quantitative data and error estimates describing the distribution of each protein among the eight major cellular compartments: nucleus, mitochondria, lysosomes, peroxisomes, endoplasmic reticulum, Golgi, plasma membrane and cytosol. Accounting for total intracellular distribution improves quality of organelle assignments and assigns proteins with multiple locations. Protein assignments and supporting data are available online through the Prolocate website (http://prolocate.cabm.rutgers.edu). As an example of the utility of this data set, we have used organelle assignments to help analyze whole exome sequencing data from an infant dying at 6 months of age from a suspected neurodegenerative lysosomal storage disorder of unknown etiology. Sequencing data was prioritized using lists of lysosomal proteins comprising well-established residents of this organelle as well as novel candidates identified in this study. The latter included copper transporter 1, encoded by SLC31A1, which we localized to both the plasma membrane and lysosome. The patient harbors two predicted loss of function mutations in SLC31A1, suggesting that this may represent a heretofore undescribed recessive lysosomal storage disease
Accounting for Protein Subcellular Localization: A Compartmental Map of the Rat Liver Proteome.

PubMed

Jadot, Michel; Boonen, Marielle; Thirion, Jaqueline; Wang, Nan; Xing, Jinchuan; Zhao, Caifeng; Tannous, Abla; Qian, Meiqian; Zheng, Haiyan; Everett, John K; Moore, Dirk F; Sleat, David E; Lobel, Peter

2017-02-01

Accurate knowledge of the intracellular location of proteins is important for numerous areas of biomedical research including assessing fidelity of putative protein-protein interactions, modeling cellular processes at a system-wide level and investigating metabolic and disease pathways. Many proteins have not been localized, or have been incompletely localized, partly because most studies do not account for entire subcellular distribution. Thus, proteins are frequently assigned to one organelle whereas a significant fraction may reside elsewhere. As a step toward a comprehensive cellular map, we used subcellular fractionation with classic balance sheet analysis and isobaric labeling/quantitative mass spectrometry to assign locations to >6000 rat liver proteins. We provide quantitative data and error estimates describing the distribution of each protein among the eight major cellular compartments: nucleus, mitochondria, lysosomes, peroxisomes, endoplasmic reticulum, Golgi, plasma membrane and cytosol. Accounting for total intracellular distribution improves quality of organelle assignments and assigns proteins with multiple locations. Protein assignments and supporting data are available online through the Prolocate website (http://prolocate.cabm.rutgers.edu). As an example of the utility of this data set, we have used organelle assignments to help analyze whole exome sequencing data from an infant dying at 6 months of age from a suspected neurodegenerative lysosomal storage disorder of unknown etiology. Sequencing data was prioritized using lists of lysosomal proteins comprising well-established residents of this organelle as well as novel candidates identified in this study. The latter included copper transporter 1, encoded by SLC31A1, which we localized to both the plasma membrane and lysosome. The patient harbors two predicted loss of function mutations in SLC31A1, suggesting that this may represent a heretofore undescribed recessive lysosomal storage disease

24 CFR 221.255 - Assignment option.

Code of Federal Regulations, 2010 CFR

2010-04-01

... 24 Housing and Urban Development 2 2010-04-01 2010-04-01 false Assignment option. 221.255 Section... Assignment option. (a) A mortgagee holding a mortgage insured pursuant to a conditional or firm commitment issued on or before November 30, 1983 has the option to assign, transfer and deliver to the Commissioner...
48 CFR 1552.211-74 - Work assignments.

Code of Federal Regulations, 2011 CFR

2011-10-01

... 48 Federal Acquisition Regulations System 6 2011-10-01 2011-10-01 false Work assignments. 1552.211... Work assignments. As prescribed in 1511.011-74, insert the following contract clause in cost-reimbursement type term form contracts when work assignments are to be used. Work Assignments (APR 1984) (a) The...
“C.R.E.A.T.E.”-ing Unique Primary-Source Research Paper Assignments for a Pleasure and Pain Course Teaching Neuroscientific Principles in a Large General Education Undergraduate Course

PubMed Central

Bodnar, Richard J.; Rotella, Francis M.; Loiacono, Ilyssa; Coke, Tricia; Olsson, Kerstin; Barrientos, Alicia; Blachorsky, Lauren; Warshaw, Deena; Buras, Agata; Sanchez, Ciara M.; Azad, Raihana; Stellar, James R.

2016-01-01

A large (250 registrants) General Education lecture course, Pleasure and Pain, presented basic neuroscience principles as they related to animal and human models of pleasure and pain by weaving basic findings related to food and drug addiction and analgesic states with human studies examining empathy, social neuroscience and neuroeconomics. In its first four years, the course grade was based on weighted scores from two multiple-choice exams and a five-page review of three unique peer-reviewed research articles. Although well-registered and well-received, 18% of the students received Incomplete grades, primarily due to failing to submit the paper that went largely unresolved and eventually resulted in a failing grade. To rectify this issue, a modified version of the C.R.E.A.T.E. (Consider, Read, Elucidate hypotheses, Analyze and interpret data, Think of the next Experiment) method replaced the paper with eight structured assignments focusing on an initial general-topic article, the introduction-methods, and results-discussion of each of three related peer-review neuroscience-related articles, and a final summary. Compliance in completing these assignments was very high, resulting in only 11 INC grades out of 228 students. Thus, use of the C.R.E.A.T.E. method reduced the percentage of problematic INC grades from 18% to 4.8%, a 73% decline, without changing the overall grade distribution. Other analyses suggested the students achieved a deeper understanding of the scientific process using the C.R.E.A.T.E. method relative to the original term paper assignment. PMID:27385918
Large-scale identification of odorant-binding proteins and chemosensory proteins from expressed sequence tags in insects

PubMed Central

2009-01-01

Background Insect odorant binding proteins (OBPs) and chemosensory proteins (CSPs) play an important role in chemical communication of insects. Gene discovery of these proteins is a time-consuming task. In recent years, expressed sequence tags (ESTs) of many insect species have accumulated, thus providing a useful resource for gene discovery. Results We have developed a computational pipeline to identify OBP and CSP genes from insect ESTs. In total, 752,841 insect ESTs were examined from 54 species covering eight Orders of Insecta. From these ESTs, 142 OBPs and 177 CSPs were identified, of which 117 OBPs and 129 CSPs are new. The complete open reading frames (ORFs) of 88 OBPs and 123 CSPs were obtained by electronic elongation. We randomly chose 26 OBPs from eight species of insects, and 21 CSPs from four species for RT-PCR validation. Twenty two OBPs and 16 CSPs were confirmed by RT-PCR, proving the efficiency and reliability of the algorithm. Together with all family members obtained from the NCBI (OBPs) or the UniProtKB (CSPs), 850 OBPs and 237 CSPs were analyzed for their structural characteristics and evolutionary relationship. Conclusions A large number of new OBPs and CSPs were found, providing the basis for deeper understanding of these proteins. In addition, the conserved motif and evolutionary analysis provide some new insights into the evolution of insect OBPs and CSPs. Motif pattern fine-tune the functions of OBPs and CSPs, leading to the minor difference in binding sex pheromone or plant volatiles in different insect Orders. PMID:20034407
From Story to Analysis: Reflection and Uptake in the Literacy Narrative Assignment

ERIC Educational Resources Information Center

Alexander, Kara Poe

2015-01-01

The literacy narrative assignment is popular with composition instructors because of the reflection it encourages in students. Previously, scholars have claimed that students demonstrate reflection in literacy narratives when they critique dominant ideologies. Largely absent, however, is research on what other elements might indicate reflection…
Technological Support for Assignment Assessment: A New Zealand Higher Education Survey

ERIC Educational Resources Information Center

Milne, John; Heinrich, Eva; Morrison, David

2008-01-01

This article presents selected aspects of a large study on the use of e-learning tools in support of assignment assessment, which was comprised of a substantial literature review, an investigation into electronic tools, and interviews with 90 academics at New Zealand tertiary institutions. The article makes two main contributions. Based on the…
Assessing reliability of protein-protein interactions by integrative analysis of data in model organisms.

PubMed

Lin, Xiaotong; Liu, Mei; Chen, Xue-wen

2009-04-29

Protein-protein interactions play vital roles in nearly all cellular processes and are involved in the construction of biological pathways such as metabolic and signal transduction pathways. Although large-scale experiments have enabled the discovery of thousands of previously unknown linkages among proteins in many organisms, the high-throughput interaction data is often associated with high error rates. Since protein interaction networks have been utilized in numerous biological inferences, the inclusive experimental errors inevitably affect the quality of such prediction. Thus, it is essential to assess the quality of the protein interaction data. In this paper, a novel Bayesian network-based integrative framework is proposed to assess the reliability of protein-protein interactions. We develop a cross-species in silico model that assigns likelihood scores to individual protein pairs based on the information entirely extracted from model organisms. Our proposed approach integrates multiple microarray datasets and novel features derived from gene ontology. Furthermore, the confidence scores for cross-species protein mappings are explicitly incorporated into our model. Applying our model to predict protein interactions in the human genome, we are able to achieve 80% in sensitivity and 70% in specificity. Finally, we assess the overall quality of the experimentally determined yeast protein-protein interaction dataset. We observe that the more high-throughput experiments confirming an interaction, the higher the likelihood score, which confirms the effectiveness of our approach. This study demonstrates that model organisms certainly provide important information for protein-protein interaction inference and assessment. The proposed method is able to assess not only the overall quality of an interaction dataset, but also the quality of individual protein-protein interactions. We expect the method to continually improve as more high quality interaction data from more
Introducing chemical biology applications to introductory organic chemistry students using series of weekly assignments.

PubMed

Kanin, Maralee R; Pontrello, Jason K

2016-01-01

Calls to bring interdisciplinary content and examples into introductory science courses have increased, yet strategies that involve course restructuring often suffer from the need for a significant faculty commitment to motivate change. Minimizing the need for dramatic course reorganization, the structure, reactivity, and chemical biology applications of classes of biological monomers and polymers have been integrated into introductory organic chemistry courses through three series of semester-long weekly assignments that explored (a) Carbohydrates and Oligosaccharides, (b) Amino Acids, Peptides, and Proteins, and (c) Nucleosides, Nucleotides, and Nucleic Acids. Comparisons of unannounced pre- and post tests revealed improved understanding of a reaction introduced in the assignments, and course examinations evaluated cumulative assignment topics. Course surveys revealed that demonstrating biologically relevant applications consistently throughout the semesters enhanced student interest in the connection between basic organic chemistry content and its application to new and unfamiliar bio-related examples. Covering basic material related to these classes of molecules outside of the classroom opened lecture time to allow the instructor to further build on information developed through the weekly assignments, teaching advanced topics and applications typically not covered in an introductory organic chemistry lecture course. Assignments were implemented as homework, either with or without accompanying discussion, in both laboratory and lecture organic courses within the context of the existing course structures. © 2015 The International Union of Biochemistry and Molecular Biology.
Using interviews to understand the assignment mechanism in a nonexperimental study: the case of eighth grade algebra.

PubMed

Rickles, Jordan H

2011-10-01

Many inquiries regarding the causal effects of policies or programs are based on research designs where the treatment assignment process is unknown, and thus valid inferences depend on tenuous assumptions about the assignment mechanism. This article draws attention to the importance of understanding the assignment mechanism in policy and program evaluation studies, and illustrates how information collected through interviews can develop a richer understanding of the assignment mechanism. Focusing on the issue of student assignment to algebra in 8th grade, I show how a preliminary data collection effort aimed at understanding the assignment mechanism is particularly beneficial in multisite observational studies in education. The findings, based on ten interviews and administrative data from a large school district, draw attention to the often ignored heterogeneity in the assignment mechanism across schools. These findings likely extend beyond the current research project in question to related educational policy issues such as ability grouping, tracking, differential course taking, and curricular intensity, as well as other social programs in which the assignment mechanism can differ across sites.
Geographical assignment of hospitalists in an urban teaching hospital: feasibility and impact on efficiency and provider satisfaction.

PubMed

Bryson, Christine; Boynton, Greta; Stepczynski, Anna; Garb, Jane; Kleppel, Reva; Irani, Farzan; Natanasabapathy, Siva; Stefan, Mihaela S

2017-10-01

To evaluate whether implementation of a geographic model of assigning hospitalists is feasible and sustainable in a large hospitalist program and assess its impact on provider satisfaction, perceived efficiency and patient outcomes. Pre (3 months) - post (12 months) intervention study conducted from June 2014 through September 2015 at a tertiary care medical center with a large hospitalist program caring for patients scattered in 4 buildings and 16 floors. Hospitalists were assigned to a particular nursing unit (geographic assignment) with a goal of having over 80% of their assigned patients located on their assigned unit. Satisfaction and perceived efficiency were assessed through a survey administered before and after the intervention. Geographic assignment percentage increased from an average of 60% in the pre-intervention period to 93% post-intervention. The number of hospitalists covering a 32 bed unit decreased from 8-10 pre to 2-3 post-intervention. A majority of physicians (87%) thought that geography had a positive impact on the overall quality of care. Respondents reported that they felt that geography increased time spent with patient/caregivers to discuss plan of care (p < 0.001); improved communication with nurses (p = 0.0009); and increased sense of teamwork with nurses/case managers (p < 0.001). Mean length of stay (4.54 vs 4.62 days), 30-day readmission rates (16.0% vs 16.6%) and patient satisfaction (79.9 vs 77.3) did not change significantly between the pre- and post-implementation period. The discharge before noon rate improved slightly (47.5% - 54.1%). Implementation of a unit-based model in a large hospitalist program is feasible and sustainable with appropriate planning and support. The geographical model of care increased provider satisfaction and perceived efficiency; it also facilitated the implementation of other key interventions such as interdisciplinary rounds.
DWARF – a data warehouse system for analyzing protein families

PubMed Central

Fischer, Markus; Thai, Quan K; Grieb, Melanie; Pleiss, Jürgen

2006-01-01

Background The emerging field of integrative bioinformatics provides the tools to organize and systematically analyze vast amounts of highly diverse biological data and thus allows to gain a novel understanding of complex biological systems. The data warehouse DWARF applies integrative bioinformatics approaches to the analysis of large protein families. Description The data warehouse system DWARF integrates data on sequence, structure, and functional annotation for protein fold families. The underlying relational data model consists of three major sections representing entities related to the protein (biochemical function, source organism, classification to homologous families and superfamilies), the protein sequence (position-specific annotation, mutant information), and the protein structure (secondary structure information, superimposed tertiary structure). Tools for extracting, transforming and loading data from public available resources (ExPDB, GenBank, DSSP) are provided to populate the database. The data can be accessed by an interface for searching and browsing, and by analysis tools that operate on annotation, sequence, or structure. We applied DWARF to the family of α/β-hydrolases to host the Lipase Engineering database. Release 2.3 contains 6138 sequences and 167 experimentally determined protein structures, which are assigned to 37 superfamilies 103 homologous families. Conclusion DWARF has been designed for constructing databases of large structurally related protein families and for evaluating their sequence-structure-function relationships by a systematic analysis of sequence, structure and functional annotation. It has been applied to predict biochemical properties from sequence, and serves as a valuable tool for protein engineering. PMID:17094801
FOAM (Functional Ontology Assignments for Metagenomes): A Hidden Markov Model (HMM) database with environmental focus

DOE PAGES

Prestat, Emmanuel; David, Maude M.; Hultman, Jenni; ...

2014-09-26

A new functional gene database, FOAM (Functional Ontology Assignments for Metagenomes), was developed to screen environmental metagenomic sequence datasets. FOAM provides a new functional ontology dedicated to classify gene functions relevant to environmental microorganisms based on Hidden Markov Models (HMMs). Sets of aligned protein sequences (i.e. ‘profiles’) were tailored to a large group of target KEGG Orthologs (KOs) from which HMMs were trained. The alignments were checked and curated to make them specific to the targeted KO. Within this process, sequence profiles were enriched with the most abundant sequences available to maximize the yield of accurate classifier models. An associatedmore » functional ontology was built to describe the functional groups and hierarchy. FOAM allows the user to select the target search space before HMM-based comparison steps and to easily organize the results into different functional categories and subcategories. FOAM is publicly available at http://portal.nersc.gov/project/m1317/FOAM/.« less
Predicting protein complexes from weighted protein-protein interaction graphs with a novel unsupervised methodology: Evolutionary enhanced Markov clustering.

PubMed

Theofilatos, Konstantinos; Pavlopoulou, Niki; Papasavvas, Christoforos; Likothanassis, Spiros; Dimitrakopoulos, Christos; Georgopoulos, Efstratios; Moschopoulos, Charalampos; Mavroudi, Seferina

2015-03-01

Proteins are considered to be the most important individual components of biological systems and they combine to form physical protein complexes which are responsible for certain molecular functions. Despite the large availability of protein-protein interaction (PPI) information, not much information is available about protein complexes. Experimental methods are limited in terms of time, efficiency, cost and performance constraints. Existing computational methods have provided encouraging preliminary results, but they phase certain disadvantages as they require parameter tuning, some of them cannot handle weighted PPI data and others do not allow a protein to participate in more than one protein complex. In the present paper, we propose a new fully unsupervised methodology for predicting protein complexes from weighted PPI graphs. The proposed methodology is called evolutionary enhanced Markov clustering (EE-MC) and it is a hybrid combination of an adaptive evolutionary algorithm and a state-of-the-art clustering algorithm named enhanced Markov clustering. EE-MC was compared with state-of-the-art methodologies when applied to datasets from the human and the yeast Saccharomyces cerevisiae organisms. Using public available datasets, EE-MC outperformed existing methodologies (in some datasets the separation metric was increased by 10-20%). Moreover, when applied to new human datasets its performance was encouraging in the prediction of protein complexes which consist of proteins with high functional similarity. In specific, 5737 protein complexes were predicted and 72.58% of them are enriched for at least one gene ontology (GO) function term. EE-MC is by design able to overcome intrinsic limitations of existing methodologies such as their inability to handle weighted PPI networks, their constraint to assign every protein in exactly one cluster and the difficulties they face concerning the parameter tuning. This fact was experimentally validated and moreover, new
Studying interregional wildland fire engine assignments for large fire suppression

Treesearch

Erin J. Belval; Yu Wei; David E. Calkin; Crystal S. Stonesifer; Matthew P. Thompson; John R. Tipton

2017-01-01

One crucial component of large fire response in the United States (US) is the sharing of wildland firefighting resources between regions: resources from regions experiencing low fire activity supplement resources in regions experiencing high fire activity. An important step towards improving the efficiency of resource sharing and related policies is to develop a better...
Sequential /sup 1/H NMR assignments and secondary structure of hen egg white lysozyme in solution

DOE Office of Scientific and Technical Information (OSTI.GOV)

Redfield, C.; Dobson, C.M.

Assignments of /sup 1/H NMR resonances of 121 of the 129 residues of hen egg white lysozyme have been obtained by sequence-specific methods. Spin systems were identified with phase-sensitive two-dimensional (2-D) correlated spectroscopy and single and double relayed coherence transfer spectroscopy. For key types of amino acid residues, particularly alanine, threonine, valine, and glycine, complete spin systems were identified. For other residues a less complete definition of the spin system was found to be adequate for the purpose of sequential assignment. Sequence-specific assignments were achieved by phase-sensitive 2-D nuclear Overhauser enhancement spectroscopy (NOESY). Exploitation of the wide range of hydrogenmore » exchange rates found in lysozyme was a useful approach to overcoming the problem of spectral overlap. The sequential assignment was built up from 21 peptide segments ranging in length from 2 to 13 residues. The NOESY spectra were also used to provide information about the secondary structure of the protein in solution. Three helical regions and two regions of ..beta..-sheet were identified from the NOESY data; these regions are identical with those found in the X-ray structure of hen lysozyme. Slowly exchanging amides are generally correlated with hydrogen bonding identified in the X-ray structure; a number of exceptions to this general trend were, however, found. The results presented in this paper indicate that highly detailed information can be obtained from 2-D NMR spectra of a protein that is significantly larger than those studies previously.« less
Replicating and Extending Research on the Partial Assignment Completion Effect: Is Sunk Cost Related to Partial Assignment Completion Strength?

ERIC Educational Resources Information Center

Hawthorn-Embree, Meredith L.; Taylor, Emily P.; Skinner, Christopher H.; Parkhurst, John; Nalls, Meagan L.

2014-01-01

After students acquire a skill, mastery often requires them to choose to engage in assigned academic activities (e.g., independent seatwork, and homework). Although students may be more likely to choose to work on partially completed assignments than on new assignments, the partial assignment completion (PAC) effect may not be very powerful. The…
cOSPREY: A Cloud-Based Distributed Algorithm for Large-Scale Computational Protein Design

PubMed Central

Pan, Yuchao; Dong, Yuxi; Zhou, Jingtian; Hallen, Mark; Donald, Bruce R.; Xu, Wei

2016-01-01

Abstract Finding the global minimum energy conformation (GMEC) of a huge combinatorial search space is the key challenge in computational protein design (CPD) problems. Traditional algorithms lack a scalable and efficient distributed design scheme, preventing researchers from taking full advantage of current cloud infrastructures. We design cloud OSPREY (cOSPREY), an extension to a widely used protein design software OSPREY, to allow the original design framework to scale to the commercial cloud infrastructures. We propose several novel designs to integrate both algorithm and system optimizations, such as GMEC-specific pruning, state search partitioning, asynchronous algorithm state sharing, and fault tolerance. We evaluate cOSPREY on three different cloud platforms using different technologies and show that it can solve a number of large-scale protein design problems that have not been possible with previous approaches. PMID:27154509
On-demand high-capacity ride-sharing via dynamic trip-vehicle assignment.

PubMed

Alonso-Mora, Javier; Samaranayake, Samitha; Wallar, Alex; Frazzoli, Emilio; Rus, Daniela

2017-01-17

Ride-sharing services are transforming urban mobility by providing timely and convenient transportation to anybody, anywhere, and anytime. These services present enormous potential for positive societal impacts with respect to pollution, energy consumption, congestion, etc. Current mathematical models, however, do not fully address the potential of ride-sharing. Recently, a large-scale study highlighted some of the benefits of car pooling but was limited to static routes with two riders per vehicle (optimally) or three (with heuristics). We present a more general mathematical model for real-time high-capacity ride-sharing that (i) scales to large numbers of passengers and trips and (ii) dynamically generates optimal routes with respect to online demand and vehicle locations. The algorithm starts from a greedy assignment and improves it through a constrained optimization, quickly returning solutions of good quality and converging to the optimal assignment over time. We quantify experimentally the tradeoff between fleet size, capacity, waiting time, travel delay, and operational costs for low- to medium-capacity vehicles, such as taxis and van shuttles. The algorithm is validated with ∼3 million rides extracted from the New York City taxicab public dataset. Our experimental study considers ride-sharing with rider capacity of up to 10 simultaneous passengers per vehicle. The algorithm applies to fleets of autonomous vehicles and also incorporates rebalancing of idling vehicles to areas of high demand. This framework is general and can be used for many real-time multivehicle, multitask assignment problems.
On-demand high-capacity ride-sharing via dynamic trip-vehicle assignment

PubMed Central

Alonso-Mora, Javier; Samaranayake, Samitha; Wallar, Alex; Frazzoli, Emilio; Rus, Daniela

2017-01-01

Ride-sharing services are transforming urban mobility by providing timely and convenient transportation to anybody, anywhere, and anytime. These services present enormous potential for positive societal impacts with respect to pollution, energy consumption, congestion, etc. Current mathematical models, however, do not fully address the potential of ride-sharing. Recently, a large-scale study highlighted some of the benefits of car pooling but was limited to static routes with two riders per vehicle (optimally) or three (with heuristics). We present a more general mathematical model for real-time high-capacity ride-sharing that (i) scales to large numbers of passengers and trips and (ii) dynamically generates optimal routes with respect to online demand and vehicle locations. The algorithm starts from a greedy assignment and improves it through a constrained optimization, quickly returning solutions of good quality and converging to the optimal assignment over time. We quantify experimentally the tradeoff between fleet size, capacity, waiting time, travel delay, and operational costs for low- to medium-capacity vehicles, such as taxis and van shuttles. The algorithm is validated with ∼3 million rides extracted from the New York City taxicab public dataset. Our experimental study considers ride-sharing with rider capacity of up to 10 simultaneous passengers per vehicle. The algorithm applies to fleets of autonomous vehicles and also incorporates rebalancing of idling vehicles to areas of high demand. This framework is general and can be used for many real-time multivehicle, multitask assignment problems. PMID:28049820
Metabolic Pathway Assignment of Plant Genes based on Phylogenetic Profiling–A Feasibility Study

PubMed Central

Weißenborn, Sandra; Walther, Dirk

2017-01-01

Despite many developed experimental and computational approaches, functional gene annotation remains challenging. With the rapidly growing number of sequenced genomes, the concept of phylogenetic profiling, which predicts functional links between genes that share a common co-occurrence pattern across different genomes, has gained renewed attention as it promises to annotate gene functions based on presence/absence calls alone. We applied phylogenetic profiling to the problem of metabolic pathway assignments of plant genes with a particular focus on secondary metabolism pathways. We determined phylogenetic profiles for 40,960 metabolic pathway enzyme genes with assigned EC numbers from 24 plant species based on sequence and pathway annotation data from KEGG and Ensembl Plants. For gene sequence family assignments, needed to determine the presence or absence of particular gene functions in the given plant species, we included data of all 39 species available at the Ensembl Plants database and established gene families based on pairwise sequence identities and annotation information. Aside from performing profiling comparisons, we used machine learning approaches to predict pathway associations from phylogenetic profiles alone. Selected metabolic pathways were indeed found to be composed of gene families of greater than expected phylogenetic profile similarity. This was particularly evident for primary metabolism pathways, whereas for secondary pathways, both the available annotation in different species as well as the abstraction of functional association via distinct pathways proved limiting. While phylogenetic profile similarity was generally not found to correlate with gene co-expression, direct physical interactions of proteins were reflected by a significantly increased profile similarity suggesting an application of phylogenetic profiling methods as a filtering step in the identification of protein-protein interactions. This feasibility study highlights the

Recognition of functional sites in protein structures.

PubMed

Shulman-Peleg, Alexandra; Nussinov, Ruth; Wolfson, Haim J

2004-06-04

Recognition of regions on the surface of one protein, that are similar to a binding site of another is crucial for the prediction of molecular interactions and for functional classifications. We first describe a novel method, SiteEngine, that assumes no sequence or fold similarities and is able to recognize proteins that have similar binding sites and may perform similar functions. We achieve high efficiency and speed by introducing a low-resolution surface representation via chemically important surface points, by hashing triangles of physico-chemical properties and by application of hierarchical scoring schemes for a thorough exploration of global and local similarities. We proceed to rigorously apply this method to functional site recognition in three possible ways: first, we search a given functional site on a large set of complete protein structures. Second, a potential functional site on a protein of interest is compared with known binding sites, to recognize similar features. Third, a complete protein structure is searched for the presence of an a priori unknown functional site, similar to known sites. Our method is robust and efficient enough to allow computationally demanding applications such as the first and the third. From the biological standpoint, the first application may identify secondary binding sites of drugs that may lead to side-effects. The third application finds new potential sites on the protein that may provide targets for drug design. Each of the three applications may aid in assigning a function and in classification of binding patterns. We highlight the advantages and disadvantages of each type of search, provide examples of large-scale searches of the entire Protein Data Base and make functional predictions.
LiveBench-1: continuous benchmarking of protein structure prediction servers.

PubMed

Bujnicki, J M; Elofsson, A; Fischer, D; Rychlewski, L

2001-02-01

We present a novel, continuous approach aimed at the large-scale assessment of the performance of available fold-recognition servers. Six popular servers were investigated: PDB-Blast, FFAS, T98-lib, GenTHREADER, 3D-PSSM, and INBGU. The assessment was conducted using as prediction targets a large number of selected protein structures released from October 1999 to April 2000. A target was selected if its sequence showed no significant similarity to any of the proteins previously available in the structural database. Overall, the servers were able to produce structurally similar models for one-half of the targets, but significantly accurate sequence-structure alignments were produced for only one-third of the targets. We further classified the targets into two sets: easy and hard. We found that all servers were able to find the correct answer for the vast majority of the easy targets if a structurally similar fold was present in the server's fold libraries. However, among the hard targets--where standard methods such as PSI-BLAST fail--the most sensitive fold-recognition servers were able to produce similar models for only 40% of the cases, half of which had a significantly accurate sequence-structure alignment. Among the hard targets, the presence of updated libraries appeared to be less critical for the ranking. An "ideally combined consensus" prediction, where the results of all servers are considered, would increase the percentage of correct assignments by 50%. Each server had a number of cases with a correct assignment, where the assignments of all the other servers were wrong. This emphasizes the benefits of considering more than one server in difficult prediction tasks. The LiveBench program (http://BioInfo.PL/LiveBench) is being continued, and all interested developers are cordially invited to join.
1H, 15N and 13C NMR Assignments of Mouse Methionine Sulfoxide Reductase B2

PubMed Central

Breivik, Åshild S.; Aachmann, Finn L.; Sal, Lena S.; Kim, Hwa-Young; Del Conte, Rebecca; Gladyshev, Vadim N.; Dikiy, Alexander

2011-01-01

A recombinant mouse methionine-r-sulfoxide reductase 2 (MsrB2ΔS) isotopically labeled with 15N and 15N/13C was generated. We report here the 1H, 15N and 13C NMR assignments of the reduced form of this protein. PMID:19636904
Rapid large-scale purification of myofilament proteins using a cleavable His6-tag

PubMed Central

Zhang, Mengjie; Martin, Jody L.; Kumar, Mohit; de Tombe, Pieter P.

2015-01-01

With the advent of high-throughput DNA sequencing, the number of identified cardiomyopathy-causing mutations has increased tremendously. As the majority of these mutations affect myofilament proteins, there is a need to understand their functional consequence on contraction. Permeabilized myofilament preparations coupled with protein exchange protocols are a common method for examining into contractile mechanics. However, producing large quantities of myofilament proteins can be time consuming and requires different approaches for each protein of interest. In the present study, we describe a unified automated method to produce troponin C, troponin T, and troponin I as well as myosin light chain 2 fused to a His6-tag followed by a tobacco etch virus (TEV) protease site. TEV protease has the advantage of a relaxed P1′ cleavage site specificity, allowing for no residues left after proteolysis and preservation of the native sequence of the protein of interest. After expression in Esherichia coli, cells were lysed by sonication in imidazole-containing buffer. The His6-tagged protein was then purified using a HisTrap nickel metal affinity column, and the His6-tag was removed by His6-TEV protease digestion for 4 h at 30°C. The protease was then removed using a HisTrap column, and complex assembly was performed via column-assisted sequential desalting. This mostly automated method allows for the purification of protein in 1 day and can be adapted to most soluble proteins. It has the advantage of greatly increasing yield while reducing the time and cost of purification. Therefore, production and purification of mutant proteins can be accelerated and functional data collected in a faster, less expensive manner. PMID:26386113
7 CFR 784.15 - Assignments.

Code of Federal Regulations, 2010 CFR

2010-01-01

... 7 Agriculture 7 2010-01-01 2010-01-01 false Assignments. 784.15 Section 784.15 Agriculture Regulations of the Department of Agriculture (Continued) FARM SERVICE AGENCY, DEPARTMENT OF AGRICULTURE SPECIAL PROGRAMS 2004 EWE LAMB REPLACEMENT AND RETENTION PAYMENT PROGRAM § 784.15 Assignments. Any person...
A collaborative filtering approach for protein-protein docking scoring functions.

PubMed

Bourquard, Thomas; Bernauer, Julie; Azé, Jérôme; Poupon, Anne

2011-04-22

A protein-protein docking procedure traditionally consists in two successive tasks: a search algorithm generates a large number of candidate conformations mimicking the complex existing in vivo between two proteins, and a scoring function is used to rank them in order to extract a native-like one. We have already shown that using Voronoi constructions and a well chosen set of parameters, an accurate scoring function could be designed and optimized. However to be able to perform large-scale in silico exploration of the interactome, a near-native solution has to be found in the ten best-ranked solutions. This cannot yet be guaranteed by any of the existing scoring functions. In this work, we introduce a new procedure for conformation ranking. We previously developed a set of scoring functions where learning was performed using a genetic algorithm. These functions were used to assign a rank to each possible conformation. We now have a refined rank using different classifiers (decision trees, rules and support vector machines) in a collaborative filtering scheme. The scoring function newly obtained is evaluated using 10 fold cross-validation, and compared to the functions obtained using either genetic algorithms or collaborative filtering taken separately. This new approach was successfully applied to the CAPRI scoring ensembles. We show that for 10 targets out of 12, we are able to find a near-native conformation in the 10 best ranked solutions. Moreover, for 6 of them, the near-native conformation selected is of high accuracy. Finally, we show that this function dramatically enriches the 100 best-ranking conformations in near-native structures.
A Collaborative Filtering Approach for Protein-Protein Docking Scoring Functions

PubMed Central

Bourquard, Thomas; Bernauer, Julie; Azé, Jérôme; Poupon, Anne

2011-01-01

A protein-protein docking procedure traditionally consists in two successive tasks: a search algorithm generates a large number of candidate conformations mimicking the complex existing in vivo between two proteins, and a scoring function is used to rank them in order to extract a native-like one. We have already shown that using Voronoi constructions and a well chosen set of parameters, an accurate scoring function could be designed and optimized. However to be able to perform large-scale in silico exploration of the interactome, a near-native solution has to be found in the ten best-ranked solutions. This cannot yet be guaranteed by any of the existing scoring functions. In this work, we introduce a new procedure for conformation ranking. We previously developed a set of scoring functions where learning was performed using a genetic algorithm. These functions were used to assign a rank to each possible conformation. We now have a refined rank using different classifiers (decision trees, rules and support vector machines) in a collaborative filtering scheme. The scoring function newly obtained is evaluated using 10 fold cross-validation, and compared to the functions obtained using either genetic algorithms or collaborative filtering taken separately. This new approach was successfully applied to the CAPRI scoring ensembles. We show that for 10 targets out of 12, we are able to find a near-native conformation in the 10 best ranked solutions. Moreover, for 6 of them, the near-native conformation selected is of high accuracy. Finally, we show that this function dramatically enriches the 100 best-ranking conformations in near-native structures. PMID:21526112
Machine Learning-based Classification of Diffuse Large B-cell Lymphoma Patients by Their Protein Expression Profiles.

PubMed

Deeb, Sally J; Tyanova, Stefka; Hummel, Michael; Schmidt-Supprian, Marc; Cox, Juergen; Mann, Matthias

2015-11-01

Characterization of tumors at the molecular level has improved our knowledge of cancer causation and progression. Proteomic analysis of their signaling pathways promises to enhance our understanding of cancer aberrations at the functional level, but this requires accurate and robust tools. Here, we develop a state of the art quantitative mass spectrometric pipeline to characterize formalin-fixed paraffin-embedded tissues of patients with closely related subtypes of diffuse large B-cell lymphoma. We combined a super-SILAC approach with label-free quantification (hybrid LFQ) to address situations where the protein is absent in the super-SILAC standard but present in the patient samples. Shotgun proteomic analysis on a quadrupole Orbitrap quantified almost 9,000 tumor proteins in 20 patients. The quantitative accuracy of our approach allowed the segregation of diffuse large B-cell lymphoma patients according to their cell of origin using both their global protein expression patterns and the 55-protein signature obtained previously from patient-derived cell lines (Deeb, S. J., D'Souza, R. C., Cox, J., Schmidt-Supprian, M., and Mann, M. (2012) Mol. Cell. Proteomics 11, 77-89). Expression levels of individual segregation-driving proteins as well as categories such as extracellular matrix proteins behaved consistently with known trends between the subtypes. We used machine learning (support vector machines) to extract candidate proteins with the highest segregating power. A panel of four proteins (PALD1, MME, TNFAIP8, and TBC1D4) is predicted to classify patients with low error rates. Highly ranked proteins from the support vector analysis revealed differential expression of core signaling molecules between the subtypes, elucidating aspects of their pathobiology. © 2015 by The American Society for Biochemistry and Molecular Biology, Inc.
Exogenous spatial attention influences figure-ground assignment.

PubMed

Vecera, Shaun P; Flevaris, Anastasia V; Filapek, Joseph C

2004-01-01

In a hierarchical stage account of vision, figure-ground assignment is thought to be completed before the operation of focal spatial attention. Results of previous studies have supported this account by showing that unpredictive, exogenous spatial precues do not influence figure-ground assignment, although voluntary attention can influence figure-ground assignment. However, in these studies, attention was not summoned directly to a region in a figure-ground display. In three experiments, we addressed the relationship between figure-ground assignment and visuospatial attention. In Experiment 1, we replicated the finding that exogenous precues do not influence figure-ground assignment when they direct attention outside of a figure-ground stimulus. In Experiment 2, we demonstrated that exogenous attention can influence figure-ground assignment if it is directed to one of the regions in a figure-ground stimulus. In Experiment 3, we demonstrated that exogenous attention can influence figure-ground assignment in displays that contain a Gestalt figure-ground cue; this result suggests that figure-ground processes are not entirely completed prior to the operation of focal spatial attention. Exogenous spatial attention acts as a cue for figure-ground assignment and can affect the outcome of figure-ground processes.
Drug Transporters and Na+/H+ Exchange Regulatory Factor PSD-95/Drosophila Discs Large/ZO-1 Proteins

PubMed Central

Walsh, Dustin R.; Nolin, Thomas D.

2015-01-01

Drug transporters govern the absorption, distribution, and elimination of pharmacologically active compounds. Members of the solute carrier and ATP binding-cassette drug transporter family mediate cellular drug uptake and efflux processes, thereby coordinating the vectorial movement of drugs across epithelial barriers. To exert their physiologic and pharmacological function in polarized epithelia, drug transporters must be targeted and stabilized to appropriate regions of the cell membrane (i.e., apical versus basolateral). Despite the critical importance of drug transporter membrane targeting, the mechanisms that underlie these processes are largely unknown. Several clinically significant drug transporters possess a recognition sequence that binds to PSD-95/Drosophila discs large/ZO-1 (PDZ) proteins. PDZ proteins, such as the Na+/H+ exchanger regulatory factor (NHERF) family, act to stabilize and organize membrane targeting of multiple transmembrane proteins, including many clinically relevant drug transporters. These PDZ proteins are normally abundant at apical membranes, where they tether membrane-delimited transporters. NHERF expression is particularly high at the apical membrane in polarized tissue such as intestinal, hepatic, and renal epithelia, tissues important to drug disposition. Several recent studies have highlighted NHERF proteins as determinants of drug transporter function secondary to their role in controlling membrane abundance and localization. Mounting evidence strongly suggests that NHERF proteins may have clinically significant roles in pharmacokinetics and pharmacodynamics of several pharmacologically active compounds and may affect drug action in cancer and chronic kidney disease. For these reasons, NHERF proteins represent a novel class of post-translational mediators of drug transport and novel targets for new drug development. PMID:26092975
A large-scale test of free-energy simulation estimates of protein-ligand binding affinities.

PubMed

Mikulskis, Paulius; Genheden, Samuel; Ryde, Ulf

2014-10-27

We have performed a large-scale test of alchemical perturbation calculations with the Bennett acceptance-ratio (BAR) approach to estimate relative affinities for the binding of 107 ligands to 10 different proteins. Employing 20-Å truncated spherical systems and only one intermediate state in the perturbations, we obtain an error of less than 4 kJ/mol for 54% of the studied relative affinities and a precision of 0.5 kJ/mol on average. However, only four of the proteins gave acceptable errors, correlations, and rankings. The results could be improved by using nine intermediate states in the simulations or including the entire protein in the simulations using periodic boundary conditions. However, 27 of the calculated affinities still gave errors of more than 4 kJ/mol, and for three of the proteins the results were not satisfactory. This shows that the performance of BAR calculations depends on the target protein and that several transformations gave poor results owing to limitations in the molecular-mechanics force field or the restricted sampling possible within a reasonable simulation time. Still, the BAR results are better than docking calculations for most of the proteins.
Hierarchical and Helical Self-assembly of ADP-ribosyl Cyclase into Large-scale Protein Microtubes

PubMed Central

Liu, Qun; Kriksunov, Irina A.; Wang, Zhongwu; Graeff, Richard; Lee, Hon Cheung; Hao, Quan

2013-01-01

Proteins are macromolecules with characteristic structures and biological functions. It is extremely challenging to obtain protein microtube structures through self-assembly as proteins are very complex and flexible. Here we present a strategy showing how a specific protein, ADP-ribosyl cyclase, helically self-assembles from monomers into hexagonal nanochains and further to highly ordered crystalline microtubes. The structures of protein nanochains and consequently self-assembled superlattice were determined by X-ray crystallography at 4.5 Å resolution and imaged by Scanning Electron Microscopy. The protein initially forms into dimers that have a fixed size of 5.6 nm, and then, helically self-assembles into 35.6 nm long hexagonal nanochains. One such nanochain consists of six dimers (12 monomers) that stack in order by a pseudo P61 screw axis. Seven nanochains produce a series of largescale assemblies, nanorods, forming the building blocks for microrods. A proposed aging process of microrods results in the formation of hollow microstructures. Synthesis and characterization of large scale self-assembled protein microtubes may pave a new pathway, capable of not only understanding the self-assembly dynamics of biological materials, but also directing design and fabrication of multifunctional nanobuilding blocks with particular applications in biomedical engineering. PMID:18956900
Phase modulated 2D HSQC-TOCSY for unambiguous assignment of overlapping spin systems

NASA Astrophysics Data System (ADS)

Singh, Amrinder; Dubey, Abhinav; Adiga, Satish K.; Atreya, Hanudatta S.

2018-01-01

We present a new method that allows one to unambiguously resolve overlapping spin systems often encountered in biomolecular systems such as peptides and proteins or in samples containing a mixture of different molecules such as in metabolomics. We address this problem using the recently proposed phase modulation approach. By evolving the 1H chemical shifts in a conventional two dimensional (2D) HSQC-TOCSY experiment for a fixed delay period, the phase/intensity of set of cross peaks belonging to one spin system are modulated differentially relative to those of its overlapping counterpart, resulting in their discrimination and recognition. The method thus accelerates the process of identification and resonance assignment of individual compounds in complex mixtures. This approach facilitated the assignment of molecules in the embryo culture medium used in human assisted reproductive technology.
Two New Tools for Glycopeptide Analysis Researchers: A Glycopeptide Decoy Generator and a Large Data Set of Assigned CID Spectra of Glycopeptides.

PubMed

Lakbub, Jude C; Su, Xiaomeng; Zhu, Zhikai; Patabandige, Milani W; Hua, David; Go, Eden P; Desaire, Heather

2017-08-04

The glycopeptide analysis field is tightly constrained by a lack of effective tools that translate mass spectrometry data into meaningful chemical information, and perhaps the most challenging aspect of building effective glycopeptide analysis software is designing an accurate scoring algorithm for MS/MS data. We provide the glycoproteomics community with two tools to address this challenge. The first tool, a curated set of 100 expert-assigned CID spectra of glycopeptides, contains a diverse set of spectra from a variety of glycan types; the second tool, Glycopeptide Decoy Generator, is a new software application that generates glycopeptide decoys de novo. We developed these tools so that emerging methods of assigning glycopeptides' CID spectra could be rigorously tested. Software developers or those interested in developing skills in expert (manual) analysis can use these tools to facilitate their work. We demonstrate the tools' utility in assessing the quality of one particular glycopeptide software package, GlycoPep Grader, which assigns glycopeptides to CID spectra. We first acquired the set of 100 expert assigned CID spectra; then, we used the Decoy Generator (described herein) to generate 20 decoys per target glycopeptide. The assigned spectra and decoys were used to test the accuracy of GlycoPep Grader's scoring algorithm; new strengths and weaknesses were identified in the algorithm using this approach. Both newly developed tools are freely available. The software can be downloaded at http://glycopro.chem.ku.edu/GPJ.jar.
Sequence-specific sup 1 H NMR resonance assignments of Bacillus subtilis HPr: Use of spectra obtained from mutants to resolve spectral overlap

DOE Office of Scientific and Technical Information (OSTI.GOV)

Wittekind, M.; Klevit, R.E.; Reizer, J.

1990-08-07

On the basis of an analysis of two-dimensional {sup 1}H NMR spectra, the complete sequence-specific {sup 1}H NMR assignments are presented for the phosphocarrier protein HPr from the Gram-positive bacterium Bacillus subtilis. During the assignment procedure, extensive use was made of spectra obtained from point mutants of HPr in order to resolve spectral overlap and to provide verification of assignments. Regions of regular secondary structure were identified by characteristic patterns of sequential backbone proton NOEs and slowly exchanging amide protons. B subtilis HPr contains four {beta}-strands that form a single antiparallel {beta}-sheet and two well-defined {alpha}-helices. There are two stretchesmore » of extended backbone structure, one of which contains the active site His{sub 15}. The overall fold of the protein is very similar to that of Escherichia coli HPr determined by NMR studies.« less
Evolution of Protein Lipograms: A Bioinformatics Problem

ERIC Educational Resources Information Center

White, Harold B., III; Dhurjati, Prasad

2006-01-01

A protein lacking one of the 20 common amino acids is a protein lipogram. This open-ended problem-based learning assignment deals with the evolution of proteins with biased amino acid composition. It has students query protein and metabolic databases to test the hypothesis that natural selection has reduced the frequency of each amino acid…
Consensus Assignments for Water Vapor Lines Not Assigned on the HITRAN Database: 13,200 to 16,500/cm

NASA Technical Reports Server (NTRS)

Giver, Lawerence P.; Chackerian, Charles, Jr.; Freedman, Richard S.; Varanasi, Prasad; Gore, Warren (Technical Monitor)

2000-01-01

There are nearly 800 water Vapor-lines in the 13,200-16,500/cm region that do not have rovibrational assignments in the HITRAN database. The positions and intensities in the database were determined by Mandin et al., but assignments could not be determined at that time. Polyansky, et al. have now assigned over 600 of the unassigned lines in the 11,200-16,500/cm region. Schwenke has also given rovibrational assignments to many of these unassigned lines throughout the visible and near-infrared. Both articles changed the assignments of some HITRAN lines. Carleer et al. extend assignments to some weaker lines measured by them on new spectra with excellent signal/noise. However, some lines measured by Mandin et al. were omitted by Carleer, et al. because of blends due to lower spectral resolution. The rovibrational assignments of Polyansky et al. completely agree with those in Schwenke's article for only about 200 lines. However, Schwenke's ab initio line list is available on his internet site (http://ccf.arc.nasa.gov/-dschwenke). A detailed comparison of the Polyansky et al.line list, the Carleer et al.line list, and Schwenke's ab initio line list shows a larger number of agreements. In many cases the disagreement is only about the vibrational and/or rotational upper level, while there is agreement on the lower state assignment and energy level, "E", which is of primary importance for atmospheric applications. We will present a line list of "consensus" assignments in the 13,200-16,500/cm region for consideration of inclusion on the HITRAN and GEISA databases. This will substantially reduce the number of unassigned lines on the databases in this spectral region.
Progressive response of large intestinal bacterial community and fermentation to the stepwise decrease of dietary crude protein level in growing pigs.

PubMed

Peng, Yu; Yu, Kaifan; Mu, Chunlong; Hang, Suqin; Che, Lianqiang; Zhu, Weiyun

2017-07-01

The study aimed to determine the effects of reduction of dietary crude protein (CP) level with balanced essential amino acids (EAA) on intestinal bacteria and their metabolites of growing pigs. Forty pigs (initial BW 13.50 ± 0.50 kg, 45 ± 2 days of age) were randomly assigned to four dietary treatments containing CP levels at 20.00% (normal crude protein, NP); 17.16% (medium crude protein, MP); 15.30% (low crude protein, LP); and 13.90% (extremely low crude protein, ELP), respectively. Crystalline AAs were added to meet the EAA requirement of pigs. After 4-week feeding, eight pigs per treatment (n = 8) were randomly selected and slaughtered for sampling of ileal, cecal, and colonic digesta and mucosa. Pigs with moderately reduced CP level had increased bacterial diversity, with the Shannon diversity indices for the colon digesta in the LP group and mucosa in the MP and LP groups significantly (P < 0.05) higher than those in the NP and ELP groups. As the CP level reduces, the Bifidobacterium population were linearly decreased (P < 0.05) both in ileum, cecum, and colon, and the ELP group had the lowest Bifidobacterium population in the cecum and colon, with its value significantly lower than NP and MP groups (P < 0.05). However, the ELP group had the highest population of Escherichia coli in the colon, with its value significantly higher than the LP group (P < 0.05). For bacterial metabolites, as CP level decreased, total short-chain fatty acid (T-SCFA), acetate, and butyrate were linearly increased (linear, P < 0.05) in the ileum, while all SCFAs except formate in the cecum and T-SCFA and acetate in the colon, were linearly decreased (P < 0.05). Reducing CP level led to a linear decrease of microbial crude protein (MCP) in the ileum (P < 0.05) and ammonia in all intestine segments (P < 0.05). The spermidine in cecum and total amines, cadaverine, methylamine, and spermidine in colon were shown a quadratic change (P < 0.05) as dietary CP
LARGE, an intellectual disability-associated protein, regulates AMPA-type glutamate receptor trafficking and memory.

PubMed

Seo, Bo Am; Cho, Taesup; Lee, Daniel Z; Lee, Joong-Jae; Lee, Boyoung; Kim, Seong-Wook; Shin, Hee-Sup; Kang, Myoung-Goo

2018-06-18

Mutations in the human LARGE gene result in severe intellectual disability and muscular dystrophy. How LARGE mutation leads to intellectual disability, however, is unclear. In our proteomic study, LARGE was found to be a component of the AMPA-type glutamate receptor (AMPA-R) protein complex, a main player for learning and memory in the brain. Here, our functional study of LARGE showed that LARGE at the Golgi apparatus (Golgi) negatively controlled AMPA-R trafficking from the Golgi to the plasma membrane, leading to down-regulated surface and synaptic AMPA-R targeting. In LARGE knockdown mice, long-term potentiation (LTP) was occluded by synaptic AMPA-R overloading, resulting in impaired contextual fear memory. These findings indicate that the fine-tuning of AMPA-R trafficking by LARGE at the Golgi is critical for hippocampus-dependent memory in the brain. Our study thus provides insights into the pathophysiology underlying cognitive deficits in brain disorders associated with intellectual disability.
Robotic large-scale application of wheat cell-free translation to structural studies including membrane proteins

PubMed Central

Beebe, Emily T.; Makino, Shin-ichi; Nozawa, Akira; Matsubara, Yuko; Frederick, Ronnie O.; Primm, John G.; Goren, Michael A.; Fox, Brian G.

2010-01-01

The use of the Protemist XE, an automated discontinuous-batch protein synthesis robot, in cell-free translation is reported. The soluble Galdieria sulphuraria protein DCN1 was obtained in greater than 2 mg total synthesis yield per mL of reaction mixture from the Protemist XE, and the structure was subsequently solved by X-ray crystallography using material from one 10 mL synthesis (PDB ID: 3KEV). The Protemist XE was also capable of membrane protein translation. Thus human sigma-1 receptor was translated in the presence of unilamellar liposomes and bacteriorhodopsin was translated directly into detergent micelles in the presence of all-trans-retinal. The versatility, ease of use, and compact size of the Protemist XE robot demonstrate its suitability for large-scale synthesis of many classes of proteins. PMID:20637905

25 CFR 535.2 - Assignments.

Code of Federal Regulations, 2010 CFR

2010-04-01

... 25 Indians 2 2010-04-01 2010-04-01 false Assignments. 535.2 Section 535.2 Indians NATIONAL INDIAN GAMING COMMISSION, DEPARTMENT OF THE INTERIOR MANAGEMENT CONTRACT PROVISIONS POST-APPROVAL PROCEDURES... disapprove an assignment applying the standards of, and within the time provided by §§ 535.1(d) and 535.1(e...
Thermal motion in proteins: Large effects on the time-averaged interaction energies

DOE Office of Scientific and Technical Information (OSTI.GOV)

Goethe, Martin, E-mail: martingoethe@ub.edu; Rubi, J. Miguel; Fita, Ignacio

As a consequence of thermal motion, inter-atomic distances in proteins fluctuate strongly around their average values, and hence, also interaction energies (i.e. the pair-potentials evaluated at the fluctuating distances) are not constant in time but exhibit pronounced fluctuations. These fluctuations cause that time-averaged interaction energies do generally not coincide with the energy values obtained by evaluating the pair-potentials at the average distances. More precisely, time-averaged interaction energies behave typically smoother in terms of the average distance than the corresponding pair-potentials. This averaging effect is referred to as the thermal smoothing effect. Here, we estimate the strength of the thermal smoothingmore » effect on the Lennard-Jones pair-potential for globular proteins at ambient conditions using x-ray diffraction and simulation data of a representative set of proteins. For specific atom species, we find a significant smoothing effect where the time-averaged interaction energy of a single atom pair can differ by various tens of cal/mol from the Lennard-Jones potential at the average distance. Importantly, we observe a dependency of the effect on the local environment of the involved atoms. The effect is typically weaker for bulky backbone atoms in beta sheets than for side-chain atoms belonging to other secondary structure on the surface of the protein. The results of this work have important practical implications for protein software relying on free energy expressions. We show that the accuracy of free energy expressions can largely be increased by introducing environment specific Lennard-Jones parameters accounting for the fact that the typical thermal motion of protein atoms depends strongly on their local environment.« less
Thermal motion in proteins: Large effects on the time-averaged interaction energies

NASA Astrophysics Data System (ADS)

Goethe, Martin; Fita, Ignacio; Rubi, J. Miguel

2016-03-01

As a consequence of thermal motion, inter-atomic distances in proteins fluctuate strongly around their average values, and hence, also interaction energies (i.e. the pair-potentials evaluated at the fluctuating distances) are not constant in time but exhibit pronounced fluctuations. These fluctuations cause that time-averaged interaction energies do generally not coincide with the energy values obtained by evaluating the pair-potentials at the average distances. More precisely, time-averaged interaction energies behave typically smoother in terms of the average distance than the corresponding pair-potentials. This averaging effect is referred to as the thermal smoothing effect. Here, we estimate the strength of the thermal smoothing effect on the Lennard-Jones pair-potential for globular proteins at ambient conditions using x-ray diffraction and simulation data of a representative set of proteins. For specific atom species, we find a significant smoothing effect where the time-averaged interaction energy of a single atom pair can differ by various tens of cal/mol from the Lennard-Jones potential at the average distance. Importantly, we observe a dependency of the effect on the local environment of the involved atoms. The effect is typically weaker for bulky backbone atoms in beta sheets than for side-chain atoms belonging to other secondary structure on the surface of the protein. The results of this work have important practical implications for protein software relying on free energy expressions. We show that the accuracy of free energy expressions can largely be increased by introducing environment specific Lennard-Jones parameters accounting for the fact that the typical thermal motion of protein atoms depends strongly on their local environment.
Rapid large-scale purification of myofilament proteins using a cleavable His6-tag.

PubMed

Zhang, Mengjie; Martin, Jody L; Kumar, Mohit; Khairallah, Ramzi J; de Tombe, Pieter P

2015-11-01

With the advent of high-throughput DNA sequencing, the number of identified cardiomyopathy-causing mutations has increased tremendously. As the majority of these mutations affect myofilament proteins, there is a need to understand their functional consequence on contraction. Permeabilized myofilament preparations coupled with protein exchange protocols are a common method for examining into contractile mechanics. However, producing large quantities of myofilament proteins can be time consuming and requires different approaches for each protein of interest. In the present study, we describe a unified automated method to produce troponin C, troponin T, and troponin I as well as myosin light chain 2 fused to a His6-tag followed by a tobacco etch virus (TEV) protease site. TEV protease has the advantage of a relaxed P1' cleavage site specificity, allowing for no residues left after proteolysis and preservation of the native sequence of the protein of interest. After expression in Esherichia coli, cells were lysed by sonication in imidazole-containing buffer. The His6-tagged protein was then purified using a HisTrap nickel metal affinity column, and the His6-tag was removed by His6-TEV protease digestion for 4 h at 30°C. The protease was then removed using a HisTrap column, and complex assembly was performed via column-assisted sequential desalting. This mostly automated method allows for the purification of protein in 1 day and can be adapted to most soluble proteins. It has the advantage of greatly increasing yield while reducing the time and cost of purification. Therefore, production and purification of mutant proteins can be accelerated and functional data collected in a faster, less expensive manner. Copyright © 2015 the American Physiological Society.
GOLabeler: Improving Sequence-based Large-scale Protein Function Prediction by Learning to Rank.

PubMed

You, Ronghui; Zhang, Zihan; Xiong, Yi; Sun, Fengzhu; Mamitsuka, Hiroshi; Zhu, Shanfeng

2018-03-07

Gene Ontology (GO) has been widely used to annotate functions of proteins and understand their biological roles. Currently only <1% of more than 70 million proteins in UniProtKB have experimental GO annotations, implying the strong necessity of automated function prediction (AFP) of proteins, where AFP is a hard multilabel classification problem due to one protein with a diverse number of GO terms. Most of these proteins have only sequences as input information, indicating the importance of sequence-based AFP (SAFP: sequences are the only input). Furthermore homology-based SAFP tools are competitive in AFP competitions, while they do not necessarily work well for so-called difficult proteins, which have <60% sequence identity to proteins with annotations already. Thus the vital and challenging problem now is how to develop a method for SAFP, particularly for difficult proteins. The key of this method is to extract not only homology information but also diverse, deep- rooted information/evidence from sequence inputs and integrate them into a predictor in a both effective and efficient manner. We propose GOLabeler, which integrates five component classifiers, trained from different features, including GO term frequency, sequence alignment, amino acid trigram, domains and motifs, and biophysical properties, etc., in the framework of learning to rank (LTR), a paradigm of machine learning, especially powerful for multilabel classification. The empirical results obtained by examining GOLabeler extensively and thoroughly by using large-scale datasets revealed numerous favorable aspects of GOLabeler, including significant performance advantage over state-of-the-art AFP methods. http://datamining-iip.fudan.edu.cn/golabeler. zhusf@fudan.edu.cn. Supplementary data are available at Bioinformatics online.
A general path for large-scale solubilization of cellular proteins: From membrane receptors to multiprotein complexes

PubMed Central

Pullara, Filippo; Guerrero-Santoro, Jennifer; Calero, Monica; Zhang, Qiangmin; Peng, Ye; Spåhr, Henrik; Kornberg, Guy L.; Cusimano, Antonella; Stevenson, Hilary P.; Santamaria-Suarez, Hugo; Reynolds, Shelley L.; Brown, Ian S.; Monga, Satdarshan P.S.; Van Houten, Bennett; Rapić-Otrin, Vesna; Calero, Guillermo; Levine, Arthur S.

2014-01-01

Expression of recombinant proteins in bacterial or eukaryotic systems often results in aggregation rendering them unavailable for biochemical or structural studies. Protein aggregation is a costly problem for biomedical research. It forces research laboratories and the biomedical industry to search for alternative, more soluble, non-human proteins and limits the number of potential “druggable” targets. In this study we present a highly reproducible protocol that introduces the systematic use of an extensive number of detergents to solubilize aggregated proteins expressed in bacterial and eukaryotic systems. We validate the usefulness of this protocol by solubilizing traditionally difficult human protein targets to milligram quantities and confirm their biological activity. We use this method to solubilize monomeric or multimeric components of multi-protein complexes and demonstrate its efficacy to reconstitute large cellular machines. This protocol works equally well on cytosolic, nuclear and membrane proteins and can be easily adapted to a high throughput format. PMID:23137940
5 CFR 351.701 - Assignment involving displacement.

Code of Federal Regulations, 2011 CFR

2011-01-01

... 5 Administrative Personnel 1 2011-01-01 2011-01-01 false Assignment involving displacement. 351.701 Section 351.701 Administrative Personnel OFFICE OF PERSONNEL MANAGEMENT CIVIL SERVICE REGULATIONS REDUCTION IN FORCE Assignment Rights (Bump and Retreat) § 351.701 Assignment involving displacement. (a...
5 CFR 351.701 - Assignment involving displacement.

Code of Federal Regulations, 2012 CFR

2012-01-01

... 5 Administrative Personnel 1 2012-01-01 2012-01-01 false Assignment involving displacement. 351.701 Section 351.701 Administrative Personnel OFFICE OF PERSONNEL MANAGEMENT CIVIL SERVICE REGULATIONS REDUCTION IN FORCE Assignment Rights (Bump and Retreat) § 351.701 Assignment involving displacement. (a...
5 CFR 351.701 - Assignment involving displacement.

Code of Federal Regulations, 2013 CFR

2013-01-01

... 5 Administrative Personnel 1 2013-01-01 2013-01-01 false Assignment involving displacement. 351.701 Section 351.701 Administrative Personnel OFFICE OF PERSONNEL MANAGEMENT CIVIL SERVICE REGULATIONS REDUCTION IN FORCE Assignment Rights (Bump and Retreat) § 351.701 Assignment involving displacement. (a...
5 CFR 351.701 - Assignment involving displacement.

Code of Federal Regulations, 2010 CFR

2010-01-01

... 5 Administrative Personnel 1 2010-01-01 2010-01-01 false Assignment involving displacement. 351.701 Section 351.701 Administrative Personnel OFFICE OF PERSONNEL MANAGEMENT CIVIL SERVICE REGULATIONS REDUCTION IN FORCE Assignment Rights (Bump and Retreat) § 351.701 Assignment involving displacement. (a...
5 CFR 351.701 - Assignment involving displacement.

Code of Federal Regulations, 2014 CFR

2014-01-01

... 5 Administrative Personnel 1 2014-01-01 2014-01-01 false Assignment involving displacement. 351.701 Section 351.701 Administrative Personnel OFFICE OF PERSONNEL MANAGEMENT CIVIL SERVICE REGULATIONS REDUCTION IN FORCE Assignment Rights (Bump and Retreat) § 351.701 Assignment involving displacement. (a...
A new test set for validating predictions of protein-ligand interaction.

PubMed

Nissink, J Willem M; Murray, Chris; Hartshorn, Mike; Verdonk, Marcel L; Cole, Jason C; Taylor, Robin

2002-12-01

We present a large test set of protein-ligand complexes for the purpose of validating algorithms that rely on the prediction of protein-ligand interactions. The set consists of 305 complexes with protonation states assigned by manual inspection. The following checks have been carried out to identify unsuitable entries in this set: (1) assessing the involvement of crystallographically related protein units in ligand binding; (2) identification of bad clashes between protein side chains and ligand; and (3) assessment of structural errors, and/or inconsistency of ligand placement with crystal structure electron density. In addition, the set has been pruned to assure diversity in terms of protein-ligand structures, and subsets are supplied for different protein-structure resolution ranges. A classification of the set by protein type is available. As an illustration, validation results are shown for GOLD and SuperStar. GOLD is a program that performs flexible protein-ligand docking, and SuperStar is used for the prediction of favorable interaction sites in proteins. The new CCDC/Astex test set is freely available to the scientific community (http://www.ccdc.cam.ac.uk). Copyright 2002 Wiley-Liss, Inc.
Prediction of protein tertiary structure to low resolution: performance for a large and structurally diverse test set.

PubMed

Eyrich, V A; Standley, D M; Friesner, R A

1999-05-14

We report the tertiary structure predictions for 95 proteins ranging in size from 17 to 160 residues starting from known secondary structure. Predictions are obtained from global minimization of an empirical potential function followed by the application of a refined atomic overlap potential. The minimization strategy employed represents a variant of the Monte Carlo plus minimization scheme of Li and Scheraga applied to a reduced model of the protein chain. For all of the cases except beta-proteins larger than 75 residues, a native-like structure, usually 4-6 A root-mean-square deviation from the native, is located. For beta-proteins larger than 75 residues, the energy gap between native-like structures and the lowest energy structures produced in the simulation is large, so that low RMSD structures are not generated starting from an unfolded state. This is attributed to the lack of an explicit hydrogen bond term in the potential function, which we hypothesize is necessary to stabilize large assemblies of beta-strands. Copyright 1999 Academic Press.
Disordered nucleiome: Abundance of intrinsic disorder in the DNA- and RNA-binding proteins in 1121 species from Eukaryota, Bacteria and Archaea.

PubMed

Wang, Chen; Uversky, Vladimir N; Kurgan, Lukasz

2016-05-01

Intrinsically disordered proteins (IDPs) are abundant in various proteomes, where they play numerous important roles and complement biological activities of ordered proteins. Among functions assigned to IDPs are interactions with nucleic acids. However, often, such assignments are made based on the guilty-by-association principle. The validity of the extension of these correlations to all nucleic acid binding proteins has never been analyzed on a large scale across all domains of life. To fill this gap, we perform a comprehensive computational analysis of the abundance of intrinsic disorder and intrinsically disordered domains in nucleiomes (∼548 000 nucleic acid binding proteins) of 1121 species from Archaea, Bacteria and Eukaryota. Nucleiome is a whole complement of proteins involved in interactions with nucleic acids. We show that relative to other proteins in the corresponding proteomes, the DNA-binding proteins have significantly increased disorder content and are significantly enriched in disordered domains in Eukaryotes but not in Archaea and Bacteria. The RNA-binding proteins are significantly enriched in the disordered domains in Bacteria, Archaea and Eukaryota, while the overall abundance of disorder in these proteins is significantly increased in Bacteria, Archaea, animals and fungi. The high abundance of disorder in nucleiomes supports the notion that the nucleic acid binding proteins often require intrinsic disorder for their functions and regulation. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Amino Acid Distribution Rules Predict Protein Fold: Protein Grammar for Beta-Strand Sandwich-Like Structures

PubMed Central

Kister, Alexander

2015-01-01

We present an alternative approach to protein 3D folding prediction based on determination of rules that specify distribution of “favorable” residues, that are mainly responsible for a given fold formation, and “unfavorable” residues, that are incompatible with that fold, in polypeptide sequences. The process of determining favorable and unfavorable residues is iterative. The starting assumptions are based on the general principles of protein structure formation as well as structural features peculiar to a protein fold under investigation. The initial assumptions are tested one-by-one for a set of all known proteins with a given structure. The assumption is accepted as a “rule of amino acid distribution” for the protein fold if it holds true for all, or near all, structures. If the assumption is not accepted as a rule, it can be modified to better fit the data and then tested again in the next step of the iterative search algorithm, or rejected. We determined the set of amino acid distribution rules for a large group of beta sandwich-like proteins characterized by a specific arrangement of strands in two beta sheets. It was shown that this set of rules is highly sensitive (~90%) and very specific (~99%) for identifying sequences of proteins with specified beta sandwich fold structure. The advantage of the proposed approach is that it does not require that query proteins have a high degree of homology to proteins with known structure. So long as the query protein satisfies residue distribution rules, it can be confidently assigned to its respective protein fold. Another advantage of our approach is that it allows for a better understanding of which residues play an essential role in protein fold formation. It may, therefore, facilitate rational protein engineering design. PMID:25625198
7 CFR 900.106 - Assignment of mediator.

Code of Federal Regulations, 2010 CFR

2010-01-01

... 7 Agriculture 8 2010-01-01 2010-01-01 false Assignment of mediator. 900.106 Section 900.106 Agriculture Regulations of the Department of Agriculture (Continued) AGRICULTURAL MARKETING SERVICE (Marketing... Assignment of mediator. The Director of the Division shall assign a mediator, from the group designated by...
48 CFR 1511.011-74 - Work assignments.

Code of Federal Regulations, 2011 CFR

2011-10-01

... 48 Federal Acquisition Regulations System 6 2011-10-01 2011-10-01 false Work assignments. 1511.011... ACQUISITION PLANNING DESCRIBING AGENCY NEEDS 1511.011-74 Work assignments. The Contracting Officer shall insert the contract clause at 1552.211-74, Work Assignments, in cost-reimbursement type term form...
DNA-Directed Assembly of Capture Tools for Constitutional Studies of Large Protein Complexes.

PubMed

Meyer, Rebecca; Faesen, Alex; Vogel, Katrin; Jeganathan, Sadasivam; Musacchio, Andrea; Niemeyer, Christof M

2015-06-10

Large supramolecular protein complexes, such as the molecular machinery involved in gene regulation, cell signaling, or cell division, are key in all fundamental processes of life. Detailed elucidation of structure and dynamics of such complexes can be achieved by reverse-engineering parts of the complexes in order to probe their interactions with distinctive binding partners in vitro. The exploitation of DNA nanostructures to mimic partially assembled supramolecular protein complexes in which the presence and state of two or more proteins are decisive for binding of additional building blocks is reported here. To this end, four-way DNA Holliday junction motifs bearing a fluorescein and a biotin tag, for tracking and affinity capture, respectively, are site-specifically functionalized with centromeric protein (CENP) C and CENP-T. The latter serves as baits for binding of the so-called KMN component, thereby mimicking early stages of the assembly of kinetochores, structures that mediate and control the attachment of microtubules to chromosomes in the spindle apparatus. Results from pull-down experiments are consistent with the hypothesis that CENP-C and CENP-T may bind cooperatively to the KMN network. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
1H, 15N, 13C resonance assignment of human GAP-43.

PubMed

Flamm, Andrea Gabriele; Żerko, Szymon; Zawadzka-Kazimierczuk, Anna; Koźmiński, Wiktor; Konrat, Robert; Coudevylle, Nicolas

2016-04-01

GAP-43 is a 25 kDa neuronal intrinsically disordered protein, highly abundant in the neuronal growth cone during development and regeneration. The exact molecular function(s) of GAP-43 remains unclear but it appears to be involved in growth cone guidance and actin cytoskeleton organization. Therefore, GAP-43 seems to play an important role in neurotransmitter vesicle fusion and recycling, long-term potentiation, spatial memory formation and learning. Here we report the nearly complete assignment of recombinant human GAP-43.
A new algorithm using cross-assignment for label-free quantitation with LC-LTQ-FT MS.

PubMed

Andreev, Victor P; Li, Lingyun; Cao, Lei; Gu, Ye; Rejtar, Tomas; Wu, Shiaw-Lin; Karger, Barry L

2007-06-01

A new algorithm is described for label-free quantitation of relative protein abundances across multiple complex proteomic samples. Q-MEND is based on the denoising and peak picking algorithm, MEND, previously developed in our laboratory. Q-MEND takes advantage of the high resolution and mass accuracy of the hybrid LTQ-FT MS mass spectrometer (or other high-resolution mass spectrometers, such as a Q-TOF MS). The strategy, termed "cross-assignment", is introduced to increase substantially the number of quantitated proteins. In this approach, all MS/MS identifications for the set of analyzed samples are combined into a master ID list, and then each LC-MS run is searched for the features that can be assigned to a specific identification from that master list. The reliability of quantitation is enhanced by quantitating separately all peptide charge states, along with a scoring procedure to filter out less reliable peptide abundance measurements. The effectiveness of Q-MEND is illustrated in the relative quantitative analysis of Escherichia coli samples spiked with known amounts of non-E. coli protein digests. A mean quantitation accuracy of 7% and mean precision of 15% is demonstrated. Q-MEND can perform relative quantitation of a set of LC-MS data sets without manual intervention and can generate files compatible with the Guidelines for Proteomic Data Publication.

48 CFR 1552.211-74 - Work assignments.

Code of Federal Regulations, 2010 CFR

2010-10-01

...-reimbursement type term form contracts when work assignments are to be used. Work Assignments (APR 1984) (a) The... assignment or similar tasking document, the Contractor shall provide a conflict of interest certification... directly related to a site, the Contractor is only required to provide a conflict of interest certification...
A large scale membrane-binding protein conformational change that initiates at small length scales

NASA Astrophysics Data System (ADS)

Grandpre, Trevor; Andorf, Matthew; Chakravarthy, Srinivas; Lamb, Robert; Poor, Taylor; Landahl, Eric

2013-03-01

The fusion (F) protein of parainfluenza virus 5 (PIV5) is a membrane-bound, homotrimeric glycoprotein located on the surface of PIV5 viral envelopes. Upon being triggered by the receptor-binding protein (HN), F undergoes a greater than 100Å ATP-independent refolding event. This refolding event results in the insertion of a hydrophobic fusion peptide into the membrane of the target cell, followed by the desolvation and subsequent fusion event as the two membranes are brought together. Isothermal calorimetry and hydrophobic dye incorporation experiments indicate that the soluble construct of the F protein undergoes a conformational rearrangement event at around 55 deg C. We present the results of an initial Time-Resolved Small-Angle X-Ray Scattering (TR-SAXS) study of this large scale, entropically driven conformational change using a temperature jump. Although we the measured radius of gyration of this protein changes on a 110 second timescale, we find that the x-ray scattering intensity at higher angles (corresponding to smaller length scales in the protein) changes nearly an order of magnitude faster. We believe this may be a signature of entropically-driven conformational change. To whom correspondence should be addressed
Screening and large-scale expression of membrane proteins in mammalian cells for structural studies.

PubMed

Goehring, April; Lee, Chia-Hsueh; Wang, Kevin H; Michel, Jennifer Carlisle; Claxton, Derek P; Baconguis, Isabelle; Althoff, Thorsten; Fischer, Suzanne; Garcia, K Christopher; Gouaux, Eric

2014-11-01

Structural, biochemical and biophysical studies of eukaryotic membrane proteins are often hampered by difficulties in overexpression of the candidate molecule. Baculovirus transduction of mammalian cells (BacMam), although a powerful method to heterologously express membrane proteins, can be cumbersome for screening and expression of multiple constructs. We therefore developed plasmid Eric Gouaux (pEG) BacMam, a vector optimized for use in screening assays, as well as for efficient production of baculovirus and robust expression of the target protein. In this protocol, we show how to use small-scale transient transfection and fluorescence-detection size-exclusion chromatography (FSEC) experiments using a GFP-His8-tagged candidate protein to screen for monodispersity and expression level. Once promising candidates are identified, we describe how to generate baculovirus, transduce HEK293S GnTI(-) (N-acetylglucosaminyltransferase I-negative) cells in suspension culture and overexpress the candidate protein. We have used these methods to prepare pure samples of chicken acid-sensing ion channel 1a (cASIC1) and Caenorhabditis elegans glutamate-gated chloride channel (GluCl) for X-ray crystallography, demonstrating how to rapidly and efficiently screen hundreds of constructs and accomplish large-scale expression in 4-6 weeks.
Lexical Stress Assignment in Italian Developmental Dyslexia

ERIC Educational Resources Information Center

Paizi, Despina; Zoccolotti, Pierluigi; Burani, Cristina

2011-01-01

Stress assignment to Italian polysyllabic words is unpredictable, because stress is neither marked nor predicted by rule. Stress assignment, especially to low frequency words, has been reported to be a function of stress dominance and stress neighbourhood. Two experiments investigate stress assignment in sixth-grade, skilled and dyslexic, readers.…
29 CFR 1403.4 - Assignment of mediators.

Code of Federal Regulations, 2010 CFR

2010-07-01

... 29 Labor 4 2010-07-01 2010-07-01 false Assignment of mediators. 1403.4 Section 1403.4 Labor Regulations Relating to Labor (Continued) FEDERAL MEDIATION AND CONCILIATION SERVICE FUNCTIONS AND DUTIES § 1403.4 Assignment of mediators. The Federal Service will assign one or more mediators to each labor...
Large-scale modelling of the divergent spectrin repeats in nesprins: giant modular proteins.

PubMed

Autore, Flavia; Pfuhl, Mark; Quan, Xueping; Williams, Aisling; Roberts, Roland G; Shanahan, Catherine M; Fraternali, Franca

2013-01-01

Nesprin-1 and nesprin-2 are nuclear envelope (NE) proteins characterized by a common structure of an SR (spectrin repeat) rod domain and a C-terminal transmembrane KASH [Klarsicht-ANC-Syne-homology] domain and display N-terminal actin-binding CH (calponin homology) domains. Mutations in these proteins have been described in Emery-Dreifuss muscular dystrophy and attributed to disruptions of interactions at the NE with nesprins binding partners, lamin A/C and emerin. Evolutionary analysis of the rod domains of the nesprins has shown that they are almost entirely composed of unbroken SR-like structures. We present a bioinformatical approach to accurate definition of the boundaries of each SR by comparison with canonical SR structures, allowing for a large-scale homology modelling of the 74 nesprin-1 and 56 nesprin-2 SRs. The exposed and evolutionary conserved residues identify important pbs for protein-protein interactions that can guide tailored binding experiments. Most importantly, the bioinformatics analyses and the 3D models have been central to the design of selected constructs for protein expression. 1D NMR and CD spectra have been performed of the expressed SRs, showing a folded, stable, high content α-helical structure, typical of SRs. Molecular Dynamics simulations have been performed to study the structural and elastic properties of consecutive SRs, revealing insights in the mechanical properties adopted by these modules in the cell.
Streaming fragment assignment for real-time analysis of sequencing experiments

PubMed Central

Roberts, Adam; Pachter, Lior

2013-01-01

We present eXpress, a software package for highly efficient probabilistic assignment of ambiguously mapping sequenced fragments. eXpress uses a streaming algorithm with linear run time and constant memory use. It can determine abundances of sequenced molecules in real time, and can be applied to ChIP-seq, metagenomics and other large-scale sequencing data. We demonstrate its use on RNA-seq data, showing greater efficiency than other quantification methods. PMID:23160280
Pedigree reconstruction from SNP data: parentage assignment, sibship clustering and beyond.

PubMed

Huisman, Jisca

2017-09-01

Data on hundreds or thousands of single nucleotide polymorphisms (SNPs) provide detailed information about the relationships between individuals, but currently few tools can turn this information into a multigenerational pedigree. I present the r package sequoia, which assigns parents, clusters half-siblings sharing an unsampled parent and assigns grandparents to half-sibships. Assignments are made after consideration of the likelihoods of all possible first-, second- and third-degree relationships between the focal individuals, as well as the traditional alternative of being unrelated. This careful exploration of the local likelihood surface is implemented in a fast, heuristic hill-climbing algorithm. Distinction between the various categories of second-degree relatives is possible when likelihoods are calculated conditional on at least one parent of each focal individual. Performance was tested on simulated data sets with realistic genotyping error rate and missingness, based on three different large pedigrees (N = 1000-2000). This included a complex pedigree with overlapping generations, occasional close inbreeding and some unknown birth years. Parentage assignment was highly accurate down to about 100 independent SNPs (error rate <0.1%) and fast (<1 min) as most pairs can be excluded from being parent-offspring based on opposite homozygosity. For full pedigree reconstruction, 40% of parents were assumed nongenotyped. Reconstruction resulted in low error rates (<0.3%), high assignment rates (>99%) in limited computation time (typically <1 h) when at least 200 independent SNPs were used. In three empirical data sets, relatedness estimated from the inferred pedigree was strongly correlated to genomic relatedness. © 2017 The Authors. Molecular Ecology Resources Published by John Wiley & Sons Ltd.
Preparation and characterization of monodisperse large-porous silica microspheres as the matrix for protein separation.

PubMed

Xia, Hongjun; Wan, Guangping; Zhao, Junlong; Liu, Jiawei; Bai, Quan

2016-11-04

High performance liquid chromatography (HPLC) is a kind of efficient separation technology and has been used widely in many fields. Micro-sized porous silica microspheres as the most popular matrix have been used for fast separation and analysis in HPLC. In this paper, the monodisperse large-porous silica microspheres with controllable size and structure were successfully synthesized with polymer microspheres as the templates and characterized. First, the poly(glycidyl methacrylate-co-ethyleneglycol dimethacrylate) microspheres (P GMA-EDMA ) were functionalized with tetraethylenepentamine (TEPA) to generate amino groups which act as a catalyst in hydrolysis of tetraethyl orthosilicate (TEOS) to form Si-containing low molecular weight species. Then the low molecular weight species diffused into the functionalized P GMA-EDMA microspheres by induction force of the amino groups to form polymer/silica hybrid microspheres. Finally, the organic polymer templates were removed by calcination, and the large-porous silica microspheres were obtained. The compositions, morphology, size distribution, specific surface area and pore size distribution of the porous silica microspheres were characterized by infrared analyzer, scanning-electron microscopy, dynamic laser scattering, the mercury intrusion method and thermal gravimetric analysis, respectively. The results show that the agglomeration of the hybrid microspheres can be overcome when the templates were functionalized with TEPA as amination reagent, and the yield of 95.7% of the monodisperse large-porous silica microspheres can be achieved with high concentration of polymer templates. The resulting large-porous silica microspheres were modified with octadecyltrichlorosilane (ODS) and the chromatographic evaluation was performed by separating the proteins and the digest of BSA. The baseline separation of seven kinds of protein standards was achieved, and the column delivered a better performance when separating BSA digests
Characterization of Thylakoid-Derived Lipid-Protein Particles Bearing the Large Subunit of Ribulose-1,5-Bisphosphate Carboxylase/Oxygenase.

PubMed Central

Smith, M. D.; Ghosh, S.; Dumbroff, E. B.; Thompson, J. E.

1997-01-01

Lipid-protein particles bearing the 55-kD ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) (EC 4.1.1.39) large subunit (RLSU) and no detectable corresponding Rubisco small subunit (RSSU) were isolated from the stroma of intact chloroplasts by flotation centrifugation. Stromal RLSU-bearing particles appear to originate from thylakoids because they can also be generated in vitro by illumination of isolated thylakoids. Their formation in vitro is largely heat denaturable and is facilitated by light or ATP. RLSU-containing lipid-protein particles range from 0.05 to 0.10 [mu]m in radius, contain the same fatty acids as thylakoids, but have a 10- to 15-fold higher free-to-esterified fatty acid ratio than thylakoids. RLSU-bearing lipid-protein particles with no detectable RSSU were also immunopurified from the populations of both stromal lipid-protein particles and those generated in vitro from illuminated thylakoids. Protease shaving indicated that the RLSU is embedded in the lipid-protein particles and that there is also a protease-protected RLSU in thylakoids. These observations collectively indicate that the RLSU associated with thylakoids is released into the stroma by light-facilitated blebbing of lipid-protein particles. The release of RLSU-containing particles may in turn be coordinated with the assembly of Rubisco holoenzyme because chaperonin 60 is also associated with lipid-protein particles isolated from stroma. PMID:12223858
The hepatitis B virus large surface protein (LHBs) is a transcriptional activator.

PubMed

Hildt, E; Saher, G; Bruss, V; Hofschneider, P H

1996-11-01

It has been shown that a C-terminally truncated form of the middle-sized hepatitis B virus (HBV) surface protein (MHBst) functions as a transcriptional activator. This function is dependent on the cytosolic orientation of the N-terminal PreS2 domain of MHBst, but in the case of wild-type MHBs, the PreS2 domain is contranslationally translocated into the ER lumen. Recent reports demonstrated that the PreS2 domain of the large HBV surface protein (LHBs) initially remains on the cytosolic side of the ER membrane after translation. Therefore, the question arose as to whether the LHBs protein exhibits the same transcriptional activator function as MHBst. We show that LHBs, like MHBst, is indeed able to activate a variety of promoter elements. There is evidence for a PKC-dependent activation of AP-1 and NF-kappa B by LHBs. Downstream of the PKC the functionality of c-Raf-1 kinase is a prerequisite for LHBs-dependent activation of AP-1 and NF-kappa B since inhibition of c-Raf-1 kinase abolishes LHBs-dependent transcriptional activation of AP-1 and NF-kappa B.
Assignment of protonation states in proteins and ligands: combining pKa prediction with hydrogen bonding network optimization.

PubMed

Krieger, Elmar; Dunbrack, Roland L; Hooft, Rob W W; Krieger, Barbara

2012-01-01

Among the many applications of molecular modeling, drug design is probably the one with the highest demands on the accuracy of the underlying structures. During lead optimization, the position of every atom in the binding site should ideally be known with high precision to identify those chemical modifications that are most likely to increase drug affinity. Unfortunately, X-ray crystallography at common resolution yields an electron density map that is too coarse, since the chemical elements and their protonation states cannot be fully resolved.This chapter describes the steps required to fill in the missing knowledge, by devising an algorithm that can detect and resolve the ambiguities. First, the pK (a) values of acidic and basic groups are predicted. Second, their potential protonation states are determined, including all permutations (considering for example protons that can jump between the oxygens of a phosphate group). Third, those groups of atoms are identified that can adopt alternative but indistinguishable conformations with essentially the same electron density. Fourth, potential hydrogen bond donors and acceptors are located. Finally, all these data are combined in a single "configuration energy function," whose global minimum is found with the SCWRL algorithm, which employs dead-end elimination and graph theory. As a result, one obtains a complete model of the protein and its bound ligand, with ambiguous groups rotated to the best orientation and with protonation states assigned considering the current pH and the H-bonding network. An implementation of the algorithm has been available since 2008 as part of the YASARA modeling & simulation program.
Evaluation of Automatically Assigned Job-Specific Interview Modules.

PubMed

Friesen, Melissa C; Lan, Qing; Ge, Calvin; Locke, Sarah J; Hosgood, Dean; Fritschi, Lin; Sadkowsky, Troy; Chen, Yu-Cheng; Wei, Hu; Xu, Jun; Lam, Tai Hing; Kwong, Yok Lam; Chen, Kexin; Xu, Caigang; Su, Yu-Chieh; Chiu, Brian C H; Ip, Kai Ming Dennis; Purdue, Mark P; Bassig, Bryan A; Rothman, Nat; Vermeulen, Roel

2016-08-01

In community-based epidemiological studies, job- and industry-specific 'modules' are often used to systematically obtain details about the subject's work tasks. The module assignment is often made by the interviewer, who may have insufficient occupational hygiene knowledge to assign the correct module. We evaluated, in the context of a case-control study of lymphoid neoplasms in Asia ('AsiaLymph'), the performance of an algorithm that provided automatic, real-time module assignment during a computer-assisted personal interview. AsiaLymph's occupational component began with a lifetime occupational history questionnaire with free-text responses and three solvent exposure screening questions. To assign each job to one of 23 study-specific modules, an algorithm automatically searched the free-text responses to the questions 'job title' and 'product made or services provided by employer' using a list of module-specific keywords, comprising over 5800 keywords in English, Traditional and Simplified Chinese. Hierarchical decision rules were used when the keyword match triggered multiple modules. If no keyword match was identified, a generic solvent module was assigned if the subject responded 'yes' to any of the three solvent screening questions. If these question responses were all 'no', a work location module was assigned, which redirected the subject to the farming, teaching, health professional, solvent, or industry solvent modules or ended the questions for that job, depending on the location response. We conducted a reliability assessment that compared the algorithm-assigned modules to consensus module assignments made by two industrial hygienists for a subset of 1251 (of 11409) jobs selected using a stratified random selection procedure using module-specific strata. Discordant assignments between the algorithm and consensus assignments (483 jobs) were qualitatively reviewed by the hygienists to evaluate the potential information lost from missed questions with using
7 CFR 97.130 - Recording of assignments.

Code of Federal Regulations, 2010 CFR

2010-01-01

... PLANT VARIETY AND PROTECTION Assignments and Recording § 97.130 Recording of assignments. (a) Any assignment of an application for a certificate, or of a certificate of plant variety protection, or of any interest in a variety, or any license or grant and conveyance of any right to use of the variety, may be...
Mining protein function from text using term-based support vector machines

PubMed Central

Rice, Simon B; Nenadic, Goran; Stapley, Benjamin J

2005-01-01

Background Text mining has spurred huge interest in the domain of biology. The goal of the BioCreAtIvE exercise was to evaluate the performance of current text mining systems. We participated in Task 2, which addressed assigning Gene Ontology terms to human proteins and selecting relevant evidence from full-text documents. We approached it as a modified form of the document classification task. We used a supervised machine-learning approach (based on support vector machines) to assign protein function and select passages that support the assignments. As classification features, we used a protein's co-occurring terms that were automatically extracted from documents. Results The results evaluated by curators were modest, and quite variable for different problems: in many cases we have relatively good assignment of GO terms to proteins, but the selected supporting text was typically non-relevant (precision spanning from 3% to 50%). The method appears to work best when a substantial set of relevant documents is obtained, while it works poorly on single documents and/or short passages. The initial results suggest that our approach can also mine annotations from text even when an explicit statement relating a protein to a GO term is absent. Conclusion A machine learning approach to mining protein function predictions from text can yield good performance only if sufficient training data is available, and significant amount of supporting data is used for prediction. The most promising results are for combined document retrieval and GO term assignment, which calls for the integration of methods developed in BioCreAtIvE Task 1 and Task 2. PMID:15960835
Natural bond orbital analysis in the ONETEP code: applications to large protein systems.

PubMed

Lee, Louis P; Cole, Daniel J; Payne, Mike C; Skylaris, Chris-Kriton

2013-03-05

First principles electronic structure calculations are typically performed in terms of molecular orbitals (or bands), providing a straightforward theoretical avenue for approximations of increasing sophistication, but do not usually provide any qualitative chemical information about the system. We can derive such information via post-processing using natural bond orbital (NBO) analysis, which produces a chemical picture of bonding in terms of localized Lewis-type bond and lone pair orbitals that we can use to understand molecular structure and interactions. We present NBO analysis of large-scale calculations with the ONETEP linear-scaling density functional theory package, which we have interfaced with the NBO 5 analysis program. In ONETEP calculations involving thousands of atoms, one is typically interested in particular regions of a nanosystem whilst accounting for long-range electronic effects from the entire system. We show that by transforming the Non-orthogonal Generalized Wannier Functions of ONETEP to natural atomic orbitals, NBO analysis can be performed within a localized region in such a way that ensures the results are identical to an analysis on the full system. We demonstrate the capabilities of this approach by performing illustrative studies of large proteins--namely, investigating changes in charge transfer between the heme group of myoglobin and its ligands with increasing system size and between a protein and its explicit solvent, estimating the contribution of electronic delocalization to the stabilization of hydrogen bonds in the binding pocket of a drug-receptor complex, and observing, in situ, the n → π* hyperconjugative interactions between carbonyl groups that stabilize protein backbones. Copyright © 2012 Wiley Periodicals, Inc.
Low resolution 1H NMR assignment of proton populations in pound cake and its polymeric ingredients.

PubMed

Luyts, A; Wilderjans, E; Waterschoot, J; Van Haesendonck, I; Brijs, K; Courtin, C M; Hills, B; Delcour, J A

2013-08-15

Based on a model system approach, five different proton populations were distinguished in pound cake crumb using one dimensional low resolution (1)H NMR spectroscopy. In free induction decay (FID) measurements, proton populations were assigned to (i) non-exchanging CH protons of crystalline starch, proteins and crystalline fat and (ii) non-exchanging CH protons of amorphous starch and gluten, which are in little contact with water. In Carr-Purcell-Meiboom-Gill (CPMG) measurements, three proton populations were distinguished. The CPMG population with the lowest mobility and the FID population with the highest mobility represent the same proton population. The two CPMG proton populations with the highest mobility were assigned to exchanging protons (i.e., protons of water, starch, gluten, egg proteins and sugar) and protons of lipids (i.e., protons of egg yolk lipids and amorphous lipid fraction of margarine) respectively. Based on their spin-lattice relaxation times (T1), two dimensional (1)H NMR spectroscopy further resolved the two proton populations with the highest mobility into three and two proton populations, respectively. Copyright © 2013 Elsevier Ltd. All rights reserved.
Aligator: A computational tool for optimizing total chemical synthesis of large proteins.

PubMed

Jacobsen, Michael T; Erickson, Patrick W; Kay, Michael S

2017-09-15

The scope of chemical protein synthesis (CPS) continues to expand, driven primarily by advances in chemical ligation tools (e.g., reversible solubilizing groups and novel ligation chemistries). However, the design of an optimal synthesis route can be an arduous and fickle task due to the large number of theoretically possible, and in many cases problematic, synthetic strategies. In this perspective, we highlight recent CPS tool advances and then introduce a new and easy-to-use program, Aligator (Automated Ligator), for analyzing and designing the most efficient strategies for constructing large targets using CPS. As a model set, we selected the E. coli ribosomal proteins and associated factors for computational analysis. Aligator systematically scores and ranks all feasible synthetic strategies for a particular CPS target. The Aligator script methodically evaluates potential peptide segments for a target using a scoring function that includes solubility, ligation site quality, segment lengths, and number of ligations to provide a ranked list of potential synthetic strategies. We demonstrate the utility of Aligator by analyzing three recent CPS projects from our lab: TNFα (157 aa), GroES (97 aa), and DapA (312 aa). As the limits of CPS are extended, we expect that computational tools will play an increasingly important role in the efficient execution of ambitious CPS projects such as production of a mirror-image ribosome. Copyright © 2017 Elsevier Ltd. All rights reserved.
Large scale analysis of protein-binding cavities using self-organizing maps and wavelet-based surface patches to describe functional properties, selectivity discrimination, and putative cross-reactivity.

PubMed

Kupas, Katrin; Ultsch, Alfred; Klebe, Gerhard

2008-05-15

A new method to discover similar substructures in protein binding pockets, independently of sequence and folding patterns or secondary structure elements, is introduced. The solvent-accessible surface of a binding pocket, automatically detected as a depression on the protein surface, is divided into a set of surface patches. Each surface patch is characterized by its shape as well as by its physicochemical characteristics. Wavelets defined on surfaces are used for the description of the shape, as they have the great advantage of allowing a comparison at different resolutions. The number of coefficients to describe the wavelets can be chosen with respect to the size of the considered data set. The physicochemical characteristics of the patches are described by the assignment of the exposed amino acid residues to one or more of five different properties determinant for molecular recognition. A self-organizing neural network is used to project the high-dimensional feature vectors onto a two-dimensional layer of neurons, called a map. To find similarities between the binding pockets, in both geometrical and physicochemical features, a clustering of the projected feature vector is performed using an automatic distance- and density-based clustering algorithm. The method was validated with a small training data set of 109 binding cavities originating from a set of enzymes covering 12 different EC numbers. A second test data set of 1378 binding cavities, extracted from enzymes of 13 different EC numbers, was then used to prove the discriminating power of the algorithm and to demonstrate its applicability to large scale analyses. In all cases, members of the data set with the same EC number were placed into coherent regions on the map, with small distances between them. Different EC numbers are separated by large distances between the feature vectors. A third data set comprising three subfamilies of endopeptidases is used to demonstrate the ability of the algorithm to
Current Progress in Tonoplast Proteomics Reveals Insights into the Function of the Large Central Vacuole

PubMed Central

Trentmann, Oliver; Haferkamp, Ilka

2013-01-01

Vacuoles of plants fulfill various biologically important functions, like turgor generation and maintenance, detoxification, solute sequestration, or protein storage. Different types of plant vacuoles (lytic versus protein storage) are characterized by different functional properties apparently caused by a different composition/abundance and regulation of transport proteins in the surrounding membrane, the tonoplast. Proteome analyses allow the identification of vacuolar proteins and provide an informative basis for assigning observed transport processes to specific carriers or channels. This review summarizes techniques required for vacuolar proteome analyses, like e.g., isolation of the large central vacuole or tonoplast membrane purification. Moreover, an overview about diverse published vacuolar proteome studies is provided. It becomes evident that qualitative proteomes from different plant species represent just the tip of the iceberg. During the past few years, mass spectrometry achieved immense improvement concerning its accuracy, sensitivity, and application. As a consequence, modern tonoplast proteome approaches are suited for detecting alterations in membrane protein abundance in response to changing environmental/physiological conditions and help to clarify the regulation of tonoplast transport processes. PMID:23459586

Theoretical and subjective bit assignments in transform picture

NASA Technical Reports Server (NTRS)

Jones, H. W., Jr.

1977-01-01

It is shown that all combinations of symmetrical input distributions with difference distortion measures give a bit assignment rule identical to the well-known rule for a Gaussian input distribution with mean-square error. Published work is examined to show that the bit assignment rule is useful for transforms of full pictures, but subjective bit assignments for transform picture coding using small block sizes are significantly different from the theoretical bit assignment rule. An intuitive explanation is based on subjective design experience, and a subjectively obtained bit assignment rule is given.
Computational Approaches to Simulation and Analysis of Large Conformational Transitions in Proteins

NASA Astrophysics Data System (ADS)

Seyler, Sean L.

In a typical living cell, millions to billions of proteins--nanomachines that fluctuate and cycle among many conformational states--convert available free energy into mechanochemical work. A fundamental goal of biophysics is to ascertain how 3D protein structures encode specific functions, such as catalyzing chemical reactions or transporting nutrients into a cell. Protein dynamics span femtosecond timescales (i.e., covalent bond oscillations) to large conformational transition timescales in, and beyond, the millisecond regime (e.g., glucose transport across a phospholipid bilayer). Actual transition events are fast but rare, occurring orders of magnitude faster than typical metastable equilibrium waiting times. Equilibrium molecular dynamics (EqMD) can capture atomistic detail and solute-solvent interactions, but even microseconds of sampling attainable nowadays still falls orders of magnitude short of transition timescales, especially for large systems, rendering observations of such "rare events" difficult or effectively impossible. Advanced path-sampling methods exploit reduced physical models or biasing to produce plausible transitions while balancing accuracy and efficiency, but quantifying their accuracy relative to other numerical and experimental data has been challenging. Indeed, new horizons in elucidating protein function necessitate that present methodologies be revised to more seamlessly and quantitatively integrate a spectrum of methods, both numerical and experimental. In this dissertation, experimental and computational methods are put into perspective using the enzyme adenylate kinase (AdK) as an illustrative example. We introduce Path Similarity Analysis (PSA)--an integrative computational framework developed to quantify transition path similarity. PSA not only reliably distinguished AdK transitions by the originating method, but also traced pathway differences between two methods back to charge-charge interactions (neglected by the
1H, 13C, and 15N backbone assignment and secondary structure of the receptor-binding domain of vascular endothelial growth factor.

PubMed Central

Fairbrother, W. J.; Champe, M. A.; Christinger, H. W.; Keyt, B. A.; Starovasnik, M. A.

1997-01-01

Nearly complete sequence-specific 1H, 13C, and 15N resonance assignments are reported for the backbone atoms of the receptor-binding domain of vascular endothelial growth factor (VEGF), a 23-kDa homodimeric protein that is a major regulator of both normal and pathological angiogenesis. The assignment strategy relied on the use of seven 3D triple-resonance experiments [HN(CO)CA, HNCA, HNCO, (HCA)CONH, HN(COCA)HA, HN(CA)HA, and CBCA-(CO)NH] and a 3D 15N-TOCSY-HSQC experiment recorded on a 0.5 mM (12 mg/mL) sample at 500 MHz, pH 7.0, 45 degrees C. Under these conditions, 15N relaxation data show that the protein has a rotational correlation time of 15.0 ns. Despite this unusually long correlation time, assignments were obtained for 94 of the 99 residues; 8 residues lack amide 1H and 15N assignments, presumably due to rapid exchange of the amide 1H with solvent under the experimental conditions used. The secondary structure of the protein was deduced from the chemical shift indices of the 1H alpha, 13C alpha, 13C beta, and 13CO nuclei, and from analysis of backbone NOEs observed in a 3D 15N-NOESY-HSQC spectrum. Two helices and a significant amount of beta-sheet structure were identified, in general agreement with the secondary structure found in a recently determined crystal structure of a similar VEGF construct [Muller YA et al., 1997, Proc Natl Acad Sci USA 94:7192-7197]. PMID:9336848
1H, 15N and 13C resonance assignments for free and IEEVD peptide-bound forms of the tetratricopeptide repeat domain from the human E3 ubiquitin ligase CHIP.

PubMed

Zhang, Huaqun; McGlone, Cameron; Mannion, Matthew M; Page, Richard C

2017-04-01

The ubiquitin ligase CHIP catalyzes covalent attachment of ubiquitin to unfolded proteins chaperoned by the heat shock proteins Hsp70/Hsc70 and Hsp90. CHIP interacts with Hsp70/Hsc70 and Hsp90 by binding of a C-terminal IEEVD motif found in Hsp70/Hsc70 and Hsp90 to the tetratricopeptide repeat (TPR) domain of CHIP. Although recruitment of heat shock proteins to CHIP via interaction with the CHIP-TPR domain is well established, alterations in structure and dynamics of CHIP upon binding are not well understood. In particular, the absence of a structure for CHIP-TPR in the free form presents a significant limitation upon studies seeking to rationally design inhibitors that may disrupt interactions between CHIP and heat shock proteins. Here we report the 1 H, 13 C, and 15 N backbone and side chain chemical shift assignments for CHIP-TPR in the free form, and backbone chemical shift assignments for CHIP-TPR in the IEEVD-bound form. The NMR resonance assignments will enable further studies examining the roles of dynamics and structure in regulating interactions between CHIP and the heat shock proteins Hsp70/Hsc70 and Hsp90.
Evaluation of Automatically Assigned Job-Specific Interview Modules

PubMed Central

Friesen, Melissa C.; Lan, Qing; Ge, Calvin; Locke, Sarah J.; Hosgood, Dean; Fritschi, Lin; Sadkowsky, Troy; Chen, Yu-Cheng; Wei, Hu; Xu, Jun; Lam, Tai Hing; Kwong, Yok Lam; Chen, Kexin; Xu, Caigang; Su, Yu-Chieh; Chiu, Brian C. H.; Ip, Kai Ming Dennis; Purdue, Mark P.; Bassig, Bryan A.; Rothman, Nat; Vermeulen, Roel

2016-01-01

Objective: In community-based epidemiological studies, job- and industry-specific ‘modules’ are often used to systematically obtain details about the subject’s work tasks. The module assignment is often made by the interviewer, who may have insufficient occupational hygiene knowledge to assign the correct module. We evaluated, in the context of a case–control study of lymphoid neoplasms in Asia (‘AsiaLymph’), the performance of an algorithm that provided automatic, real-time module assignment during a computer-assisted personal interview. Methods: AsiaLymph’s occupational component began with a lifetime occupational history questionnaire with free-text responses and three solvent exposure screening questions. To assign each job to one of 23 study-specific modules, an algorithm automatically searched the free-text responses to the questions ‘job title’ and ‘product made or services provided by employer’ using a list of module-specific keywords, comprising over 5800 keywords in English, Traditional and Simplified Chinese. Hierarchical decision rules were used when the keyword match triggered multiple modules. If no keyword match was identified, a generic solvent module was assigned if the subject responded ‘yes’ to any of the three solvent screening questions. If these question responses were all ‘no’, a work location module was assigned, which redirected the subject to the farming, teaching, health professional, solvent, or industry solvent modules or ended the questions for that job, depending on the location response. We conducted a reliability assessment that compared the algorithm-assigned modules to consensus module assignments made by two industrial hygienists for a subset of 1251 (of 11409) jobs selected using a stratified random selection procedure using module-specific strata. Discordant assignments between the algorithm and consensus assignments (483 jobs) were qualitatively reviewed by the hygienists to evaluate the potential
Protein-transitions in and out of the dough matrix in wheat flour mixing.

PubMed

Wang, Xiaolong; Appels, Rudi; Zhang, Xiaoke; Bekes, Ferenc; Torok, Kitti; Tomoskozi, Sandor; Diepeveen, Dean; Ma, Wujun; Islam, Shahidul

2017-02-15

Sequential protein behavior in the wheat dough matrix under continuous mixing and heating treatment has been studied using Mixolab-dough samples from two Australian wheat cultivars, Westonia and Wyalkatchem. Size exclusion high performance liquid chromatography (SE-HPLC) and two-dimensional gel electrophoresis (2-DGE) analysis indicated that 32min (80°C) was a critical time point in forming large protein complexes and loosing extractability of several protein groups like y-type high molecular weight glutenin subunits (HMW-GSs), gamma-gliadins, beta-amylases, serpins, and metabolic proteins with higher mass. Up to 32min (80°C) Westonia showed higher protein extractability compared to Wyalkatchem although it was in the opposite direction thereafter. Twenty differentially expressed proteins could be assigned to chromosomes 1D, 3A, 4A, 4B, 4D, 6A, 6B, 7A and 7B. The results expanded the range of proteins associated with changes in the gluten-complex during processing and provided targets for selecting new genetic variants associated with altered quality attributes of the flour. Copyright © 2016 Elsevier Ltd. All rights reserved.
Tracing the geographic origin of traded leopard body parts in the indian subcontinent with DNA-based assignment tests.

PubMed

Mondol, Samrat; Sridhar, Vanjulavalli; Yadav, Prasanjeet; Gubbi, Sanjay; Ramakrishnan, Uma

2015-04-01

Illicit trade in wildlife products is rapidly decimating many species across the globe. Such trade is often underestimated for wide-ranging species until it is too late for the survival of their remaining populations. Policing this trade could be vastly improved if one could reliably determine geographic origins of illegal wildlife products and identify areas where greater enforcement is needed. Using DNA-based assignment tests (i.e., samples are assigned to geographic locations), we addressed these factors for leopards (Panthera pardus) on the Indian subcontinent. We created geography-specific allele frequencies from a genetic reference database of 173 leopards across India to infer geographic origins of DNA samples from 40 seized leopard skins. Sensitivity analyses of samples of known geographic origins and assignments of seized skins demonstrated robust assignments for Indian leopards. We found that confiscated pelts seized in small numbers were not necessarily from local leopards. The geographic footprint of large seizures appeared to be bigger than the cumulative footprint of several smaller seizures, indicating widespread leopard poaching across the subcontinent. Our seized samples had male-biased sex ratios, especially the large seizures. From multiple seized sample assignments, we identified central India as a poaching hotspot for leopards. The techniques we applied can be used to identify origins of seized illegal wildlife products and trade routes at the subcontinent scale and beyond. © 2014 Society for Conservation Biology.
Survey of large protein complexes D. vulgaris reveals great structural diversity

DOE Office of Scientific and Technical Information (OSTI.GOV)

Han, B.-G.; Dong, M.; Liu, H.

2009-08-15

An unbiased survey has been made of the stable, most abundant multi-protein complexes in Desulfovibrio vulgaris Hildenborough (DvH) that are larger than Mr {approx} 400 k. The quaternary structures for 8 of the 16 complexes purified during this work were determined by single-particle reconstruction of negatively stained specimens, a success rate {approx}10 times greater than that of previous 'proteomic' screens. In addition, the subunit compositions and stoichiometries of the remaining complexes were determined by biochemical methods. Our data show that the structures of only two of these large complexes, out of the 13 in this set that have recognizable functions,more » can be modeled with confidence based on the structures of known homologs. These results indicate that there is significantly greater variability in the way that homologous prokaryotic macromolecular complexes are assembled than has generally been appreciated. As a consequence, we suggest that relying solely on previously determined quaternary structures for homologous proteins may not be sufficient to properly understand their role in another cell of interest.« less
48 CFR 208.7002 - Assignment authority.

Code of Federal Regulations, 2012 CFR

2012-10-01

... activities concerned. (b) Under the Integrated Materiel Management Program, assignments are made by the... 48 Federal Acquisition Regulations System 3 2012-10-01 2012-10-01 false Assignment authority. 208.7002 Section 208.7002 Federal Acquisition Regulations System DEFENSE ACQUISITION REGULATIONS SYSTEM...
48 CFR 208.7002 - Assignment authority.

Code of Federal Regulations, 2013 CFR

2013-10-01

... activities concerned. (b) Under the Integrated Materiel Management Program, assignments are made by the... 48 Federal Acquisition Regulations System 3 2013-10-01 2013-10-01 false Assignment authority. 208.7002 Section 208.7002 Federal Acquisition Regulations System DEFENSE ACQUISITION REGULATIONS SYSTEM...
48 CFR 208.7002 - Assignment authority.

Code of Federal Regulations, 2011 CFR

2011-10-01

... activities concerned. (b) Under the Integrated Materiel Management Program, assignments are made by the... 48 Federal Acquisition Regulations System 3 2011-10-01 2011-10-01 false Assignment authority. 208.7002 Section 208.7002 Federal Acquisition Regulations System DEFENSE ACQUISITION REGULATIONS SYSTEM...
48 CFR 208.7002 - Assignment authority.

Code of Federal Regulations, 2014 CFR

2014-10-01

... activities concerned. (b) Under the Integrated Materiel Management Program, assignments are made by the... 48 Federal Acquisition Regulations System 3 2014-10-01 2014-10-01 false Assignment authority. 208.7002 Section 208.7002 Federal Acquisition Regulations System DEFENSE ACQUISITION REGULATIONS SYSTEM...
Joint Inference of Population Assignment and Demographic History

PubMed Central

Choi, Sang Chul; Hey, Jody

2011-01-01

A new approach to assigning individuals to populations using genetic data is described. Most existing methods work by maximizing Hardy–Weinberg and linkage equilibrium within populations, neither of which will apply for many demographic histories. By including a demographic model, within a likelihood framework based on coalescent theory, we can jointly study demographic history and population assignment. Genealogies and population assignments are sampled from a posterior distribution using a general isolation-with-migration model for multiple populations. A measure of partition distance between assignments facilitates not only the summary of a posterior sample of assignments, but also the estimation of the posterior density for the demographic history. It is shown that joint estimates of assignment and demographic history are possible, including estimation of population phylogeny for samples from three populations. The new method is compared to results of a widely used assignment method, using simulated and published empirical data sets. PMID:21775468
Assignment of selected hyperfine proton NMR resonances in the met forms of Glycera dibranchiata monomer hemoglobins and comparisons with sperm whale metmyoglobin

DOE Office of Scientific and Technical Information (OSTI.GOV)

Constantinidis, I.; Satterlee, J.D.; Pandey, R.K.

1988-04-19

This work indicates a high degree of purity for our preparations of all three of the primary Glycera dibranchiata monomer hemoglobins and details assignments of the heme methyl and vinyl protons in the hyperfine shift region of the ferric (aquo.) protein forms. The assignments were carried out by reconstituting the apoproteins of each component with selectively deuteriated hemes. The results indicate that even though the individual component preparations consist of essentially a single protein, the proton NMR spectra indicate spectroscopic heterogeneity. Evidence is presented for identification and classification of major and minor protein forms that are present in solutions ofmore » each component. Finally, in contrast to previous results, a detailed analysis of the proton hyperfine shift patterns of the major and minor forms of each component, in comparison to the major and minor forms of metmyoglobin, leads to the conclusions that the corresponding forms of the proteins from each species have strikingly similar heme-globin contacts and display nearly identical heme electronic structures and coordination numbers.« less
From protein-protein interactions to protein co-expression networks: a new perspective to evaluate large-scale proteomic data.

PubMed

Vella, Danila; Zoppis, Italo; Mauri, Giancarlo; Mauri, Pierluigi; Di Silvestre, Dario

2017-12-01

The reductionist approach of dissecting biological systems into their constituents has been successful in the first stage of the molecular biology to elucidate the chemical basis of several biological processes. This knowledge helped biologists to understand the complexity of the biological systems evidencing that most biological functions do not arise from individual molecules; thus, realizing that the emergent properties of the biological systems cannot be explained or be predicted by investigating individual molecules without taking into consideration their relations. Thanks to the improvement of the current -omics technologies and the increasing understanding of the molecular relationships, even more studies are evaluating the biological systems through approaches based on graph theory. Genomic and proteomic data are often combined with protein-protein interaction (PPI) networks whose structure is routinely analyzed by algorithms and tools to characterize hubs/bottlenecks and topological, functional, and disease modules. On the other hand, co-expression networks represent a complementary procedure that give the opportunity to evaluate at system level including organisms that lack information on PPIs. Based on these premises, we introduce the reader to the PPI and to the co-expression networks, including aspects of reconstruction and analysis. In particular, the new idea to evaluate large-scale proteomic data by means of co-expression networks will be discussed presenting some examples of application. Their use to infer biological knowledge will be shown, and a special attention will be devoted to the topological and module analysis.
Differential protein expression, DNA binding and interaction with SV40 large tumour antigen implicate the p63-family of proteins in replicative senescence.

PubMed

Djelloul, Siham; Tarunina, Marina; Barnouin, Karin; Mackay, Alan; Jat, Parmjit S

2002-02-07

P53 activity plays a key role in mammalian cells when they undergo replicative senescence at their Hayflick limit. To determine whether p63 proteins, members of the family of p53-related genes, are also involved in this process, we examined their expression in serially passaged rat embryo fibroblasts. Upon senescence, two truncated DeltaNp63 proteins decreased in abundance whereas two TAp63 isoforms accumulated. 2-D gel analysis showed that the DeltaNp63 proteins underwent post-translational modifications in both proliferating and senescent cells. Direct binding of DeltaNp63 proteins to a p53 consensus motif was greater in proliferating cells than senescent cells. In contrast p63alpha isoforms bound to DNA in a p53 dependent manner and this was higher in senescent cells than proliferating cells. An interaction of p63alpha proteins with SV40 large tumour antigen was also detected and ectopic expression of DeltaNp63alpha can extend the lifespan of rat embryo fibroblasts. Taken together the results indicate that p63 proteins may play a role in replicative senescence either by competition for p53 DNA binding sites or by direct interaction with p53 protein bound to DNA.
MetaboID: a graphical user interface package for assignment of 1H NMR spectra of bodyfluids and tissues.

PubMed

MacKinnon, Neil; Somashekar, Bagganahalli S; Tripathi, Pratima; Ge, Wencheng; Rajendiran, Thekkelnaycke M; Chinnaiyan, Arul M; Ramamoorthy, Ayyalusamy

2013-01-01

Nuclear magnetic resonance based measurements of small molecule mixtures continues to be confronted with the challenge of spectral assignment. While multi-dimensional experiments are capable of addressing this challenge, the imposed time constraint becomes prohibitive, particularly with the large sample sets commonly encountered in metabolomic studies. Thus, one-dimensional spectral assignment is routinely performed, guided by two-dimensional experiments on a selected sample subset; however, a publicly available graphical interface for aiding in this process is currently unavailable. We have collected spectral information for 360 unique compounds from publicly available databases including chemical shift lists and authentic full resolution spectra, supplemented with spectral information for 25 compounds collected in-house at a proton NMR frequency of 900 MHz. This library serves as the basis for MetaboID, a Matlab-based user interface designed to aid in the one-dimensional spectral assignment process. The tools of MetaboID were built to guide resonance assignment in order of increasing confidence, starting from cursory compound searches based on chemical shift positions to analysis of authentic spike experiments. Together, these tools streamline the often repetitive task of spectral assignment. The overarching goal of the integrated toolbox of MetaboID is to centralize the one dimensional spectral assignment process, from providing access to large chemical shift libraries to providing a straightforward, intuitive means of spectral comparison. Such a toolbox is expected to be attractive to both experienced and new metabolomic researchers as well as general complex mixture analysts. Copyright © 2012 Elsevier Inc. All rights reserved.
Extra Large G-Protein Interactome Reveals Multiple Stress Response Function and Partner-Dependent XLG Subcellular Localization

DOE Office of Scientific and Technical Information (OSTI.GOV)

Liang, Ying; Gao, Yajun; Jones, Alan M.

The three-member family of Arabidopsis extra-large G proteins (XLG1-3) defines the prototype of an atypical Ga subunit in the heterotrimeric G protein complex. Some recent evidence indicate that XLG subunits operate along with its Gbg dimer in root morphology, stress responsiveness, and cytokinin induced development, however downstream targets of activated XLG proteins in the stress pathways are rarely known. In order to assemble a set of candidate XLG-targeted proteins, a yeast two-hybrid complementation-based screen was performed using XLG protein baits to query interactions between XLG and partner protein found in glucose-treated seedlings, roots, and Arabidopsis cells in culture. Seventy twomore » interactors were identified and >60% of a test set displayed in vivo interaction with XLG proteins. Gene co-expression analysis shows that >70% of the interactors are positively correlated with the corresponding XLG partners. Gene Ontology enrichment for all the candidates indicates stress responses and posits a molecular mechanism involving a specific set of transcription factor partners to XLG. Genes encoding two of these transcription factors, SZF1 and 2, require XLG proteins for full NaCl-induced expression. Furthermore, the subcellular localization of the XLG proteins in the nucleus, endosome, and plasma membrane is dependent on the specific interacting partner.« less
Extra Large G-Protein Interactome Reveals Multiple Stress Response Function and Partner-Dependent XLG Subcellular Localization

DOE PAGES

Liang, Ying; Gao, Yajun; Jones, Alan M.

2017-06-13

The three-member family of Arabidopsis extra-large G proteins (XLG1-3) defines the prototype of an atypical Ga subunit in the heterotrimeric G protein complex. Some recent evidence indicate that XLG subunits operate along with its Gbg dimer in root morphology, stress responsiveness, and cytokinin induced development, however downstream targets of activated XLG proteins in the stress pathways are rarely known. In order to assemble a set of candidate XLG-targeted proteins, a yeast two-hybrid complementation-based screen was performed using XLG protein baits to query interactions between XLG and partner protein found in glucose-treated seedlings, roots, and Arabidopsis cells in culture. Seventy twomore » interactors were identified and >60% of a test set displayed in vivo interaction with XLG proteins. Gene co-expression analysis shows that >70% of the interactors are positively correlated with the corresponding XLG partners. Gene Ontology enrichment for all the candidates indicates stress responses and posits a molecular mechanism involving a specific set of transcription factor partners to XLG. Genes encoding two of these transcription factors, SZF1 and 2, require XLG proteins for full NaCl-induced expression. Furthermore, the subcellular localization of the XLG proteins in the nucleus, endosome, and plasma membrane is dependent on the specific interacting partner.« less
Efficiency of Adaptive Temperature-Based Replica Exchange for Sampling Large-Scale Protein Conformational Transitions.

PubMed

Zhang, Weihong; Chen, Jianhan

2013-06-11

Temperature-based replica exchange (RE) is now considered a principal technique for enhanced sampling of protein conformations. It is also recognized that existence of sharp cooperative transitions (such as protein folding/unfolding) can lead to temperature exchange bottlenecks and significantly reduce the sampling efficiency. Here, we revisit two adaptive temperature-based RE protocols, namely, exchange equalization (EE) and current maximization (CM), that were previously examined using atomistic simulations (Lee and Olson, J. Chem. Physics2011, 134, 24111). Both protocols aim to overcome exchange bottlenecks by adaptively adjusting the simulation temperatures, either to achieve uniform exchange rates (in EE) or to maximize temperature diffusion (CM). By designing a realistic yet computationally tractable coarse-grained protein model, one can sample many reversible folding/unfolding transitions using conventional constant temperature molecular dynamics (MD), standard REMD, EE-REMD, and CM-REMD. This allows rigorous evaluation of the sampling efficiency, by directly comparing the rates of folding/unfolding transitions and convergence of various thermodynamic properties of interest. The results demonstrate that both EE and CM can indeed enhance temperature diffusion compared to standard RE, by ∼3- and over 10-fold, respectively. Surprisingly, the rates of reversible folding/unfolding transitions are similar in all three RE protocols. The convergence rates of several key thermodynamic properties, including the folding stability and various 1D and 2D free energy surfaces, are also similar. Therefore, the efficiency of RE protocols does not appear to be limited by temperature diffusion, but by the inherent rates of spontaneous large-scale conformational rearrangements. This is particularly true considering that virtually all RE simulations of proteins in practice involve exchange attempt frequencies (∼ps(-1)) that are several orders of magnitude faster than the

28 CFR 548.17 - Work assignments.

Code of Federal Regulations, 2012 CFR

2012-07-01

... 28 Judicial Administration 2 2012-07-01 2012-07-01 false Work assignments. 548.17 Section 548.17 Judicial Administration BUREAU OF PRISONS, DEPARTMENT OF JUSTICE INSTITUTIONAL MANAGEMENT RELIGIOUS PROGRAMS Religious Beliefs and Practices of Committed Offenders § 548.17 Work assignments. When the...
28 CFR 548.17 - Work assignments.

Code of Federal Regulations, 2011 CFR

2011-07-01

... 28 Judicial Administration 2 2011-07-01 2011-07-01 false Work assignments. 548.17 Section 548.17 Judicial Administration BUREAU OF PRISONS, DEPARTMENT OF JUSTICE INSTITUTIONAL MANAGEMENT RELIGIOUS PROGRAMS Religious Beliefs and Practices of Committed Offenders § 548.17 Work assignments. When the...
28 CFR 548.17 - Work assignments.

Code of Federal Regulations, 2014 CFR

2014-07-01

... 28 Judicial Administration 2 2014-07-01 2014-07-01 false Work assignments. 548.17 Section 548.17 Judicial Administration BUREAU OF PRISONS, DEPARTMENT OF JUSTICE INSTITUTIONAL MANAGEMENT RELIGIOUS PROGRAMS Religious Beliefs and Practices of Committed Offenders § 548.17 Work assignments. When the...
28 CFR 548.17 - Work assignments.

Code of Federal Regulations, 2013 CFR

2013-07-01

... 28 Judicial Administration 2 2013-07-01 2013-07-01 false Work assignments. 548.17 Section 548.17 Judicial Administration BUREAU OF PRISONS, DEPARTMENT OF JUSTICE INSTITUTIONAL MANAGEMENT RELIGIOUS PROGRAMS Religious Beliefs and Practices of Committed Offenders § 548.17 Work assignments. When the...
28 CFR 548.17 - Work assignments.

Code of Federal Regulations, 2010 CFR

2010-07-01

... 28 Judicial Administration 2 2010-07-01 2010-07-01 false Work assignments. 548.17 Section 548.17 Judicial Administration BUREAU OF PRISONS, DEPARTMENT OF JUSTICE INSTITUTIONAL MANAGEMENT RELIGIOUS PROGRAMS Religious Beliefs and Practices of Committed Offenders § 548.17 Work assignments. When the...
Direct detection of x-rays for protein crystallography employing a thick, large area CCD

DOEpatents

Atac, Muzaffer; McKay, Timothy

1999-01-01

An apparatus and method for directly determining the crystalline structure of a protein crystal. The crystal is irradiated by a finely collimated x-ray beam. The interaction of the x-ray beam with the crystal produces scattered x-rays. These scattered x-rays are detected by means of a large area, thick CCD which is capable of measuring a significant number of scattered x-rays which impact its surface. The CCD is capable of detecting the position of impact of the scattered x-ray on the surface of the CCD and the quantity of scattered x-rays which impact the same cell or pixel. This data is then processed in real-time and the processed data is outputted to produce a image of the structure of the crystal. If this crystal is a protein the molecular structure of the protein can be determined from the data received.
The reference frame of figure-ground assignment.

PubMed

Vecera, Shaun P

2004-10-01

Figure-ground assignment involves determining which visual regions are foreground figures and which are backgrounds. Although figure-ground processes provide important inputs to high-level vision, little is known about the reference frame in which the figure's features and parts are defined. Computational approaches have suggested a retinally based, viewer-centered reference frame for figure-ground assignment, but figural assignment could also be computed on the basis of environmental regularities in an environmental reference frame. The present research used a newly discovered cue, lower region, to examine the reference frame of figure-ground assignment. Possible reference frames were misaligned by changing the orientation of viewers by having them tilt their heads (Experiments 1 and 2) or turn them upside down (Experiment 3). The results of these experiments indicated that figure-ground perception followed the orientation of the viewer, suggesting a viewer-centered reference frame for figure-ground assignment.
Screening and large-scale expression of membrane proteins in mammalian cells for structural studies

PubMed Central

Goehring, April; Lee, Chia-Hsueh; Wang, Kevin H.; Michel, Jennifer Carlisle; Claxton, Derek P.; Baconguis, Isabelle; Althoff, Thorsten; Fischer, Suzanne; Garcia, K. Christopher; Gouaux, Eric

2014-01-01

Structural, biochemical and biophysical studies of eukaryotic membrane proteins are often hampered by difficulties in over-expression of the candidate molecule. Baculovirus transduction of mammalian cells (BacMam), although a powerful method to heterologously express membrane proteins, can be cumbersome for screening and expression of multiple constructs. We therefore developed plasmid Eric Gouaux (pEG) BacMam, a vector optimized for use in screening assays, as well as for efficient production of baculovirus and robust expression of the target protein. In this protocol we show how to use small-scale transient transfection and fluorescence-detection, size-exclusion chromatography (FSEC) experiments using a GFP-His8 tagged candidate protein to screen for monodispersity and expression level. Once promising candidates are identified, we describe how to generate baculovirus, transduce HEK293S GnTI− (N-acetylglucosaminyltransferase I-negative) cells in suspension culture, and over-express the candidate protein. We have used these methods to prepare pure samples of chicken acid-sensing ion channel 1a (cASIC1) and Caenorhabditis elegans glutamate-gated chloride channel (GluCl), for X-ray crystallography, demonstrating how to rapidly and efficiently screen hundreds of constructs and accomplish large-scale expression in 4-6 weeks. PMID:25299155
My Favorite Assignment.

ERIC Educational Resources Information Center

Hebert, Margaret; And Others

1991-01-01

Contains seven brief articles which offer assignments designed to help students perform job searches, write job application letters, answer difficult questions, write letters of resignation, alleviate fears of public speaking, use the interview effectively in the business communication, and develop listening skills. (PRA)
Effects of tea polyphenols on the post-mortem integrity of large yellow croaker (Pseudosciaena crocea) fillet proteins.

PubMed

Zhao, Jin; Lv, Weijin; Wang, Jinlin; Li, Jianrong; Liu, Xiaoxiang; Zhu, Junli

2013-12-01

Tea polyphenols (TP) are known to be important for the post-mortem deterioration of fish muscle and can enhance food quality. To shed light on the influence of TP on the status of large yellow croaker muscle proteins, control and treated fillets (0.1% TP, 0.2% TP and 0.3% TP, w/v) were analysed periodically for myofibrillar protein functional properties (Ca(2+)-ATPase activity, surface hydrophobicity, total sulfhydryl content, emulsion stability index and rheological behaviour). Degradation of the myofibrillar protein myosin could be clearly observed; several proteins were also observed to vary in abundance following post-mortem storage for 25 days. The present study offers new evidence that TP have an effective impact on muscle protein integrity post-mortem. Copyright © 2013 Elsevier Ltd. All rights reserved.
Functional assignment of solute-binding proteins of ABC transporters using a fluorescence-based thermal shift assay.

DOE Office of Scientific and Technical Information (OSTI.GOV)

Giulliani, S. E.; Frank, A. E.; Collart, F. R.

2008-12-08

We have used a fluorescence-based thermal shift (FTS) assay to identify amino acids that bind to solute-binding proteins in the bacterial ABC transporter family. The assay was validated with a set of six proteins with known binding specificity and was consistently able to map proteins with their known binding ligands. The assay also identified additional candidate binding ligands for several of the amino acid-binding proteins in the validation set. We extended this approach to additional targets and demonstrated the ability of the FTS assay to unambiguously identify preferential binding for several homologues of amino acid-binding proteins with known specificity andmore » to functionally annotate proteins of unknown binding specificity. The assay is implemented in a microwell plate format and provides a rapid approach to validate an anticipated function or to screen proteins of unknown function. The ABC-type transporter family is ubiquitous and transports a variety of biological compounds, but the current annotation of the ligand-binding proteins is limited to mostly generic descriptions of function. The results illustrate the feasibility of the FTS assay to improve the functional annotation of binding proteins associated with ABC-type transporters and suggest this approach that can also be extended to other protein families.« less
Computational approaches to protein inference in shotgun proteomics

PubMed Central

2012-01-01

Shotgun proteomics has recently emerged as a powerful approach to characterizing proteomes in biological samples. Its overall objective is to identify the form and quantity of each protein in a high-throughput manner by coupling liquid chromatography with tandem mass spectrometry. As a consequence of its high throughput nature, shotgun proteomics faces challenges with respect to the analysis and interpretation of experimental data. Among such challenges, the identification of proteins present in a sample has been recognized as an important computational task. This task generally consists of (1) assigning experimental tandem mass spectra to peptides derived from a protein database, and (2) mapping assigned peptides to proteins and quantifying the confidence of identified proteins. Protein identification is fundamentally a statistical inference problem with a number of methods proposed to address its challenges. In this review we categorize current approaches into rule-based, combinatorial optimization and probabilistic inference techniques, and present them using integer programing and Bayesian inference frameworks. We also discuss the main challenges of protein identification and propose potential solutions with the goal of spurring innovative research in this area. PMID:23176300
fast_protein_cluster: parallel and optimized clustering of large-scale protein modeling data.

PubMed

Hung, Ling-Hong; Samudrala, Ram

2014-06-15

fast_protein_cluster is a fast, parallel and memory efficient package used to cluster 60 000 sets of protein models (with up to 550 000 models per set) generated by the Nutritious Rice for the World project. fast_protein_cluster is an optimized and extensible toolkit that supports Root Mean Square Deviation after optimal superposition (RMSD) and Template Modeling score (TM-score) as metrics. RMSD calculations using a laptop CPU are 60× faster than qcprot and 3× faster than current graphics processing unit (GPU) implementations. New GPU code further increases the speed of RMSD and TM-score calculations. fast_protein_cluster provides novel k-means and hierarchical clustering methods that are up to 250× and 2000× faster, respectively, than Clusco, and identify significantly more accurate models than Spicker and Clusco. fast_protein_cluster is written in C++ using OpenMP for multi-threading support. Custom streaming Single Instruction Multiple Data (SIMD) extensions and advanced vector extension intrinsics code accelerate CPU calculations, and OpenCL kernels support AMD and Nvidia GPUs. fast_protein_cluster is available under the M.I.T. license. (http://software.compbio.washington.edu/fast_protein_cluster) © The Author 2014. Published by Oxford University Press.
HMMER Cut-off Threshold Tool (HMMERCTTER): Supervised classification of superfamily protein sequences with a reliable cut-off threshold.

PubMed

Pagnuco, Inti Anabela; Revuelta, María Victoria; Bondino, Hernán Gabriel; Brun, Marcel; Ten Have, Arjen

2018-01-01

Protein superfamilies can be divided into subfamilies of proteins with different functional characteristics. Their sequences can be classified hierarchically, which is part of sequence function assignation. Typically, there are no clear subfamily hallmarks that would allow pattern-based function assignation by which this task is mostly achieved based on the similarity principle. This is hampered by the lack of a score cut-off that is both sensitive and specific. HMMER Cut-off Threshold Tool (HMMERCTTER) adds a reliable cut-off threshold to the popular HMMER. Using a high quality superfamily phylogeny, it clusters a set of training sequences such that the cluster-specific HMMER profiles show cluster or subfamily member detection with 100% precision and recall (P&R), thereby generating a specific threshold as inclusion cut-off. Profiles and thresholds are then used as classifiers to screen a target dataset. Iterative inclusion of novel sequences to groups and the corresponding HMMER profiles results in high sensitivity while specificity is maintained by imposing 100% P&R self detection. In three presented case studies of protein superfamilies, classification of large datasets with 100% precision was achieved with over 95% recall. Limits and caveats are presented and explained. HMMERCTTER is a promising protein superfamily sequence classifier provided high quality training datasets are used. It provides a decision support system that aids in the difficult task of sequence function assignation in the twilight zone of sequence similarity. All relevant data and source codes are available from the Github repository at the following URL: https://github.com/BBCMdP/HMMERCTTER.
HMMER Cut-off Threshold Tool (HMMERCTTER): Supervised classification of superfamily protein sequences with a reliable cut-off threshold

PubMed Central

Pagnuco, Inti Anabela; Revuelta, María Victoria; Bondino, Hernán Gabriel; Brun, Marcel

2018-01-01

Background Protein superfamilies can be divided into subfamilies of proteins with different functional characteristics. Their sequences can be classified hierarchically, which is part of sequence function assignation. Typically, there are no clear subfamily hallmarks that would allow pattern-based function assignation by which this task is mostly achieved based on the similarity principle. This is hampered by the lack of a score cut-off that is both sensitive and specific. Results HMMER Cut-off Threshold Tool (HMMERCTTER) adds a reliable cut-off threshold to the popular HMMER. Using a high quality superfamily phylogeny, it clusters a set of training sequences such that the cluster-specific HMMER profiles show cluster or subfamily member detection with 100% precision and recall (P&R), thereby generating a specific threshold as inclusion cut-off. Profiles and thresholds are then used as classifiers to screen a target dataset. Iterative inclusion of novel sequences to groups and the corresponding HMMER profiles results in high sensitivity while specificity is maintained by imposing 100% P&R self detection. In three presented case studies of protein superfamilies, classification of large datasets with 100% precision was achieved with over 95% recall. Limits and caveats are presented and explained. Conclusions HMMERCTTER is a promising protein superfamily sequence classifier provided high quality training datasets are used. It provides a decision support system that aids in the difficult task of sequence function assignation in the twilight zone of sequence similarity. All relevant data and source codes are available from the Github repository at the following URL: https://github.com/BBCMdP/HMMERCTTER. PMID:29579071
Comparison of weight-loss diets with different compositions of fat, protein, and carbohydrates.

PubMed

Sacks, Frank M; Bray, George A; Carey, Vincent J; Smith, Steven R; Ryan, Donna H; Anton, Stephen D; McManus, Katherine; Champagne, Catherine M; Bishop, Louise M; Laranjo, Nancy; Leboff, Meryl S; Rood, Jennifer C; de Jonge, Lilian; Greenway, Frank L; Loria, Catherine M; Obarzanek, Eva; Williamson, Donald A

2009-02-26

The possible advantage for weight loss of a diet that emphasizes protein, fat, or carbohydrates has not been established, and there are few studies that extend beyond 1 year. We randomly assigned 811 overweight adults to one of four diets; the targeted percentages of energy derived from fat, protein, and carbohydrates in the four diets were 20, 15, and 65%; 20, 25, and 55%; 40, 15, and 45%; and 40, 25, and 35%. The diets consisted of similar foods and met guidelines for cardiovascular health. The participants were offered group and individual instructional sessions for 2 years. The primary outcome was the change in body weight after 2 years in two-by-two factorial comparisons of low fat versus high fat and average protein versus high protein and in the comparison of highest and lowest carbohydrate content. At 6 months, participants assigned to each diet had lost an average of 6 kg, which represented 7% of their initial weight; they began to regain weight after 12 months. By 2 years, weight loss remained similar in those who were assigned to a diet with 15% protein and those assigned to a diet with 25% protein (3.0 and 3.6 kg, respectively); in those assigned to a diet with 20% fat and those assigned to a diet with 40% fat (3.3 kg for both groups); and in those assigned to a diet with 65% carbohydrates and those assigned to a diet with 35% carbohydrates (2.9 and 3.4 kg, respectively) (P>0.20 for all comparisons). Among the 80% of participants who completed the trial, the average weight loss was 4 kg; 14 to 15% of the participants had a reduction of at least 10% of their initial body weight. Satiety, hunger, satisfaction with the diet, and attendance at group sessions were similar for all diets; attendance was strongly associated with weight loss (0.2 kg per session attended). The diets improved lipid-related risk factors and fasting insulin levels. Reduced-calorie diets result in clinically meaningful weight loss regardless of which macronutrients they emphasize
Comparison of Weight-Loss Diets with Different Compositions of Fat, Protein, and Carbohydrates

PubMed Central

Sacks, Frank M.; Bray, George A.; Carey, Vincent J.; Smith, Steven R.; Ryan, Donna H.; Anton, Stephen D.; McManus, Katherine; Champagne, Catherine M.; Bishop, Louise M.; Laranjo, Nancy; Leboff, Meryl S.; Rood, Jennifer C.; de Jonge, Lilian; Greenway, Frank L.; Loria, Catherine M.; Obarzanek, Eva; Williamson, Donald A.

2009-01-01

BACKGROUND The possible advantage for weight loss of a diet that emphasizes protein, fat, or carbohydrates has not been established, and there are few studies that extend beyond 1 year. METHODS We randomly assigned 811 overweight adults to one of four diets; the targeted percentages of energy derived from fat, protein, and carbohydrates in the four diets were 20, 15, and 65%; 20, 25, and 55%; 40, 15, and 45%; and 40, 25, and 35%. The diets consisted of similar foods and met guidelines for cardiovascular health. The participants were offered group and individual instructional sessions for 2 years. The primary outcome was the change in body weight after 2 years in two-by-two factorial comparisons of low fat versus high fat and average protein versus high protein and in the comparison of highest and lowest carbohydrate content. RESULTS At 6 months, participants assigned to each diet had lost an average of 6 kg, which represented 7% of their initial weight; they began to regain weight after 12 months. By 2 years, weight loss remained similar in those who were assigned to a diet with 15% protein and those assigned to a diet with 25% protein (3.0 and 3.6 kg, respectively); in those assigned to a diet with 20% fat and those assigned to a diet with 40% fat (3.3 kg for both groups); and in those assigned to a diet with 65% carbohydrates and those assigned to a diet with 35% carbohydrates (2.9 and 3.4 kg, respectively) (P>0.20 for all comparisons). Among the 80% of participants who completed the trial, the average weight loss was 4 kg; 14 to 15% of the participants had a reduction of at least 10% of their initial body weight. Satiety, hunger, satisfaction with the diet, and attendance at group sessions were similar for all diets; attendance was strongly associated with weight loss (0.2 kg per session attended). The diets improved lipid-related risk factors and fasting insulin levels. CONCLUSIONS Reduced-calorie diets result in clinically meaningful weight loss regardless of
31 CFR 306.56 - Assignment of securities registered in the names of or assigned to two or more persons.

Code of Federal Regulations, 2010 CFR

2010-07-01

... assigned to two or more persons in the alternative, for example, “John B. Smith or Mrs. Mary J. Smith” or “John B. Smith or Mrs. Mary J. Smith or the survivor,” may be assigned by one of them at maturity or... the names of or assigned to two or more persons jointly, for example, “John B. Smith and Mrs. Mary J...
46 CFR 387.5 - Surplus property assignment recommendation.

Code of Federal Regulations, 2011 CFR

2011-10-01

... 46 Shipping 8 2011-10-01 2011-10-01 false Surplus property assignment recommendation. 387.5 Section 387.5 Shipping MARITIME ADMINISTRATION, DEPARTMENT OF TRANSPORTATION MISCELLANEOUS UTILIZATION AND... property assignment recommendation. Before any assignment recommendation is submitted to the Disposal...
46 CFR 387.5 - Surplus property assignment recommendation.

Code of Federal Regulations, 2010 CFR

2010-10-01

... 46 Shipping 8 2010-10-01 2010-10-01 false Surplus property assignment recommendation. 387.5 Section 387.5 Shipping MARITIME ADMINISTRATION, DEPARTMENT OF TRANSPORTATION MISCELLANEOUS UTILIZATION AND... property assignment recommendation. Before any assignment recommendation is submitted to the Disposal...

Large-scale gene function analysis with the PANTHER classification system.

PubMed

Mi, Huaiyu; Muruganujan, Anushya; Casagrande, John T; Thomas, Paul D

2013-08-01

The PANTHER (protein annotation through evolutionary relationship) classification system (http://www.pantherdb.org/) is a comprehensive system that combines gene function, ontology, pathways and statistical analysis tools that enable biologists to analyze large-scale, genome-wide data from sequencing, proteomics or gene expression experiments. The system is built with 82 complete genomes organized into gene families and subfamilies, and their evolutionary relationships are captured in phylogenetic trees, multiple sequence alignments and statistical models (hidden Markov models or HMMs). Genes are classified according to their function in several different ways: families and subfamilies are annotated with ontology terms (Gene Ontology (GO) and PANTHER protein class), and sequences are assigned to PANTHER pathways. The PANTHER website includes a suite of tools that enable users to browse and query gene functions, and to analyze large-scale experimental data with a number of statistical tests. It is widely used by bench scientists, bioinformaticians, computer scientists and systems biologists. In the 2013 release of PANTHER (v.8.0), in addition to an update of the data content, we redesigned the website interface to improve both user experience and the system's analytical capability. This protocol provides a detailed description of how to analyze genome-wide experimental data with the PANTHER classification system.
Development, social norms, and assignment to task

PubMed Central

Fafchamps, Marcel

2011-01-01

Economic development involves a structural transformation in the way people are allocated to tasks. There is a shift from self-provision to market exchange, facilitating specialization. There is also a shift from self-employment to wage employment in large firms and organizations, driven by innovation and increasing returns to scale. Changes in allocation mechanisms require changes in norms and attitudes. Because different labor assignment domains coexist, conflicts arise among norms that apply to different domains, possibly resulting in dysfunctional outcomes. I argue that religion, humanism, and schools have all played an important historical role in fostering the changes in social norms and attitudes that are needed to accompany structural changes in the way economies allocate workers to tasks. PMID:22198757
Large-scale analysis of intrinsic disorder flavors and associated functions in the protein sequence universe.

PubMed

Necci, Marco; Piovesan, Damiano; Tosatto, Silvio C E

2016-12-01

Intrinsic disorder (ID) in proteins has been extensively described for the last decade; a large-scale classification of ID in proteins is mostly missing. Here, we provide an extensive analysis of ID in the protein universe on the UniProt database derived from sequence-based predictions in MobiDB. Almost half the sequences contain an ID region of at least five residues. About 9% of proteins have a long ID region of over 20 residues which are more abundant in Eukaryotic organisms and most frequently cover less than 20% of the sequence. A small subset of about 67,000 (out of over 80 million) proteins is fully disordered and mostly found in Viruses. Most proteins have only one ID, with short ID evenly distributed along the sequence and long ID overrepresented in the center. The charged residue composition of Das and Pappu was used to classify ID proteins by structural propensities and corresponding functional enrichment. Swollen Coils seem to be used mainly as structural components and in biosynthesis in both Prokaryotes and Eukaryotes. In Bacteria, they are confined in the nucleoid and in Viruses provide DNA binding function. Coils & Hairpins seem to be specialized in ribosome binding and methylation activities. Globules & Tadpoles bind antigens in Eukaryotes but are involved in killing other organisms and cytolysis in Bacteria. The Undefined class is used by Bacteria to bind toxic substances and mediate transport and movement between and within organisms in Viruses. Fully disordered proteins behave similarly, but are enriched for glycine residues and extracellular structures. © 2016 The Protein Society.
MALDI Top-Down sequencing: calling N- and C-terminal protein sequences with high confidence and speed.

PubMed

Suckau, Detlev; Resemann, Anja

2009-12-01

The ability to match Top-Down protein sequencing (TDS) results by MALDI-TOF to protein sequences by classical protein database searching was evaluated in this work. Resulting from these analyses were the protein identity, the simultaneous assignment of the N- and C-termini and protein sequences of up to 70 residues from either terminus. In combination with de novo sequencing using the MALDI-TDS data, even fusion proteins were assigned and the detailed sequence around the fusion site was elucidated. MALDI-TDS allowed to efficiently match protein sequences quickly and to validate recombinant protein structures-in particular, protein termini-on the level of undigested proteins.
UFO: a web server for ultra-fast functional profiling of whole genome protein sequences.

PubMed

Meinicke, Peter

2009-09-02

Functional profiling is a key technique to characterize and compare the functional potential of entire genomes. The estimation of profiles according to an assignment of sequences to functional categories is a computationally expensive task because it requires the comparison of all protein sequences from a genome with a usually large database of annotated sequences or sequence families. Based on machine learning techniques for Pfam domain detection, the UFO web server for ultra-fast functional profiling allows researchers to process large protein sequence collections instantaneously. Besides the frequencies of Pfam and GO categories, the user also obtains the sequence specific assignments to Pfam domain families. In addition, a comparison with existing genomes provides dissimilarity scores with respect to 821 reference proteomes. Considering the underlying UFO domain detection, the results on 206 test genomes indicate a high sensitivity of the approach. In comparison with current state-of-the-art HMMs, the runtime measurements show a considerable speed up in the range of four orders of magnitude. For an average size prokaryotic genome, the computation of a functional profile together with its comparison typically requires about 10 seconds of processing time. For the first time the UFO web server makes it possible to get a quick overview on the functional inventory of newly sequenced organisms. The genome scale comparison with a large number of precomputed profiles allows a first guess about functionally related organisms. The service is freely available and does not require user registration or specification of a valid email address.
Figure-ground assignment to a translating contour: a preference for advancing vs. receding motion.

PubMed

Barenholtz, Elan; Tarr, Michael J

2009-05-28

Past research on figure-ground assignment to contours has largely considered static stimuli. Here we report a simple and extremely robust dynamic cue to figural assignment, based on whether the bounding region of a contour is growing larger within the field of view ("advancing") rather than smaller ("receding"). Subjects viewed a straight or jagged contour dividing two colored regions translating behind a virtual aperture and had to report which color they had seen "moving in front", effectively assigning figure to that side of the contour. Across three experiments, subjects showed a strong preference to assign figure such that the bounded contour was advancing. This was true regardless of the direction of motion of the contour and regardless of the initial/ending size of the bounded regions (i.e., the motion cue served to override the conventional cue to figure-ground of smaller area). In a fourth, control experiment, subjects showed no such bias when it was the aperture, rather than the contour, that moved, demonstrating that the effect depends on contour motion and not simply an increase in area. We discuss a possible explanation for this bias as well as the general implications regarding dynamic factors in form perception.
Replica exchange molecular dynamics simulation of structure variation from α/4β-fold to 3α-fold protein.

PubMed

Lazim, Raudah; Mei, Ye; Zhang, Dawei

2012-03-01

Replica exchange molecular dynamics (REMD) simulation provides an efficient conformational sampling tool for the study of protein folding. In this study, we explore the mechanism directing the structure variation from α/4β-fold protein to 3α-fold protein after mutation by conducting REMD simulation on 42 replicas with temperatures ranging from 270 K to 710 K. The simulation began from a protein possessing the primary structure of GA88 but the tertiary structure of GB88, two G proteins with "high sequence identity." Albeit the large Cα-root mean square deviation (RMSD) of the folded protein (4.34 Å at 270 K and 4.75 Å at 304 K), a variation in tertiary structure was observed. Together with the analysis of secondary structure assignment, cluster analysis and principal component, it provides insights to the folding and unfolding pathway of 3α-fold protein and α/4β-fold protein respectively paving the way toward the understanding of the ongoings during conformational variation.
Large-scale crystallization of proteins for purification and formulation.

PubMed

Hekmat, Dariusch

2015-07-01

Since about 170 years, salts were used to create supersaturated solutions and crystallize proteins. The dehydrating effect of salts as well as their kosmotropic or chaotropic character was revealed. Even the suitability of organic solvents for crystallization was already recognized. Interestingly, what was performed during the early times is still practiced today. A lot of effort was put into understanding the underlying physico-chemical interaction mechanisms leading to protein crystallization. However, it was understood that already the solvation of proteins is a highly complex process not to mention the intricate interrelation of electrostatic and hydrophobic interactions taking place. Although many basic questions are still unanswered, preparative protein crystallization was attempted as illustrated in the presented case studies. Due to the highly variable nature of crystallization, individual design of the crystallization process is needed in every single case. It was shown that preparative crystallization from impure protein solutions as a capture step is possible after applying adequate pre-treatment procedures like precipitation or extraction. Protein crystallization can replace one or more chromatography steps. It was further shown that crystallization can serve as an attractive alternative means for formulation of therapeutic proteins. Crystalline proteins can offer enhanced purity and enable highly concentrated doses of the active ingredient. Easy scalability of the proposed protein crystallization processes was shown using the maximum local energy dissipation as a suitable scale-up criterion. Molecular modeling and target-oriented protein engineering may allow protein crystallization to become part of a platform purification process in the near future.
Homework assignments in couple and family therapy.

PubMed

Dattilio, Frank M

2002-05-01

Homework has been cited as an integral part of a number of theoretical orientations and therapy formats; unfortunately, very little has been written about its use with couples and families. This is despite the fact that many couple and family therapists espouse the use of homework or out-of-session assignments in order to help the effects of therapy jell. This article reviews some of the empirical literature on homework assignments and their effectiveness in the domain of therapy for families and couples. It also highlights the effectiveness of and the need for out-of-session assignments in treatment. A case illustration is used to demonstrate how homework assignments may be used as a significant change agent in couple and family treatment. Copyright 2002 Wiley Periodicals, Inc.
Quadratic partial eigenvalue assignment in large-scale stochastic dynamic systems for resilient and economic design

DOE Office of Scientific and Technical Information (OSTI.GOV)

Das, Sonjoy; Goswami, Kundan; Datta, Biswa N.

2014-12-10

Failure of structural systems under dynamic loading can be prevented via active vibration control which shifts the damped natural frequencies of the systems away from the dominant range of loading spectrum. The damped natural frequencies and the dynamic load typically show significant variations in practice. A computationally efficient methodology based on quadratic partial eigenvalue assignment technique and optimization under uncertainty has been formulated in the present work that will rigorously account for these variations and result in an economic and resilient design of structures. A novel scheme based on hierarchical clustering and importance sampling is also developed in this workmore » for accurate and efficient estimation of probability of failure to guarantee the desired resilience level of the designed system. Numerical examples are presented to illustrate the proposed methodology.« less
Writing Assignments that Promote Active Learning

NASA Astrophysics Data System (ADS)

Narayanan, M.

2014-12-01

Encourage students to write a detailed, analytical report correlating classroom discussions to an important historical event or a current event. Motivate students interview an expert from industry on a topic that was discussed in class. Ask the students to submit a report with supporting sketches, drawings, circuit diagrams and graphs. Propose that the students generate a complete a set of reading responses pertaining to an assigned topic. Require each student to bring in one comment or one question about an assigned reading. The assignment should be a recent publication in an appropriate journal. Have the students conduct a web search on an assigned topic. Ask them to generate a set of ideas that can relate to classroom discussions. Provide the students with a study guide. The study guide should provide about 10 or 15 short topics. Quiz the students on one or two of the topics. Encourage the students to design or develop some creative real-world examples based on a chapter discussed or a topic of interest. Require that students originate, develop, support and defend a viewpoint using a specifically assigned material. Make the students practice using or utilizing a set of new technical terms they have encountered in an assigned chapter. Have students develop original examples explaining the different terms. Ask the students to select one important terminology from the previous classroom discussions. Encourage the students to explain why they selected that particular word. Ask them to talk about the importance of the terminology from the point of view of their educational objectives and future career. Angelo, T. A. (1991). Ten easy pieces: Assessing higher learning in four dimensions. In T. A. Angelo (Ed.), Classroom research: Early lessons from success (pp. 17-31). New Directions for Teaching and Learning, No. 46. San Francisco: Jossey-Bass.
47 CFR 78.18 - Frequency assignments.

Code of Federal Regulations, 2012 CFR

2012-10-01

... 47 Telecommunication 4 2012-10-01 2012-10-01 false Frequency assignments. 78.18 Section 78.18... SERVICE Applications and Licenses § 78.18 Frequency assignments. (a) The Cable Television Relay Service is... Relay Service stations operating on frequencies in the sub-bands 18.3-18.58 GHz and 19.26-19.3 GHz that...
47 CFR 78.18 - Frequency assignments.

Code of Federal Regulations, 2010 CFR

2010-10-01

... 47 Telecommunication 4 2010-10-01 2010-10-01 false Frequency assignments. 78.18 Section 78.18... SERVICE Applications and Licenses § 78.18 Frequency assignments. (a) The Cable Television Relay Service is... Relay Service stations operating on frequencies in the sub-bands 18.3-18.58 GHz and 19.26-19.3 GHz that...
47 CFR 78.18 - Frequency assignments.

Code of Federal Regulations, 2014 CFR

2014-10-01

... 47 Telecommunication 4 2014-10-01 2014-10-01 false Frequency assignments. 78.18 Section 78.18... SERVICE Applications and Licenses § 78.18 Frequency assignments. (a) The Cable Television Relay Service is... Relay Service stations operating on frequencies in the sub-bands 18.3-18.58 GHz and 19.26-19.3 GHz that...
47 CFR 78.18 - Frequency assignments.

Code of Federal Regulations, 2011 CFR

2011-10-01

... 47 Telecommunication 4 2011-10-01 2011-10-01 false Frequency assignments. 78.18 Section 78.18... SERVICE Applications and Licenses § 78.18 Frequency assignments. (a) The Cable Television Relay Service is... Relay Service stations operating on frequencies in the sub-bands 18.3-18.58 GHz and 19.26-19.3 GHz that...
47 CFR 78.18 - Frequency assignments.

Code of Federal Regulations, 2013 CFR

2013-10-01

... 47 Telecommunication 4 2013-10-01 2013-10-01 false Frequency assignments. 78.18 Section 78.18... SERVICE Applications and Licenses § 78.18 Frequency assignments. (a) The Cable Television Relay Service is... Relay Service stations operating on frequencies in the sub-bands 18.3-18.58 GHz and 19.26-19.3 GHz that...
47 CFR 74.702 - Channel assignments.

Code of Federal Regulations, 2010 CFR

2010-10-01

..., AUXILIARY, SPECIAL BROADCAST AND OTHER PROGRAM DISTRIBUTIONAL SERVICES Low Power TV, TV Translator, and TV Booster Stations § 74.702 Channel assignments. (a) An applicant for a new low power TV or TV translator... standard VHF Channels (2 to 13 inclusive) may be assigned to a VHF low power TV or TV translator station...
Sensitive spectroscopic detection of large and denatured protein aggregates in solution by use of the fluorescent dye Nile red.

PubMed

Sutter, Marc; Oliveira, Sabrina; Sanders, Niek N; Lucas, Bart; van Hoek, Arie; Hink, Mark A; Visser, Antonie J W G; De Smedt, Stefaan C; Hennink, Wim E; Jiskoot, Wim

2007-03-01

The fluorescent dye Nile red was used as a probe for the sensitive detection of large, denatured aggregates of the model protein beta-galactosidase (E. coli) in solution. Aggregates were formed by irreversible heat denaturation of beta-galactosidase below and above the protein's unfolding temperature of 57.4 degrees C, and the presence of aggregates in heated solutions was confirmed by static light scattering. Interaction of Nile red with beta-galactosidase aggregates led to a shift of the emission maximum (lambda (max)) from 660 to 611 nm, and to an increase of fluorescence intensity. Time-resolved fluorescence and fluorescence correlation spectroscopy (FCS) measurements showed that Nile red detected large aggregates with hydrodynamic radii around 130 nm. By steady-state fluorescence measurements, it was possible to detect 1 nM of denatured and aggregated beta-galactosidase in solution. The comparison with size exclusion chromatography (SEC) showed that native beta-galactosidase and small aggregates thereof had no substantial effect on the fluorescence of Nile red. Large aggregates were not detected by SEC, because they were excluded from the column. The results with beta-galactosidase demonstrate the potential of Nile red for developing complementary analytical methods that overcome the size limitations of SEC, and can detect the formation of large protein aggregates at early stages.
Structural study of the membrane protein MscL using cell-free expression and solid-state NMR

NASA Astrophysics Data System (ADS)

Abdine, Alaa; Verhoeven, Michiel A.; Park, Kyu-Ho; Ghazi, Alexandre; Guittet, Eric; Berrier, Catherine; Van Heijenoort, Carine; Warschawski, Dror E.

2010-05-01

High-resolution structures of membrane proteins have so far been obtained mostly by X-ray crystallography, on samples where the protein is surrounded by detergent. Recent developments of solid-state NMR have opened the way to a new approach for the study of integral membrane proteins inside a membrane. At the same time, the extension of cell-free expression to the production of membrane proteins allows for the production of proteins tailor made for NMR. We present here an in situ solid-state NMR study of a membrane protein selectively labeled through the use of cell-free expression. The sample consists of MscL (mechano-sensitive channel of large conductance), a 75 kDa pentameric α-helical ion channel from Escherichia coli, reconstituted in a hydrated lipid bilayer. Compared to a uniformly labeled protein sample, the spectral crowding is greatly reduced in the cell-free expressed protein sample. This approach may be a decisive step required for spectral assignment and structure determination of membrane proteins by solid-state NMR.
Identifying and Remediating Student Misconceptions in Introductory Biology via Writing-to-Learn Assignments and Peer Review.

PubMed

Halim, Audrey S; Finkenstaedt-Quinn, Solaire A; Olsen, Laura J; Gere, Anne Ruggles; Shultz, Ginger V

2018-06-01

Student misconceptions are an obstacle in science, technology, engineering, and mathematics courses and unless remediated may continue causing difficulties in learning as students advance in their studies. Writing-to-learn assignments (WTL) are characterized by their ability to promote in-depth conceptual learning by allowing students to explore their understanding of a topic. This study sought to determine whether and what types of misconceptions are elicited by WTL assignments and how the process of peer review and revision leads to remediation or propagation of misconceptions. We examined four WTL assignments in an introductory biology course in which students first wrote about content by applying it to a realistic scenario, then participated in a peer-review process before revising their work. Misconceptions were identified in all four assignments, with the greatest number pertaining to protein structure and function. Additionally, in certain contexts, students used scientific terminology incorrectly. Analysis of the drafts and peer-review comments generated six profiles by which misconceptions were addressed through the peer-review process. The prevalent mode of remediation arose through directed peer-review comments followed by correction during revision. It was also observed that additional misconceptions were elicited as students revised their writing in response to general peer-review suggestions.

Maternal Obesity Induces Sustained Inflammation in Both Fetal and Offspring Large Intestine of Sheep

PubMed Central

Yan, Xu; Huang, Yan; Wang, Hui; Du, Min; Hess, Bret W.; Ford, Stephen P.; Nathanielsz, Peter W.; Zhu, Mei-Jun

2010-01-01

Background Both maternal obesity and inflammatory bowel diseases (IBDs) are increasing. It was hypothesized that maternal obesity induces an inflammatory response in the fetal large intestine, predisposing offspring to IBDs. Methods Nonpregnant ewes were assigned to a control (Con, 100% of National Research Council [NRC] recommendations) or obesogenic (OB, 150% of NRC) diet from 60 days before conception. The large intestine was sampled from fetuses at 135 days (term 150 days) after conception and from offspring lambs at 22.5 ± 0.5 months of age. Results Maternal obesity enhanced mRNA expression tumor necrosis factor (TNF)α, interleukin (IL)1α, IL1β, IL6, IL8, and monocyte/macrophage chemotactic protein-1 (MCP1), as well as macrophage markers, CD11b, CD14, and CD68 in fetal gut. mRNA expression of Toll-like receptor (TLR) 2 and TLR4 was increased in OB versus Con fetuses; correspondingly, inflammatory NF-κB and JNK signaling pathways were also upregulated. Both mRNA expression and protein content of transforming growth factor (TGF) β was increased. The IL-17A mRNA expression and protein content was higher in OB compared to Con samples, which was associated with fibrosis in the large intestine of OB fetuses. Similar inflammatory responses and enhanced fibrosis were detected in OB compared to Con offspring. Conclusions Maternal obesity induced inflammation and enhanced expression of proinflammatory cytokines in fetal and offspring large intestine, which correlated with increased TGFβ and IL17 expression. These data show that maternal obesity may predispose offspring gut to IBDs. PMID:21674707
Extending the Applicability of Exact Nuclear Overhauser Enhancements to Large Proteins and RNA.

PubMed

Nichols, Parker; Born, Alexandra; Henen, Morkos; Strotz, Dean; Chi, Celestine N; Güntert, Peter; Vögeli, Beat Rolf

2018-06-08

Distance-dependent NOEs are one of the most popular and important experimental restraints for calculating NMR structures. Despite this, they are mostly employed as semi-quantitative upper distance bounds, which discards a wealth of information that is encoded in the cross-relaxation rate constant. Information that is lost includes exact distances between protons and dynamics that occur on the sub-millisecond time-scale. Our recently introduced exact measurement of the NOE (eNOE) requires little additional experimental effort relative to other NMR observables. So far, we have used eNOEs to calculate multi-state ensembles of proteins up to ~150 residues. Here, we briefly revisit the eNOE methodology and present two new directions for the use of eNOEs: Applications to large proteins and RNA. © 2018 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
32 CFR 241.6 - Length of assignments.

Code of Federal Regulations, 2011 CFR

2011-07-01

... Components and private sector organizations. (b) No assignment may commence after September 30, 2013, unless an individual began an assignment by September 30, 2013. This extension may be granted in 3-month...
28 CFR 301.103 - Inmate work assignments.

Code of Federal Regulations, 2010 CFR

2010-07-01

... designates work assignments, or whoever makes work assignments, shall review appropriate medical records... compatible with the inmate's physical ability or condition. [55 FR 9296, Mar. 12, 1990, as amended at 59 FR...
Enrichment and separation techniques for large-scale proteomics analysis of the protein post-translational modifications.

PubMed

Huang, Junfeng; Wang, Fangjun; Ye, Mingliang; Zou, Hanfa

2014-11-06

Comprehensive analysis of the post-translational modifications (PTMs) on proteins at proteome level is crucial to elucidate the regulatory mechanisms of various biological processes. In the past decades, thanks to the development of specific PTM enrichment techniques and efficient multidimensional liquid chromatography (LC) separation strategy, the identification of protein PTMs have made tremendous progress. A huge number of modification sites for some major protein PTMs have been identified by proteomics analysis. In this review, we first introduced the recent progresses of PTM enrichment methods for the analysis of several major PTMs including phosphorylation, glycosylation, ubiquitination, acetylation, methylation, and oxidation/reduction status. We then briefly summarized the challenges for PTM enrichment. Finally, we introduced the fractionation and separation techniques for efficient separation of PTM peptides in large-scale PTM analysis. Copyright © 2014 Elsevier B.V. All rights reserved.
4 CFR 3.1 - Appointment, promotion, and assignment.

Code of Federal Regulations, 2010 CFR

2010-01-01

... 4 Accounts 1 2010-01-01 2010-01-01 false Appointment, promotion, and assignment. 3.1 Section 3.1 Accounts GOVERNMENT ACCOUNTABILITY OFFICE PERSONNEL SYSTEM EMPLOYMENT § 3.1 Appointment, promotion, and assignment. Employees of GAO shall be appointed, promoted and assigned solely on the basis of merit and...
7 CFR 400.305 - Assignment of Nonstandard Classifications.

Code of Federal Regulations, 2011 CFR

2011-01-01

... tables for the county. (b) Nonstandard classification assignment will be made each year, for the year... 7 Agriculture 6 2011-01-01 2011-01-01 false Assignment of Nonstandard Classifications. 400.305... Underwriting Classification System Regulations for the 1991 and Succeeding Crop Years § 400.305 Assignment of...
7 CFR 400.305 - Assignment of Nonstandard Classifications.

Code of Federal Regulations, 2010 CFR

2010-01-01

... tables for the county. (b) Nonstandard classification assignment will be made each year, for the year... 7 Agriculture 6 2010-01-01 2010-01-01 false Assignment of Nonstandard Classifications. 400.305... Underwriting Classification System Regulations for the 1991 and Succeeding Crop Years § 400.305 Assignment of...
Detecting Plagiarism in MS Access Assignments

ERIC Educational Resources Information Center

Singh, Anil

2013-01-01

Assurance of individual effort from students in computer-based assignments is a challenge. Due to digitization, students can easily use a copy of their friend's work and submit it as their own. Plagiarism in assignments puts students who cheat at par with those who work honestly and this compromises the learning evaluation process. Using a…
Absolute Points for Multiple Assignment Problems

ERIC Educational Resources Information Center

Adlakha, V.; Kowalski, K.

2006-01-01

An algorithm is presented to solve multiple assignment problems in which a cost is incurred only when an assignment is made at a given cell. The proposed method recursively searches for single/group absolute points to identify cells that must be loaded in any optimal solution. Unlike other methods, the first solution is the optimal solution. The…
NMR resonance assignments of a hypoallergenic isoform of the major birch pollen allergen Bet v 1.

PubMed

Ahammer, Linda; Grutsch, Sarina; Wallner, Michael; Ferreira, Fatima; Tollinger, Martin

2017-10-01

In Northern America and Europe a great number of people are suffering from birch pollen allergy and pollen related food allergies. The trigger for these immunological reactions is the 17.5 kDa major birch pollen allergen Bet v 1, which belongs to the family of PR-10 (pathogenesis-related) proteins. In nature, Bet v 1 occurs as a mixture of various isoforms that possess different immunological properties despite their high sequence identities. Bet v 1.0102 (Bet v 1d), which is investigated here, is a hypoallergenic isoform of Bet v 1 and a potential candidate for allergen-specific immunotherapy. We assigned the backbone and side chain 1 H, 13 C and 15 N resonances of this protein and predicted its secondary structure. The NMR-chemical shift data indicate that Bet v 1.0102 is composed of three α-helices and a seven stranded β-sheet, in agreement with the known structure of the hyperallergenic isoform Bet v 1.0101 (Bet v 1a). Our resonance assignments create the foundation for detailed characterization of the dynamic properties of Bet v 1 isoforms by NMR relaxation measurements.
Four signature motifs define the first class of structurally related large coiled-coil proteins in plants.

PubMed Central

Gindullis, Frank; Rose, Annkatrin; Patel, Shalaka; Meier, Iris

2002-01-01

Background Animal and yeast proteins containing long coiled-coil domains are involved in attaching other proteins to the large, solid-state components of the cell. One subgroup of long coiled-coil proteins are the nuclear lamins, which are involved in attaching chromatin to the nuclear envelope and have recently been implicated in inherited human diseases. In contrast to other eukaryotes, long coiled-coil proteins have been barely investigated in plants. Results We have searched the completed Arabidopsis genome and have identified a family of structurally related long coiled-coil proteins. Filament-like plant proteins (FPP) were identified by sequence similarity to a tomato cDNA that encodes a coiled-coil protein which interacts with the nuclear envelope-associated protein, MAF1. The FPP family is defined by four novel unique sequence motifs and by two clusters of long coiled-coil domains separated by a non-coiled-coil linker. All family members are expressed in a variety of Arabidopsis tissues. A homolog sharing the structural features was identified in the monocot rice, indicating conservation among angiosperms. Conclusion Except for myosins, this is the first characterization of a family of long coiled-coil proteins in plants. The tomato homolog of the FPP family binds in a yeast two-hybrid assay to a nuclear envelope-associated protein. This might suggest that FPP family members function in nuclear envelope biology. Because the full Arabidopsis genome does not appear to contain genes for lamins, it is of interest to investigate other long coiled-coil proteins, which might functionally replace lamins in the plant kingdom. PMID:11972898
7 CFR 1.77 - Assignment of priorities.

Code of Federal Regulations, 2014 CFR

2014-01-01

... Television Films § 1.77 Assignment of priorities. (a) Authority. (1) The Director of Information or his designee will make assignment of priorities for the U.S. Department of Agriculture for a television film...
7 CFR 1.77 - Assignment of priorities.

Code of Federal Regulations, 2013 CFR

2013-01-01

... Television Films § 1.77 Assignment of priorities. (a) Authority. (1) The Director of Information or his designee will make assignment of priorities for the U.S. Department of Agriculture for a television film...
7 CFR 1.77 - Assignment of priorities.

Code of Federal Regulations, 2011 CFR

2011-01-01

... Television Films § 1.77 Assignment of priorities. (a) Authority. (1) The Director of Information or his designee will make assignment of priorities for the U.S. Department of Agriculture for a television film...
7 CFR 1.77 - Assignment of priorities.

Code of Federal Regulations, 2012 CFR

2012-01-01

... Television Films § 1.77 Assignment of priorities. (a) Authority. (1) The Director of Information or his designee will make assignment of priorities for the U.S. Department of Agriculture for a television film...
Assigning historic responsibility for extreme weather events

NASA Astrophysics Data System (ADS)

Otto, Friederike E. L.; Skeie, Ragnhild B.; Fuglestvedt, Jan S.; Berntsen, Terje; Allen, Myles R.

2017-11-01

Recent scientific advances make it possible to assign extreme events to human-induced climate change and historical emissions. These developments allow losses and damage associated with such events to be assigned country-level responsibility.
7 CFR 1.77 - Assignment of priorities.

Code of Federal Regulations, 2010 CFR

2010-01-01

... Television Films § 1.77 Assignment of priorities. (a) Authority. (1) The Director of Information or his designee will make assignment of priorities for the U.S. Department of Agriculture for a television film...
1H, 15N, 13C resonance assignment of folded and 8 M urea-denatured state of SUMO from Drosophila melanogaster.

PubMed

Kumar, Dinesh; Kumar, Ashutosh; Misra, Jyoti Ranjan; Chugh, Jeetender; Sharma, Shilpy; Hosur, Ramakrishna V

2008-06-01

SUMO, an important post-translational modifier of variety of substrate proteins, regulates different cellular functions. Here, we report the NMR resonance assignment of the folded and 8 M urea-denatured state of SUMO from Drosophila melanogaster (dsmt3).
Quantum probability assignment limited by relativistic causality.

PubMed

Han, Yeong Deok; Choi, Taeseung

2016-03-14

Quantum theory has nonlocal correlations, which bothered Einstein, but found to satisfy relativistic causality. Correlation for a shared quantum state manifests itself, in the standard quantum framework, by joint probability distributions that can be obtained by applying state reduction and probability assignment that is called Born rule. Quantum correlations, which show nonlocality when the shared state has an entanglement, can be changed if we apply different probability assignment rule. As a result, the amount of nonlocality in quantum correlation will be changed. The issue is whether the change of the rule of quantum probability assignment breaks relativistic causality. We have shown that Born rule on quantum measurement is derived by requiring relativistic causality condition. This shows how the relativistic causality limits the upper bound of quantum nonlocality through quantum probability assignment.

The Protein Interactome of Streptococcus pneumoniae and Bacterial Meta-interactomes Improve Function Predictions.

PubMed

Wuchty, S; Rajagopala, S V; Blazie, S M; Parrish, J R; Khuri, S; Finley, R L; Uetz, P

2017-01-01

The functions of roughly a third of all proteins in Streptococcus pneumoniae , a significant human-pathogenic bacterium, are unknown. Using a yeast two-hybrid approach, we have determined more than 2,000 novel protein interactions in this organism. We augmented this network with meta-interactome data that we defined as the pool of all interactions between evolutionarily conserved proteins in other bacteria. We found that such interactions significantly improved our ability to predict a protein's function, allowing us to provide functional predictions for 299 S. pneumoniae proteins with previously unknown functions. IMPORTANCE Identification of protein interactions in bacterial species can help define the individual roles that proteins play in cellular pathways and pathogenesis. Very few protein interactions have been identified for the important human pathogen S. pneumoniae . We used an experimental approach to identify over 2,000 new protein interactions for S. pneumoniae , the most extensive interactome data for this bacterium to date. To predict protein function, we used our interactome data augmented with interactions from other closely related bacteria. The combination of the experimental data and meta-interactome data significantly improved the prediction results, allowing us to assign possible functions to a large number of poorly characterized proteins.
Mathematics Textbooks and the Teaching of Assigned Writing.

ERIC Educational Resources Information Center

Donlan, Dan

This paper initially presents the results of several studies concerning what kind of writing mathematics teachers assign and what kind of writing mathematics textbooks assign. By far, report-research was the most popular type of writing assigned in the surveyed textbooks. The types of reports students were asked to write include biography,…
The pepATTRACT web server for blind, large-scale peptide-protein docking.

PubMed

de Vries, Sjoerd J; Rey, Julien; Schindler, Christina E M; Zacharias, Martin; Tuffery, Pierre

2017-07-03

Peptide-protein interactions are ubiquitous in the cell and form an important part of the interactome. Computational docking methods can complement experimental characterization of these complexes, but current protocols are not applicable on the proteome scale. pepATTRACT is a novel docking protocol that is fully blind, i.e. it does not require any information about the binding site. In various stages of its development, pepATTRACT has participated in CAPRI, making successful predictions for five out of seven protein-peptide targets. Its performance is similar or better than state-of-the-art local docking protocols that do require binding site information. Here we present a novel web server that carries out the rigid-body stage of pepATTRACT. On the peptiDB benchmark, the web server generates a correct model in the top 50 in 34% of the cases. Compared to the full pepATTRACT protocol, this leads to some loss of performance, but the computation time is reduced from ∼18 h to ∼10 min. Combined with the fact that it is fully blind, this makes the web server well-suited for large-scale in silico protein-peptide docking experiments. The rigid-body pepATTRACT server is freely available at http://bioserv.rpbs.univ-paris-diderot.fr/services/pepATTRACT. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
28 CFR 100.13 - Directly assignable costs.

Code of Federal Regulations, 2011 CFR

2011-07-01

..., COMMUNICATIONS ASSISTANCE FOR LAW ENFORCEMENT ACT OF 1994 § 100.13 Directly assignable costs. (a) A cost is directly assignable to the CALEA compliance effort if it is a plant cost incurred specifically to meet the...
7 CFR 37.16 - OMB assigned numbers.

Code of Federal Regulations, 2014 CFR

2014-01-01

... 7 Agriculture 2 2014-01-01 2014-01-01 false OMB assigned numbers. 37.16 Section 37.16 Agriculture... ASSESS ORGANIC CERTIFYING AGENCIES § 37.16 OMB assigned numbers. The information collection and... Control Number 0581-0183. ...
Flexible taxonomic assignment of ambiguous sequencing reads

PubMed Central

2011-01-01

Background To characterize the diversity of bacterial populations in metagenomic studies, sequencing reads need to be accurately assigned to taxonomic units in a given reference taxonomy. Reads that cannot be reliably assigned to a unique leaf in the taxonomy (ambiguous reads) are typically assigned to the lowest common ancestor of the set of species that match it. This introduces a potentially severe error in the estimation of bacteria present in the sample due to false positives, since all species in the subtree rooted at the ancestor are implicitly assigned to the read even though many of them may not match it. Results We present a method that maps each read to a node in the taxonomy that minimizes a penalty score while balancing the relevance of precision and recall in the assignment through a parameter q. This mapping can be obtained in time linear in the number of matching sequences, because LCA queries to the reference taxonomy take constant time. When applied to six different metagenomic datasets, our algorithm produces different taxonomic distributions depending on whether coverage or precision is maximized. Including information on the quality of the reads reduces the number of unassigned reads but increases the number of ambiguous reads, stressing the relevance of our method. Finally, two measures of performance are described and results with a set of artificially generated datasets are discussed. Conclusions The assignment strategy of sequencing reads introduced in this paper is a versatile and a quick method to study bacterial communities. The bacterial composition of the analyzed samples can vary significantly depending on how ambiguous reads are assigned depending on the value of the q parameter. Validation of our results in an artificial dataset confirm that a combination of values of q produces the most accurate results. PMID:21211059
Many nonuniversal archaeal ribosomal proteins are found in conserved gene clusters

PubMed Central

WANG, JIACHEN; DASGUPTA, INDRANI; FOX, GEORGE E.

2009-01-01

The genomic associations of the archaeal ribosomal proteins, (r-proteins), were examined in detail. The archaeal versions of the universal r-protein genes are typically in clusters similar or identical and to those found in bacteria. Of the 35 nonuniversal archaeal r-protein genes examined, the gene encoding L18e was found to be associated with the conserved L13 cluster, whereas the genes for S4e, L32e and L19e were found in the archaeal version of the spc operon. Eleven nonuniversal protein genes were not associated with any common genomic context. Of the remaining 19 protein genes, 17 were convincingly assigned to one of 10 previously unrecognized gene clusters. Examination of the gene content of these clusters revealed multiple associations with genes involved in the initiation of protein synthesis, transcription or other cellular processes. The lack of such associations in the universal clusters suggests that initially the ribosome evolved largely independently of other processes. More recently it likely has evolved in concert with other cellular systems. It was also verified that a second copy of the gene encoding L7ae found in some bacteria is actually a homolog of the gene encoding L30e and should be annotated as such. PMID:19478915
Structure and sequence analyses of Bacteroides proteins BVU_4064 and BF1687 reveal presence of two novel predominantly-beta domains, predicted to be involved in lipid and cell surface interactions

DOE PAGES

Natarajan, Padmaja; Punta, Marco; Kumar, Abhinav; ...

2015-01-16

N-terminal domains of BVU_4064 and BF1687 proteins from Bacteroides vulgatus and Bacteroides fragilis respectively are members of the Pfam family PF12985 (DUF3869). Proteins containing a domain from this family can be found in most Bacteroides species and, in large numbers, in all human gut microbiome samples. Both BVU_4064 and BF1687 proteins have a consensus lipobox motif implying they are anchored to the membrane, but their functions are otherwise unknown. The C-terminal half of BVU_4064 is assigned to protein family PF12986 (DUF3870); the equivalent part of BF1687 was unclassified.
The Protein Interactome of Streptococcus pneumoniae and Bacterial Meta-interactomes Improve Function Predictions

PubMed Central

Rajagopala, S. V.; Blazie, S. M.; Parrish, J. R.; Khuri, S.; Finley, R. L.

2017-01-01

ABSTRACT The functions of roughly a third of all proteins in Streptococcus pneumoniae, a significant human-pathogenic bacterium, are unknown. Using a yeast two-hybrid approach, we have determined more than 2,000 novel protein interactions in this organism. We augmented this network with meta-interactome data that we defined as the pool of all interactions between evolutionarily conserved proteins in other bacteria. We found that such interactions significantly improved our ability to predict a protein’s function, allowing us to provide functional predictions for 299 S. pneumoniae proteins with previously unknown functions. IMPORTANCE Identification of protein interactions in bacterial species can help define the individual roles that proteins play in cellular pathways and pathogenesis. Very few protein interactions have been identified for the important human pathogen S. pneumoniae. We used an experimental approach to identify over 2,000 new protein interactions for S. pneumoniae, the most extensive interactome data for this bacterium to date. To predict protein function, we used our interactome data augmented with interactions from other closely related bacteria. The combination of the experimental data and meta-interactome data significantly improved the prediction results, allowing us to assign possible functions to a large number of poorly characterized proteins. PMID:28744484
The BG21 isoform of Golli myelin basic protein is intrinsically disordered with a highly flexible amino-terminal domain.

PubMed

Ahmed, Mumdooh A M; Bamm, Vladimir V; Harauz, George; Ladizhansky, Vladimir

2007-08-28

The genes of the oligodendrocyte lineage (Golli) encode a family of developmentally regulated isoforms of myelin basic protein. The "classic" MBP isoforms arise from transcription start site 3, whereas Golli-specific isoforms arise from transcription start site 1, and comprise both Golli-specific and classic MBP sequences. The Golli isoform BG21 has been suggested to play roles in myelination and T cell activation pathways. It is an intrinsically disordered protein, thereby presenting a large effective surface area for interaction with other proteins such as Golli-interacting protein. We have used multidimensional heteronuclear NMR spectroscopy to achieve sequence-specific resonance assignments of the recombinant murine BG21 in physiologically relevant buffer, to analyze its secondary structure using chemical shift indexing (CSI), and to investigate its backbone dynamics using 15N spin relaxation measurements. We have assigned 184 out of 199 residues unambiguously. The CSI analysis revealed little ordered secondary structure under these conditions, with only some small fragments having a slight tendency toward alpha-helicity, which may represent putative recognition motifs. The 15N relaxation and NOE measurements confirmed the general behavior of the protein as an extended polypeptide chain, with the N-terminal Golli-specific portion (residues S5-T69) being exceptionally flexible, even in comparison to other intrinsically disordered proteins that have been studied this way. The high degree of flexibility of this N-terminal region may be to provide additional plasticity, or conformational adaptability, in protein-protein interactions. Another highly mobile segment, A126-S127-G128-G129, may function as a hinge.
DeepText2GO: Improving large-scale protein function prediction with deep semantic text representation.

PubMed

You, Ronghui; Huang, Xiaodi; Zhu, Shanfeng

2018-06-06

As of April 2018, UniProtKB has collected more than 115 million protein sequences. Less than 0.15% of these proteins, however, have been associated with experimental GO annotations. As such, the use of automatic protein function prediction (AFP) to reduce this huge gap becomes increasingly important. The previous studies conclude that sequence homology based methods are highly effective in AFP. In addition, mining motif, domain, and functional information from protein sequences has been found very helpful for AFP. Other than sequences, alternative information sources such as text, however, may be useful for AFP as well. Instead of using BOW (bag of words) representation in traditional text-based AFP, we propose a new method called DeepText2GO that relies on deep semantic text representation, together with different kinds of available protein information such as sequence homology, families, domains, and motifs, to improve large-scale AFP. Furthermore, DeepText2GO integrates text-based methods with sequence-based ones by means of a consensus approach. Extensive experiments on the benchmark dataset extracted from UniProt/SwissProt have demonstrated that DeepText2GO significantly outperformed both text-based and sequence-based methods, validating its superiority. Copyright © 2018 Elsevier Inc. All rights reserved.
PROBABILISTIC CROSS-IDENTIFICATION IN CROWDED FIELDS AS AN ASSIGNMENT PROBLEM

DOE Office of Scientific and Technical Information (OSTI.GOV)

Budavári, Tamás; Basu, Amitabh, E-mail: budavari@jhu.edu, E-mail: basu.amitabh@jhu.edu

2016-10-01

One of the outstanding challenges of cross-identification is multiplicity: detections in crowded regions of the sky are often linked to more than one candidate associations of similar likelihoods. We map the resulting maximum likelihood partitioning to the fundamental assignment problem of discrete mathematics and efficiently solve the two-way catalog-level matching in the realm of combinatorial optimization using the so-called Hungarian algorithm. We introduce the method, demonstrate its performance in a mock universe where the true associations are known, and discuss the applicability of the new procedure to large surveys.
Probabilistic Cross-identification in Crowded Fields as an Assignment Problem

NASA Astrophysics Data System (ADS)

Budavári, Tamás; Basu, Amitabh

2016-10-01

One of the outstanding challenges of cross-identification is multiplicity: detections in crowded regions of the sky are often linked to more than one candidate associations of similar likelihoods. We map the resulting maximum likelihood partitioning to the fundamental assignment problem of discrete mathematics and efficiently solve the two-way catalog-level matching in the realm of combinatorial optimization using the so-called Hungarian algorithm. We introduce the method, demonstrate its performance in a mock universe where the true associations are known, and discuss the applicability of the new procedure to large surveys.
Assignment of chromosomal locus and evidence for alternatively spliced mRNAs of a human sperm membrane protein (hSMP-1).

PubMed

Wang, H; Miao, S; Chen, D; Wang, L; Koide, S S

1999-10-06

The gene (HSD-1) coding a human sperm membrane protein (hSMP-1) was isolated from a human testis cDNA expression library using antibodies found in the serum of an infertile woman. HSD-1 was localized to a single locus on chromosome 9 and assigned to band 9p12-p13 by fluorescent in situ hybridization (FISH) mapping and DAPI (4,6-diamidino-2-phenylindole) banding, using rat/human somatic cell hybrids and metaphase chromosomes of human lymphocytes. In rescreening a testis lambdagt10 cDNA expression library, the full-length cDNA (HSD-1) and several truncated cDNAs with heterologous regions were isolated from positive clones. The heterology consisted of deletion, insertion and alteration of the 5'-end. These heterologous truncated fragments may be produced by alternative splicing of mRNAs. Two recombinant prokaryotic expression vectors were constructed with one of the heterologous fragment (clone #26) with and without the alternative 5'-end. Escherichia coli transfected with the construct containing the alternative 5'-end failed to produce the recombinant product, whereas those transfected with the vector lacking the 5'-end produced hSMP-1. DNASIS analysis of the structure of #26 mRNA suggests that the 5'-end has a stable secondary configuration that may maintain the mRNA in an inactivated state, whereby hindering its translation and preventing the expression of the gene.
Structural basis for translational surveillance by the large ribosomal subunit-associated protein quality control complex

PubMed Central

Lyumkis, Dmitry; Oliveira dos Passos, Dario; Tahara, Erich B.; Webb, Kristofor; Bennett, Eric J.; Vinterbo, Staal; Potter, Clinton S.; Carragher, Bridget; Joazeiro, Claudio A. P.

2014-01-01

All organisms have evolved mechanisms to manage the stalling of ribosomes upon translation of aberrant mRNA. In eukaryotes, the large ribosomal subunit-associated quality control complex (RQC), composed of the listerin/Ltn1 E3 ubiquitin ligase and cofactors, mediates the ubiquitylation and extraction of ribosome-stalled nascent polypeptide chains for proteasomal degradation. How RQC recognizes stalled ribosomes and performs its functions has not been understood. Using single-particle cryoelectron microscopy, we have determined the structure of the RQC complex bound to stalled 60S ribosomal subunits. The structure establishes how Ltn1 associates with the large ribosomal subunit and properly positions its E3-catalytic RING domain to mediate nascent chain ubiquitylation. The structure also reveals that a distinguishing feature of stalled 60S particles is an exposed, nascent chain-conjugated tRNA, and that the Tae2 subunit of RQC, which facilitates Ltn1 binding, is responsible for selective recognition of stalled 60S subunits. RQC components are engaged in interactions across a large span of the 60S subunit surface, connecting the tRNA in the peptidyl transferase center to the distally located nascent chain tunnel exit. This work provides insights into a mechanism linking translation and protein degradation that targets defective proteins immediately after synthesis, while ignoring nascent chains in normally translating ribosomes. PMID:25349383
A Non-parametric Cutout Index for Robust Evaluation of Identified Proteins*

PubMed Central

Serang, Oliver; Paulo, Joao; Steen, Hanno; Steen, Judith A.

2013-01-01

This paper proposes a novel, automated method for evaluating sets of proteins identified using mass spectrometry. The remaining peptide-spectrum match score distributions of protein sets are compared to an empirical absent peptide-spectrum match score distribution, and a Bayesian non-parametric method reminiscent of the Dirichlet process is presented to accurately perform this comparison. Thus, for a given protein set, the process computes the likelihood that the proteins identified are correctly identified. First, the method is used to evaluate protein sets chosen using different protein-level false discovery rate (FDR) thresholds, assigning each protein set a likelihood. The protein set assigned the highest likelihood is used to choose a non-arbitrary protein-level FDR threshold. Because the method can be used to evaluate any protein identification strategy (and is not limited to mere comparisons of different FDR thresholds), we subsequently use the method to compare and evaluate multiple simple methods for merging peptide evidence over replicate experiments. The general statistical approach can be applied to other types of data (e.g. RNA sequencing) and generalizes to multivariate problems. PMID:23292186
Helix-packing motifs in membrane proteins.

PubMed

Walters, R F S; DeGrado, W F

2006-09-12

The fold of a helical membrane protein is largely determined by interactions between membrane-imbedded helices. To elucidate recurring helix-helix interaction motifs, we dissected the crystallographic structures of membrane proteins into a library of interacting helical pairs. The pairs were clustered according to their three-dimensional similarity (rmsd assigned to clusters consisting of at least five members. Surprisingly, three quarters of the helical pairs belong to one of five tightly clustered motifs whose structural features can be understood in terms of simple principles of helix-helix packing. Thus, the universe of common transmembrane helix-pairing motifs is relatively simple. The largest cluster, which comprises 29% of the library members, consists of an antiparallel motif with left-handed packing angles, and it is frequently stabilized by packing of small side chains occurring every seven residues in the sequence. Right-handed parallel and antiparallel structures show a similar tendency to segregate small residues to the helix-helix interface but spaced at four-residue intervals. Position-specific sequence propensities were derived for the most populated motifs. These structural and sequential motifs should be quite useful for the design and structural prediction of membrane proteins.
Visualisation and graph-theoretic analysis of a large-scale protein structural interactome

PubMed Central

Bolser, Dan; Dafas, Panos; Harrington, Richard; Park, Jong; Schroeder, Michael

2003-01-01

Background Large-scale protein interaction maps provide a new, global perspective with which to analyse protein function. PSIMAP, the Protein Structural Interactome Map, is a database of all the structurally observed interactions between superfamilies of protein domains with known three-dimensional structure in the PDB. PSIMAP incorporates both functional and evolutionary information into a single network. Results We present a global analysis of PSIMAP using several distinct network measures relating to centrality, interactivity, fault-tolerance, and taxonomic diversity. We found the following results: Centrality: we show that the center and barycenter of PSIMAP do not coincide, and that the superfamilies forming the barycenter relate to very general functions, while those constituting the center relate to enzymatic activity. Interactivity: we identify the P-loop and immunoglobulin superfamilies as the most highly interactive. We successfully use connectivity and cluster index, which characterise the connectivity of a superfamily's neighbourhood, to discover superfamilies of complex I and II. This is particularly significant as the structure of complex I is not yet solved. Taxonomic diversity: we found that highly interactive superfamilies are in general taxonomically very diverse and are thus amongst the oldest. Fault-tolerance: we found that the network is very robust as for the majority of superfamilies removal from the network will not break up the network. Conclusions Overall, we can single out the P-loop containing nucleotide triphosphate hydrolases superfamily as it is the most highly connected and has the highest taxonomic diversity. In addition, this superfamily has the highest interaction rank, is the barycenter of the network (it has the shortest average path to every other superfamily in the network), and is an articulation vertex, whose removal will disconnect the network. More generally, we conclude that the graph-theoretic and taxonomic analysis of
48 CFR 642.602 - Assignment and location.

Code of Federal Regulations, 2010 CFR

2010-10-01

... 48 Federal Acquisition Regulations System 4 2010-10-01 2010-10-01 false Assignment and location. 642.602 Section 642.602 Federal Acquisition Regulations System DEPARTMENT OF STATE CONTRACT MANAGEMENT CONTRACT ADMINISTRATION AND AUDIT SERVICES Corporate Administration Contracting Officer 642.602 Assignment...
Enhanced conformational sampling technique provides an energy landscape view of large-scale protein conformational transitions.

PubMed

Shao, Qiang

2016-10-26

Large-scale conformational changes in proteins are important for their functions. Tracking the conformational change in real time at the level of a single protein molecule, however, remains a great challenge. In this article, we present a novel in silico approach with the combination of normal mode analysis and integrated-tempering-sampling molecular simulation (NMA-ITS) to give quantitative data for exploring the conformational transition pathway in multi-dimensional energy landscapes starting only from the knowledge of the two endpoint structures of the protein. The open-to-closed transitions of three proteins, including nCaM, AdK, and HIV-1 PR, were investigated using NMA-ITS simulations. The three proteins have varied structural flexibilities and domain communications in their respective conformational changes. The transition state structure in the conformational change of nCaM and the associated free-energy barrier are in agreement with those measured in a standard explicit-solvent REMD simulation. The experimentally measured transition intermediate structures of the intrinsically flexible AdK are captured by the conformational transition pathway measured here. The dominant transition pathways between the closed and fully open states of HIV-1 PR are very similar to those observed in recent REMD simulations. Finally, the evaluated relaxation times of the conformational transitions of three proteins are roughly at the same level as reported experimental data. Therefore, the NMA-ITS method is applicable for a variety of cases, providing both qualitative and quantitative insights into the conformational changes associated with the real functions of proteins.

Backbone resonance assignment of an insect arylalkylamine N-acetyltransferase from Bombyx mori reveals conformational heterogeneity.

PubMed

Aboalroub, Adam A; Zhang, Ziming; Keramisanou, Dimitra; Gelis, Ioannis

2017-04-01

Arylalkylamine N-acetyltransferases (AANATs) catalyze the transfer of an acetyl group from the acetyl-group donor, acetyl-CoA, to an arylalkylamine acceptor. Although a single AANAT has been identified in mammals, insects utilize multiple AANATs in a diverse array of biological processes. AANATs belong to the GCN5-related acetyltransferase (GNAT) superfamily of enzymes, which despite their overall very low sequence homology, are characterized by a well conserved catalytic core domain. The structural properties of many GNATs have been extensively studied by X-ray crystallography that revealed common features during the catalytic cycle. Here we report the 1 H, 13 C and 15 N backbone NMR resonance assignment of the 24 kDa AANAT3 from Bombyx mori (bmAANAT3) as a first step towards understanding the role of protein dynamics in the catalytic properties of AANATs. Our preliminary solution NMR studies reveal that bmAANAT3 is well-folded in solution. The P-loop, which is responsible for cofactor binding, is flexible in the free-state, while a large region of the enzyme interconverts between two distinct conformations in the slow exchange regime.
47 CFR 74.402 - Frequency assignment.

Code of Federal Regulations, 2011 CFR

2011-10-01

... two of the following 6.25 kHz segments may be stacked to form a channel which may be assigned for use... paragraph (e)(9) of this section. (b) Up to four of the following 7.5 kHz VHF segments and up to eight of the following 6.25 kHz UHF segments may be stacked to form a channel which may be assigned for use by...
47 CFR 74.402 - Frequency assignment.

Code of Federal Regulations, 2014 CFR

2014-10-01

... two of the following 6.25 kHz segments may be stacked to form a channel which may be assigned for use... paragraph (e)(9) of this section. (b) Up to four of the following 7.5 kHz VHF segments and up to eight of the following 6.25 kHz UHF segments may be stacked to form a channel which may be assigned for use by...
47 CFR 74.402 - Frequency assignment.

Code of Federal Regulations, 2012 CFR

2012-10-01

... two of the following 6.25 kHz segments may be stacked to form a channel which may be assigned for use... paragraph (e)(9) of this section. (b) Up to four of the following 7.5 kHz VHF segments and up to eight of the following 6.25 kHz UHF segments may be stacked to form a channel which may be assigned for use by...
47 CFR 74.402 - Frequency assignment.

Code of Federal Regulations, 2013 CFR

2013-10-01

... two of the following 6.25 kHz segments may be stacked to form a channel which may be assigned for use... paragraph (e)(9) of this section. (b) Up to four of the following 7.5 kHz VHF segments and up to eight of the following 6.25 kHz UHF segments may be stacked to form a channel which may be assigned for use by...
47 CFR 215.2 - Assignment of responsibilities.

Code of Federal Regulations, 2013 CFR

2013-10-01

... 47 Telecommunication 5 2013-10-01 2013-10-01 false Assignment of responsibilities. 215.2 Section 215.2 Telecommunication OFFICE OF SCIENCE AND TECHNOLOGY POLICY AND NATIONAL SECURITY COUNCIL FEDERAL GOVERNMENT FOCAL POINT FOR ELECTROMAGNETIC PULSE (EMP) INFORMATION § 215.2 Assignment of responsibilities...
47 CFR 215.2 - Assignment of responsibilities.

Code of Federal Regulations, 2012 CFR

2012-10-01

... 47 Telecommunication 5 2012-10-01 2012-10-01 false Assignment of responsibilities. 215.2 Section 215.2 Telecommunication OFFICE OF SCIENCE AND TECHNOLOGY POLICY AND NATIONAL SECURITY COUNCIL FEDERAL GOVERNMENT FOCAL POINT FOR ELECTROMAGNETIC PULSE (EMP) INFORMATION § 215.2 Assignment of responsibilities...
47 CFR 215.2 - Assignment of responsibilities.

Code of Federal Regulations, 2010 CFR

2010-10-01

... 47 Telecommunication 5 2010-10-01 2010-10-01 false Assignment of responsibilities. 215.2 Section 215.2 Telecommunication OFFICE OF SCIENCE AND TECHNOLOGY POLICY AND NATIONAL SECURITY COUNCIL FEDERAL GOVERNMENT FOCAL POINT FOR ELECTROMAGNETIC PULSE (EMP) INFORMATION § 215.2 Assignment of responsibilities...
47 CFR 215.2 - Assignment of responsibilities.

Code of Federal Regulations, 2014 CFR

2014-10-01

... 47 Telecommunication 5 2014-10-01 2014-10-01 false Assignment of responsibilities. 215.2 Section 215.2 Telecommunication OFFICE OF SCIENCE AND TECHNOLOGY POLICY AND NATIONAL SECURITY COUNCIL FEDERAL GOVERNMENT FOCAL POINT FOR ELECTROMAGNETIC PULSE (EMP) INFORMATION § 215.2 Assignment of responsibilities...
47 CFR 215.2 - Assignment of responsibilities.

Code of Federal Regulations, 2011 CFR

2011-10-01

... 47 Telecommunication 5 2011-10-01 2011-10-01 false Assignment of responsibilities. 215.2 Section 215.2 Telecommunication OFFICE OF SCIENCE AND TECHNOLOGY POLICY AND NATIONAL SECURITY COUNCIL FEDERAL GOVERNMENT FOCAL POINT FOR ELECTROMAGNETIC PULSE (EMP) INFORMATION § 215.2 Assignment of responsibilities...
Stress Assignment in Reading Italian Polysyllabic Pseudowords

ERIC Educational Resources Information Center

Sulpizio, Simone; Arduino, Lisa S.; Paizi, Despina; Burani, Cristina

2013-01-01

In 4 naming experiments we investigated how Italian readers assign stress to pseudowords. We assessed whether participants assign stress following distributional information such as stress neighborhood (the proportion and number of existent words sharing orthographic ending and stress pattern) and whether such distributional information affects…
Assigning breed origin to alleles in crossbred animals.

PubMed

Vandenplas, Jérémie; Calus, Mario P L; Sevillano, Claudia A; Windig, Jack J; Bastiaansen, John W M

2016-08-22

For some species, animal production systems are based on the use of crossbreeding to take advantage of the increased performance of crossbred compared to purebred animals. Effects of single nucleotide polymorphisms (SNPs) may differ between purebred and crossbred animals for several reasons: (1) differences in linkage disequilibrium between SNP alleles and a quantitative trait locus; (2) differences in genetic backgrounds (e.g., dominance and epistatic interactions); and (3) differences in environmental conditions, which result in genotype-by-environment interactions. Thus, SNP effects may be breed-specific, which has led to the development of genomic evaluations for crossbred performance that take such effects into account. However, to estimate breed-specific effects, it is necessary to know breed origin of alleles in crossbred animals. Therefore, our aim was to develop an approach for assigning breed origin to alleles of crossbred animals (termed BOA) without information on pedigree and to study its accuracy by considering various factors, including distance between breeds. The BOA approach consists of: (1) phasing genotypes of purebred and crossbred animals; (2) assigning breed origin to phased haplotypes; and (3) assigning breed origin to alleles of crossbred animals based on a library of assigned haplotypes, the breed composition of crossbred animals, and their SNP genotypes. The accuracy of allele assignments was determined for simulated datasets that include crosses between closely-related, distantly-related and unrelated breeds. Across these scenarios, the percentage of alleles of a crossbred animal that were correctly assigned to their breed origin was greater than 90 %, and increased with increasing distance between breeds, while the percentage of incorrectly assigned alleles was always less than 2 %. For the remaining alleles, i.e. 0 to 10 % of all alleles of a crossbred animal, breed origin could not be assigned. The BOA approach accurately assigns
Unifying Temporal and Structural Credit Assignment Problems

NASA Technical Reports Server (NTRS)

Agogino, Adrian K.; Tumer, Kagan

2004-01-01

Single-agent reinforcement learners in time-extended domains and multi-agent systems share a common dilemma known as the credit assignment problem. Multi-agent systems have the structural credit assignment problem of determining the contributions of a particular agent to a common task. Instead, time-extended single-agent systems have the temporal credit assignment problem of determining the contribution of a particular action to the quality of the full sequence of actions. Traditionally these two problems are considered different and are handled in separate ways. In this article we show how these two forms of the credit assignment problem are equivalent. In this unified frame-work, a single-agent Markov decision process can be broken down into a single-time-step multi-agent process. Furthermore we show that Monte-Carlo estimation or Q-learning (depending on whether the values of resulting actions in the episode are known at the time of learning) are equivalent to different agent utility functions in a multi-agent system. This equivalence shows how an often neglected issue in multi-agent systems is equivalent to a well-known deficiency in multi-time-step learning and lays the basis for solving time-extended multi-agent problems, where both credit assignment problems are present.
PONDEROSA, an automated 3D-NOESY peak picking program, enables automated protein structure determination.

PubMed

Lee, Woonghee; Kim, Jin Hae; Westler, William M; Markley, John L

2011-06-15

PONDEROSA (Peak-picking Of Noe Data Enabled by Restriction of Shift Assignments) accepts input information consisting of a protein sequence, backbone and sidechain NMR resonance assignments, and 3D-NOESY ((13)C-edited and/or (15)N-edited) spectra, and returns assignments of NOESY crosspeaks, distance and angle constraints, and a reliable NMR structure represented by a family of conformers. PONDEROSA incorporates and integrates external software packages (TALOS+, STRIDE and CYANA) to carry out different steps in the structure determination. PONDEROSA implements internal functions that identify and validate NOESY peak assignments and assess the quality of the calculated three-dimensional structure of the protein. The robustness of the analysis results from PONDEROSA's hierarchical processing steps that involve iterative interaction among the internal and external modules. PONDEROSA supports a variety of input formats: SPARKY assignment table (.shifts) and spectrum file formats (.ucsf), XEASY proton file format (.prot), and NMR-STAR format (.star). To demonstrate the utility of PONDEROSA, we used the package to determine 3D structures of two proteins: human ubiquitin and Escherichia coli iron-sulfur scaffold protein variant IscU(D39A). The automatically generated structural constraints and ensembles of conformers were as good as or better than those determined previously by much less automated means. The program, in the form of binary code along with tutorials and reference manuals, is available at http://ponderosa.nmrfam.wisc.edu/.
Integer Linear Programming for Constrained Multi-Aspect Committee Review Assignment

PubMed Central

Karimzadehgan, Maryam; Zhai, ChengXiang

2011-01-01

Automatic review assignment can significantly improve the productivity of many people such as conference organizers, journal editors and grant administrators. A general setup of the review assignment problem involves assigning a set of reviewers on a committee to a set of documents to be reviewed under the constraint of review quota so that the reviewers assigned to a document can collectively cover multiple topic aspects of the document. No previous work has addressed such a setup of committee review assignments while also considering matching multiple aspects of topics and expertise. In this paper, we tackle the problem of committee review assignment with multi-aspect expertise matching by casting it as an integer linear programming problem. The proposed algorithm can naturally accommodate any probabilistic or deterministic method for modeling multiple aspects to automate committee review assignments. Evaluation using a multi-aspect review assignment test set constructed using ACM SIGIR publications shows that the proposed algorithm is effective and efficient for committee review assignments based on multi-aspect expertise matching. PMID:22711970
7 CFR 762.160 - Assignment of guarantee.

Code of Federal Regulations, 2014 CFR

2014-01-01

... 7 Agriculture 7 2014-01-01 2014-01-01 false Assignment of guarantee. 762.160 Section 762.160 Agriculture Regulations of the Department of Agriculture (Continued) FARM SERVICE AGENCY, DEPARTMENT OF AGRICULTURE SPECIAL PROGRAMS GUARANTEED FARM LOANS § 762.160 Assignment of guarantee. (a) The following...
28 CFR 524.72 - CIM assignment categories.

Code of Federal Regulations, 2010 CFR

2010-07-01

... groups (e.g., prison gangs), which have a history of disrupting operations and security in either state... assignment categories. CIM cases are classified according to the following assignments: (a) Witness Security... safety of these individuals. There are two types of Witness Security cases: Department of Justice...
36 CFR 292.22 - Land category assignments.

Code of Federal Regulations, 2013 CFR

2013-07-01

... the following four land categories: (i) Farm/forest/grazing land. (ii) Mining land. (iii) Residential.... Lands assigned to the Commercial, Residential, or Mining category may be reclassified as farm/forest... 36 Parks, Forests, and Public Property 2 2013-07-01 2013-07-01 false Land category assignments...
36 CFR 292.22 - Land category assignments.

Code of Federal Regulations, 2011 CFR

2011-07-01

... the following four land categories: (i) Farm/forest/grazing land. (ii) Mining land. (iii) Residential.... Lands assigned to the Commercial, Residential, or Mining category may be reclassified as farm/forest... 36 Parks, Forests, and Public Property 2 2011-07-01 2011-07-01 false Land category assignments...
36 CFR 292.22 - Land category assignments.

Code of Federal Regulations, 2012 CFR

2012-07-01

... the following four land categories: (i) Farm/forest/grazing land. (ii) Mining land. (iii) Residential.... Lands assigned to the Commercial, Residential, or Mining category may be reclassified as farm/forest... 36 Parks, Forests, and Public Property 2 2012-07-01 2012-07-01 false Land category assignments...

36 CFR 292.22 - Land category assignments.

Code of Federal Regulations, 2014 CFR

2014-07-01

... the following four land categories: (i) Farm/forest/grazing land. (ii) Mining land. (iii) Residential.... Lands assigned to the Commercial, Residential, or Mining category may be reclassified as farm/forest... 36 Parks, Forests, and Public Property 2 2014-07-01 2014-07-01 false Land category assignments...
Maximizing Completion and Comprehension of Reading Assignments

ERIC Educational Resources Information Center

Owen, Leanne R.

2017-01-01

The author presents self-report data from students in three upper-level undergraduate courses to illustrate the comparative effectiveness of different out-of-class assessment approaches in promoting completion and comprehension of reading assignments. Students reported agreeing or strongly agreeing that all three assignments motivated them to…
istar: a web platform for large-scale protein-ligand docking.

PubMed

Li, Hongjian; Leung, Kwong-Sak; Ballester, Pedro J; Wong, Man-Hon

2014-01-01

Protein-ligand docking is a key computational method in the design of starting points for the drug discovery process. We are motivated by the desire to automate large-scale docking using our popular docking engine idock and thus have developed a publicly-accessible web platform called istar. Without tedious software installation, users can submit jobs using our website. Our istar website supports 1) filtering ligands by desired molecular properties and previewing the number of ligands to dock, 2) monitoring job progress in real time, and 3) visualizing ligand conformations and outputting free energy and ligand efficiency predicted by idock, binding affinity predicted by RF-Score, putative hydrogen bonds, and supplier information for easy purchase, three useful features commonly lacked on other online docking platforms like DOCK Blaster or iScreen. We have collected 17,224,424 ligands from the All Clean subset of the ZINC database, and revamped our docking engine idock to version 2.0, further improving docking speed and accuracy, and integrating RF-Score as an alternative rescoring function. To compare idock 2.0 with the state-of-the-art AutoDock Vina 1.1.2, we have carried out a rescoring benchmark and a redocking benchmark on the 2,897 and 343 protein-ligand complexes of PDBbind v2012 refined set and CSAR NRC HiQ Set 24Sept2010 respectively, and an execution time benchmark on 12 diverse proteins and 3,000 ligands of different molecular weight. Results show that, under various scenarios, idock achieves comparable success rates while outperforming AutoDock Vina in terms of docking speed by at least 8.69 times and at most 37.51 times. When evaluated on the PDBbind v2012 core set, our istar platform combining with RF-Score manages to reproduce Pearson's correlation coefficient and Spearman's correlation coefficient of as high as 0.855 and 0.859 respectively between the experimental binding affinity and the predicted binding affinity of the docked conformation. istar
Maternal obesity induces sustained inflammation in both fetal and offspring large intestine of sheep.

PubMed

Yan, Xu; Huang, Yan; Wang, Hui; Du, Min; Hess, Bret W; Ford, Stephen P; Nathanielsz, Peter W; Zhu, Mei-Jun

2011-07-01

Both maternal obesity and inflammatory bowel diseases (IBDs) are increasing. It was hypothesized that maternal obesity induces an inflammatory response in the fetal large intestine, predisposing offspring to IBDs. Nonpregnant ewes were assigned to a control (Con, 100% of National Research Council [NRC] recommendations) or obesogenic (OB, 150% of NRC) diet from 60 days before conception. The large intestine was sampled from fetuses at 135 days (term 150 days) after conception and from offspring lambs at 22.5 ± 0.5 months of age. Maternal obesity enhanced mRNA expression tumor necrosis factor (TNF)α, interleukin (IL)1α, IL1β, IL6, IL8, and monocyte/macrophage chemotactic protein-1 (MCP1), as well as macrophage markers, CD11b, CD14, and CD68 in fetal gut. mRNA expression of Toll-like receptor (TLR) 2 and TLR4 was increased in OB versus Con fetuses; correspondingly, inflammatory NF-κB and JNK signaling pathways were also upregulated. Both mRNA expression and protein content of transforming growth factor (TGF) β was increased. The IL-17A mRNA expression and protein content was higher in OB compared to Con samples, which was associated with fibrosis in the large intestine of OB fetuses. Similar inflammatory responses and enhanced fibrosis were detected in OB compared to Con offspring. Maternal obesity induced inflammation and enhanced expression of proinflammatory cytokines in fetal and offspring large intestine, which correlated with increased TGFβ and IL17 expression. These data show that maternal obesity may predispose offspring gut to IBDs. Copyright © 2010 Crohn's & Colitis Foundation of America, Inc.
One of My Favorite Assignments: Automated Teller Machine Simulation.

ERIC Educational Resources Information Center

Oberman, Paul S.

2001-01-01

Describes an assignment for an introductory computer science class that requires the student to write a software program that simulates an automated teller machine. Highlights include an algorithm for the assignment; sample file contents; language features used; assignment variations; and discussion points. (LRW)
7 CFR 1900.5 - Assignment of cases.

Code of Federal Regulations, 2010 CFR

2010-01-01

... 7 Agriculture 12 2010-01-01 2010-01-01 false Assignment of cases. 1900.5 Section 1900.5 Agriculture Regulations of the Department of Agriculture (Continued) RURAL HOUSING SERVICE, RURAL BUSINESS... REGULATIONS GENERAL Delegations of Authority § 1900.5 Assignment of cases. The State Director may, in writing...
47 CFR 74.786 - Digital channel assignments.

Code of Federal Regulations, 2010 CFR

2010-10-01

... Translator, and TV Booster Stations § 74.786 Digital channel assignments. (a) An applicant for a new low power television or television translator digital station or for changes in the facilities of an... VHF digital low power television or television translator station. Channels 5 and 6 assigned in Alaska...
Immunogenicity assessment during the development of protein therapeutics.

PubMed

Rosenberg, Amy S; Sauna, Zuben E

2018-05-01

Here we provide a critical review of the state of the art with respect to non-clinical assessments of immunogenicity for therapeutic proteins. The number of studies on immunogenicity published annually has more than doubled in the last 5 years. The science and technology, which have reached a critical mass, provide multiple of non-clinical approaches (computational, in vitro, ex vivo and animal models) to first predict and then to modify or eliminate T-cell or B-cell epitopes via de-immunization strategies. We discuss how these may be used in the context of drug development in assigning the immunogenicity risk of new and marketed therapeutic proteins. Protein therapeutics represents a large share of the pharma market and provide medical interventions for some of the most complex and intractable diseases. Immunogenicity (the development of antibodies to therapeutic proteins) is an important concern for both the safety and efficacy of protein therapeutics as immune responses may neutralize the activity of life-saving and highly effective protein therapeutics and induce hypersensitivity responses including anaphylaxis. The non-clinical computational tools and experimental technologies that offer a comprehensive and increasingly accurate estimation of immunogenic potential are surveyed here. This critical review also discusses technologies which are promising but are not as yet ready for routine use. Published 2017. This article is a U.S. Government work and is in the public domain in the USA.
Detecting protein-protein interactions in the intact cell of Bacillus subtilis (ATCC 6633).

PubMed

Winters, Michael S; Day, R A

2003-07-01

The salt bridge, paired group-specific reagent cyanogen (ethanedinitrile; C(2)N(2)) converts naturally occurring pairs of functional groups into covalently linked products. Cyanogen readily permeates cell walls and membranes. When the paired groups are shared between associated proteins, isolation of the covalently linked proteins allows their identity to be assigned. Examination of organisms of known genome sequence permits identification of the linked proteins by mass spectrometric techniques applied to peptides derived from them. The cyanogen-linked proteins were isolated by polyacrylamide gel electrophoresis. Digestion of the isolated proteins with proteases of known specificity afforded sets of peptides that could be analyzed by mass spectrometry. These data were compared with those derived theoretically from the Swiss Protein Database by computer-based comparisons (Protein Prospector; http://prospector.ucsf.edu). Identification of associated proteins in the ribosome of Bacillus subtilis strain ATCC 6633 showed that there is an association homology with the association patterns of the ribosomal proteins of Haloarcula marismortui and Thermus thermophilus. In addition, other proteins involved in protein biosynthesis were shown to be associated with ribosomal proteins.
Detecting Protein-Protein Interactions in the Intact Cell of Bacillus subtilis (ATCC 6633)

PubMed Central

Winters, Michael S.; Day, R. A.

2003-01-01

The salt bridge, paired group-specific reagent cyanogen (ethanedinitrile; C2N2) converts naturally occurring pairs of functional groups into covalently linked products. Cyanogen readily permeates cell walls and membranes. When the paired groups are shared between associated proteins, isolation of the covalently linked proteins allows their identity to be assigned. Examination of organisms of known genome sequence permits identification of the linked proteins by mass spectrometric techniques applied to peptides derived from them. The cyanogen-linked proteins were isolated by polyacrylamide gel electrophoresis. Digestion of the isolated proteins with proteases of known specificity afforded sets of peptides that could be analyzed by mass spectrometry. These data were compared with those derived theoretically from the Swiss Protein Database by computer-based comparisons (Protein Prospector; http://prospector.ucsf.edu). Identification of associated proteins in the ribosome of Bacillus subtilis strain ATCC 6633 showed that there is an association homology with the association patterns of the ribosomal proteins of Haloarcula marismortui and Thermus thermophilus. In addition, other proteins involved in protein biosynthesis were shown to be associated with ribosomal proteins. PMID:12837803
MYC protein expression and genetic alterations have prognostic impact in patients with diffuse large B-cell lymphoma treated with immunochemotherapy

PubMed Central

Valera, Alexandra; López-Guillermo, Armando; Cardesa-Salzmann, Teresa; Climent, Fina; González-Barca, Eva; Mercadal, Santiago; Espinosa, Íñigo; Novelli, Silvana; Briones, Javier; Mate, José L.; Salamero, Olga; Sancho, Juan M.; Arenillas, Leonor; Serrano, Sergi; Erill, Nadina; Martínez, Daniel; Castillo, Paola; Rovira, Jordina; Martínez, Antonio; Campo, Elias; Colomo, Luis

2013-01-01

MYC alterations influence the survival of patients with diffuse large B-cell lymphoma. Most studies have focused on MYC translocations but there is little information regarding the impact of numerical alterations and protein expression. We analyzed the genetic alterations and protein expression of MYC, BCL2, BCL6, and MALT1 in 219 cases of diffuse large B-cell lymphoma. MYC rearrangement occurred as the sole abnormality (MYC single-hit) in 3% of cases, MYC and concurrent BCL2 and/or BCL6 rearrangements (MYC double/triple-hit) in 4%, MYC amplifications in 2% and MYC gains in 19%. MYC single-hit, MYC double/triple-hit and MYC amplifications, but not MYC gains or other gene rearrangements, were associated with unfavorable progression-free survival and overall survival. MYC protein expression, evaluated using computerized image analysis, captured the unfavorable prognosis of MYC translocations/amplifications and identified an additional subset of patients without gene alterations but with similar poor prognosis. Patients with tumors expressing both MYC/BCL2 had the worst prognosis, whereas those with double-negative tumors had the best outcome. High MYC expression was associated with shorter overall survival irrespectively of the International Prognostic Index and BCL2 expression. In conclusion, MYC protein expression identifies a subset of diffuse large B-cell lymphoma with very poor prognosis independently of gene alterations and other prognostic parameters. PMID:23716551
MYC protein expression and genetic alterations have prognostic impact in patients with diffuse large B-cell lymphoma treated with immunochemotherapy.

PubMed

Valera, Alexandra; López-Guillermo, Armando; Cardesa-Salzmann, Teresa; Climent, Fina; González-Barca, Eva; Mercadal, Santiago; Espinosa, Iñigo; Novelli, Silvana; Briones, Javier; Mate, José L; Salamero, Olga; Sancho, Juan M; Arenillas, Leonor; Serrano, Sergi; Erill, Nadina; Martínez, Daniel; Castillo, Paola; Rovira, Jordina; Martínez, Antonio; Campo, Elias; Colomo, Luis

2013-10-01

MYC alterations influence the survival of patients with diffuse large B-cell lymphoma. Most studies have focused on MYC translocations but there is little information regarding the impact of numerical alterations and protein expression. We analyzed the genetic alterations and protein expression of MYC, BCL2, BCL6, and MALT1 in 219 cases of diffuse large B-cell lymphoma. MYC rearrangement occurred as the sole abnormality (MYC single-hit) in 3% of cases, MYC and concurrent BCL2 and/or BCL6 rearrangements (MYC double/triple-hit) in 4%, MYC amplifications in 2% and MYC gains in 19%. MYC single-hit, MYC double/triple-hit and MYC amplifications, but not MYC gains or other gene rearrangements, were associated with unfavorable progression-free survival and overall survival. MYC protein expression, evaluated using computerized image analysis, captured the unfavorable prognosis of MYC translocations/amplifications and identified an additional subset of patients without gene alterations but with similar poor prognosis. Patients with tumors expressing both MYC/BCL2 had the worst prognosis, whereas those with double-negative tumors had the best outcome. High MYC expression was associated with shorter overall survival irrespectively of the International Prognostic Index and BCL2 expression. In conclusion, MYC protein expression identifies a subset of diffuse large B-cell lymphoma with very poor prognosis independently of gene alterations and other prognostic parameters.
Software Tool for Researching Annotations of Proteins (STRAP): Open-Source Protein Annotation Software with Data Visualization

PubMed Central

Bhatia, Vivek N.; Perlman, David H.; Costello, Catherine E.; McComb, Mark E.

2009-01-01

In order that biological meaning may be derived and testable hypotheses may be built from proteomics experiments, assignments of proteins identified by mass spectrometry or other techniques must be supplemented with additional notation, such as information on known protein functions, protein-protein interactions, or biological pathway associations. Collecting, organizing, and interpreting this data often requires the input of experts in the biological field of study, in addition to the time-consuming search for and compilation of information from online protein databases. Furthermore, visualizing this bulk of information can be challenging due to the limited availability of easy-to-use and freely available tools for this process. In response to these constraints, we have undertaken the design of software to automate annotation and visualization of proteomics data in order to accelerate the pace of research. Here we present the Software Tool for Researching Annotations of Proteins (STRAP) – a user-friendly, open-source C# application. STRAP automatically obtains gene ontology (GO) terms associated with proteins in a proteomics results ID list using the freely accessible UniProtKB and EBI GOA databases. Summarized in an easy-to-navigate tabular format, STRAP includes meta-information on the protein in addition to complimentary GO terminology. Additionally, this information can be edited by the user so that in-house expertise on particular proteins may be integrated into the larger dataset. STRAP provides a sortable tabular view for all terms, as well as graphical representations of GO-term association data in pie (biological process, cellular component and molecular function) and bar charts (cross comparison of sample sets) to aid in the interpretation of large datasets and differential analyses experiments. Furthermore, proteins of interest may be exported as a unique FASTA-formatted file to allow for customizable re-searching of mass spectrometry data, and gene
Using sound to solve syntactic problems: the role of phonology in grammatical category assignments.

PubMed

Kelly, M H

1992-04-01

One ubiquitous problem in language processing involves the assignment of words to the correct grammatical category, such as noun or verb. In general, semantic and syntactic cues have been cited as the principal information for grammatical category assignment, to the neglect of possible phonological cues. This neglect is unwarranted, and the following claims are made: (a) Numerous correlations between phonology and grammatical class exist, (b) some of these correlations are large and can pervade the entire lexicon of a language and hence can involve thousands of words, (c) experiments have repeatedly found that adults and children have learned these correlations, and (d) explanations for how these correlations arose can be proposed and evaluated. Implications of these phenomena for language representation and processing are discussed.
7 CFR 1210.540 - OMB assigned numbers.

Code of Federal Regulations, 2010 CFR

2010-01-01

... 7 Agriculture 10 2010-01-01 2010-01-01 false OMB assigned numbers. 1210.540 Section 1210.540 Agriculture Regulations of the Department of Agriculture (Continued) AGRICULTURAL MARKETING SERVICE (MARKETING... Management and Budget (OMB) under the provisions of 44 U.S.C. chapter 35 and have been assigned OMB Control...
Learning cellular sorting pathways using protein interactions and sequence motifs.

PubMed

Lin, Tien-Ho; Bar-Joseph, Ziv; Murphy, Robert F

2011-11-01

Proper subcellular localization is critical for proteins to perform their roles in cellular functions. Proteins are transported by different cellular sorting pathways, some of which take a protein through several intermediate locations until reaching its final destination. The pathway a protein is transported through is determined by carrier proteins that bind to specific sequence motifs. In this article, we present a new method that integrates protein interaction and sequence motif data to model how proteins are sorted through these sorting pathways. We use a hidden Markov model (HMM) to represent protein sorting pathways. The model is able to determine intermediate sorting states and to assign carrier proteins and motifs to the sorting pathways. In simulation studies, we show that the method can accurately recover an underlying sorting model. Using data for yeast, we show that our model leads to accurate prediction of subcellular localization. We also show that the pathways learned by our model recover many known sorting pathways and correctly assign proteins to the path they utilize. The learned model identified new pathways and their putative carriers and motifs and these may represent novel protein sorting mechanisms. Supplementary results and software implementation are available from http://murphylab.web.cmu.edu/software/2010_RECOMB_pathways/.
Learning Cellular Sorting Pathways Using Protein Interactions and Sequence Motifs

PubMed Central

Lin, Tien-Ho; Bar-Joseph, Ziv

2011-01-01

Abstract Proper subcellular localization is critical for proteins to perform their roles in cellular functions. Proteins are transported by different cellular sorting pathways, some of which take a protein through several intermediate locations until reaching its final destination. The pathway a protein is transported through is determined by carrier proteins that bind to specific sequence motifs. In this article, we present a new method that integrates protein interaction and sequence motif data to model how proteins are sorted through these sorting pathways. We use a hidden Markov model (HMM) to represent protein sorting pathways. The model is able to determine intermediate sorting states and to assign carrier proteins and motifs to the sorting pathways. In simulation studies, we show that the method can accurately recover an underlying sorting model. Using data for yeast, we show that our model leads to accurate prediction of subcellular localization. We also show that the pathways learned by our model recover many known sorting pathways and correctly assign proteins to the path they utilize. The learned model identified new pathways and their putative carriers and motifs and these may represent novel protein sorting mechanisms. Supplementary results and software implementation are available from http://murphylab.web.cmu.edu/software/2010_RECOMB_pathways/. PMID:21999284
Protein crystal growth in microgravity review of large scale temperature induction method: Bovine insulin, human insulin and human α-interferon

NASA Astrophysics Data System (ADS)

Long, Marianna M.; Bishop, John Bradford; Delucas, Lawrence J.; Nagabhushan, Tattanhalli L.; Reichert, Paul; Smith, G. David

1997-01-01

The Protein Crystal Growth Facility (PCF) is space-flight hardware that accommodates large scale protein crystal growth experiments using temperature change as the inductive step. Recent modifications include specialized instrumentation for monitoring crystal nucleation with laser light scattering. This paper reviews results from its first seven flights on the Space Shuttle, the last with laser light scattering instrumentation in place. The PCF's objective is twofold: (1) the production of high quality protein crystals for x-ray analysis and subsequent structure-based drug design and (2) preparation of a large quantity of relatively contaminant free crystals for use as time-release protein pharmaceuticals. The first three Shuttle flights with bovine insulin constituted the PCF's proof of concept, demonstrating that the space-grown crystals were larger and diffracted to higher resolution than their earth-grown counterparts. The later four PCF missions were used to grow recombinant human insulin crystals for x-ray analysis and continue productions trials aimed at the development of a processing facility for crystalline recombinant a-interferon.
Protein Annotators' Assistant: A Novel Application of Information Retrieval Techniques.

ERIC Educational Resources Information Center

Wise, Michael J.

2000-01-01

Protein Annotators' Assistant (PAA) is a software system which assists protein annotators in assigning functions to newly sequenced proteins. PAA employs a number of information retrieval techniques in a novel setting and is thus related to text categorization, where multiple categories may be suggested, except that in this case none of the…
Impact of an All-or-Nothing Assignment Requirement under Medicare

PubMed Central

Mitchell, Janet B.; Cromwell, Jerry

1983-01-01

In an effort to raise assignment rates, some policymakers have considered dropping Medicare's case-by-case assignment option. Physicians would have to decide whether to accept all of their patients on assignment, or none of them. In a 1976 national survey, over two-thirds of the physicians stated they would take none of their patients on assignment if forced to choose. Simulation analysis showed that in that event, assignment rates nationwide would fall almost 10 percent. The mean supply of assigned visits would actually increase 11 percent for general practitioners, while decreasing 12-25 percent for general surgeons, internists, and obstetricians/gynecologists. PMID:10310000

Testing the Effectiveness of Online Assignments in Theory of Finance

ERIC Educational Resources Information Center

Batu, Michael; Bower, Nancy; Lun, Esmond; Sadanand, Asha

2018-01-01

The authors investigated the effectiveness of online versus paper assignments using final examination scores in three cohorts of theory of finance. In particular, two cohorts were exposed to online assignments while another cohort was exposed to traditional assignments. The central result is that exposure to online assignments robustly leads to…
Assignment Choice: Do Students Choose Briefer Assignments or Finishing What They Started?

ERIC Educational Resources Information Center

Hawthorn-Embree, Meredith L.; Skinner, Christopher H.; Parkhurst, John; O'Neil, Michael; Conley, Elisha

2010-01-01

Academic skill development requires engagement in effortful academic behaviors. Although students may be more likely to choose to engage in behaviors that require less effort, they also may be motivated to complete assignments that they have already begun. Seventh-grade students (N = 88) began a mathematics computation worksheet, but were stopped…
Identification of structural domains in proteins by a graph heuristic.

PubMed

Wernisch, L; Hunting, M; Wodak, S J

1999-05-15

A novel automatic procedure for identifying domains from protein atomic coordinates is presented. The procedure, termed STRUDL (STRUctural Domain Limits), does not take into account information on secondary structures and handles any number of domains made up of contiguous or non-contiguous chain segments. The core algorithm uses the Kernighan-Lin graph heuristic to partition the protein into residue sets which display minimum interactions between them. These interactions are deduced from the weighted Voronoi diagram. The generated partitions are accepted or rejected on the basis of optimized criteria, representing basic expected physical properties of structural domains. The graph heuristic approach is shown to be very effective, it approximates closely the exact solution provided by a branch and bound algorithm for a number of test proteins. In addition, the overall performance of STRUDL is assessed on a set of 787 representative proteins from the Protein Data Bank by comparison to domain definitions in the CATH protein classification. The domains assigned by STRUDL agree with the CATH assignments in at least 81% of the tested proteins. This result is comparable to that obtained previously using PUU (Holm and Sander, Proteins 1994;9:256-268), the only other available algorithm designed to identify domains with any number of non-contiguous chain segments. A detailed discussion of the structures for which our assignments differ from those in CATH brings to light some clear inconsistencies between the concept of structural domains based on minimizing inter-domain interactions and that of delimiting structural motifs that represent acceptable folding topologies or architectures. Considering both concepts as complementary and combining them in a layered approach might be the way forward.
24 CFR 241.895 - Assignment of insured loans.

Code of Federal Regulations, 2010 CFR

2010-04-01

... AUTHORITIES SUPPLEMENTARY FINANCING FOR INSURED PROJECT MORTGAGES Contract Rights and Obligations-Multifamily Projects Without a HUD-Insured or HUD-Held Mortgage Assignments § 241.895 Assignment of insured loans. (a...
3D Complex: A Structural Classification of Protein Complexes

PubMed Central

Levy, Emmanuel D; Pereira-Leal, Jose B; Chothia, Cyrus; Teichmann, Sarah A

2006-01-01

Most of the proteins in a cell assemble into complexes to carry out their function. It is therefore crucial to understand the physicochemical properties as well as the evolution of interactions between proteins. The Protein Data Bank represents an important source of information for such studies, because more than half of the structures are homo- or heteromeric protein complexes. Here we propose the first hierarchical classification of whole protein complexes of known 3-D structure, based on representing their fundamental structural features as a graph. This classification provides the first overview of all the complexes in the Protein Data Bank and allows nonredundant sets to be derived at different levels of detail. This reveals that between one-half and two-thirds of known structures are multimeric, depending on the level of redundancy accepted. We also analyse the structures in terms of the topological arrangement of their subunits and find that they form a small number of arrangements compared with all theoretically possible ones. This is because most complexes contain four subunits or less, and the large majority are homomeric. In addition, there is a strong tendency for symmetry in complexes, even for heteromeric complexes. Finally, through comparison of Biological Units in the Protein Data Bank with the Protein Quaternary Structure database, we identified many possible errors in quaternary structure assignments. Our classification, available as a database and Web server at http://www.3Dcomplex.org, will be a starting point for future work aimed at understanding the structure and evolution of protein complexes. PMID:17112313
Gain weighted eigenspace assignment

NASA Technical Reports Server (NTRS)

Davidson, John B.; Andrisani, Dominick, II

1994-01-01

This report presents the development of the gain weighted eigenspace assignment methodology. This provides a designer with a systematic methodology for trading off eigenvector placement versus gain magnitudes, while still maintaining desired closed-loop eigenvalue locations. This is accomplished by forming a cost function composed of a scalar measure of error between desired and achievable eigenvectors and a scalar measure of gain magnitude, determining analytical expressions for the gradients, and solving for the optimal solution by numerical iteration. For this development the scalar measure of gain magnitude is chosen to be a weighted sum of the squares of all the individual elements of the feedback gain matrix. An example is presented to demonstrate the method. In this example, solutions yielding achievable eigenvectors close to the desired eigenvectors are obtained with significant reductions in gain magnitude compared to a solution obtained using a previously developed eigenspace (eigenstructure) assignment method.
Assigning crystallographic electron densities with free energy calculations—The case of the fluoride channel Fluc

PubMed Central

2018-01-01

Approximately 90% of the structures in the Protein Data Bank (PDB) were obtained by X-ray crystallography or electron microscopy. Whereas the overall quality of structure is considered high, thanks to a wide range of tools for structure validation, uncertainties may arise from density maps of small molecules, such as organic ligands, ions or water, which are non-covalently bound to the biomolecules. Even with some experience and chemical intuition, the assignment of such disconnected electron densities is often far from obvious. In this study, we suggest the use of molecular dynamics (MD) simulations and free energy calculations, which are well-established computational methods, to aid in the assignment of ambiguous disconnected electron densities. Specifically, estimates of (i) relative binding affinities, for instance between an ion and water, (ii) absolute binding free energies, i.e., free energies for transferring a solute from bulk solvent to a binding site, and (iii) stability assessments during equilibrium simulations may reveal the most plausible assignments. We illustrate this strategy using the crystal structure of the fluoride specific channel (Fluc), which contains five disconnected electron densities previously interpreted as four fluoride and one sodium ion. The simulations support the assignment of the sodium ion. In contrast, calculations of relative and absolute binding free energies as well as stability assessments during free MD simulations suggest that four of the densities represent water molecules instead of fluoride. The assignment of water is compatible with the loss of these densities in the non-conductive F82I/F85I mutant of Fluc. We critically discuss the role of the ion force fields for the calculations presented here. Overall, these findings indicate that MD simulations and free energy calculations are helpful tools for modeling water and ions into crystallographic density maps. PMID:29771936
26 CFR 26.2651-1 - Generation assignment.

Code of Federal Regulations, 2011 CFR

2011-04-01

... 26 Internal Revenue 14 2011-04-01 2010-04-01 true Generation assignment. 26.2651-1 Section 26.2651... TAXES GENERATION-SKIPPING TRANSFER TAX REGULATIONS UNDER THE TAX REFORM ACT OF 1986 § 26.2651-1 Generation assignment. (a) Special rule for persons with a deceased parent—(1) In general. This paragraph (a...
26 CFR 26.2651-1 - Generation assignment.

Code of Federal Regulations, 2012 CFR

2012-04-01

... 26 Internal Revenue 14 2012-04-01 2012-04-01 false Generation assignment. 26.2651-1 Section 26... GIFT TAXES GENERATION-SKIPPING TRANSFER TAX REGULATIONS UNDER THE TAX REFORM ACT OF 1986 § 26.2651-1 Generation assignment. (a) Special rule for persons with a deceased parent—(1) In general. This paragraph (a...
26 CFR 26.2651-1 - Generation assignment.

Code of Federal Regulations, 2014 CFR

2014-04-01

... 26 Internal Revenue 14 2014-04-01 2013-04-01 true Generation assignment. 26.2651-1 Section 26.2651... TAXES GENERATION-SKIPPING TRANSFER TAX REGULATIONS UNDER THE TAX REFORM ACT OF 1986 § 26.2651-1 Generation assignment. (a) Special rule for persons with a deceased parent—(1) In general. This paragraph (a...
26 CFR 26.2651-1 - Generation assignment.

Code of Federal Regulations, 2013 CFR

2013-04-01

... 26 Internal Revenue 14 2013-04-01 2013-04-01 false Generation assignment. 26.2651-1 Section 26... GIFT TAXES GENERATION-SKIPPING TRANSFER TAX REGULATIONS UNDER THE TAX REFORM ACT OF 1986 § 26.2651-1 Generation assignment. (a) Special rule for persons with a deceased parent—(1) In general. This paragraph (a...
The Manifesto Assignment: Study with Women Prisoners

ERIC Educational Resources Information Center

Moore, Lisa Jean

2016-01-01

Lisa Jean Moore describes a class assignment for her Social and Cultural Change course in which she was required to accommodate the material limitations of teaching within a highly regulated and restricted environment. This assignment, carried out in a women's prison, could be adapted for traditional college settings and travel abroad programs,…
Selling the Home Territory: Assignments for International MBA Students.

ERIC Educational Resources Information Center

Reep, Diana C.

2000-01-01

Describes the author's approach to a MBA writing course functioning to connect the writing and oral presentation assignments to the students' current jobs. Notes that most international students lack an understanding of American business practices. Develops the assignments around what the students do know. Notes three assignments including a…
QuickProbs 2: Towards rapid construction of high-quality alignments of large protein families

PubMed Central

Gudyś, Adam; Deorowicz, Sebastian

2017-01-01

The ever-increasing size of sequence databases caused by the development of high throughput sequencing, poses to multiple alignment algorithms one of the greatest challenges yet. As we show, well-established techniques employed for increasing alignment quality, i.e., refinement and consistency, are ineffective when large protein families are investigated. We present QuickProbs 2, an algorithm for multiple sequence alignment. Based on probabilistic models, equipped with novel column-oriented refinement and selective consistency, it offers outstanding accuracy. When analysing hundreds of sequences, Quick-Probs 2 is noticeably better than ClustalΩ and MAFFT, the previous leaders for processing numerous protein families. In the case of smaller sets, for which consistency-based methods are the best performing, QuickProbs 2 is also superior to the competitors. Due to low computational requirements of selective consistency and utilization of massively parallel architectures, presented algorithm has similar execution times to ClustalΩ, and is orders of magnitude faster than full consistency approaches, like MSAProbs or PicXAA. All these make QuickProbs 2 an excellent tool for aligning families ranging from few, to hundreds of proteins. PMID:28139687
PDB_TM: selection and membrane localization of transmembrane proteins in the protein data bank.

PubMed

Tusnády, Gábor E; Dosztányi, Zsuzsanna; Simon, István

2005-01-01

PDB_TM is a database for transmembrane proteins with known structures. It aims to collect all transmembrane proteins that are deposited in the protein structure database (PDB) and to determine their membrane-spanning regions. These assignments are based on the TMDET algorithm, which uses only structural information to locate the most likely position of the lipid bilayer and to distinguish between transmembrane and globular proteins. This algorithm was applied to all PDB entries and the results were collected in the PDB_TM database. By using TMDET algorithm, the PDB_TM database can be automatically updated every week, keeping it synchronized with the latest PDB updates. The PDB_TM database is available at http://www.enzim.hu/PDB_TM.
Interactive effects of protein and carbohydrates on production of microbial metabolites in the large intestine of growing pigs.

PubMed

Taciak, Marcin; Barszcz, Marcin; Święch, Ewa; Tuśnio, Anna; Bachanek, Ilona

2017-06-01

The study aimed at determining the effect of protein type and indigestible carbohydrates on the concentration of microbial metabolites in the large intestine of pigs. The experiment involved 36 pigs (15 kg initial body weight) divided into six groups, fed cereal-based diets with highly digestible casein (CAS) or potato protein concentrate (PPC) of lower ileal digestibility. Each diet was supplemented with cellulose, raw potato starch or pectin. After 2 weeks of feeding, pigs were sacrificed and samples of caecal and ascending, transverse and descending colon digesta were collected for analyses of microbial metabolites. PPC increased the concentration of ammonia, p-cresol, indole, n-butyrate, isovalerate and most of the amines in comparison with CAS. Pectin reduced the production of p-cresol, indole, phenylethylamine and isovalerate in the large intestine compared with potato starch. Starch and pectin increased mainly the concentration of n-butyrate and n-valerate in the colon compared to cellulose. Interaction affected mainly amines. Feeding PPC diet with potato starch considerably increased putrescine, cadaverine, tyramine and total amines concentrations compared with PPC diets with pectin and cellulose, whereas feeding CAS diet with starch reduced their concentrations. There was also a significant effect of interaction between diet and intestinal segment on microbial metabolites. In conclusion, PPC intensifies proteolysis in the large intestine and also n-butyrate production. Raw starch and pectin similarly increase n-butyrate concentration but pectin inhibits proteolysis more efficiently than starch. The interactive effects of both factors indicate that pectin and cellulose may beneficially affect fermentative processes in case of greater protein flow to the large intestine.
NMRNet: A deep learning approach to automated peak picking of protein NMR spectra.

PubMed

Klukowski, Piotr; Augoff, Michal; Zieba, Maciej; Drwal, Maciej; Gonczarek, Adam; Walczak, Michal J

2018-03-14

Automated selection of signals in protein NMR spectra, known as peak picking, has been studied for over 20 years, nevertheless existing peak picking methods are still largely deficient. Accurate and precise automated peak picking would accelerate the structure calculation, and analysis of dynamics and interactions of macromolecules. Recent advancement in handling big data, together with an outburst of machine learning techniques, offer an opportunity to tackle the peak picking problem substantially faster than manual picking and on par with human accuracy. In particular, deep learning has proven to systematically achieve human-level performance in various recognition tasks, and thus emerges as an ideal tool to address automated identification of NMR signals. We have applied a convolutional neural network for visual analysis of multidimensional NMR spectra. A comprehensive test on 31 manually-annotated spectra has demonstrated top-tier average precision (AP) of 0.9596, 0.9058 and 0.8271 for backbone, side-chain and NOESY spectra, respectively. Furthermore, a combination of extracted peak lists with automated assignment routine, FLYA, outperformed other methods, including the manual one, and led to correct resonance assignment at the levels of 90.40%, 89.90% and 90.20% for three benchmark proteins. The proposed model is a part of a Dumpling software (platform for protein NMR data analysis), and is available at https://dumpling.bio/. michaljerzywalczak@gmail.compiotr.klukowski@pwr.edu.pl. Supplementary data are available at Bioinformatics online.
Predicting Faculty Intentions to Assign Writing in Their Classes

ERIC Educational Resources Information Center

Trafimow, David; Ruckel, Lindsay M.; Stovall, Shelly; Raut, Yogesh J.

2017-01-01

Teachers who offer undergraduate courses agree widely on the importance of writing assignments to further undergraduate education. And yet, there is a great deal of variance among teachers in their writing assignments; some teachers assign no writing whatsoever. To determine the variables that influence the decisions of teachers about whether to…
47 CFR 78.35 - Assignment or transfer of control.

Code of Federal Regulations, 2013 CFR

2013-10-01

... CABLE TELEVISION RELAY SERVICE Applications and Licenses § 78.35 Assignment or transfer of control. (a) No assignment of the license of a cable television relay station or transfer of control of a CARS... 47 Telecommunication 4 2013-10-01 2013-10-01 false Assignment or transfer of control. 78.35...
47 CFR 78.35 - Assignment or transfer of control.

Code of Federal Regulations, 2014 CFR

2014-10-01

... CABLE TELEVISION RELAY SERVICE Applications and Licenses § 78.35 Assignment or transfer of control. (a) No assignment of the license of a cable television relay station or transfer of control of a CARS... 47 Telecommunication 4 2014-10-01 2014-10-01 false Assignment or transfer of control. 78.35...

Some links on this page may take you to non-federal websites. Their policies may differ from this site.